BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy12185
(317 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|350415610|ref|XP_003490694.1| PREDICTED: cathepsin O-like [Bombus impatiens]
Length = 355
Score = 316 bits (809), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 163/316 (51%), Positives = 210/316 (66%), Gaps = 19/316 (6%)
Query: 5 KNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK--SEHDIRFKNFEK 62
+ V IV +++LCFLAIP++VS L+LF ++ RY KSY +E++ RFK F K
Sbjct: 4 RTVAVIVLVVSLCFLAIPIRVSPNTSNGDLKLFQNYVMRYNKSYRNDPTEYEERFKRFLK 63
Query: 63 SLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV----NKHVLMSHHKHHD- 117
SL IE++N R S ESA YG+TEFSD+SE+EF + L + KHV S+H+ H
Sbjct: 64 SLRHIEKMNGLRPSQESAYYGLTEFSDMSEDEFLSLTLLPDLPARGEKHVNESYHRRHHL 123
Query: 118 -HHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHA 176
N VKK IP++ DWR+ G+I VRNQ +CGACWAFSTVE ESM+A
Sbjct: 124 LQSTNRVKK---------SVSIPLRFDWRDKGVITPVRNQGSCGACWAFSTVEVVESMYA 174
Query: 177 LKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACK 236
+KNGTL +LSVQE+IDCA N N GC GGD C+LL W+ +KV + ES YPL+ K + CK
Sbjct: 175 IKNGTLHMLSVQEMIDCAKNSNFGCEGGDICSLLSWLLASKVQIFQESTYPLVGKTSMCK 234
Query: 237 --RKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCD 294
+ +GVKI+ + CD + +E +L +ATHGPV AAVNAL+WQ YLGGVIQY+CD
Sbjct: 235 LGKMIDKASGVKIRDFNCDNFVDAEDELLITVATHGPVAAAVNALSWQNYLGGVIQYHCD 294
Query: 295 GSLANINHAVQIVGYD 310
S N+NHAVQIVGYD
Sbjct: 295 SSFDNLNHAVQIVGYD 310
>gi|328789602|ref|XP_623690.2| PREDICTED: cathepsin O-like [Apis mellifera]
Length = 368
Score = 308 bits (790), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 157/314 (50%), Positives = 213/314 (67%), Gaps = 18/314 (5%)
Query: 5 KNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY--SKSEHDIRFKNFEK 62
K ++F + +++LCFLAIP+KV P+ + ++LF ++ RY KSY + SE++ RFK F++
Sbjct: 20 KTIVFTILVVSLCFLAIPIKVD-PDNNEDIKLFQNYVIRYNKSYRNNPSEYEERFKRFQR 78
Query: 63 SLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV----NKHVLMSHHKHHDH 118
SL IE +N R S ESA YG+TEFSD+SE EF L + KH+ S+H+ H
Sbjct: 79 SLQHIERMNGLRSSQESAYYGLTEFSDMSENEFLLHTLLPDLPIRGEKHMNASYHRKHQI 138
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
+ +K RSI+ IP++ DWR+ G+I VR+Q +CGACWAFST+E ESM A+K
Sbjct: 139 SIDRMK-RSIS--------IPLRFDWRDKGVITPVRSQGSCGACWAFSTIEVIESMFAIK 189
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACK-- 236
NGTL LSVQE+IDCA N N GC GGD C+LL W+ ++KV + ES YPL+ CK
Sbjct: 190 NGTLHSLSVQEMIDCAKNSNFGCEGGDICSLLSWLLISKVQILQESIYPLVGMTGTCKLG 249
Query: 237 RKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGS 296
+ +KI+ +TCD+ + +E +L +ATHGPV AAVNAL+WQ YLGGVIQY+CDGS
Sbjct: 250 KMTDKTFNIKIQDFTCDSFVDAEDELLIALATHGPVAAAVNALSWQNYLGGVIQYHCDGS 309
Query: 297 LANINHAVQIVGYD 310
N+NHAVQI+GYD
Sbjct: 310 FNNLNHAVQIIGYD 323
>gi|380026170|ref|XP_003696831.1| PREDICTED: cathepsin O-like [Apis florea]
Length = 368
Score = 308 bits (790), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 156/310 (50%), Positives = 210/310 (67%), Gaps = 10/310 (3%)
Query: 5 KNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK--SEHDIRFKNFEK 62
K + F + +++LCFLAIP+KV P+ + ++LF ++ RY KSY SE++ RFK F++
Sbjct: 20 KTIAFTILVVSLCFLAIPIKVD-PDNNEDIKLFQNYVVRYNKSYKNDPSEYEERFKRFQR 78
Query: 63 SLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNH 122
SL IE +N R S ESA YG+TEFSD+SE+EF L H++ + + KH + + H
Sbjct: 79 SLQHIERMNGLRSSQESAYYGLTEFSDMSEDEF----LLHTLLPDLPIRGEKHKNAPY-H 133
Query: 123 VKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTL 182
K + T + IP + DWR+ G+I VR+Q +CGACWAFST+E ESM A+KNGTL
Sbjct: 134 RKHQVSTDRMKRSISIPSRFDWRDKGVITPVRSQGSCGACWAFSTIEVIESMFAIKNGTL 193
Query: 183 SLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACK--RKAT 240
LSVQE+IDCA N N GC GGD C+LL W+ V+KV + ES YPL+ CK +
Sbjct: 194 HSLSVQEMIDCAKNSNFGCEGGDICSLLSWLLVSKVQILQESIYPLVGMTGTCKLGKMTD 253
Query: 241 SPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANI 300
G+KI+ +TCD+ + +E +L +ATHGPV AAVNAL+WQ YLGGVIQY+CDGS N+
Sbjct: 254 KAFGIKIQDFTCDSFVDAEDELLIALATHGPVAAAVNALSWQNYLGGVIQYHCDGSFDNL 313
Query: 301 NHAVQIVGYD 310
NHAVQI+GYD
Sbjct: 314 NHAVQIIGYD 323
>gi|383852175|ref|XP_003701604.1| PREDICTED: cathepsin O-like [Megachile rotundata]
Length = 370
Score = 299 bits (765), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 150/318 (47%), Positives = 216/318 (67%), Gaps = 16/318 (5%)
Query: 5 KNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY--SKSEHDIRFKNFEK 62
+ V V +++LCFL IP++V + ++LF ++ RY K+Y +E++ RF+ F++
Sbjct: 20 RTVALTVLIVSLCFLVIPIRVDPDPSSEDIKLFKNYVTRYNKTYRNDPTEYEERFQRFQR 79
Query: 63 SLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV----NKHVLMSHHKHHDH 118
SL IE +N R SPESA YG+TEFSD++E+EF+++ L + KH +H+ H
Sbjct: 80 SLRHIETMNSLRSSPESAFYGLTEFSDMTEDEFRSQALSPDLAARGEKHATAPYHRLHRL 139
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
H++ +R+ T +P++ DWR+ G+I VR+Q CGACWAFSTVE AESM A++
Sbjct: 140 KHSNRVRRA--------TVVPLRFDWRDKGVITPVRSQGACGACWAFSTVEVAESMFAIQ 191
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
NGTL LSVQE+IDCA N N GC GGD C+LL W+ ++KV + E YPL K CK +
Sbjct: 192 NGTLYPLSVQEMIDCAKNSNFGCEGGDICSLLSWLLLSKVQIFQEHAYPLTRKTDTCKLE 251
Query: 239 ATSP--NGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGS 296
T+ +GV+IK +TCD+ + +E +++ +ATHGPV AAVNAL+WQ YLGGVIQ++CDGS
Sbjct: 252 KTAGKISGVRIKDFTCDSFVDAEDELVSTLATHGPVAAAVNALSWQNYLGGVIQFHCDGS 311
Query: 297 LANINHAVQIVGYDNYSR 314
++NHAVQIVGYD ++
Sbjct: 312 FDSLNHAVQIVGYDKSAK 329
>gi|340710428|ref|XP_003393792.1| PREDICTED: cathepsin O-like [Bombus terrestris]
Length = 355
Score = 295 bits (755), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 153/304 (50%), Positives = 199/304 (65%), Gaps = 19/304 (6%)
Query: 17 CFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY--SKSEHDIRFKNFEKSLDIIEELNKNR 74
CFLAIP++VS L+LF ++ RY KSY + +E++ RFK F KSL IE++N R
Sbjct: 16 CFLAIPIRVSPDTSNGDLKLFQNYVMRYNKSYRNNPTEYEERFKRFRKSLRHIEKMNGLR 75
Query: 75 QSPESARYGITEFSDLSEEEFKTRHLRHSVN----KHVLMSHHKHHD--HHHNHVKKRSI 128
S ESA YG+TEFSD+SE+EF + L ++ KH S+H+ H N VKK
Sbjct: 76 PSQESAYYGLTEFSDMSEDEFLSLTLLPDLSARGEKHANESYHRRHHLLQSTNRVKK--- 132
Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
IP++ DWR+ G+I VR+Q +CGACWAFST+E ESM+A+KNGTL +LSVQ
Sbjct: 133 ------SVSIPLRFDWRDKGVITPVRSQGSCGACWAFSTIEVVESMYAIKNGTLYMLSVQ 186
Query: 189 EVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPN--GVK 246
E+IDCA N N GC GGD +LL W+ +KV + ES YPL+ K + CK N GVK
Sbjct: 187 EMIDCAKNKNFGCEGGDIYSLLSWLLASKVQIFQESTYPLVGKTSMCKLGKMIDNAFGVK 246
Query: 247 IKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQI 306
I+ + CD + +E +L +ATHGPV A VNAL+WQ YLGGVIQY+CD + N NHAVQI
Sbjct: 247 IRDFNCDNFVDAEDELLIKVATHGPVAAVVNALSWQNYLGGVIQYHCDSTYDNRNHAVQI 306
Query: 307 VGYD 310
+GYD
Sbjct: 307 IGYD 310
>gi|307206026|gb|EFN84119.1| Cathepsin O [Harpegnathos saltator]
Length = 353
Score = 290 bits (742), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 143/300 (47%), Positives = 197/300 (65%), Gaps = 9/300 (3%)
Query: 15 ALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY--SKSEHDIRFKNFEKSLDIIEELNK 72
+LCF +P++V + ++LF + RY KSY E++ RF F++SL IE +N
Sbjct: 14 SLCFFMVPIRVGPDKNAEDIKLFVDYVARYNKSYRHDPPEYNERFDRFQRSLRHIERMNG 73
Query: 73 NRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGI 132
R S ESA YG+TEFSDLSE+EF R L ++ M HK ++H H K + +
Sbjct: 74 FRSSQESAYYGLTEFSDLSEDEFVQRTLLPDLSSRGQM--HKAASYYHRHTKNTNNRS-- 129
Query: 133 TIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVID 192
T +P K DWR+ G++G +++Q+ CGACWAFST+ AESM+A+KNGTL SVQE+ID
Sbjct: 130 ERETNVPPKIDWRDKGVVGPIQSQEICGACWAFSTIGVAESMYAMKNGTLYPFSVQEMID 189
Query: 193 CAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACK--RKATSPNGVKIKSY 250
C G+ GC GGD C+LL W+ +K + PES YPL +D CK + + +GV I +
Sbjct: 190 CM-PGDFGCQGGDICSLLSWLLTSKTKIIPESAYPLTRRDDQCKLLKLSAKTSGVGITDF 248
Query: 251 TCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
TCD+ +E +L +A+HGPV AAVNA++WQ YLGGVIQY+CDGS +++NHAVQIVGYD
Sbjct: 249 TCDSFADAEDELLALLASHGPVAAAVNAISWQNYLGGVIQYHCDGSFSSLNHAVQIVGYD 308
>gi|332024588|gb|EGI64786.1| Cathepsin O [Acromyrmex echinatior]
Length = 356
Score = 286 bits (733), Expect = 7e-75, Method: Compositional matrix adjust.
Identities = 145/306 (47%), Positives = 200/306 (65%), Gaps = 14/306 (4%)
Query: 17 CFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY--SKSEHDIRFKNFEKSLDIIEELNKNR 74
CF +P+KV E+ ELF+++ RY KSY ++++ RF++F+KSL IE+LN R
Sbjct: 16 CFFIVPIKVDFDKTEKDAELFANYIARYNKSYRNDPAKYEERFEHFQKSLRHIEKLNSLR 75
Query: 75 QSPESARYGITEFSDLSEEEFKTRHLRHSV----NKHVLMSHHKHHDHHHNHVKKRSITT 130
S ESA YG+TEFSDLS++EF + L + KH S++ H + KR I
Sbjct: 76 SSQESAYYGLTEFSDLSDDEFIQQALIPDLPLRGQKHTTASYYHQHFMGSVNRMKRMIPI 135
Query: 131 GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEV 190
GIP K DWR+ G++G V +Q+ CGACWAFSTV AESM+A++NGTL SVQE+
Sbjct: 136 -----IGIPSKFDWRDKGVVGPVMSQENCGACWAFSTVGVAESMYAIENGTLHSFSVQEM 190
Query: 191 IDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACK--RKATSPNGVKIK 248
IDC GN GC GGD C+LL W+ +K + E +YPL L+ C+ + + +GV+I
Sbjct: 191 IDCM-PGNFGCQGGDICSLLSWLLASKTRIISEIDYPLTLQTDTCRLHKISAKTSGVRIT 249
Query: 249 SYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVG 308
+TCD+ + +E+ +LT + THGPV AVNA++WQ YLGG+IQYNCD S ++NHAVQIVG
Sbjct: 250 DFTCDSFVDAETELLTLLVTHGPVAVAVNAISWQNYLGGIIQYNCDSSFNSLNHAVQIVG 309
Query: 309 YDNYSR 314
YD +R
Sbjct: 310 YDTEAR 315
>gi|307169691|gb|EFN62267.1| Cathepsin O [Camponotus floridanus]
Length = 358
Score = 281 bits (718), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 151/308 (49%), Positives = 206/308 (66%), Gaps = 18/308 (5%)
Query: 17 CFLAIPVKVSKPNLE-QKLELFSSFQQRYKKSY--SKSEHDIRFKNFEKSLDIIEELNKN 73
CF IP+KV KPN + +LF ++ +Y KSY +E+ RF+ F+KSL IE++N
Sbjct: 16 CFFIIPIKV-KPNKNVEDAKLFENYIVQYNKSYRNDSTEYKKRFECFQKSLRHIEKMNSF 74
Query: 74 RQSPESARYGITEFSDLSEEEFKTRHLRHSVN----KHVLMSH-HKHHDHHHNHVKKRSI 128
+ S ESA YG+T+FSDLSE+EF + L ++ KH S+ H++ + NH KR+I
Sbjct: 75 QSSQESAYYGLTKFSDLSEDEFLQQTLLPDLSLRNQKHTTASYYHQYFTNSSNH-GKRAI 133
Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
IP IP K DWR G++G V+ Q CGACWAFST+ ESM+A+KNGTL SVQ
Sbjct: 134 -----IPPPIPSKVDWRNRGVVGPVQYQDNCGACWAFSTIGVVESMYAIKNGTLYPFSVQ 188
Query: 189 EVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSP--NGVK 246
E+IDC G+ GC GGD CALL W+ +K + E+ YPL L++ CK TS GVK
Sbjct: 189 EMIDCMP-GSYGCQGGDTCALLSWLLESKTKIISENVYPLTLRNDPCKLSKTSAKTTGVK 247
Query: 247 IKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQI 306
I +TC++ + +ES++LT + THGPV+A VNA++WQ YLGG+IQY+CDGS +++NHAVQI
Sbjct: 248 ITDFTCNSFVNAESNLLTLLGTHGPVVAGVNAISWQNYLGGIIQYHCDGSFSHLNHAVQI 307
Query: 307 VGYDNYSR 314
VGYD +R
Sbjct: 308 VGYDMAAR 315
>gi|156553312|ref|XP_001599758.1| PREDICTED: cathepsin O-like [Nasonia vitripennis]
Length = 345
Score = 268 bits (684), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 139/286 (48%), Positives = 186/286 (65%), Gaps = 14/286 (4%)
Query: 37 FSSFQQRYKKSYSKS--EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
F ++ Q YKK Y E++ RF F++SL IE LN+ R S +SARYG+T++SD++E+E
Sbjct: 26 FEAYVQDYKKPYKNDPDEYERRFGRFQQSLRKIESLNRLRSSADSARYGLTDYSDMTEQE 85
Query: 95 FKTRHLRHSVNKHV--LMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
F +LR ++ H HH+H N +R++T +P K DWR G +
Sbjct: 86 FLALNLRPDLSNRSEKHHQCHYHHNHSDNKRYERAVTV-------LPDKFDWRTKGAVTA 138
Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
V++Q +CGACWAFS VETAESM A+ N TL SVQE+IDCAGN N GC GGD C+LLDW
Sbjct: 139 VKSQGSCGACWAFSAVETAESMFAISNKTLRAFSVQEMIDCAGNSNFGCEGGDICSLLDW 198
Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSP---NGVKIKSYTCDTLIPSESSILTDIATH 269
+ V+K + PE YPL ACK + T+ G++I +TCD + +E +L +AT
Sbjct: 199 LLVSKTEILPEINYPLTRTTDACKLQKTATKIQEGIRISDFTCDNYVGAEDKLLKVLATK 258
Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYSRT 315
GPV AAVNAL+WQ YLGGVIQ++CDGS ++NHAVQIVGYD + T
Sbjct: 259 GPVAAAVNALSWQNYLGGVIQFHCDGSFKSLNHAVQIVGYDKTATT 304
>gi|401758202|gb|AFQ01136.1| cathepsin O2-like protease [Chilo suppressalis]
Length = 368
Score = 264 bits (675), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 137/324 (42%), Positives = 191/324 (58%), Gaps = 15/324 (4%)
Query: 1 MFDVKNVLFIVALIALC--FLAIPVKVSKPNLEQKLE-LFSSFQQRYKKSYSKS--EHDI 55
M+ + +++ ++ C F+ +P+ S +++L+ +F + ++Y KSY + E++
Sbjct: 1 MYKINWWTWVLGIVLFCLLFIVVPISYSASTSKEQLKPIFDQYIEKYNKSYKNNPEEYET 60
Query: 56 RFKNFEKSLDIIEELNKNRQSPES--ARYGITEFSDLSEEEFKTRHL------RHSVNKH 107
RF++F S+ I+ LN + PE ARYG T+ SD+S E+K HL +
Sbjct: 61 RFQHFLVSMSEIDRLNSESRGPEQYRARYGPTKLSDMSPTEYKDLHLSDEKLTKSPATYD 120
Query: 108 VLMSHHKHHDHHH-NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFS 166
H D++H V +R +P+ DWR G +G VRNQ CGACWAFS
Sbjct: 121 RSWRKHNQRDYYHVQDVNERKENLIRKKRASLPMLVDWRVKGAVGAVRNQGLCGACWAFS 180
Query: 167 TVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEY 226
TV T ESM A+ G L LSVQEVIDCA GN GCSGGD C LLDW+ + +E E +Y
Sbjct: 181 TVGTMESMAAINTGKLPALSVQEVIDCARLGNQGCSGGDICLLLDWLMITNTPVEVEKDY 240
Query: 227 PLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLG 286
PL L + CK K + GV++ S+TCD + +E I+ +A HGPV AVNALTWQ YLG
Sbjct: 241 PLQLTNGVCKAKKNT-TGVRVTSFTCDDFVGTEQKIIEALALHGPVAVAVNALTWQNYLG 299
Query: 287 GVIQYNCDGSLANINHAVQIVGYD 310
GVIQY+C G ++NHAVQ+VGYD
Sbjct: 300 GVIQYHCSGDAMDLNHAVQLVGYD 323
>gi|357609157|gb|EHJ66323.1| putative Cathepsin O precursor [Danaus plexippus]
Length = 382
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 142/342 (41%), Positives = 195/342 (57%), Gaps = 47/342 (13%)
Query: 6 NVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY--SKSEHDIRFKNFEKS 63
N + +VAL+ L F+AIP+ E +F + + + K+Y +E++ R ++F S
Sbjct: 6 NWILVVALVCLLFVAIPLSYPDRTKESLRPMFDEYIENFNKTYKDDPAEYEKRLEHFVAS 65
Query: 64 LDIIEELNKNRQSPES--ARYGITEFSDLSEEEFKTRHLR----HSVNKHVL-------- 109
+ I+ LN + PE ARYG+T+ SD+S++EF+ HL H +H L
Sbjct: 66 VKEIDRLNSAARGPEQHRARYGLTQMSDMSKDEFRDVHLSDEQPHRYRRHKLGKSWSKGR 125
Query: 110 -----------------MSHHKHHDHHHNHV----KKRSITTGITIPTGIPVKKDWREAG 148
K HHN KKR++ +P++ DWR G
Sbjct: 126 VKDIEDVADNMDDYDDEDDDDKEGSPHHNIYIVIRKKRAM---------LPLQVDWRTKG 176
Query: 149 IIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCA 208
+IG VR+Q CGACWAFST+ T E+M A+ G L+ LSVQEVIDCAG GN GC+GGD C
Sbjct: 177 VIGPVRDQGLCGACWAFSTIGTMEAMAAIDTGKLNTLSVQEVIDCAGLGNSGCAGGDICL 236
Query: 209 LLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIAT 268
LLDW+ + ++ E EYPL L + C+ K + GVK+ +TC L+ +E I+ IAT
Sbjct: 237 LLDWLLMTDTAVQVEKEYPLKLTNGVCQAKKNA-TGVKVAKFTCTDLVGAEDKIIESIAT 295
Query: 269 HGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
HGPV AVNALTWQ YLGGVIQY+C GS +NHAV++VGYD
Sbjct: 296 HGPVAVAVNALTWQNYLGGVIQYHCSGSPKELNHAVELVGYD 337
>gi|321475753|gb|EFX86715.1| hypothetical protein DAPPUDRAFT_187469 [Daphnia pulex]
Length = 360
Score = 247 bits (631), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 141/317 (44%), Positives = 192/317 (60%), Gaps = 20/317 (6%)
Query: 5 KNVLFIVALIALCFLAIPVKVSKPN-LEQKLELFSSFQQRYKKSYSKS--EHDIRFKNFE 61
KNV+ + L +LCFL IP+++ +P+ ++Q+ F F +++ KSY + E+ R F+
Sbjct: 6 KNVICALGLFSLCFLGIPIRIDQPDSMKQE---FKQFIEKHNKSYGRDPVEYGRRLSYFK 62
Query: 62 KSLDIIEELN--KNRQSPESARYGITEFSDLSEEEFKTRHLRH--SVNKHVLMSHHKHHD 117
S +E N K+ Q A +GIT+FSDL EF+ LRH S V+ S+ H +
Sbjct: 63 ASHSRAKEYNMLKHNQDNGHASFGITKFSDLDANEFQEMLLRHKPSSLSCVIGSNLNHVN 122
Query: 118 HHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHAL 177
+ +KR I +P DWRE ++ V+NQ +CGACWAFSTV+T ESMHA+
Sbjct: 123 RNR---RKREIPNAQKNFKQLPSYVDWREKNVVTAVKNQHSCGACWAFSTVQTVESMHAI 179
Query: 178 KNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACK- 236
G L+ LS Q+VIDCA NGN GC GGD C L WM + V L E +YPL LKD CK
Sbjct: 180 ATGELNELSTQQVIDCARNGNKGCIGGDTCTALTWMSASNVSLLEEKQYPLTLKDQRCKT 239
Query: 237 --RKATSPNGVKIKS-YTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNC 293
+TS GV++ S +TC L+ +E + +A HGPV AAV+A+TWQ YLGG+IQY+C
Sbjct: 240 VFEGSTSSGGVRLASNFTCYNLVDNEEQLKHILAFHGPVTAAVDAVTWQDYLGGIIQYHC 299
Query: 294 DGSLANINHAVQIVGYD 310
+ NHAVQIVGYD
Sbjct: 300 RD---HTNHAVQIVGYD 313
>gi|189236657|ref|XP_970512.2| PREDICTED: similar to cathepsin o [Tribolium castaneum]
Length = 329
Score = 224 bits (571), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 120/301 (39%), Positives = 177/301 (58%), Gaps = 28/301 (9%)
Query: 11 VALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEE 69
+ IAL F IP+++ P +Q F + +R+ K+Y S + R F++SL IE
Sbjct: 11 IFYIALLFFVIPIRIKGP--DQAESQFQEYLKRFNKTYDDPSVYQNRLHAFKQSLQTIET 68
Query: 70 LNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSIT 129
LN +++ SA YG+T+FSDL EEF +L+ ++++ + K H H KR+
Sbjct: 69 LNSKKRNG-SALYGLTKFSDLLPEEFFQTYLQSNLSQKTHSNEPKRHHH------KRAT- 120
Query: 130 TGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQE 189
+P K DWRE + ++ NQ +CGACWA+S +ET ESM+A+K LSVQE
Sbjct: 121 --------VPNKVDWREKNAVTRIYNQGSCGACWAYSVIETVESMNAIKTNKSEELSVQE 172
Query: 190 VIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKS 249
+IDCAGN N GC+GGD C LL W+ ++ ++Y C R + GV ++
Sbjct: 173 IIDCAGN-NKGCNGGDICTLLSWIKATNFTIQRHADY-----GGKCGRGSA---GVHVRD 223
Query: 250 YTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+ C+ L+ SE +L +A +GP+ A+NA TWQ Y+GGVI+Y+CDG + +NHAVQIVGY
Sbjct: 224 FMCEGLVGSEDVMLRLLADNGPLAVAINAQTWQNYIGGVIEYHCDGDPSKLNHAVQIVGY 283
Query: 310 D 310
D
Sbjct: 284 D 284
>gi|332373716|gb|AEE61999.1| unknown [Dendroctonus ponderosae]
Length = 346
Score = 224 bits (570), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 124/312 (39%), Positives = 188/312 (60%), Gaps = 21/312 (6%)
Query: 2 FDVKNVLFIVALIALCFLAIPVKVSKPNLEQKL-ELFSSFQQRYKKSY-SKSEHDIRFKN 59
F K + + IAL F +P K+ L+ ++ E F + + KSY ++E RF
Sbjct: 8 FTYKTYIELGFYIALLFFVVPCKIK---LDSEIREQFHEYLSDFNKSYPQEAEFQFRFAA 64
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF-KTRHLRHSVNKHVLMSHHKHHDH 118
F+KSL IE+LN N+ + SA+YG+T+FSD + EEF ++ R V + +
Sbjct: 65 FKKSLANIEQLNANK-TKSSAQYGLTKFSDFTAEEFLDLQNNRAGVRRDL-------RGA 116
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
+ +KK ++ + +P DWR ++ KV+NQ+ CGACWAF+ ET ESM A+K
Sbjct: 117 AQSRLKKVALRSAY--EKELPQIVDWRNKNVVSKVKNQKNCGACWAFAVSETIESMQAIK 174
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
L+ LS+Q++IDC+ N GC GGD CALL W+ VN + + E++YPL+L+D C++
Sbjct: 175 TQQLTDLSIQQLIDCSSYNN-GCKGGDTCALLRWIKVNNIAIMNETDYPLVLEDQKCQKT 233
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLA 298
S GVK+ +Y C++ + E IL +A +GPV A++ TWQ Y+GGVIQ++C+G L+
Sbjct: 234 DMS-EGVKVGTYQCNSFVGREDIILKLLAINGPVAVAISGETWQNYVGGVIQFHCEGDLS 292
Query: 299 NINHAVQIVGYD 310
HAVQIVGY+
Sbjct: 293 ---HAVQIVGYN 301
>gi|241111179|ref|XP_002399230.1| cysteine protease and A protease inhibitor, putative [Ixodes
scapularis]
gi|215492918|gb|EEC02559.1| cysteine protease and A protease inhibitor, putative [Ixodes
scapularis]
Length = 363
Score = 221 bits (562), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 118/289 (40%), Positives = 169/289 (58%), Gaps = 15/289 (5%)
Query: 24 KVSKPNLEQKLELFSSFQQRYKKSYSK--SEHDIRFKNFEKSLDIIEELNKNRQSPESAR 81
+ + P++E F + +RY K+Y+ +E+ R F +L IE+ N++ A
Sbjct: 37 RTADPSVEAA---FEQYVKRYNKTYASGSAEYSKRLNAFRDALIRIEDRNRHGNHSNGAL 93
Query: 82 YGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVK 141
YG+T +SDL+ +EF R + + + + H + G T P P K
Sbjct: 94 YGLTPYSDLTPDEF-----RALLATFAPAENTRTEANEVEHDDLQLALPGATSPR-YPPK 147
Query: 142 KDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGC 201
DWR G++ VRNQ+ CGACWAFSTVET E+MHAL GTL+ SVQ++IDC+ N N GC
Sbjct: 148 FDWRTRGVVTAVRNQRDCGACWAFSTVETVETMHALAAGTLTGFSVQQMIDCSNNSNHGC 207
Query: 202 SGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESS 261
+GGD CA L W+ VN++ L +S YP +C+ A+ ++ YTCD L+ +E
Sbjct: 208 NGGDTCAALKWLKVNRIKLVRDSVYPFKAVTGSCQHPASDVT-AEVSDYTCDRLVGNEER 266
Query: 262 ILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
++ +A GP++ AV+A TWQ YLGGVIQ++CD A NHAVQIVGYD
Sbjct: 267 MIDMLANVGPLVVAVDATTWQDYLGGVIQFHCD---AGRNHAVQIVGYD 312
>gi|270006364|gb|EFA02812.1| cathepsin O precursor [Tribolium castaneum]
Length = 326
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 119/301 (39%), Positives = 175/301 (58%), Gaps = 31/301 (10%)
Query: 11 VALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEE 69
+ IAL F IP+++ P +Q F + +R+ K+Y S + R F++SL IE
Sbjct: 11 IFYIALLFFVIPIRIKGP--DQAESQFQEYLKRFNKTYDDPSVYQNRLHAFKQSLQTIET 68
Query: 70 LNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSIT 129
LN +++ SA YG+T+FSDL EEF +L+ ++++ + K H H KR+
Sbjct: 69 LNSKKRNG-SALYGLTKFSDLLPEEFFQTYLQSNLSQKTHSNEPKRHHH------KRAT- 120
Query: 130 TGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQE 189
+P K DWRE + ++ NQ +CGACWA+S +ET ESM+A+K LSVQE
Sbjct: 121 --------VPNKVDWREKNAVTRIYNQGSCGACWAYSVIETVESMNAIKTNKSEELSVQE 172
Query: 190 VIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKS 249
+IDCAGN N GC+GGD C LL W+ ++ ++Y C R + GV ++
Sbjct: 173 IIDCAGN-NKGCNGGDICTLLSWIKATNFTIQRHADY-----GGKCGRGSA---GVHVRD 223
Query: 250 YTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+ L+ SE +L +A +GP+ A+NA TWQ Y+GGVI+Y+CDG + +NHAVQIVGY
Sbjct: 224 F---ILVGSEDVMLRLLADNGPLAVAINAQTWQNYIGGVIEYHCDGDPSKLNHAVQIVGY 280
Query: 310 D 310
D
Sbjct: 281 D 281
>gi|157134825|ref|XP_001656461.1| cathepsin o [Aedes aegypti]
gi|108884338|gb|EAT48563.1| AAEL000420-PA [Aedes aegypti]
Length = 375
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 122/340 (35%), Positives = 191/340 (56%), Gaps = 40/340 (11%)
Query: 1 MFDVKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKS--EHDIRFK 58
M +V ++ I+ ++ LCFL IP + ++ + + F +F + Y K Y + E+D RF+
Sbjct: 1 MSEVIEMIMILIIVTLCFLMIPFNLQPNSVIEARKKFDTFIKLYDKPYRYNVREYDHRFQ 60
Query: 59 NFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHL---RHSV------NKHV- 108
F SL+ I LN +R ++A YGIT+++DL+++EF HL +H N+ V
Sbjct: 61 IFRVSLNKIASLNAHRVENDTAIYGITQYADLTDQEFLRLHLADLKHETTPGTANNRGVS 120
Query: 109 ----LMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWA 164
+ K + + + R+ + I +P DWR+ G++ VR+Q +CGACWA
Sbjct: 121 VLDKFIIESKSAEMKDDIIFSRA-KRDLKILDYLPKVVDWRDKGVVAPVRSQGSCGACWA 179
Query: 165 FSTVETAESMHALK-NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPE 223
S V+T S+ A+K S L + +VI+CAGNGN GC GGD C LL+W+ KV L
Sbjct: 180 ISVVDTITSISAIKRQQNFSELCLDQVINCAGNGNFGCEGGDTCRLLEWLKEEKVKLNTL 239
Query: 224 SEYPLLLKDAACKRKATSPNG-------------VKIKSYTCDTLIPSESSILTDIATHG 270
+ C+ TS NG + + ++C +L+ E +L +ATHG
Sbjct: 240 KQ---------CEALDTSKNGPNCTFQQASNGEYLSLNQFSCVSLVDREHLMLRYLATHG 290
Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
P++AAVNA +W+YYLGGVIQY+C+ + ++NHAV+IVGY+
Sbjct: 291 PIVAAVNAASWKYYLGGVIQYHCEEAYEDLNHAVEIVGYN 330
>gi|328711164|ref|XP_003244460.1| PREDICTED: cathepsin O-like [Acyrthosiphon pisum]
Length = 339
Score = 210 bits (535), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 122/307 (39%), Positives = 174/307 (56%), Gaps = 17/307 (5%)
Query: 4 VKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEK 62
V NVL +++ L + +S L + + F+ F + Y KSY +++EH+ RF++F+K
Sbjct: 3 VSNVLKASLVVSSVVLILFFIMSITQLNRDQDKFNKFIKMYNKSYMNETEHNKRFEHFKK 62
Query: 63 SLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNH 122
SL I+ L+ ++ YGITEFSDLS EEF +L V + +
Sbjct: 63 SLKTIQLLS--QKCNGCTNYGITEFSDLSTEEFTKIYLNS-----VTLRTPRTGTFSMAR 115
Query: 123 VKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTL 182
KRSITT DWR+ G++ VRNQ+ CGACWA S VE ES++A+K G L
Sbjct: 116 -SKRSITTATLSSI------DWRDKGVVTSVRNQKNCGACWAISVVELIESVYAIKTGLL 168
Query: 183 SLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSP 242
SVQE++DC+G N GC+GG LL W+ N + + E YP + KD C T
Sbjct: 169 QTFSVQEMLDCSGGINQGCTGGSVVYLLLWLVENNITVYKEENYPTIYKDQMCTLDKTFD 228
Query: 243 NGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINH 302
GVK+KS+ L+ E +L+ I+ PV A+NAL WQ+Y+GGV+ CD S+A++NH
Sbjct: 229 KGVKVKSFLTLNLVDREDLLLSYIS-KSPVSVALNALPWQFYVGGVLS-QCDNSMASLNH 286
Query: 303 AVQIVGY 309
A +IVGY
Sbjct: 287 AAEIVGY 293
>gi|347968429|ref|XP_312205.5| AGAP002720-PA [Anopheles gambiae str. PEST]
gi|333468007|gb|EAA08145.5| AGAP002720-PA [Anopheles gambiae str. PEST]
Length = 383
Score = 203 bits (516), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 117/340 (34%), Positives = 180/340 (52%), Gaps = 32/340 (9%)
Query: 1 MFDVKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY--SKSEHDIRFK 58
M +V +L I+ ++ LCFL IP + + F F + Y K Y E+ RF+
Sbjct: 1 MTEVIEMLMIILIVTLCFLMIPFNTKPSAVIESRRKFDVFVRLYDKPYRGDAREYAYRFQ 60
Query: 59 NFEKSLDIIEELNK-NRQSPESARYGITEFSDLSEEEFKTRHLR--------------HS 103
F SL I LN+ R++ ++A YGIT+++DL++ EF R L +
Sbjct: 61 IFRTSLSKIRALNEWAREANDTAIYGITQYADLTDREFVARQLADLLPDEPGGGAGGPRA 120
Query: 104 VNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACW 163
K+V+ S + + ++ + R+ + +P + DWRE G+I V+NQ CGACW
Sbjct: 121 YQKYVIES--RSAEMKNDIIFSRARRDALPAVRNLPHRVDWREQGVISTVKNQGGCGACW 178
Query: 164 AFSTVETAESMHALKNGTLSLLSV--QEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVL 220
A S V+T ++ A+K L+ + + V+ CA NGN GC GGD C LL+W+ + + +
Sbjct: 179 AISVVDTIAALAAIKRNDRKLIDLCHERVVRCAANGNNGCDGGDTCRLLEWLAEESYRIG 238
Query: 221 EPESEYPLLLKD----------AACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
ES L D + +R+ + N +K ++C E +L +AT G
Sbjct: 239 AAESCLERNLADQEGGLNCTGESGVRREDGALNATLVKRFSCQGYENEEHLMLRHLATKG 298
Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
P++AAVNA++W+YYLGGVIQY+CD +NHAV IVGYD
Sbjct: 299 PIVAAVNAISWKYYLGGVIQYHCDSDYELLNHAVAIVGYD 338
>gi|410914437|ref|XP_003970694.1| PREDICTED: cathepsin O-like [Takifugu rubripes]
Length = 328
Score = 191 bits (484), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 104/276 (37%), Positives = 152/276 (55%), Gaps = 25/276 (9%)
Query: 37 FSSFQQRYKKSY--SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
F F++R+ ++Y + + D R F++S LN + +SA+YGI +FSDLS+ E
Sbjct: 32 FEWFRERFGRNYEVNSPQFDRRLFFFQESTTRHAYLNSFSAASQSAKYGINQFSDLSQRE 91
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F+ +LR S ++ S K G+P K DWR+ I+ V+
Sbjct: 92 FQDLYLRASADRAPAFSGQK--------------------AEGLPAKFDWRDHAIVAPVQ 131
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
NQQ CG+CWAFS V +S+HA+ L LSVQ+V+DC+ N GC+GG A L W+
Sbjct: 132 NQQACGSCWAFSVVGAVQSVHAIGGSQLVELSVQQVLDCSFQ-NKGCNGGTPVAALKWLT 190
Query: 215 VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIA 274
+V L P+SEYP + C + S GV +K++T E +++ + HGP+
Sbjct: 191 QTRVKLVPQSEYPYKAQTRMCHFFSGSHGGVGVKNFTALDFSGQEEAMMGHLVKHGPLSV 250
Query: 275 AVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
V+AL+WQ YLGG+IQY+C S NHAV +VGYD
Sbjct: 251 VVDALSWQDYLGGIIQYHC--SSKRSNHAVLVVGYD 284
>gi|342305190|dbj|BAK55649.1| cathepsin O [Oplegnathus fasciatus]
Length = 338
Score = 187 bits (474), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 100/276 (36%), Positives = 153/276 (55%), Gaps = 25/276 (9%)
Query: 37 FSSFQQRYKKSY--SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
F SF++ + + Y + E + R NF+ + LN +P+SA+YGI FSDLS++E
Sbjct: 42 FDSFREHFHRMYEVNGEEFNRRHLNFQNATKRHAYLNSLSTAPQSAKYGINRFSDLSQKE 101
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F+ +LR S ++ L S K G+P K DWR+ ++ V+
Sbjct: 102 FRGLYLRASADRAPLFSGLK--------------------TEGLPAKFDWRDKAVVAPVQ 141
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
NQQ CG+CWAFS V +S+HA+ L+ LSVQ+V+DC+ N GC+GG L W+
Sbjct: 142 NQQACGSCWAFSVVGAMQSVHAIGGSPLAQLSVQQVLDCSFQ-NHGCNGGSPFRALTWLK 200
Query: 215 VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIA 274
+V L P+SEY + C + S GV +K++T E +++ + HGP+ A
Sbjct: 201 QTRVKLVPQSEYSYKAETGICHFFSQSHAGVAVKNFTAHDFSGQEEAMMGQLVEHGPLAA 260
Query: 275 AVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
V+A++WQ YLGG+IQ++C + NHAV +VGY+
Sbjct: 261 IVDAVSWQDYLGGIIQHHCSSQWS--NHAVLVVGYN 294
>gi|401758200|gb|AFQ01135.1| cathepsin O1-like protease [Chilo suppressalis]
Length = 371
Score = 184 bits (468), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 112/292 (38%), Positives = 158/292 (54%), Gaps = 22/292 (7%)
Query: 34 LELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEE 93
L F + +++ KSY+ E R +N+EKS+ I LN +S +G+T+FSD +++
Sbjct: 42 LNSFIGYMKKFNKSYTDYEFMRRMRNYEKSVQEIRRLNTIH---DSKVFGLTKFSDWADD 98
Query: 94 EFKTRHLRHSVNK-------HVLMSHHKHHDH----HHNHVKKRSITTGITIPTG-IPVK 141
EF L + L K+ + + K R+I I+ G IPVK
Sbjct: 99 EFSAFMLSGRSERACKEQSMKCLPKRKKYQNFSPSIRYMMFKNRTIDVKISPTYGNIPVK 158
Query: 142 KDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGC 201
DWR+ G++ V NQ+ C ACWAFS V ESM A+ L+ LS+QE+IDC+ N GC
Sbjct: 159 IDWRDFGVVSPVLNQKLCSACWAFSIVGVMESMVAIYKKGLTRLSIQELIDCSKYNN-GC 217
Query: 202 SGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACK-RKATSPNGVKIKSYT--CDTLIPS 258
GD L ++ N + E EY L L+D +C+ PNG +I Y C+
Sbjct: 218 HMGDIRLALQFLCQNDYPIVTEKEYSLTLRDESCRIPDDQKPNGERIAEYANLCNV---D 274
Query: 259 ESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
E +L IA HGPV+A+VNA W+YY+GGVI+ C G+ +NHAVQIVGYD
Sbjct: 275 EKKLLKLIAMHGPVVASVNAAPWRYYIGGVIKSACPGTWHLVNHAVQIVGYD 326
>gi|348511930|ref|XP_003443496.1| PREDICTED: cathepsin O-like [Oreochromis niloticus]
Length = 338
Score = 183 bits (465), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 107/312 (34%), Positives = 165/312 (52%), Gaps = 32/312 (10%)
Query: 8 LFIVALIALCFLAIPVKVSKPN-------LEQKLELFSSFQQRYKKSY--SKSEHDIRFK 58
+FI A++AL L PV N L F +F++++ ++Y S E R
Sbjct: 6 VFIPAVVALGLLVSPVCCQNVNSSEIRTQLNGSAADFGAFRKQFHRTYEVSSEEFSRRHL 65
Query: 59 NFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDH 118
+F+++ LN +SA+YGI FSDLS+EEF+ +L + L S
Sbjct: 66 SFQRATIRHTYLNSFSTETQSAKYGINRFSDLSQEEFRDLYLGAVYERAPLFS------- 118
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
G+++ +P K DWR+ + V++QQ CG+CWAFS V +S+HA+
Sbjct: 119 ------------GLSVKE-LPDKFDWRDKAAVAAVQDQQACGSCWAFSVVGAIQSVHAIG 165
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
L LSVQ+V+DC+ N GC+GG L+W+ +V L +SEYP K C
Sbjct: 166 GSQLEQLSVQQVVDCSYQ-NAGCNGGSTTRALNWLKQTRVKLVTQSEYPYKAKTEICHFF 224
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLA 298
+ S GV IK++T E +++ + +GP++A V+A++WQ YLGG+IQ++C +
Sbjct: 225 SQSHGGVAIKNFTTHDFSGQEKAMMGQLVQYGPLVAIVDAVSWQDYLGGIIQHHCSSQWS 284
Query: 299 NINHAVQIVGYD 310
NHA+ IVGYD
Sbjct: 285 --NHAILIVGYD 294
>gi|195997891|ref|XP_002108814.1| hypothetical protein TRIADDRAFT_20325 [Trichoplax adhaerens]
gi|190589590|gb|EDV29612.1| hypothetical protein TRIADDRAFT_20325 [Trichoplax adhaerens]
Length = 333
Score = 181 bits (460), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 117/313 (37%), Positives = 168/313 (53%), Gaps = 33/313 (10%)
Query: 1 MFDVKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYS-KSEHDIRFKN 59
+F VK LFI+ +L + V P L F SF Y ++Y+ K EH+ RF+
Sbjct: 5 VFIVKATLFILISTSL---VLSESVHSPT--DLLARFKSFITDYNRNYTTKEEHEFRFQT 59
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHH 119
F+K+ I N N A YG+ +F+D ++EEFK V +++ HH
Sbjct: 60 FKKNFRRIASTNAN-----GATYGVNKFADWTDEEFKELLGNRQVPTQEIVNSELHH--- 111
Query: 120 HNHVKKRSITTGITIPTGIPVKKDWRE--AGIIGKVRNQQTCGACWAFSTVETAESMHAL 177
S++T P DWRE I+G VRNQ CG CWAFSTVET S AL
Sbjct: 112 -------SLSTA-----KFPSSLDWREHKRNIVGPVRNQGRCGCCWAFSTVETIASAWAL 159
Query: 178 KNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKR 237
+ + LSVQ+++ C N + GC GG F +W+ N+V LE ES P L K C +
Sbjct: 160 AGNSFTELSVQQLLSC-DNMDGGCRGGSFYLACNWLTKNRVPLETESANPYLGKRDKCVK 218
Query: 238 KATSPNGVKIKSYTCDTLIPSESS-ILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGS 296
AT+ G+ +K +T I ESS ++ + +GP+ AV+A +W+ Y+GG+IQ++CDG
Sbjct: 219 HATN-TGIILKKFTTSNFIYQESSSMIAALNQNGPLSIAVDATSWRDYVGGIIQHHCDGK 277
Query: 297 LANINHAVQIVGY 309
+ +NHAVQ+VGY
Sbjct: 278 V--LNHAVQVVGY 288
>gi|312371319|gb|EFR19540.1| hypothetical protein AND_22253 [Anopheles darlingi]
Length = 403
Score = 180 bits (456), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 122/358 (34%), Positives = 179/358 (50%), Gaps = 48/358 (13%)
Query: 1 MFDVKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKS--EHDIRFK 58
M +V +L I+ ++ LCFL IP + + + F F + Y K Y E+D RF+
Sbjct: 1 MTEVIEMLMIILIVTLCFLMIPFNTKPSPVLEARKKFDIFVRLYDKPYRYDVREYDYRFQ 60
Query: 59 NFEKSLDIIEELNKNRQSP----ESARYGITEFSDLSEEEFKTRHLRHSVNKH---VLMS 111
F SL+ I +LN R + + A YG+T+++DL++ EF +HL + V
Sbjct: 61 IFRTSLNRIRQLNDRRSATGNETDGAIYGVTQYADLTDREFIAQHLADLLAAEEMAVPRL 120
Query: 112 HHKHHDHHHNHVKKRSITTGIT--------------------IPTGIPVKKDWREAGIIG 151
H K+ + K I +PT +P DWR GII
Sbjct: 121 HQKYAIESRSAEMKNDIIFSRARRDLPLKEQQQQQQQQQQQHLPTNLPPTVDWRAKGIIT 180
Query: 152 KVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSV-------QEVIDCAGNGNMGCSGG 204
V++Q +CGACWA S V+T ++ A+K L+ ++V+ CAGNGN GCSGG
Sbjct: 181 PVKSQGSCGACWAISVVDTIAALAAIKRNEQQPLTTPVTDLCHEQVVHCAGNGNNGCSGG 240
Query: 205 DFCALLDWMDVNKVVLEPESEYP---LLLKDAACKRKATSPNGV---------KIKSYTC 252
D C LL+W+ + +E P L D C + G ++K ++C
Sbjct: 241 DTCLLLEWLKQESFPIGAAAECPYRRLADTDQNCTLPGSVVAGAWQPGQHRETRVKRFSC 300
Query: 253 DTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
D E +L +AT GP++AAVNA++W+YYLGGVIQY+CD +NHAVQIVGY+
Sbjct: 301 DRFENREHLMLQHLATKGPLVAAVNAVSWKYYLGGVIQYHCDSGPQLLNHAVQIVGYE 358
>gi|443732032|gb|ELU16924.1| hypothetical protein CAPTEDRAFT_222012 [Capitella teleta]
Length = 342
Score = 179 bits (455), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 115/315 (36%), Positives = 169/315 (53%), Gaps = 31/315 (9%)
Query: 4 VKNVLF--IVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY--SKSEHDIRFKN 59
V+ +LF I LI+ C ++VS ++ +LF F ++Y K+Y E+ R
Sbjct: 5 VQQILFFLICVLISHC-----LRVSNEEID---DLFVKFTEKYHKTYLIGSLEYMHRRGI 56
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHH 119
F + LN R + SA YG+T+FSDL++EEF R L + + +
Sbjct: 57 FRDNFKKHVALNSLRTNNASAWYGVTQFSDLTQEEFTNRFLSNFTTSPTVPA-------- 108
Query: 120 HNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK- 178
+++G I + P K DWR+ +I ++NQ +CG CWA++ ESMHALK
Sbjct: 109 ----LPTLLSSGQLIDS-FPRKWDWRDKKVITSMKNQDSCGGCWAYAATAVLESMHALKV 163
Query: 179 NGTLSLLSVQEVIDCA---GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAAC 235
G L LS Q++IDC+ GC GG+ CA L WM N V L E YP + KD C
Sbjct: 164 PGDLKSLSTQQMIDCSYGFAYALYGCKGGNPCAALHWMKQNNVGLISEKLYPTVNKDQKC 223
Query: 236 KRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDG 295
K + P+ V + +Y+C + SE S+L I++ GPV +V+A W Y GG+IQ++C G
Sbjct: 224 YIKKSKPDEVHVAAYSCQNFVGSEESLLRYISSVGPVAVSVDARMWINYQGGIIQHHC-G 282
Query: 296 SLANINHAVQIVGYD 310
+++ NHAV IVGYD
Sbjct: 283 EVSS-NHAVTIVGYD 296
>gi|390344145|ref|XP_798313.2| PREDICTED: cathepsin O-like [Strongylocentrotus purpuratus]
Length = 361
Score = 174 bits (440), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 103/283 (36%), Positives = 156/283 (55%), Gaps = 26/283 (9%)
Query: 36 LFSSFQQRYKKSYSKS--EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEE 93
F F Q++ K+Y++ E+ R++ F++SL E LN + A YGIT+FSDL+ E
Sbjct: 53 FFQIFIQKFNKTYTRGSQEYFKRYRIFKESLLKHEMLNAIATHRDHATYGITKFSDLTSE 112
Query: 94 EFKTRHL-RHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIP-TGIPVKKDWR--EAGI 149
EF+ ++L S+ + RS+ + P +P+ D R + +
Sbjct: 113 EFQFQYLGTASIPDQSV----------------RSVPGPVRRPLKTMPLVYDLRSIKPPV 156
Query: 150 IGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGCSGGDFCA 208
+ V+NQ++CGACWAFS VET E+ ALK L+ LS QE++DC G+ GC GG C
Sbjct: 157 VTPVKNQKSCGACWAFSVVETMETQIALKTKRLTQLSAQELVDCGTAAGDGGCRGGIPCK 216
Query: 209 LLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIA- 267
LDW++ K L PES YP + K C+ S + +++C E ++ +
Sbjct: 217 TLDWLNRTKTSLVPESTYPYIAKKGDCRINKNSTLNAVVTNFSCGNYAADEEHVMPAMLY 276
Query: 268 THGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
GP+ +V+A +WQYYLGG+IQY+C + +NHAVQIVG+D
Sbjct: 277 NQGPLSISVDAESWQYYLGGIIQYHCTPTY--LNHAVQIVGFD 317
>gi|291232495|ref|XP_002736191.1| PREDICTED: cysteine protease and A protease inhibitor,
putative-like [Saccoglossus kowalevskii]
Length = 367
Score = 173 bits (439), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 103/304 (33%), Positives = 170/304 (55%), Gaps = 22/304 (7%)
Query: 12 ALIALCFL--AIPVKVSKPNLEQKLELFSSFQQRYKKSY--SKSEHDIRFKNFEKSL-DI 66
++ LC+L AI V N+++ ++ F F +++K Y +E++ RF+ F++SL I
Sbjct: 17 VVLVLCYLPCAIQYDVQPGNIDEDVQ-FKEFILKHRKPYIAGTTEYEHRFRVFQQSLHRI 75
Query: 67 IEELNKNRQSPESARYGITEFSDLSEEEFKTRHL--RHSVNKHVLMSHHKHHDHHHNHVK 124
+ ++ +RQ ++A YGIT+FSDL+ +EF+ +L R S + + +S V+
Sbjct: 76 RKRISLSRQLNDTAVYGITQFSDLTPDEFQQMYLTLRPSKSSQIPVSL----------VQ 125
Query: 125 KRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL 184
S +P +P K D R+ + V++Q +CG CW+FSTV+ E+ L G ++
Sbjct: 126 FPSAFNSSNVPPDMPKKYDLRDKSAVSAVKDQGSCGGCWSFSTVQGMETKWVLNGGKMTE 185
Query: 185 LSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNG 244
LSVQ++IDC + + GC+GGD C + W+ V L YP C+ K + G
Sbjct: 186 LSVQQLIDCDTSSS-GCAGGDTCIAMAWLKTKNVGLITSHNYPFTGHTGECRIKNYT-EG 243
Query: 245 VKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAV 304
V +K +TC I E ++ ++ +G ++ A+NA +WQ YLGG+IQ++C NHAV
Sbjct: 244 VHLKDFTCKEYIGKEDKMVENLYYNGSLVVALNARSWQDYLGGIIQHHCSAGFN--NHAV 301
Query: 305 QIVG 308
QIVG
Sbjct: 302 QIVG 305
>gi|410956684|ref|XP_003984969.1| PREDICTED: cathepsin O [Felis catus]
Length = 390
Score = 170 bits (431), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 111/323 (34%), Positives = 168/323 (52%), Gaps = 41/323 (12%)
Query: 12 ALIALCFLAIPVKVSKP--NLEQKLEL------FSSFQQR---YKKSYSKSEHDIRFKNF 60
A +A P++ S+P LE+ L ++FQ R KK+ K+E +F +F
Sbjct: 51 AALASPAPGTPLRRSRPYSRLEEAPLLAVPWTALTAFQSRSFTVKKNQIKAEETRQF-SF 109
Query: 61 EKSLDIIEELNKNRQ-------SPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHH 113
+ +E L+++R SA YGI +FS L EEFK +LR ++
Sbjct: 110 RRG--ALESLHRHRYLNSVFPGENSSAVYGINQFSHLFPEEFKAIYLRSKPSR------- 160
Query: 114 KHHDHHHNHVKKRSITTGITIPT-GIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAE 172
+ + +IP+ +P++ DWR+ ++ +VRNQQTCG CWAFS V E
Sbjct: 161 ---------LPRYRAEVQTSIPSVSLPLRFDWRDKRVVTQVRNQQTCGGCWAFSVVGAVE 211
Query: 173 SMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKD 232
S +A+K L LSVQ+VIDC+ N N GC+GG L+W++ V L +SEYP ++
Sbjct: 212 SAYAIKGKPLEDLSVQQVIDCSYN-NYGCNGGSTLNALNWLNKTHVKLVRDSEYPFKAQN 270
Query: 233 AACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYN 292
C+ + S +G IK Y+ E + + T GP++ V+A++WQ YLGG+IQ++
Sbjct: 271 GLCRYFSDSHSGFPIKGYSAYDFSDQEDEMAKALVTFGPLVVVVDAVSWQDYLGGIIQHH 330
Query: 293 CDGSLANINHAVQIVGYDNYSRT 315
C S NHAV I G+D T
Sbjct: 331 C--SSGEANHAVLITGFDKIGNT 351
>gi|301777930|ref|XP_002924382.1| PREDICTED: cathepsin O-like [Ailuropoda melanoleuca]
Length = 300
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 96/256 (37%), Positives = 137/256 (53%), Gaps = 27/256 (10%)
Query: 68 EELNKNR-------QSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
E LN++R SA YGI +FS L EEFK +LR ++
Sbjct: 25 ESLNRHRYLNSVFPHENSSAVYGINQFSYLFPEEFKAIYLRSKSSR-------------- 70
Query: 121 NHVKKRSITTGITIP-TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
+ + +IP +P++ DWR+ ++ +VRNQQTCG CWAFS V ES +A+K
Sbjct: 71 --LPRYRAEAQTSIPNVSLPLRFDWRDKRVVTQVRNQQTCGGCWAFSVVGAVESAYAIKG 128
Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
L LSVQ+VIDC+ N N GCSGG + L W++ +V L +SEYP ++ C +
Sbjct: 129 EPLEALSVQQVIDCSYN-NYGCSGGSTVSALHWLNKTQVKLVRDSEYPFKAQNGLCHYFS 187
Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLAN 299
S +G IK Y+ E + + T GP++ V+A++WQ YLGG+IQ++C S
Sbjct: 188 DSQSGFSIKGYSAYDFSDQEDEMAKALLTFGPLVVVVDAVSWQDYLGGIIQHHC--SSGE 245
Query: 300 INHAVQIVGYDNYSRT 315
NHAV I G+D T
Sbjct: 246 ANHAVLITGFDKIGST 261
>gi|281354027|gb|EFB29611.1| hypothetical protein PANDA_013700 [Ailuropoda melanoleuca]
Length = 266
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 96/256 (37%), Positives = 137/256 (53%), Gaps = 27/256 (10%)
Query: 68 EELNKNR-------QSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
E LN++R SA YGI +FS L EEFK +LR ++
Sbjct: 1 ESLNRHRYLNSVFPHENSSAVYGINQFSYLFPEEFKAIYLRSKSSR-------------- 46
Query: 121 NHVKKRSITTGITIP-TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
+ + +IP +P++ DWR+ ++ +VRNQQTCG CWAFS V ES +A+K
Sbjct: 47 --LPRYRAEAQTSIPNVSLPLRFDWRDKRVVTQVRNQQTCGGCWAFSVVGAVESAYAIKG 104
Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
L LSVQ+VIDC+ N N GCSGG + L W++ +V L +SEYP ++ C +
Sbjct: 105 EPLEALSVQQVIDCSYN-NYGCSGGSTVSALHWLNKTQVKLVRDSEYPFKAQNGLCHYFS 163
Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLAN 299
S +G IK Y+ E + + T GP++ V+A++WQ YLGG+IQ++C S
Sbjct: 164 DSQSGFSIKGYSAYDFSDQEDEMAKALLTFGPLVVVVDAVSWQDYLGGIIQHHC--SSGE 221
Query: 300 INHAVQIVGYDNYSRT 315
NHAV I G+D T
Sbjct: 222 ANHAVLITGFDKIGST 237
>gi|426345827|ref|XP_004040600.1| PREDICTED: cathepsin O [Gorilla gorilla gorilla]
Length = 321
Score = 167 bits (424), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 99/258 (38%), Positives = 139/258 (53%), Gaps = 21/258 (8%)
Query: 60 FEKSLDIIEELNKNRQSPES-ARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDH 118
F +SL+ LN S S A YGI +FS L EEFK +LR +K S H
Sbjct: 44 FRESLNRHRYLNSLFPSENSTAFYGINQFSHLFPEEFKAIYLRSKPSKFPRYSAEVH--- 100
Query: 119 HHNHVKKRSITTGITIP-TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHAL 177
++IP +P++ DWR+ ++ +VRNQQ CG CWAFS V ES +A+
Sbjct: 101 -------------MSIPNVSLPLRFDWRDKQVVTQVRNQQMCGGCWAFSVVGAVESAYAI 147
Query: 178 KNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKR 237
K L LSVQ+VIDC+ N N GC+GG L+W++ +V L +SEYP ++ C
Sbjct: 148 KGKPLEDLSVQQVIDCSYN-NYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGLCHY 206
Query: 238 KATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL 297
+ S +G IK Y+ E + + T GP++ V+A++WQ YLGG+IQ++C S
Sbjct: 207 FSGSHSGFSIKGYSAHDFSNQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQHHC--SS 264
Query: 298 ANINHAVQIVGYDNYSRT 315
NHAV I G+D T
Sbjct: 265 GEANHAVLITGFDKTGST 282
>gi|4557501|ref|NP_001325.1| cathepsin O preproprotein [Homo sapiens]
gi|1168795|sp|P43234.1|CATO_HUMAN RecName: Full=Cathepsin O; Flags: Precursor
gi|574804|emb|CAA54562.1| cathepsin O [Homo sapiens]
gi|29351630|gb|AAH49206.1| Cathepsin O [Homo sapiens]
gi|312153238|gb|ADQ33131.1| cathepsin O [synthetic construct]
Length = 321
Score = 167 bits (422), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 99/258 (38%), Positives = 139/258 (53%), Gaps = 21/258 (8%)
Query: 60 FEKSLDIIEELNKNRQSPES-ARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDH 118
F +SL+ LN S S A YGI +FS L EEFK +LR +K S H
Sbjct: 44 FRESLNRHRYLNSLFPSENSTAFYGINQFSYLFPEEFKAIYLRSKPSKFPRYSAEVH--- 100
Query: 119 HHNHVKKRSITTGITIP-TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHAL 177
++IP +P++ DWR+ ++ +VRNQQ CG CWAFS V ES +A+
Sbjct: 101 -------------MSIPNVSLPLRFDWRDKQVVTQVRNQQMCGGCWAFSVVGAVESAYAI 147
Query: 178 KNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKR 237
K L LSVQ+VIDC+ N N GC+GG L+W++ +V L +SEYP ++ C
Sbjct: 148 KGKPLEDLSVQQVIDCSYN-NYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGLCHY 206
Query: 238 KATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL 297
+ S +G IK Y+ E + + T GP++ V+A++WQ YLGG+IQ++C S
Sbjct: 207 FSGSHSGFSIKGYSAYDFSDQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQHHC--SS 264
Query: 298 ANINHAVQIVGYDNYSRT 315
NHAV I G+D T
Sbjct: 265 GEANHAVLITGFDKTGST 282
>gi|119625288|gb|EAX04883.1| cathepsin O [Homo sapiens]
Length = 336
Score = 167 bits (422), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 99/258 (38%), Positives = 139/258 (53%), Gaps = 21/258 (8%)
Query: 60 FEKSLDIIEELNKNRQSPES-ARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDH 118
F +SL+ LN S S A YGI +FS L EEFK +LR +K S H
Sbjct: 59 FRESLNRHRYLNSLFPSENSTAFYGINQFSYLFPEEFKAIYLRSKPSKFPRYSAEVH--- 115
Query: 119 HHNHVKKRSITTGITIP-TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHAL 177
++IP +P++ DWR+ ++ +VRNQQ CG CWAFS V ES +A+
Sbjct: 116 -------------MSIPNVSLPLRFDWRDKQVVTQVRNQQMCGGCWAFSVVGAVESAYAI 162
Query: 178 KNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKR 237
K L LSVQ+VIDC+ N N GC+GG L+W++ +V L +SEYP ++ C
Sbjct: 163 KGKPLEDLSVQQVIDCSYN-NYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGLCHY 221
Query: 238 KATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL 297
+ S +G IK Y+ E + + T GP++ V+A++WQ YLGG+IQ++C S
Sbjct: 222 FSGSHSGFSIKGYSAYDFSDQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQHHC--SS 279
Query: 298 ANINHAVQIVGYDNYSRT 315
NHAV I G+D T
Sbjct: 280 GEANHAVLITGFDKTGST 297
>gi|397504019|ref|XP_003822607.1| PREDICTED: cathepsin O [Pan paniscus]
Length = 321
Score = 167 bits (422), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 99/258 (38%), Positives = 139/258 (53%), Gaps = 21/258 (8%)
Query: 60 FEKSLDIIEELNKNRQSPES-ARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDH 118
F +SL+ LN S S A YGI +FS L EEFK +LR +K S H
Sbjct: 44 FRESLNRHRYLNSLFPSENSTAFYGINQFSYLFPEEFKAIYLRSKPSKFPRYSAEVH--- 100
Query: 119 HHNHVKKRSITTGITIP-TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHAL 177
++IP +P++ DWR+ ++ +VRNQQ CG CWAFS V ES +A+
Sbjct: 101 -------------MSIPNVSLPLRFDWRDKQVVTQVRNQQMCGGCWAFSVVGAVESAYAI 147
Query: 178 KNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKR 237
K L LSVQ+VIDC+ N N GC+GG L+W++ +V L +SEYP ++ C
Sbjct: 148 KGKPLEDLSVQQVIDCSYN-NYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGLCHY 206
Query: 238 KATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL 297
+ S +G IK Y+ E + + T GP++ V+A++WQ YLGG+IQ++C S
Sbjct: 207 FSGSHSGFSIKGYSAYDFSNQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQHHC--SS 264
Query: 298 ANINHAVQIVGYDNYSRT 315
NHAV I G+D T
Sbjct: 265 GEANHAVLITGFDKTGST 282
>gi|351707349|gb|EHB10268.1| Cathepsin O, partial [Heterocephalus glaber]
Length = 266
Score = 166 bits (421), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 94/256 (36%), Positives = 136/256 (53%), Gaps = 27/256 (10%)
Query: 68 EELNKNR-------QSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
E LN++R +A YGI +FS L EEFK +LR ++
Sbjct: 1 ESLNRHRYLNSLFPHENSTAFYGINQFSYLFPEEFKAIYLRSKPSR-------------- 46
Query: 121 NHVKKRSITTGITIP-TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
K + ++P T +P++ DWR ++ +VRNQQ CG CWAFS V ES A++
Sbjct: 47 --FPKYAAKVQASVPNTPLPLRFDWRNKHVVTQVRNQQMCGGCWAFSVVGAVESAWAIRG 104
Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
G L LS Q+VIDC+ N N GC+GG + L W++ +V L +SEYP +D C +
Sbjct: 105 GPLEDLSAQQVIDCSYN-NYGCNGGSPLSALSWLNKTRVKLVRDSEYPFKAQDGPCHYFS 163
Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLAN 299
S G+ I+ Y+ E+ + + HGP++ V+A++WQ YLGGVIQ++C A
Sbjct: 164 QSQPGLSIQGYSAYDFSGQEAEMARALLAHGPLVVIVDAVSWQDYLGGVIQHHCSSGRA- 222
Query: 300 INHAVQIVGYDNYSRT 315
NHAV I G+D T
Sbjct: 223 -NHAVLITGFDRTDST 237
>gi|114596533|ref|XP_517502.2| PREDICTED: cathepsin O [Pan troglodytes]
gi|410212082|gb|JAA03260.1| cathepsin O [Pan troglodytes]
gi|410330245|gb|JAA34069.1| cathepsin O [Pan troglodytes]
Length = 318
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 99/258 (38%), Positives = 139/258 (53%), Gaps = 21/258 (8%)
Query: 60 FEKSLDIIEELNKNRQSPES-ARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDH 118
F +SL+ LN S S A YGI +FS L EEFK +LR +K S H
Sbjct: 41 FRESLNRHRYLNSLFPSENSTAFYGINQFSYLFPEEFKAIYLRSKPSKFPRYSAEVH--- 97
Query: 119 HHNHVKKRSITTGITIP-TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHAL 177
++IP +P++ DWR+ ++ +VRNQQ CG CWAFS V ES +A+
Sbjct: 98 -------------MSIPNVSLPLRFDWRDKQVVTQVRNQQMCGGCWAFSVVGAVESAYAI 144
Query: 178 KNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKR 237
K L LSVQ+VIDC+ N N GC+GG L+W++ +V L +SEYP ++ C
Sbjct: 145 KGKPLEDLSVQQVIDCSYN-NYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGLCHY 203
Query: 238 KATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL 297
+ S +G IK Y+ E + + T GP++ V+A++WQ YLGG+IQ++C S
Sbjct: 204 FSGSHSGFSIKGYSAYDFSNQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQHHC--SS 261
Query: 298 ANINHAVQIVGYDNYSRT 315
NHAV I G+D T
Sbjct: 262 GEANHAVLITGFDKTGST 279
>gi|332217574|ref|XP_003257933.1| PREDICTED: cathepsin O [Nomascus leucogenys]
Length = 318
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 100/258 (38%), Positives = 138/258 (53%), Gaps = 21/258 (8%)
Query: 60 FEKSLDIIEELNKNRQSPES-ARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDH 118
F +SL+ LN S S A YGI +FS L EEFK +LR +K S H
Sbjct: 41 FRESLNRHRYLNSLFPSENSTAFYGINQFSYLFPEEFKAIYLRSKPSKFPRYSAEVH--- 97
Query: 119 HHNHVKKRSITTGITIP-TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHAL 177
++IP +P+K DWR+ ++ +VRNQQ CG CWAFS V ES +A+
Sbjct: 98 -------------MSIPNVSLPLKFDWRDKHVVTQVRNQQMCGGCWAFSVVGAVESAYAI 144
Query: 178 KNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKR 237
K L LSVQ+VIDC+ N N GC+GG L+W++ +V L +SEYP ++ C
Sbjct: 145 KGKPLEDLSVQQVIDCSYN-NYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGLCHY 203
Query: 238 KATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL 297
S +G IK Y+ E + + T GP++ V+A++WQ YLGG+IQ++C S
Sbjct: 204 FLGSHSGFSIKGYSAYDFSNQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQHHC--SS 261
Query: 298 ANINHAVQIVGYDNYSRT 315
NHAV I G+D T
Sbjct: 262 GEANHAVLITGFDKTGST 279
>gi|291401083|ref|XP_002716930.1| PREDICTED: cathepsin O [Oryctolagus cuniculus]
Length = 309
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 97/250 (38%), Positives = 137/250 (54%), Gaps = 25/250 (10%)
Query: 68 EELNKNR-------QSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
E LN++R +A YGI +FS L EEFK +LR S +
Sbjct: 34 ESLNRHRYLNSFFSHENSTAFYGINQFSYLFPEEFKAIYLR---------SQPSSSPRYP 84
Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
VK T+ +T+P +P++ DWR+ ++ +VRNQQ CG CWAFS V ES A+K
Sbjct: 85 AEVK----TSLLTVP--LPLRFDWRDKHVVSQVRNQQMCGGCWAFSVVGAVESTWAIKGH 138
Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKAT 240
L LSVQ+VIDC+ N N GCSGG + L W++ +V L +SEYP + C +
Sbjct: 139 PLEDLSVQQVIDCSYN-NYGCSGGSTLSALKWLNKTQVRLVNDSEYPFKARSGLCHYFPS 197
Query: 241 SPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANI 300
S +G+ IK Y+ E + + +GP++ V+A++WQ YLGGVIQ++C S
Sbjct: 198 SHSGLSIKGYSAYDFSDQEDEMAKSLLIYGPLVVIVDAVSWQDYLGGVIQHHC--SSGEA 255
Query: 301 NHAVQIVGYD 310
NHAV I G+D
Sbjct: 256 NHAVLITGFD 265
>gi|355681662|gb|AER96817.1| Cathepsin O precursor [Mustela putorius furo]
Length = 265
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 91/238 (38%), Positives = 130/238 (54%), Gaps = 20/238 (8%)
Query: 79 SARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIP-TG 137
SA YGI +FS L EEFK +LR ++ + + +IP
Sbjct: 19 SAIYGINQFSYLFPEEFKAIYLRSKSSR----------------LPRYRTEVQTSIPNVS 62
Query: 138 IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG 197
+P + DWR+ ++ +VRNQQTCG CWAFS V ES +A+K L LSVQ+VIDC+ N
Sbjct: 63 LPSRFDWRDKHVVTQVRNQQTCGGCWAFSVVGAVESAYAIKGKPLEDLSVQQVIDCSYN- 121
Query: 198 NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP 257
N GC GG + L+W++ +V L +SEYP ++ C + S +G IK Y+
Sbjct: 122 NYGCQGGSTLSALNWLNKTQVRLVRDSEYPFKAQNGLCHYFSDSQSGFSIKGYSAYDFSD 181
Query: 258 SESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYSRT 315
E + + T GP++ V+A++WQ YLGG+IQ++C S NHAV I G+D T
Sbjct: 182 QEDEMAKALLTFGPLVVVVDAVSWQDYLGGIIQHHC--SSGEANHAVLITGFDKIGNT 237
>gi|345780796|ref|XP_539782.3| PREDICTED: cathepsin O [Canis lupus familiaris]
Length = 456
Score = 165 bits (417), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 94/255 (36%), Positives = 140/255 (54%), Gaps = 25/255 (9%)
Query: 68 EELNKNR-------QSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
E LN++R + SA YGI +FS LS EEFK +LR ++ ++
Sbjct: 181 ESLNRHRYLNSVFPRENSSAVYGINQFSYLSPEEFKAIYLRSKPSRS-----PRYPAEVR 235
Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
++ S+ P++ DWR+ ++ +VRNQQTCG CWAFS V ES +A+K
Sbjct: 236 TSIRNVSL----------PLRFDWRDKRVVTQVRNQQTCGGCWAFSVVGAVESAYAIKGK 285
Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKAT 240
L+ +SVQ+VIDC+ N N GCSGG L+W++ +V L +SEYP ++ C +
Sbjct: 286 PLADISVQQVIDCSYN-NYGCSGGSTLNALNWLNKTQVKLVRDSEYPFKAQNGLCHYFSD 344
Query: 241 SPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANI 300
S +G I+ Y+ E + + T GP++ V+A++WQ YLGG+IQ++C S
Sbjct: 345 SYSGFSIRGYSAYDFSDQEDEMAKVLLTFGPLVVVVDAVSWQDYLGGIIQHHC--SSGEA 402
Query: 301 NHAVQIVGYDNYSRT 315
NHAV I G+D T
Sbjct: 403 NHAVLITGFDKIGST 417
>gi|395735444|ref|XP_002815290.2| PREDICTED: cathepsin O [Pongo abelii]
Length = 318
Score = 165 bits (417), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 98/258 (37%), Positives = 139/258 (53%), Gaps = 21/258 (8%)
Query: 60 FEKSLDIIEELNKNRQSPES-ARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDH 118
F +SL+ LN S S A YGI +FS L EEFK +LR +K
Sbjct: 41 FRESLNRHRYLNSLFPSENSTAFYGINQFSYLFPEEFKAIYLRSKPSK------------ 88
Query: 119 HHNHVKKRSITTGITIP-TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHAL 177
+ S ++IP +P++ DWR+ ++ +VRNQQ CG CWAFS V ES +A+
Sbjct: 89 ----FPRYSAEVRMSIPNVSLPLRFDWRDKHVVTQVRNQQMCGGCWAFSVVGAVESAYAI 144
Query: 178 KNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKR 237
K L LSVQ+VIDC+ N N GC+GG L+W++ +V L +SEYP ++ C
Sbjct: 145 KGKPLEDLSVQQVIDCSYN-NYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGLCHY 203
Query: 238 KATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL 297
+ S +G IK Y+ E + + T GP++ V+A++WQ YLGG+IQ++C S
Sbjct: 204 FSGSHSGFSIKGYSAYDFSNQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQHHC--SS 261
Query: 298 ANINHAVQIVGYDNYSRT 315
NHAV I G+D T
Sbjct: 262 GEANHAVLITGFDKTGST 279
>gi|402870704|ref|XP_003899346.1| PREDICTED: cathepsin O [Papio anubis]
Length = 321
Score = 164 bits (415), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 99/255 (38%), Positives = 137/255 (53%), Gaps = 25/255 (9%)
Query: 68 EELNKNRQ----SP---ESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
E LN++R SP +A YGI +FS L EEFK +LR +K S H
Sbjct: 46 ESLNRHRYLNSLSPGENSTAFYGINQFSYLFPEEFKAIYLRSKPSKFPRYSAEVH----- 100
Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
RSI P++ DWR+ ++ +VRNQQTCG CWAFS V ES +A+K
Sbjct: 101 -----RSIPN-----VSWPLRFDWRDKHVVTQVRNQQTCGGCWAFSVVGAVESAYAIKGK 150
Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKAT 240
L LSVQ+VIDC+ N GC+GG L+W++ +V L +SEYP ++ C +
Sbjct: 151 PLEDLSVQQVIDCSYT-NYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGLCHYFSG 209
Query: 241 SPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANI 300
S +G IK Y+ E + + T GP++ V+A++WQ YLGG+IQ++C S
Sbjct: 210 SHSGFSIKGYSAYDFSNQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQHHC--SSGEA 267
Query: 301 NHAVQIVGYDNYSRT 315
NHAV I G+D T
Sbjct: 268 NHAVLITGFDKTGST 282
>gi|355687683|gb|EHH26267.1| hypothetical protein EGK_16186 [Macaca mulatta]
gi|384945482|gb|AFI36346.1| cathepsin O preproprotein [Macaca mulatta]
Length = 321
Score = 164 bits (414), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 99/255 (38%), Positives = 137/255 (53%), Gaps = 25/255 (9%)
Query: 68 EELNKNRQ----SP---ESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
E LN++R SP +A YGI +FS L EEFK +LR +K S H
Sbjct: 46 ESLNRHRYLNSLSPGENSTAFYGINQFSYLFPEEFKAIYLRSKPSKFPRYSAEVH----- 100
Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
RSI P++ DWR+ ++ +VRNQQTCG CWAFS V ES +A+K
Sbjct: 101 -----RSIPN-----VSWPLRFDWRDKHVVTQVRNQQTCGGCWAFSVVGAVESAYAIKGK 150
Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKAT 240
L LSVQ+VIDC+ N GC+GG L+W++ +V L +SEYP ++ C +
Sbjct: 151 PLEDLSVQQVIDCSYT-NYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGLCHYFSG 209
Query: 241 SPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANI 300
S +G IK Y+ E + + T GP++ V+A++WQ YLGG+IQ++C S
Sbjct: 210 SHSGFSIKGYSAYDFSNQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQHHC--SSGEA 267
Query: 301 NHAVQIVGYDNYSRT 315
NHAV I G+D T
Sbjct: 268 NHAVLITGFDKTGST 282
>gi|395542489|ref|XP_003773162.1| PREDICTED: cathepsin O-like [Sarcophilus harrisii]
Length = 407
Score = 164 bits (414), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 99/274 (36%), Positives = 145/274 (52%), Gaps = 25/274 (9%)
Query: 49 SKSEHDIRFKNFEKSLDIIEELNKNR-------QSPESARYGITEFSDLSEEEFKTRHLR 101
S+S D ++ ++S E L ++R ++ SA YGI +FS L EEF+ +LR
Sbjct: 113 SRSRLDSPERSEKRSAAFRESLKRHRYLNSFSSRANTSAIYGINQFSHLFPEEFRAIYLR 172
Query: 102 HSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGA 161
++ L +HK H+ +P++ DWR+ ++ KVRNQQ CG
Sbjct: 173 SKPSQLPL--YHKELKMPATHMP-------------LPIRFDWRDKNVVTKVRNQQMCGG 217
Query: 162 CWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLE 221
CWAFS V ES +A+K +L LSVQ+VIDC+ N N GCSGG L+W++ +V L
Sbjct: 218 CWAFSVVGGIESAYAIKGESLEDLSVQQVIDCSYN-NFGCSGGSTVNALNWLNKTQVRLV 276
Query: 222 PESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTW 281
+SEY + C + S GV IK Y+ E + + +GP+ V+A++W
Sbjct: 277 RDSEYSFKAQTGLCHYFSGSHAGVSIKGYSSYDFSDKEDEMAKVLLAYGPLAVIVDAISW 336
Query: 282 QYYLGGVIQYNCDGSLANINHAVQIVGYDNYSRT 315
Q YLGG+IQ++C S NHAV I G+D T
Sbjct: 337 QDYLGGIIQHHC--SSGEANHAVLITGFDKTGNT 368
>gi|189528132|ref|XP_695717.3| PREDICTED: cathepsin O [Danio rerio]
Length = 334
Score = 164 bits (414), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 104/310 (33%), Positives = 160/310 (51%), Gaps = 28/310 (9%)
Query: 6 NVLFIVALIALCFLA--IPVKVSKPNLEQ--KLELFSSFQQRYKKSYSKSEHDIRFKNFE 61
++ FIV +I L I V+V + +L + +L+ +FQQ + R+ N++
Sbjct: 4 SLTFIVLIIYQELLTGIISVEVIRKSLTEGERLQHSDTFQQDVNNELYQ-----RWINYQ 58
Query: 62 KSLDIIEELNKN-RQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
SL LN +S +SA+YG+ +FS LS+++FK ++L K
Sbjct: 59 SSLQRQAFLNSALGKSNQSAQYGVNQFSYLSQKQFKEQYLTARAEAAPKFDQSKSE---- 114
Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
I + P + DWR+ G++G V NQ +CG CWAFS VE ES+ A
Sbjct: 115 -----------IKVKANNPPRFDWRDHGVVGPVHNQGSCGGCWAFSIVEAIESVSAKGGE 163
Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKAT 240
L LSVQ+VIDC+ N GC+GG L W+ +K+ L E+EYP D C+
Sbjct: 164 KLQQLSVQQVIDCSYQ-NQGCNGGSPVEALYWLTQSKLKLVSEAEYPFKGADGVCQFFPQ 222
Query: 241 SPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANI 300
+ GV +++Y+ E +++ + GP++ V+A++WQ YLGG+IQ++C A
Sbjct: 223 AHAGVAVRNYSAYDFSGQEEVMMSALVDFGPLVVIVDAISWQDYLGGIIQHHCSSHKA-- 280
Query: 301 NHAVQIVGYD 310
NHAV I GYD
Sbjct: 281 NHAVLITGYD 290
>gi|403272508|ref|XP_003928101.1| PREDICTED: cathepsin O [Saimiri boliviensis boliviensis]
Length = 465
Score = 164 bits (414), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 94/256 (36%), Positives = 138/256 (53%), Gaps = 27/256 (10%)
Query: 68 EELNKNR-------QSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
E LN++R +A YGI +FS L EEFK +LR +K+
Sbjct: 190 ESLNRHRYLNSLFPNENSTAFYGINQFSYLFPEEFKAIYLRSKPSKY------------- 236
Query: 121 NHVKKRSITTGITIP-TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
+ S ++IP +P++ DWR+ ++ +VRNQQ CG CWAFS V ES A+K
Sbjct: 237 ---PRYSAEVRMSIPNVSLPLRFDWRDKHVVTQVRNQQMCGGCWAFSVVGAVESACAIKG 293
Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
L LSVQ+VIDC+ N N GC+GG + L+W++ +V L +SEYP ++ C +
Sbjct: 294 KPLEDLSVQQVIDCSYN-NYGCNGGSTLSALNWLNKMQVKLVKDSEYPFKAQNGLCHYFS 352
Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLAN 299
S +G IK Y+ E + + T GP++ V+A++WQ YLGG+IQ++C S
Sbjct: 353 GSHSGFSIKGYSAYDFSNQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQHHC--SSGE 410
Query: 300 INHAVQIVGYDNYSRT 315
NHAV + G+D T
Sbjct: 411 ANHAVLVTGFDKTGST 426
>gi|297293584|ref|XP_001093045.2| PREDICTED: cathepsin O [Macaca mulatta]
Length = 421
Score = 163 bits (413), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 99/255 (38%), Positives = 137/255 (53%), Gaps = 25/255 (9%)
Query: 68 EELNKNRQ----SP---ESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
E LN++R SP +A YGI +FS L EEFK +LR +K S H
Sbjct: 146 ESLNRHRYLNSLSPGENSTAFYGINQFSYLFPEEFKAIYLRSKPSKFPRYSAEVH----- 200
Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
RSI P++ DWR+ ++ +VRNQQTCG CWAFS V ES +A+K
Sbjct: 201 -----RSIPN-----VSWPLRFDWRDKHVVTQVRNQQTCGGCWAFSVVGAVESAYAIKGK 250
Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKAT 240
L LSVQ+VIDC+ N GC+GG L+W++ +V L +SEYP ++ C +
Sbjct: 251 PLEDLSVQQVIDCSYT-NYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGLCHYFSG 309
Query: 241 SPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANI 300
S +G IK Y+ E + + T GP++ V+A++WQ YLGG+IQ++C S
Sbjct: 310 SHSGFSIKGYSAYDFSNQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQHHC--SSGEA 367
Query: 301 NHAVQIVGYDNYSRT 315
NHAV I G+D T
Sbjct: 368 NHAVLITGFDKTGST 382
>gi|344239864|gb|EGV95967.1| Cathepsin O [Cricetulus griseus]
Length = 291
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 92/259 (35%), Positives = 136/259 (52%), Gaps = 18/259 (6%)
Query: 57 FKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHH 116
F F++SL+ LN SA YG+ +FS LS EEFK +L
Sbjct: 12 FLFFQESLNRHRYLNSFSHDNSSASYGLNQFSYLSPEEFKALYL----------GSKPAW 61
Query: 117 DHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHA 176
+ +++ I +P++ DWR+ ++ +VRNQ+ CG CWAFS V ES A
Sbjct: 62 SPRYPAAEQKPIPN-----VSLPLRFDWRDKHVVNQVRNQKMCGGCWAFSVVTAIESACA 116
Query: 177 LKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACK 236
++ L LSVQ+VIDC+ N N GCSGG + L W++ +V L +SEYP ++ C+
Sbjct: 117 IQGKPLDYLSVQQVIDCSFN-NYGCSGGSPLSALSWLNKTQVKLMEDSEYPFKAENGLCR 175
Query: 237 RKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGS 296
S +GV IK ++ E + + GP++ V+A++WQ YLGG+IQ++C S
Sbjct: 176 YFPQSQSGVSIKDFSAYDFSGQEDEMAKALLNFGPLVVIVDAVSWQDYLGGIIQHHC--S 233
Query: 297 LANINHAVQIVGYDNYSRT 315
NHAV I G+D T
Sbjct: 234 SGEANHAVLITGFDKTGNT 252
>gi|296195327|ref|XP_002745330.1| PREDICTED: cathepsin O [Callithrix jacchus]
Length = 453
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 95/256 (37%), Positives = 136/256 (53%), Gaps = 27/256 (10%)
Query: 68 EELNKNR-------QSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
E LN++R +A YGI +FS L EEFK +LR K+ S H
Sbjct: 178 ESLNRHRYLNSLFPSENSTAFYGINQFSYLFPEEFKAIYLRSKPFKYRRYSAEVH----- 232
Query: 121 NHVKKRSITTGITIP-TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
++IP +P++ DWR+ ++ +VRNQQ CG CWAFS V ES A+K
Sbjct: 233 -----------MSIPNVSLPLRFDWRDKHVVTQVRNQQMCGGCWAFSVVGAVESACAIKG 281
Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
L LSVQ+VIDC+ N N GC+GG L+W++ +V L +SEYP ++ C +
Sbjct: 282 KPLEDLSVQQVIDCSYN-NYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGLCHYFS 340
Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLAN 299
S +G IK Y+ E + + T GP++ V+A++WQ YLGG+IQ++C A
Sbjct: 341 GSHSGFSIKGYSAYDFSNQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQHHCSSGEA- 399
Query: 300 INHAVQIVGYDNYSRT 315
NHAV + G+D T
Sbjct: 400 -NHAVLVTGFDKTGST 414
>gi|355749637|gb|EHH54036.1| hypothetical protein EGM_14772, partial [Macaca fascicularis]
Length = 311
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 98/255 (38%), Positives = 137/255 (53%), Gaps = 25/255 (9%)
Query: 68 EELNKNRQ----SP---ESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
E LN++R SP +A YGI +FS L EEFK +LR +K S H
Sbjct: 36 ESLNRHRYLNSLSPGENSTAFYGINQFSYLFPEEFKAIYLRSKPSKFPRYSAEVH----- 90
Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
RSI P++ DW++ ++ +VRNQQTCG CWAFS V ES +A+K
Sbjct: 91 -----RSIPN-----VSWPLRFDWQDKHVVTQVRNQQTCGGCWAFSVVGAVESAYAIKGK 140
Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKAT 240
L LSVQ+VIDC+ N GC+GG L+W++ +V L +SEYP ++ C +
Sbjct: 141 PLEDLSVQQVIDCSYT-NYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGLCHYFSG 199
Query: 241 SPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANI 300
S +G IK Y+ E + + T GP++ V+A++WQ YLGG+IQ++C S
Sbjct: 200 SHSGFSIKGYSAYDFSNQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQHHC--SSGEA 257
Query: 301 NHAVQIVGYDNYSRT 315
NHAV I G+D T
Sbjct: 258 NHAVLITGFDKTGST 272
>gi|449272742|gb|EMC82496.1| Cathepsin O, partial [Columba livia]
Length = 275
Score = 162 bits (409), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 96/255 (37%), Positives = 131/255 (51%), Gaps = 28/255 (10%)
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHH 119
++S I LN + + +A YGI +FS L EEFK +LR +K
Sbjct: 1 LQESTKRIRLLNSSSKDNMTAFYGINQFSHLFPEEFKAIYLRSIPHK------------- 47
Query: 120 HNHVKKRSITTGITIPTG----IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMH 175
+ + +P G +P K DWR+ +I +VRNQQTCG CWAFS V ES +
Sbjct: 48 --------LPRYLKVPKGEEKPLPKKFDWRDKKVIAEVRNQQTCGGCWAFSVVGGIESAY 99
Query: 176 ALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAAC 235
A+K L LSVQ+VIDC+ N N GCSGG + L W++ KV L +SEY + C
Sbjct: 100 AIKGHNLEELSVQQVIDCSYN-NYGCSGGSTVSALSWLNQTKVKLVRDSEYAFKAQTGLC 158
Query: 236 KRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDG 295
S GV I + E ++ + GP+ V+A++WQ YLGG+IQY+C
Sbjct: 159 HYFGHSDFGVSITGFAAYDFSGQEEEMMRMLVNWGPLAVTVDAVSWQDYLGGIIQYHCSS 218
Query: 296 SLANINHAVQIVGYD 310
A NHAV I G+D
Sbjct: 219 GRA--NHAVLITGFD 231
>gi|224049669|ref|XP_002196637.1| PREDICTED: cathepsin O [Taeniopygia guttata]
Length = 299
Score = 162 bits (409), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 94/247 (38%), Positives = 129/247 (52%), Gaps = 26/247 (10%)
Query: 67 IEELNKNRQSPESARYGITEFSDLSEEEFKTRHLR---HSVNKHVLMSHHKHHDHHHNHV 123
I LN + +A YGI +FS L EEFK +LR H + +++ + K
Sbjct: 32 IRLLNSLAKDNTTAVYGINQFSHLFPEEFKAIYLRSIPHKLPRYIKVPKGKEKP------ 85
Query: 124 KKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLS 183
+P K DWR+ +I +VRNQQTCG CWAFS V ES +A+K TL
Sbjct: 86 --------------LPKKFDWRDKKVIAEVRNQQTCGGCWAFSVVGGIESAYAIKRNTLE 131
Query: 184 LLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPN 243
LSVQ+VIDC+ N N GC+GG + L W++ KV L +SEY + C S
Sbjct: 132 ELSVQQVIDCSYN-NYGCNGGSTVSALSWLNQTKVKLVRDSEYTFKAQTGLCHYFERSDF 190
Query: 244 GVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHA 303
GV I + E ++ + + GP+ V+A++WQ YLGG+IQY+C A NHA
Sbjct: 191 GVSITGFAAYDFSGQEEEMMRMLVSWGPLAVTVDAVSWQDYLGGIIQYHCSSGRA--NHA 248
Query: 304 VQIVGYD 310
V I G+D
Sbjct: 249 VLITGFD 255
>gi|395861575|ref|XP_003803057.1| PREDICTED: cathepsin O [Otolemur garnettii]
Length = 320
Score = 161 bits (408), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 98/258 (37%), Positives = 138/258 (53%), Gaps = 21/258 (8%)
Query: 60 FEKSLDIIEELNKNRQSPES-ARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDH 118
F +SL+ LN S S A YGI +FS L EEFK +LR +K
Sbjct: 43 FRESLNRHRYLNSLFPSENSTAFYGINQFSYLFPEEFKAIYLRSKPSK------------ 90
Query: 119 HHNHVKKRSITTGITIP-TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHAL 177
+ + IP +P++ DWR+ ++ +VRNQQTCG CWAFS V ES A+
Sbjct: 91 ----FPRYPAELQMPIPNVSLPLRFDWRDKHVVTQVRNQQTCGGCWAFSVVGAVESACAI 146
Query: 178 KNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKR 237
K L LSVQ+VIDC+ N N GC+GG L+W++ +V L +SEYP ++ C
Sbjct: 147 KGEPLEDLSVQQVIDCSYN-NYGCNGGSTVNALNWLNKMQVKLVKDSEYPFKAQNGLCHY 205
Query: 238 KATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL 297
+ S +G+ IK Y+ E + + T GP++ V+A++WQ YLGG+IQ++C S
Sbjct: 206 FSGSHSGISIKDYSEYDFNEQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQHHC--SS 263
Query: 298 ANINHAVQIVGYDNYSRT 315
NHAV I G+D T
Sbjct: 264 GEANHAVLITGFDKTGST 281
>gi|213512532|ref|NP_001134063.1| Cathepsin O precursor [Salmo salar]
gi|209730446|gb|ACI66092.1| Cathepsin O precursor [Salmo salar]
Length = 341
Score = 161 bits (407), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 104/323 (32%), Positives = 154/323 (47%), Gaps = 51/323 (15%)
Query: 8 LFIVALIALCFLAIP---------VKVSKPNLEQKLEL-FSSFQQRYKKSY---SKSEHD 54
LF++ L+ L L P + K N ++ F SF++++ ++Y S H
Sbjct: 6 LFLMFLLNLGILTFPDVARCSGVWKTIRKSNCSAGTDVDFESFREQFHRNYKLHSDCYHR 65
Query: 55 IRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHK 114
R F+ S+ LN +SA+YGI +FSDLS EF+ +L
Sbjct: 66 RR-SYFKNSIKRHAYLNSLSTDKDSAKYGINQFSDLSIHEFRELYL-------------- 110
Query: 115 HHDHHHNHVKKRSITTGITIP-------TGIPVKKDWREAGIIGKVRNQQTCGACWAFST 167
T T+P G+P K DWR +G V+NQQ CG CWAFS
Sbjct: 111 -------------TATAETVPPYSGLKTEGLPAKFDWRVKAAVGSVQNQQACGGCWAFSV 157
Query: 168 VETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYP 227
V ES++A LSVQ+VIDC+ N GC+GG L W+ +V L +SEYP
Sbjct: 158 VGAIESVYAKSGQPFKQLSVQQVIDCSYK-NQGCNGGSITRALSWLKQTRVKLVKQSEYP 216
Query: 228 LLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGG 287
+ C + S +GV +K + E +++ + GP+ V+A++WQ YLGG
Sbjct: 217 YKAETGICHLFSQSHDGVLVKDFAAHDYSGHEEAMMGRLVEWGPLAVTVDAISWQDYLGG 276
Query: 288 VIQYNCDGSLANINHAVQIVGYD 310
++Q++C S + NHAV + GYD
Sbjct: 277 IMQHHC--SCHHANHAVLVTGYD 297
>gi|432961003|ref|XP_004086527.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin O-like [Oryzias latipes]
Length = 333
Score = 160 bits (404), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 87/278 (31%), Positives = 142/278 (51%), Gaps = 27/278 (9%)
Query: 37 FSSFQQRYKKSYSKSEHDI--RFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
F F++ + + + S R +F+++ LN +SA YG +FSDLS+EE
Sbjct: 37 FDKFRKNFNRLFDGSGDQFKRRLLHFQEAAVRHTHLNSFSTEAQSATYGFNQFSDLSQEE 96
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKDWREAGIIGKV 153
F+ +L+ + + S +P G+P + DWR+ ++ V
Sbjct: 97 FRGIYLQATSGRAPPFS---------------------GLPAEGLPARFDWRDKAVVAAV 135
Query: 154 RNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM 213
++Q CG+CWAFS V +S A+ L LSVQ+++DC+ N GC GG A L W+
Sbjct: 136 QDQLACGSCWAFSVVGAVQSARAVGGSRLQRLSVQQLLDCSFT-NKGCGGGSPTAALSWL 194
Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
+ L +EYP + C+ + + GV +K++T E +++ + HGP++
Sbjct: 195 LQTREKLVTAAEYPYQAEAQICRFFSQTHQGVAVKNFTVHNFRGQEPAMMAQLVEHGPLV 254
Query: 274 AAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYDN 311
A V+A++WQ YLGG+IQ++C NHAV +VGYD
Sbjct: 255 AVVDAVSWQDYLGGIIQHHCSSQWP--NHAVLVVGYDT 290
>gi|344293694|ref|XP_003418556.1| PREDICTED: cathepsin O-like [Loxodonta africana]
Length = 327
Score = 160 bits (404), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 95/255 (37%), Positives = 136/255 (53%), Gaps = 25/255 (9%)
Query: 68 EELNKNR-------QSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
E LN++R +A YGI +FS L EEFK +LR ++ +
Sbjct: 52 ESLNRHRYLNSLFPNENSTASYGINQFSYLFPEEFKAIYLRSKPSRF----------PRY 101
Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
+ SIT +PV+ DWRE ++ +VRNQ+ CG CWAFS V ES A+K
Sbjct: 102 PTDLQMSITN-----VSLPVRFDWREKHVVTQVRNQKMCGGCWAFSVVGAVESACAIKGE 156
Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKAT 240
L LSVQ+VIDC+ N GC+GG + L+W++ +V L +SEYP ++ C+ +
Sbjct: 157 PLEDLSVQQVIDCS-YSNYGCNGGSTLSALNWLNKMQVKLVKDSEYPFKAQNGLCQYFSV 215
Query: 241 SPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANI 300
S +G IK Y+ E + + T GP+I V+A++WQ YLGGVIQ++C S
Sbjct: 216 SHSGFSIKGYSAYDFSDREDEMAKALLTFGPLIVVVDAVSWQDYLGGVIQHHC--SSGEA 273
Query: 301 NHAVQIVGYDNYSRT 315
NHAV + G+D T
Sbjct: 274 NHAVLVTGFDTTGST 288
>gi|426247636|ref|XP_004017585.1| PREDICTED: cathepsin O [Ovis aries]
Length = 288
Score = 160 bits (404), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 93/251 (37%), Positives = 138/251 (54%), Gaps = 18/251 (7%)
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHH 119
+++SL+ LN +A YGI +FS L EEFK +LR S ++ ++
Sbjct: 12 WQESLNRQRYLNSFPHENSTAVYGINQFSYLFPEEFKAIYLRSSPSRFPRFPAEEY---- 67
Query: 120 HNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
SI+ +P+K DWR+ +I +VRNQ+TCG CWAFS V ES+ A+K
Sbjct: 68 ------TSISN-----LSLPLKFDWRDKHVITQVRNQKTCGGCWAFSVVGAVESVCAIKG 116
Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
L +LSVQ+VIDC+ N GC+GG L W++ +V L +SEYP ++ C+ +
Sbjct: 117 QPLEVLSVQQVIDCS-YSNYGCNGGSPLNALYWLNKLQVKLVRDSEYPFQAQNGLCRYFS 175
Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLAN 299
S +G IK Y+ E + + GP+I V+A++WQ YLGG+IQ++C S
Sbjct: 176 DSHSGSSIKGYSAYDFSGQEDKMAKALLALGPLIVVVDAMSWQDYLGGIIQHHC--SSGE 233
Query: 300 INHAVQIVGYD 310
NHAV + G+D
Sbjct: 234 SNHAVLVTGFD 244
>gi|431901237|gb|ELK08303.1| Cathepsin O [Pteropus alecto]
Length = 322
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 96/257 (37%), Positives = 137/257 (53%), Gaps = 19/257 (7%)
Query: 60 FEKSLDIIEELNKNRQSPES-ARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDH 118
F +SL+ LN S S A YGI +FS L EEFK +L+ ++ S
Sbjct: 45 FRESLNRHRYLNSLFPSENSTAVYGINQFSHLFPEEFKAIYLKSKTSRFPKYS------- 97
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
+T +P +P++ DWR+ ++ +VRNQQ CG CWAFS V ES +A+K
Sbjct: 98 ------ADLLTVISKLP--LPLRFDWRDKHVVTQVRNQQMCGGCWAFSVVGAVESAYAIK 149
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
L LSVQ+VIDC+ N N GC+GG L W++ +V L +SEYP ++ C
Sbjct: 150 GKPLEDLSVQQVIDCSYN-NYGCNGGSTLNALYWLNKTQVKLVRDSEYPFKAQNGLCLYF 208
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLA 298
A + +G IK Y+ E + + T GP++ V+A++WQ YLGG+IQ++C S
Sbjct: 209 ADTHSGFSIKGYSAHDFSDQEDEMAKALLTFGPLVGIVDAVSWQDYLGGIIQHHC--SSG 266
Query: 299 NINHAVQIVGYDNYSRT 315
NHAV I G+D T
Sbjct: 267 EANHAVIITGFDKTGST 283
>gi|126331447|ref|XP_001375261.1| PREDICTED: cathepsin O-like [Monodelphis domestica]
Length = 414
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 101/264 (38%), Positives = 137/264 (51%), Gaps = 25/264 (9%)
Query: 56 RFKNFEKSLDIIEELNKNRQSPE-SARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHK 114
R F +SL LN S SA YGI +FS L EEFK +LR + L S
Sbjct: 133 RSTAFRESLKRHHYLNSFSSSDNTSAIYGINQFSYLFPEEFKDIYLRSKPSVLPLYSE-- 190
Query: 115 HHDHHHNHVKKRSITTGITIPT---GIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETA 171
+ +PT +PV+ DWR+ ++ KVRNQQ CG CWAFS V +
Sbjct: 191 ----------------ALKMPTTHMPLPVRFDWRDKHVVTKVRNQQMCGGCWAFSVVGSI 234
Query: 172 ESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLK 231
ES +A+K +L LSVQ+VIDC+ N N GCSGG L+W++ +V L +SEY +
Sbjct: 235 ESAYAIKGESLEDLSVQQVIDCSYN-NFGCSGGSTVNALNWLNKTQVRLVKDSEYSFKAQ 293
Query: 232 DAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQY 291
C + S GV IK Y+ E+ + + GP+ V+A++WQ YLGG+IQ+
Sbjct: 294 TGLCHYFSGSHAGVSIKDYSSYDFSGKENEMANVLLAFGPLAVIVDAVSWQDYLGGIIQH 353
Query: 292 NCDGSLANINHAVQIVGYDNYSRT 315
+C S NHAV I G+D T
Sbjct: 354 HC--SSGEANHAVLITGFDRTGNT 375
>gi|149698347|ref|XP_001499302.1| PREDICTED: cathepsin O-like [Equus caballus]
Length = 367
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 99/263 (37%), Positives = 139/263 (52%), Gaps = 19/263 (7%)
Query: 54 DIRFKNFEKSLDIIEELNKNRQSPES-ARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSH 112
D + F +SL+ LN S S A YGI +FS L EEFK +LR S
Sbjct: 84 DRQAAAFRESLNRHRYLNSLFPSENSTAVYGINQFSYLFPEEFKAIYLR---------SK 134
Query: 113 HKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAE 172
+ V+ S++ +P++ DWR+ ++ +VRNQQ CG CWAFS V E
Sbjct: 135 PSRFPRYPAEVQT-SLSN-----VSLPLRFDWRDRHVVTQVRNQQACGGCWAFSVVGAVE 188
Query: 173 SMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKD 232
S+ A+K L LSVQ+VIDC+ N N GCSGG L+W++ +V L +SEYP +
Sbjct: 189 SVCAIKGEPLEDLSVQQVIDCSYN-NYGCSGGSTLNALNWLNKTQVKLVRDSEYPFKAQS 247
Query: 233 AACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYN 292
C + S +G IK ++ E + + T GP++ V+A++WQ YLGGVIQ++
Sbjct: 248 GLCHYFSDSHSGFSIKGFSAYDFSDQEDQMAKALLTFGPLVVVVDAVSWQDYLGGVIQHH 307
Query: 293 CDGSLANINHAVQIVGYDNYSRT 315
C S NHAV I G+D T
Sbjct: 308 C--SSGEANHAVLITGFDRTGST 328
>gi|440911897|gb|ELR61520.1| Cathepsin O, partial [Bos grunniens mutus]
Length = 276
Score = 157 bits (398), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 91/250 (36%), Positives = 136/250 (54%), Gaps = 25/250 (10%)
Query: 68 EELNKNR-------QSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
E LN+ R +A YGI +FS L EEFK +LR S ++ ++
Sbjct: 1 ESLNRQRYLNSLFPHENSTAVYGINQFSYLFPEEFKAIYLRSSPSRFPRFPAEEY----- 55
Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
SI+ +P++ DWR+ ++ +VRNQ+TCG CWAFS V ES+ A+K
Sbjct: 56 -----TSISN-----LSLPLRFDWRDKHVVTQVRNQKTCGGCWAFSVVGAVESVCAIKGQ 105
Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKAT 240
L +LSVQ+VIDC+ N GC+GG + L W++ +V L +SEYP ++ C+ +
Sbjct: 106 PLGVLSVQQVIDCS-YSNYGCNGGSPLSALYWLNKLQVKLVRDSEYPFQAQNGLCRYFSD 164
Query: 241 SPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANI 300
S +G IK Y+ E + + GP+I V+A++WQ YLGG+IQ++C S
Sbjct: 165 SHSGSSIKGYSAYDFSGQEDKMAEALLALGPLIVVVDAMSWQDYLGGIIQHHC--SSGEA 222
Query: 301 NHAVQIVGYD 310
NHAV + G+D
Sbjct: 223 NHAVLVTGFD 232
>gi|326918260|ref|XP_003205408.1| PREDICTED: cathepsin O-like, partial [Meleagris gallopavo]
Length = 283
Score = 157 bits (398), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 98/272 (36%), Positives = 130/272 (47%), Gaps = 38/272 (13%)
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLR---HSVNKHVLMSHHKHH 116
+S I LN SA YG +FS L EEFK +LR H + +++ K
Sbjct: 9 LRESAKRIRLLNSPSNDNGSAFYGKNQFSHLFPEEFKAIYLRSIPHKLPRYIKAPKGKEK 68
Query: 117 DHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHA 176
+P K DWR+ +I +VRNQQTCG CWAFS V ES +A
Sbjct: 69 P--------------------LPKKFDWRDKKVIAEVRNQQTCGGCWAFSVVGGIESAYA 108
Query: 177 LKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACK 236
+K L LSVQ+VIDC+ N GCSGG L W++ KV L +SEY + C
Sbjct: 109 IKGNNLEELSVQQVIDCS-YSNYGCSGGSTITALSWLNQTKVKLVRDSEYTFKAQTGLCH 167
Query: 237 RKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGS 296
A S GV I + E ++ + GP+ V+A++WQ YLGG+IQY+C S
Sbjct: 168 YFARSDFGVSITGFAAYDFSGQEEEMMRVLVDWGPLAVTVDAVSWQDYLGGIIQYHC--S 225
Query: 297 LANINHAVQIVGYD------------NYSRTW 316
NHAV I G+D ++ RTW
Sbjct: 226 SGKANHAVLITGFDRTGSIPYWIVQNSWGRTW 257
>gi|358416284|ref|XP_874012.4| PREDICTED: cathepsin O [Bos taurus]
gi|359074588|ref|XP_002694471.2| PREDICTED: cathepsin O [Bos taurus]
Length = 313
Score = 157 bits (397), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 91/250 (36%), Positives = 136/250 (54%), Gaps = 25/250 (10%)
Query: 68 EELNKNRQ-------SPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
E LN+ R +A YGI +FS L EEFK +LR S ++ ++
Sbjct: 38 ESLNRQRYLNSLFPYENSTAVYGINQFSYLFPEEFKAIYLRSSPSRFPRFPAEEY----- 92
Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
SI+ +P++ DWR+ ++ +VRNQ+TCG CWAFS V ES+ A+K
Sbjct: 93 -----TSISN-----LSLPLRFDWRDKHVVTQVRNQKTCGGCWAFSVVGAVESVCAIKGQ 142
Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKAT 240
L +LSVQ+VIDC+ N GC+GG + L W++ +V L +SEYP ++ C+ +
Sbjct: 143 PLEVLSVQQVIDCS-YSNYGCNGGSPLSALYWLNKLQVKLVRDSEYPFQAQNGLCRYFSD 201
Query: 241 SPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANI 300
S +G IK Y+ E + + GP+I V+A++WQ YLGG+IQ++C S
Sbjct: 202 SHSGSSIKGYSAYDFSGQEDKMAEALLALGPLIVVVDAMSWQDYLGGIIQHHC--SSGEA 259
Query: 301 NHAVQIVGYD 310
NHAV + G+D
Sbjct: 260 NHAVLVTGFD 269
>gi|354474585|ref|XP_003499511.1| PREDICTED: cathepsin O-like [Cricetulus griseus]
Length = 311
Score = 157 bits (397), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 88/246 (35%), Positives = 129/246 (52%), Gaps = 18/246 (7%)
Query: 70 LNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSIT 129
LN SA YG+ +FS LS EEFK +L + +++ I
Sbjct: 45 LNSFSHDNSSASYGLNQFSYLSPEEFKALYL----------GSKPAWSPRYPAAEQKPIP 94
Query: 130 TGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQE 189
+P++ DWR+ ++ +VRNQ+ CG CWAFS V ES A++ L LSVQ+
Sbjct: 95 N-----VSLPLRFDWRDKHVVNQVRNQKMCGGCWAFSVVTAIESACAIQGKPLDYLSVQQ 149
Query: 190 VIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKS 249
VIDC+ N N GCSGG + L W++ +V L +SEYP ++ C+ S +GV IK
Sbjct: 150 VIDCSFN-NYGCSGGSPLSALSWLNKTQVKLMEDSEYPFKAENGLCRYFPQSQSGVSIKD 208
Query: 250 YTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
++ E + + GP++ V+A++WQ YLGG+IQ++C S NHAV I G+
Sbjct: 209 FSAYDFSGQEDEMAKALLNFGPLVVIVDAVSWQDYLGGIIQHHC--SSGEANHAVLITGF 266
Query: 310 DNYSRT 315
D T
Sbjct: 267 DKTGNT 272
>gi|429327035|gb|AFZ78846.1| cathepsin O-like protein [Coptotermes formosanus]
Length = 227
Score = 157 bits (396), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 81/190 (42%), Positives = 124/190 (65%), Gaps = 11/190 (5%)
Query: 5 KNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY--SKSEHDIRFKNFEK 62
+ V +V L+ALCFL IP+++ L QK ELF F QR+ K+Y +++E+ + NF++
Sbjct: 6 RRVFIVVGLVALCFLGIPIRIDDNEL-QKRELFRGFLQRFNKTYEGNETEYMKHYNNFKE 64
Query: 63 SLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHK--HHDHHH 120
SL+II+ELN++R + SA YG+T +SDLS++EF +L+ + H+ + K H+ H +
Sbjct: 65 SLNIIDELNRDRLTEHSAVYGLTAYSDLSKDEFLHLYLQPWLPDHLNLMKQKQSHYSHKY 124
Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
V K ++ + P++ DWR+ +I +VRNQ+TCGACWAFS T E+M+A+K G
Sbjct: 125 FAVNKEAVVDDL------PLRVDWRDRNVITEVRNQKTCGACWAFSAAATIEAMYAIKTG 178
Query: 181 TLSLLSVQEV 190
L LSVQEV
Sbjct: 179 LLHKLSVQEV 188
>gi|296478683|tpg|DAA20798.1| TPA: cathepsin O preproprotein-like [Bos taurus]
Length = 375
Score = 157 bits (396), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 91/250 (36%), Positives = 136/250 (54%), Gaps = 25/250 (10%)
Query: 68 EELNKNRQ-------SPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
E LN+ R +A YGI +FS L EEFK +LR S ++ ++
Sbjct: 100 ESLNRQRYLNSLFPYENSTAVYGINQFSYLFPEEFKAIYLRSSPSRFPRFPAEEYT---- 155
Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
SI+ +P++ DWR+ ++ +VRNQ+TCG CWAFS V ES+ A+K
Sbjct: 156 ------SISN-----LSLPLRFDWRDKHVVTQVRNQKTCGGCWAFSVVGAVESVCAIKGQ 204
Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKAT 240
L +LSVQ+VIDC+ N GC+GG + L W++ +V L +SEYP ++ C+ +
Sbjct: 205 PLEVLSVQQVIDCS-YSNYGCNGGSPLSALYWLNKLQVKLVRDSEYPFQAQNGLCRYFSD 263
Query: 241 SPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANI 300
S +G IK Y+ E + + GP+I V+A++WQ YLGG+IQ++C S
Sbjct: 264 SHSGSSIKGYSAYDFSGQEDKMAEALLALGPLIVVVDAMSWQDYLGGIIQHHC--SSGEA 321
Query: 301 NHAVQIVGYD 310
NHAV + G+D
Sbjct: 322 NHAVLVTGFD 331
>gi|345307542|ref|XP_001510786.2| PREDICTED: cathepsin O-like [Ornithorhynchus anatinus]
Length = 358
Score = 155 bits (392), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 86/233 (36%), Positives = 129/233 (55%), Gaps = 20/233 (8%)
Query: 79 SARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITI-PTG 137
+A YG +FS L EEFK +LR +K + + S + ++I P
Sbjct: 101 TAYYGTNQFSYLFPEEFKAIYLRSKTSK----------------LPRYSESEEMSIKPMP 144
Query: 138 IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG 197
+PV+ DWR+ ++ +VRNQ+ CG CWAFS V ES +A++ L LSVQ+VIDC+ N
Sbjct: 145 LPVRFDWRDKHVVTQVRNQEACGGCWAFSIVGEIESAYAIRGKPLEELSVQQVIDCSYN- 203
Query: 198 NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP 257
N GCSGG L+W++ +V L ++EY + C + S G+ I+ Y+
Sbjct: 204 NFGCSGGSTINALNWLNKTQVKLVRDAEYSFKAQTGICHYFSGSHYGISIRGYSAYDFSG 263
Query: 258 SESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
E ++ + + GP+ V+A++WQ YLGG+IQ++C A NHAV I GYD
Sbjct: 264 QEDEMVKVLLSFGPLAVIVDAVSWQDYLGGIIQHHCSSGEA--NHAVLITGYD 314
>gi|71895793|ref|NP_001026300.1| cathepsin O precursor [Gallus gallus]
gi|53127320|emb|CAG31043.1| hypothetical protein RCJMB04_1m17 [Gallus gallus]
Length = 320
Score = 153 bits (387), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 99/273 (36%), Positives = 129/273 (47%), Gaps = 40/273 (14%)
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHH 119
+S I LN SA YG +FS L EEFK +LR K
Sbjct: 46 LRESAKRIRLLNSPSNDNGSAFYGKNQFSHLFPEEFKAIYLRSIPYK------------- 92
Query: 120 HNHVKKRSITTGITIPTG----IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMH 175
+ I +P G +P K DWR+ +I +VRNQQTCG CWAFS V ES +
Sbjct: 93 --------LPRYIKVPKGEEKPLPKKFDWRDKKVIAEVRNQQTCGGCWAFSVVGGIESAY 144
Query: 176 ALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAAC 235
A+K L LSVQ+VIDC+ N GCSGG L W++ KV L +SEY + C
Sbjct: 145 AIKGHNLEELSVQQVIDCS-YSNYGCSGGSTITALSWLNQTKVKLVRDSEYTFKAQTGLC 203
Query: 236 KRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDG 295
S GV I + E ++ + GP+ V+A++WQ YLGG+IQY+C
Sbjct: 204 HYFPHSDFGVSITGFAAYDFSGQEEEMMRVLVDWGPLAVTVDAVSWQDYLGGIIQYHC-- 261
Query: 296 SLANINHAVQIVGYD------------NYSRTW 316
S NHAV I G+D ++ RTW
Sbjct: 262 SSGKANHAVLITGFDTTGSIPYWIVQNSWGRTW 294
>gi|301607871|ref|XP_002933519.1| PREDICTED: cathepsin O-like [Xenopus (Silurana) tropicalis]
Length = 370
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 99/290 (34%), Positives = 141/290 (48%), Gaps = 35/290 (12%)
Query: 37 FSSFQQRYKKSYSKSEHDI--RFKNFEKSLDIIEELNK---NRQSPESARYGITEFSDLS 91
F F Q+Y + Y R++ F KS + LN +A YGI +FSDLS
Sbjct: 66 FLDFIQKYGRGYKDGSQVFQERYQIFLKSTERQNYLNAIALPTNLTSAAHYGINQFSDLS 125
Query: 92 EEEFKTRHLR------HSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWR 145
EEF +LR ++ NK S ++ +P++ DWR
Sbjct: 126 AEEFFYTYLRSFPTGNYTSNKPFKNSAQQYF---------------------LPLRFDWR 164
Query: 146 EAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGD 205
+ ++ V+NQ +CGACWAFS V ES +A+K TL LSVQ+VIDC+ + GC+GG
Sbjct: 165 DKKLVTPVKNQLSCGACWAFSVVGAVESAYAIKWHTLEELSVQQVIDCS-YLDSGCNGGS 223
Query: 206 FCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
L W+ K L SEY K C + GV I Y +E +++
Sbjct: 224 TNGALKWLYQTKTKLVRASEYNFKAKTGLCHYFPKTDFGVSINGYETQDFSGTEDAMMKM 283
Query: 266 IATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYSRT 315
+ GP++ VNA++WQ YLGG+IQ++C S NHAV ++GYD T
Sbjct: 284 LVDLGPMVVIVNAVSWQDYLGGIIQHHC--SSGAPNHAVLVIGYDKTGDT 331
>gi|327273973|ref|XP_003221753.1| PREDICTED: cathepsin O-like [Anolis carolinensis]
Length = 376
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 89/233 (38%), Positives = 120/233 (51%), Gaps = 18/233 (7%)
Query: 79 SARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGI 138
+A YG+ +FS L EEF+ +L+ +K + + I +
Sbjct: 119 TAFYGMNQFSHLFPEEFRAIYLQSKSSKVPKFTPEVRVEE---------------IDKPL 163
Query: 139 PVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGN 198
P K DWR+ GI+ KVRNQ CG CWAFS V ES+HA+K L LSVQ+VIDC+ N
Sbjct: 164 PAKFDWRDKGIVTKVRNQGVCGGCWAFSVVGIIESVHAIKRNVLEELSVQQVIDCS-YIN 222
Query: 199 MGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPS 258
GC GG L W++ +V L +SEY + C+ + + GV IK Y L
Sbjct: 223 SGCRGGSPVGALGWINQTRVKLVRDSEYHFQAETGLCRYFSRADFGVSIKGYAAYDLSDQ 282
Query: 259 ESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYDN 311
E + + GP+ V+A +WQ YLGG+IQY+C S NHAV I GYD
Sbjct: 283 EDKMKKLLLEWGPLAVVVDAASWQDYLGGIIQYHC--SSGEPNHAVLITGYDT 333
>gi|444519298|gb|ELV12725.1| Cathepsin O [Tupaia chinensis]
Length = 428
Score = 150 bits (380), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 106/178 (59%), Gaps = 3/178 (1%)
Query: 138 IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG 197
+P++ DWR+ I+ VRNQQTCGACWAFS V ES A+ L LSVQ+V+DCA +
Sbjct: 33 LPLRFDWRDKHIVTPVRNQQTCGACWAFSVVSAVESACAMAGAPLRELSVQQVLDCAYD- 91
Query: 198 NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP 257
+ GC GG + L+W++ +V L ESEYP +D C+ S GV I+ Y
Sbjct: 92 DRGCGGGSTLSALNWLNKTQVKLVGESEYPFTARDGICRFFPASCPGVSIRGYLAYDFSA 151
Query: 258 SESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYSRT 315
E + + GP++A V+A++WQ YLGGVIQ++C S NHAV + G+D +T
Sbjct: 152 QEDEMAKALVALGPLVAVVDAVSWQDYLGGVIQHHC--SSGEANHAVLVTGFDKAGQT 207
>gi|29244082|ref|NP_808330.1| cathepsin O precursor [Mus musculus]
gi|67460397|sp|Q8BM88.1|CATO_MOUSE RecName: Full=Cathepsin O; Flags: Precursor
gi|26329979|dbj|BAC28728.1| unnamed protein product [Mus musculus]
gi|74139152|dbj|BAE38466.1| unnamed protein product [Mus musculus]
gi|74141620|dbj|BAE38573.1| unnamed protein product [Mus musculus]
Length = 312
Score = 150 bits (379), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 86/256 (33%), Positives = 130/256 (50%), Gaps = 18/256 (7%)
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHH 119
+SL LN +A YG+ +FS L EEFK +L +K+ +
Sbjct: 36 LRESLHRHRYLNSFPHENSTAFYGVNQFSYLFPEEFKALYLG---SKYAWAPRYPAEG-- 90
Query: 120 HNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
+R I +P++ DWR+ ++ VRNQ+ CG CWAFS V ES A++
Sbjct: 91 -----QRPIPN-----VSLPLRFDWRDKHVVNPVRNQEMCGGCWAFSVVSAIESARAIQG 140
Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
+L LSVQ+VIDC+ N N GC GG L W++ ++ L +S+YP + C+
Sbjct: 141 KSLDYLSVQQVIDCSFN-NSGCLGGSPLCALRWLNETQLKLVADSQYPFKAVNGQCRHFP 199
Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLAN 299
S GV +K ++ E + + + GP++ V+A++WQ YLGG+IQ++C S
Sbjct: 200 QSQAGVSVKDFSAYNFRGQEDEMARALLSFGPLVVIVDAMSWQDYLGGIIQHHC--SSGE 257
Query: 300 INHAVQIVGYDNYSRT 315
NHAV I G+D T
Sbjct: 258 ANHAVLITGFDRTGNT 273
>gi|148683493|gb|EDL15440.1| cathepsin O [Mus musculus]
Length = 312
Score = 150 bits (378), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 86/256 (33%), Positives = 130/256 (50%), Gaps = 18/256 (7%)
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHH 119
+SL LN +A YG+ +FS L EEFK +L +K+ +
Sbjct: 36 LRESLHRHRYLNSFPHENSTAFYGVNQFSYLFPEEFKALYLG---SKYAWAPRYPAEG-- 90
Query: 120 HNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
+R I +P++ DWR+ ++ VRNQ+ CG CWAFS V ES A++
Sbjct: 91 -----QRPIPN-----VSLPLRFDWRDKHVVNPVRNQEMCGGCWAFSVVSAIESARAIQG 140
Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
+L LSVQ+VIDC+ N N GC GG L W++ ++ L +S+YP + C+
Sbjct: 141 KSLDYLSVQQVIDCSFN-NSGCLGGSPLCALRWLNETQLKLVADSQYPFKAVNGQCRHFP 199
Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLAN 299
S GV +K ++ E + + + GP++ V+A++WQ YLGG+IQ++C S
Sbjct: 200 QSQAGVSVKDFSAYNFRGQEDEMARALLSFGPLVVIVDAMSWQDYLGGIIQHHC--SSGE 257
Query: 300 INHAVQIVGYDNYSRT 315
NHAV I G+D T
Sbjct: 258 ANHAVLITGFDRTGNT 273
>gi|28278727|gb|AAH44664.1| Ctso protein [Mus musculus]
Length = 292
Score = 150 bits (378), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 86/256 (33%), Positives = 130/256 (50%), Gaps = 18/256 (7%)
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHH 119
+SL LN +A YG+ +FS L EEFK +L +K+ +
Sbjct: 16 LRESLHRHRYLNSFPHENSTAFYGVNQFSYLFPEEFKALYLG---SKYAWAPRYPAEG-- 70
Query: 120 HNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
+R I +P++ DWR+ ++ VRNQ+ CG CWAFS V ES A++
Sbjct: 71 -----QRPIPN-----VSLPLRFDWRDKHVVNPVRNQEMCGGCWAFSVVSAIESARAIQG 120
Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
+L LSVQ+VIDC+ N N GC GG L W++ ++ L +S+YP + C+
Sbjct: 121 KSLDYLSVQQVIDCSFN-NSGCLGGSPLCALRWLNETQLKLVADSQYPFKAVNGQCRHFP 179
Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLAN 299
S GV +K ++ E + + + GP++ V+A++WQ YLGG+IQ++C S
Sbjct: 180 QSQAGVSVKDFSAYNFRGQEDEMARALLSFGPLVVIVDAMSWQDYLGGIIQHHC--SSGE 237
Query: 300 INHAVQIVGYDNYSRT 315
NHAV I G+D T
Sbjct: 238 ANHAVLITGFDRTGNT 253
>gi|68086379|gb|AAH98219.1| Cathepsin O [Mus musculus]
Length = 312
Score = 150 bits (378), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 87/263 (33%), Positives = 131/263 (49%), Gaps = 18/263 (6%)
Query: 53 HDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSH 112
H +SL LN +A YG+ +FS L EEFK +L +K+
Sbjct: 29 HQREAAALRESLHRHRYLNSFPHENSTAFYGVNQFSYLFPEEFKALYLG---SKYAWAPR 85
Query: 113 HKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAE 172
+ +R I +P++ DWR+ ++ VRNQ+ CG CWAFS V E
Sbjct: 86 YPAEG-------QRPIPN-----VSLPLRFDWRDKHVVNPVRNQEMCGGCWAFSVVSAIE 133
Query: 173 SMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKD 232
S A++ +L LSVQ+VIDC+ N N GC GG L W++ ++ L +S+YP +
Sbjct: 134 SARAIQGKSLDYLSVQQVIDCSFN-NSGCLGGSPPCALRWLNETQLKLVADSQYPFKAVN 192
Query: 233 AACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYN 292
C+ S GV +K ++ E + + + GP++ V+A++WQ YLGG+IQ++
Sbjct: 193 GQCRHFPQSQAGVSVKDFSAYNFRGQEDEMARALLSFGPLVVIVDAMSWQDYLGGIIQHH 252
Query: 293 CDGSLANINHAVQIVGYDNYSRT 315
C S NHAV I G+D T
Sbjct: 253 C--SSGEANHAVLITGFDRTGNT 273
>gi|348582234|ref|XP_003476881.1| PREDICTED: cathepsin O-like [Cavia porcellus]
Length = 478
Score = 149 bits (377), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 85/237 (35%), Positives = 123/237 (51%), Gaps = 18/237 (7%)
Query: 79 SARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGI 138
+A YGI +FS L EEFK +LR ++ + K + G +
Sbjct: 221 TAFYGINQFSYLFPEEFKAIYLRSKPSRS------------PRYPSKVQTSVG---SVSL 265
Query: 139 PVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGN 198
P + DWR+ ++ +VRNQQ CG CWAFS V ES A++ L LS Q+VIDC+ N N
Sbjct: 266 PPRFDWRDKHVVTQVRNQQACGGCWAFSVVGAVESAWAIRGEPLEDLSAQQVIDCSYN-N 324
Query: 199 MGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPS 258
GC+GG + L W+ +V L +SEYP ++ C ++S G I+ Y
Sbjct: 325 FGCNGGSPLSALTWLKKTRVKLVKDSEYPFKAQNGLCHYFSSSHPGFSIQDYAAYDFSAQ 384
Query: 259 ESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYSRT 315
E + + GP++ V+A++WQ YLGGVIQ++C S NHAV + G+D T
Sbjct: 385 EDEMARVLLLSGPLVVIVDAVSWQDYLGGVIQHHC--SSGEANHAVLVTGFDQTGST 439
>gi|440804656|gb|ELR25533.1| cysteine proteinase precursor, putative [Acanthamoeba castellanii
str. Neff]
Length = 330
Score = 149 bits (377), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 92/276 (33%), Positives = 140/276 (50%), Gaps = 26/276 (9%)
Query: 35 ELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
+ F F +Y KSY+ E R + F +LD I+ LN + ARYG+ +F+DL+ +E
Sbjct: 30 QQFRQFAAQYGKSYASEEFGERLRIFRDNLDRIDALNS---ANTGARYGVNKFADLTPKE 86
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
FK +L+ + KK + T + + +P + DWR+ G + +
Sbjct: 87 FKATYLKGA---------------RSAGQKKAAATAKLDMTGPLPSQFDWRDKGAVTPTK 131
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC-AGNGNMGCSGGDFCALLDWM 213
+Q CG WAFS E ES L L L+ Q+++DC GNG+ GC GGD +++
Sbjct: 132 DQGQCG--WAFSVTEAIESQWFLSGRKLVSLAPQQIVDCDQGNGDYGCDGGDPPTAYEYV 189
Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
+ L+ E YP +D C K S G KI ++T T +E+ + +A+ GP+
Sbjct: 190 -IKAGGLDTEESYPYTAEDGQCAFKP-SAVGAKISNWTYITTTKNETEMQYGLASRGPLS 247
Query: 274 AAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
V+A +WQYY+GGVI C+ SL +H V I GY
Sbjct: 248 ICVDASSWQYYIGGVITSLCEDSL---DHCVMITGY 280
>gi|26340204|dbj|BAC33765.1| unnamed protein product [Mus musculus]
Length = 312
Score = 147 bits (372), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 86/263 (32%), Positives = 130/263 (49%), Gaps = 18/263 (6%)
Query: 53 HDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSH 112
H +SL LN +A YG+ + S L EEFK +L +K+
Sbjct: 29 HQREAAALRESLHRHRYLNSFPHENSTAFYGVNQLSYLFPEEFKALYLG---SKYAWAPR 85
Query: 113 HKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAE 172
+ +R I +P++ DWR+ ++ VRNQ+ CG CWAFS V E
Sbjct: 86 YPAEG-------QRPIPN-----VSLPLRFDWRDKHVVNPVRNQEMCGGCWAFSVVSAIE 133
Query: 173 SMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKD 232
S A++ +L LSVQ+VIDC+ N N GC GG L W++ ++ L +S+YP +
Sbjct: 134 SARAIQGKSLDYLSVQQVIDCSFN-NSGCLGGSPLCALRWLNETQLKLVADSQYPFKAVN 192
Query: 233 AACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYN 292
C+ S GV +K ++ E + + + GP++ V+A++WQ YLGG+IQ++
Sbjct: 193 GQCRHFPQSQAGVSVKDFSAYNFRGQEDEMARALLSFGPLVVIVDAMSWQDYLGGIIQHH 252
Query: 293 CDGSLANINHAVQIVGYDNYSRT 315
C S NHAV I G+D T
Sbjct: 253 C--SSGEANHAVLITGFDRTGNT 273
>gi|380254588|gb|AFD36229.1| cysteine proteinase [Acanthamoeba castellanii]
Length = 359
Score = 147 bits (371), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 92/305 (30%), Positives = 155/305 (50%), Gaps = 16/305 (5%)
Query: 8 LFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDII 67
F V ++A+ LA V + + L+ + F+ + +++ +SY E R+ + +++ +
Sbjct: 8 FFAVVVLAVASLAQGVSIEERELQGR---FNGWMRQHARSYDSDEFLERYNIWRENMAFV 64
Query: 68 EELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRS 127
EE N R +S + ++ DL+ EEF + H + K K D + ++
Sbjct: 65 EEFN--RAGDKSFTVAMNQYGDLAPEEFSRLYKGHMLPKDEEEQMRKRLDEQ-DPAEEEP 121
Query: 128 ITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSV 187
+T G T+P DWR G + + NQ +C +CWAF++ E + N TL LS
Sbjct: 122 VTVGATVP----ASWDWRSVGAVTGIENQGSCASCWAFASAYALEGARKIANSTLVSLSK 177
Query: 188 QEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVK 246
Q+++DC+G+ GN+GC GG+ WM N L E+ YP AAC+ + SP V
Sbjct: 178 QQLVDCSGSGGNLGCYGGNVGLTYTWMRRNNAKLMTEANYPYTGVQAACRYTSASPAVVG 237
Query: 247 IKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAV 304
+K+Y SES +L + A GPV A+++ ++ YY GG Y+ S + ++HAV
Sbjct: 238 VKNYA-SVKAGSESDLLANAAV-GPVTVAIDSSKRSFIYYSGGYY-YDQTCSSSYLDHAV 294
Query: 305 QIVGY 309
+VG+
Sbjct: 295 TVVGW 299
>gi|440793487|gb|ELR14669.1| cysteine proteinase [Acanthamoeba castellanii str. Neff]
Length = 342
Score = 147 bits (371), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 92/305 (30%), Positives = 155/305 (50%), Gaps = 16/305 (5%)
Query: 8 LFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDII 67
F V ++A+ LA V + + L+ + F+ + +++ +SY E R+ + +++ +
Sbjct: 8 FFAVVVLAVASLAQGVSIEERELQGR---FNGWMRQHARSYDSDEFLERYNIWRENMAFV 64
Query: 68 EELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRS 127
EE N R +S + ++ DL+ EEF + H + K K D + ++
Sbjct: 65 EEFN--RAGDKSFTVAMNQYGDLAPEEFSRLYKGHMLPKDEEEQMRKRLDEQ-DPAEEEP 121
Query: 128 ITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSV 187
+T G T+P DWR G + + NQ +C +CWAF++ E + N TL LS
Sbjct: 122 VTVGATVP----ASWDWRSVGAVTGIENQGSCASCWAFASAYALEGARKIANSTLVSLSK 177
Query: 188 QEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVK 246
Q+++DC+G+ GN+GC GG+ WM N L E+ YP AAC+ + SP V
Sbjct: 178 QQLVDCSGSGGNLGCYGGNVGLTYTWMRRNNAKLMTEANYPYTGVQAACRYTSASPAVVG 237
Query: 247 IKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAV 304
+K+Y SES +L + A GPV A+++ ++ YY GG Y+ S + ++HAV
Sbjct: 238 VKNYA-SVKAGSESDLLANAAV-GPVTVAIDSSKRSFIYYSGGYY-YDQTCSSSYLDHAV 294
Query: 305 QIVGY 309
+VG+
Sbjct: 295 TVVGW 299
>gi|260832906|ref|XP_002611398.1| hypothetical protein BRAFLDRAFT_210717 [Branchiostoma floridae]
gi|229296769|gb|EEN67408.1| hypothetical protein BRAFLDRAFT_210717 [Branchiostoma floridae]
Length = 283
Score = 147 bits (371), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 87/244 (35%), Positives = 126/244 (51%), Gaps = 20/244 (8%)
Query: 68 EELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRS 127
E+LN R + SA YG+ FSDL+ EF+ ++ +K+ S +
Sbjct: 14 EQLNHGR-TAGSALYGLNRFSDLTPAEFRGSNV---TSKNSAQSTYD------------- 56
Query: 128 ITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSV 187
P+ +P DWR + +R+Q +CG CWAFS VET ES ++ L SV
Sbjct: 57 PGYSFEAPSDVPPIWDWRNNKTVTAIRDQGSCGGCWAFSIVETIESQWSIAGHLLEEYSV 116
Query: 188 QEVIDC-AGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVK 246
Q+V+DC G+ GC GGD C L WM+ L P+ +YP KD C+ + + V
Sbjct: 117 QQVLDCDRTKGSHGCRGGDTCNALSWMNQTTANLVPKKDYPYTGKDGECRFFTNTTDSVH 176
Query: 247 IKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQI 306
+ +YTC E ++ + HG + V+A +WQ YLGG+IQ++C S NHAVQI
Sbjct: 177 LTNYTCRGYENHEDEMVRLLHGHGTLAIIVDATSWQDYLGGIIQHHC--SHDYNNHAVQI 234
Query: 307 VGYD 310
VGY+
Sbjct: 235 VGYN 238
>gi|293345419|ref|XP_001070844.2| PREDICTED: cathepsin O-like [Rattus norvegicus]
Length = 307
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 87/256 (33%), Positives = 133/256 (51%), Gaps = 22/256 (8%)
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHH 119
+SL+ LN +A YG+ +FS L EEFK +L +K +
Sbjct: 35 LRESLNRHRYLNSFPHDNSTAFYGVNQFSYLFPEEFKALYLG---SKPAWAPRYP----- 86
Query: 120 HNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
K ++ ++ +P++ DWR+ ++ VRNQ+TCG CWAFS V ES A++
Sbjct: 87 ---AKGQTPIPNVS----LPLRFDWRDKHVVNHVRNQKTCGGCWAFSVVSAVESAGAIQG 139
Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
L LSVQ+VIDC+ N N GC GG L W++ ++ L +S+YP ++ C+
Sbjct: 140 KPLDYLSVQQVIDCSFN-NYGCRGGSPLGALSWLNETQLKLVADSQYPFKAENGLCRYFP 198
Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLAN 299
S N V I S+ + E + + + GP++ V+A++WQ YLGG+IQ++C S
Sbjct: 199 QSFNYVYISSFGSN----QEDEMARALLSFGPLVVIVDAVSWQDYLGGIIQHHC--SSGE 252
Query: 300 INHAVQIVGYDNYSRT 315
NHAV I G+D T
Sbjct: 253 ANHAVLITGFDKTGNT 268
>gi|224116880|ref|XP_002317417.1| predicted protein [Populus trichocarpa]
gi|118488173|gb|ABK95906.1| unknown [Populus trichocarpa]
gi|222860482|gb|EEE98029.1| predicted protein [Populus trichocarpa]
Length = 498
Score = 146 bits (368), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 86/278 (30%), Positives = 141/278 (50%), Gaps = 22/278 (7%)
Query: 35 ELFSSFQQRYKKSYSKSEH-DIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEE 93
E+F ++++++K Y +E + R NF+++L I E N R+S + G+ +F+DLS E
Sbjct: 48 EVFKLWKEKHQKVYKHAEEAERRIGNFKRNLKYIIEKNGKRKSGLEHKVGLNKFADLSNE 107
Query: 94 EFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKV 153
EF+ +L V K + + + H H P DWR G++ V
Sbjct: 108 EFREMYLSK-VKKPITIEEKRKHRHLQT--------------CDAPSSLDWRNKGVVTAV 152
Query: 154 RNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM 213
++Q CG+CW+FST E+++A+ G L LS QE++DC N GC GGD + W+
Sbjct: 153 KDQGDCGSCWSFSTTGAIEAINAIVTGDLISLSEQELVDCDTTNNYGCEGGDMDSAFQWV 212
Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
+ ++ E++YP D C V I+ Y + PS+S++L P+
Sbjct: 213 -IGNGGIDTEADYPYTGVDGTCNTAKEEKKVVSIEGYV--DVDPSDSALLC-ATVQQPIS 268
Query: 274 AAVN--ALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
++ AL +Q Y GG+ +C G +I+HA+ IVGY
Sbjct: 269 VGMDGSALDFQLYTGGIYDGDCSGDPNDIDHAILIVGY 306
>gi|297598407|ref|NP_001045533.2| Os01g0971400 [Oryza sativa Japonica Group]
gi|15289977|dbj|BAB63672.1| putative cysteine protease CP1 [Oryza sativa Japonica Group]
gi|125529282|gb|EAY77396.1| hypothetical protein OsI_05384 [Oryza sativa Indica Group]
gi|125573472|gb|EAZ14987.1| hypothetical protein OsJ_04922 [Oryza sativa Japonica Group]
gi|215740756|dbj|BAG97412.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215741010|dbj|BAG97505.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765325|dbj|BAG87022.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767338|dbj|BAG99566.1| unnamed protein product [Oryza sativa Japonica Group]
gi|255674119|dbj|BAF07447.2| Os01g0971400 [Oryza sativa Japonica Group]
Length = 365
Score = 144 bits (364), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 96/294 (32%), Positives = 151/294 (51%), Gaps = 31/294 (10%)
Query: 31 EQKLELFSSFQQRYKKSYSKSEHDIR-FKNFEKSLDIIEELNKNRQSPESARYGITEFSD 89
E+ +ELF F +Y+K+YS E +R F+ F+ +L+ I+E NK G+ EF+D
Sbjct: 46 ERLMELFEKFMAKYRKAYSSLEEKLRRFEVFKDNLNHIDEENKK---ITGYWLGLNEFAD 102
Query: 90 LSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGI 149
L+ +EFK +L ++ + +D + + + + +P + DWR+ G
Sbjct: 103 LTHDEFKAAYLGLTLTP----ARRNSNDQLFRYEEVEAAS--------LPKEVDWRKKGA 150
Query: 150 IGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCAL 209
+ +V+NQ CG+CWAFSTV E ++A+ G L+ LS QE+IDC +GN GCSGG
Sbjct: 151 VTEVKNQGQCGSCWAFSTVAAVEGINAIVTGNLTRLSEQELIDCDTDGNNGCSGGLMDYA 210
Query: 210 LDWMDVNKVVLEPESEYPLLLKDAACKRKATSPN-------GVKIKSYTCDTLIPSESSI 262
++ N L E YP L+++ C+R +T + V I Y D +E ++
Sbjct: 211 FSYIAANG-GLHTEESYPYLMEEGTCRRGSTEGDDDGEAAAAVTISGYE-DVPRNNEQAL 268
Query: 263 LTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYSR 314
L +A H PV A+ A +Q+Y GGV C ++H V VGY S+
Sbjct: 269 LKALA-HQPVSVAIEASGRNFQFYSGGVFDGPCG---TRLDHGVTAVGYGTASK 318
>gi|289740839|gb|ADD19167.1| cysteine proteinase cathepsin F [Glossina morsitans morsitans]
Length = 471
Score = 144 bits (362), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 101/289 (34%), Positives = 152/289 (52%), Gaps = 37/289 (12%)
Query: 29 NLEQKLELFSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
NL + LF+ FQ ++K++Y + E +RF+ F+++L +IEELN+N Q SA+YGITEF
Sbjct: 158 NLNKVEHLFAKFQIKFKRNYHTTMEKQMRFRIFKQNLQLIEELNRNEQG--SAKYGITEF 215
Query: 88 SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKDWRE 146
+D++ E+K R + S+ K IP +P + DWRE
Sbjct: 216 ADMTSPEYKQRTGLWQRDPQKAASNPK-----------------AEIPNIDLPKEFDWRE 258
Query: 147 AGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDF 206
G I V+NQ CG+CWAFS E +HA++ G L S QE++DC + C+GG
Sbjct: 259 KGAISAVKNQGNCGSCWAFSVTGNIEGLHAVRTGVLEQYSEQELLDC-DTSDSACNGG-- 315
Query: 207 CALLD--WMDVNKV-VLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSIL 263
L D + + K+ LE ES+YP + C +T + VK+K + L +E++I
Sbjct: 316 --LPDNAYEAIEKIGGLELESDYPYHARKDQCHFNSTKIH-VKVKGHV--DLPKNETAIA 370
Query: 264 TDIATHGPVIAAVNALTWQYYLGGVIQYN---CDGSLANINHAVQIVGY 309
+ +GP+ +NA Q+Y GGV C S N++H V IVGY
Sbjct: 371 QWLIANGPISIGINANAMQFYRGGVSHPPHILC--SRKNLDHGVLIVGY 417
>gi|83944664|gb|ABC48936.1| cathepsin F like protease [Glossina morsitans morsitans]
Length = 471
Score = 144 bits (362), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 101/289 (34%), Positives = 152/289 (52%), Gaps = 37/289 (12%)
Query: 29 NLEQKLELFSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
NL + LF+ FQ ++K++Y + E +RF+ F+++L +IEELN+N Q SA+YGITEF
Sbjct: 158 NLNKVEHLFAKFQIKFKRNYHTTMEKQMRFRIFKQNLQLIEELNRNEQG--SAKYGITEF 215
Query: 88 SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKDWRE 146
+D++ E+K R + S+ K IP +P + DWRE
Sbjct: 216 ADMTSPEYKQRTGLWQRDPQKAASNPK-----------------AEIPNIDLPKEFDWRE 258
Query: 147 AGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDF 206
G I V+NQ CG+CWAFS E +HA++ G L S QE++DC + C+GG
Sbjct: 259 KGAISAVKNQGNCGSCWAFSVTGNIEGLHAVRTGVLEQYSEQELLDC-DTSDSACNGG-- 315
Query: 207 CALLD--WMDVNKV-VLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSIL 263
L D + + K+ LE ES+YP + C +T + VK+K + L +E++I
Sbjct: 316 --LPDNAYEAIEKIGGLELESDYPYHARKDQCHFNSTKIH-VKVKGHV--DLPKNETAIA 370
Query: 264 TDIATHGPVIAAVNALTWQYYLGGVIQYN---CDGSLANINHAVQIVGY 309
+ +GP+ +NA Q+Y GGV C S N++H V IVGY
Sbjct: 371 QWLIANGPISIGINANAMQFYRGGVSHPPHILC--SRKNLDHGVLIVGY 417
>gi|66812702|ref|XP_640530.1| counting factor associated protein [Dictyostelium discoideum AX4]
gi|74897159|sp|Q54TR1.1|CFAD_DICDI RecName: Full=Counting factor associated protein D; Flags:
Precursor
gi|60468561|gb|EAL66564.1| counting factor associated protein [Dictyostelium discoideum AX4]
Length = 531
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 96/290 (33%), Positives = 143/290 (49%), Gaps = 31/290 (10%)
Query: 31 EQKLELFSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSD 89
EQ LF ++ +Y K YS + EHD RF NF+ + II N S + G+ ++D
Sbjct: 219 EQASNLFKEYKAQYNKEYSSQDEHDERFINFKAARKIIATHNAKE---SSYKLGMNHYAD 275
Query: 90 LSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGI 149
LS +EF T V V D H+ RSI P DWR
Sbjct: 276 LSNKEFNTL-----VKPKVARPSVTGADSVHDDESLRSI----------PSTVDWRNQNC 320
Query: 150 IGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGCSGGDFCA 208
+ V++Q CG+CW F + + E + + NG L LS Q+++DCA G+ GC GG +
Sbjct: 321 VTPVKDQGICGSCWTFGSTGSLEGTNCVTNGELVSLSEQQLVDCAILTGSQGCGGGFASS 380
Query: 209 LLDW-MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIA 267
+ M++ L ES YP L+++ C+ + +P+GV I Y + SES++ IA
Sbjct: 381 AFQYVMEIGS--LATESNYPYLMQNGLCRDRTVTPSGVSITGYV-NVTSGSESALQNAIA 437
Query: 268 THGPVIAAVNALT--WQYYLGGVIQYN---CDGSLANINHAVQIVGYDNY 312
T GPV A++A ++YY+ GV YN C L +++H V +GY Y
Sbjct: 438 TTGPVAIAIDASVDDFRYYMSGV--YNNPACKNGLDDLDHEVLAIGYGTY 485
>gi|332026794|gb|EGI66903.1| Putative cysteine proteinase [Acromyrmex echinatior]
Length = 774
Score = 143 bits (360), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 92/281 (32%), Positives = 153/281 (54%), Gaps = 34/281 (12%)
Query: 36 LFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
LF++F Y ++YS E ++RFK F ++L+ IEEL + Q + YG+ F+D+S++EF
Sbjct: 469 LFNNFMTTYNRTYSSLERNLRFKIFRENLNFIEELRETEQG--TGIYGVNMFADMSQKEF 526
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVK-KRSITTGITIPTGIPVKKDWREAGIIGKVR 154
+TR+L + N + ++ I +P+ DWR+ G++ V+
Sbjct: 527 RTRYL-----------GLRPDLQSENEIPLPKAEIPDIDLPSSF----DWRQKGVVTPVK 571
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLD--W 212
NQ CG+CWAFS E +A+K+G L LS QE++DC + + GC+GG L D +
Sbjct: 572 NQGQCGSCWAFSVTGNVEGQYAIKHGQLLSLSEQELVDC-DHLDEGCNGG----LPDNAY 626
Query: 213 MDVNKV-VLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGP 271
+ ++ LE ES+YP ++ C K N VK++ + + +E+ I + +GP
Sbjct: 627 RAIEQLGGLELESDYPYEAENEKCHFKQ---NLVKVELASAVNITSNETQIAQWLVQNGP 683
Query: 272 VIAAVNALTWQYYLGGV---IQYNCDGSLANINHAVQIVGY 309
+ +NA Q+Y+GGV ++ C+ + N+NH V IVGY
Sbjct: 684 IAIGINANAMQFYMGGVSHPLKILCNPN--NLNHGVLIVGY 722
>gi|227018328|gb|ACP18830.1| cysteine proteinase 1 [Chrysomela tremula]
Length = 323
Score = 141 bits (356), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 99/307 (32%), Positives = 156/307 (50%), Gaps = 32/307 (10%)
Query: 16 LCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNR 74
+ F A V + N EL++ F++ + K+Y S E +RF F+ +L I N
Sbjct: 5 IAFAAFVVAI---NAASDQELWADFKKAHGKTYKSLREEKLRFNIFQDTLREIAAHNAKY 61
Query: 75 QSPESARY-GITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGIT 133
+S ES Y I +FSD+++EEF+ +++ ++ L ++ ++T G
Sbjct: 62 ESGESTYYLAINQFSDITDEEFRAMLMKNVESRPSL-----------EDMEIANLTVGAA 110
Query: 134 IPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC 193
P DWR G + +RNQ+ CG+CWAFS V E A+K+G+ + LSVQ+++DC
Sbjct: 111 -----PESIDWRTEGAVLPIRNQEDCGSCWAFSAVAAVEGQAAIKSGSKTPLSVQQLVDC 165
Query: 194 AG-NGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTC 252
+ GN GC+GG D++ N LE +++YP D +CK +S VK+ Y
Sbjct: 166 STEGGNSGCNGGLMNGAFDYIKANG--LESDAKYPYTGTDDSCKADKSSSL-VKLTGYK- 221
Query: 253 DTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVI-QYNCDGSLANINHAVQIVGY-- 309
+ SE+S+ + T GP+ AV A W+ Y GG+ C G ++H V VGY
Sbjct: 222 -KVASSEASLKEAVGTVGPISVAVYADLWRSYGGGIFNNILCLG--FGLDHGVTAVGYGT 278
Query: 310 DNYSRTW 316
DN + W
Sbjct: 279 DNGKKYW 285
>gi|328876826|gb|EGG25189.1| hypothetical protein DFA_03437 [Dictyostelium fasciculatum]
Length = 341
Score = 141 bits (355), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 93/302 (30%), Positives = 143/302 (47%), Gaps = 27/302 (8%)
Query: 11 VALIALCFLAIPV-KVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIE 68
+ALIA+ + V + F ++ + K Y + E +R NF +++ IE
Sbjct: 5 LALIAIMLAVVSAYNVRLSTADDYTTRFKTWMVEHNKMYHEEEEFYLRLSNFIRNIHSIE 64
Query: 69 ELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSI 128
++N RQ +A +G+ +FSDLS +EFK KH LM ++K K R
Sbjct: 65 KMN--RQYGRTATFGLNKFSDLSLDEFK---------KHYLMPNYKP--------KARVT 105
Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
P+ IP DWR G + V+NQ CG+CWAFS E E+ + + G + LS Q
Sbjct: 106 KETFNYPSNIPATLDWRTKGYVTPVKNQLMCGSCWAFSATEQIETANIMAGGQVEYLSEQ 165
Query: 189 EVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIK 248
+++DC + GC GGD ++ N L YP + AC +T+P V++
Sbjct: 166 QIVDCDPY-DGGCGGGDPYTAYQYVQ-NNGGLTLNVTYPYTAANGACYANSTAP-AVQVT 222
Query: 249 SYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVG 308
++ + +E+ + +A GP+ VNA W Y G+ C L +H VQIVG
Sbjct: 223 AFGYASSQGNETQLREAMAARGPLSICVNAEPWMSYQSGIFSSTCSDDL---DHCVQIVG 279
Query: 309 YD 310
YD
Sbjct: 280 YD 281
>gi|281209544|gb|EFA83712.1| cysteine proteinase 1 [Polysphondylium pallidum PN500]
Length = 465
Score = 141 bits (355), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 98/313 (31%), Positives = 158/313 (50%), Gaps = 40/313 (12%)
Query: 8 LFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDII 67
+ I+ ++ L +A K+S LE+ F FQ +Y K Y+ SE+ RF F+ +L +I
Sbjct: 4 VIILTVLLLVSMAAAKKLS---LEETQ--FRQFQIKYNKQYTSSEYAERFATFKSNLKVI 58
Query: 68 EELNKNRQSPESA-RYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKR 126
+E N++ S +S+ R+G+ EF+DLS+ EF+ +L +SV V+
Sbjct: 59 DEKNRDAASRKSSVRFGVNEFADLSQSEFRATYL-NSVQA----------------VRDP 101
Query: 127 SITTGITIPT-GIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLL 185
+ +P +P DWR G + V+NQ CG+CW+FST E L TL+ L
Sbjct: 102 NAAVAADLPVEDLPTAFDWRTKGAVTGVKNQGQCGSCWSFSTTGNVEGQWFLAGNTLTGL 161
Query: 186 SVQEVIDC-------AGNG--NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACK 236
S Q ++DC G+ + GC+GG ++ N + + E+ YP D C
Sbjct: 162 SEQNLVDCDHECMEYLGDNVCDQGCNGGLQPNAYTYIIKNGGI-DTEASYPYQGVDGTCS 220
Query: 237 RKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGS 296
KA + G KI ++T + +E+ + + +GP+ A +A+ WQ+YLGGV C +
Sbjct: 221 FKAANI-GAKISNWT--YVSSNETQMAAYLVANGPLAIAADAVEWQFYLGGVFDVPCGNT 277
Query: 297 LANINHAVQIVGY 309
L +H + IVGY
Sbjct: 278 L---DHGILIVGY 287
>gi|281207374|gb|EFA81557.1| hypothetical protein PPL_05546 [Polysphondylium pallidum PN500]
Length = 341
Score = 140 bits (354), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 92/308 (29%), Positives = 148/308 (48%), Gaps = 27/308 (8%)
Query: 7 VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQ---RYKKSYSK-SEHDIRFKNFEK 62
+ F+V L+A+ + + + + F+Q +++KSY+ SE+ +R ++ K
Sbjct: 5 IAFLVCLVAIASVDAIRIQNNSGFHRARDFEGEFRQWMTKHEKSYADDSEYYLRLSHYIK 64
Query: 63 SLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNH 122
+L + + NK A++ +FSDLS EEF+ +L + NK +
Sbjct: 65 NLRTVADYNKKHAG--MAKFAPNKFSDLSIEEFRAGYLNYVPNKLI-------------- 108
Query: 123 VKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTL 182
K RS P IPV DWR+ G + V+NQ+ CG+CWAFS E E+ + +
Sbjct: 109 -KDRSTKQNFDYPANIPVSLDWRQKGFVTPVKNQEQCGSCWAFSAGEQIETAYIMAGNAA 167
Query: 183 SLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSP 242
+S Q+++DC + GC GGD ++ + + ++YP D C + T P
Sbjct: 168 QNVSEQQIVDCDPY-DGGCGGGDPMTAYQYVQ-SAGGITTNTDYPYTATDGTCYAQNT-P 224
Query: 243 NGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINH 302
+I SY + +E+ + IA GP+ V+A TW Y GV+ NC L +H
Sbjct: 225 KFTQIASYGYASNKGNETELKQAIAARGPLSICVDAETWMNYQSGVLNSNCPDEL---DH 281
Query: 303 AVQIVGYD 310
VQIVGYD
Sbjct: 282 CVQIVGYD 289
>gi|56718881|gb|AAW28151.1| westerpain-1 [Paragonimus westermani]
Length = 322
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 103/312 (33%), Positives = 152/312 (48%), Gaps = 47/312 (15%)
Query: 8 LFIVA---LIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSL 64
LF V+ LI C +A+P + EL+ F++ Y K Y+ + RF F+ +L
Sbjct: 3 LFTVSCFVLIVSCAVAVP--------DSARELYEQFKRDYGKVYANEDDQKRFAIFKDNL 54
Query: 65 DIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVK 124
++L Q +ARYG+T+FSDL+ EEF ++L VN N
Sbjct: 55 MRAQKLQLKDQG--TARYGVTQFSDLTPEEFAAKYLSAPVN---------------NDQV 97
Query: 125 KRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL 184
KR TG+ P + DWR G + V NQ +CG+CWAFST E +K G L
Sbjct: 98 KRVRPTGLK---AAPERIDWRAKGAVTAVENQGSCGSCWAFSTAGNVEGQWFIKTGQLVS 154
Query: 185 LSVQEVIDCAGNGNMGCSGG-DFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPN 243
LS Q+++DC GC+GG + L+ M + LE ES+YP + + C + N
Sbjct: 155 LSKQQLVDC-DRAAQGCNGGWPASSYLEIMYMGG--LESESDYPYVGVEQTC-----ALN 206
Query: 244 GVKIKSYTCDTLI--PSESSILTDIATHGPVIAAVNALTWQYYLGGVIQ---YNCDGSLA 298
K+ + D+++ P E +A HGP+ +NA+ QYY GV++ C +
Sbjct: 207 KEKLVAKIDDSIVLGPEEEDHAAYLAEHGPLSTLLNAVALQYYQSGVLKPTFEECPDT-- 264
Query: 299 NINHAVQIVGYD 310
+NHAV VGYD
Sbjct: 265 ELNHAVLTVGYD 276
>gi|67773370|gb|AAY81942.1| cysteine protease 3 [Paragonimus westermani]
Length = 321
Score = 138 bits (348), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 102/308 (33%), Positives = 148/308 (48%), Gaps = 39/308 (12%)
Query: 8 LFIVA---LIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSL 64
LF V+ LI C +A+P + EL+ F++ Y K Y+ + RF F+ +L
Sbjct: 3 LFTVSCFVLIVSCAVAVP--------DSARELYEQFKRDYGKVYANEDDQKRFAIFKDNL 54
Query: 65 DIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVK 124
++L Q +ARYG+T+FSDL+ EEF ++L VN N
Sbjct: 55 MRAQKLQLKDQG--TARYGVTQFSDLTPEEFAAKYLSAPVN---------------NDQV 97
Query: 125 KRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL 184
KR TG+ P + DWR G + V NQ +CG+CWAFST E +K G L
Sbjct: 98 KRVRPTGLK---AAPERIDWRAKGAVTAVENQGSCGSCWAFSTAGNVEGQWFIKTGQLVS 154
Query: 185 LSVQEVIDCAGNGNMGCSGG-DFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPN 243
LS Q+++DC + GC+GG + L+ M + LE + +YP C +
Sbjct: 155 LSKQQLVDCDRAAD-GCNGGWPASSYLEIMHMGG--LESQDDYPYAGVKEQCFMEKER-- 209
Query: 244 GVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDG-SLANINH 302
+ K L PSE +A HGP+ +NA+T QYY G+I + + S ++NH
Sbjct: 210 -LLAKIDDSIALGPSEDDNAAYLAEHGPLSTLLNAITLQYYQSGIIHPSYEECSPVDLNH 268
Query: 303 AVQIVGYD 310
AV VGYD
Sbjct: 269 AVLTVGYD 276
>gi|156546466|ref|XP_001607324.1| PREDICTED: hypothetical protein LOC100123649 [Nasonia vitripennis]
Length = 1036
Score = 138 bits (347), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 97/282 (34%), Positives = 152/282 (53%), Gaps = 35/282 (12%)
Query: 36 LFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F +YKK Y +K E ++RF+ F+ +L++IEEL +N + RYG+T+F+DL++ E
Sbjct: 730 LFHEFMGKYKKMYHNKEEKEMRFQIFKDNLNLIEELQRNEMG--TGRYGVTQFTDLTKAE 787
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIP-TGIPVKKDWREAGIIGKV 153
FK RHL K L S N + TIP +P DWR ++ V
Sbjct: 788 FKARHLGL---KPTLKSE--------NDIP----MPMATIPDIELPSDYDWRHHNVVTPV 832
Query: 154 RNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLD-- 211
++Q +CG+CWAFS E +A+K+G L LS QE++DC + GC+GG L D
Sbjct: 833 KDQGSCGSCWAFSVTGNIEGQYAIKHGELLSLSEQELVDC-DKLDSGCNGG----LPDTA 887
Query: 212 WMDVNKV-VLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
+ + ++ LE ES+YP +D C + N VK+ + + +E+ + + +G
Sbjct: 888 YRAIEELGGLELESDYPYDAEDEKCH---FNKNKVKVNIVSGLNITSNETQMAQWLVKNG 944
Query: 271 PVIAAVNALTWQYYLGGV---IQYNCDGSLANINHAVQIVGY 309
P+ +NA Q+Y+GGV ++ C S +++H V IVGY
Sbjct: 945 PMSIGINANAMQFYMGGVSHPFKFLC--SPDSLDHGVLIVGY 984
>gi|27728675|gb|AAO18731.1| cysteine protease [Gossypium hirsutum]
Length = 389
Score = 138 bits (347), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 88/286 (30%), Positives = 150/286 (52%), Gaps = 27/286 (9%)
Query: 31 EQKLELFSSFQQRYKKSYSKSEH-DIRFKNFEKSLDIIEELNKNRQSPESARY-GITEFS 88
E+ LE+F ++++++K Y +E + RF+NF+ +L I E N R++ + + G+ +F+
Sbjct: 43 ERVLEIFQQWKEKHRKVYRHAEEAEKRFENFKGNLKYILERNAKRKANKWEHHVGLNKFA 102
Query: 89 DLSEEEFKTRHL---RHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWR 145
D+S EEF+ +L + +NK + +S + +R + + P DWR
Sbjct: 103 DMSNEEFRKAYLSKVKKPINKGITLSRNM----------RRKVQS-----CDAPSSLDWR 147
Query: 146 EAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGD 205
G++ V++Q +CG+CWAFS+ E ++AL G L LS QE+++C N GC GG
Sbjct: 148 NYGVVTAVKDQGSCGSCWAFSSTGAMEGINALVTGDLISLSEQELVECD-TSNYGCEGGY 206
Query: 206 FCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
+W+ +N ++ ES+YP D C V I Y + S+S++L
Sbjct: 207 MDYAFEWV-INNGGIDSESDYPYTGVDGTCNTTKEETKVVSIDGYQ--DVEQSDSALLCA 263
Query: 266 IATHGPVIAAVN--ALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+A PV ++ A+ +Q Y GG+ +C +I+HAV IVGY
Sbjct: 264 VAQQ-PVSVGIDGSAIDFQLYTGGIYDGSCSDDPDDIDHAVLIVGY 308
>gi|414887427|tpg|DAA63441.1| TPA: hypothetical protein ZEAMMB73_713985 [Zea mays]
Length = 355
Score = 137 bits (346), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 94/316 (29%), Positives = 152/316 (48%), Gaps = 28/316 (8%)
Query: 9 FIVALIALCFLAIPVKVSKPNLEQKLEL--FSSFQQRYKKSY-SKSEHDIRFKNFEKSLD 65
+ A + L +A + ++E L + F ++Q Y +SY + +E RF+ + ++++
Sbjct: 10 LLCACLMLVLMAGAASGGRVDVEDMLMMDRFRAWQATYNRSYLTAAERLRRFEVYRQNME 69
Query: 66 IIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHV-- 123
+IE N R++ S + T F+DL+ EEF H S H + +H + H
Sbjct: 70 LIEATN--RRAELSYQLSETPFTDLTSEEFLATHT-MSTRLHASEAARRHRELITTHAGP 126
Query: 124 --------KKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMH 175
+R+ TT + +P + DWR G + V++Q CG CW+F+TV E +H
Sbjct: 127 VSDGGRQWNRRNYTTDLDVPESV----DWRTKGAVTTVKDQGACGGCWSFATVAAIEGLH 182
Query: 176 ALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAAC 235
++ G L LS QEV+DC+ N GC GG+ A +DW+ N L ES+YP + C
Sbjct: 183 KIRTGQLVSLSEQEVLDCSSPPNNGCHGGNPAAAIDWVSANG-GLTTESDYPYEGRQGKC 241
Query: 236 KRKATSPNGVKIKSYTCDTLIPSESSILTDIA-THGPVIAAVNA-LTWQYYLGGVIQYNC 293
K + KI+ L+ + ++A PV +N Q+Y GV C
Sbjct: 242 KLDKARNHVAKIRG---RKLVDQNNEAALEVAVAQQPVAVGMNVHPIQQHYKSGVFHGPC 298
Query: 294 DGSLANINHAVQIVGY 309
D ++NHAV +VGY
Sbjct: 299 DPE--DLNHAVTMVGY 312
>gi|56718883|gb|AAW28152.1| westerpain-10 [Paragonimus westermani]
Length = 327
Score = 137 bits (345), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 101/311 (32%), Positives = 154/311 (49%), Gaps = 40/311 (12%)
Query: 8 LFIVALIALCFLAIPVKVSKPNL-EQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDI 66
LF V+ AL ++ + VS + + EL+ F++ Y K Y+ + RF F+ +L
Sbjct: 3 LFTVSCFAL-IVSCAIAVSAGRVPDSARELYEQFKRGYGKVYANEDDQKRFAIFKDNLVR 61
Query: 67 IEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKR 126
++L Q +ARYG+T+FSDL+ EEF ++L VN + VK+
Sbjct: 62 AQKLQLKDQG--TARYGVTQFSDLTPEEFAAKYLSAPVN--------------DDQVKRM 105
Query: 127 SITTGITIPTGI---PVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLS 183
PTG+ P + DWR G + V NQ +CG+CWAFST E +K G L
Sbjct: 106 R-------PTGLKAAPERIDWRAKGAVTAVENQGSCGSCWAFSTAGNVEGQWFIKTGQLV 158
Query: 184 LLSVQEVIDCAGNGNMGCSGG-DFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSP 242
LS Q+++DC GC+GG + L+ M + LE ES+YP + + C +
Sbjct: 159 SLSKQQLVDC-DRAAQGCNGGWPASSYLEIMYMGG--LESESDYPYVGVEQTC-----AL 210
Query: 243 NGVKIKSYTCDTLI--PSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL-AN 299
N K+ + D+++ P E +A HGP+ +NA+ Q+Y GV++ D
Sbjct: 211 NKEKLVAKIDDSIVLGPEEEDHAAYLAEHGPLSTLLNAVALQHYQSGVLKPTFDECPDTE 270
Query: 300 INHAVQIVGYD 310
+NHAV VGYD
Sbjct: 271 LNHAVLTVGYD 281
>gi|84181681|gb|AAW78661.2| senescence-specific cysteine protease [Nicotiana tabacum]
Length = 349
Score = 137 bits (344), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 96/307 (31%), Positives = 152/307 (49%), Gaps = 30/307 (9%)
Query: 14 IALCFLAIPVKVSKPNLEQKLELFSSFQQRY-------KKSYSK-SEHDIRFKNFEKSLD 65
+AL F+ + + S+ L + + ++ + R+ +K Y +E ++RF+ F+++++
Sbjct: 12 LALFFICLGLWSSQVALSRPINYEATMRARHDQWIVHHEKVYKDLNEKEVRFQIFKENVE 71
Query: 66 IIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKK 125
IE N + + G +FSDL+ EEF+ H + + +M+ K H
Sbjct: 72 RIEAFNAGED--KGYKLGFNKFSDLTNEEFRVLHTGYKRSHPKVMTSSKGKTHFR----- 124
Query: 126 RSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLL 185
T T IP DWR+ G + +++Q+ CG CWAFS V E +H LK G L L
Sbjct: 125 ------YTNVTDIPPTMDWRKKGAVTPIKDQKECGCCWAFSAVAAMEGLHQLKTGELIPL 178
Query: 186 SVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNG 244
S QE++DC G + GCSGG D++ NK L E YP +D C +K ++ +
Sbjct: 179 SEQELVDCDVEGEDEGCSGGLLDTAFDFILKNK-GLTTEVNYPYKGEDGVCNKKKSALSA 237
Query: 245 VKIKSYTCDTLIPSESSILTDIATHGPVIAAVN--ALTWQYYLGGVIQYNCDGSLANINH 302
KI Y D SE ++L +A PV A++ + +Q+Y GV +C L NH
Sbjct: 238 AKITGYE-DVPANSEKALLQAVANQ-PVSVAIDGSSFDFQFYSSGVFSGSCSTWL---NH 292
Query: 303 AVQIVGY 309
AV VGY
Sbjct: 293 AVTAVGY 299
>gi|332326581|gb|AEE42614.1| cysteine protease [Leishmania aethiopica]
Length = 443
Score = 137 bits (344), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 89/280 (31%), Positives = 140/280 (50%), Gaps = 25/280 (8%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F++ Y+++Y +E R NFE++L+++ E ++P AR+GIT+F DLSE E
Sbjct: 37 LFEEFKRTYRRAYGTVAEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F R+L N + K H H + ++ +P DWRE G + V+
Sbjct: 94 FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
+Q CG+CWAFS V ES A+ L+ LS Q+++ C + + GC GG +W+
Sbjct: 143 DQGACGSCWAFSAVGNIESQWAVAGHRLTALSEQQLVSC-DDKDSGCGGGLMTQAFEWLL 201
Query: 215 VN-KVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
N + E YP + AC + G +I Y T+ SE+ + +A G
Sbjct: 202 RNMNGTMXTEDSYPYVSSTGDVPACTNSSQLVPGARIDGYV--TIESSETVMAAWLAKSG 259
Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
P+ AV+A ++ Y GV+ +C G +NH V +VGY+
Sbjct: 260 PISIAVDASSFMSYXSGVLT-SCAGK--XLNHGVLLVGYN 296
>gi|357162946|ref|XP_003579573.1| PREDICTED: cysteine proteinase 1-like [Brachypodium distachyon]
Length = 376
Score = 137 bits (344), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 104/340 (30%), Positives = 163/340 (47%), Gaps = 57/340 (16%)
Query: 1 MFDVKNVLFIVALIALC--FLAIPVKVSKPNLEQ--------KLEL-----FSSFQQRYK 45
M ++ + +VA + L A+ V P +EQ +LEL F+SF QR+
Sbjct: 1 MARLRRLPIVVAAVLLLSGVAALSSPVEDPLIEQVVGGDEKNELELNAEAHFASFVQRFN 60
Query: 46 KSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHL---- 100
KSY + EH R F +L ++++ SA +G+T+FSDL+ +EF+ R L
Sbjct: 61 KSYRDADEHAHRLSVFTANL---RRARRHQRLDPSAVHGVTKFSDLTPDEFRDRFLGLRK 117
Query: 101 -RHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKDWREAGIIGKVRNQQT 158
R S K + S H +PT G+P + DWRE G +G V++Q +
Sbjct: 118 YRRSFLKGLSGSAHD----------------APALPTDGLPTEFDWREHGAVGPVKDQGS 161
Query: 159 CGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDFCALL 210
CG+CW+FST E H L G L +LS Q+++DC + GC+GG
Sbjct: 162 CGSCWSFSTSGALEGAHYLATGKLEVLSEQQMVDCDHECDPSEPRACDAGCNGGLMTTAF 221
Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
++ LE E +YP + ACK S ++K+++ T+ E I ++ HG
Sbjct: 222 SYL-AKAGGLETEKDYPYTGRGGACKFD-KSKIAAQVKNFS--TVAVDEDQIAANLVKHG 277
Query: 271 PVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
P+ +NA+ Q Y+GGV + C +++H V +VGY
Sbjct: 278 PLAIGINAVFMQTYIGGVSCPFICG---RHLDHGVLLVGY 314
>gi|330792958|ref|XP_003284553.1| hypothetical protein DICPUDRAFT_96752 [Dictyostelium purpureum]
gi|325085467|gb|EGC38873.1| hypothetical protein DICPUDRAFT_96752 [Dictyostelium purpureum]
Length = 346
Score = 137 bits (344), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 92/285 (32%), Positives = 145/285 (50%), Gaps = 33/285 (11%)
Query: 37 FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPES-ARYGITEFSDLSEEEF 95
F +FQQ+Y K YS +E+ +F+ F+ +L +I +LN+ + +S ++G+ EF+DLS EF
Sbjct: 29 FVAFQQKYNKVYSSNEYSAKFETFKANLGVIAQLNQKAKLHKSDTKFGVNEFADLSAAEF 88
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
+ +L V K + + T + T IP DWR G + V+N
Sbjct: 89 RKYYLNAQVAKP------------DASLPMAPLLTEEVLET-IPTAFDWRTKGAVTGVKN 135
Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC---------AGNGNMGCSGGDF 206
Q CG+CW+FST E L TL LS Q ++DC + + GC GG
Sbjct: 136 QGQCGSCWSFSTTGNIEGQWYLAGNTLVGLSEQNLVDCDHQCMEYDGQKSCDAGCDGGLQ 195
Query: 207 CALLDWMDVNKVVLEPESEYPLL-LKDAACKRKATSPNGVKIKSYTCDTLIP-SESSILT 264
++ + L+ E+ YP L + +CK K+ + KI ++ T+IP +E+ +
Sbjct: 196 PNAYRYV-IENGGLDSENSYPYLAVTGDSCKFKSGNV-AAKISNF---TMIPQNETQMAG 250
Query: 265 DIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+ATHGP+ A +A WQ+Y+GGV C SL +H + IVG+
Sbjct: 251 YLATHGPLAIAADAAEWQFYIGGVFDLPCGQSL---DHGILIVGF 292
>gi|318136892|gb|ADV41672.1| cysteine protease [Nicotiana tabacum]
Length = 349
Score = 136 bits (342), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 88/262 (33%), Positives = 133/262 (50%), Gaps = 22/262 (8%)
Query: 51 SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLM 110
+E ++RFK F+++++ IE N + + G+ +FSDL+ E+F+ H + + +M
Sbjct: 57 NEKEMRFKIFKENVERIEAFNAGED--KGYKLGVNKFSDLTNEKFRVLHTGYKRSHPKVM 114
Query: 111 SHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVET 170
S K H T IP DWR+ G + +++Q+ CG CWAFS V
Sbjct: 115 SSSKPKTHFR-----------YANVTDIPPTMDWRKKGAVTPIKDQKECGCCWAFSAVAA 163
Query: 171 AESMHALKNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLL 229
E +H LK G L LS QE++DC G + GCSGG D++ NK L E+ YP
Sbjct: 164 TEGLHQLKTGKLIPLSEQELVDCDVEGEDEGCSGGLLDTAFDFILKNK-GLTTEANYPYK 222
Query: 230 LKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVN--ALTWQYYLGG 287
+D C +K ++ + KI Y D SE ++L +A PV A++ + +Q+Y G
Sbjct: 223 GEDGVCNKKKSALSAAKIAGYE-DVPANSEKALLQAVANQ-PVSVAIDGSSFDFQFYSSG 280
Query: 288 VIQYNCDGSLANINHAVQIVGY 309
V +C L NHAV VGY
Sbjct: 281 VFSGSCSTWL---NHAVTAVGY 299
>gi|383863617|ref|XP_003707276.1| PREDICTED: uncharacterized protein LOC100880620 [Megachile
rotundata]
Length = 884
Score = 136 bits (342), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 93/279 (33%), Positives = 140/279 (50%), Gaps = 29/279 (10%)
Query: 36 LFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F + Y K+Y S E R+K F K+L +IE+L K Q +A YG+T F+DL+ EE
Sbjct: 578 LFEDFVKTYNKTYLSAKEKADRYKVFRKNLKMIEKLRKFEQG--TAVYGVTMFADLTPEE 635
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
FKT++L N N + + +P K DWRE + V+
Sbjct: 636 FKTKYLGLKTN--------------LNQENDIPLQEAVIPDIDLPPKFDWREYNAVTPVK 681
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
+Q CG+CWAFS + E +A+K+ L LS QE++DC N + GC GG + +
Sbjct: 682 DQGQCGSCWAFSAIGNIEGQYAIKHKKLLSLSEQELVDC-DNLDDGCGGG--YMINAYKT 738
Query: 215 VNKV-VLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
V K+ LE E++YP ++ C N K++ + + E + + +GP+
Sbjct: 739 VEKLGGLELETDYPYDARNEKCHFLK---NKAKVQVASALNITNDEKKMAQWLVKNGPIS 795
Query: 274 AAVNALTWQYYLGGV---IQYNCDGSLANINHAVQIVGY 309
+NA Q+Y GGV ++ CD AN++H V IVGY
Sbjct: 796 VGINANAMQFYFGGVSHPFKFLCDP--ANLDHGVLIVGY 832
>gi|403352840|gb|EJY75943.1| Oryzain gamma chain [Oxytricha trifallax]
Length = 338
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 101/310 (32%), Positives = 155/310 (50%), Gaps = 33/310 (10%)
Query: 7 VLFIVALIALC-FLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKS- 63
L IV +++L A + + L E F ++ R+ KSY +K+E R K F K+
Sbjct: 5 TLAIVGIVSLSSVFASDAFLKESGLVSSTEEFLNYIARFGKSYATKAEFQKRAKLFLKTK 64
Query: 64 LDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSH--HKHHDHHHN 121
++I++ + N S + R G +FSD +EEEF+ +L + + HD +H
Sbjct: 65 MEIMQAASSN--SVPTFRLGFNQFSDWTEEEFQA----------ILGNKPSEEEHDVYHE 112
Query: 122 HVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGT 181
H+K I +P KDWR+ G++ V++Q CG+CWAFST ES A++ G
Sbjct: 113 HLK-------ILEDAILPASKDWRDDGVVNPVKDQGRCGSCWAFSTAAGVESHFAIQFGK 165
Query: 182 LSLLSVQEVIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKAT 240
L LS Q+++DC+ N GC+GG D+ V LE E++YP L D C R +
Sbjct: 166 LYSLSEQQLVDCSTAYDNAGCNGGLATQGYDY--VKSYGLEQEADYPYLAADGTCHRDKS 223
Query: 241 SPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL-TWQYYLGGVIQYNCDGSLAN 299
+T TL PS+ + +AT GP +V+A ++ Y G++ C SL
Sbjct: 224 KIVAYVEDFHTVQTLSPSQ--LKAALATQGPASVSVDASGVFKNYQSGILNAGCGTSL-- 279
Query: 300 INHAVQIVGY 309
NHA+ VGY
Sbjct: 280 -NHAILAVGY 288
>gi|82659048|gb|ABB88697.1| cysteine protease [Leishmania tropica]
Length = 443
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 89/281 (31%), Positives = 142/281 (50%), Gaps = 27/281 (9%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F++ Y+++Y +E R NFE++L+++ E ++P AR+GIT+F DLSE E
Sbjct: 37 LFEEFKRTYRRAYGTVAEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F R+L N + K H H + ++ +P DWR+ G + V+
Sbjct: 94 FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWRKKGAVTPVK 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
+Q CG+CWAFS V + ES AL L+ LS Q+++ C N GC GG +W+
Sbjct: 143 DQGACGSCWAFSAVGSIESQWALAGHGLTALSEQQLVSCDDKDN-GCGGGLMLQAFEWLL 201
Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
++N + E YP + C + G +I Y T+ SE+ + +A +
Sbjct: 202 RNMNGTMFT-EDSYPYVSSSGYVPECSNSSQLVPGARIDGYM--TIESSETVMAAWLAKN 258
Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
GP+ AV+A ++ Y GV+ +C G +NH V +VGY+
Sbjct: 259 GPISIAVDASSFMSYQSGVLT-SCAGDA--LNHGVLLVGYN 296
>gi|328866896|gb|EGG15279.1| cysteine protease [Dictyostelium fasciculatum]
Length = 347
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 96/312 (30%), Positives = 152/312 (48%), Gaps = 36/312 (11%)
Query: 10 IVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEE 69
I+A++ L LA K+S ++ F FQ +Y K Y E +F F+ +L+ I+
Sbjct: 5 IIAILFLVALAAARKLSPEEIQ-----FRDFQVKYNKVYGSHEFSQKFVTFKDNLNRIDT 59
Query: 70 LNKNRQSPES-ARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSI 128
LN N + S ++G+ EF+DLS +EF+ ++ ++V V D+ +
Sbjct: 60 LNANAAASGSDTKFGVNEFADLSVQEFRKFYM-NAVPASVPSDAQVAGDYSDETLAS--- 115
Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
IP DWR G + V+NQ CG+CW+FST E L TL+ LS Q
Sbjct: 116 ---------IPSSFDWRTKGAVTPVKNQGQCGSCWSFSTTGNVEGQWFLAGNTLTGLSEQ 166
Query: 189 EVIDC-----AGNGNM----GCSGGDFCALLDWMDVNKVVLEPESEYPLL-LKDAACKRK 238
++DC +G GC+GG ++ + ++ E+ YP L + C+ K
Sbjct: 167 NLVDCDHHCMTYDGQQSCDDGCNGGLQPNAFQYI-IGNGGIDTETSYPYLAVAQDKCQFK 225
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLA 298
A++ G KI ++ L +E+ I +A +GPV A +A WQ+Y+GGV C +L
Sbjct: 226 ASNI-GAKISNW--QMLSTNETQIAAYLALNGPVSIAADAAEWQFYIGGVFDLPCGKAL- 281
Query: 299 NINHAVQIVGYD 310
+H + IVGYD
Sbjct: 282 --DHGILIVGYD 291
>gi|118394988|ref|XP_001029851.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89284124|gb|EAR82188.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 330
Score = 135 bits (340), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 103/313 (32%), Positives = 152/313 (48%), Gaps = 41/313 (13%)
Query: 6 NVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEH-DIRFKNFEKSL 64
N L IVAL+A C A + S + F F Q Y K YS EH + R F+++L
Sbjct: 2 NKLIIVALLAACVFA---RFSTMQDQDIAAAFKKFTQTYNKKYSSEEHYNARLSIFKENL 58
Query: 65 DIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVK 124
IE NKN + A++GIT+F+DL+ EEF +L + N
Sbjct: 59 RRIELFNKN----DEAQHGITQFADLTHEEFADMYL-------------GYKPQLRNSQA 101
Query: 125 KRSIT-TGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK-NGTL 182
K S++ T T PT I DW G + V+NQ +CG+CWAFST + E + L+ L
Sbjct: 102 KVSLSSTPFTAPTAI----DWTTKGAVTPVKNQGSCGSCWAFSTTGSIEGQYVLQLKQNL 157
Query: 183 SLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSP 242
+ S Q+++DC + GC+GG +++ K LE ES YP D +CK S
Sbjct: 158 TSFSEQQLVDCDTKEDQGCNGGLMDNAFTYLESAK--LETESAYPYTAVDGSCKYN-QSL 214
Query: 243 NGVKIKSYT----CDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGV---IQYNCDG 295
V + S+ T+ +E+++ + GP+ A+NA Q+Y GG+ + N +G
Sbjct: 215 GVVGVASFVDIEQGKTVADTENTMGVALDNIGPLSVAINANNLQFYAGGISNPLICNPNG 274
Query: 296 SLANINHAVQIVG 308
+NH V IVG
Sbjct: 275 ----LNHGVLIVG 283
>gi|394331814|gb|AFN27126.1| cysteine protease [Leishmania tropica]
Length = 443
Score = 135 bits (340), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 88/281 (31%), Positives = 143/281 (50%), Gaps = 27/281 (9%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F++ Y+++Y +E R NFE++L+++ E ++P AR+GIT+F DLSE E
Sbjct: 37 LFEEFKRTYRRAYGTVAEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F R+L N + K H H + ++ +P DWR+ G + V+
Sbjct: 94 FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPYAVDWRKKGAVTPVK 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
+Q CG+CWAFS V + ES AL L+ LS Q+++ C + + GC GG +W+
Sbjct: 143 DQGACGSCWAFSAVGSIESQWALAGHRLTALSEQQLVSC-DDKDSGCGGGLMLQAFEWLL 201
Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
++N + E YP + C + G +I Y T+ SE+ + +A +
Sbjct: 202 RNMNGTMFT-EDSYPYVSSSGYVPECSNSSQLVPGARIDGYM--TIESSETVMAAWLAKN 258
Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
GP+ AV+A ++ Y GV+ +C G +NH V +VGY+
Sbjct: 259 GPISIAVDASSFMSYESGVLT-SCAGD--TLNHGVLLVGYN 296
>gi|394331822|gb|AFN27130.1| cysteine protease [Leishmania tropica]
Length = 443
Score = 135 bits (340), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 88/281 (31%), Positives = 143/281 (50%), Gaps = 27/281 (9%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F++ Y+++Y +E R NFE++L+++ E ++P AR+GIT+F DLSE E
Sbjct: 37 LFEEFKRTYRRAYGTVAEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F R+L N + K H H + ++ +P DWR+ G + V+
Sbjct: 94 FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPYAVDWRKKGAVTPVK 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
+Q CG+CWAFS V + ES AL L+ LS Q+++ C + + GC GG +W+
Sbjct: 143 DQGACGSCWAFSAVGSIESQWALAGHRLTALSEQQLVSC-DDKDSGCGGGLMLQAFEWLL 201
Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
++N + E YP + C + G +I Y T+ SE+ + +A +
Sbjct: 202 RNMNGTMFT-EDSYPYVSSSGYVPECSNSSQLVPGARIDGYM--TIESSETVMAAWLAKN 258
Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
GP+ AV+A ++ Y GV+ +C G +NH V +VGY+
Sbjct: 259 GPISIAVDASSFMSYESGVLT-SCAG--ITLNHGVLLVGYN 296
>gi|394331816|gb|AFN27127.1| cysteine protease [Leishmania tropica]
Length = 443
Score = 135 bits (340), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 89/281 (31%), Positives = 143/281 (50%), Gaps = 27/281 (9%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F++ Y+++Y +E R NFE++L+++ E ++P AR+GIT+F DLSE E
Sbjct: 37 LFEEFKRTYRRAYGTVAEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F R+L N + K H H + ++ +P DWR+ G + V+
Sbjct: 94 FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWRKKGAVTPVK 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
+Q CG+CWAFS V + ES AL L+ LS Q+++ C N GC+GG +W+
Sbjct: 143 DQGACGSCWAFSAVGSIESQWALAGHRLTALSEQQLVSCDDKDN-GCAGGLMLQAFEWLL 201
Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
++N + E YP + C + G +I Y T+ SE+ + +A +
Sbjct: 202 RNMNGTMFT-EDSYPYVSSTGYVPECSNSSQLVPGARIDGYL--TIESSETVMAAWLAKN 258
Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
GP+ AV+A ++ Y GV+ +C G +NH V +VGY+
Sbjct: 259 GPISIAVDASSFMSYQSGVLT-SCAGDA--LNHGVLLVGYN 296
>gi|350421176|ref|XP_003492760.1| PREDICTED: hypothetical protein LOC100745708 [Bombus impatiens]
Length = 884
Score = 135 bits (340), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 90/284 (31%), Positives = 145/284 (51%), Gaps = 39/284 (13%)
Query: 36 LFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF +F +++ K+Y+ ++ + RFK F+++L IIEEL + +A YG+T F+DL+ +E
Sbjct: 578 LFEAFIKKFGKTYNSADEKLDRFKIFKQNLKIIEELQTFERG--TAEYGVTMFADLTPKE 635
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIP------TGIPVKKDWREAG 148
FK R+L L KH + I +P +P+K DWR+
Sbjct: 636 FKARYLG-------LRPELKHENE-------------IPLPEAEIPDVSLPLKFDWRDHS 675
Query: 149 IIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCA 208
++ V++Q CG+CWAFS E +A+K+ L LS QE++DC + + GC+GGD
Sbjct: 676 VVTPVKDQGQCGSCWAFSVTGNVEGQYAIKHNQLLSLSEQELVDC-DSLDEGCNGGDMEN 734
Query: 209 LLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIAT 268
++ LE ES+YP KD C N K++ + + E + +
Sbjct: 735 AYKAIE-RLGGLELESDYPYDAKDEKCHFLQ---NKAKVQVVSAVNITSDEKRMAQWLVK 790
Query: 269 HGPVIAAVNALTWQYYLGGV---IQYNCDGSLANINHAVQIVGY 309
+GP+ +NA Q+Y GGV + + C+ N++H V IVGY
Sbjct: 791 NGPISVGINANAMQFYFGGVSHPLNFLCNPK--NLDHGVLIVGY 832
>gi|394331826|gb|AFN27132.1| cysteine protease [Leishmania tropica]
Length = 443
Score = 135 bits (339), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 90/281 (32%), Positives = 143/281 (50%), Gaps = 27/281 (9%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F++ Y+++Y +E R NFE++L+++ E ++P AR+GIT+F DLSE E
Sbjct: 37 LFEEFKRTYRRAYGTVAEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F R+L N + K H H + ++ +P DWR+ G + V+
Sbjct: 94 FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWRKKGAVTPVK 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
+Q CG+CWAFS V + ES AL L+ LS Q+++ C N GCSGG +W+
Sbjct: 143 DQGACGSCWAFSAVGSIESQWALAGHGLTALSEQQLVSCDDKDN-GCSGGLMLQAFEWLL 201
Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
++N + E YP + C + G +I+ Y T+ SE+ +A +
Sbjct: 202 RNMNGTMFT-EDSYPYVSSSGYVPECSNSSQLVPGARIEGYM--TIESSETVKGAWLAKN 258
Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
GP+ AV+A ++ Y GV+ +C G +NH V +VGY+
Sbjct: 259 GPISIAVDASSFMSYQSGVLT-SCAGDA--LNHGVLLVGYN 296
>gi|291230041|ref|XP_002734978.1| PREDICTED: cysteine proteinase inhibitor-like [Saccoglossus
kowalevskii]
Length = 352
Score = 135 bits (339), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 97/286 (33%), Positives = 146/286 (51%), Gaps = 30/286 (10%)
Query: 29 NLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
++++ +LF F + Y K Y ++ EH +R++ F+ +L E L + Q+ + +YG+T+F
Sbjct: 46 SVDKTQDLFQDFMKTYDKKYDTEEEHQLRYQIFQDNLLKAERLQQTEQA--TGQYGVTKF 103
Query: 88 SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREA 147
DLSEEEF+ K+ L + D H +KK I G P DWR+A
Sbjct: 104 MDLSEEEFR---------KYYLTPVWRGSDPH---MKKAEIPKGTP-----PAAFDWRDA 146
Query: 148 --GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGG- 204
+ KV+NQ TCG+CWAFST E +K GTL LS QE++DC + GC+GG
Sbjct: 147 DKNAVTKVKNQGTCGSCWAFSTTGNIEGQWKIKKGTLVSLSEQELVDCD-KLDQGCNGGL 205
Query: 205 DFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILT 264
A + M ++ E +YP +D CK AT N V I + E + +
Sbjct: 206 PSNAYQEIMRFGGIM--SEDDYPYTGRDQDCKLNATL-NKVYINGSM--NISKDEGDMAS 260
Query: 265 DIATHGPVIAAVNALTWQYYLGGVIQ-YNCDGSLANINHAVQIVGY 309
+A +GP+ +NA Q+Y GGV + + N++H V IVGY
Sbjct: 261 WLAANGPISIGINANAMQFYFGGVSHPWKIFCNPENLDHGVLIVGY 306
>gi|6649577|gb|AAF21462.1|U69121_1 cysteine proteinase PWCP2 [Paragonimus westermani]
Length = 260
Score = 135 bits (339), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 94/278 (33%), Positives = 135/278 (48%), Gaps = 28/278 (10%)
Query: 35 ELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
EL+ F++ Y K Y+ + RF F+ +L ++L Q +ARYG+T+FSDL+ EE
Sbjct: 4 ELYEQFKRXYGKVYANEDDQKRFAIFKDNLMRAQKLQLKDQG--TARYGVTQFSDLTPEE 61
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F ++L VN N KR TG+ P + DWR G + V
Sbjct: 62 FAAKYLSAPVN---------------NDQVKRVRPTGLK---AAPERIDWRAKGAVTAVE 103
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGG-DFCALLDWM 213
NQ +CG+CWAFST E +K G L LS Q+++DC + GC+GG + L+ M
Sbjct: 104 NQGSCGSCWAFSTAGNVEGQWFIKTGQLVSLSKQQLVDCDRAAD-GCNGGWPASSYLEIM 162
Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
+ LE + +YP C + + K L PSE +A HGP+
Sbjct: 163 HMGG--LESQDDYPYAGVKEQCFMEKER---LLAKIDDSIALXPSEDDNAAYLAEHGPLS 217
Query: 274 AAVNALTWQYYLGGVIQYNCDG-SLANINHAVQIVGYD 310
+NA+T QYY G+I + S ++NHAV VGYD
Sbjct: 218 TLLNAITLQYYQSGIIHPSYXXCSPVDLNHAVLTVGYD 255
>gi|357216861|gb|AET71138.1| cysteine peptidase isoform b [Sphenophorus levis]
Length = 324
Score = 135 bits (339), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 90/292 (30%), Positives = 145/292 (49%), Gaps = 25/292 (8%)
Query: 23 VKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKN-RQSPESA 80
V VS L++ F SF+ ++ K+Y +++E RF F ++L IE N +Q S
Sbjct: 12 VAVSATLLKEDGAHFQSFKLKHGKTYKNQAEETKRFAIFRENLRKIEAHNAEYKQGIHSY 71
Query: 81 RYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPV 140
GI +F+D++ EFK K +++ K + G+++P I
Sbjct: 72 TQGINKFADMTRAEFKAMLATQVKTKPSIVA-----------TKTFQLADGVSVPESI-- 118
Query: 141 KKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMG 200
DWR ++ +++Q CG+CWAF+ V + E +AL G L+ S Q+++DC + N G
Sbjct: 119 --DWRSRNVVTPIKDQAQCGSCWAFAVVGSTEGAYALSTGKLTRFSEQQLVDCTTDLNYG 176
Query: 201 CSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSES 260
C GG ++ N LE ES+YP D C + +S K+ SY ++ +E
Sbjct: 177 CDGGYLDDTFPYIQTNG--LELESDYPYTGYDGYCSYE-SSKVVTKVSSYV--SVPANEQ 231
Query: 261 SILTDIATHGPVIAAVNALTWQYYLGGVIQYN-CDGSLANINHAVQIVGYDN 311
++L + T GPV A+NA Q+Y G+I CD ++H V VGYD+
Sbjct: 232 ALLEAVGTAGPVAIAINADDLQFYFSGIIDDKYCDPEY--LDHGVLAVGYDS 281
>gi|332326587|gb|AEE42617.1| cysteine protease [Leishmania aethiopica]
Length = 443
Score = 135 bits (339), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 89/281 (31%), Positives = 143/281 (50%), Gaps = 27/281 (9%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F++ Y+++Y +E R NFE++L+++ E ++P AR+GIT+F DLSE E
Sbjct: 37 LFEEFKRTYRRAYGTVAEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F R+L N + K H H + ++ +P DWRE G + V+
Sbjct: 94 FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
+Q CG+CWAFS V ES A+ L+ LS Q+++ C + + GC+GG +W+
Sbjct: 143 DQGACGSCWAFSAVGNIESQWAVAGHRLTALSEQQLVSC-DDKDSGCNGGLMTQAFEWLL 201
Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
++N +L E YP + C + G +I Y T+ SE+ + +A
Sbjct: 202 RNMNGTMLT-EDSYPYVSSTGDVPECTNSSQLVPGARIDGYV--TIESSETVMAAWLAKS 258
Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
GP+ AV+A ++ Y GV+ +C G +NH V +VGY+
Sbjct: 259 GPISIAVDASSFMSYESGVLT-SCAGDA--LNHGVLLVGYN 296
>gi|293334761|ref|NP_001168296.1| uncharacterized protein LOC100382061 [Zea mays]
gi|223947281|gb|ACN27724.1| unknown [Zea mays]
Length = 322
Score = 135 bits (339), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 89/289 (30%), Positives = 142/289 (49%), Gaps = 26/289 (8%)
Query: 34 LELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSE 92
++ F ++Q Y +SY + +E RF+ + +++++IE N R++ S + T F+DL+
Sbjct: 4 MDRFRAWQATYNRSYLTAAERLRRFEVYRQNMELIEATN--RRAELSYQLSETPFTDLTS 61
Query: 93 EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHV----------KKRSITTGITIPTGIPVKK 142
EEF H S H + +H + H +R+ TT + +P +
Sbjct: 62 EEFLATHT-MSTRLHASEAARRHRELITTHAGPVSDGGRQWNRRNYTTDLDVPESV---- 116
Query: 143 DWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCS 202
DWR G + V++Q CG CW+F+TV E +H ++ G L LS QEV+DC+ N GC
Sbjct: 117 DWRTKGAVTTVKDQGACGGCWSFATVAAIEGLHKIRTGQLVSLSEQEVLDCSSPPNNGCH 176
Query: 203 GGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSI 262
GG+ A +DW+ N L ES+YP + CK + KI+ L+ +
Sbjct: 177 GGNPAAAIDWVSANG-GLTTESDYPYEGRQGKCKLDKARNHVAKIRG---RKLVDQNNEA 232
Query: 263 LTDIA-THGPVIAAVNA-LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
++A PV +N Q+Y GV CD ++NHAV +VGY
Sbjct: 233 ALEVAVAQQPVAVGMNVHPIQQHYKSGVFHGPCDPE--DLNHAVTMVGY 279
>gi|37780041|gb|AAP32193.1| cysteine protease 14 [Trifolium repens]
Length = 351
Score = 134 bits (338), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 92/287 (32%), Positives = 147/287 (51%), Gaps = 31/287 (10%)
Query: 29 NLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
++++ +ELF S+ R+ K Y E + RF+ F+ +L I+E NK + G+ EF
Sbjct: 39 SMDKLIELFESWMSRHGKIYETIEEKLLRFEVFKDNLKHIDERNK---IVSNYWLGLNEF 95
Query: 88 SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREA 147
+DLS +EFK ++L VN +S + + + +P DWR+
Sbjct: 96 ADLSHQEFKNKYLGLKVN----LSQRRESSNEEEFTYR---------DVDLPKSVDWRKK 142
Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFC 207
G + V+NQ CG+CWAFSTV E ++ + G L+ LS QE+IDC N GC+GG
Sbjct: 143 GAVTPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDTTYNNGCNGG--- 199
Query: 208 ALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILT 264
L+D+ V L E +YP +++++ C+ K V I Y D +E S+L
Sbjct: 200 -LMDYAFSFIVQNGGLHKEDDYPYIMEESTCEMKKEETQVVTINGYH-DVPQNNEQSLLK 257
Query: 265 DIATHGPVIAAVNALT--WQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+A P+ A+ A + +Q+Y GGV +C ++++H V VGY
Sbjct: 258 ALANQ-PLSVAIEASSRDFQFYSGGVFDGHCG---SDLDHGVSAVGY 300
>gi|154332647|ref|XP_001562140.1| cathepsin L-like protease [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134059588|emb|CAM37170.1| cathepsin L-like protease [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 441
Score = 134 bits (338), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 91/280 (32%), Positives = 141/280 (50%), Gaps = 25/280 (8%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F+Q Y++ Y+ E R NF+++L+++ E N AR+GIT+F DLSEEE
Sbjct: 37 LFEEFKQTYQRVYATLDEEQQRLANFQRNLELMREHQANN---PHARFGITKFFDLSEEE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F TR+L + + K H+ V G + T P DWRE G + V+
Sbjct: 94 FATRYLSGATH---FAKAKKFASQHYRKV-------GADLSTA-PAAVDWREKGAVTPVK 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
+Q CG+CWAFS + ES L +L LS QE++ C + + GC+GG DW+
Sbjct: 143 DQGMCGSCWAFSAIGNIESQWYLATHSLISLSEQELVSC-DDVDEGCNGGLMLQAFDWLL 201
Query: 215 VNK-VVLEPESEYPLLLKDAA---CKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
N+ + YP + + + C + G I + T+ +E ++ +A +G
Sbjct: 202 NNRNGAVYTGVSYPYVSGNGSVPECSESSDLVIGAYIDGHV--TIESNEDTMAAWLAANG 259
Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
P+ AV+A + Y GGV+ +CDG +NH V +VGY+
Sbjct: 260 PIAIAVDASAFMSYTGGVLT-SCDGK--QLNHGVLLVGYN 296
>gi|154332649|ref|XP_001562141.1| cathepsin L-like protease [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134059589|emb|CAM37171.1| cathepsin L-like protease [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 441
Score = 134 bits (338), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 91/280 (32%), Positives = 141/280 (50%), Gaps = 25/280 (8%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F+Q Y++ Y+ E R NF+++L+++ E N AR+GIT+F DLSEEE
Sbjct: 37 LFEEFKQTYQRVYATLDEEQQRLANFQRNLELMREHQANN---PHARFGITKFFDLSEEE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F TR+L + + K H+ V G + T P DWRE G + V+
Sbjct: 94 FATRYLSGATH---FAKAKKFASQHYRKV-------GADLSTA-PAAVDWREKGAVTPVK 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
+Q CG+CWAFS + ES L +L LS QE++ C + + GC+GG DW+
Sbjct: 143 DQGMCGSCWAFSAIGNIESQWYLATHSLISLSEQELVSC-DDVDEGCNGGLMLQAFDWLL 201
Query: 215 VNK-VVLEPESEYPLLLKDAA---CKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
N+ + YP + + + C + G I + T+ +E ++ +A +G
Sbjct: 202 NNRNGAVYTGVSYPYVSGNGSVPECSESSDLVIGAYIDGHV--TIESNEDTMAAWLAANG 259
Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
P+ AV+A + Y GGV+ +CDG +NH V +VGY+
Sbjct: 260 PIAIAVDASAFMSYTGGVLT-SCDGK--QLNHGVLLVGYN 296
>gi|67773382|gb|AAY81948.1| cysteine protease 11 [Paragonimus westermani]
Length = 322
Score = 134 bits (337), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 99/310 (31%), Positives = 153/310 (49%), Gaps = 43/310 (13%)
Query: 8 LFIV---ALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSL 64
LF V ALI C +A+P + EL+ F++ Y K Y+ + RF F+ +L
Sbjct: 3 LFTVSCFALIVSCAVAVP--------DSARELYEQFKRDYGKVYANEDDQKRFAIFKDNL 54
Query: 65 DIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVK 124
++L Q +ARYG+T+FSDL+ EEF ++L +N +
Sbjct: 55 VRAQKLQLRDQG--TARYGVTQFSDLTPEEFAAKYLSPPLNSDQV--------------- 97
Query: 125 KRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL 184
+R TG+ P + DWR G + V NQ CG+CWAFST E +K G L
Sbjct: 98 ERVQPTGLK---AAPERMDWRAKGAVTPVENQGECGSCWAFSTAGNVEGQWFIKTGQLVS 154
Query: 185 LSVQEVIDCAGNGNMGCSGG-DFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPN 243
LS Q+++DC GC+GG + L+ MD+ LE E++YP + + C + N
Sbjct: 155 LSKQQLVDCDMAAE-GCNGGWPSSSYLEIMDMGG--LESENDYPYVGVEQTC-----ALN 206
Query: 244 GVKIKSYTCDTLI--PSESSILTDIATHGPVIAAVNALTWQYYLGGVIQ-YNCDGSLANI 300
K+ + D ++ SE+ + +A HGP+ +NA+ Q+Y G++ + D ++
Sbjct: 207 KEKLVAKIDDAVVLGASENEHVDYLAEHGPLSTLLNAVALQHYQSGILHPSHKDCPDDDL 266
Query: 301 NHAVQIVGYD 310
NHAV VGYD
Sbjct: 267 NHAVLTVGYD 276
>gi|32396018|gb|AAP41846.1| cysteine protease [Anthurium andraeanum]
Length = 502
Score = 134 bits (337), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 91/300 (30%), Positives = 144/300 (48%), Gaps = 20/300 (6%)
Query: 22 PVKVSKPNLEQKLELFSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELN-KNRQSPES 79
P V +E ELF + ++++K Y+ E R+ NF +L + + N + R++P S
Sbjct: 36 PEDVGAGGVEGGQELFERWMEKHRKVYAHPGEKARRYANFLSNLAFVRKRNAEGRRAPSS 95
Query: 80 AR-YGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGI 138
+ G+ F+DLS EEF R + VL + + G P +
Sbjct: 96 GQGVGMNVFADLSNEEF-----REVYSSRVLRKKAAEGRGARRRAGEGRVVAGCDAPASL 150
Query: 139 PVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGN 198
DWR+ G + V+NQ CG+CWAFS+ E ++A+ G L LS QE++DC N
Sbjct: 151 ----DWRKRGAVTAVKNQGDCGSCWAFSSTGAMEGINAITTGELISLSEQELVDCD-TTN 205
Query: 199 MGCSGGDFCALLDWMDVNKVVLEPESEYPLLLK-DAACKRKATSPNGVKIKSYTCDTLIP 257
GC GG +W+ +N ++ E+ YP + D+ C V I Y + +
Sbjct: 206 EGCDGGYMDYAFEWV-INNGGIDSEANYPYTGQADSVCNTTKEEIKVVSIDGY--EDVAT 262
Query: 258 SESSILTDIATHGPVIAAVN--ALTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYSRT 315
SES++L A PV ++ +L +Q Y GG+ +C G+ +I+HAV +VGY T
Sbjct: 263 SESALLC-AAVQQPVSVGIDGSSLDFQLYAGGIYDGDCSGNPDDIDHAVLVVGYGQQGGT 321
>gi|398010921|ref|XP_003858657.1| cathepsin L-like protease, partial [Leishmania donovani]
gi|322496866|emb|CBZ31937.1| cathepsin L-like protease, partial [Leishmania donovani]
Length = 345
Score = 134 bits (337), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 90/281 (32%), Positives = 141/281 (50%), Gaps = 27/281 (9%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F++ Y+++Y +E R NFE++L+++ E ++P AR+GIT+F DLSE E
Sbjct: 37 LFEEFKRTYRRAYGTLAEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F R+L N + K H H + ++ +P DWRE G + V+
Sbjct: 94 FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
NQ CG+CWAFS V ES A L LS Q+++ C N GC+GG +W+
Sbjct: 143 NQGACGSCWAFSAVGNIESQWARAGHGLVSLSEQQLVSCDDKDN-GCNGGLMLQAFEWLL 201
Query: 215 VNKV-VLEPESEYPLLLKD---AACKRKATSPNGVKIKSYTCDTLIPSESSILTD-IATH 269
+ ++ E YP + A C + G +I Y +IPS +++ +A +
Sbjct: 202 RHMYGIVFTEKSYPYTSGNGDVAECLNSSKLVPGAQIDGY---VMIPSNETVMAAWLAEN 258
Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
GP+ AV+A ++ Y GV+ +C G +NH V +VGY+
Sbjct: 259 GPIAIAVDASSFMSYQSGVLT-SCAGDA--LNHGVLLVGYN 296
>gi|66803062|ref|XP_635374.1| cysteine protease [Dictyostelium discoideum AX4]
gi|60463697|gb|EAL61879.1| cysteine protease [Dictyostelium discoideum AX4]
Length = 352
Score = 134 bits (337), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 97/326 (29%), Positives = 152/326 (46%), Gaps = 55/326 (16%)
Query: 7 VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDI 66
+ F++ L AL + L + F +FQ +Y K YS E+ ++F+ F+ +L
Sbjct: 5 LFFVLMLTAL--------AAGRRLSVEESQFIAFQNKYNKIYSAEEYLVKFETFKSNLLN 56
Query: 67 IEELNKNRQSPES-ARYGITEFSDLSEEEFKTRHLRHS---VNKHVLMSHHKHHDHHHNH 122
I+ LNK + S ++G+ +F+DLS+EEFK +L + + M + D
Sbjct: 57 IDALNKQATTIGSDTKFGVNKFADLSKEEFKKYYLSSKEARLTDDLPMLPNLSDD----- 111
Query: 123 VKKRSITTGITIPTGIPVKKDWREAG---------IIGKVRNQQTCGACWAFSTVETAES 173
I + P DWR G + V+NQ CG+CW+FST E
Sbjct: 112 -----------IISATPAAFDWRNTGGSTKFPQGTPVTAVKNQGQCGSCWSFSTTGNVEG 160
Query: 174 MHALKNGTLSLLSVQEVIDC---------AGNGNMGCSGGDFCALLDWMDVNKVVLEPES 224
H L GTL LS Q ++DC N GC GG +++ N + + E+
Sbjct: 161 QHYLSTGTLVGLSEQNLVDCDHTCMTYENENVCNAGCDGGLQPNAYNYIIKNGGI-QTEA 219
Query: 225 EYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP-SESSILTDIATHGPVIAAVNALTWQY 283
YP D CK + G KI S+ T++P +E+ I + + +GP+ A +A WQ+
Sbjct: 220 TYPYTAVDGECKFNSAQV-GAKISSF---TMVPQNETQIASYLFNNGPLAIAADAEEWQF 275
Query: 284 YLGGVIQYNCDGSLANINHAVQIVGY 309
Y+GGV + C +L +H + IVGY
Sbjct: 276 YMGGVFDFPCGQTL---DHGILIVGY 298
>gi|357619725|gb|EHJ72184.1| hypothetical protein KGM_03271 [Danaus plexippus]
Length = 338
Score = 134 bits (337), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 91/281 (32%), Positives = 143/281 (50%), Gaps = 21/281 (7%)
Query: 29 NLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFS 88
+LE+ LF F + Y K Y +SE + RFK F +L I +N + +A YGI +FS
Sbjct: 33 SLEEAPTLFEQFIKDYNKEYDESEKEERFKIFVNNLKDINAMN---ERSSNAVYGINKFS 89
Query: 89 DLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAG 148
DLS+EEF K+ + + +H KK + + P + DWR+ G
Sbjct: 90 DLSKEEFI---------KYYTGLKREESPSNEDH-KKTDLPESFNVTA--PDQFDWRKKG 137
Query: 149 IIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCA 208
++ ++NQ+ CG+CWAFS ES+HA+K G L +S Q+++DC + GCSGG
Sbjct: 138 VVSSIKNQKHCGSCWAFSAAANVESIHAIKTGKLIDVSEQQLLDC-DKYDSGCSGGLPWD 196
Query: 209 LLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIAT 268
L + N + YP + K+ C R +S +++K Y + I SE I +
Sbjct: 197 ALRYFVANGAM--SLKSYPYVAKEGKC-RYDSSKVEIRLKGYKIFSKI-SEDQIKEHLYN 252
Query: 269 HGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
GP+ A++ + Y+GG++ C + +NHAV +VGY
Sbjct: 253 IGPLSIAIDVSPIKPYVGGIVMEECH-EVCQVNHAVLLVGY 292
>gi|356557743|ref|XP_003547170.1| PREDICTED: LOW QUALITY PROTEIN: xylem cysteine proteinase 1-like
[Glycine max]
Length = 400
Score = 134 bits (336), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 87/286 (30%), Positives = 140/286 (48%), Gaps = 24/286 (8%)
Query: 28 PNLEQKLELFSSFQQRYKKSYSKSEHD-IRFKNFEKSLDIIEELNKNRQSPESARYGITE 86
P+ E +ELF +++ KK Y E + +RF+NF+++L I E N R SP G+ +
Sbjct: 41 PSEEGVVELFQRWKEENKKIYRNPEEEKLRFENFKRNLKYIVEKNSKRISPYGQSLGLNQ 100
Query: 87 FSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWRE 146
F+D+S EEFK++ MS K N V + + P DWR+
Sbjct: 101 FADMSNEEFKSK----------FMSKVKKPFSKRNGVSSKDHSC-----EDEPYSLDWRK 145
Query: 147 AGIIG-KVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGD 205
G++ V++Q CG+ WAFS+ + E ++A+ L LS QE++DC N GC GG
Sbjct: 146 KGVVTLAVKDQGYCGSYWAFSSTDAIEGINAIVTADLISLSEQELVDCDST-NDGCDGGX 204
Query: 206 FCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
+W+ N + + E+ YP + D C + I Y + S+SS+L
Sbjct: 205 MDYAFEWVMYNGGI-DTETNYPYIGADGTCNVTKEKTKVIGIDGYY--DVGQSDSSLLCA 261
Query: 266 IATHGPVIAAVNALTW--QYYLGGVIQYNCDGSLANINHAVQIVGY 309
P+ A ++ +W Q Y+GG+ +C +I+HA+ +VGY
Sbjct: 262 TVKQ-PISAGIDGTSWDFQLYIGGIYDGDCSSDPDDIDHAILVVGY 306
>gi|449500145|ref|XP_004161017.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 349
Score = 134 bits (336), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 100/309 (32%), Positives = 150/309 (48%), Gaps = 32/309 (10%)
Query: 7 VLFIVALIALCFLAIPVKVSK-PNLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSL 64
LFI IA F + ++++ +ELF S+ ++ K+Y E + RF+ F +L
Sbjct: 16 TLFITYAIAHDFSIVGYSPEHLASMDKTIELFESWMSKHSKTYRSIEEKLHRFEIFLDNL 75
Query: 65 DIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVK 124
I+E NK S G+ EF+DLS EEFK+++L V +
Sbjct: 76 KHIDETNKK---VSSYWLGLNEFADLSHEEFKSKYLGLRVE----------------FPR 116
Query: 125 KRSITTGITIP--TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTL 182
KRS + G + +P DWR G + V+NQ +CG+CWAFSTV E ++ + G L
Sbjct: 117 KRS-SRGFSYGDVEDLPESVDWRTKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVTGNL 175
Query: 183 SLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSP 242
+ LS QE+IDC + N GC GG ++ N L E +YP L+++ C R+
Sbjct: 176 TSLSEQELIDCDRSFNNGCYGGLMDYAFQYIMSNS-GLRKEEDYPYLMEEGRCIREKEQF 234
Query: 243 NGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALT--WQYYLGGVIQYNCDGSLANI 300
V I Y D E S+L + +H PV A+ A + +Q+Y GG+ C +
Sbjct: 235 EVVTISGYE-DVPANDEQSLLKAL-SHQPVSVAIEASSRNFQFYKGGIFTGRCG---TQM 289
Query: 301 NHAVQIVGY 309
+H V VGY
Sbjct: 290 DHGVTAVGY 298
>gi|330842703|ref|XP_003293312.1| hypothetical protein DICPUDRAFT_41833 [Dictyostelium purpureum]
gi|325076376|gb|EGC30167.1| hypothetical protein DICPUDRAFT_41833 [Dictyostelium purpureum]
Length = 352
Score = 134 bits (336), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 95/311 (30%), Positives = 150/311 (48%), Gaps = 24/311 (7%)
Query: 10 IVALIALCFLAIPVKVSKPNLEQKLE-----LFSSFQQRYKKSYSKSEH-DIRFKNFEKS 63
++ + F+A V ++ N + ++ LF + ++ K Y SE + RF NF+ +
Sbjct: 7 LIIIFCFVFVAQSVNININNAYRTIDGPSKDLFHHWTKQNGKIYETSEEFEKRFSNFKTN 66
Query: 64 LDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHV 123
L IE LN + A +G+ ++SDLSEEEF +L K+ + D+
Sbjct: 67 LKKIENLNNLHKG--KASFGMNKYSDLSEEEFSNFYLM----KNFKGKPEEERDYIKKPE 120
Query: 124 KKRSITTGITIPTGIPVKK----DWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
S G + T +K DWR G++ V++Q CG+C+ FS E ES +
Sbjct: 121 NPSSNLIGGYLNTDDGLKAMYQVDWRNKGLVTPVKDQGQCGSCYIFSATEQIESEYIRAG 180
Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
LLS Q+ +DC + GC GGD + +++ ++ + E +YP +D C
Sbjct: 181 HKAILLSEQQSVDCD-TMDGGCGGGDPANVYNYI-ISAGGVSTEKDYPYTAQDGTC---F 235
Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLAN 299
+ V I + T E +++T IA HGPV V+A TWQ Y GG+I C+ N
Sbjct: 236 NTTRAVSITGFQYVTQNSDEDTLITTIANHGPVSICVDASTWQSYTGGIITTGCE---QN 292
Query: 300 INHAVQIVGYD 310
I+H VQ+VG D
Sbjct: 293 IDHCVQVVGLD 303
>gi|330800456|ref|XP_003288252.1| hypothetical protein DICPUDRAFT_55299 [Dictyostelium purpureum]
gi|325081708|gb|EGC35214.1| hypothetical protein DICPUDRAFT_55299 [Dictyostelium purpureum]
Length = 531
Score = 134 bits (336), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 92/297 (30%), Positives = 153/297 (51%), Gaps = 30/297 (10%)
Query: 19 LAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSP 77
L +KV + +L++K F +F+ Y+KSY +K EHD+RFKN++ + + I N S
Sbjct: 210 LGDSLKVKESDLQEK---FVAFKSEYEKSYENKEEHDMRFKNYKVAHNKIVSHNAKNLS- 265
Query: 78 ESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG 137
+ G ++DLS+ EF T ++ V + H HD +
Sbjct: 266 --YKLGFNHYADLSDHEFNTL-IKPKVARPSNNGAHSVHDDEDIYT-------------- 308
Query: 138 IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAG-N 196
IP DWR + V++Q CG+CW F + + E + + NG L LS Q+++DCA
Sbjct: 309 IPQSVDWRNQKCVTPVKDQGVCGSCWTFGSTGSLEGTNCVTNGYLVSLSEQQLVDCAYLM 368
Query: 197 GNMGCSGGDFCALLDW-MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTL 255
G+ GC+GG + + MD + ES+Y L+++A CK K+T+ +GV + SY +
Sbjct: 369 GSQGCNGGFAASAFQYIMDAGGIAT--ESDYQYLMQNALCKDKSTTFSGVGVSSYV-NVT 425
Query: 256 IPSESSILTDIATHGPVIAAVNALT--WQYYLGGVIQY-NCDGSLANINHAVQIVGY 309
S +++L +AT GPV A++A ++YY G+ +C +++H V +GY
Sbjct: 426 AGSINALLNAVATQGPVAIAIDASVDDFRYYQSGIYSNPSCKNGPDDLDHEVLAIGY 482
>gi|118399607|ref|XP_001032128.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89286466|gb|EAR84465.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 336
Score = 134 bits (336), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 95/309 (30%), Positives = 150/309 (48%), Gaps = 23/309 (7%)
Query: 10 IVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRY--KKSY-SKSEHDIRFKNFEKSLDI 66
I+ALI L + P+ N LF+ + RY K++Y S E R + F ++ +
Sbjct: 3 IIALITLLLVVSPIIADSTNNFSVSALFAYNKWRYANKRTYFSLEEQQFRQQIFFETHER 62
Query: 67 IEELNKNRQSPESA-RYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKK 125
I+ N N PE+ + +FSD+ +EEF +R L S L+ + ++N +
Sbjct: 63 IQNHNSN---PEATYKLAHNQFSDMPQEEFASRVLMKSSQ---LIPRNAVQAQNNNSTTQ 116
Query: 126 RSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLL 185
+ + +P DWR+ GI+ V++Q CG+CWAFST E+++ ++N
Sbjct: 117 QHTAQDVQLPASF----DWRDYGILSDVKDQGQCGSCWAFSTTGILEALYFMENRQKISF 172
Query: 186 SVQEVIDCAGNGN----MGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATS 241
S Q+++DCA N N GCSGG L + V K + E +YP L D+ CK + +
Sbjct: 173 SEQQLVDCATNSNGFNSYGCSGGWPEEALKY--VAKFGILKEEQYPYLAVDSKCKVSSPT 230
Query: 242 PNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANIN 301
+G K++S+ I + L + PV V+A TW Y GV + N+N
Sbjct: 231 SDGFKVQSF---YFIDKTADALKNTVARIPVSVLVDASTWGSYSSGVYNGCGNTQTYNLN 287
Query: 302 HAVQIVGYD 310
HAV +GYD
Sbjct: 288 HAVVAIGYD 296
>gi|84028184|sp|Q9R014.2|CATJ_MOUSE RecName: Full=Cathepsin J; AltName: Full=Cathepsin L-related
protein; AltName: Full=Cathepsin P; AltName:
Full=Catlrp-p; Flags: Precursor
gi|5306071|gb|AAD41898.1|AF158182_1 preprocathepsin P [Mus musculus]
gi|12838143|dbj|BAB24099.1| unnamed protein product [Mus musculus]
gi|74199838|dbj|BAE20748.1| unnamed protein product [Mus musculus]
gi|74355544|gb|AAI03770.1| Cathepsin J [Mus musculus]
gi|148709363|gb|EDL41309.1| cathepsin J, isoform CRA_a [Mus musculus]
Length = 334
Score = 134 bits (336), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 99/304 (32%), Positives = 154/304 (50%), Gaps = 30/304 (9%)
Query: 11 VALIALCF-LAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEE 69
V L+ LCF +A + P L+ + + ++ +Y KSYS E +R +E+++ +I+
Sbjct: 5 VLLLILCFGVASGAQAHDPKLDAE---WKDWKTKYAKSYSPKEEALRRAVWEENMRMIKL 61
Query: 70 LNK-NRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSI 128
NK N + + +F D + EEF R S++ ++ + H NHV
Sbjct: 62 HNKENSLGKNNFTMKMNKFGDQTSEEF-----RKSID-NIPIPAAMTDPHAQNHVS---- 111
Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
G+P KDWRE G + VRNQ CG+CWAF+ E K G L+ LSVQ
Sbjct: 112 -------IGLPDYKDWREEGYVTPVRNQGKCGSCWAFAAAGAIEGQMFWKTGNLTPLSVQ 164
Query: 189 EVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI 247
++DC+ GN GC G +++ NK LE E+ YP KD C+ ++ + + I
Sbjct: 165 NLLDCSKTVGNKGCQSGTAHQAFEYVLKNK-GLEAEATYPYEGKDGPCRYRSENAS-ANI 222
Query: 248 KSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSLANINHAVQ 305
Y L P+E + +A+ GPV AA++A ++++Y GG I Y + S +NHAV
Sbjct: 223 TDYV--NLPPNELYLWVAVASIGPVSAAIDASHDSFRFYNGG-IYYEPNCSSYFVNHAVL 279
Query: 306 IVGY 309
+VGY
Sbjct: 280 VVGY 283
>gi|339896953|ref|XP_003392238.1| cathepsin L-like protease [Leishmania infantum JPCM5]
gi|14349351|gb|AAC38832.2| cysteine protease [Leishmania chagasi]
gi|17384031|emb|CAD12393.1| cysteine proteinase [Leishmania infantum]
gi|321398984|emb|CBZ08377.1| cathepsin L-like protease [Leishmania infantum JPCM5]
Length = 443
Score = 134 bits (336), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 90/281 (32%), Positives = 141/281 (50%), Gaps = 27/281 (9%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F++ Y+++Y +E R NFE++L+++ E ++P AR+GIT+F DLSE E
Sbjct: 37 LFEEFKRTYRRAYGTLAEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F R+L N + K H H + ++ +P DWRE G + V+
Sbjct: 94 FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
NQ CG+CWAFS V ES A L LS Q+++ C N GC+GG +W+
Sbjct: 143 NQGACGSCWAFSAVGNIESQWARAGHGLVSLSEQQLVSCDDKDN-GCNGGLMLQAFEWLL 201
Query: 215 VNKV-VLEPESEYPLLLKD---AACKRKATSPNGVKIKSYTCDTLIPSESSILTD-IATH 269
+ ++ E YP + A C + G +I Y +IPS +++ +A +
Sbjct: 202 RHMYGIVFTEKSYPYTSGNGDVAECLNSSKLVPGAQIDGY---VMIPSNETVMAAWLAEN 258
Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
GP+ AV+A ++ Y GV+ +C G +NH V +VGY+
Sbjct: 259 GPIAIAVDASSFMSYQSGVLT-SCAGDA--LNHGVLLVGYN 296
>gi|156398078|ref|XP_001638016.1| predicted protein [Nematostella vectensis]
gi|156225133|gb|EDO45953.1| predicted protein [Nematostella vectensis]
Length = 326
Score = 134 bits (336), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 95/302 (31%), Positives = 145/302 (48%), Gaps = 24/302 (7%)
Query: 13 LIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELN 71
+ LC L + Q E + +++ + KSYS E R ++++L+ I+ N
Sbjct: 4 FLVLCVLVASSRGWSVRFGQDSE-WVAWKSYHGKSYSDVHEERTRMAIWQQNLEKIKRHN 62
Query: 72 KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTG 131
S + A + DL+E+EF+ +L HHN K+ T
Sbjct: 63 AEDHSYKMA---MNHLGDLTEDEFRYFYLGVRA--------------HHNSTKRGWATYM 105
Query: 132 ITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVI 191
IP DW + G + V+NQ CG+CWAFST + E H K G+L LS Q +I
Sbjct: 106 PPSNVKIPSSVDWSQKGYVTGVKNQGQCGSCWAFSTTGSVEGQHFRKTGSLVSLSEQNLI 165
Query: 192 DCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSY 250
DC+G+ GN GC GG +++ N + + ES YP L + +C ++S G ++ Y
Sbjct: 166 DCSGSYGNNGCQGGLMDNAFRYIESNGGI-DTESSYPYLGQQGSC-HFSSSHVGARVTGY 223
Query: 251 TCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
D SE ++ + +AT GPV AV+A WQ+Y GV N S ++H V ++GY
Sbjct: 224 Q-DIPQGSEQALQSAVATVGPVSVAVDASQWQFYSSGVYD-NPYCSSTQLDHGVLVIGYG 281
Query: 311 NY 312
NY
Sbjct: 282 NY 283
>gi|154332645|ref|XP_001562139.1| cathepsin L-like protease [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134059587|emb|CAM37169.1| cathepsin L-like protease [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 441
Score = 133 bits (335), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 90/280 (32%), Positives = 142/280 (50%), Gaps = 25/280 (8%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F+Q Y++ Y+ E R NF+++L+++ E N AR+GIT+F DLSEEE
Sbjct: 37 LFEEFKQTYQRVYATLDEEQQRLANFQRNLELMREHQANN---PHARFGITKFFDLSEEE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F TR+L + + K ++ V G + T P DWRE G + V+
Sbjct: 94 FATRYLSGATH---FAKAKKFASQYYRKV-------GADLSTA-PAAVDWREKGAVTPVK 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
+Q CG+CWAFS + ES L +L LS QE++ C + + GC+GG DW+
Sbjct: 143 DQGMCGSCWAFSAIGNIESKWYLATHSLISLSEQELVSC-DDVDEGCNGGLMLQAFDWLL 201
Query: 215 VNK-VVLEPESEYPLLLKDAA---CKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
N+ + + YP + + + C + G I + T+ +E ++ +A +G
Sbjct: 202 NNRNGAVYTGASYPYVSGNGSVPECSESSDLVIGAYIDGHV--TIESNEDTMAAWLAANG 259
Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
P+ AV+A + Y GGV+ +CDG +NH V +VGY+
Sbjct: 260 PIAIAVDASAFMSYTGGVLT-SCDGK--QLNHGVLLVGYN 296
>gi|224555777|gb|ACN56478.1| cathepsin F [Paralichthys olivaceus]
Length = 475
Score = 133 bits (335), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 101/289 (34%), Positives = 145/289 (50%), Gaps = 31/289 (10%)
Query: 28 PNLEQKLELFSSFQQ---RYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQSPESARYG 83
P LE+ +EL F++ +Y K YS + E D R F ++L E+L Q SA YG
Sbjct: 165 PPLEESVELLGQFKEFMVKYNKVYSSQDEADRRLSIFHENLKTAEKLQSLDQG--SAEYG 222
Query: 84 ITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKD 143
+T+FSDL+EEEF++ +L +++ L H +K S G P D
Sbjct: 223 VTKFSDLTEEEFRSTYLNPLLSQWTL----------HRPMKPASPAKGPA-----PASWD 267
Query: 144 WREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSG 203
WR+ G + V+NQ CG+CWAFS E LKNGTL LS QE++DC G + C+G
Sbjct: 268 WRDHGAVSSVKNQGMCGSCWAFSVTGNIEGQWFLKNGTLVSLSEQELVDCDGL-DQACNG 326
Query: 204 GDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSIL 263
G + ++ LE E++Y + K +C AT I S L E I
Sbjct: 327 GLPSNAYEAIE-KLGGLETETDYSYIGKKQSCDF-ATKKVAAYINSSV--ELSKDEKEIA 382
Query: 264 TDIATHGPVIAAVNALTWQYYLGGV---IQYNCDGSLANINHAVQIVGY 309
+A +GPV A+NA Q+Y GV ++ C+ + I+HAV +VGY
Sbjct: 383 AWLAENGPVSVALNAFAMQFYRKGVSHPLKIFCNPWM--IDHAVLMVGY 429
>gi|186688051|gb|ACC86111.1| cathepsin F [Paralichthys olivaceus]
Length = 475
Score = 133 bits (335), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 101/289 (34%), Positives = 145/289 (50%), Gaps = 31/289 (10%)
Query: 28 PNLEQKLELFSSFQQ---RYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQSPESARYG 83
P LE+ +EL F++ +Y K YS + E D R F ++L E+L Q SA YG
Sbjct: 165 PPLEESVELLGQFKEFMVKYNKVYSSQDEADRRLSIFHENLKTAEKLQSLDQG--SAEYG 222
Query: 84 ITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKD 143
+T+FSDL+EEEF++ +L +++ L H +K S G P D
Sbjct: 223 VTKFSDLTEEEFRSTYLNPLLSQWTL----------HRPMKPASPAKGPA-----PASWD 267
Query: 144 WREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSG 203
WR+ G + V+NQ CG+CWAFS E LKNGTL LS QE++DC G + C+G
Sbjct: 268 WRDHGAVSSVKNQGMCGSCWAFSVTGNIEGQWFLKNGTLVSLSEQELVDCDGL-DQACNG 326
Query: 204 GDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSIL 263
G + ++ LE E++Y + K +C AT I S L E I
Sbjct: 327 GLPSNAYEAIE-KLGGLETETDYSYIGKKQSCDF-ATKKVAAYINSSV--ELSKDEKEIA 382
Query: 264 TDIATHGPVIAAVNALTWQYYLGGV---IQYNCDGSLANINHAVQIVGY 309
+A +GPV A+NA Q+Y GV ++ C+ + I+HAV +VGY
Sbjct: 383 AWLAENGPVSVALNAFAMQFYRKGVSHPLKIFCNPWM--IDHAVLMVGY 429
>gi|15824693|gb|AAL09444.1| cysteine protease [Leishmania donovani]
Length = 394
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 89/281 (31%), Positives = 140/281 (49%), Gaps = 27/281 (9%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F++ Y+++Y +E R NFE++L+++ E ++P AR+GIT+F DLSE E
Sbjct: 37 LFEEFKRTYRRAYGTLAEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F R+L N + K H H + ++ +P DWRE G + V+
Sbjct: 94 FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
NQ CG+CWAFS V ES A L LS Q+++ C N GC+GG +W+
Sbjct: 143 NQGACGSCWAFSAVGNIESQWARAGHGLVSLSEQQLVSCDDKDN-GCNGGLMLQAFEWLL 201
Query: 215 VNKV-VLEPESEYPLLLKD---AACKRKATSPNGVKIKSYTCDTLIPSESSILTD-IATH 269
+ ++ E YP + A C + G +I Y +IPS +++ +A +
Sbjct: 202 RHMYGIVFTEKSYPYTSGNGDVAECLNSSKLVPGARIDGY---VMIPSNETVMAAWLAEN 258
Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
GP+ V+A ++ Y GV+ +C G +NH V +VGY+
Sbjct: 259 GPIAIGVDASSFMSYQSGVLT-SCAGDA--LNHGVLLVGYN 296
>gi|18418684|ref|NP_567983.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
gi|71153408|sp|O65493.1|XCP1_ARATH RecName: Full=Xylem cysteine proteinase 1; Short=AtXCP1; Flags:
Precursor
gi|6708181|gb|AAF25831.1|AF191027_1 papain-type cysteine endopeptidase XCP1 [Arabidopsis thaliana]
gi|3080415|emb|CAA18734.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|7270487|emb|CAB80252.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|26449881|dbj|BAC42063.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|28827736|gb|AAO50712.1| unknown protein [Arabidopsis thaliana]
gi|332661101|gb|AEE86501.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
Length = 355
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 90/287 (31%), Positives = 139/287 (48%), Gaps = 31/287 (10%)
Query: 29 NLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
N ++ LELF S+ + K+Y E + RF+ F ++L I++ N S G+ EF
Sbjct: 43 NTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEINS---YWLGLNEF 99
Query: 88 SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREA 147
+DL+ EEFK R+L + + + R IT +P DWR+
Sbjct: 100 ADLTHEEFKGRYL------GLAKPQFSRKRQPSANFRYRDITD-------LPKSVDWRKK 146
Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFC 207
G + V++Q CG+CWAFSTV E ++ + G LS LS QE+IDC N GC+GG
Sbjct: 147 GAVAPVKDQGQCGSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGG--- 203
Query: 208 ALLDWMD---VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILT 264
L+D+ ++ L E +YP L+++ C+ + V I Y + + ++ L
Sbjct: 204 -LMDYAFQYIISTGGLHKEDDYPYLMEEGICQEQKEDVERVTISGY--EDVPENDDESLV 260
Query: 265 DIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
H PV A+ A +Q+Y GGV C +++H V VGY
Sbjct: 261 KALAHQPVSVAIEASGRDFQFYKGGVFNGKCG---TDLDHGVAAVGY 304
>gi|194352752|emb|CAQ00104.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 351
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 95/314 (30%), Positives = 155/314 (49%), Gaps = 37/314 (11%)
Query: 11 VALIALCFLAIPVKVSKPNL-----------EQKLELFSSFQQRYKKSYSKSEHDI-RFK 58
VA++ LC A + S ++ ++ +ELF + +++K+Y+ E + RF+
Sbjct: 7 VAVLLLCVGACVARNSDFSIVGYSEEDLSSHDRLVELFEKWLAKHQKAYASFEEKLHRFE 66
Query: 59 NFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDH 118
F+ +L +I+E+N+ S G+ EF+DL+ +EFKT +L ++
Sbjct: 67 VFKDNLKLIDEINREVTS---YWLGLNEFADLTHDEFKTTYL--GLSPPPARRSSSRSFR 121
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
+ N +P DWR+ G + V+NQ CG+CWAFSTV E ++A+
Sbjct: 122 YEN-----------VAAHDLPKAVDWRKKGAVTDVKNQGQCGSCWAFSTVAAVEGINAIV 170
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAAC-KR 237
G L+ LS QE+IDC+ +GN GC+GG ++ + L E YP L+++ +C
Sbjct: 171 TGNLTALSEQELIDCSVDGNSGCNGGMMDYAFSYI-ASSGGLHTEEAYPYLMEEGSCGDG 229
Query: 238 KATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALT--WQYYLGGVIQYNCDG 295
K + V I Y D E +++ +A H PV A+ A +Q+Y GGV C
Sbjct: 230 KKSESEAVSISGYE-DVPTKDEQALIKALA-HQPVSVAIEASGRHFQFYSGGVFDGPCG- 286
Query: 296 SLANINHAVQIVGY 309
A ++H V VGY
Sbjct: 287 --AQLDHGVAAVGY 298
>gi|394331818|gb|AFN27128.1| cysteine protease [Leishmania tropica]
Length = 443
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 88/281 (31%), Positives = 143/281 (50%), Gaps = 27/281 (9%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F++ Y+++Y +E R NFE++L+++ E ++P AR+GIT+F DLSE E
Sbjct: 37 LFEEFKRTYRRAYGTVAEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F R+L N + K H H + ++ +P DWR+ G + V+
Sbjct: 94 FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWRKKGAVTPVK 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
+Q CG+CWAFS V + ES AL L+ LS ++ C + N GC+GG +W+
Sbjct: 143 DQGACGSCWAFSAVGSIESQWALAGHRLTALSEHHLVSCH-DKNSGCTGGLMLQAFEWLL 201
Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
++N + E YP + C + G +I Y T+ SE+ + +A +
Sbjct: 202 RNMNGTMFT-EDSYPYVSSSGYVPECSNSSQLVPGARIDGYM--TIESSETVMAAWLAKN 258
Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
GP+ AV+A ++ Y GV+ +C G ++NH V +VGY+
Sbjct: 259 GPISIAVDASSFMSYQSGVLT-SCAG--ISLNHGVLLVGYN 296
>gi|378943046|gb|AFC76264.1| cathepsin L-like protease [Leishmania major]
gi|378943056|gb|AFC76269.1| cathepsin L-like protease [Leishmania major]
gi|394331745|gb|AFN27095.1| cysteine protease [Leishmania major]
Length = 348
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 90/281 (32%), Positives = 140/281 (49%), Gaps = 27/281 (9%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F++ Y+++Y +E R NFE++L+++ E ++P AR+GIT+F DLSE E
Sbjct: 37 LFEEFKRTYQRAYGTLTEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F R+L N + K H H + ++ +P DWRE G + V+
Sbjct: 94 FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
NQ CG+CWAFS V ES A+ L LS Q+++ C N GC GG +W+
Sbjct: 143 NQGACGSCWAFSAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDN-GCGGGLMLQAFEWVL 201
Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
++N V E YP + + C + G +I Y ++ SE + +A +
Sbjct: 202 RNMNGTVFT-EKSYPYVSGNGDVPECSNSSELAPGARIDGYV--SMESSERVMAAWLAKN 258
Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
GP+ AV+A ++ Y GV+ +C G +NH V +VGY+
Sbjct: 259 GPISIAVDASSFMSYHSGVLT-SCIGE--QLNHGVLLVGYN 296
>gi|356549192|ref|XP_003542981.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 517
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 92/322 (28%), Positives = 148/322 (45%), Gaps = 37/322 (11%)
Query: 4 VKNVLFIVALI-----ALCFLAIPVKVS--------KPNLEQKLELFSSFQQRYKKSY-S 49
+K LF++ L+ LC+ +P + S P+ E +ELF +++ KK Y S
Sbjct: 5 LKTQLFLLFLVWGSWTFLCY-GLPSEYSILALEIDKFPSEEGVIELFQRWKEENKKIYRS 63
Query: 50 KSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVL 109
+ +RF+NF+++L I E N R SP G+ F+D+S EEFK++
Sbjct: 64 PDQEKLRFENFKRNLKYIAEKNSKRISPYGQSLGLNRFADMSNEEFKSKFTSKVKKPFSK 123
Query: 110 MSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVE 169
+ DH P DWR+ G++ V++Q CG CWAFS+
Sbjct: 124 RNGLSGKDHSCEDA---------------PYSLDWRKKGVVTAVKDQGYCGCCWAFSSTG 168
Query: 170 TAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLL 229
E ++A+ +G L LS E++DC N GC GG +W+ N + + E+ YP
Sbjct: 169 AIEGINAIVSGDLISLSEPELVDCD-RTNDGCDGGHMDYAFEWVMHNGGI-DTETNYPYS 226
Query: 230 LKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTW--QYYLGG 287
D C + I Y + S+ S+L P+ A ++ +W Q Y+GG
Sbjct: 227 GADGTCNVAKEETKVIGIDGYY--NVEQSDRSLLCATVKQ-PISAGIDGSSWDFQLYIGG 283
Query: 288 VIQYNCDGSLANINHAVQIVGY 309
+ +C +I+HA+ +VGY
Sbjct: 284 IYDGDCSSDPDDIDHAILVVGY 305
>gi|332374900|gb|AEE62591.1| unknown [Dendroctonus ponderosae]
Length = 359
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 96/278 (34%), Positives = 137/278 (49%), Gaps = 24/278 (8%)
Query: 35 ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKN-RQSPESARYGITEFSDLSE 92
E F +FQQ+Y K Y + SE +R + F+++L IEE NK +Q+ S G+ +FSDL+E
Sbjct: 22 ETFVTFQQKYGKVYQNDSELSVREEIFKENLAKIEEHNKQFQQNLVSYELGLNQFSDLTE 81
Query: 93 EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
EF+ L++ D ++K + I PV +W E G++
Sbjct: 82 AEFQ-----------ALLTMSPLTDQLTKQMEKYNSEFDIKTA---PVSVNWAEKGVVTP 127
Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
V+NQ CG+CW F+T T ES ALK G+L LS Q+++DC N GC GG L +
Sbjct: 128 VKNQGNCGSCWTFTTTGTIESRLALKTGSLVSLSEQQLLDC-NRVNAGCDGGVLSYALQY 186
Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV 272
V L E EYP + C P K YT SES ++ +A GPV
Sbjct: 187 --VESAGLTTEDEYPYKAWNGTC-NSTHKPVAAYTKGYTL-IYTRSESDLMKAVA-EGPV 241
Query: 273 IAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
A+NA QYY G+ +N + +NH +VGY+
Sbjct: 242 AVALNADLLQYYSKGI--FNPSACSSTVNHGGLVVGYE 277
>gi|18141287|gb|AAL60581.1|AF454959_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 368
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 91/295 (30%), Positives = 149/295 (50%), Gaps = 36/295 (12%)
Query: 26 SKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGI 84
++PN+ + FS F++++ K Y S+ EHD RF F+ +L ++++ SAR+G+
Sbjct: 40 AEPNVLSSEDHFSLFKKKFGKVYASREEHDYRFSVFKSNL---RRARRHQKLDPSARHGV 96
Query: 85 TEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKD 143
T+FSDL+ EFK +HL V D + + +PT +P + D
Sbjct: 97 TQFSDLTRSEFKRKHL------GVKGGFKLPKDANKAPI----------LPTENLPEEFD 140
Query: 144 WREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AG 195
WRE G + V+NQ +CG+CW+FS E + L G L LS Q+++DC AG
Sbjct: 141 WRERGAVTPVKNQGSCGSCWSFSATGALEGANFLATGKLVSLSEQQLVDCDHECDPEEAG 200
Query: 196 NGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTL 255
+ + GC+GG + ++ + L E +YP KD A + S + +++ ++
Sbjct: 201 SCDSGCNGGLMNSAFEYT-LKTGGLMREEDYPYTGKDGATCKLDKSKIVASVSNFSVISI 259
Query: 256 IPSESSILTDIATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
E I ++ +GP+ A+NA Q Y+GGV Y C + +NH V +VGY
Sbjct: 260 --DEEQIAANLVKNGPLAVAINAAYMQTYIGGVSCPYIC---MRRLNHGVLLVGY 309
>gi|157864849|ref|XP_001681133.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|68124427|emb|CAJ02283.1| cathepsin L-like protease [Leishmania major strain Friedlin]
Length = 348
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 90/281 (32%), Positives = 140/281 (49%), Gaps = 27/281 (9%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F++ Y+++Y +E R NFE++L+++ E ++P AR+GIT+F DLSE E
Sbjct: 37 LFEEFKRTYQRAYGTLTEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F R+L N + K H H + ++ +P DWRE G + V+
Sbjct: 94 FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
NQ CG+CWAFS V ES A+ L LS Q+++ C N GC GG +W+
Sbjct: 143 NQGACGSCWAFSAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDN-GCGGGLMLQAFEWVL 201
Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
++N V E YP + + C + G +I Y ++ SE + +A +
Sbjct: 202 RNMNGTVFT-EKSYPYVSGNGDVPECSNSSELAPGARIDGYV--SMESSERVMAAWLAKN 258
Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
GP+ AV+A ++ Y GV+ +C G +NH V +VGY+
Sbjct: 259 GPISIAVDASSFMSYHSGVLT-SCIGE--QLNHGVLLVGYN 296
>gi|71084302|gb|AAZ23596.1| cysteine protease [Leishmania aethiopica]
Length = 443
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 88/282 (31%), Positives = 141/282 (50%), Gaps = 29/282 (10%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F++ Y+++Y +E R NFE++L+++ E ++P AR+GIT+F DLSE E
Sbjct: 37 LFEEFKRTYRRAYGTVAEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F R+L N + K H H + ++ +P DWRE G + V+
Sbjct: 94 FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
NQ CG+CWAFS V ES A+ L+ LS Q+++ C + + GC GG +W+
Sbjct: 143 NQGACGSCWAFSVVGNIESQWAVAGHRLTALSEQQLVSC-DDMDSGCGGGLMTQAFEWLL 201
Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTD-IAT 268
++N + E YP + C + G +I Y +I S +++ +A
Sbjct: 202 RNMNGTMFT-EDSYPYVSTFGYVPECTNSSQLVPGARIDGY---VMIESNETVMAAWLAK 257
Query: 269 HGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
GP+ V+A ++ Y GGV+ +C G +NH V +VGY+
Sbjct: 258 SGPISIGVDASSFMSYHGGVLT-SCAGK--QLNHGVLLVGYN 296
>gi|157864851|ref|XP_001681134.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|68124428|emb|CAJ02284.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|378943050|gb|AFC76266.1| cathepsin L-like protease [Leishmania major]
gi|378943052|gb|AFC76267.1| cathepsin L-like protease [Leishmania major]
gi|378943054|gb|AFC76268.1| cathepsin L-like protease [Leishmania major]
gi|378943058|gb|AFC76270.1| cathepsin L-like protease [Leishmania major]
gi|394331737|gb|AFN27091.1| cysteine protease [Leishmania major]
gi|394331741|gb|AFN27093.1| cysteine protease [Leishmania major]
gi|394331747|gb|AFN27096.1| cysteine protease [Leishmania major]
gi|394331749|gb|AFN27097.1| cysteine protease [Leishmania major]
gi|394331751|gb|AFN27098.1| cysteine protease [Leishmania major]
gi|394331753|gb|AFN27099.1| cysteine protease [Leishmania major]
gi|394331755|gb|AFN27100.1| cysteine protease [Leishmania major]
gi|394331757|gb|AFN27101.1| cysteine protease [Leishmania major]
gi|394331759|gb|AFN27102.1| cysteine protease [Leishmania major]
gi|394331761|gb|AFN27103.1| cysteine protease [Leishmania major]
gi|394331763|gb|AFN27104.1| cysteine protease [Leishmania major]
gi|394331765|gb|AFN27105.1| cysteine protease [Leishmania major]
gi|394331767|gb|AFN27106.1| cysteine protease [Leishmania major]
gi|394331769|gb|AFN27107.1| cysteine protease [Leishmania major]
gi|394331771|gb|AFN27108.1| cysteine protease [Leishmania major]
gi|394331773|gb|AFN27109.1| cysteine protease [Leishmania major]
gi|394331775|gb|AFN27110.1| cysteine protease [Leishmania major]
gi|394331777|gb|AFN27111.1| cysteine protease [Leishmania major]
gi|394331779|gb|AFN27112.1| cysteine protease [Leishmania major]
gi|394331781|gb|AFN27113.1| cysteine protease [Leishmania major]
gi|394331783|gb|AFN27114.1| cysteine protease [Leishmania major]
gi|394331785|gb|AFN27115.1| cysteine protease [Leishmania major]
gi|394331787|gb|AFN27116.1| cysteine protease [Leishmania major]
gi|394331789|gb|AFN27117.1| cysteine protease [Leishmania major]
gi|394331791|gb|AFN27118.1| cysteine protease [Leishmania major]
gi|394331793|gb|AFN27119.1| cysteine protease [Leishmania major]
gi|394331795|gb|AFN27120.1| cysteine protease [Leishmania major]
gi|394331797|gb|AFN27121.1| cysteine protease [Leishmania major]
gi|394331799|gb|AFN27122.1| cysteine protease [Leishmania major]
gi|394331801|gb|AFN27123.1| cysteine protease [Leishmania major]
gi|394331803|gb|AFN27124.1| cysteine protease [Leishmania major]
Length = 348
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 90/281 (32%), Positives = 140/281 (49%), Gaps = 27/281 (9%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F++ Y+++Y +E R NFE++L+++ E ++P AR+GIT+F DLSE E
Sbjct: 37 LFEEFKRTYQRAYGTLTEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F R+L N + K H H + ++ +P DWRE G + V+
Sbjct: 94 FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
NQ CG+CWAFS V ES A+ L LS Q+++ C N GC GG +W+
Sbjct: 143 NQGACGSCWAFSAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDN-GCGGGLMLQAFEWVL 201
Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
++N V E YP + + C + G +I Y ++ SE + +A +
Sbjct: 202 RNMNGTVFT-EKSYPYVSGNGDVPECSNSSELAPGARIDGYV--SMESSERVMAAWLAKN 258
Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
GP+ AV+A ++ Y GV+ +C G +NH V +VGY+
Sbjct: 259 GPISIAVDASSFMSYHSGVLT-SCIGE--QLNHGVLLVGYN 296
>gi|157864855|ref|XP_001681136.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|68124430|emb|CAJ02286.1| cathepsin L-like protease [Leishmania major strain Friedlin]
Length = 348
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 90/281 (32%), Positives = 140/281 (49%), Gaps = 27/281 (9%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F++ Y+++Y +E R NFE++L+++ E ++P AR+GIT+F DLSE E
Sbjct: 37 LFEEFKRTYQRAYGTLTEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F R+L N + K H H + ++ +P DWRE G + V+
Sbjct: 94 FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
NQ CG+CWAFS V ES A+ L LS Q+++ C N GC GG +W+
Sbjct: 143 NQGACGSCWAFSAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDN-GCGGGLMLQAFEWVL 201
Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
++N V E YP + + C + G +I Y ++ SE + +A +
Sbjct: 202 RNMNGTVFT-EKSYPYVSGNGDVPECSNSSELAPGARIDGYV--SMESSERVMTAWLAKN 258
Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
GP+ AV+A ++ Y GV+ +C G +NH V +VGY+
Sbjct: 259 GPISIAVDASSFMSYHSGVLT-SCIGE--QLNHGVLLVGYN 296
>gi|394331743|gb|AFN27094.1| cysteine protease [Leishmania major]
Length = 348
Score = 132 bits (333), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 90/281 (32%), Positives = 140/281 (49%), Gaps = 27/281 (9%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F++ Y+++Y +E R NFE++L+++ E ++P AR+GIT+F DLSE E
Sbjct: 37 LFEEFKRTYQRAYGTLTEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F R+L N + K H H + ++ +P DWRE G + V+
Sbjct: 94 FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
NQ CG+CWAFS V ES A+ L LS Q+++ C N GC GG +W+
Sbjct: 143 NQGACGSCWAFSAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDN-GCGGGLMLQAFEWVL 201
Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
++N V E YP + + C + G +I Y ++ SE + +A +
Sbjct: 202 RNMNGTVFT-EKSYPYVSGNGDVPECSNSSELAPGARIDGYV--SMESSERVMAAWLAKN 258
Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
GP+ AV+A ++ Y GV+ +C G +NH V +VGY+
Sbjct: 259 GPISIAVDASSFMSYHSGVLT-SCIGE--QLNHGVLLVGYN 296
>gi|15824691|gb|AAL09443.1| cysteine protease [Leishmania donovani]
Length = 443
Score = 132 bits (333), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 90/281 (32%), Positives = 141/281 (50%), Gaps = 27/281 (9%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F++ Y+++Y +E R NFE++L+++ E ++P AR+GIT+F DLSE E
Sbjct: 37 LFEEFKRTYRRAYGTLAEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F R+L N + K H H + ++ +P DWRE G + V+
Sbjct: 94 FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
NQ CG+CWAFS V ES A L LS Q+++ C N GC+GG +W+
Sbjct: 143 NQGACGSCWAFSAVGNIESQWARVGHGLVSLSEQQLVSCDDKDN-GCNGGLMLQAFEWLL 201
Query: 215 VNKV-VLEPESEYPLLLKD---AACKRKATSPNGVKIKSYTCDTLIPSESSILTD-IATH 269
+ ++ E YP + A C + G +I Y +IPS +++ +A +
Sbjct: 202 RHMYGIVFTEKSYPYTSGNGDVAECLNSSKLVPGAQIDGY---VMIPSNETVMAAWLAEN 258
Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
GP+ AV+A ++ Y GV+ +C G +NH V +VGY+
Sbjct: 259 GPIAIAVDASSFMSYQSGVLT-SCAGDA--LNHGVLLVGYN 296
>gi|449454309|ref|XP_004144898.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
gi|449471311|ref|XP_004153272.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 349
Score = 132 bits (333), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 94/286 (32%), Positives = 143/286 (50%), Gaps = 31/286 (10%)
Query: 29 NLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
++++ +ELF S+ ++ K+Y E + RF+ F +L I+E NK S G+ EF
Sbjct: 39 SMDKTIELFESWMSKHSKAYRSIEEKLHRFEIFLDNLKHIDETNKK---VSSYWLGLNEF 95
Query: 88 SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIP--TGIPVKKDWR 145
+DLS EEFK+++L V +KRS + G + +P DWR
Sbjct: 96 ADLSHEEFKSKYLGLRVE----------------FPRKRS-SRGFSYGDVEDLPESVDWR 138
Query: 146 EAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGD 205
G + V+NQ +CG+CWAFSTV E ++ + G L+ LS QE+IDC + N GC GG
Sbjct: 139 TKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDRSFNNGCYGGL 198
Query: 206 FCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
++ N L E +YP L+++ C R+ V I Y D E S+L
Sbjct: 199 MDYAFQYIMSNS-GLRKEEDYPYLMEEGRCIREKEQFEVVTISGYE-DVPANDEQSLLKA 256
Query: 266 IATHGPVIAAVNALT--WQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+ +H PV A+ A + +Q+Y GG+ C ++H V VGY
Sbjct: 257 L-SHQPVSVAIEASSRNFQFYKGGIFTGRCG---TQMDHGVTAVGY 298
>gi|56553473|gb|AAV97878.1| recombinant cysteine protease [Cloning vector pQ-CPB]
Length = 335
Score = 132 bits (333), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 91/280 (32%), Positives = 144/280 (51%), Gaps = 25/280 (8%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F+Q Y++ Y+ E R NF+++L+++ E N +P AR+GIT+F DLSEEE
Sbjct: 29 LFEEFKQTYQRVYATLDEEQQRLANFQRNLELMREHQAN--NPH-ARFGITKFFDLSEEE 85
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F TR+L + + K ++ V G + T P DWRE G + V+
Sbjct: 86 FATRYLSGATH---FAKAKKFASQYYRKV-------GADLSTA-PAAVDWREKGAVTPVK 134
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
+Q CG+CWAFS + ES L +L LS QE++ C + + GC+GG DW+
Sbjct: 135 DQGMCGSCWAFSAIGNIESKWYLATHSLISLSEQELVSCD-DVDEGCNGGLMGQAFDWLL 193
Query: 215 VNK-VVLEPESEYPLLLKDAA---CKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
N+ + + YP + + + C + G I + T+ +E ++ +A +G
Sbjct: 194 NNRNGAVYTGASYPYVSGNGSVPECSESSDLVIGAYIDGHV--TIESNEDTMAAWLAANG 251
Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
P+ AV+A + Y GGV+ +CDG +NH V +VGY+
Sbjct: 252 PIAIAVDASAFMSYTGGVLT-SCDGK--QLNHGVLLVGYN 288
>gi|332326589|gb|AEE42618.1| cysteine protease [Leishmania aethiopica]
Length = 443
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 92/281 (32%), Positives = 143/281 (50%), Gaps = 27/281 (9%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F++ Y + Y +E R NFE++L+++ E ++P AR+GIT+F DLSE E
Sbjct: 37 LFEEFKRTYWRVYGTVAEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F R+L N + K H H H K R+ + + P DWRE G + V+
Sbjct: 94 FAARYL----NGAAYFAAAKQHAGQH-HRKARADLSAV------PDAVDWREKGAVTPVK 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
+Q CG+CWAFS V ES A+ L+ LS Q+++ C + + GC+GG +W+
Sbjct: 143 DQGACGSCWAFSAVGNIESQWAVAGHRLTALSEQQLVSC-DDKDSGCNGGLMTQAFEWLL 201
Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
++N +L E YP + C + G +I Y T+ SE+ + +A
Sbjct: 202 RNMNGTMLT-EDSYPYVSSTGDVPECTNSSQLVPGARIDGYV--TIESSETVMAAWLAKS 258
Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
GP+ AV+A ++ Y GV+ +C G +NH V +VGY+
Sbjct: 259 GPISIAVDASSFMSYESGVLT-SCAGDA--LNHGVLLVGYN 296
>gi|8886940|gb|AAF80626.1|AC069251_19 F2D10.37 [Arabidopsis thaliana]
Length = 315
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 87/279 (31%), Positives = 142/279 (50%), Gaps = 24/279 (8%)
Query: 34 LELFSSFQQRYKKSYSKSEHD-IRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSE 92
+ELF ++ ++K+Y E +RF+ F+ +L I+E NK +S G+ EF+DLS
Sbjct: 48 IELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKGKS---YWLGLNEFADLSH 104
Query: 93 EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
EEFK +L + ++ + + + R + +P DWR+ G + +
Sbjct: 105 EEFKKMYL--GLKTDIV---RRDEERSYAEFAYRDVEA-------VPKSVDWRKKGAVAE 152
Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
V+NQ +CG+CWAFSTV E ++ + G L+ LS QE+IDC N GC+GG ++
Sbjct: 153 VKNQGSCGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEY 212
Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV 272
+ V L E +YP +++ C+ + V I + D E S+L +A H P+
Sbjct: 213 I-VKNGGLRKEEDYPYSMEEGTCEMQKDESETVTINGHQ-DVPTNDEKSLLKALA-HQPL 269
Query: 273 IAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
A++A +Q+Y GGV C +++H V VGY
Sbjct: 270 SVAIDASGREFQFYSGGVFDGRCG---VDLDHGVAAVGY 305
>gi|161598418|gb|ABX74953.1| cysteine protease [Leishmania panamensis]
Length = 441
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 89/280 (31%), Positives = 141/280 (50%), Gaps = 25/280 (8%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F+Q YK+ Y+ +E R NF+++L+++ E N AR+GIT+F DLSE E
Sbjct: 37 LFEEFKQTYKRVYATLAEEQQRVANFQRNLELMREHQANN---PHARFGITKFFDLSEAE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F TR+L + + K H+ V G + T P DWR+ G + V
Sbjct: 94 FATRYLSGATH---FAKAKKFASQHYRKV-------GADLSTA-PAAVDWRQMGAVTPVN 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
+Q CG+CWAFS + ES + +L LS QE++ C + + GC+GG DW+
Sbjct: 143 DQGACGSCWAFSAIGNIESQWYVTTHSLITLSEQELVSC-DDVDEGCNGGLMLQAFDWLL 201
Query: 215 VNK-VVLEPESEYPLLLKDAA---CKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
NK + + YP + + + C + G I + T+ +E ++ +A +G
Sbjct: 202 NNKNGAVYTGASYPYVSGNGSVPECSESSELVVGAYIDGHV--TIESNEDTMAAWLAVNG 259
Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
P+ AV+A + Y GG++ +CDG +NH V +VGY+
Sbjct: 260 PIAIAVDASAFMSYTGGILT-SCDGR--QLNHGVLLVGYN 296
>gi|394331739|gb|AFN27092.1| cysteine protease [Leishmania major]
Length = 348
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 90/281 (32%), Positives = 140/281 (49%), Gaps = 27/281 (9%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F++ Y+++Y +E R NFE++L+++ E ++P AR+GIT+F DLSE E
Sbjct: 37 LFEEFKRTYQRAYGTLTEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F R+L N + K H H + ++ +P DWRE G + V+
Sbjct: 94 FAARYL----NGAAYFAAAKQHAGQHCRKARADLSA-------VPDAVDWREKGAVTPVK 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
NQ CG+CWAFS V ES A+ L LS Q+++ C N GC GG +W+
Sbjct: 143 NQGACGSCWAFSAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDN-GCGGGLMLQAFEWVL 201
Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
++N V E YP + + C + G +I Y ++ SE + +A +
Sbjct: 202 RNMNGTVFT-EKSYPYVSGNGDVPECSNSSELAPGARIDGYV--SMESSERVMAAWLAKN 258
Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
GP+ AV+A ++ Y GV+ +C G +NH V +VGY+
Sbjct: 259 GPISIAVDASSFMSYHSGVLT-SCIGE--QLNHGVLLVGYN 296
>gi|332326583|gb|AEE42615.1| cysteine protease [Leishmania aethiopica]
Length = 443
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 92/281 (32%), Positives = 143/281 (50%), Gaps = 27/281 (9%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F++ Y + Y +E R NFE++L+++ E ++P AR+GIT+F DLSE E
Sbjct: 37 LFEEFKRTYWRVYGTVAEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F R+L N + K H H H K R+ + + P DWRE G + V+
Sbjct: 94 FAARYL----NGAAYFAAAKQHAGQH-HRKARADLSAV------PDAVDWREKGAVTPVK 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
+Q CG+CWAFS V ES A+ + L LS Q+++ C + + GC+GG +W+
Sbjct: 143 DQGACGSCWAFSAVGNIESQWAVADHRLXXLSEQQLVSC-DDKDSGCNGGLMTQAFEWLL 201
Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
++N +L E YP + C + G +I Y T+ SE+ + +A
Sbjct: 202 RNMNGTMLT-EDSYPYVSSTGDVPECTNSSQLVPGARIDGYV--TIESSETVMAAWLAKS 258
Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
GP+ AV+A ++ Y GV+ +C G +NH V +VGY+
Sbjct: 259 GPISIAVDASSFMSYESGVLT-SCAGDA--LNHGVLLVGYN 296
>gi|18394919|ref|NP_564126.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
gi|71153409|sp|Q9LM66.2|XCP2_ARATH RecName: Full=Xylem cysteine proteinase 2; Short=AtXCP2; Flags:
Precursor
gi|4836904|gb|AAD30607.1|AC007369_17 Putative cysteine proteinase [Arabidopsis thaliana]
gi|6708183|gb|AAF25832.1|AF191028_1 papain-type cysteine endopeptidase XCP2 [Arabidopsis thaliana]
gi|28466959|gb|AAO44088.1| At1g20850 [Arabidopsis thaliana]
gi|110743795|dbj|BAE99733.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332191910|gb|AEE30031.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
Length = 356
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 87/279 (31%), Positives = 142/279 (50%), Gaps = 24/279 (8%)
Query: 34 LELFSSFQQRYKKSYSKSEHD-IRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSE 92
+ELF ++ ++K+Y E +RF+ F+ +L I+E NK +S G+ EF+DLS
Sbjct: 48 IELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKGKS---YWLGLNEFADLSH 104
Query: 93 EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
EEFK +L + ++ + + + R + +P DWR+ G + +
Sbjct: 105 EEFKKMYL--GLKTDIV---RRDEERSYAEFAYRDVEA-------VPKSVDWRKKGAVAE 152
Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
V+NQ +CG+CWAFSTV E ++ + G L+ LS QE+IDC N GC+GG ++
Sbjct: 153 VKNQGSCGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEY 212
Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV 272
+ V L E +YP +++ C+ + V I + D E S+L +A H P+
Sbjct: 213 I-VKNGGLRKEEDYPYSMEEGTCEMQKDESETVTINGHQ-DVPTNDEKSLLKALA-HQPL 269
Query: 273 IAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
A++A +Q+Y GGV C +++H V VGY
Sbjct: 270 SVAIDASGREFQFYSGGVFDGRCG---VDLDHGVAAVGY 305
>gi|378943060|gb|AFC76271.1| cathepsin L-like protease [Leishmania major]
Length = 348
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 90/281 (32%), Positives = 139/281 (49%), Gaps = 27/281 (9%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F++ Y+++Y +E R NFE++L+++ E ++P AR+GIT+F DLSE E
Sbjct: 37 LFEEFKRTYQRAYGTLTEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F R+L N + K H H + ++ +P DWRE G + V+
Sbjct: 94 FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
NQ CG+CWAFS V ES A+ L LS Q+++ C N GC GG +W+
Sbjct: 143 NQGACGSCWAFSAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDN-GCGGGLMLQAFEWVL 201
Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
++N V E YP + C + G +I Y ++ SE + +A +
Sbjct: 202 RNMNGTVFT-EKSYPYTSGNGDVPECSNSSELAPGARIDGYV--SMESSERVMAAWLAKN 258
Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
GP+ AV+A ++ Y GV+ +C G +NH V +VGY+
Sbjct: 259 GPISIAVDASSFMSYHSGVLT-SCIGE--QLNHGVLLVGYN 296
>gi|157864853|ref|XP_001681135.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|157864857|ref|XP_001681137.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|68124429|emb|CAJ02285.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|68124431|emb|CAJ02287.1| cathepsin L-like protease [Leishmania major strain Friedlin]
Length = 443
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 90/281 (32%), Positives = 140/281 (49%), Gaps = 27/281 (9%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F++ Y+++Y +E R NFE++L+++ E ++P AR+GIT+F DLSE E
Sbjct: 37 LFEEFKRTYQRAYGTLTEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F R+L N + K H H + ++ +P DWRE G + V+
Sbjct: 94 FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
NQ CG+CWAFS V ES A+ L LS Q+++ C N GC GG +W+
Sbjct: 143 NQGACGSCWAFSAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDN-GCGGGLMLQAFEWVL 201
Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
++N V E YP + + C + G +I Y ++ SE + +A +
Sbjct: 202 RNMNGTVFT-EKSYPYVSGNGDVPECSNSSELAPGARIDGYV--SMESSERVMAAWLAKN 258
Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
GP+ AV+A ++ Y GV+ +C G +NH V +VGY+
Sbjct: 259 GPISIAVDASSFMSYHSGVLT-SCIGE--QLNHGVLLVGYN 296
>gi|157864845|ref|XP_001681131.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|68124425|emb|CAJ02281.1| cathepsin L-like protease [Leishmania major strain Friedlin]
Length = 348
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 90/281 (32%), Positives = 140/281 (49%), Gaps = 27/281 (9%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F++ Y+++Y +E R NFE++L+++ E ++P AR+GIT+F DLSE E
Sbjct: 37 LFEEFKRTYQRAYGTLTEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F R+L N + K H H + ++ +P DWRE G + V+
Sbjct: 94 FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
NQ CG+CWAFS V ES A+ L LS Q+++ C N GC GG +W+
Sbjct: 143 NQGACGSCWAFSAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDN-GCGGGLMLQAFEWVL 201
Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
++N V E YP + + C + G +I Y ++ SE + +A +
Sbjct: 202 RNMNGTV-STEKSYPYVSGNGDVPECSNSSELAPGARIDGYV--SMESSERVMTAWLAKN 258
Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
GP+ AV+A ++ Y GV+ +C G +NH V +VGY+
Sbjct: 259 GPISIAVDASSFMSYHSGVLT-SCIGE--QLNHGVLLVGYN 296
>gi|67773380|gb|AAY81947.1| cysteine protease 9 [Paragonimus westermani]
Length = 322
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 98/312 (31%), Positives = 151/312 (48%), Gaps = 47/312 (15%)
Query: 8 LFIVA---LIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSL 64
LF V+ LI C + +P + EL+ F++ Y K Y+ + RF F+ +L
Sbjct: 3 LFTVSCFVLIVSCAVVVP--------DSARELYEQFKRDYGKVYANEDDQKRFAIFKDNL 54
Query: 65 DIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVK 124
++L Q +ARYG+T+FSDL+ EEF ++LR +VN N
Sbjct: 55 VRAQKLQLKDQG--TARYGVTQFSDLTPEEFAAKYLRAAVN---------------NDQV 97
Query: 125 KRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL 184
+R TG+ P + DWRE G + V NQ +CG+CWAFS E +K G L
Sbjct: 98 ERVRPTGLK---AAPERMDWREKGAVTAVENQGSCGSCWAFSAAGNVEGQWFIKTGQLVS 154
Query: 185 LSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKV-VLEPESEYPLLLKDAACKRKATSPN 243
LS Q+++DC GC+GG + ++++ + LE ES+YP + + C + N
Sbjct: 155 LSKQQLVDCDRVAE-GCNGG--WPVSSYLEIKHMGGLESESDYPYVGAEQTC-----ALN 206
Query: 244 GVKIKSYTCDTLI--PSESSILTDIATHGPVIAAVNALTWQYYLGGVIQ---YNCDGSLA 298
K+ + D ++ E +A HGP+ +NA+ Q+Y GV+ C +
Sbjct: 207 KEKLLAKIDDLIVLGAYEEEHAAYLAEHGPLSTLLNAVALQHYQSGVLNPTYEECPDT-- 264
Query: 299 NINHAVQIVGYD 310
+NHAV VGYD
Sbjct: 265 ELNHAVLTVGYD 276
>gi|1848231|gb|AAB48120.1| cathepsin L-like protease [Leishmania major]
Length = 443
Score = 132 bits (331), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 90/281 (32%), Positives = 140/281 (49%), Gaps = 27/281 (9%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F++ Y+++Y +E R NFE++L+++ E ++P AR+GIT+F DLSE E
Sbjct: 37 LFEEFKRTYQRAYGTLTEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F R+L N + K H H + ++ +P DWRE G + V+
Sbjct: 94 FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
NQ CG+CWAFS V ES A+ L LS Q+++ C N GC GG +W+
Sbjct: 143 NQGACGSCWAFSAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDN-GCGGGLMLQAFEWVL 201
Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
++N V E YP + + C + G +I Y ++ SE + +A +
Sbjct: 202 RNMNGTVFT-EKSYPYVSGNGDVPECSNSSELAPGARIDGYV--SMESSERVMAAWLAKN 258
Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
GP+ AV+A ++ Y GV+ +C G +NH V +VGY+
Sbjct: 259 GPISIAVDASSFMSYHSGVLT-SCIGE--QLNHGVLLVGYN 296
>gi|4733887|gb|AAD02173.3| cysteine proteinase [Acanthamoeba culbertsoni]
Length = 482
Score = 132 bits (331), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 92/309 (29%), Positives = 144/309 (46%), Gaps = 34/309 (11%)
Query: 14 IALCFLAIPVKVSKPNLEQKLEL---FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEEL 70
+AL LA VS +L ++ EL F+S+ +R+ +SYS E R+ + +++D IEE
Sbjct: 39 LALLVLACLTLVSCVSLRER-ELQGQFNSWMRRHARSYSNDEFLERYNTWRENMDFIEEF 97
Query: 71 NKNRQSPESARYGITEFSDLSEEEFKTRHL-------RHSVNKHVLMSHHKHHDHHHNHV 123
N+ + A + E DL+ EEF ++ + + + +HHH
Sbjct: 98 NRGNHTFTVA---MNEHGDLTPEEFARLYMGQVSPASEQELQERIAAESAMEDEHHHTRA 154
Query: 124 KKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLS 183
IP DWR G + V+NQ +C +CWAF E + + G+L
Sbjct: 155 S-------------IPANWDWRTKGAVTPVKNQGSCASCWAFVATGAVEGVRKIAGGSLV 201
Query: 184 LLSVQEVIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSP 242
LS Q ++DCA G GN GCSGG+ WM N L ++ YP + + + C+ +
Sbjct: 202 SLSDQMLLDCAVGTGNQGCSGGNVEITYRWMISNNARLMTQASYPYIARQSTCRYVPS-- 259
Query: 243 NGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANI 300
GV+ SES +L A PV A++ ++ +Y GG Y+ S N+
Sbjct: 260 QGVQGIRNIMRVRAGSESDLLAKAAI-APVTVAIDGSKRSFMFYSGGYY-YDPTCSSTNL 317
Query: 301 NHAVQIVGY 309
NHAV +VG+
Sbjct: 318 NHAVLVVGW 326
>gi|170032975|ref|XP_001844355.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167873312|gb|EDS36695.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 1454
Score = 132 bits (331), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 94/281 (33%), Positives = 143/281 (50%), Gaps = 30/281 (10%)
Query: 36 LFSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F+ R+ ++Y S EH++RF+ F+ +L IE+LNK Q +A+YGIT F+D++ E
Sbjct: 1145 LFDKFKTRHNRTYQSSLEHEMRFRIFKNNLFKIEQLNKYEQG--TAKYGITHFADMTSAE 1202
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
++ R V+ +H N + + I + +P DWRE G + +V+
Sbjct: 1203 YRAR------TGLVVPREGDEVNHIRNPMAE--IDEHMELPDAF----DWRELGAVSEVK 1250
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
NQ CG+CWAFS V E +H +K L S QE++DC + C+GG +D D
Sbjct: 1251 NQGNCGSCWAFSVVGNIEGLHQVKTKKLEEYSEQELLDC-DTVDSACNGG----FMD--D 1303
Query: 215 VNKVV-----LEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
K + LE ESEYP L K + V++K L +E++I + +
Sbjct: 1304 AYKAIEKIGGLELESEYPYLAKKQKTCHFNKTMAHVRVKGAV--DLPKNETAIAQFLVAN 1361
Query: 270 GPVIAAVNALTWQYYLGGVIQ-YNCDGSLANINHAVQIVGY 309
GPV +NA Q+Y GG+ + S N++H V IVGY
Sbjct: 1362 GPVSIGLNANAMQFYRGGISHPWKPLCSKKNLDHGVLIVGY 1402
>gi|224065647|ref|XP_002301901.1| predicted protein [Populus trichocarpa]
gi|222843627|gb|EEE81174.1| predicted protein [Populus trichocarpa]
Length = 336
Score = 132 bits (331), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 92/281 (32%), Positives = 141/281 (50%), Gaps = 30/281 (10%)
Query: 34 LELFSSFQQRYKKSYSKSEHD-IRFKNFEKSLDIIEELNKNRQSPESARY--GITEFSDL 90
++LF S+ ++ K Y E +RF+ F+ +L I+E NK + Y G+ EFSDL
Sbjct: 30 IDLFESWISKHGKIYESIEEKWLRFEIFKDNLFHIDETNK-----KVVNYWLGLNEFSDL 84
Query: 91 SEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGII 150
S EEFK ++L V+ MS + N+ SI P DWR+ G +
Sbjct: 85 SHEEFKNKYLGLKVD----MSERRECSQEFNYKDVMSI----------PKSVDWRKKGAV 130
Query: 151 GKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALL 210
V+NQ +CG+CWAFSTV E ++ + G L+ LS QE++DC N GC+GG
Sbjct: 131 TDVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDCDTTNNYGCNGGLMDYAF 190
Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
++ ++ L E +YP ++++ C+ + V I Y D SE S+L +A
Sbjct: 191 SYI-ISNGGLHKEVDYPYIMEEGTCEMRKEESEVVTISGYH-DVPQNSEESLLKALANQ- 247
Query: 271 PVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
P+ A+ A +Q+Y GGV +C ++H V VGY
Sbjct: 248 PLSVAIEASGRDFQFYSGGVFDGHCG---TQLDHGVAAVGY 285
>gi|194352750|emb|CAQ00103.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326514262|dbj|BAJ92281.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326519402|dbj|BAJ96700.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326524351|dbj|BAK00559.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326531998|dbj|BAK01375.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 356
Score = 132 bits (331), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 92/286 (32%), Positives = 144/286 (50%), Gaps = 32/286 (11%)
Query: 31 EQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEFSD 89
E+ +ELF + +++K+Y+ E + RF+ F+ +L I+++N+ S G+ EF+D
Sbjct: 43 ERLVELFEKWLAKHQKAYASFEEKLHRFEVFKDNLKHIDKINREVTS---YWLGLNEFAD 99
Query: 90 LSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGI 149
L+ +EFK +L S RS + +P DWR+ G
Sbjct: 100 LTHDEFKAAYLGLDAAPARRGS-------------SRSFRYEDVSASDLPKSVDWRKKGA 146
Query: 150 IGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCAL 209
+ +V+NQ CG+CWAFSTV E ++A+ G L+ LS QE+IDC+ +GN GC+GG L
Sbjct: 147 VTEVKNQGQCGSCWAFSTVAAVEGINAIVTGNLTALSEQELIDCSVDGNSGCNGG----L 202
Query: 210 LDWM---DVNKVVLEPESEYPLLLKDAAC-KRKATSPNGVKIKSYTCDTLIPSESSILTD 265
+D+ + L E YP L+++ +C K V I Y D E +++
Sbjct: 203 MDYAFSYIASSGGLHTEEAYPYLMEEGSCGDGKKAESEAVTISGYE-DVPANDEQALIKA 261
Query: 266 IATHGPVIAAVNALT--WQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+A H PV A+ A +Q+Y GGV C A ++H V VGY
Sbjct: 262 LA-HQPVSVAIEASGRHFQFYSGGVFDGPCG---AQLDHGVAAVGY 303
>gi|157864847|ref|XP_001681132.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|68124426|emb|CAJ02282.1| cathepsin L-like protease [Leishmania major strain Friedlin]
Length = 443
Score = 132 bits (331), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 90/281 (32%), Positives = 140/281 (49%), Gaps = 27/281 (9%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F++ Y+++Y +E R NFE++L+++ E ++P AR+GIT+F DLSE E
Sbjct: 37 LFEEFKRTYQRAYGTLTEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F R+L N + K H H + ++ +P DWRE G + V+
Sbjct: 94 FAARYL----NGAAYFAAVKQHAGQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
NQ CG+CWAFS V ES A+ L LS Q+++ C N GC GG +W+
Sbjct: 143 NQGACGSCWAFSAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDN-GCGGGLMLQAFEWVL 201
Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
++N V E YP + + C + G +I Y ++ SE + +A +
Sbjct: 202 RNMNGTVFT-EKSYPYVSGNGDVPECSNSSELAPGARIDGYV--SMESSERVMAAWLAKN 258
Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
GP+ AV+A ++ Y GV+ +C G +NH V +VGY+
Sbjct: 259 GPISIAVDASSFMSYHSGVLT-SCIGE--QLNHGVLLVGYN 296
>gi|74834619|sp|O97397.1|CATLL_PHACE RecName: Full=Cathepsin L-like proteinase; Flags: Precursor
gi|4210800|emb|CAA76927.1| thiol protease [Phaedon cochleariae]
Length = 324
Score = 131 bits (330), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 95/310 (30%), Positives = 158/310 (50%), Gaps = 33/310 (10%)
Query: 13 LIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELN 71
+IAL L + + N EL++ F++ + ++Y S E +RF F+ +L I E N
Sbjct: 4 IIALAALIVVI-----NAASDQELWADFKKTHARTYKSLREEKLRFNIFQDTLRQIAEHN 58
Query: 72 KNRQSPESARY-GITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITT 130
++ ES Y I +FSD+++EEF+ +++ ++ L ++ +T
Sbjct: 59 VKYENGESTYYLAINKFSDITDEEFRDMLMKNEASRPNL-----------EGLEVADLTV 107
Query: 131 GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEV 190
G P DWR G++ VRNQ CG+CWA ST ES A+K+G+ LS Q++
Sbjct: 108 GAA-----PESIDWRSKGVVLPVRNQGECGSCWALSTAAAIESQSAIKSGSKVPLSPQQL 162
Query: 191 IDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKS 249
+DC+ + GN GC+GG +++ N LE +++YP K+ CK S + V++
Sbjct: 163 VDCSTSYGNHGCNGGFAVNGFEYVKDNG--LESDADYPYSGKEDKCKANDKSRSVVELTG 220
Query: 250 YTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVI-QYNCDGSLANINHAVQIVG 308
Y + SE+S+ + T GP+ A V + Y GG+ +C G N++H V +VG
Sbjct: 221 YK--KVTASETSLKEAVGTIGPISAVVFGKPMKSYGGGIFDDSSCLGD--NLHHGVNVVG 276
Query: 309 Y--DNYSRTW 316
Y +N + W
Sbjct: 277 YGIENGQKYW 286
>gi|224116884|ref|XP_002317418.1| predicted protein [Populus trichocarpa]
gi|222860483|gb|EEE98030.1| predicted protein [Populus trichocarpa]
Length = 503
Score = 131 bits (330), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 86/282 (30%), Positives = 142/282 (50%), Gaps = 19/282 (6%)
Query: 31 EQKLELFSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSD 89
E +E+F ++ R++K Y +E + R++NF+++L I E + + G+ +F+D
Sbjct: 44 ESIIEIFQQWRDRHQKVYEHAAESEKRYRNFKRNLKYIIEKAGKKTAALGHSVGLNKFAD 103
Query: 90 LSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGI 149
LS EEFK +L V K + + D ++R++ T P DWR+ G+
Sbjct: 104 LSNEEFKELYL-SKVKKPINIKRSTARDW-----RQRNLQT-----CDAPSSLDWRKKGV 152
Query: 150 IGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCAL 209
+ V++Q CG+CW+FST E ++A+ G L LS QE++DC N GC GG
Sbjct: 153 VTAVKDQGDCGSCWSFSTTGAIEGINAIVTGDLISLSEQELVDC-DTTNYGCEGGYMDYA 211
Query: 210 LDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
+W+ +N ++ E+ YP D C V I YT + ++S++L
Sbjct: 212 FEWV-INNGGIDTEANYPYTGVDGTCNTTKEEIKVVSIDGYT--DVDETDSALLC-ATVQ 267
Query: 270 GPVIAAVN--ALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
P+ ++ AL +Q Y GG+ +C +I+HAV IVGY
Sbjct: 268 QPISVGMDGSALDFQLYTGGIYDGDCSDDPNDIDHAVLIVGY 309
>gi|118150|sp|P25804.1|CYSP_PEA RecName: Full=Cysteine proteinase 15A; AltName:
Full=Turgor-responsive protein 15A; Flags: Precursor
gi|20679|emb|CAA38242.1| unnamed protein product [Pisum sativum]
Length = 363
Score = 131 bits (330), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 91/283 (32%), Positives = 147/283 (51%), Gaps = 34/283 (12%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F+SF+ ++ KSY +K EHD RF F+ +L I +L++NR +A +GIT+FSDL+ EF
Sbjct: 48 FTSFKSKFSKSYATKEEHDYRFGVFKSNL-IKAKLHQNRDP--TAEHGITKFSDLTASEF 104
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
+ + L + K + + H I T +P DWRE G + V++
Sbjct: 105 RRQFL--GLKKRLRLPAHAQK-------------APILPTTNLPEDFDWREKGAVTPVKD 149
Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDFC 207
Q +CG+CWAFST E H L G L LS Q+++DC AG+ + GC+GG
Sbjct: 150 QGSCGSCWAFSTTGALEGAHYLATGKLVSLSEQQLVDCDHVCDPEQAGSCDSGCNGGLMN 209
Query: 208 ALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIA 267
+++ + V++ E +Y +D +CK S + +++ TL E I ++
Sbjct: 210 NAFEYLLESGGVVQ-EKDYAYTGRDGSCKFD-KSKVVASVSNFSVVTL--DEDQIAANLV 265
Query: 268 THGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
+GP+ A+NA Q Y+ GV Y C + + ++H V +VG+
Sbjct: 266 KNGPLAVAINAAWMQTYMSGVSCPYVC--AKSRLDHGVLLVGF 306
>gi|357446975|ref|XP_003593763.1| Cysteine proteinase [Medicago truncatula]
gi|355482811|gb|AES64014.1| Cysteine proteinase [Medicago truncatula]
Length = 350
Score = 131 bits (330), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 89/304 (29%), Positives = 145/304 (47%), Gaps = 27/304 (8%)
Query: 9 FIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKS-EHDIRFKNFEKSLDII 67
F + L+ C A P +E + +Y+++Y+ S E + R K F+++L+ I
Sbjct: 7 FCIILLWAC--AYPTMSRTLTESSVVEAHQQWMMKYERTYTNSSEMEKRKKIFKENLEYI 64
Query: 68 EELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRS 127
E N N +S + G+ +SDL+ EEF H V+ + S K RS
Sbjct: 65 E--NFNNVGNKSYKLGLNRYSDLTSEEFIASHTGFKVSDQLSDS------------KMRS 110
Query: 128 ITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSV 187
+ + +P DWRE G++ V+NQ+ CG CWAF+ V E + +KNG L LS
Sbjct: 111 VAIPFNLNDDVPTNFDWREKGVVTDVKNQRQCGCCWAFTAVAAVEGIVKIKNGNLISLSE 170
Query: 188 QEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI 247
Q+++DC + GC GGDF D + ++ +++ E +YP D + P +I
Sbjct: 171 QQLVDCDRQSS-GCGGGDFVLAFDSIIKSRGIVK-EDDYPYKANDVQTCQLGQIPGAAQI 228
Query: 248 KSYTCDTLIPS--ESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQ 305
Y +P+ E +L + +A + + +Y+GGV + +C L NHAV
Sbjct: 229 NGY---FKVPANDEQQLLRAVLQQPVSVAISTSYDFHHYMGGVYEGSCGPKL---NHAVT 282
Query: 306 IVGY 309
I+GY
Sbjct: 283 IIGY 286
>gi|449668436|ref|XP_002162416.2| PREDICTED: cathepsin O-like [Hydra magnipapillata]
Length = 365
Score = 131 bits (330), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 75/236 (31%), Positives = 128/236 (54%), Gaps = 20/236 (8%)
Query: 79 SARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHV---KKRSITTGITIP 135
+A+YGI ++SD S EEFK L ++ S K + ++ N + +K++
Sbjct: 101 TAKYGINQYSDWSLEEFKNYRLTSNLGMFSDFSTPKIYLNNGNEICSIQKKAY------- 153
Query: 136 TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAG 195
P KDW G+ K+++Q+ CG+CWAF E E+ A+ + LS QE+I C+
Sbjct: 154 ---PSSKDW--IGMSTKIKDQKNCGSCWAFVASEQVETYLAIAGKPIVELSPQELISCS- 207
Query: 196 NGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYT-CDT 254
+MGC GG+ C L W+ L+ E EYP + + C + + +I + C +
Sbjct: 208 -PSMGCHGGNTCTALSWLKQTHSCLKTEKEYPYEAQVSKCLYSNCTTSDARIYAVCGCQS 266
Query: 255 LIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
+ +E ++ ++ GP+ V+A++WQ Y+GG+IQ++C +INHAVQ++GY+
Sbjct: 267 FVGNEEYMIRVLSQKGPLSVNVDAVSWQDYIGGIIQHHCTNK--DINHAVQLIGYN 320
>gi|297802418|ref|XP_002869093.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
lyrata]
gi|297314929|gb|EFH45352.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
lyrata]
Length = 355
Score = 131 bits (330), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 88/282 (31%), Positives = 135/282 (47%), Gaps = 25/282 (8%)
Query: 31 EQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEFSD 89
E+ LELF S+ + K Y E + RF+ F ++L I++ N S G+ EF+D
Sbjct: 45 EKLLELFESWMSEHSKVYKSVEEKVHRFEVFRENLMHIDQRNNEINS---YWLGLNEFAD 101
Query: 90 LSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGI 149
L+ EEFK R+L + + + R IT +P DWR+ G
Sbjct: 102 LTHEEFKGRYL------GLAKPQFSRKRQPSANFRYRDITD-------LPKSVDWRKKGA 148
Query: 150 IGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCAL 209
+ V++Q CG+CWAFSTV E ++ + G LS LS QE+IDC N GC+GG
Sbjct: 149 VAPVKDQGQCGSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYA 208
Query: 210 LDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
++ ++ L E +YP L+++ C+ + V I Y + + ++ L H
Sbjct: 209 FQYI-ISTGGLHKEDDYPYLMEEGICQEQKEDVERVTISGY--EDVPENDDESLVKALAH 265
Query: 270 GPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
PV A+ A +Q+Y GGV C +++H V VGY
Sbjct: 266 QPVSVAIEASGRDFQFYKGGVFNGQCG---TDLDHGVAAVGY 304
>gi|403183546|gb|EJY58173.1| AAEL017153-PA [Aedes aegypti]
Length = 1165
Score = 131 bits (329), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 91/283 (32%), Positives = 141/283 (49%), Gaps = 35/283 (12%)
Query: 36 LFSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F+ ++ + Y + EH++RF+ F+ +L IE+LNK Q +A+YGIT F+D++ E
Sbjct: 857 LFEKFKLKHSREYQSTLEHEMRFRIFKNNLFKIEQLNKYEQG--TAKYGITHFADMTSAE 914
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
++ R + + D +H K I + +P DWRE G + V+
Sbjct: 915 YRQR---------TGLVIPRDEDRNHVGNPKAEIDENMELPESF----DWRELGAVSPVK 961
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
NQ CG+CWAFS V E +H +K L S QE++DC + C GG +MD
Sbjct: 962 NQGNCGSCWAFSVVGNIEGLHQIKTKVLEEYSEQELLDCDAV-DSACQGG-------YMD 1013
Query: 215 -----VNKV-VLEPESEYPLLL-KDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIA 267
+ K+ LE ESEYP L K C +T V ++ L +E+++ +
Sbjct: 1014 DAYKAIEKIGGLELESEYPYLAKKQKTCHFNSTE---VHVRVKGAVDLPKNETAMAQYLV 1070
Query: 268 THGPVIAAVNALTWQYYLGGVIQ-YNCDGSLANINHAVQIVGY 309
+GP+ +NA Q+Y GG+ + S N++H V IVGY
Sbjct: 1071 ANGPISIGLNANAMQFYRGGISHPWKPLCSKKNLDHGVLIVGY 1113
>gi|394331735|gb|AFN27090.1| cysteine protease [Leishmania major]
Length = 348
Score = 131 bits (329), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 89/281 (31%), Positives = 140/281 (49%), Gaps = 27/281 (9%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F++ Y+++Y +E R NFE++L+++ E ++P AR+GIT+F DLSE E
Sbjct: 37 LFEEFKRTYQRAYGTLTEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F R+L N + K H H + ++ +P DWR+ G + V+
Sbjct: 94 FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWRKKGAVTPVK 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
NQ CG+CWAFS V ES A+ L LS Q+++ C N GC GG +W+
Sbjct: 143 NQGACGSCWAFSAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDN-GCGGGLMLQAFEWVL 201
Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
++N V E YP + + C + G +I Y ++ SE + +A +
Sbjct: 202 RNMNGTVFT-EKSYPYVSGNGDVPECSNSSELAPGARIDGYV--SMESSERVMAAWLAKN 258
Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
GP+ AV+A ++ Y GV+ +C G +NH V +VGY+
Sbjct: 259 GPISIAVDASSFMSYHSGVLT-SCIGE--QLNHGVLLVGYN 296
>gi|45822201|emb|CAE47497.1| cathepsin L-like proteinase [Diabrotica virgifera virgifera]
Length = 315
Score = 131 bits (329), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 95/298 (31%), Positives = 143/298 (47%), Gaps = 26/298 (8%)
Query: 14 IALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKN 73
+ L LA + V+ E ++SF+ + KSY+ E +RF F+ +L IEE N
Sbjct: 1 MKLFILAAALIVATSANLGAFEKWTSFKATHNKSYNVIEDKLRFAVFQDNLKKIEEHNAK 60
Query: 74 RQSPESARY-GITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGI 132
+S E Y + +F+D S EF+ R NK K+ I +
Sbjct: 61 YESGEETYYLAVNKFADWSSAEFQAMLARQMANKP----------------KQSFIAKHV 104
Query: 133 TIPTGIPVKK-DWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVI 191
P V++ DWR++ ++G V++Q CG+CWAFST + E A+ LS QE++
Sbjct: 105 ADPNVQAVEEVDWRDSAVLG-VKDQGQCGSCWAFSTTGSLEGQLAIHKNQRVPLSEQELV 163
Query: 192 DCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYT 251
DC + N GC+GG ++ V + L ES+Y +D CK P I Y
Sbjct: 164 DCDTSRNAGCNGGLMTDAFNY--VKRHGLSSESQYAYTGRDDRCKNVENKPLS-SISGYV 220
Query: 252 CDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
L +E ++ + +A+ GPV AV+A TWQ Y GG+ +N N+NH V VGY
Sbjct: 221 --ELETTEDALASAVASVGPVSIAVDADTWQLYGGGL--FNNKNCRTNLNHGVLAVGY 274
>gi|241062152|gb|ACS66748.1| cysteine protease [Leishmania guyanensis]
Length = 441
Score = 131 bits (329), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 88/280 (31%), Positives = 141/280 (50%), Gaps = 25/280 (8%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F+Q YK+ Y+ +E R NF+++L+++ E N AR+GIT+F DLSE E
Sbjct: 37 LFEEFKQTYKRVYATLAEEQQRVANFQRNLELMREHQANN---PHARFGITKFFDLSEAE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F TR+L + + K H+ V G + T P DWR+ G + V+
Sbjct: 94 FATRYLSGATH---FAKAKKFASQHYRKV-------GADLSTA-PAAVDWRQMGAVTPVK 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
+Q CG+CWA S + ES + +L LS QE++ C + + GC+GG DW+
Sbjct: 143 DQGACGSCWALSAIGNIESQWYVTTHSLITLSEQELVSC-DDVDEGCNGGLMLQAFDWLL 201
Query: 215 VNK-VVLEPESEYPLLLKDAA---CKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
NK + + YP + + + C + G I + T+ +E ++ +A +G
Sbjct: 202 NNKNGAVYTGASYPYVSGNGSVPECSESSELVVGAYIDGHV--TIESNEDTMAAWLAVNG 259
Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
P+ AV+A + Y GG++ +CDG +NH V +VGY+
Sbjct: 260 PIAIAVDASAFMSYTGGILT-SCDGR--QLNHGVLLVGYN 296
>gi|37780039|gb|AAP32192.1| cysteine protease 14 [Trifolium repens]
Length = 351
Score = 131 bits (329), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 88/284 (30%), Positives = 145/284 (51%), Gaps = 25/284 (8%)
Query: 29 NLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
++++ +ELF S+ R+ K Y E + RF+ F+ +L I++ NK + G+ EF
Sbjct: 39 SMDKLIELFESWMSRHGKIYETIEEKLLRFEVFKDNLKHIDDRNK---IVSNYWLGLNEF 95
Query: 88 SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREA 147
+DLS +EFK ++L V+ +S + + + +P DWR+
Sbjct: 96 ADLSHQEFKNKYLGLKVD----LSQRRESSNEEEFTYR---------DVDLPKSVDWRKK 142
Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFC 207
G + V+NQ CG+CWAFSTV E ++ + G L+ LS QE+IDC N GC+GG
Sbjct: 143 GAVTPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDTTYNNGCNGGLMD 202
Query: 208 ALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIA 267
++ N L E +YP +++++ C+ K V I Y D +E S+L +A
Sbjct: 203 YAFSFIGQNG-GLHKEEDYPYIMEESTCEMKKEETQVVTINGYH-DVPQNNEQSLLKALA 260
Query: 268 THGPVIAAVNALT--WQYYLGGVIQYNCDGSLANINHAVQIVGY 309
P+ A+ A + +Q+Y GGV +C ++++H V VGY
Sbjct: 261 NQ-PLSVAIEASSRDFQFYSGGVFDGHCG---SDLDHGVSAVGY 300
>gi|394331820|gb|AFN27129.1| cysteine protease [Leishmania tropica]
Length = 443
Score = 131 bits (329), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 88/281 (31%), Positives = 141/281 (50%), Gaps = 27/281 (9%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F++ Y+++Y +E R NFE++L+++ E ++P AR+GIT+F DLSE E
Sbjct: 37 LFEEFKRTYRRAYGTVAEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F R+L N + K H H + ++ +P DWR+ G + V+
Sbjct: 94 FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWRKKGAVTPVK 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
+Q CG+CWAFS V + ES AL L+ LS Q+++ C N GC GG +W+
Sbjct: 143 DQGACGSCWAFSAVGSIESQWALAGHRLTALSEQQLVSCDDKDN-GCRGGLMLQAFEWLL 201
Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
++N + E YP + C + G +I Y T+ SE+ + +A +
Sbjct: 202 RNMNGTMFT-EDSYPYVSSTGYVPECSNSSQLVPGARIDGYM--TIESSETVMAAWLAKN 258
Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
GP+ AV+A ++ Y GV+ +C G +NH V +V Y+
Sbjct: 259 GPISIAVDASSFMSYQSGVLT-SCAG--MPLNHGVLLVWYN 296
>gi|115461667|ref|NP_001054433.1| Os05g0108600 [Oryza sativa Japonica Group]
gi|14719319|gb|AAK73137.1|AC079022_10 putative cysteine proteinase [Oryza sativa]
gi|33151125|gb|AAP97431.1| cysteine protease CP1 [Oryza sativa]
gi|52353572|gb|AAU44138.1| cysteine proteinase CP1 [Oryza sativa Japonica Group]
gi|113577984|dbj|BAF16347.1| Os05g0108600 [Oryza sativa Japonica Group]
gi|125550541|gb|EAY96250.1| hypothetical protein OsI_18148 [Oryza sativa Indica Group]
Length = 358
Score = 130 bits (328), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 90/279 (32%), Positives = 142/279 (50%), Gaps = 22/279 (7%)
Query: 34 LELFSSFQQRYKKSYSKSEHDIR-FKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSE 92
+ELF + +Y+K+Y+ E +R F+ F+ +L+ I+++NK S G+ EF+DL+
Sbjct: 48 IELFEKWVAKYRKAYASFEEKVRRFEVFKDNLNHIDDINKKVTS---YWLGLNEFADLTH 104
Query: 93 EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
+EFK +L + S+ KH+ K S +P + DWR+ + +
Sbjct: 105 DEFKATYL--GLTPPPTRSNSKHYSSEEFRYGKMSNGE-------VPKEMDWRKKNAVTE 155
Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
V+NQ CG+CWAFSTV E ++A+ G L+ LS QE+IDC+ +GN GC+GG +
Sbjct: 156 VKNQGQCGSCWAFSTVAAVEGINAIVTGNLTSLSEQELIDCSTDGNNGCNGGLMDYAFSY 215
Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV 272
+ + L E YP +++ C + V I Y D E +++ +A H PV
Sbjct: 216 I-ASTGGLRTEEAYPYAMEEGDCDEGKGAAV-VTISGYE-DVPANDEQALVKALA-HQPV 271
Query: 273 IAAVNALT--WQYYLGGVIQYNCDGSLANINHAVQIVGY 309
A+ A +Q+Y GGV C L +H V VGY
Sbjct: 272 SVAIEASGRHFQFYSGGVFDGPCGEQL---DHGVTAVGY 307
>gi|217323618|gb|ACK38176.1| midgut cysteine peptidase, partial [Sphenophorus levis]
Length = 324
Score = 130 bits (328), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 88/292 (30%), Positives = 145/292 (49%), Gaps = 25/292 (8%)
Query: 23 VKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKN-RQSPESA 80
V VS L++ F SF+ ++ K+Y +++E RF F ++L IE N +Q S
Sbjct: 12 VAVSATLLKEDGVHFQSFKLKHGKTYKNQAEETKRFAIFRENLRKIEAHNAEYKQGIHSY 71
Query: 81 RYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPV 140
GI +F+D++ EFK K +++ K + G+++P I
Sbjct: 72 TQGINKFADMTRAEFKAMLATQVKTKPSIVA-----------TKTFQLADGVSVPESI-- 118
Query: 141 KKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMG 200
DWR ++ +++Q CG+CW+F+ V + E +AL G L+ S Q+++DC + N G
Sbjct: 119 --DWRSRNVVTPIKDQAQCGSCWSFAVVGSTEGAYALSTGKLTRFSEQQLVDCTTDLNYG 176
Query: 201 CSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSES 260
C GG ++ N LE ES+YP D +C +S K+ SY ++ +E
Sbjct: 177 CDGGYLDDTFPYIQTNG--LELESDYPYTGYDGSCSYD-SSKVVTKVSSYV--SVPANEQ 231
Query: 261 SILTDIATHGPVIAAVNALTWQYYLGGVIQYN-CDGSLANINHAVQIVGYDN 311
++L + T GPV A+NA Q+Y G+I CD ++H V VGY++
Sbjct: 232 ALLEAVGTAGPVAIAINADDLQFYFSGIIDDKYCDPEW--LDHGVLAVGYNS 281
>gi|47524507|gb|AAT34987.1| putative cysteine protease [Gossypium hirsutum]
Length = 344
Score = 130 bits (328), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 91/318 (28%), Positives = 153/318 (48%), Gaps = 30/318 (9%)
Query: 8 LFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHD---IRFKNFEKSL 64
LF+ +++ CF +S+P L++ + ++ + Y+ + D RF F++++
Sbjct: 8 LFVALVLSFCFSIQLAGLSRPLLDEDSMRHEEWMSQHGRVYADEQEDHKNKRFNVFKENV 67
Query: 65 DIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVK 124
+ IEE N + + + I +F+DL+ EEF+ + + +++S + +
Sbjct: 68 ERIEEFNDGK----TFKLAINQFADLTNEEFRASY--NGFKGPMVLS---------SQIT 112
Query: 125 KRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL 184
K + + + +PV DWR+ G + V+NQ CG CWAFS V E + + G L
Sbjct: 113 KPTPFRYENVSSALPVSVDWRKKGAVTPVKNQGQCGCCWAFSAVAAIEGITQISTGKLIS 172
Query: 185 LSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPN 243
LS QE++DC G + GC GG +++ +N L ES YP +D C T+P
Sbjct: 173 LSEQELVDCDTKGIDHGCEGGLMDTAFEFI-INNGGLTTESNYPYKGEDGTCNFNKTNPI 231
Query: 244 GVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANIN 301
V I Y D E +++ +A H PV A+ A +Q+Y GV C L +
Sbjct: 232 AVSITGYE-DVPANDEQALMKAVA-HQPVSVAIEAGGSDFQFYSSGVFTGECGTEL---D 286
Query: 302 HAVQIVGY---DNYSRTW 316
HAV VGY ++ S+ W
Sbjct: 287 HAVTAVGYGESEDGSKYW 304
>gi|225428328|ref|XP_002279940.1| PREDICTED: cysteine proteinase-like [Vitis vinifera]
Length = 707
Score = 130 bits (328), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 90/283 (31%), Positives = 139/283 (49%), Gaps = 26/283 (9%)
Query: 30 LEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEFS 88
+++ + F S+ ++ K Y E + RF+ F ++L+ I+E NK S G+ EF+
Sbjct: 397 IDKLIARFESWVSKHGKVYKSMEEKLHRFEVFRENLNHIDERNKE---VSSYWLGLNEFA 453
Query: 89 DLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAG 148
DLS EEFK+++L L + + + R + +P DWR+ G
Sbjct: 454 DLSHEEFKSKYLG-------LRAEFPRSRDYSGEFRYRDVAD-------LPESVDWRKKG 499
Query: 149 IIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCA 208
+ V+NQ CG+CWAFSTV E ++ + G L+ LS QE+IDC N GC+GG
Sbjct: 500 AVTHVKNQGACGSCWAFSTVAAVEGINQIVTGNLTTLSEQELIDCDTTFNSGCNGGLMDY 559
Query: 209 LLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIAT 268
++ N L E +YP L+++ C+ + + V I Y D E S+L +A
Sbjct: 560 AFAFIASNG-GLHKEDDYPYLMEEGTCEEQKEDVDIVTISGYE-DVPEKDEESLLKALA- 616
Query: 269 HGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
H P+ A+ A +Q+Y GGV C L +H V VGY
Sbjct: 617 HQPLSVAIEASGRDFQFYSGGVFNGPCGTEL---DHGVAAVGY 656
>gi|157864843|ref|XP_001681130.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|68124424|emb|CAJ02280.1| cathepsin L-like protease [Leishmania major strain Friedlin]
Length = 348
Score = 130 bits (328), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 90/281 (32%), Positives = 138/281 (49%), Gaps = 27/281 (9%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F++ Y+++Y +E R NFE++L+++ E ++P AR+GIT+F DLSE E
Sbjct: 37 LFEEFKRTYQRAYGTLTEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F R+L N + K H H + ++ +P DWRE G + V+
Sbjct: 94 FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
NQ CG+CWAFS V ES A+ L LS Q+++ C N GC GG +W+
Sbjct: 143 NQGACGSCWAFSAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDN-GCGGGLMLQAFEWVL 201
Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
++N V E YP C + G +I Y ++ SE + +A +
Sbjct: 202 RNMNGTVFT-EKSYPYTSTFGYVPECSNSSELAPGARIDGYV--SMESSERVMAAWLAKN 258
Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
GP+ AV+A ++ Y GV+ +C G +NH V +VGY+
Sbjct: 259 GPISIAVDASSFMSYHSGVLT-SCIGE--QLNHGVLLVGYN 296
>gi|67773372|gb|AAY81943.1| cysteine protease 5 [Paragonimus westermani]
Length = 325
Score = 130 bits (327), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 97/305 (31%), Positives = 151/305 (49%), Gaps = 38/305 (12%)
Query: 11 VALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEEL 70
+A + C A+ + P + EL+ F++ Y K Y+ + RF F+ +L ++L
Sbjct: 9 LAFLVGCAFAVS---TVPVPDNARELYEQFKRDYGKVYANDDDQKRFAIFKDNLVRAQKL 65
Query: 71 N-KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSIT 129
K+R + ARYG+T+FSDL+ EEF ++L +N V +R
Sbjct: 66 QLKDRGT---ARYGVTQFSDLTPEEFAAKYLSRPMNDQV----------------ERVRP 106
Query: 130 TGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQE 189
TG+ P + DWRE G +G V NQ +CG+CWAFS E LK G L LS Q+
Sbjct: 107 TGLK---AAPERMDWREWGAVGPVENQGSCGSCWAFSVAGNVEGQWFLKTGQLVSLSKQQ 163
Query: 190 VIDCAGNGNMGCSGGDFCALLDWMDVNKV-VLEPESEYPLLLKDAACKRKATSPNGVKIK 248
++DC + GC GG +M++ ++ LE +S+YP + C N K+
Sbjct: 164 LVDCDVM-DYGCGGG--WPTNAYMEIMRMGGLELQSDYPYVGVQQQCYL-----NKEKLL 215
Query: 249 SYTCDTLI--PSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDG-SLANINHAVQ 305
+ D ++ E +A HGP+ +A+NA Q+Y G+ + + S A++NHAV
Sbjct: 216 AKIDDLIVLGAYEEEHAAYLAEHGPLSSALNAGYLQFYQSGISHPSYEECSPASLNHAVL 275
Query: 306 IVGYD 310
VGYD
Sbjct: 276 TVGYD 280
>gi|5231178|gb|AAD41105.1|AF157961_1 cysteine proteinase [Hypera postica]
Length = 324
Score = 130 bits (327), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 98/300 (32%), Positives = 150/300 (50%), Gaps = 28/300 (9%)
Query: 14 IALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNK 72
+A+ F + V +S E+ F +F+ + K+Y +++E RF F ++ IE N
Sbjct: 3 VAIFFSLLVVAISASISEELGAKFQAFKLEHGKTYLNQAEESKRFNIFTDNVRAIEAHNA 62
Query: 73 -NRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTG 131
Q S + GI +F+D+S+EEFKT + K L ++VK TG
Sbjct: 63 LYEQGKVSYKKGINKFTDMSQEEFKTMLTLSASRKPTL--------ETTSYVK-----TG 109
Query: 132 ITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVI 191
+ IP+ + DWR+ G + V++Q CG+CWAFS + E +A K+G L LS Q++I
Sbjct: 110 VEIPSSV----DWRKEGRVTGVKDQGDCGSCWAFSITGSTEGAYARKSGKLVSLSEQQLI 165
Query: 192 DCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYT 251
DC + + GC GG + V K L+ E Y +D ACK S K+ YT
Sbjct: 166 DCCTDTSAGCDGGSLDDNFKY--VMKDGLQSEESYTYKGEDGACKYNVASVV-TKVSKYT 222
Query: 252 CDTLIPS--ESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
IP+ E ++L +AT GPV ++A Y G+ + + D S A +NHA+ VGY
Sbjct: 223 S---IPAEDEDALLEAVATVGPVSVGMDASYLSSYDSGIYE-DQDCSPAGLNHAILAVGY 278
>gi|194352746|emb|CAQ00101.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 381
Score = 130 bits (327), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 97/307 (31%), Positives = 154/307 (50%), Gaps = 48/307 (15%)
Query: 24 KVSKPNLEQKLEL-----FSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSP 77
+V + E +LEL F+SF +R+ KSY + EH+ R F +L ++++
Sbjct: 40 QVVGGDAENELELNAEAHFASFVRRFGKSYRDADEHEHRLSVFRANL---RRARRHQRLD 96
Query: 78 ESARYGITEFSDLSEEEFKTRHL-----RHSVNKHVLMSHHKHHDHHHNHVKKRSITTGI 132
SA +GIT+FSDL+ +EF+ R L R S K + S H
Sbjct: 97 PSAVHGITKFSDLTPDEFRERFLGLRKSRRSFLKGISGSAHD----------------AP 140
Query: 133 TIPT-GIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVI 191
+PT G+P + DWRE G +G V++Q +CG+CW+FST E + L G L +LS Q+++
Sbjct: 141 ALPTDGLPTEFDWREHGAVGPVKDQGSCGSCWSFSTSGALEGANYLATGKLEVLSEQQLV 200
Query: 192 DC--------AGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPN 243
DC + GC+GG ++ LE E +YP +++ACK S
Sbjct: 201 DCDHECDPSEPRACDAGCNGGLMTTAFSYL-AKAGGLETEKDYPYTGRNSACKFD-KSKI 258
Query: 244 GVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINH 302
++K+++ T+ E I ++ HGP+ +NA+ Q Y+GGV Y C +++H
Sbjct: 259 AAQVKNFS--TVAIDEDQIAANLVKHGPLAIGINAVFMQTYIGGVSCPYICG---RHLDH 313
Query: 303 AVQIVGY 309
V +VGY
Sbjct: 314 -VFLVGY 319
>gi|224131910|ref|XP_002328138.1| predicted protein [Populus trichocarpa]
gi|222837653|gb|EEE76018.1| predicted protein [Populus trichocarpa]
Length = 349
Score = 130 bits (326), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 99/313 (31%), Positives = 156/313 (49%), Gaps = 34/313 (10%)
Query: 8 LFIVALIALCFLAIPVKVSK-PNLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLD 65
LF+ +++A F + ++++ +ELF S+ + K+Y+ E + RF+ F+++L
Sbjct: 17 LFVCSVLAHDFSIVGYSPEHLTSVDKLVELFESWISGHGKAYNSLEEKLHRFEVFKENLK 76
Query: 66 IIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKK 125
I++ NK S G+ EF+DLS EEFK++ L + KK
Sbjct: 77 HIDQRNKEVTS---YWLGLNEFADLSHEEFKSKFLGL---------------YPEFPRKK 118
Query: 126 RSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLL 185
S +P DWR+ G + V+NQ +CG+CWAFSTV E ++ + G L+ L
Sbjct: 119 SSEDFSYRDVVDLPKSIDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVAGNLTSL 178
Query: 186 SVQEVIDCAGNGNMGCSGGDFCALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSP 242
S Q++IDC + N GC+GG L+D+ VN L E +YP L+++ C K
Sbjct: 179 SEQQLIDCDTSFNNGCNGG----LMDYAFEFIVNNGGLHKEEDYPYLMEEGTCDEKREEM 234
Query: 243 NGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANI 300
V I Y D E S+L +A H P+ A++A +Q+Y GGV C ++
Sbjct: 235 EVVTISGYH-DVPRNDEQSLLKALA-HQPLSVAIDASGRDFQFYSGGVFSGPCG---TDL 289
Query: 301 NHAVQIVGYDNYS 313
+H V VGY + S
Sbjct: 290 DHGVAAVGYGSSS 302
>gi|378943048|gb|AFC76265.1| cathepsin L-like protease [Leishmania major]
Length = 348
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 89/281 (31%), Positives = 139/281 (49%), Gaps = 27/281 (9%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F++ Y+++Y +E R NFE++L+++ E ++P AR+GIT+F DLSE
Sbjct: 37 LFEEFKRTYQRAYGTLTEEQRRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAV 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F R+L N + K H H + ++ +P DWRE G + V+
Sbjct: 94 FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
NQ CG+CWAFS V ES A+ L LS Q+++ C N GC GG +W+
Sbjct: 143 NQGACGSCWAFSAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDN-GCGGGLMLQAFEWVL 201
Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
++N V E YP + + C + G +I Y ++ SE + +A +
Sbjct: 202 RNMNGTVFT-EKSYPYVSGNGDVPECSNSSELAPGARIDGYV--SMESSERVMAAWLAKN 258
Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
GP+ AV+A ++ Y GV+ +C G +NH V +VGY+
Sbjct: 259 GPISIAVDASSFMSYHSGVLT-SCIGE--QLNHGVLLVGYN 296
>gi|440792185|gb|ELR13413.1| cathepsin L, putative [Acanthamoeba castellanii str. Neff]
Length = 331
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 92/278 (33%), Positives = 144/278 (51%), Gaps = 24/278 (8%)
Query: 35 ELFSSFQQRYKKSYSKSEH-DIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEE 93
E F++F QRY KSY+ +E + RF F ++L LN + ++GIT+F+D+S+E
Sbjct: 32 EQFNAFVQRYGKSYASAEEAEQRFAIFTQNLAETAALNIKYEG--KTQFGITKFADMSQE 89
Query: 94 EFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWR-EAGIIGK 152
EF++R VLMS+ + + G T P+ DWR + G++
Sbjct: 90 EFQSR---------VLMSNPPPPPTEKPYRGPK--FEGFTAPSTF----DWRNKPGVVTP 134
Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
V +Q CG+CWAFS E ES AL L+ LS+Q+++DC+ + GC GG D+
Sbjct: 135 VYDQGQCGSCWAFSATENIESQWALAGHKLTGLSMQQIVDCSWWDD-GCGGGFPSYAYDY 193
Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV 272
+ ++ L+ + YP +C K S KI S+T T +E + +A HGP+
Sbjct: 194 V-IDAPGLDALANYPYTAVGGSCAFK-ESQVVAKISSWTYTTTDSNEHQMANYLAQHGPI 251
Query: 273 IAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
V+A +W Y GGV + + G+ +I+H V VGY+
Sbjct: 252 SVCVDAESWPSYTGGVYRASACGT--SIDHCVLAVGYN 287
>gi|394331805|gb|AFN27125.1| cysteine protease [Leishmania major]
Length = 348
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 90/281 (32%), Positives = 139/281 (49%), Gaps = 27/281 (9%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F++ Y+++Y +E R NFE++L+++ E ++P AR+GIT+F DLSE E
Sbjct: 37 LFEEFKRTYQRAYGTLTEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F R+L N + K H H ++ +P DWRE G + V+
Sbjct: 94 FAARYL----NGAAYFAAAKQHAGQHYRKACADLSA-------VPDAVDWREKGAVTPVK 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
NQ CG+CWAFS V ES A+ L LS Q+++ C N GC GG +W+
Sbjct: 143 NQGACGSCWAFSAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDN-GCGGGLMLQAFEWVL 201
Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
++N V E YP + + C + G +I Y ++ SE + +A +
Sbjct: 202 RNMNGTV-STEKSYPYVSGNGDVPECSNSSELAPGARIDGYV--SMESSERVMAAWLAKN 258
Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
GP+ AV+A ++ Y GV+ +C G +NH V +VGY+
Sbjct: 259 GPISIAVDASSFMSYHSGVLT-SCIGE--QLNHGVLLVGYN 296
>gi|124484387|dbj|BAF46304.1| cysteine proteinase precursor [Ipomoea nil]
Length = 474
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 88/280 (31%), Positives = 135/280 (48%), Gaps = 26/280 (9%)
Query: 35 ELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEE 93
E+F S+ ++ KSY+ E D RFK F +L I+E KN S + G+ F+D++ E
Sbjct: 48 EMFESWLVKHGKSYNAVDEKDKRFKIFRDNLKYIDE--KNSLENRSYKLGLNRFADITNE 105
Query: 94 EFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKV 153
E++T +L D N VK +S +P DWRE G + V
Sbjct: 106 EYRTGYL------------GAKRDASRNMVKSKSDRYAPVAGDSLPDSIDWREKGAVTGV 153
Query: 154 RNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM 213
++Q +CG+CWAFST+ E ++ L G L LS QE++DC N GC+GGD ++
Sbjct: 154 KDQGSCGSCWAFSTIAAVEGVNQLATGNLISLSEQELVDCDRKINQGCNGGDMGYAFQFI 213
Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP--SESSILTDIATHGP 271
+ ++ E +YP KD C + N K+ S +P +E S+ +A P
Sbjct: 214 -IKNGGIDSEEDYPYTGKDGKC--DSYRQNNAKVASIDGYEEVPVNNEKSLQKAVANQ-P 269
Query: 272 VIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
V A+ A +Q Y G+ +C +++H V VGY
Sbjct: 270 VSVAIEAGGYDFQLYSSGIFTGSCG---TDLDHGVAAVGY 306
>gi|401397136|ref|XP_003879989.1| cathepsin L, related [Neospora caninum Liverpool]
gi|325114397|emb|CBZ49954.1| cathepsin L, related [Neospora caninum Liverpool]
Length = 415
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 95/295 (32%), Positives = 143/295 (48%), Gaps = 32/295 (10%)
Query: 22 PVKVSKPNLEQKLELFSSFQQRYKKSYSKSEH-DIRFKNFEKSLDIIEELNKNRQSPESA 80
PV + + E F SF+ Y KSY+ E R+ F+ +L I N Q S
Sbjct: 104 PVNIWEWKEEHFQNAFGSFRATYGKSYATEEETQKRYAIFKNNLAYIHTHN---QQGYSY 160
Query: 81 RYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPV 140
+ F DLS EEF+ ++L ++K + N++ + ++ P+ +P
Sbjct: 161 SLKMNHFGDLSREEFRRKYL----------GYNKSRNLKSNNLGVATELLKVS-PSDVPS 209
Query: 141 KKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNM 199
DWRE G + V++Q+ CG+CWAFS E H K G L LS QE++DC+ GN
Sbjct: 210 AVDWREKGCVTPVKDQRDCGSCWAFSATGALEGAHCAKTGELLSLSEQELVDCSLAEGNQ 269
Query: 200 GCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKR---KATSPNGVKIKSYTCDTLI 256
GCSGG+ ++ V+ L E YP L +D CKR K + +G K D
Sbjct: 270 GCSGGEMNDAFQYV-VDSGGLCSEEGYPYLARDGECKRACKKVVTISGFK------DVPR 322
Query: 257 PSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
SE+++ +A H PV A+ A L +Q+Y GV +C +++H V +VGY
Sbjct: 323 KSETAMKAALA-HSPVSIAIEADQLPFQFYHEGVFDASCG---TDLDHGVLLVGY 373
>gi|7770062|ref|NP_036137.1| cathepsin J precursor [Mus musculus]
gi|6467374|gb|AAF13142.1|AF136272_1 cathepsin J precursor [Mus musculus]
gi|15418834|gb|AAK58455.1| cathepsin J [Mus musculus]
gi|148709364|gb|EDL41310.1| cathepsin J, isoform CRA_b [Mus musculus]
Length = 333
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 99/304 (32%), Positives = 154/304 (50%), Gaps = 31/304 (10%)
Query: 11 VALIALCF-LAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEE 69
V L+ LCF +A + P L+ + + ++ +Y KSYS E +R +E+++ +I+
Sbjct: 5 VLLLILCFGVASGAQAHDPKLDAE---WKDWKTKYAKSYS-PEEALRRAVWEENMRMIKL 60
Query: 70 LNK-NRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSI 128
NK N + + +F D + EEF R S++ ++ + H NHV
Sbjct: 61 HNKENSLGKNNFTMKMNKFGDQTSEEF-----RKSID-NIPIPAAMTDPHAQNHVS---- 110
Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
G+P KDWRE G + VRNQ CG+CWAF+ E K G L+ LSVQ
Sbjct: 111 -------IGLPDYKDWREEGYVTPVRNQGKCGSCWAFAAAGAIEGQMFWKTGNLTPLSVQ 163
Query: 189 EVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI 247
++DC+ GN GC G +++ NK LE E+ YP KD C+ ++ + + I
Sbjct: 164 NLLDCSKTVGNKGCQSGTAHQAFEYVLKNK-GLEAEATYPYEGKDGPCRYRSENAS-ANI 221
Query: 248 KSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSLANINHAVQ 305
Y L P+E + +A+ GPV AA++A ++++Y GG I Y + S +NHAV
Sbjct: 222 TDYV--NLPPNELYLWVAVASIGPVSAAIDASHDSFRFYNGG-IYYEPNCSSYFVNHAVL 278
Query: 306 IVGY 309
+VGY
Sbjct: 279 VVGY 282
>gi|156938919|gb|ABU97481.1| cathepsin L-like cysteine protease [Tyrophagus putrescentiae]
Length = 333
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 100/314 (31%), Positives = 154/314 (49%), Gaps = 42/314 (13%)
Query: 9 FIVALIALCFLAIPVK-VSKPNLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDI 66
F+ A++AL F + +S+ LE LF + R+ +SY+ E +I R + F +L+
Sbjct: 3 FVFAVLALVFAPTASELISEGELEAHFNLF---KTRFGRSYANFEEEIFRKRVFASNLEF 59
Query: 67 IEELNKNRQ---SPESARYGITEFSDLSEEEFKTRH--LRHSVNKHVLMSHHKHHDHHHN 121
I N NR+ ++ + F+D+S EF+ R LRHS + H +
Sbjct: 60 I--FNHNREFFAGNKNFNVAVNNFTDMSNTEFRARFNGLRHSGVQSAPAIHSASAE---- 113
Query: 122 HVKKRSITTGITIPTGIPVKKDWREA-GIIGKVRNQQTCGACWAF-STVETAESMHALKN 179
G+P DW + ++ ++NQ+ CG+CWAF S V + E H LK
Sbjct: 114 ---------------GLPATVDWTKVKNVVTPIKNQEQCGSCWAFFSAVASMEGQHGLKT 158
Query: 180 GTLSLLSVQEVIDC-AGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
G L LS Q ++DC A GNMGC GG ++ NK + + E YP D + + K
Sbjct: 159 GKLVSLSEQNLVDCSAAEGNMGCEGGLMDQAFQYVIANKGI-DTEMSYPYKAIDESWEFK 217
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQY-NCDG 295
S G IKSY D SESS+ + +AT GP+ ++A L++Q+Y GV + C
Sbjct: 218 KNSV-GATIKSYV-DVKTGSESSLQSAVATVGPISVGIDASQLSFQFYSSGVYEEPACST 275
Query: 296 SLANINHAVQIVGY 309
++ ++H V VGY
Sbjct: 276 TI--LDHGVTAVGY 287
>gi|428186189|gb|EKX55040.1| hypothetical protein GUITHDRAFT_63227 [Guillardia theta CCMP2712]
Length = 344
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 101/297 (34%), Positives = 141/297 (47%), Gaps = 33/297 (11%)
Query: 31 EQKLELFSSFQQRYKKS-----YSKSEHDIRFKNFEKSLDII----EELNKNRQSPESAR 81
++ L +SS+ + Y K YS E F+ F+K+LD+I EE N+ QS E
Sbjct: 21 QKYLSAWSSWVKEYNKEHWVDPYSSPESTRAFEVFQKNLDMIMKHNEEYNQGLQSYE--- 77
Query: 82 YGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVK 141
G+ F+ L+ EEF ++L + V + H K RS IP
Sbjct: 78 MGLNGFAHLTFEEFSAQYLGYG-GAEVEQPKTRRAGKHER--KSRSE---------IPAS 125
Query: 142 KDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMG 200
DWRE G + +V+NQ CG+CWAFS V E H L +G L LS Q+++DC+ GN G
Sbjct: 126 VDWREKGAVAEVKNQGACGSCWAFSAVAALEGAHFLNSGELISLSEQQLVDCSKKFGNHG 185
Query: 201 CSGGDF-CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVK--IKSYTCDTLIP 257
C+GG A WM+ + E +YP D CK S +GV+ I Y D
Sbjct: 186 CAGGYMDNAFEYWMNNTGHGDDSEKDYPYKGMDGKCK---FSADGVRATISGYN-DVKQG 241
Query: 258 SESSILTDIATHGPVIAAVNA-LTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYS 313
+E+ +L +A GPV A++A Q+YL GV +NH V VGY S
Sbjct: 242 NETDLLDAVANVGPVSVAIHAGAALQFYLRGVFNGVAGTCFGPLNHGVTAVGYGTAS 298
>gi|118350036|ref|XP_001008299.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89290066|gb|EAR88054.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 332
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 93/321 (28%), Positives = 150/321 (46%), Gaps = 39/321 (12%)
Query: 5 KNVLFIVALIALCFLAIPVKVSKPNLEQKL---ELFSSFQQRYKKSYSKSEHDIRFKNFE 61
K ++F+ A + A+ + S+ ++E+ + ++ ++ Q++++ K H I +K E
Sbjct: 3 KLIVFVAAAFIIASTAVLIIESQSSVEEVIINSDIIAA--QKWQEFLKK--HSITYKTIE 58
Query: 62 KSLDIIEELNKNRQSPES-ARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
+ L N + E + YGIT+F DL+ EEF+ R+LR N
Sbjct: 59 EKLHRFAVFRDNLKKIEGHSNYGITKFMDLTSEEFQQRYLRLKTNTI-----------KR 107
Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
+ K + + G + DW + G + V++Q+ CG+CWAFS ES + G
Sbjct: 108 QNFKSNPKNAQLNMKLGDDIIIDWTKKGAVTPVKDQEQCGSCWAFSATGALESATFISTG 167
Query: 181 TLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
TL LS QE++DC+ + GN GC GGD A ++ N + E E Y D CK
Sbjct: 168 TLPSLSEQELVDCSTSYGNEGCDGGDMDAAFKFIHDNNIATEKEYTYRGF--DQKCK-GT 224
Query: 240 TSPNGVKIKSY----TCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDG 295
P + S+ +CD L+ + PV AV+A WQYY G +C
Sbjct: 225 QYPTTYGLSSFVDVQSCDELVAA--------IQQQPVSVAVDATNWQYYEFGTFN-DC-- 273
Query: 296 SLANINHAVQIVGYDNYSRTW 316
N+NH V +VGY++ + W
Sbjct: 274 -FDNLNHGVLLVGYNSKTHQW 293
>gi|4678299|emb|CAB41090.1| cysteine proteinase precursor-like protein [Arabidopsis thaliana]
Length = 363
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 99/299 (33%), Positives = 139/299 (46%), Gaps = 46/299 (15%)
Query: 27 KPNL-----EQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESA 80
+PNL E K LF S Y K+YS E I R F K ++++ P SA
Sbjct: 39 RPNLLGTHTESKFRLFMS---DYGKNYSTREEYIHRLGIFAK--NVLKAAEHQMMDP-SA 92
Query: 81 RYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT---- 136
+G+T+FSDL+EEEFK + + + R T G P
Sbjct: 93 VHGVTQFSDLTEEEFK-----------------RMYTGVADVGGSRGGTVGAEAPMVEVD 135
Query: 137 GIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN 196
G+P DWRE G + +V+NQ CG+CWAFST AE H + G L LS Q+++DC
Sbjct: 136 GLPEDFDWREKGGVTEVKNQGACGSCWAFSTTGAAEGAHFVSTGKLLSLSEQQLVDCDQA 195
Query: 197 G----NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTC 252
+ GC GG +++ + LE E YP K CK P V ++
Sbjct: 196 DKKACDNGCGGGLMTNAYEYL-MEAGGLEEERSYPYTGKRGHCK---FDPEKVAVRVLNF 251
Query: 253 DTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCD--GSLANINHAVQIVGY 309
T+ E+ I ++ HGP+ +NA+ Q Y+GGV +C S N+NH V +VGY
Sbjct: 252 TTIPLDENQIAANLVRHGPLAVGLNAVFMQTYIGGV---SCPLICSKRNVNHGVLLVGY 307
>gi|343477619|emb|CCD11596.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 445
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 96/315 (30%), Positives = 156/315 (49%), Gaps = 26/315 (8%)
Query: 4 VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSY-SKSEHDIRFKNFE 61
+ + F V L+A+ +PV + + EQ L+ F++F+Q+Y +SY +E RF+ F+
Sbjct: 7 ARTLRFSVGLLAVAACFVPVALGVLHAEQSLQQQFAAFKQKYSRSYKDATEEAFRFRMFK 66
Query: 62 KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
+S+ E + + A +G+T+FSD+S EEF+ +L + K++
Sbjct: 67 QSM---ERAKEEAAANPYATFGVTQFSDMSPEEFRATYLNGA----------KYYAAALK 113
Query: 122 HVKKRSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
+K +T+ TG P DWR+ G + V++Q+ CG+CWAFS + E +
Sbjct: 114 RPRKV-----VTVSTGKAPPAIDWRKKGAVTPVKDQRKCGSCWAFSAIGNIEGQWKVAGH 168
Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACKRKA 239
L+ LS Q ++ C N + GC GG L W+ NK + E YP D
Sbjct: 169 ELTSLSEQMLVSC-DNMDDGCQGGLMDRALKWIVSSNKGNVFTEESYPYDSTDGDVPPCN 227
Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLAN 299
S V K L E++I +A +GP+ AV+A ++ Y GGV+ +C S
Sbjct: 228 KSGKVVGAKISGLINLPKDENAIAEWLAKNGPIAIAVDASSFLDYTGGVLT-SC--SSDA 284
Query: 300 INHAVQIVGYDNYSR 314
+NH V +VGYD+ S+
Sbjct: 285 LNHDVLLVGYDDSSK 299
>gi|144905116|dbj|BAF56430.1| cysteine proteinase [Lotus japonicus]
Length = 341
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 95/319 (29%), Positives = 146/319 (45%), Gaps = 29/319 (9%)
Query: 4 VKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKS-EHDIRFKNFEK 62
V N+ + L+ FL+ E + +Y K Y S E ++R K F++
Sbjct: 6 VLNITSLTLLLVFGFLSFEANARTLEDASMHERHEQWMAQYGKVYKDSYEKELRSKIFKE 65
Query: 63 SLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNH 122
++ IE N +S + GI +F+DL+ EEFK R+ H+ + + + H
Sbjct: 66 NVQRIEAFN--NAGNKSYKLGINQFADLTNEEFKARN---RFKGHMCSNSTRTPTFKYEH 120
Query: 123 VKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTL 182
V T +P DWR+ G + +++Q CG CWAFS V E + L G L
Sbjct: 121 V------------TSVPASLDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGITKLSTGKL 168
Query: 183 SLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATS 241
LS QE++DC G + GC GG ++ NK L E++YP DA C A +
Sbjct: 169 ISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIMQNK-GLNTEAKYPYQGVDATCNANAEA 227
Query: 242 PNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLAN 299
+ IK + D SES++L +A P+ A++A +Q+Y GV +C L
Sbjct: 228 KDAASIKGFE-DVPANSESALLKAVANQ-PISVAIDASGSEFQFYSSGVFTGSCGTEL-- 283
Query: 300 INHAVQIVGY--DNYSRTW 316
+H V VGY D ++ W
Sbjct: 284 -DHGVTAVGYGSDGGTKYW 301
>gi|357458909|ref|XP_003599735.1| Cysteine proteinase [Medicago truncatula]
gi|357474677|ref|XP_003607623.1| Cysteine proteinase [Medicago truncatula]
gi|355488783|gb|AES69986.1| Cysteine proteinase [Medicago truncatula]
gi|355508678|gb|AES89820.1| Cysteine proteinase [Medicago truncatula]
Length = 342
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 96/317 (30%), Positives = 149/317 (47%), Gaps = 33/317 (10%)
Query: 8 LFIVALIALCFLAIPVKVSKPNLEQKLE-LFSSFQQRYKKSYSKSEHDIRFKNFEKSLDI 66
+F++ + + +V +P L K E + F + YK + +E + RF+ F+ +++
Sbjct: 11 MFLIFTTWMLPYVMSSRVLEPYLSNKHEKWMTQFGKSYKDA---AEKEKRFQIFKNNVEF 67
Query: 67 IEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKR 126
IE N P I F+DL+ EEFK L + K HD +
Sbjct: 68 IELFNAVGNKP--FNLSINHFADLTNEEFKAS----------LNGNKKLHDKFD--ILNE 113
Query: 127 SITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLS 186
+ + T +P DWR+ G + ++NQ +CG+CWAFSTV + E +H + G L LS
Sbjct: 114 TTSFRYHNVTSVPASMDWRKRGAVTPIKNQGSCGSCWAFSTVASIEGIHQITTGELVSLS 173
Query: 187 VQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVK 246
QE+IDC + GCSGG ++ K + E+ YP D CK K S + +
Sbjct: 174 EQELIDCVRGNSSGCSGGYLEDAFKFI-AKKGGMASETNYPYKETDEKCKFKKESKHVAE 232
Query: 247 IKSYTCDTLIP--SESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINH 302
IK Y +P SE+ +L +A PV V+A +Q+Y GG+ C + +H
Sbjct: 233 IKGY---EKVPSNSENDLLKAVANQ-PVSVYVDAGDYVFQFYSGGIFTGKCG---TDTDH 285
Query: 303 AVQIVGYD---NYSRTW 316
V IVGY +Y+ W
Sbjct: 286 VVTIVGYGVSLDYTEYW 302
>gi|343477445|emb|CCD11724.1| unnamed protein product, partial [Trypanosoma congolense IL3000]
Length = 380
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 95/319 (29%), Positives = 161/319 (50%), Gaps = 34/319 (10%)
Query: 4 VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSY-SKSEHDIRFKNFE 61
+ + F V L+A+ +PV + + EQ L+ F++F+Q+Y +SY +E RF+ F+
Sbjct: 7 TRTLRFSVGLLAVAACFVPVALGVLHAEQSLQQQFAAFKQKYSRSYKDATEEAFRFRVFK 66
Query: 62 KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
+++ E + + A +G+T FSD+S EEF+ ++H +++
Sbjct: 67 QNM---ERAKEEAAANPYATFGVTRFSDMSPEEFRA-------------TYHNGAEYYAA 110
Query: 122 HVKK-RSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
+K+ R + + + TG P DWR+ G + V++Q CG+CWAFS + E +
Sbjct: 111 ALKRPRKV---VNVSTGKAPEAVDWRKKGAVTPVKDQGQCGSCWAFSAIGNIEGQWKVAG 167
Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDA---AC 235
L+ LS Q ++ C N + GC GG W+ NK + E YP AC
Sbjct: 168 HELTSLSEQMLVSCDTN-DFGCEGGLMDDAFKWIVSSNKGNVFTEQSYPYASGGGNVPAC 226
Query: 236 KRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDG 295
K+ G KI+ + L E++I +A +GPV AV+A ++Q Y GGV+ +C
Sbjct: 227 D-KSGKVVGAKIRDHV--DLPEDENAIAEWLAKNGPVAIAVDATSFQSYTGGVLT-SCIS 282
Query: 296 SLANINHAVQIVGYDNYSR 314
+++H V +VGYD+ S+
Sbjct: 283 E--HLDHGVLLVGYDDTSK 299
>gi|332326585|gb|AEE42616.1| cysteine protease [Leishmania aethiopica]
Length = 443
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 86/282 (30%), Positives = 139/282 (49%), Gaps = 29/282 (10%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F++ Y+++Y +E R NFE++L+++ E ++P AR+GIT+F DLSE E
Sbjct: 37 LFEEFKRTYRRAYGTVAEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F R+L N + K H H + ++ +P DWRE G + V+
Sbjct: 94 FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
BQ CG+CWAFS V ES A+ L LS Q+++ C + + GC GG +W+
Sbjct: 143 BQGACGSCWAFSAVGNIESQWAVAGHRLXXLSEQQLVSC-DDKDSGCXGGLMTQAFEWLL 201
Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTD-IAT 268
+N + E YP + C + G +I Y +I S +++ +A
Sbjct: 202 RXMNGTMFT-EDSYPYVSSTGDVPECTNSSELVPGARIDGY---VMIESNETVMAAWLAK 257
Query: 269 HGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
GP+ V+A ++ Y GV+ +C G ++NH V +VGY+
Sbjct: 258 SGPISIGVDASSFMSYESGVLT-SCAGK--HLNHGVLLVGYN 296
>gi|255646767|gb|ACU23856.1| unknown [Glycine max]
Length = 350
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 91/288 (31%), Positives = 145/288 (50%), Gaps = 35/288 (12%)
Query: 29 NLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
++++ +ELF S+ R+ K Y E + RF+ F+ +L I+E NK + G++EF
Sbjct: 40 SMDKLIELFESWMSRHGKIYENIEEKLLRFEIFKDNLKHIDERNK---VVSNYWLGLSEF 96
Query: 88 SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIP-TGIPVKKDWRE 146
+DLS EF ++L V+ + ++R T +P DWR+
Sbjct: 97 ADLSHREFNNKYLGLKVD----------------YSRRRESPEEFTYKDVELPKSVDWRK 140
Query: 147 AGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDF 206
G + V+NQ +CG+CWAFSTV E ++ + G L+ LS QE+IDC N GC+GG
Sbjct: 141 KGAVAPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGG-- 198
Query: 207 CALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSIL 263
L+D+ V L E +YP ++++ AC+ V I Y D +E S+L
Sbjct: 199 --LMDYAFSFIVENGGLHKEEDYPYIMEEGACEMTKEETQVVTISGYH-DVPQNNEQSLL 255
Query: 264 TDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+A P+ A+ A +Q+Y GGV +C ++++H V VGY
Sbjct: 256 KALANQ-PLSVAIEASGRDFQFYSGGVFDGHCG---SDLDHGVAAVGY 299
>gi|146335582|gb|ABQ23400.1| cathepsin L isotype 3 [Trypanoplasma borreli]
Length = 442
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 95/316 (30%), Positives = 153/316 (48%), Gaps = 35/316 (11%)
Query: 7 VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLD 65
++ + L+ +C + + P E LF F+ + ++Y S E RF+ F ++
Sbjct: 3 IVIVTVLLMVCTV-----MGAPTTEV---LFRDFKTTHARNYASADEERKRFEIFAANMK 54
Query: 66 IIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKK 125
ELN R++P A +G EF+D+S EEF+TRH + +H+ K
Sbjct: 55 KAAELN--RKNPM-ATFGPNEFADMSSEEFQTRH-----------NAARHYAAVMARPPK 100
Query: 126 RSIT-TGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL 184
+ T T I + K DWR G + V+NQ +CG+CW+FST E HA+ G L
Sbjct: 101 NTKTFTEEEINAAVGQKVDWRLKGAVTPVKNQGSCGSCWSFSTTGNIEGQHAIATGQLVS 160
Query: 185 LSVQEVIDCAGNGNMGCSGGDFCALLDW-MDVNKVVLEPESEYPLLLKDA---ACKRKAT 240
LS QE++ C + GCSGG W + + + E+ YP + + AC +
Sbjct: 161 LSEQELVSC-DTVDDGCSGGLMDNAFGWLLSAHNGQITTEASYPYVSGNGIVPACTFNSN 219
Query: 241 S-PNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLAN 299
S P G I S+ + +E + + +GP+ V+A +WQ Y+GG++ + D
Sbjct: 220 SNPVGATITSF--HDIPKTERDMAAFVFKYGPLSIGVDASSWQSYIGGILSHCSD---VQ 274
Query: 300 INHAVQIVGYDNYSRT 315
I+H V IVG+D+ + T
Sbjct: 275 IDHGVLIVGFDDTAST 290
>gi|332326593|gb|AEE42620.1| cysteine protease [Leishmania aethiopica]
Length = 443
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 87/281 (30%), Positives = 138/281 (49%), Gaps = 27/281 (9%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F++ Y + Y +E R NFE++L+++ E ++P AR+GIT+F DLSE E
Sbjct: 37 LFEEFKRTYWRVYGTVAEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F R+L N + K H H + ++ +P DWRE G + V+
Sbjct: 94 FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
+Q CG+CWAFS V ES A+ L+ LS Q+++ C + + GC GG +W+
Sbjct: 143 DQGACGSCWAFSAVGNIESQWAVAGHRLTALSEQQLVSC-DDKDSGCGGGLMTQAFEWLL 201
Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
++N + E YP + C + G +I Y T+ SE+ + +A
Sbjct: 202 RNMNGTMFT-EDSYPYVSSXGDVPECTNSSQLVPGARIDGYV--TIESSETVMAAWLAKS 258
Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
GP+ V+A ++ Y GV+ +C G +NH V +VGY+
Sbjct: 259 GPISIGVDASSFMSYESGVLT-SCAGB--XLNHGVLLVGYN 296
>gi|312985015|gb|ACX54787.2| cysteine protease [Arachis diogoi]
Length = 360
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 89/284 (31%), Positives = 140/284 (49%), Gaps = 38/284 (13%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F++F+ ++ KSY ++ EHD RF F +L + + SA +G+T+FSDL+ EEF
Sbjct: 44 FTTFKTKFGKSYATQEEHDYRFGVFRANL---RRAKLHAKLDPSAEHGVTKFSDLTPEEF 100
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
K ++L + L S + +PT +P DWR+ G + V+
Sbjct: 101 KRQYL--GLKPLRLPS---------------TANKAPILPTSDLPENFDWRDKGAVTPVK 143
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA--------GNGNMGCSGGDF 206
NQ +CG+CWAFST E H L G L LS Q+++DC G + GC+GG
Sbjct: 144 NQGSCGSCWAFSTTGALEGAHYLSTGELVSLSEQQLVDCDHVCDPEEYGACDAGCNGGLM 203
Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
D++ + ++ E +YP +D CK S + +++ +L E I ++
Sbjct: 204 NNAFDYI-LQAGGVQTEKDYPYSGRDETCKFD-KSKVAATVANFSVVSL--DEDQIAANL 259
Query: 267 ATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
HGP+ +NA+ Q Y+GGV Y C N++H V +VGY
Sbjct: 260 VKHGPLAVGINAIFMQTYIGGVSCPYICG---KNLDHGVLLVGY 300
>gi|242086591|ref|XP_002439128.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
gi|241944413|gb|EES17558.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
Length = 371
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 87/281 (30%), Positives = 136/281 (48%), Gaps = 28/281 (9%)
Query: 34 LELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSE 92
++LF + +Y+K+Y+ E + RF+ F+ +L I+E NK + G+ F+DL+
Sbjct: 63 IKLFEEWVAKYRKAYASFEEKLHRFEVFKDNLHHIDEANKKVTT---YWLGLNAFADLTH 119
Query: 93 EEFKTRHL--RHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGII 150
+EFK +L R K S ++ G +P DWR+ G +
Sbjct: 120 DEFKATYLGLRQPETKKTTDSRFRY---------------GGVADDDVPASVDWRKKGAV 164
Query: 151 GKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALL 210
V+NQ CG+CWAFSTV E ++ + G L+ LS QE++DC+ +GN GC+GG
Sbjct: 165 TDVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDCSTDGNNGCNGGVMDNAF 224
Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
++ + L E YP L+++ C KA V S D E +++ +A H
Sbjct: 225 SYI-ASSGGLRTEEAYPYLMEEGDCDDKARDGEQVVTISGYEDVPANDEQALVKALA-HQ 282
Query: 271 PVIAAVNALT--WQYYLGGVIQYNCDGSLANINHAVQIVGY 309
P+ A+ A +Q+Y GGV C L +H V VGY
Sbjct: 283 PLSVAIEASGRHFQFYSGGVFNGPCGSEL---DHGVAAVGY 320
>gi|359491865|ref|XP_002273243.2| PREDICTED: xylem cysteine proteinase 1-like [Vitis vinifera]
Length = 351
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 92/287 (32%), Positives = 141/287 (49%), Gaps = 32/287 (11%)
Query: 29 NLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
++++ +LF S+ ++ KSY E + RF+ F+ +L I+E NK S G+ EF
Sbjct: 40 SMDKLTDLFESWMSKHGKSYRSFEEKLHRFEVFQDNLKHIDETNKKVSS---YWLGLNEF 96
Query: 88 SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREA 147
+DLS EEFK ++L + + K D K +P DWR+
Sbjct: 97 ADLSHEEFKRKYL------GLKIELPKRRDSPEEFSYKDVAD--------LPKSVDWRKK 142
Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFC 207
G + V+NQ CG+CWAFSTV E ++ + G L+ LS QE+IDC N GC+GG
Sbjct: 143 GAVAHVKNQGACGSCWAFSTVAAVEGINQIVTGNLTALSEQELIDCDKPFNNGCNGG--- 199
Query: 208 ALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILT 264
L+D+ ++ L E +YP ++++ C K V I Y D +E S L
Sbjct: 200 -LMDYAFAFIISNGGLRKEEDYPYVMEEGTCGEKKEELEVVTISGYH-DVPEDNEQSFLK 257
Query: 265 DIATHGPVIAAVNALT--WQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+A P+ A+ A + +Q+Y GG+ +C L +H V VGY
Sbjct: 258 ALANQ-PLSVAIEASSRGFQFYSGGIFNGHCGTEL---DHGVAAVGY 300
>gi|457756|emb|CAA82995.1| cysteine proteinase [Vicia sativa]
Length = 358
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 90/283 (31%), Positives = 145/283 (51%), Gaps = 34/283 (12%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F+SF+ ++ KSY +K EHD RF F+ +L I+ + P +A +GIT+FSDL+ EF
Sbjct: 43 FTSFKSKFSKSYATKEEHDYRFGVFKANL--IKAKLHQKLDP-TAEHGITKFSDLTASEF 99
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
+ + L +NK + + H I T +P DWRE G + V++
Sbjct: 100 RRQFL--GLNKRLRLPAHAQ-------------KAPILPTTNLPEDFDWREKGAVTPVKD 144
Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDFC 207
Q +CG+CWAFST E H L G L LS Q+++DC AG+ + GC+GG
Sbjct: 145 QGSCGSCWAFSTTGALEGAHYLATGKLVSLSEQQLVDCDHVCDPEEAGSCDSGCNGGLMN 204
Query: 208 ALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIA 267
+++ + V++ E +Y +D +CK S + +++ +L E I ++
Sbjct: 205 NAFEYLLQSGGVVQ-EKDYAYTGRDGSCKFD-KSKVVASVSNFSVVSL--DEEQIAANLV 260
Query: 268 THGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
+GP+ A+NA Q Y+ GV Y C + A ++H V +VG+
Sbjct: 261 KNGPLAVAINAAWMQAYMSGVSCPYVC--AKARLDHGVLLVGF 301
>gi|357467173|ref|XP_003603871.1| Cysteine proteinase [Medicago truncatula]
gi|355492919|gb|AES74122.1| Cysteine proteinase [Medicago truncatula]
gi|388499154|gb|AFK37643.1| unknown [Medicago truncatula]
Length = 350
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 91/287 (31%), Positives = 143/287 (49%), Gaps = 32/287 (11%)
Query: 29 NLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
++++ +ELF S+ R+ K Y E + RF+ F+ +L I++ NK + G+ EF
Sbjct: 39 SMDKLIELFESWMSRHGKIYETIEEKLLRFEVFKDNLKHIDDRNK---VVSNYWLGLNEF 95
Query: 88 SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREA 147
+DLS +EFK ++L V+ +S + + +P DWR+
Sbjct: 96 ADLSHQEFKNKYLGLKVD----LSQRRESSEEEFTYR----------DVDLPKSVDWRKK 141
Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFC 207
G + V+NQ CG+CWAFSTV E ++ + G L+ LS QE+IDC N GC+GG
Sbjct: 142 GAVTPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDTTYNNGCNGG--- 198
Query: 208 ALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILT 264
L+D+ V L E +YP +++++ C+ K V I Y D +E S+L
Sbjct: 199 -LMDYAFSFIVKNGGLHKEEDYPYIMEESTCEMKKEVSEVVTINGYH-DVPQNNEQSLLK 256
Query: 265 DIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+A P+ A+ A +Q+Y GGV +C L +H V VGY
Sbjct: 257 ALANQ-PLSVAIEASGRDFQFYSGGVFDGHCGSEL---DHGVSAVGY 299
>gi|356517184|ref|XP_003527269.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 350
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 91/288 (31%), Positives = 146/288 (50%), Gaps = 35/288 (12%)
Query: 29 NLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
++++ +ELF S+ R+ K Y E + RF+ F+ +L I+E NK + G+ EF
Sbjct: 40 SMDKLIELFESWISRHGKIYQSIEEKLHRFEIFKDNLKHIDERNK---VVSNYWLGLNEF 96
Query: 88 SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIP-TGIPVKKDWRE 146
+DLS +EFK ++L V+ + ++R T +P DWR+
Sbjct: 97 ADLSHQEFKNKYLGLKVD----------------YSRRRESPEEFTYKDVELPKSVDWRK 140
Query: 147 AGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDF 206
G + +V+NQ +CG+CWAFSTV E ++ + G L+ LS QE+IDC N GC+GG
Sbjct: 141 KGAVTQVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGG-- 198
Query: 207 CALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSIL 263
L+D+ V L E +YP ++++ C+ V I Y D +E S+L
Sbjct: 199 --LMDYAFSFIVENDGLHKEEDYPYIMEEGTCEMAKEETEVVTISGYH-DVPQNNEQSLL 255
Query: 264 TDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+A P+ A+ A +Q+Y GGV +C ++++H V VGY
Sbjct: 256 KALANQ-PLSVAIEASGRDFQFYSGGVFDGHCG---SDLDHGVAAVGY 299
>gi|118395092|ref|XP_001029901.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89284178|gb|EAR82238.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 344
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 95/312 (30%), Positives = 154/312 (49%), Gaps = 22/312 (7%)
Query: 3 DVKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFE 61
++K +++I L+A+ A + P Q+L F F+ ++ K Y ++ EH F N++
Sbjct: 2 NMKFIVYIFVLVAVASCAYMNETIDP---QRLAEFEEFKSKFNKYYHNEHEHHSSFHNYK 58
Query: 62 KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
S E + K++ +A++G T+FSD+S EEF+ + L + +
Sbjct: 59 TSR---EHIVKHQMENPNAKFGHTKFSDMSPEEFENKMLNFDFS--LFKKAKSQGIKLKA 113
Query: 122 HVKKRSITTGITIP-TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
K + G + + +P DWR+ GII + Q TCG+CW F+T ES +ALK G
Sbjct: 114 EPMKGYLRQGENVDNSDLPESFDWRDKGIITPAKFQNTCGSCWTFATTGVIESQYALKYG 173
Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKAT 240
L S Q ++DC N N GC GG ++ + + ++ D K+
Sbjct: 174 ELLHFSEQMLLDC-DNINQGCRGGLMTDAYQFLQQSGGIQTADT-----YGDYKNKKDIC 227
Query: 241 SPNGVKIKSYTCDTL-IP-SESSILTDIATHGPVIAAVNALTWQYYLGGVIQ-YNCDGSL 297
+ + K+K+ D IP +E +I ++ +GPV +NA T Q+Y GG++ NCD
Sbjct: 228 NFDKAKVKAKVVDWYQIPENEETIRRELVKNGPVAVGINARTLQFYEGGIVDPKNCDDK- 286
Query: 298 ANINHAVQIVGY 309
INHAV IVGY
Sbjct: 287 --INHAVLIVGY 296
>gi|302763927|ref|XP_002965385.1| hypothetical protein SELMODRAFT_439207 [Selaginella moellendorffii]
gi|300167618|gb|EFJ34223.1| hypothetical protein SELMODRAFT_439207 [Selaginella moellendorffii]
Length = 353
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 95/283 (33%), Positives = 136/283 (48%), Gaps = 30/283 (10%)
Query: 33 KLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLS 91
K+ F F R+K+ Y S E RF F ++L++IEE N+ ++ P + + +F+D+S
Sbjct: 47 KVARFHEFATRHKRVYGSLVELRERFVTFSRNLELIEETNR-KELPYT--LAVNQFADMS 103
Query: 92 EEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIG 151
EEFK KH L S N V+ P P KKDWR+ I+
Sbjct: 104 WEEFK---------KHNLFSSQNCSATATNSVR------AFLTP---PSKKDWRDDKIVS 145
Query: 152 KVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALL 210
V+NQQ CG+CW FST ES HA G + +LS Q+++DCAG N GCSGG
Sbjct: 146 PVKNQQHCGSCWTFSTTGALESAHAQATGKMVVLSEQQLVDCAGGYNNFGCSGGLPSQAF 205
Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP-SESSILTDIATH 269
+++ N L+ E YP D C N + K Y + +E ++ +A +
Sbjct: 206 EYIRYNG-GLDTEDSYPYTAHDGKCMYNQ---NSIGAKVYDVVNITEGAEDELIHAVAFN 261
Query: 270 GPVIAAVNALT-WQYYLGGVIQYN-CDGSLANINHAVQIVGYD 310
PV A L +++Y GV N C +NHAV VGY+
Sbjct: 262 RPVSIAYEVLKDFRFYKSGVYTSNVCGTGPDTVNHAVLAVGYN 304
>gi|168047065|ref|XP_001775992.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162672650|gb|EDQ59184.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 336
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 90/278 (32%), Positives = 138/278 (49%), Gaps = 29/278 (10%)
Query: 37 FSSFQQRYKKSYSKSEH-DIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F+ F +YKK Y E RF F +S+ ++E NK + S A + EF+D++ EEF
Sbjct: 29 FAGFAAKYKKEYKTVEELKHRFVTFLESVKLVETHNKGQHSYSLA---VNEFADMTFEEF 85
Query: 96 K-TRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
+ +R ++ N + +H + TG ++P KDWRE GI+ +V+
Sbjct: 86 RDSRLMKGEQNCSATVGNH--------------VLTGESLPK----TKDWREEGIVSQVK 127
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWM 213
NQ +CG+CW FST E+ HA G + LLS Q+++DCAG N GC GG +++
Sbjct: 128 NQASCGSCWTFSTTGALEAAHAQATGKMVLLSEQQLVDCAGEFNNFGCGGGLPSQAFEYI 187
Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
N + + E YP KD+ C+ + G ++ + +E+ + IAT PV
Sbjct: 188 RYNGGI-DTEDSYPYNAKDSQCRFHKNTI-GAQVWD-VVNITEGAETQLKHAIATMRPVS 244
Query: 274 AAVNAL-TWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
A + ++ Y GGV NC +NHAV VGY
Sbjct: 245 VAFEVVHDFRLYNGGVYTSLNCHTGPQTVNHAVLAVGY 282
>gi|358345461|ref|XP_003636796.1| Cysteine proteinase [Medicago truncatula]
gi|355502731|gb|AES83934.1| Cysteine proteinase [Medicago truncatula]
Length = 475
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 84/285 (29%), Positives = 136/285 (47%), Gaps = 37/285 (12%)
Query: 28 PNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITE 86
P+ EQ +ELF +++ ++K Y E +R +NF+++L I E N R SP G+
Sbjct: 42 PSEEQVVELFQQWKKEHQKFYIHPEEAALRLENFKRNLKYIVERNAMRNSPVGHHLGLNR 101
Query: 87 FSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWRE 146
F+D+S EEFK + +S + D P DWR+
Sbjct: 102 FADMSNEEFKNK----------FISKVESCDD-------------------APYSLDWRK 132
Query: 147 AGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDF 206
G++ V++Q CG+CW+FS+ E ++A+ G L LS QE++DC N GC GG
Sbjct: 133 KGVVTGVKDQGNCGSCWSFSSTGAIEGVNAIVTGDLISLSEQELVDCD-TTNDGCEGGYM 191
Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
+W+ +N ++ E++YP + C V I YT + S+S++
Sbjct: 192 DYAFEWV-INNGGIDTEADYPYIGVGGTCNVTKEETKVVTIDGYT--DVTQSDSALFCAT 248
Query: 267 ATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
P+ ++ L +Q Y GG+ +C + +I+HAV IVGY
Sbjct: 249 VKQ-PISVGIDGSTLDFQLYTGGIYDGDCSSNPDDIDHAVLIVGY 292
>gi|343476707|emb|CCD12272.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 447
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 92/318 (28%), Positives = 161/318 (50%), Gaps = 32/318 (10%)
Query: 4 VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSY-SKSEHDIRFKNFE 61
+ + F V L+A+ +PV + + EQ L+ F++F+Q+Y +SY +E RF+ F+
Sbjct: 7 TRTLRFSVGLLAVAACFVPVALGVLHAEQSLQQQFAAFKQKYSRSYKDATEEAFRFRVFK 66
Query: 62 KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
+++ E + + A +G+T FSD+S EEF+ ++H +++
Sbjct: 67 QNM---ERAKEEAAANPYATFGVTRFSDMSPEEFRA-------------TYHNGAEYYAA 110
Query: 122 HVKK-RSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
+K+ R + +T+ TG P DWR+ G + V++Q CG+CWAFS + E +
Sbjct: 111 ALKRPRKV---VTVSTGKAPEAVDWRKKGAVTPVKDQGQCGSCWAFSAIGNIEGQWKVTG 167
Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACK-- 236
L+ LS Q ++ C ++GC+GG W+ N+ + E YP K
Sbjct: 168 HNLTSLSEQMLVSC-DTEDLGCAGGLMDNAFKWIVSSNRHNVFTEESYPYASKGGNVPPC 226
Query: 237 RKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGS 296
R + G KI+ + L E++I +A +GPV AV++ ++Q Y GGV+ +C
Sbjct: 227 RMSGKVVGAKIRDHV--DLPKDENAIAEWLAKNGPVAIAVDSTSFQSYTGGVLT-SCISK 283
Query: 297 LANINHAVQIVGYDNYSR 314
++H V +VGYD+ S+
Sbjct: 284 --QLDHGVLLVGYDDTSK 299
>gi|255539310|ref|XP_002510720.1| cysteine protease, putative [Ricinus communis]
gi|223551421|gb|EEF52907.1| cysteine protease, putative [Ricinus communis]
Length = 349
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 99/320 (30%), Positives = 157/320 (49%), Gaps = 44/320 (13%)
Query: 6 NVLFIVALIALCFLAIP-------VKVSKPNL---EQKLELFSSFQQRYKKSYSKSEHDI 55
+ LF +A ++L FLA V + +L ++ ++LF S+ R+ + Y +E +
Sbjct: 7 SFLFFLA-VSLSFLAYSGFARDSIVGYAPEDLTSNDKLIDLFESWISRFGRVYESAEEKL 65
Query: 56 -RFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHK 114
RF+ F+ +L I++ NK ++ G+ EF+DLS EEFK ++L + +
Sbjct: 66 ERFEIFKDNLFHIDDTNKKVRN---YWLGLNEFADLSHEEFKNKYL--GLKPDLSKRAQC 120
Query: 115 HHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESM 174
+ + V IP DWR+ G + V+NQ +CG+CWAFSTV E +
Sbjct: 121 PEEFTYKDV-------------AIPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGI 167
Query: 175 HALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVV---LEPESEYPLLLK 231
+ + G L+ LS QE+IDC N GC+GG L+D+ V L E +YP +++
Sbjct: 168 NQIVTGNLTSLSEQELIDCDTTYNNGCNGG----LMDYAFAYIVANGGLHKEEDYPYIME 223
Query: 232 DAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVI 289
+ C + + V I Y D SE S+L +A P+ A+ A +Q+Y GGV
Sbjct: 224 EGTCDMRKEESDAVTISGYH-DVPQNSEESLLKALANQ-PLSIAIEASGRDFQFYSGGVF 281
Query: 290 QYNCDGSLANINHAVQIVGY 309
+C L +H V VGY
Sbjct: 282 DGHCGTEL---DHGVAAVGY 298
>gi|146335580|gb|ABQ23399.1| cathepsin L isotype 2 [Trypanoplasma borreli]
Length = 443
Score = 128 bits (321), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 96/310 (30%), Positives = 149/310 (48%), Gaps = 28/310 (9%)
Query: 12 ALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEEL 70
A+I L + + P + +LFS F+ + ++Y S E RF+ F ++ EL
Sbjct: 3 AVIVTALLMVCTVMGAPTTD---DLFSDFKATHARNYVSPGEERKRFEIFAANMKKAAEL 59
Query: 71 NKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITT 130
N R++P A +G EF+D+S EEF+TRH + +H + + H + K+
Sbjct: 60 N--RKNPM-ATFGPNEFADMSSEEFQTRH---NAARHYAAAKARRAKHTKSFTKEE---- 109
Query: 131 GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEV 190
I K DWR G + V+NQ +CG+CW+FST E +A+ G L LS QE+
Sbjct: 110 ---IKAADGQKIDWRLKGAVTSVKNQGSCGSCWSFSTTGNIEGQNAIATGNLVSLSEQEL 166
Query: 191 IDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDA---ACKRKATS-PNGV 245
+ C N GC+GG W+ + E+ YP + + AC + P G
Sbjct: 167 VSCDTTDN-GCNGGLMDNAFGWLISTRGGQIATEASYPYVSGNGIVPACSYNLDNKPVGA 225
Query: 246 KIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQ 305
I ++ + +E + + +GP+ V+A TWQ Y GG+I Y D I+H V
Sbjct: 226 TISNF--QDITGTEEDMAAFVFNYGPLSIGVDASTWQSYAGGIITYCPD---VQIDHGVL 280
Query: 306 IVGYDNYSRT 315
IVGYD+ + T
Sbjct: 281 IVGYDDTAPT 290
>gi|343470378|emb|CCD16903.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 445
Score = 128 bits (321), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 95/315 (30%), Positives = 154/315 (48%), Gaps = 26/315 (8%)
Query: 4 VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSY-SKSEHDIRFKNFE 61
+ + F V L+A+ +PV + + EQ L+ F++F+Q+Y +SY +E RF+ F+
Sbjct: 7 ARTLRFSVGLLAVAACFVPVALGVLHAEQSLQQQFAAFKQKYSRSYKDATEEAFRFRMFK 66
Query: 62 KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
+S+ E + + A +G+T+FSD+S EEF+ +L + K++
Sbjct: 67 QSM---ERAKEEAAANPYATFGVTQFSDMSPEEFRATYLNGA----------KYYAAALK 113
Query: 122 HVKKRSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
+K + + TG P DWR+ G + V++Q CG+CWAFS + E +
Sbjct: 114 RPRKV-----VNVSTGKAPPAIDWRKKGAVTPVKDQGKCGSCWAFSAIGNIEGQWKVAGH 168
Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACKRKA 239
L+ LS Q ++ C N + GC GG L W+ NK + E YP D
Sbjct: 169 ELTSLSEQMLVSC-DNMDYGCRGGFLDRALKWIVSSNKGNVFTEESYPYDSTDGDVPPCN 227
Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLAN 299
S V K L E++I +A +GP+ AV+A ++ Y GGV+ +C S
Sbjct: 228 KSGKVVGAKISGLINLPKDENAIAEWLAKNGPIAIAVDASSFLDYTGGVLT-SC--SSDA 284
Query: 300 INHAVQIVGYDNYSR 314
+NH V +VGYD+ S+
Sbjct: 285 LNHGVLLVGYDDSSK 299
>gi|255580659|ref|XP_002531152.1| cysteine protease, putative [Ricinus communis]
gi|223529265|gb|EEF31237.1| cysteine protease, putative [Ricinus communis]
Length = 340
Score = 128 bits (321), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 92/308 (29%), Positives = 148/308 (48%), Gaps = 31/308 (10%)
Query: 6 NVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKS-EHDIRFKNFEKSL 64
++ I L AL AI + ++ +K E + R+K+ YS + E +IR+K F++++
Sbjct: 11 SLALIFFLGALASQAIARTLQDASIHEKHE---EWMTRFKRVYSDAKEKEIRYKIFKENV 67
Query: 65 DIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVK 124
IE NK S +S + GI +F+DL+ EEFKT R+ H+ S + +
Sbjct: 68 QRIESFNK--ASEKSYKLGINQFADLTNEEFKTS--RNRFKGHMCSSQAGPFRYEN---- 119
Query: 125 KRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL 184
T +P DWR+ G + +++Q CG+CWAFS V E + L L
Sbjct: 120 ----------ITAVPSSMDWRKEGAVTAIKDQGQCGSCWAFSAVAAVEGITQLATSKLIS 169
Query: 185 LSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPN 243
LS QE++DC G + GC GG +++ N+ L E+ YP D C K + +
Sbjct: 170 LSEQELVDCDTKGEDQGCQGGLMDDAFKFIEQNQ-GLTTEANYPYEGSDGTCNTKQEANH 228
Query: 244 GVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANIN 301
KI + D +E +++ +A PV A++A +Q+Y G+ +C L +
Sbjct: 229 AAKINGFE-DVPANNEGALMKAVAKQ-PVSVAIDAGGFEFQFYSSGIFTGDCGTEL---D 283
Query: 302 HAVQIVGY 309
H V VGY
Sbjct: 284 HGVAAVGY 291
>gi|343412462|emb|CCD21670.1| cysteine peptidase (CP), putative [Trypanosoma vivax Y486]
Length = 367
Score = 128 bits (321), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 88/284 (30%), Positives = 136/284 (47%), Gaps = 29/284 (10%)
Query: 36 LFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF++F+Q+Y +SY + +E R + FE D + + A +G+T FSDL+ EE
Sbjct: 33 LFAAFKQKYGRSYGTAAEEAFRLRVFE---DNMRRSRMYAAANPHATFGVTPFSDLTPEE 89
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKV 153
F+TR+ H+ H + + T + +P G P DWR G + V
Sbjct: 90 FRTRY---------------HNGERHFEAARGRVRTLVQVPPGKAPAAVDWRRKGAVTPV 134
Query: 154 RNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM 213
++Q TCG+CW+FS + E A L+ LS Q ++ C N GC GG +W+
Sbjct: 135 KDQGTCGSCWSFSAIGNIEGQWAAAGNPLTSLSEQMLVSCDTKDN-GCGGGLMDNAFEWI 193
Query: 214 -DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI-KSYTCDTLIP-SESSILTDIATHG 270
N + E YP + + P G K+ + T IP E +I +A +G
Sbjct: 194 VKENSGKVYTEKSYPYV--SGGGEEPPCKPRGHKVGATITGHVDIPHDEDAIAKYLADNG 251
Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYSR 314
PV AV+A T+ Y GGV+ +C +NH V +VGY++ S+
Sbjct: 252 PVAVAVDATTFMSYSGGVVT-SCTSEA--LNHGVLLVGYNDSSK 292
>gi|146335578|gb|ABQ23398.1| cathepsin L isotype 1 [Trypanoplasma borreli]
Length = 443
Score = 128 bits (321), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 96/310 (30%), Positives = 149/310 (48%), Gaps = 28/310 (9%)
Query: 12 ALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEEL 70
A+I L + + P + +LFS F+ + ++Y S E RF+ F ++ EL
Sbjct: 3 AVIVTALLMVCTVMGAPTTD---DLFSDFKATHARNYVSPGEERKRFEIFAANMKKAAEL 59
Query: 71 NKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITT 130
N R++P A +G EF+D+S EEF+TRH + +H + + H + K+
Sbjct: 60 N--RKNPM-ATFGPNEFADMSSEEFQTRH---NAARHYAAAKARRAKHTKSFTKEE---- 109
Query: 131 GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEV 190
I K DWR G + V+NQ +CG+CW+FST E +A+ G L LS QE+
Sbjct: 110 ---IKAADGQKIDWRLKGAVTSVKNQGSCGSCWSFSTTGNIEGQNAIATGNLVSLSEQEL 166
Query: 191 IDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDA---ACKRKATS-PNGV 245
+ C N GC+GG W+ + E+ YP + + AC + P G
Sbjct: 167 VSCDTTDN-GCNGGLMDNAFGWLISTRGGQIATEASYPYVSGNGIVPACSYNLDNKPVGA 225
Query: 246 KIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQ 305
I ++ + +E + + +GP+ V+A TWQ Y GG+I Y D I+H V
Sbjct: 226 TISNF--QDITGTEEDMAAFVFNYGPLSIGVDASTWQSYAGGIITYCPD---VQIDHGVL 280
Query: 306 IVGYDNYSRT 315
IVGYD+ + T
Sbjct: 281 IVGYDDTAPT 290
>gi|356508490|ref|XP_003522989.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 349
Score = 128 bits (321), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 91/288 (31%), Positives = 146/288 (50%), Gaps = 35/288 (12%)
Query: 29 NLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
++++ +ELF S+ ++ K Y S E +RF+ F+ +L I+E NK + G+ EF
Sbjct: 39 SMDKLIELFESWMSKHGKIYQSIEEKLLRFEIFKDNLKHIDERNK---VVSNYWLGLNEF 95
Query: 88 SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIP-TGIPVKKDWRE 146
+DLS +EFK ++L V+ + ++R T +P DWR+
Sbjct: 96 ADLSHQEFKNKYLGLKVD----------------YSRRRESPEEFTYKDVELPKSVDWRK 139
Query: 147 AGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDF 206
G + V+NQ +CG+CWAFSTV E ++ + G L+ LS QE+IDC N GC+GG
Sbjct: 140 KGAVAPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGG-- 197
Query: 207 CALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSIL 263
L+D+ V L E +YP ++++ C+ V I Y D +E S+L
Sbjct: 198 --LMDYAFSFIVENGGLHKEEDYPYIMEEGTCEMTKEETEVVTISGYH-DVPQNNEQSLL 254
Query: 264 TDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+A P+ A+ A +Q+Y GGV +C ++++H V VGY
Sbjct: 255 KALANQ-PLSVAIEASGRDFQFYSGGVFDGHCG---SDLDHGVAAVGY 298
>gi|45822209|emb|CAE47501.1| cathepsin L-like proteinase [Diabrotica virgifera virgifera]
Length = 325
Score = 128 bits (321), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 89/284 (31%), Positives = 141/284 (49%), Gaps = 34/284 (11%)
Query: 35 ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESA-RYGITEFSDLSE 92
E + F+ + KSY S E RF+ F+++L IE N+ + ES ++G+T+F+DL+E
Sbjct: 21 EEWVQFKVKNNKSYKSYVEEQTRFRIFQENLRKIENHNEKYNNGESTFKFGVTKFTDLTE 80
Query: 93 EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
+EF ++ VL + + + H H+ + +P DWR+ G + +
Sbjct: 81 KEF--------LDLLVLSKNARPNRTHATHL--------LAPLRDLPSAFDWRDKGAVTE 124
Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
V++Q CG+CW FST + E+ H LK G L LS Q ++DCA + GC GG W
Sbjct: 125 VKDQGMCGSCWTFSTTGSVEAAHFLKTGNLVSLSEQNLVDCAKDTCYGCGGG-------W 177
Query: 213 MD-----VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIA 267
MD + K + E +YP D C R S KI ++T E + +A
Sbjct: 178 MDKALEYIEKGGIMSEKDYPYEGVDDNC-RFDISKVAAKISNFTY-IKKNDEEDLKNAVA 235
Query: 268 THGPVIAAVNA-LTWQYYLGGVI-QYNCDGSLANINHAVQIVGY 309
GP+ A++A T+Q Y+ G++ C ++NH V +VGY
Sbjct: 236 AKGPISVAIDASATFQLYVSGILDDTECSNEFDSLNHGVLVVGY 279
>gi|4731374|gb|AAD28477.1|AF133839_1 papain-like cysteine protease [Sandersonia aurantiaca]
Length = 357
Score = 127 bits (320), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 90/306 (29%), Positives = 141/306 (46%), Gaps = 22/306 (7%)
Query: 7 VLFIVALIALCFLAIPVKVSKPNLEQKL-ELFSSFQQRYKKSYSKSEHDIRFKNFEKSLD 65
VL ++AL L+IP+K E L L+ ++ + S + RF F++++
Sbjct: 7 VLLVLALAFGSTLSIPIKEKDLESEDSLWSLYERWRSHHAVSRDLDQKQKRFNVFKENVK 66
Query: 66 IIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKK 125
I E NKN+ + + + +F D++ +EF+ ++ V+ H M +H
Sbjct: 67 FIHEFNKNKDV--TFKLALNKFGDMTNQEFRAKYAGSKVHHHRTMKGSRHGSGSGAKFMY 124
Query: 126 RSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLL 185
+ P DWRE G + V+NQ CG+CWAFS + E ++ + L L
Sbjct: 125 ENAVA--------PPSIDWRERGAVAAVKNQGQCGSCWAFSAIAAVEGINQIVTKELVPL 176
Query: 186 SVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGV 245
S QE+IDC + N GCSGG +++ N + E YP +DA CK+ SP V
Sbjct: 177 SEQELIDCDTDQNQGCSGGLMDYAFEFIKNNGGIT-TEDVYPYQAEDATCKK--NSP-AV 232
Query: 246 KIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHA 303
I Y D E +++ +A PV A+ A +Q+Y GV C L +H
Sbjct: 233 VIDGYE-DVPTNDEDALMKAVANQ-PVAVAIEASGYVFQFYSEGVFTGRCGTEL---DHG 287
Query: 304 VQIVGY 309
V +VGY
Sbjct: 288 VAVVGY 293
>gi|357130141|ref|XP_003566711.1| PREDICTED: xylem cysteine proteinase 1-like [Brachypodium
distachyon]
Length = 457
Score = 127 bits (320), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 91/283 (32%), Positives = 143/283 (50%), Gaps = 32/283 (11%)
Query: 34 LELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSE 92
+ELF + +++K+Y+ E + RF+ F+ +L I+++N+ S G+ EF+DL+
Sbjct: 147 IELFEKWLAKHQKAYASFEEKLHRFEVFKDNLKHIDKVNREVTS---YWLGLNEFADLTH 203
Query: 93 EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
EEFK +L + S K ++ +P DWR G + +
Sbjct: 204 EEFKATYLGLAPPAPARESR--------GSFKYEDVSA-----DDLPKSVDWRTKGAVTE 250
Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
V+NQ CG+CWAFSTV E ++A+ G L+ LS QE+IDC+ +GN GC+GG L+D+
Sbjct: 251 VKNQGQCGSCWAFSTVAAVEGINAIVTGNLTALSEQELIDCSVDGNNGCNGG----LMDY 306
Query: 213 M---DVNKVVLEPESEYPLLLKDAAC-KRKATSPNGVKIKSYTCDTLIPSESSILTDIAT 268
+ L E YP L+++ +C K + V I Y D +E +++ +A
Sbjct: 307 AFSYIASSGGLHTEEAYPYLMEEGSCGDGKKSESEAVTISGYE-DVPAHNEQALIKALA- 364
Query: 269 HGPVIAAVNALT--WQYYLGGVIQYNCDGSLANINHAVQIVGY 309
H PV A+ A +Q+Y GGV C L +H V VGY
Sbjct: 365 HQPVSVAIEASGRHFQFYSGGVFDGPCGTQL---DHGVAAVGY 404
>gi|66803148|ref|XP_635417.1| cysteine proteinase 1 [Dictyostelium discoideum AX4]
gi|166201987|sp|P04988.2|CYSP1_DICDI RecName: Full=Cysteine proteinase 1; Flags: Precursor
gi|60463731|gb|EAL61909.1| cysteine proteinase 1 [Dictyostelium discoideum AX4]
Length = 343
Score = 127 bits (320), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 90/313 (28%), Positives = 152/313 (48%), Gaps = 35/313 (11%)
Query: 10 IVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEE 69
++ L L + V LE++ + F FQ ++ K YS E+ RF+ F+ +L IEE
Sbjct: 3 VILLFVLAVFTVFVSSRGIPLEEQSQ-FLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEE 61
Query: 70 LNK---NRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKR 126
LN N ++ ++G+ +F+DLS +EFK +L NK + + +++
Sbjct: 62 LNLIAINHKA--DTKFGVNKFADLSSDEFKNYYLN---NKEAIFTDDLPV---ADYLDDE 113
Query: 127 SITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLS 186
I + IP DWR G + V+NQ CG+CW+FST E H + L LS
Sbjct: 114 FINS-------IPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLS 166
Query: 187 VQEVIDC---------AGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKR 237
Q ++DC + GC+GG +++ N + + ES YP +
Sbjct: 167 EQNLVDCDHECMEYEGEQACDEGCNGGLQPNAYNYIIKNGGI-QTESSYPYTAETGTQCN 225
Query: 238 KATSPNGVKIKSYTCDTLIPSESSILTD-IATHGPVIAAVNALTWQYYLGGVIQYNCDGS 296
++ G KI ++ T+IP +++ I + GP+ A +A+ WQ+Y+GGV C+ +
Sbjct: 226 FNSANIGAKISNF---TMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPN 282
Query: 297 LANINHAVQIVGY 309
+++H + IVGY
Sbjct: 283 --SLDHGILIVGY 293
>gi|320169652|gb|EFW46551.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 325
Score = 127 bits (320), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 99/308 (32%), Positives = 147/308 (47%), Gaps = 45/308 (14%)
Query: 11 VALIAL--CFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDII 67
VAL+AL C A+P F+ ++ + + Y S E +R + + +L++I
Sbjct: 7 VALLALVACATAMP--------------FAEWKALHNRQYASAQEEALRQEIYLSNLELI 52
Query: 68 EELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRS 127
E N S G+ EF DL+ EF ++L N N K +
Sbjct: 53 NE--HNAAGRHSYTLGMNEFGDLAHHEFAAKYLGVRFNGV-------------NATKSFA 97
Query: 128 ITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSV 187
+T + +P DWR AGI+ V+NQ CG+CW+FST + E HA K GTL LS
Sbjct: 98 SSTYLPRMVSLPDSVDWRTAGIVTPVKNQGQCGSCWSFSTTGSVEGQHARKTGTLVSLSE 157
Query: 188 QEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVK 246
Q ++DC+ GN GC+GG +++ N + + E+ YP CK A + G
Sbjct: 158 QNLVDCSSQEGNEGCNGGLMDDAFEYIIKNGGI-DTEASYPYTATTGTCKFNAANI-GAT 215
Query: 247 IKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYN---CDGSLANIN 301
+ SY D + SES + +AT GPV A++A + +Q+Y GV YN C S ++
Sbjct: 216 VASYQ-DIITGSESDLQNAVATVGPVSVAIDASHINFQFYFTGV--YNEKKC--STTQLD 270
Query: 302 HAVQIVGY 309
H V VGY
Sbjct: 271 HGVLAVGY 278
>gi|2731635|gb|AAB93494.1| pre-procathepsin L [Paragonimus westermani]
Length = 325
Score = 127 bits (320), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 91/307 (29%), Positives = 149/307 (48%), Gaps = 41/307 (13%)
Query: 10 IVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEE 69
+V ++ F V+V + EL+ F++ Y K+Y+ + RF F+ +L ++
Sbjct: 9 LVVVVGCAFAVNTVRVP----DNARELYEQFKRDYGKAYANEDDQKRFAIFKDNLVRAQQ 64
Query: 70 LNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSIT 129
Q +A+YG+T+FSDL+ EEF +L +++ V + V+ +
Sbjct: 65 YQTQEQG--TAKYGVTQFSDLTNEEFAAMYLGSRIDERV------------DRVQLNDLQ 110
Query: 130 TGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQE 189
T P DWRE G +G V +Q +CG+CWAFS E LK G L LS Q+
Sbjct: 111 TA-------PASVDWREKGAVGPVEHQGSCGSCWAFSVTANVEGQWFLKTGRLVSLSKQQ 163
Query: 190 VIDCAGNGNMGCSGGDFCALLDWMDVNKV-VLEPESEYPLLLKDAACKRKATSPNGVKIK 248
++DC + GCSGG + ++ ++ LE +S YP + AC+ + K+
Sbjct: 164 LVDC-DRLDHGCSGG--YPPYTYKEIKRMGGLELQSAYPYTGWEQACRLDRS-----KLF 215
Query: 249 SYTCDTLI--PSESSILTDIATHGPVIAAVNALTWQYYLGGVI---QYNCDGSLANINHA 303
+ D+++ +E +A HGP+ +NA Q+Y G++ +Y C S +NHA
Sbjct: 216 AKIDDSIVLEKNEEKQAAWLAEHGPMSTCLNAGPLQFYRYGILHPSEYAC--SPEGLNHA 273
Query: 304 VQIVGYD 310
V VGYD
Sbjct: 274 VLTVGYD 280
>gi|118627554|emb|CAL64936.1| putative cysteine protease 8 [Trifolium pratense]
Length = 344
Score = 127 bits (320), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 99/316 (31%), Positives = 145/316 (45%), Gaps = 38/316 (12%)
Query: 6 NVLFIVALIAL-CFLAIPVKVSKPNLEQK--LELFSSFQQRYKKSYSK-SEHDIRFKNFE 61
N L+ ++L L C ++V+ L+ E + +Y K Y E + RFK F+
Sbjct: 5 NQLYHISLALLFCLGLFAIQVTSRTLQDDSMYERHGQWMSQYGKIYKDHQERETRFKIFK 64
Query: 62 KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
++++ IE N N +S + GI +F+DL+ EEF R+ H+ S + +
Sbjct: 65 ENVNYIETFN-NADDTKSYKLGINQFADLTNEEFIAS--RNKFKGHMCSSIMRTTSFKYE 121
Query: 122 HVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGT 181
+V +GIP DWR+ G + V+NQ CG CWAFS V E +H L G
Sbjct: 122 NV------------SGIPSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGIHKLSTGK 169
Query: 182 LSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVV-----LEPESEYPLLLKDAAC 235
L LS QE++DC G + GC GG L+D D K + L E++YP D C
Sbjct: 170 LISLSEQELVDCDTKGVDQGCEGG----LMD--DAFKFIIQNHGLSTEAQYPYEGVDGTC 223
Query: 236 KRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNC 293
S V I Y D SE ++ +A P+ A++A +Q+Y GV C
Sbjct: 224 NANKASVQAVTITGYE-DVPANSEQALQKAVANQ-PISVAIDASGSDFQFYKSGVFTGAC 281
Query: 294 DGSLANINHAVQIVGY 309
L +H V VGY
Sbjct: 282 GTEL---DHGVTAVGY 294
>gi|34761156|gb|AAQ81938.1| cysteine proteinase precursor [Ipomoea batatas]
Length = 371
Score = 127 bits (320), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 89/286 (31%), Positives = 144/286 (50%), Gaps = 32/286 (11%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F+ F+ +Y KSY ++ EHD R F+ +L +++ SA +G+T+FSDL+ +EF
Sbjct: 47 FTLFKSKYGKSYATQEEHDYRLSVFKANL---RRAKRHQMLDPSAVHGVTKFSDLTPKEF 103
Query: 96 KTRHL--RHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGK 152
+ +L R S + + D H + +PT +P +WR+ G +
Sbjct: 104 RRTYLGIRKSSSSKQKLKLKLPADAHAAEI----------LPTSDLPFDFEWRDYGAVTG 153
Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGG 204
V++Q CG+CW+FST T E + L G L L+ QE++DC AG + GC+GG
Sbjct: 154 VKDQGLCGSCWSFSTTGTLEGTNFLATGELLSLNEQELVDCDHLCDPKKAGACDAGCNGG 213
Query: 205 DFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILT 264
+++ + LE E +YP +D CK S + +++ +L E I
Sbjct: 214 LMTTAYEYV-LQSGGLEKEKDYPYTGRDGTCKFD-KSKIAAAVANFSVVSL--DEDQIAA 269
Query: 265 DIATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
++ HGP+ +N++ Q Y+GGV Y C S N++H V IVGY
Sbjct: 270 NLVKHGPLSVGINSIFMQTYIGGVSCPYIC--SKKNLDHGVLIVGY 313
>gi|57282617|emb|CAE54306.1| putative papain-like cysteine proteinase [Gossypium hirsutum]
Length = 373
Score = 127 bits (320), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 91/295 (30%), Positives = 147/295 (49%), Gaps = 37/295 (12%)
Query: 26 SKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGI 84
++P L +S F++R+KKSY S+ EHD RFK F+ +L +++ SA +G+
Sbjct: 47 AEPQLLTAEHHYSLFKKRFKKSYGSQKEHDYRFKIFQVNL---RRAARHQNLDPSATHGV 103
Query: 85 TEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKD 143
T+FSDL+ EF+ +L + + L + T +PT +P D
Sbjct: 104 TQFSDLTPGEFRKAYL--GLRRLRL---------------PKDATEAPILPTDNLPQDFD 146
Query: 144 WREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AG 195
WRE G + V+NQ +CG+CW+FST E + L G L LS Q+++DC AG
Sbjct: 147 WREKGAVTPVKNQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEAG 206
Query: 196 NGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTL 255
+ + GC+GG + ++ + L E +YP D + + K+ +++ +L
Sbjct: 207 SCDSGCNGGLMNSAFEYT-LKAGGLMREEDYPYTGTDRGTCKFDNTKVAAKVANFSVVSL 265
Query: 256 IPSESSILTDIATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
E I ++ +GP+ A+NA+ Q Y+GGV Y C L +H V +VGY
Sbjct: 266 --DEDQIAANLFKNGPLAVAINAVFMQTYIGGVSCPYICSKRL---DHGVLLVGY 315
>gi|348528696|ref|XP_003451852.1| PREDICTED: cathepsin F-like [Oreochromis niloticus]
Length = 475
Score = 127 bits (319), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 100/300 (33%), Positives = 144/300 (48%), Gaps = 41/300 (13%)
Query: 24 KVSKPNLEQKLE-------LFSSFQQRYKKSYSKSEH-DIRFKNFEKSLDIIEELNKNRQ 75
KV P+ Q LE F F +Y K YS E D R + F ++L E+L Q
Sbjct: 157 KVEDPSTSQPLEESVELLGQFKEFMTKYNKVYSSQEEVDRRLRIFHENLKTAEKLQALDQ 216
Query: 76 SPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIP 135
SA YG+T+FSDL+EEEF++ +L +++ L H +K + G +
Sbjct: 217 G--SAEYGVTKFSDLTEEEFRSTYLNPLLSQWTL----------HQPMKPATPAKGPS-- 262
Query: 136 TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAG 195
P DWR+ G + V+NQ CG+CWAFS + E LKNGTL LS QE++DC G
Sbjct: 263 ---PDSWDWRDHGAVSPVKNQGMCGSCWAFSVIGNIEGQWFLKNGTLLSLSEQELVDCDG 319
Query: 196 NGNMGCSGGDFCALLDWMDVNKV-VLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDT 254
+ C GG + + K+ LE ES+Y C K+ +Y +
Sbjct: 320 L-DQACRGGLPSNAYE--AIEKLGGLETESDYSYTGHKQRCDFTTG-----KVAAYINSS 371
Query: 255 --LIPSESSILTDIATHGPVIAAVNALTWQYYLGGV---IQYNCDGSLANINHAVQIVGY 309
L E I +A +GPV A+NA Q+Y G+ ++ C+ + I+HAV +VGY
Sbjct: 372 VELPKDEKEIAAWLAENGPVSVALNAFAMQFYRKGISHPLKIFCNPWM--IDHAVLLVGY 429
>gi|408009|gb|AAA18215.1| cysteine protease precursor [Trypanosoma congolense]
Length = 444
Score = 127 bits (319), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 93/318 (29%), Positives = 159/318 (50%), Gaps = 32/318 (10%)
Query: 4 VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSY-SKSEHDIRFKNFE 61
+ + F V L+A+ +PV + + EQ L+ F++F+Q+Y +SY +E RF+ F+
Sbjct: 7 TRTLRFSVGLLAVAACFVPVALGVLHAEQSLQQQFAAFKQKYSRSYKDATEEAFRFRVFK 66
Query: 62 KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
+++ E + + A +G+T FSD+S EEF+ ++H +++
Sbjct: 67 QNM---ERAKEEAAANPYATFGVTRFSDMSPEEFRA-------------TYHNGAEYYAA 110
Query: 122 HVKK-RSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
+K+ R + + + TG P DWR+ G + V++Q CG+CWAFS + E +
Sbjct: 111 ALKRPRKV---VNVSTGKAPEAVDWRKKGAVTPVKDQGQCGSCWAFSAIGNIEGQWKVAG 167
Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACKR- 237
L+ LS Q ++ C N + GC GG W+ NK + E YP
Sbjct: 168 HELTSLSEQMLVSCDTN-DFGCEGGLMDDAFKWIVSSNKGNVFTEQSYPYASGGGNVPTC 226
Query: 238 -KATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGS 296
K+ G KI+ + L E++I +A +GPV AV+A ++Q Y GGV+ +C
Sbjct: 227 DKSGKVVGAKIRDHV--DLPEDENAIAEWLAKNGPVAIAVDATSFQSYTGGVLT-SCISE 283
Query: 297 LANINHAVQIVGYDNYSR 314
+++H V +VGYD+ S+
Sbjct: 284 --HLDHGVLLVGYDDTSK 299
>gi|116779325|gb|ABK21238.1| unknown [Picea sitchensis]
gi|148905850|gb|ABR16087.1| unknown [Picea sitchensis]
gi|148908434|gb|ABR17330.1| unknown [Picea sitchensis]
gi|148908881|gb|ABR17545.1| unknown [Picea sitchensis]
gi|224286109|gb|ACN40765.1| unknown [Picea sitchensis]
Length = 366
Score = 127 bits (319), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 95/286 (33%), Positives = 142/286 (49%), Gaps = 42/286 (14%)
Query: 37 FSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F F +RY K YS EH+ RF F+ +L + L + P A +G+T+FSDL++EEF
Sbjct: 57 FRHFIRRYGKKYSGPEEHEHRFGVFKSNL--LRALEHQKLDPR-ASHGVTKFSDLTQEEF 113
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
+ ++L + D H + +PT +P DWRE G + +V+
Sbjct: 114 RHQYLG--------LRAPPLRDAHDAPI----------LPTNDLPEDFDWREKGAVTEVK 155
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
NQ +CG+CWAFST E + LK G L LS Q+++DC A + + GC+GG
Sbjct: 156 NQGSCGSCWAFSTTGALEGANFLKTGELVSLSEQQLVDCDHECDPSDARSCDSGCNGGLM 215
Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPS--ESSILT 264
+ + + LE E +YP KD C S N KI ++ + + S E I
Sbjct: 216 TSAYQYA-LKSGGLEKEEDYPYTGKDGTC-----SFNKNKIVAHVSNFSVVSIDEGQIAA 269
Query: 265 DIATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
++ +GP+ +NA Q Y+GGV Y C S N++H V +VGY
Sbjct: 270 NLVKNGPLSVGINAAFMQTYVGGVSCPYVC--SKRNLDHGVLLVGY 313
>gi|116787909|gb|ABK24688.1| unknown [Picea sitchensis]
gi|224284108|gb|ACN39791.1| unknown [Picea sitchensis]
gi|224285024|gb|ACN40241.1| unknown [Picea sitchensis]
Length = 366
Score = 127 bits (319), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 95/286 (33%), Positives = 142/286 (49%), Gaps = 42/286 (14%)
Query: 37 FSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F F +RY K YS EH+ RF F+ +L + L + P A +G+T+FSDL++EEF
Sbjct: 57 FRHFIRRYGKKYSGPEEHEHRFGVFKSNL--LRALEHQKLDPR-ASHGVTKFSDLTQEEF 113
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
+ ++L + D H + +PT +P DWRE G + +V+
Sbjct: 114 RHQYLG--------LRAPPLRDAHDAPI----------LPTNDLPEDFDWREKGAVTEVK 155
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
NQ +CG+CWAFST E + LK G L LS Q+++DC A + + GC+GG
Sbjct: 156 NQGSCGSCWAFSTTGALEGANFLKTGELVSLSEQQLVDCDHECDPSDARSCDSGCNGGLM 215
Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPS--ESSILT 264
+ + + LE E +YP KD C S N KI ++ + + S E I
Sbjct: 216 TSAYQYA-LKSGGLEKEEDYPYTGKDGTC-----SFNKNKIVAHVSNFSVVSIDEGQIAA 269
Query: 265 DIATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
++ +GP+ +NA Q Y+GGV Y C S N++H V +VGY
Sbjct: 270 NLVKNGPLSVGINAAFMQTYVGGVSCPYVC--SKRNLDHGVLLVGY 313
>gi|401416326|ref|XP_003872658.1| putative cathepsin L-like protease [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|14348750|emb|CAC41275.1| CPB2 protein [Leishmania mexicana]
gi|322488882|emb|CBZ24132.1| putative cathepsin L-like protease [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 359
Score = 127 bits (319), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 89/280 (31%), Positives = 138/280 (49%), Gaps = 25/280 (8%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F++ Y ++Y +E R NFE++L+++ E ++P A++GIT+F DLSE E
Sbjct: 37 LFEEFKRTYGRAYETLAEEQQRLANFERNLELMRE--HQARNPH-AQFGITKFFDLSEAE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F R+L N + K H H + ++ +P DWRE G + V+
Sbjct: 94 FAARYL----NGAAYFAAAKRHAAQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
+Q CG+CWAFS+V E L L LS Q+++ C + N GC GG DW+
Sbjct: 143 DQGECGSCWAFSSVGNIEGQWYLAGHELVSLSEQQLVSC-DDMNDGCDGGLMLQAFDWLL 201
Query: 215 VNKVV-LEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
N L E YP + + C + G +I S+ + SE ++ +A +G
Sbjct: 202 QNTNGHLYTEDSYPYVSGNGYLPECSNSSELVVGAQIDSHVL--IGSSEKAMAAWLAKNG 259
Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
P+ A++A ++ Y GV+ C G +NHAV +VGYD
Sbjct: 260 PIAIALDASSFMSYKSGVLT-ACIGK--EVNHAVLLVGYD 296
>gi|224105327|ref|XP_002313770.1| predicted protein [Populus trichocarpa]
gi|222850178|gb|EEE87725.1| predicted protein [Populus trichocarpa]
Length = 368
Score = 127 bits (319), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 92/285 (32%), Positives = 140/285 (49%), Gaps = 39/285 (13%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
FS F++++KKSY S+ EHD RF F+ +L ++++ +A +G+T+FSDL+ EF
Sbjct: 53 FSLFKRKFKKSYLSQEEHDYRFSVFKSNL---RRAARHQKLDPTASHGVTQFSDLTSAEF 109
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
+ + L + K L + T +PT +P DWRE G +G V+
Sbjct: 110 RKQVL--GLRKLRL---------------PKDANTAPILPTNDLPEDFDWREKGAVGPVK 152
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
NQ +CG+CW+FST E H L G L LS Q+++DC G+ + GC+GG
Sbjct: 153 NQGSCGSCWSFSTTGALEGAHFLATGELVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLM 212
Query: 207 CALLDWMDVNKVVLEPESEYPLLLKD-AACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
+ ++ + L E +YP D ACK N V + E I +
Sbjct: 213 NSAFEYT-LKAGGLMREEDYPYTGMDRGACK---FDKNKVAAGVANFSAVSLDEDQIAAN 268
Query: 266 IATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
+ +GP+ A+NA+ Q Y+GGV Y C L +H V +VGY
Sbjct: 269 LVKNGPLAVAINAVFMQTYIGGVSCPYICSRRL---DHGVLLVGY 310
>gi|356508487|ref|XP_003522988.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 349
Score = 127 bits (319), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 90/288 (31%), Positives = 144/288 (50%), Gaps = 35/288 (12%)
Query: 29 NLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
++++ +ELF S+ R+ K Y E + RF F+ +L I+E NK + G+ EF
Sbjct: 39 SMDKLIELFESWMSRHGKIYQSIEEKLHRFDIFKDNLKHIDERNK---VVSNYWLGLNEF 95
Query: 88 SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKDWRE 146
+DLS +EFK ++L V+ + ++R T +P DWR+
Sbjct: 96 ADLSHQEFKNKYLGLKVD----------------YSRRRESPEEFTYKDFELPKSVDWRK 139
Query: 147 AGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDF 206
G + +V+NQ +CG+CWAFSTV E ++ + G L+ LS QE+IDC N GC+GG
Sbjct: 140 KGAVTQVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGG-- 197
Query: 207 CALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSIL 263
L+D+ V L E +YP ++++ C+ V I Y D +E S+L
Sbjct: 198 --LMDYAFSFIVENGGLHKEEDYPYIMEEGTCEMTKEETEVVTISGYH-DVPQNNEQSLL 254
Query: 264 TDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+ P+ A+ A +Q+Y GGV +C ++++H V VGY
Sbjct: 255 KALVNQ-PLSVAIEASGRDFQFYSGGVFDGHCG---SDLDHGVAAVGY 298
>gi|118481169|gb|ABK92536.1| unknown [Populus trichocarpa]
Length = 368
Score = 127 bits (319), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 92/285 (32%), Positives = 140/285 (49%), Gaps = 39/285 (13%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
FS F++++KKSY S+ EHD RF F+ +L ++++ +A +G+T+FSDL+ EF
Sbjct: 53 FSLFKRKFKKSYLSQEEHDYRFSVFKSNL---RRAARHQKLDPTASHGVTQFSDLTSAEF 109
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
+ + L + K L + T +PT +P DWRE G +G V+
Sbjct: 110 RKQVL--GLRKLRL---------------PKDANTAPILPTNDLPEDFDWREKGAVGPVK 152
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
NQ +CG+CW+FST E H L G L LS Q+++DC G+ + GC+GG
Sbjct: 153 NQGSCGSCWSFSTTGALEGAHFLATGELVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLM 212
Query: 207 CALLDWMDVNKVVLEPESEYPLLLKD-AACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
+ ++ + L E +YP D ACK N V + E I +
Sbjct: 213 NSAFEYT-LKAGGLMREEDYPYTGMDRGACK---FDKNKVAAGVANFSVVSLDEDQIAAN 268
Query: 266 IATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
+ +GP+ A+NA+ Q Y+GGV Y C L +H V +VGY
Sbjct: 269 LVKNGPLAVAINAVFMQTYIGGVSCPYICSRRL---DHGVLLVGY 310
>gi|240255643|ref|NP_567010.5| Papain family cysteine protease [Arabidopsis thaliana]
gi|17979125|gb|AAL49820.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332645795|gb|AEE79316.1| Papain family cysteine protease [Arabidopsis thaliana]
Length = 367
Score = 127 bits (319), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 99/303 (32%), Positives = 139/303 (45%), Gaps = 50/303 (16%)
Query: 27 KPNL-----EQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESA 80
+PNL E K LF S Y K+YS E I R F K ++++ P SA
Sbjct: 39 RPNLLGTHTESKFRLFMS---DYGKNYSTREEYIHRLGIFAK--NVLKAAEHQMMDP-SA 92
Query: 81 RYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT---- 136
+G+T+FSDL+EEEFK + + + R T G P
Sbjct: 93 VHGVTQFSDLTEEEFK-----------------RMYTGVADVGGSRGGTVGAEAPMVEVD 135
Query: 137 GIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--- 193
G+P DWRE G + +V+NQ CG+CWAFST AE H + G L LS Q+++DC
Sbjct: 136 GLPEDFDWREKGGVTEVKNQGACGSCWAFSTTGAAEGAHFVSTGKLLSLSEQQLVDCDQA 195
Query: 194 -----AGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIK 248
+ GC GG +++ + LE E YP K CK P V ++
Sbjct: 196 CDPKDKKACDNGCGGGLMTNAYEYL-MEAGGLEEERSYPYTGKRGHCK---FDPEKVAVR 251
Query: 249 SYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCD--GSLANINHAVQI 306
T+ E+ I ++ HGP+ +NA+ Q Y+GGV +C S N+NH V +
Sbjct: 252 VLNFTTIPLDENQIAANLVRHGPLAVGLNAVFMQTYIGGV---SCPLICSKRNVNHGVLL 308
Query: 307 VGY 309
VGY
Sbjct: 309 VGY 311
>gi|224083362|ref|XP_002306996.1| predicted protein [Populus trichocarpa]
gi|222856445|gb|EEE93992.1| predicted protein [Populus trichocarpa]
Length = 336
Score = 127 bits (318), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 92/284 (32%), Positives = 145/284 (51%), Gaps = 36/284 (12%)
Query: 34 LELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARY--GITEFSDL 90
++LF S+ +++K Y E RF+ F+ +L I+E NK + Y G+ EF+DL
Sbjct: 30 IDLFESWISKHQKIYESIEEKWHRFEIFKDNLFHIDETNK-----KVVNYWLGLNEFADL 84
Query: 91 SEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGII 150
S EEFK ++L +V+ +S+ + + SI P DWR+ G +
Sbjct: 85 SHEEFKNKYLGLNVD----LSNRRECSEEFTYKDVSSI----------PKSVDWRKKGAV 130
Query: 151 GKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALL 210
V+NQ +CG+CWAFSTV E ++ + G L+ LS QE++DC N GC+GG L+
Sbjct: 131 TDVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDCDTTYNNGCNGG----LM 186
Query: 211 DWMD---VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIA 267
D+ ++ L E +YP ++++ C+ + V I Y D SE S+L +A
Sbjct: 187 DYAFAYIISNGGLHKEEDYPYIMEEGTCEMRKAESEVVTISGYH-DVPQNSEESLLKALA 245
Query: 268 THGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
P+ A++A +Q+Y GGV +C L +H V VGY
Sbjct: 246 NQ-PLSVAIDASGRDFQFYSGGVFDGHCGTEL---DHGVAAVGY 285
>gi|39930363|ref|NP_058817.1| cathepsin J precursor [Rattus norvegicus]
gi|84028185|sp|Q63088.2|CATJ_RAT RecName: Full=Cathepsin J; AltName: Full=Cathepsin L-related
protein; AltName: Full=Cathepsin P; AltName:
Full=Catlrp-p; Flags: Precursor
gi|28196048|gb|AAL26793.2| cathepsin P [Rattus norvegicus]
gi|66910531|gb|AAH97263.1| Cathepsin J [Rattus norvegicus]
gi|149039736|gb|EDL93852.1| cathepsin J [Rattus norvegicus]
Length = 334
Score = 127 bits (318), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 98/305 (32%), Positives = 156/305 (51%), Gaps = 32/305 (10%)
Query: 11 VALIALCF-LAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEE 69
V L+ LCF +A PNL+ + + ++ +Y KSYS E +++ +E++L +I+
Sbjct: 5 VFLVILCFGVASGAPARDPNLDAEWQ---DWKTKYAKSYSPVEEELKRAVWEENLKMIQL 61
Query: 70 LNK-NRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSI 128
NK N + F+D + EEF R S++ ++ + V S
Sbjct: 62 HNKENGLGKNGFTMEMNAFADTTGEEF-----RKSLSDILIPAA----------VTNPSA 106
Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
++I G+P KDWR+ G + VRNQ CG+CWAF+ V E K G L+ LSVQ
Sbjct: 107 QKQVSI--GLPNFKDWRKEGYVTPVRNQGKCGSCWAFAAVGAIEGQMFSKTGNLTPLSVQ 164
Query: 189 EVIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI 247
++DC+ GN GC G +++ NK LE E+ YP KD C+ + + + I
Sbjct: 165 NLLDCSKSEGNNGCRWGTAHQAFNYVLKNK-GLEAEATYPYEGKDGPCRYHSENAS-ANI 222
Query: 248 KSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVI-QYNCDGSLANINHAV 304
+ L P+E + +A+ GPV AA++A ++++Y GGV + NC + +NHAV
Sbjct: 223 TGFV--NLPPNELYLWVAVASIGPVSAAIDASHDSFRFYSGGVYHEPNCSSYV--VNHAV 278
Query: 305 QIVGY 309
+VGY
Sbjct: 279 LVVGY 283
>gi|356517188|ref|XP_003527271.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 350
Score = 127 bits (318), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 90/288 (31%), Positives = 143/288 (49%), Gaps = 35/288 (12%)
Query: 29 NLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
++++ +ELF S+ R+ K Y E + RF+ F+ +L I+E NK + G+ EF
Sbjct: 40 SMDKLIELFESWMSRHGKIYENIEEKLLRFEIFKDNLKHIDERNK---VVSNYWLGLNEF 96
Query: 88 SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIP-TGIPVKKDWRE 146
+DLS EF ++L V+ + ++R T +P DWR+
Sbjct: 97 ADLSHREFNNKYLGLKVD----------------YSRRRESPEEFTYKDVELPKSVDWRK 140
Query: 147 AGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDF 206
G + V+NQ +CG+CWAFSTV E ++ + G L+ LS QE+IDC N GC+GG
Sbjct: 141 KGAVAPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGG-- 198
Query: 207 CALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSIL 263
L+D+ V L E +YP ++++ C+ V I Y D +E S+L
Sbjct: 199 --LMDYAFSFIVENGGLHKEEDYPYIMEEGTCEMTKEETQVVTISGYH-DVPQNNEQSLL 255
Query: 264 TDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+A P+ A+ A +Q+Y GGV +C ++++H V VGY
Sbjct: 256 KALANQ-PLSVAIEASGRDFQFYSGGVFDGHCG---SDLDHGVAAVGY 299
>gi|340053965|emb|CCC48258.1| cysteine peptidase precursor [Trypanosoma vivax Y486]
Length = 441
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 87/284 (30%), Positives = 136/284 (47%), Gaps = 29/284 (10%)
Query: 36 LFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF++F+Q+Y +SY + +E R + FE D + + A +G+T FSDL+ EE
Sbjct: 33 LFAAFKQKYGRSYGTAAEEAFRLRVFE---DNMRRSRMYAAANPHATFGVTPFSDLTPEE 89
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKV 153
F+TR+ H+ H + + T + +P G P DWR G + V
Sbjct: 90 FRTRY---------------HNGERHFEAARGRVRTLVQVPPGKAPAAVDWRRKGAVTPV 134
Query: 154 RNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM 213
++Q +CG+CW+FS + E A L+ LS Q ++ C N GC GG +W+
Sbjct: 135 KDQGSCGSCWSFSAIGNIEGQWAAAGNPLTSLSEQMLVSCDTKDN-GCGGGLMDNAFEWI 193
Query: 214 -DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI-KSYTCDTLIP-SESSILTDIATHG 270
N + E YP + + P G K+ + T IP E +I +A +G
Sbjct: 194 VKENSGKVYTEKSYPYV--SGGGEEPPCKPRGHKVGATITGHVDIPHDEDAIAKYLADNG 251
Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYSR 314
PV AV+A T+ Y GGV+ +C +NH V +VGY++ S+
Sbjct: 252 PVAVAVDATTFMSYSGGVVT-SCTSEA--LNHGVLLVGYNDSSK 292
>gi|224096714|ref|XP_002310708.1| predicted protein [Populus trichocarpa]
gi|222853611|gb|EEE91158.1| predicted protein [Populus trichocarpa]
Length = 356
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 86/287 (29%), Positives = 144/287 (50%), Gaps = 32/287 (11%)
Query: 31 EQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELN-KNRQSPESARYGITEFS 88
++ + ++ + Q++ K+Y++ E RF+ F+ +L I+E N +NR + + G+T+F+
Sbjct: 22 DEVMSIYKWWLQKHGKAYNRLGEKAKRFEIFKNNLRFIDEHNSQNR----TYKVGLTKFA 77
Query: 89 DLSEEEFKTRHL-RHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREA 147
DL+ +E++ L S K LM N ++ + G +P + DWR
Sbjct: 78 DLTNQEYRAMFLGTRSDPKRRLMKSK-------NPSERYAYKAGDKLPESV----DWRGK 126
Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFC 207
G + +++Q +CG+CWAFSTV E ++ + G L LS QE++DC N GC+GG
Sbjct: 127 GAVNPIKDQGSCGSCWAFSTVAAVEGINQIVTGELISLSEQELVDCDRFYNAGCNGG--- 183
Query: 208 ALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILT 264
L+D+ +N L+ E +YP L D C R V I + + ++P + L
Sbjct: 184 -LMDYAFQFIINNGGLDTEKDYPYLGNDDTCDRDKMKTKAVSIDGF--EDVLPFDEKALQ 240
Query: 265 DIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
H PV A+ A + Q+Y GV C +L +H V +VGY
Sbjct: 241 KAVAHQPVSVAIEASGMALQFYQSGVFTGECGTAL---DHGVVVVGY 284
>gi|116786779|gb|ABK24233.1| unknown [Picea sitchensis]
Length = 463
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 87/288 (30%), Positives = 143/288 (49%), Gaps = 29/288 (10%)
Query: 34 LELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSE 92
+EL+ + ++KK+Y+ E RF F+ + I + N Q S + G+ +F+DLS
Sbjct: 41 MELYELWLAQHKKAYNGLGEKQNRFSVFKDNFLYIHQ--HNNQGNPSYKLGLNQFADLSH 98
Query: 93 EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
EEFK +L ++ +S+ + + + G +P I DWRE G +
Sbjct: 99 EEFKATYLGAKLDTKKRLSNSPSPRYQY--------SDGEDLPESI----DWREKGAVTA 146
Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
V++Q +CG+CWAFSTV E ++ + G L+ LS QE++DC + N GC+GG L+D+
Sbjct: 147 VKDQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDCDTSYNQGCNGG----LMDY 202
Query: 213 ---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
+N L+ E +YP D +C + + V I Y + + ++ L A +
Sbjct: 203 AFQFIINNGGLDSEDDYPYKANDGSCDAYRKNAHVVTIDDY--EDVPENDEKSLKKAAAN 260
Query: 270 GPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYSRT 315
P+ A+ A +Q+Y GV C L +H V +VGY + S T
Sbjct: 261 QPISVAIEASGRAFQFYESGVFTSTCGTQL---DHGVTLVGYGSESGT 305
>gi|332326591|gb|AEE42619.1| cysteine protease [Leishmania aethiopica]
Length = 443
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 87/282 (30%), Positives = 138/282 (48%), Gaps = 29/282 (10%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F++ Y ++Y +E R NFE++L+++ E ++P AR+GIT+F DLSE E
Sbjct: 37 LFEEFKRTYWRAYGTVAEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F R+L N + K H H + ++ +P DWRE G + V+
Sbjct: 94 FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
BQ CG+CWAFS V ES A+ L LS Q+++ C + + GC GG +W+
Sbjct: 143 BQGACGSCWAFSAVGNIESQWAVAXHGLVRLSEQQLVSC-DDKDSGCGGGLMTQAFEWLL 201
Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTD-IAT 268
++N + E YP + C + G +I Y +I S +++ +A
Sbjct: 202 RNMNGTMFT-EDSYPYVSSTGDVPECTNSSELVPGARIDGY---VMIESXETVMAAWLAK 257
Query: 269 HGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
GP+ AV+A + Y GV+ +C G +NH V +VGY+
Sbjct: 258 SGPISIAVDASPFMSYESGVLT-SCVGK--XLNHGVLLVGYN 296
>gi|307175778|gb|EFN65613.1| Putative cysteine proteinase CG12163 [Camponotus floridanus]
Length = 887
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 89/283 (31%), Positives = 148/283 (52%), Gaps = 35/283 (12%)
Query: 35 ELFSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEE 93
+LF++F Y ++YS E ++R + F ++L II+ L K + +A Y + F+D+S E
Sbjct: 580 QLFNNFVVTYNRTYSTPEERNLRLRIFRENLGIIQLLRKTERG--TAHYDVNMFADMSPE 637
Query: 94 EFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIP-TGIPVKKDWREAGIIGK 152
EF++R+L + N + R IP +P K DWRE ++
Sbjct: 638 EFRSRYL-----------GLRPDLRSENDIPLREAE----IPDVELPPKFDWREKSVVTP 682
Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLD- 211
V++Q CG+CWAFS E +A+K+G L LS QE++DC + + GC+GG L D
Sbjct: 683 VKDQGMCGSCWAFSVTGNIEGQYAIKHGRLLSLSEQELVDC-DDLDEGCNGG----LPDN 737
Query: 212 -WMDVNKV-VLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
+ + K+ LE ES+YP ++ C K N K++ + + +E+ + + +
Sbjct: 738 AYRAIEKLGGLELESDYPYEAENEKCHFKK---NLAKVQLASAVNITSNETQMAQWLVQN 794
Query: 270 GPVIAAVNALTWQYYLGGV---IQYNCDGSLANINHAVQIVGY 309
GP+ +NA Q+Y+GGV ++ C+ N++H V IVGY
Sbjct: 795 GPISIGINANAMQFYVGGVSHPFKFLCNPK--NLDHGVLIVGY 835
>gi|427777627|gb|JAA54265.1| Putative cathepsin f-like cysteine protease [Rhipicephalus
pulchellus]
Length = 475
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 95/291 (32%), Positives = 145/291 (49%), Gaps = 26/291 (8%)
Query: 30 LEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFS 88
L Q+ LFS F + Y K+Y K EH+ RF F+ +L I N+ + +A YG+TEFS
Sbjct: 159 LSQERSLFSVFARTYNKTYKDKEEHEARFMIFKNNLKRIALFNRLEEG--TAHYGLTEFS 216
Query: 89 DLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAG 148
DLS EF+ RH + ++ HK + + I G + +P DWR G
Sbjct: 217 DLSPSEFE----RHYLGLKKDLAEHK--------AEVKPIKVG-PVNEPLPDLFDWRTKG 263
Query: 149 IIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCA 208
+ +V+NQ CG+CWAFS E L L LS QE++DC +G+ GC GG
Sbjct: 264 AVTEVKNQGMCGSCWAFSVTGNVEGQWFLSRSKLLSLSEQELVDC-DHGDHGCKGGYMGQ 322
Query: 209 LLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIAT 268
+ + + LE ESEYP D C+ T +++S+ L +E+ + +
Sbjct: 323 AMKAV-IEMGGLETESEYPYKGVDGTCEFNKTESK-ARVQSFV--GLPQNETELAYWLMK 378
Query: 269 HGPVIAAVNALTWQYYLGGV---IQYNCDGSLANINHAVQIVGYDNYSRTW 316
HGPV +NA Q+Y GG+ ++ C S +++H V +VG+ R++
Sbjct: 379 HGPVSIGINANAMQFYFGGISHPWKFLC--SPTDLDHGVLLVGFGVDKRSF 427
>gi|394331824|gb|AFN27131.1| cysteine protease [Leishmania tropica]
Length = 443
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 86/281 (30%), Positives = 140/281 (49%), Gaps = 27/281 (9%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F++ Y+++Y +E R NFE++L+++ E ++P AR+GIT+F DLSE E
Sbjct: 37 LFEEFKRTYRRAYGTVAEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F R+L N + K H H + ++ +P DWR+ G + V+
Sbjct: 94 FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWRKKGAVTPVK 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
+Q CG+CWAFS V + ES AL L+ LS Q+++ C + + GC +W+
Sbjct: 143 DQGACGSCWAFSAVGSIESQWALAGHRLTALSEQQLVSC-DDKDSGCRARLMLQAFEWLL 201
Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
++N + E YP + C G +I Y T+ SE+ + +A +
Sbjct: 202 RNMNGTMFT-EDSYPYVSSTGYVPECSNSIQLVPGARIDGYM--TIESSETVMAAWLAKN 258
Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
GP+ AV+A ++ Y GV+ +C G +NH V +VGY+
Sbjct: 259 GPISIAVDASSFMSYQRGVVT-SCAG--MPLNHGVLLVGYN 296
>gi|2780176|emb|CAA71085.1| cystein proteinase [Leishmania mexicana]
Length = 443
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 89/280 (31%), Positives = 135/280 (48%), Gaps = 25/280 (8%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F++ Y ++Y +E R NFE++L+++ E ++P A++GIT+F DLSE E
Sbjct: 37 LFEEFKRTYGRAYETLAEEQQRLANFERNLELMRE--HQARNPH-AQFGITKFFDLSEAE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F R+L N + K H H + ++ +P DWRE G + V+
Sbjct: 94 FAARYL----NGAAYFAAAKRHAAQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
NQ CG+CWAFS V E L L LS Q+++ C N GCSGG DW+
Sbjct: 143 NQGACGSCWAFSAVGNIEGQWYLAGHELVSLSEQQLVSCDDMDN-GCSGGLMLQAFDWLL 201
Query: 215 VNKVV-LEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
N L E YP + + C + G +I + + SE ++ +A +G
Sbjct: 202 QNTNGHLYTEDSYPYVSGNGYVPECSNSSELVVGAQIDGHV--LIGSSEKAMAAWLAKNG 259
Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
P+ A++A ++ Y GV+ C G +NH V +VGYD
Sbjct: 260 PIAIALDASSFMSYKSGVLT-ACIGK--QLNHGVLLVGYD 296
>gi|414887429|tpg|DAA63443.1| TPA: hypothetical protein ZEAMMB73_816727 [Zea mays]
Length = 334
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 88/300 (29%), Positives = 141/300 (47%), Gaps = 25/300 (8%)
Query: 9 FIVALIALCFLAIPVKVSKPNLEQKLEL--FSSFQQRYKKSY-SKSEHDIRFKNFEKSLD 65
+ A + L +A + ++E L + F +Q Y +SY + +E RF+ + ++++
Sbjct: 31 LLCACLMLVLMAGAASGGRVDVEDMLMMDRFRGWQATYNRSYLTAAERLRRFEVYRQNME 90
Query: 66 IIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVK- 124
+IE NR++ S + G T F+DL+ EEF H S H + +H + H
Sbjct: 91 LIEA--TNRRAGLSYQLGETPFTDLTSEEFLATHT-MSTRLHASEAARRHRELITTHAGP 147
Query: 125 --------KRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHA 176
R+ TT + +P + DWR G + V++Q CG+CW+F TV E +H
Sbjct: 148 VSDGGRQWNRNYTTDLDVPESV----DWRTKGAVTPVKDQGACGSCWSFVTVAAIEGLHK 203
Query: 177 LKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACK 236
++ G L LS Q V+DC+ N GC+ GD A +DW+ N L ES+YP + + CK
Sbjct: 204 IRTGQLVSLSEQAVLDCSSPPNHGCNRGDPAAAIDWVSANG-GLTTESDYPYVGRQGKCK 262
Query: 237 RKATSPNGVKIKSYTCDTLIPSESSILTDIA-THGPVIAAVNA-LTWQYYLGGVIQYNCD 294
+ KIK L+ + ++A PV +N Q+Y GV CD
Sbjct: 263 LDKARNHVAKIKGR---KLVDQNNEAALEVAVAQQPVAVDMNVDPILQHYKSGVFHGPCD 319
>gi|45822205|emb|CAE47499.1| cathepsin L-like proteinase [Diabrotica virgifera virgifera]
Length = 317
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 93/286 (32%), Positives = 142/286 (49%), Gaps = 38/286 (13%)
Query: 35 ELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARY-GITEFSDLSE 92
+ ++ F+ + K Y E +RF+ F ++L IE+ N Q+ E + Y G+ +F+D++
Sbjct: 14 QQWAQFKVNHSKKYGHLKEEQVRFQVFSQNLQKIEQHNARYQNGEVSFYLGVNQFADMTS 73
Query: 93 EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITT------GITIPTGIPVKKDWRE 146
EEFK D H KR IT+ +T+P I DWRE
Sbjct: 74 EEFKAML-----------------DSQLIHKPKRDITSRFVADPQLTVPESI----DWRE 112
Query: 147 AGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGD 205
G + VR+Q+ CG+CWAFS E LK G L +LS Q+++DC+ + N GC+GG
Sbjct: 113 KGAVNPVRDQEQCGSCWAFSAAGALEGQRFLKEGKLEVLSTQQLVDCSRDYKNEGCNGGW 172
Query: 206 FCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
D++ N + LE + +Y CK P KI Y+ ++ +E ++
Sbjct: 173 PHWAYDYIKDNGLCLESKYKYQ-GYDGYYCKE--CIPAIKKINGYS--SINQTEEALKEA 227
Query: 266 IATHGPVIAAVNA-LTWQYYLGGVIQ-YNCDGSLANINHAVQIVGY 309
+ T GP+ VNA WQ Y GG+++ +C G +INHAV VGY
Sbjct: 228 VGTAGPIAVCVNANDDWQLYSGGILESQSCPGG-ESINHAVLAVGY 272
>gi|302790930|ref|XP_002977232.1| hypothetical protein SELMODRAFT_228454 [Selaginella moellendorffii]
gi|300155208|gb|EFJ21841.1| hypothetical protein SELMODRAFT_228454 [Selaginella moellendorffii]
Length = 353
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 94/283 (33%), Positives = 137/283 (48%), Gaps = 30/283 (10%)
Query: 33 KLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLS 91
K+ F F R+K+ Y S E RF F ++L++IEE N+ ++ P + + +F+D+S
Sbjct: 47 KVARFHEFATRHKRVYGSLVELRERFVTFSRNLELIEETNR-KELPYTL--AVNQFADMS 103
Query: 92 EEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIG 151
EEFK KH L S N V+ P P KKDWR+ I+
Sbjct: 104 WEEFK---------KHNLFSSQNCSATTTNSVR------AFLTP---PSKKDWRDDKIVS 145
Query: 152 KVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALL 210
V+NQQ CG+CW FST ES HA G + +LS Q+++DCAG N GC+GG
Sbjct: 146 PVKNQQHCGSCWTFSTTGALESAHAQATGKMVVLSEQQLVDCAGGYNNFGCNGGLPSQAF 205
Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP-SESSILTDIATH 269
+++ N L+ E YP D C + N + K Y + +E ++ +A +
Sbjct: 206 EYIRYNG-GLDTEDSYPYTGHDGKC---TYNQNSIGAKVYDVVNITEGAEDELIHAVAFN 261
Query: 270 GPVIAAVNALT-WQYYLGGVIQYN-CDGSLANINHAVQIVGYD 310
PV A L +++Y GV N C +NHAV VGY+
Sbjct: 262 RPVSIAYEVLKDFRFYKSGVYTSNVCGTGPDTVNHAVLAVGYN 304
>gi|307200028|gb|EFN80374.1| Putative cysteine proteinase CG12163 [Harpegnathos saltator]
Length = 1032
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 87/281 (30%), Positives = 144/281 (51%), Gaps = 33/281 (11%)
Query: 36 LFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF +F Y ++Y ++ E ++R F ++L II L KN Q + +YG+ +F+D+S EE
Sbjct: 726 LFENFVNTYNRTYATEEERNLRLSIFRENLGIIRLLRKNEQG--TGQYGVNQFADVSTEE 783
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F +L + +N +++ I +P DWR+ G + V+
Sbjct: 784 FHAFYLGLRPDLRT----------ENNIPLRQAEIPDIELPNSF----DWRQKGAVTPVK 829
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLD--W 212
NQ CG+CWAFS E +A+K+ L LS QE++DC + + GC+GG L D +
Sbjct: 830 NQGMCGSCWAFSVTGNVEGQYAIKHNKLLSLSEQELVDC-DDLDEGCNGG----LPDNAY 884
Query: 213 MDVNKV-VLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGP 271
+ K+ LE ES+YP ++ C K N K++ + + +E+ I + +GP
Sbjct: 885 RAIEKLGGLELESDYPYEAENERCHFKK---NMAKVQVGSAVNITSNETQIAQWLVANGP 941
Query: 272 VIAAVNALTWQYYLGGV---IQYNCDGSLANINHAVQIVGY 309
+ +NA Q+Y+GGV ++ C+ N++H V IVGY
Sbjct: 942 ISIGINANAMQFYMGGVSHPFKFLCNPK--NLDHGVLIVGY 980
>gi|401430387|ref|XP_003886572.1| unnamed protein product, partial [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|356491640|emb|CBZ40951.1| unnamed protein product, partial [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 332
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 88/280 (31%), Positives = 136/280 (48%), Gaps = 25/280 (8%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F++ Y ++Y +E R NFE++L+++ E ++P A++GIT+F DLSE E
Sbjct: 37 LFEEFKRTYGRAYETLAEEQQRLANFERNLELMRE--HQARNPH-AQFGITKFFDLSEAE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F R+L N + K H H + ++ +P DWRE G + V+
Sbjct: 94 FAARYL----NGAAYFAAAKRHAAQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
+Q CG+CWAFS V E L L LS Q+++ C + N GCSGG DW+
Sbjct: 143 DQGACGSCWAFSAVGNIEGQWYLAGHELVSLSEQQLVSC-DDMNDGCSGGLMLQAFDWLL 201
Query: 215 VNKVV-LEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
N L E YP + + C + G +I + + SE ++ +A +G
Sbjct: 202 QNTNGHLHTEDSYPYVSGNGYVPECSNSSELVVGAQIDGHV--LIGSSEKAMAAWLAKNG 259
Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
P+ A++A ++ Y GV+ C G +NH V +VGYD
Sbjct: 260 PIAIALDASSFMSYKSGVLT-ACIGK--QLNHGVLLVGYD 296
>gi|340053966|emb|CCC48259.1| cysteine peptidase precursor, fragment, partial [Trypanosoma vivax
Y486]
Length = 447
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 87/284 (30%), Positives = 139/284 (48%), Gaps = 29/284 (10%)
Query: 36 LFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF++F+Q+Y +SY + +E R + FE D + + A +G+T FSDL+ EE
Sbjct: 25 LFAAFKQKYGRSYGTAAEEAFRLRVFE---DNMRRSRMYAAANPHATFGVTPFSDLTPEE 81
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKV 153
F+TR+ H+ H + + T + +P G P DWR G + V
Sbjct: 82 FRTRY---------------HNGERHFEAARGRVRTLVQVPPGKAPAAVDWRRKGAVTPV 126
Query: 154 RNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM 213
++Q +CG+CW+FS + E A L+ LS Q ++ C N GC GG +W+
Sbjct: 127 KDQGSCGSCWSFSAIGNIEGQWAAAGNPLTSLSEQMLVSCDFKDN-GCGGGFMDNAFEWI 185
Query: 214 -DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI-KSYTCDTLIP-SESSILTDIATHG 270
N + E YP + +D + + P G ++ + T IP E +I +A +G
Sbjct: 186 VKENSGKVYTEKSYPYVSEDGS--KPFCIPYGHEVGATITGHVDIPHDEDAIAKYLADNG 243
Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYSR 314
PV AV+A T+ Y GGV+ +C +NH V +VGY++ S+
Sbjct: 244 PVAVAVDATTFMSYSGGVVT-SCTSEA--LNHGVLLVGYNDSSK 284
>gi|148908373|gb|ABR17300.1| unknown [Picea sitchensis]
Length = 357
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 88/277 (31%), Positives = 128/277 (46%), Gaps = 28/277 (10%)
Query: 37 FSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F+ F RY K Y + RF F K++++IE N A I EF+D++ EEF
Sbjct: 58 FAEFALRYGKRYDSVRQLVHRFNAFVKNVELIESRNSMNLPYTLA---INEFADITWEEF 114
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
++L S N S+HK D P KKDWRE GI+ V+N
Sbjct: 115 HGQYLGASQNCSATKSNHKFTDAQP------------------PTKKDWREEGIVSPVKN 156
Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWMD 214
Q CG+CW FST E+ + G +LS Q+++DCAG N GCSGG +++
Sbjct: 157 QAHCGSCWTFSTTGALEAAYTQATGKTVILSEQQLVDCAGAFNNFGCSGGLPSQAFEYIK 216
Query: 215 VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIA 274
N L+ E YP KD C + GVK+ + + + +E + + + PV
Sbjct: 217 YNG-GLDTEEAYPYTAKDGVCNYDVNNV-GVKVAD-SVNISLGAEDKLKSAVGLVRPVSV 273
Query: 275 AVNALT-WQYYLGGVI-QYNCDGSLANINHAVQIVGY 309
A + +++Y GV C ++NHAV VGY
Sbjct: 274 AFQVIQDFRFYKEGVFTSTTCGQGPMDVNHAVLAVGY 310
>gi|116779845|gb|ABK21448.1| unknown [Picea sitchensis]
gi|116791731|gb|ABK26088.1| unknown [Picea sitchensis]
gi|224286276|gb|ACN40847.1| unknown [Picea sitchensis]
Length = 357
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 88/277 (31%), Positives = 128/277 (46%), Gaps = 28/277 (10%)
Query: 37 FSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F+ F RY K Y + RF F K++++IE N A I EF+D++ EEF
Sbjct: 58 FAEFALRYGKRYDSVRQLVHRFNAFVKNVELIESRNSMNLPYTLA---INEFADITWEEF 114
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
++L S N S+HK D P KKDWRE GI+ V+N
Sbjct: 115 HGQYLGASQNCSATKSNHKFTDAQP------------------PTKKDWREEGIVSPVKN 156
Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWMD 214
Q CG+CW FST E+ + G +LS Q+++DCAG N GCSGG +++
Sbjct: 157 QAHCGSCWTFSTTGALEAAYTQATGKTVILSEQQLVDCAGAFNNFGCSGGLPSQAFEYIK 216
Query: 215 VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIA 274
N L+ E YP KD C + GVK+ + + + +E + + + PV
Sbjct: 217 YNG-GLDTEEAYPYTAKDGVCNYDVNNV-GVKVAD-SVNISLGAEDELKSAVGLVRPVSV 273
Query: 275 AVNALT-WQYYLGGVI-QYNCDGSLANINHAVQIVGY 309
A + +++Y GV C ++NHAV VGY
Sbjct: 274 AFQVIQDFRFYKEGVFTSTTCGQGPMDVNHAVLAVGY 310
>gi|356542633|ref|XP_003539771.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 341
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 99/329 (30%), Positives = 151/329 (45%), Gaps = 46/329 (13%)
Query: 1 MFDVKNVLFIVALIALCFLAIPVKV--SKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFK 58
+F LF++ A C + P E+ + ++ + YK SY K + +++
Sbjct: 6 LFHCTLALFLI--FAFCAFEANARTLEDAPMRERHEQWMATHGKVYKHSYEKEQ---KYQ 60
Query: 59 NFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDH 118
F +++ IE N P + GI F+DL+ EEFK ++N+
Sbjct: 61 IFMENVQRIEAFNNAGXKP--YKLGINHFADLTNEEFK------AINRF---------KG 103
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
H + R+ T T +P DWR+ G + +++Q CG CWAFS V E + L+
Sbjct: 104 HVCSKRTRTTTFRYENVTAVPASLDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGITKLR 163
Query: 179 NGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLE-----PESEYPLLLKD 232
G L LS QE++DC G + GC GG L+D D K +L+ E+ YP D
Sbjct: 164 TGKLISLSEQELVDCDTKGVDQGCEGG----LMD--DAFKFILQNKGLATEAIYPYEGFD 217
Query: 233 AACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQ 290
C KA + IK Y D SES++L +A PV A+ A +Q+Y GGV
Sbjct: 218 GTCNAKADGNHAGSIKGYE-DVPANSESALLKAVANQ-PVSVAIEASGFKFQFYSGGVFT 275
Query: 291 YNCDGSLANINHAVQIVGY---DNYSRTW 316
+C N++H V VGY D+ ++ W
Sbjct: 276 GSCG---TNLDHGVTSVGYGVGDDGTKYW 301
>gi|401416322|ref|XP_003872656.1| putative cathepsin L-like protease [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322488880|emb|CBZ24130.1| putative cathepsin L-like protease [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 366
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 88/280 (31%), Positives = 136/280 (48%), Gaps = 25/280 (8%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F++ Y ++Y +E R NFE++L+++ E ++P A++GIT+F DLSE E
Sbjct: 37 LFEEFKRTYGRAYETLAEEQQRLANFERNLELMRE--HQARNPH-AQFGITKFFDLSEAE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F R+L N + K H H + ++ +P DWRE G + V+
Sbjct: 94 FAARYL----NGAAYFAAAKRHAAQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
+Q CG+CWAFS V E L L LS Q+++ C + N GCSGG DW+
Sbjct: 143 DQGACGSCWAFSAVGNIEGQWYLAGHELVSLSEQQLVSC-DDMNDGCSGGLMLQAFDWLL 201
Query: 215 VNKVV-LEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
N L E YP + + C + G +I + + SE ++ +A +G
Sbjct: 202 QNTNGHLYTEDSYPYVSGNGYVPECSNSSELVVGAQIDGHVL--IGSSEKAMAAWLAKNG 259
Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
P+ A++A ++ Y GV+ C G +NH V +VGYD
Sbjct: 260 PIAIALDASSFMSYKSGVLT-ACIGK--QLNHGVLLVGYD 296
>gi|326430490|gb|EGD76060.1| cysteine proteinase [Salpingoeca sp. ATCC 50818]
Length = 448
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 89/312 (28%), Positives = 149/312 (47%), Gaps = 42/312 (13%)
Query: 8 LFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIR-FKNFEKSLDI 66
L +V + +A P+ ++ K LF +F+ ++ K Y +E + R F F +++D
Sbjct: 5 LVLVCALVGAAMAEPLSLTV----NKGRLFDAFKTKFNKVYESAEEEARRFSVFSQNIDF 60
Query: 67 IEELNKNRQSPESAR------YGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
I NR + E+AR + +F+DL+ EE++ +LR + L+ +
Sbjct: 61 I-----NRHNAEAARGVHTHTVDVNQFADLTNEEYRQLYLRPYPTE--LLGRERQE---- 109
Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
+ P V DWR+ G + ++NQ CG+CW+FST + E HA+ G
Sbjct: 110 ---------VWLDGPNAGSV--DWRQKGAVTPIKNQGQCGSCWSFSTTGSVEGAHAIATG 158
Query: 181 TLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
L LS Q+++DC+G+ GN GC+GG ++ ++ L+ E +YP +D C +
Sbjct: 159 NLVSLSEQQLVDCSGSFGNQGCNGGLMDNAFKYI-ISNGGLDTEQDYPYTARDGVCDKSK 217
Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSL 297
S + V I Y D +E + + GPV A+ A ++Q Y GV C
Sbjct: 218 ESKHAVSISGYK-DVPQNNEDQLAAAV-EKGPVSVAIEADQQSFQMYSSGVFSGPCG--- 272
Query: 298 ANINHAVQIVGY 309
N++H V +VGY
Sbjct: 273 TNLDHGVLVVGY 284
>gi|124484383|dbj|BAF46302.1| cysteine proteinase precursor [Ipomoea nil]
Length = 369
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 91/287 (31%), Positives = 143/287 (49%), Gaps = 36/287 (12%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F+ F+ +Y KSY ++ EHD R F+ +L +++ SA +G+T+FSDL+ +EF
Sbjct: 47 FTLFKSKYGKSYATQEEHDYRLSVFKANL---RRAKRHQLLDPSAVHGVTKFSDLTPKEF 103
Query: 96 KTRHL---RHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIG 151
+ L + S K L D H + +PT +P DWR+ G +
Sbjct: 104 RRTFLGIRKSSSGKRKL---KLPADAHAAEI----------LPTSDLPSDFDWRDYGAVT 150
Query: 152 KVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSG 203
V++Q +CG+CW+FST E + L G L LS Q+++DC AG + GC+G
Sbjct: 151 GVKDQGSCGSCWSFSTTGALEGANFLATGELVSLSEQQLVDCDHLCDPEEAGACDSGCNG 210
Query: 204 GDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSIL 263
G +++ + LE E +YP KD CK S + +++ +L E I
Sbjct: 211 GLMTTAYEYV-LQSGGLEKEKDYPYTGKDGTCKFD-KSKIAAAVANFSVVSL--DEDQIA 266
Query: 264 TDIATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
++ HGP+ +NA+ Q Y+GGV Y C S N++H V +VGY
Sbjct: 267 ANLVKHGPLSVGINAVFMQTYIGGVSCPYIC--SKRNLDHGVLLVGY 311
>gi|118365750|ref|XP_001016095.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89297862|gb|EAR95850.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 335
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 95/311 (30%), Positives = 157/311 (50%), Gaps = 28/311 (9%)
Query: 7 VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKN--FEKSL 64
+L I+ L+ LC LA + V +KL ++ + ++++ Y +EH+ F+ F ++L
Sbjct: 6 LLSIIMLMPLC-LAQDISV------EKLLAYNKWSSQHQRVY-LNEHEKLFRQMVFFENL 57
Query: 65 DIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHS-VNKHVLMSHHKHHDHHHNHV 123
I+E N + + S + +FSD+++EEF + L S + H++ + H+ +
Sbjct: 58 QKIQEHNSDSNNTYSVH--LNQFSDMTKEEFAEKILMKSDLVDHLMKGISQEATHNDTNK 115
Query: 124 KKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLS 183
+ + + G+++ I DWR G + V+NQ CG+CW+FS ES + +KN L
Sbjct: 116 ETQLNSKGLSLADSI----DWRTKGAVTSVKNQGNCGSCWSFSAAAVMESFNFIKNKALV 171
Query: 184 LLSVQEVIDCA----GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
S Q+++DC G + GCSGG + LD+ +KV + +YP + C
Sbjct: 172 DFSEQQLVDCVIPANGYNSYGCSGGWPASCLDY--ASKVGITTLDKYPYVAVQKNCNVTG 229
Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLAN 299
T+ NG K S+ IP+ S+ L PV V+A TW Y G+ CD S +
Sbjct: 230 TN-NGFKPISW---IQIPNTSNDLKSALNFSPVSVVVDASTWGSYYSGIFN-GCDQSHIS 284
Query: 300 INHAVQIVGYD 310
+NHAV VGYD
Sbjct: 285 LNHAVLAVGYD 295
>gi|294885989|ref|XP_002771502.1| thiolproteinase SmTP1, putative [Perkinsus marinus ATCC 50983]
gi|239875206|gb|EER03318.1| thiolproteinase SmTP1, putative [Perkinsus marinus ATCC 50983]
Length = 337
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 94/303 (31%), Positives = 146/303 (48%), Gaps = 27/303 (8%)
Query: 14 IALCFL-AIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELN 71
I++ FL A PV +LE F FQ+++ KSY E ++ R F +L+ IEE+N
Sbjct: 4 ISVVFLLAFPV-CKAVDLEAAGLAFIGFQKKHGKSYDNKEEEMKRAAIFHDNLNYIEEVN 62
Query: 72 KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTG 131
S + G+ E++DL+ EEF L S V TT
Sbjct: 63 AQNL---SYKLGVNEYTDLTLEEFAALKLS---------STDMSEGMGDGFVAGAGPTT- 109
Query: 132 ITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVI 191
T+PT + DWR+ G++ V++Q CG+CWAFS + E +A+ G L LS Q+++
Sbjct: 110 TTLPTSV----DWRKKGVLNPVKDQGYCGSCWAFSAIGALEPRYAIATGKLLSLSEQQLV 165
Query: 192 DCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATS-PNGVKIKS 249
DCAG GN GC+GG +++ V + ES YP + D C+ + +G+ +
Sbjct: 166 DCAGAYGNEGCNGGLMDKAFEYIKATGV--DKESTYPYVGSDETCQATVENKTDGLPVGE 223
Query: 250 YTCDTLIPSESSILTDIATHGPVIAAV--NALTWQYYLGGVI-QYNCDGSLANINHAVQI 306
T + ++ L + PV A+ N ++Q+Y GV NC+ +I+H V
Sbjct: 224 VTGNQMLHQTEKALMEGVAAAPVSIAMYANLQSFQHYKSGVYSDPNCNAKGGSIDHGVVA 283
Query: 307 VGY 309
VGY
Sbjct: 284 VGY 286
>gi|401430108|ref|XP_003879535.1| unnamed protein product [Leishmania mexicana MHOM/GT/2001/U1103]
gi|356491914|emb|CBZ40911.1| unnamed protein product [Leishmania mexicana MHOM/GT/2001/U1103]
Length = 359
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 88/280 (31%), Positives = 137/280 (48%), Gaps = 25/280 (8%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F++ Y ++Y +E R NFE++L+++ E ++P A++GIT+F DLSE E
Sbjct: 37 LFEEFKRTYGRAYETLAEEQQRLANFERNLELMRE--HQARNPH-AQFGITKFFDLSEAE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F R+L N + K H H + ++ +P DWRE G + V+
Sbjct: 94 FAARYL----NGAAYFAAAKRHAAQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
+Q CG+CWAFS+V E L L LS Q+++ C + N GC GG DW+
Sbjct: 143 DQGECGSCWAFSSVGNIEGQWYLAGHELVSLSEQQLVSC-DDMNDGCDGGLMLQAFDWLL 201
Query: 215 VNKVV-LEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
N L E YP + + C + G +I + + SE ++ +A +G
Sbjct: 202 QNTNGHLYTEDSYPYVSGNGYLPECSNSSKLVVGAQIDGHVL--IGSSEKAMAAWLAKNG 259
Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
P+ A++A ++ Y GV+ C G +NHAV +VGYD
Sbjct: 260 PIAIALDASSFMSYKSGVLT-ACIGK--QVNHAVLLVGYD 296
>gi|125551397|gb|EAY97106.1| hypothetical protein OsI_19029 [Oryza sativa Indica Group]
Length = 350
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 86/273 (31%), Positives = 131/273 (47%), Gaps = 34/273 (12%)
Query: 44 YKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARY--GITEFSDLSEEEFKTRHLR 101
YK + K+ R + F+ ++ IE N ++ RY G+ +F+DL+ EEFK
Sbjct: 55 YKDAAEKAR---RLEVFKANVAFIESFNAGGKN----RYWLGVNQFADLTSEEFK----- 102
Query: 102 HSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT----GIPVKKDWREAGIIGKVRNQQ 157
M++ K +N V+ ++TG +P DWR G + ++++Q
Sbjct: 103 ------ATMTNSKGFSTPNNGVR---VSTGFKYENVSADALPASVDWRTKGAVTRIKDQG 153
Query: 158 TCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGN-MGCSGGDFCALLDWMDVN 216
CG CWAFS V E + L G L LS QE++DC +GN GC GG+ ++ N
Sbjct: 154 QCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVDGNDQGCEGGEIDGAFQFILSN 213
Query: 217 KVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAV 276
L E+ YP +D CK A + I+ Y D E S++ +A PV AV
Sbjct: 214 G-GLTAEANYPYTAEDGRCKTTAAADVAASIRGYE-DVPANDEPSLMKAVAGQ-PVSVAV 270
Query: 277 NALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+A +Q+Y GGV+ C SL +H V ++GY
Sbjct: 271 DASKFQFYGGGVMAGECGTSL---DHGVTVIGY 300
>gi|343476708|emb|CCD12273.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 363
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 94/318 (29%), Positives = 156/318 (49%), Gaps = 32/318 (10%)
Query: 4 VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSY-SKSEHDIRFKNFE 61
+ + F V L+A+ +PV + + EQ L+ F++F+Q+Y +SY +E RF+ F+
Sbjct: 7 TRTLRFSVGLLAVAACFVPVALGVLHAEQSLQQQFAAFKQKYSRSYKDATEEAFRFRVFK 66
Query: 62 KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
+++ E + + A +G+T FSD+S EEF+ ++H +++
Sbjct: 67 QNM---ERAKEEAAANPYATFGVTRFSDMSPEEFRA-------------TYHNGAEYYAA 110
Query: 122 HVKK-RSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
+K+ R + +T+ TG P DWR+ G + VR+++ C + WAFS + E +
Sbjct: 111 ALKRPRKV---VTVSTGKAPDAVDWRKKGAVTPVRDERLCDSSWAFSAIGNIEGQWKVAG 167
Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACKR- 237
L+ LS Q ++ C + GC GG W+ NK + E YP D R
Sbjct: 168 HELTSLSEQMLLSCDTRED-GCGGGLMDRAFQWIVSSNKGNVFTEQSYPYASTDGDVPRC 226
Query: 238 -KATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGS 296
K+ G KI Y L E++I +A +GPV AV A + Q Y GGV+ +C
Sbjct: 227 NKSGKVVGAKISDYV--DLPQDENAIAEWLAKNGPVAIAVEATSLQRYTGGVLT-SCISE 283
Query: 297 LANINHAVQIVGYDNYSR 314
++H V +VGYD+ S+
Sbjct: 284 --QLDHGVLLVGYDDTSK 299
>gi|110349473|gb|ABG73217.1| cathepsin L 1 precursor [Diaprepes abbreviatus]
Length = 322
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 102/314 (32%), Positives = 152/314 (48%), Gaps = 41/314 (13%)
Query: 13 LIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELN 71
IA C L V VS LE+ F +F+ ++ K+Y ++ E RF F+ +L IE+ N
Sbjct: 4 FIAACLL---VAVSATVLEETGVKFQAFKLKHGKTYKNQVEETARFNIFKDNLRAIEQHN 60
Query: 72 K-NRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITT 130
Q S + GI F+D+++EEF+ L S K H + HV T
Sbjct: 61 VLYEQGLVSYKKGINRFTDMTQEEFRAFL--------TLSSSKKPHFNTTEHV-----LT 107
Query: 131 GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEV 190
G+ +P I DWR G + V++Q CG+CWAFS + E+ + K G L LS Q++
Sbjct: 108 GLAVPDSI----DWRTKGQVTGVKDQGNCGSCWAFSVTGSTEAAYYRKAGKLVSLSEQQL 163
Query: 191 IDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA----TSPNGVK 246
+DC+ + N GC+GG + V LE ES YP D +CK A T +G K
Sbjct: 164 VDCSTDINAGCNGGYLDETFTY--VKSKGLEAESTYPYKGTDGSCKYSASKVVTKVSGHK 221
Query: 247 -IKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYN-CDGSLANINHAV 304
+KS E+++L + GPV A++A Y G+ + + C S + +NH V
Sbjct: 222 SLKS-------EDENALLDAVGNVGPVSVAIDATYLSSYESGIYEDDWC--SPSELNHGV 272
Query: 305 QIVGY--DNYSRTW 316
+VGY N + W
Sbjct: 273 LVVGYGTSNGKKYW 286
>gi|312281697|dbj|BAJ33714.1| unnamed protein product [Thellungiella halophila]
Length = 347
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 97/323 (30%), Positives = 158/323 (48%), Gaps = 36/323 (11%)
Query: 7 VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK--------SEHDIRFK 58
+ IV+LI+ L+I + S+P + +L + Q+R+ + +K E + R+
Sbjct: 8 IFLIVSLISSFCLSITL--SRPLDDNELIM----QKRHDEWMAKHGRVYADMKEKNNRYV 61
Query: 59 NFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDH 118
F+++++ IE LN N + + + + +F+DL+ +EF++ + + VL S
Sbjct: 62 VFKRNVERIERLN-NVPAGRTFKLAVNQFADLTNDEFRSMYTGYK-GGSVLSSQSGTKTS 119
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
+ +++++G +PV DWR+ G + ++NQ TCG CWAFS V E +K
Sbjct: 120 SFRY---QNVSSG-----ALPVSVDWRKKGAVTPIKNQGTCGCCWAFSAVAAIEGATKIK 171
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
G L LS Q+++DC N + GCSGG + + + L ES YP KDA CK K
Sbjct: 172 KGKLISLSEQQLVDCDTN-DFGCSGGLMDTAFEHI-MATGGLTTESNYPYKGKDATCKIK 229
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAV--NALTWQYYLGGVIQYNCDGS 296
T P I Y D + E +++ +A H PV + +Q+Y GV C
Sbjct: 230 NTKPTATSITGYE-DVPVNDEKALMKAVA-HQPVSIGIEGGGFDFQFYGSGVFTGECTTY 287
Query: 297 LANINHAVQIVGY---DNYSRTW 316
L +HAV VGY N S+ W
Sbjct: 288 L---DHAVTAVGYGQSSNGSKYW 307
>gi|400180419|gb|AFP73348.1| cysteine protease [Solanum lycopersicoides]
Length = 343
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 98/314 (31%), Positives = 150/314 (47%), Gaps = 34/314 (10%)
Query: 3 DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
D+ ++L + + F + S+P L ++ EL+ S R YK K E RF
Sbjct: 6 DLMSILITLFFVISMFNSQTTARSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHH 119
F++++ IE +NK S + GI EF+D++ EEF T+ ++ ++ S +
Sbjct: 63 FKENMKFIESVNK--AGNLSYKLGINEFADITSEEFLTKFTGINIPSYLSPSPMSSTEFK 120
Query: 120 HNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
N + + P DWRE+G + +V+NQ CG CWAFS V + E + +
Sbjct: 121 INDLSDDDM----------PSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIAT 170
Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
G L S QE++DC N N GC+GG D++ N + ES+Y + C+ +
Sbjct: 171 GNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGI-SSESDYEYQGQQYTCRSQE 228
Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDGS 296
+ V+I SY ++P + L T PV IAA L Q+Y GG DGS
Sbjct: 229 KTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DGS 278
Query: 297 LAN-INHAVQIVGY 309
A+ INHAV +GY
Sbjct: 279 CADRINHAVTAIGY 292
>gi|55979119|gb|AAV69023.1| cysteine protease [Opisthorchis viverrini]
gi|224923980|gb|ACN68966.1| cathepsin F-like cysteine protease [Opisthorchis viverrini]
Length = 326
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 89/308 (28%), Positives = 150/308 (48%), Gaps = 38/308 (12%)
Query: 9 FIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIE 68
F+ ++ ++ + + + L+ F+ +YKK+YS + ++RF+ F+ +L+ +
Sbjct: 4 FVCCVLVTTIWSVFARTTPFEPDDARALYEEFKLKYKKTYSNDDDELRFRIFKDNLERAK 63
Query: 69 ELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSI 128
L Q +A YG+T+FSDL+ EEFKTR+LR ++ ++ + + + +
Sbjct: 64 RLQAMEQG--TAEYGVTQFSDLTSEEFKTRYLRMRFDEPIV---------NEDPTPQEDV 112
Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
T + DWR+ G +G V +Q CG+CWAFS + E K G L LS Q
Sbjct: 113 TMDNS-------NFDWRDHGAVGPVLDQGDCGSCWAFSVIGNVEGQWFRKTGDLLGLSEQ 165
Query: 189 EVIDCAGNGNMGCSGG----DFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNG 244
++IDC + + GC GG + A+ + LE S+YP KD C +
Sbjct: 166 QLIDCD-HSDQGCDGGYPPQTYSAIEEMGG-----LELRSDYPYTGKDGICYMDQS---- 215
Query: 245 VKIKSYT-CDTLIP-SESSILTDIATHGPVIAAVNALTWQYYLGGVIQYN-CDGSLANIN 301
K +Y T +P E + + GP+ + +NA+ Q Y G+++ C+ A +N
Sbjct: 216 -KFVAYVNGSTRLPWCEKTQAKSLKEIGPLSSGLNAVLLQLYKRGIMRPRWCN--PAELN 272
Query: 302 HAVQIVGY 309
HAV VGY
Sbjct: 273 HAVLTVGY 280
>gi|340053963|emb|CCC48256.1| cysteine peptidase precursor [Trypanosoma vivax Y486]
Length = 452
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 86/284 (30%), Positives = 136/284 (47%), Gaps = 29/284 (10%)
Query: 36 LFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF++F+Q+Y +SY + +E R + FE D + + A +G+T FSDL+ EE
Sbjct: 33 LFAAFKQKYGRSYGTAAEEAFRLRVFE---DNMRRSRMYAAANPHATFGVTPFSDLTPEE 89
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKV 153
F+TR+ H+ H + + T + +P G P DWR G + V
Sbjct: 90 FRTRY---------------HNGERHFEAARGRVRTLVQVPPGKAPAAVDWRRKGAVTPV 134
Query: 154 RNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM 213
++Q +CG+CW+FS + E A L+ LS Q ++ C N GC GG +W+
Sbjct: 135 KDQGSCGSCWSFSAIGNIEGQWAAAGNPLTSLSEQMLVSCDSKDN-GCGGGFMDNAFEWI 193
Query: 214 -DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI-KSYTCDTLIP-SESSILTDIATHG 270
N + E YP + + P G ++ + T IP E +I +A +G
Sbjct: 194 VKENSGKVYTEKSYPYV--SGGGEEPPCKPRGHEVGATITGHVDIPHDEDAIAKYLADNG 251
Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYSR 314
PV AV+A T+ Y GGV+ +C +NH V +VGY++ S+
Sbjct: 252 PVAVAVDATTFMSYSGGVVT-SCTSEA--LNHGVLLVGYNDSSK 292
>gi|167521499|ref|XP_001745088.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776702|gb|EDQ90321.1| predicted protein [Monosiga brevicollis MX1]
Length = 294
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 87/279 (31%), Positives = 138/279 (49%), Gaps = 31/279 (11%)
Query: 40 FQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELN-KNRQSPESARYGITEFSDLSEEEFKT 97
F+ Y KSY S++ R FE +L+ I + N ++ Q S G+ EF+DL+ +EF
Sbjct: 1 FKSDYSKSYESEAVEAKRLAAFEANLEFINKHNAEHAQGLHSYTVGVNEFADLTIDEFMA 60
Query: 98 RHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQ 157
++ N+ M ++ + +P DWR G + ++NQ
Sbjct: 61 LYVPSKFNR--TMPYNT-----------------VYLPATSEDSVDWRTKGAVTPIKNQG 101
Query: 158 TCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWMDVN 216
CG+CW+FST + E HA+ G L LS Q+++DC+G+ GN GC+GG ++ N
Sbjct: 102 QCGSCWSFSTTGSTEGAHAIATGNLVSLSEQQLVDCSGSFGNQGCNGGLMDDAFKYIISN 161
Query: 217 KVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAV 276
K L+ E +YP +D C ++ + + I SY+ D +E + +A GPV A+
Sbjct: 162 K-GLDTEEDYPYTAQDGTCNKEKEAKHAATISSYS-DVPKNNEDQLAAAVA-KGPVSVAI 218
Query: 277 NA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY-DNY 312
A +Q Y GV NC N++H V +VGY D+Y
Sbjct: 219 EADQSGFQLYKSGVFDGNCG---TNLDHGVLVVGYTDDY 254
>gi|356517348|ref|XP_003527349.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 98/313 (31%), Positives = 141/313 (45%), Gaps = 29/313 (9%)
Query: 4 VKNVLFIVALIAL-C--FLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKN 59
KN + ++L L C FLA V E + RY K Y E + RFK
Sbjct: 3 AKNQFYQISLALLFCSGFLAFQVTCRTLQDASMYERHEEWMGRYAKVYKDPQERERRFKI 62
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHH 119
F+++++ IE N P + GI +F+DL+ EEF R+ H+ S +
Sbjct: 63 FKENVNYIEAFNNAANKPYT--LGINQFADLTNEEFIAP--RNRFKGHMCSSITRTTTFK 118
Query: 120 HNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
+ +V T IP DWR+ G + +++Q CG CWAFS V E +HAL
Sbjct: 119 YENV------------TAIPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSA 166
Query: 180 GTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
G L LS QEV+DC G + GC+GG ++ N L E YP D C K
Sbjct: 167 GKLISLSEQEVVDCDTKGEDQGCAGGFMDGAFKFIIQNH-GLNNEPNYPYKAVDGKCNAK 225
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGS 296
A + + I Y D + +E ++ +A PV A++A +Q+Y GV +C
Sbjct: 226 AAANHVATITGYE-DVPVNNEKALQKAVANQ-PVSVAIDASGSDFQFYQSGVFTGSCGTE 283
Query: 297 LANINHAVQIVGY 309
L +H V VGY
Sbjct: 284 L---DHGVTAVGY 293
>gi|77554625|gb|ABA97421.1| Vignain precursor, putative [Oryza sativa Japonica Group]
gi|222630746|gb|EEE62878.1| hypothetical protein OsJ_17681 [Oryza sativa Japonica Group]
Length = 350
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 86/273 (31%), Positives = 130/273 (47%), Gaps = 34/273 (12%)
Query: 44 YKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARY--GITEFSDLSEEEFKTRHLR 101
YK + K+ R + F+ ++ IE N ++ RY G+ +F+DL+ EEFK
Sbjct: 55 YKDAAEKAR---RLEVFKANVAFIESFNAGGKN----RYWLGVNQFADLTSEEFK----- 102
Query: 102 HSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT----GIPVKKDWREAGIIGKVRNQQ 157
M++ K +N V+ ++TG +P DWR G + ++++Q
Sbjct: 103 ------ATMTNSKGFSTPNNGVR---VSTGFKYENVSADALPASVDWRTKGAVTRIKDQG 153
Query: 158 TCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGN-MGCSGGDFCALLDWMDVN 216
CG CWAFS V E L G L LS QE++DC +GN GC GG+ ++ N
Sbjct: 154 QCGCCWAFSAVAAMEGFVKLSTGKLISLSEQELVDCDVDGNDQGCEGGEIDGAFQFILSN 213
Query: 217 KVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAV 276
L E+ YP +D CK A + I+ Y D E S++ +A PV AV
Sbjct: 214 G-GLTAEANYPYTAEDGRCKTTAAADVAASIRGYE-DVPANDEPSLMKAVAGQ-PVSVAV 270
Query: 277 NALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+A +Q+Y GGV+ C SL +H V ++GY
Sbjct: 271 DASKFQFYGGGVMAGECGTSL---DHGVTVIGY 300
>gi|390339264|ref|XP_791714.3| PREDICTED: putative cysteine proteinase CG12163-like
[Strongylocentrotus purpuratus]
Length = 453
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 84/282 (29%), Positives = 139/282 (49%), Gaps = 39/282 (13%)
Query: 35 ELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
+ +F++ Y+++ +E++ R+ F +++ +E N+ Q +A+YG T+F+D++E E
Sbjct: 158 KFLMTFKREYRQNDGTNEYEYRYSVFVQNMLTVEMFNQFEQG--TAKYGPTKFADMTEAE 215
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKV 153
F R L+ +KK I IP G +P + DWR G + V
Sbjct: 216 F--RKLQS------------------GPLKKTGIKKQAAIPQGPVPEEYDWRTHGAVTPV 255
Query: 154 RNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM 213
+NQ CG+CWAFS + E +K G L LS QE++DC + GC GG+
Sbjct: 256 KNQGMCGSCWAFSAIGNMEGQWQIKKGELISLSEQELVDCD-KVDGGCEGGEMS------ 308
Query: 214 DVNKVVLE-----PESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIAT 268
D + +++ E +YP ++ CK T VKI Y + +E+ + +A
Sbjct: 309 DAYEAIIKLGGAMSEEKYPYRGENEKCKFNMTDVR-VKINGYV--NISKNETEMAGWLAA 365
Query: 269 HGPVIAAVNALTWQYYLGGVIQ-YNCDGSLANINHAVQIVGY 309
HGP+ +NAL Q+Y GG+ + S +++H V IVGY
Sbjct: 366 HGPISIGINALMMQFYFGGIAHPWKIFCSPDSLDHGVLIVGY 407
>gi|356517358|ref|XP_003527354.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
gi|356577767|ref|XP_003556994.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 343
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 97/313 (30%), Positives = 140/313 (44%), Gaps = 29/313 (9%)
Query: 4 VKNVLFIVALIAL---CFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKN 59
KN + ++L L FLA V E + RY K Y E + RFK
Sbjct: 3 AKNQFYQISLALLFCSGFLAFQVTCRTLQDASMYERHEEWMGRYAKVYKDPQERERRFKI 62
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHH 119
F+++++ IE N P + GI +F+DL+ EEF R+ H+ S +
Sbjct: 63 FKENVNYIEAFNNAANKPYT--LGINQFADLTNEEFIAP--RNRFKGHMCSSITRTTTFK 118
Query: 120 HNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
+ +V T IP DWR+ G + +++Q CG CWAFS V E +HAL
Sbjct: 119 YENV------------TAIPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSA 166
Query: 180 GTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
G L LS QEV+DC G + GC+GG ++ N L E YP D C K
Sbjct: 167 GKLISLSEQEVVDCDTKGEDQGCAGGFMDGAFKFIIQNH-GLNNEPNYPYKAVDGKCNAK 225
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGS 296
A + + I Y D + +E ++ +A PV A++A +Q+Y GV +C
Sbjct: 226 AAANHVATITGYE-DVPVNNEKALQKAVANQ-PVSVAIDASGSDFQFYQSGVFTGSCGTE 283
Query: 297 LANINHAVQIVGY 309
L +H V VGY
Sbjct: 284 L---DHGVTAVGY 293
>gi|12024965|gb|AAG45727.1| cathepsin L-like cysteine protease [Leishmania chagasi]
Length = 381
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 85/270 (31%), Positives = 133/270 (49%), Gaps = 26/270 (9%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F++ Y+++Y +E R NFE++L+++ E ++P AR+GIT+F DLSE E
Sbjct: 37 LFEEFKRTYRRAYGTLAEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F R+L N + K H H + ++ +P DWRE G + V+
Sbjct: 94 FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
NQ CG+CWAFS V ES A L LS Q+++ C N GC+GG +W+
Sbjct: 143 NQGACGSCWAFSAVGNIESQWARAGHGLVSLSEQQLVSCDDKDN-GCNGGLMLQAFEWLL 201
Query: 215 VNKV-VLEPESEYPLLLKD---AACKRKATSPNGVKIKSYTCDTLIPSESSILTD-IATH 269
+ ++ E YP + A C + G +I Y +IPS +++ +A +
Sbjct: 202 RHMYGIVFTEKSYPYTSGNGDVAECLNSSKLVPGAQIDGY---VMIPSNETVMAAWLAEN 258
Query: 270 GPVIAAVNALTWQYYLGGV--IQYNCDGSL 297
GP+ AV+A ++ Y GV + YN G +
Sbjct: 259 GPIAIAVDASSFMSYQSGVLLVGYNKTGGV 288
>gi|260516654|gb|ACX43954.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516656|gb|ACX43955.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516658|gb|ACX43956.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516660|gb|ACX43957.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516662|gb|ACX43958.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516664|gb|ACX43959.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516666|gb|ACX43960.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516668|gb|ACX43961.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516670|gb|ACX43962.1| cysteine protease 2 [Brachiaria hybrid cultivar]
Length = 338
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 86/279 (30%), Positives = 136/279 (48%), Gaps = 29/279 (10%)
Query: 35 ELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
++F++F ++Y K+YS +E RF F+ +++ I N + S G+ EF+DLS EE
Sbjct: 40 DMFTAFMKQYSKAYSHAEFSSRFNQFKANVETIRL--HNTLANASYTMGLNEFADLSFEE 97
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
FK ++ + KHV + ++ H P DWR + + ++
Sbjct: 98 FKGKYFGY---KHVEREFARSNNLHQE-------------VEAAPTSIDWRTSNAVTPIK 141
Query: 155 NQQTCGACWAFSTVETAESMHALKNG-TLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDW 212
+Q CG+CWAFS + E L+ TL+ LS Q+++DC+ + GN GC+GG ++
Sbjct: 142 DQGQCGSCWAFSATGSIEGAWVLQGKHTLTSLSEQQLVDCSTSYGNAGCNGGLMDYAFEY 201
Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV 272
+ NK + ES YP C++ T V I Y D E+S+L + T GPV
Sbjct: 202 IIANKGIC-AESAYPYKGVGGLCQKSCTKV--VTISGYK-DVASGDEASLLNAVGTVGPV 257
Query: 273 IAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
A+ A +Q+Y GV C N++H V VGY
Sbjct: 258 SVAIEADQAGFQFYSSGVFSGTCG---HNLDHGVLAVGY 293
>gi|255635584|gb|ACU18142.1| unknown [Glycine max]
Length = 345
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 88/287 (30%), Positives = 144/287 (50%), Gaps = 33/287 (11%)
Query: 29 NLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
++++ +ELF S+ ++ K Y S E +RF+ F+ +L I+E NK + G+ EF
Sbjct: 39 SMDKLIELFESWMSKHGKIYQSIEEKLLRFEIFKDNLKHIDERNK---VVSNYWLGLNEF 95
Query: 88 SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIP-TGIPVKKDWRE 146
+DLS +EFK ++L V+ + ++R T +P DWR+
Sbjct: 96 ADLSHQEFKNKYLGLKVD----------------YSRRRESPEEFTYKDVELPKSVDWRK 139
Query: 147 AGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDF 206
G + V+NQ +CG+CWAFSTV E ++ + G L+ LS QE+IDC + GC+GG
Sbjct: 140 KGAVAPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYSNGCNGG-- 197
Query: 207 CALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSIL 263
L+D+ V L E +YP ++++ C+ V I Y D +E S+L
Sbjct: 198 --LMDYAFSFIVENGGLHKEEDYPYIMEEGTCEMTKEETEVVTISGYH-DVPQNNEQSLL 254
Query: 264 TDIATHGPVIA-AVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+A +A + +Q+Y GGV +C ++++H V VGY
Sbjct: 255 KALANQSLSVAIEASGRDFQFYSGGVFDGHCG---SDLDHGVAAVGY 298
>gi|67773378|gb|AAY81946.1| cysteine protease 8 [Paragonimus westermani]
Length = 325
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 90/307 (29%), Positives = 148/307 (48%), Gaps = 41/307 (13%)
Query: 10 IVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEE 69
+V ++ F V+V + EL+ F++ Y K+Y+ + RF F+ +L ++
Sbjct: 9 LVVVVGCSFAVNTVRVP----DNARELYEQFKRDYGKAYANEDDQKRFAIFKDNLVRAQQ 64
Query: 70 LNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSIT 129
Q +A+YG+T+FSDL+ EEF+ ++L +++ V + V+ +
Sbjct: 65 YQMQEQG--TAKYGVTQFSDLTPEEFEAKYLGLRIDEQV------------DRVQLNDLQ 110
Query: 130 TGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQE 189
T P DWRE G +G + NQ +CG+CWAFS V E LK G L LS Q+
Sbjct: 111 TA-------PASVDWREKGAVGPIENQGSCGSCWAFSVVGNIEGQWFLKTGYLVSLSKQQ 163
Query: 190 VIDCAGNGNMGCSGGDFCALLDWMDVNKV-VLEPESEYPLLLKDAACKRKATSPNGVKIK 248
++DC N GC GG + ++ ++ LE +S+YP C+ + K+
Sbjct: 164 LVDCDTVDN-GCYGG--YPPYTYKEIKRMGGLELQSDYPYTGWGHGCRLDRS-----KLF 215
Query: 249 SYTCDTLI--PSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYN---CDGSLANINHA 303
+ D+++ E +A HGP+ +NA Q+Y G++ + C S +NHA
Sbjct: 216 AKIDDSIVLEADEEKQAAWLAEHGPMSTCLNAKYLQFYQSGILHPSKAMC--SPEGLNHA 273
Query: 304 VQIVGYD 310
V VGYD
Sbjct: 274 VLTVGYD 280
>gi|400180441|gb|AFP73357.1| cysteine protease [Solanum habrochaites]
Length = 344
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 101/325 (31%), Positives = 155/325 (47%), Gaps = 38/325 (11%)
Query: 3 DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
D+ ++L + + F + S+P L ++ EL+ S R YK K E RF
Sbjct: 6 DLMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
F++++ IE +NK S + G+ EF+D++ EEF + ++ N ++ S +
Sbjct: 63 FKENMKFIESVNK--AGNLSYKLGMNEFADITSEEFLAKFTGLNIPNSYLSPSPMSSTEF 120
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
N + + P DWRE+G + +V+NQ CG CWAFS V + E + +
Sbjct: 121 KINDISDDDM----------PSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIA 170
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
G L S QE++DC N N GC+GG D++ N + ES+Y L + C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIRENGGI-SRESDYEYLGQQYTCRSQ 228
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
+ V+I SY ++P + L T PV IAA L Q+Y GG DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278
Query: 296 SLAN-INHAVQIVGY---DNYSRTW 316
S AN INHAV +GY +N + W
Sbjct: 279 SCANRINHAVTAIGYGTDENGQKYW 303
>gi|340053968|emb|CCC48262.1| cysteine peptidase, Clan CA, family C1,Cathepsin L-like, fragment,
partial [Trypanosoma vivax Y486]
Length = 323
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 87/284 (30%), Positives = 138/284 (48%), Gaps = 29/284 (10%)
Query: 36 LFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF++F+Q+Y +SY + +E R + FE D + + A +G+T FSDL+ EE
Sbjct: 33 LFAAFKQKYGRSYGTAAEEAFRLRVFE---DNMRRSRMYAAANPHATFGVTPFSDLTPEE 89
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKV 153
F+TR+ H+ H + + T + +P G P DWR G + V
Sbjct: 90 FRTRY---------------HNGERHFEAARGRVRTLVQVPPGKAPAAVDWRRKGAVTPV 134
Query: 154 RNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM 213
++Q CG+CW+FS + E A L+ LS Q ++ C N GC GG +W+
Sbjct: 135 KDQGRCGSCWSFSAIGNIEGQWAAAGNPLTSLSEQMLVSCDFKDN-GCGGGFMDNAFEWI 193
Query: 214 -DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI-KSYTCDTLIP-SESSILTDIATHG 270
N + E YP + +D + + P G ++ + T IP E +I +A +G
Sbjct: 194 VKENSGKVYTEKSYPYVSEDGS--KPFCIPYGHEVGATITGHVDIPHDEDAIAKYLADNG 251
Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYSR 314
PV AV+A T+ Y GGV+ +C +NH V +VGY++ S+
Sbjct: 252 PVAVAVDATTFMSYSGGVVT-SCTSEA--LNHGVLLVGYNDSSK 292
>gi|168059933|ref|XP_001781954.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666600|gb|EDQ53250.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 369
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 92/284 (32%), Positives = 137/284 (48%), Gaps = 37/284 (13%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F SF + + K Y + E++ RFK F+ +L + L P +A +G+T FSDL+EEEF
Sbjct: 56 FESFIKEFGKVYHTVEEYEHRFKVFKSNL--LRALKHQALDP-TASHGVTMFSDLTEEEF 112
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
T++L + + +S + T +PTG +P DWRE G +G V+
Sbjct: 113 ATQYL--GLKRPSALS---------------TAPTAEPLPTGDLPPSFDWREKGAVGPVK 155
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
NQ +CG+CWAFST E H L G L LS Q+++DC A + GC GG
Sbjct: 156 NQGSCGSCWAFSTTGAVEGAHFLATGKLLSLSEQQLVDCDHQCDPEEAQACDAGCGGGLM 215
Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
+++ LE ES+YP +D C+ +PN V K + E + +
Sbjct: 216 TNAYKYVE-EAGGLELESDYPYKGRDGKCQ---FNPNKVAAKVSNFTNIPIDEDQVAAYL 271
Query: 267 ATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
GP+ +NA Q Y+ GV C+ N++H V +VGY
Sbjct: 272 IKSGPLAIGINAEFMQTYVAGVSCPIFCNKR--NLDHGVLLVGY 313
>gi|356539398|ref|XP_003538185.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 101/333 (30%), Positives = 148/333 (44%), Gaps = 54/333 (16%)
Query: 5 KNVLFIVALIALC----FLAIPVKV----SKPNLEQKLELFSSFQQRYKKSYSKSEHDIR 56
K VLF +ALC F A P E+ + + + Y SY K + +
Sbjct: 4 KKVLFQYFTLALCLVFAFCAFEGNARTLEDAPMRERHEQWMAIHGKVYTHSYEKEQ---K 60
Query: 57 FKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKT--RHLRHSVNKHVLMSHHK 114
++ F++++ IE N P + GI F+DL+ EEFK R H +K +
Sbjct: 61 YQTFKENVQRIEAFNHAGNKP--YKLGINHFADLTNEEFKAINRFKGHVCSKITRTPTFR 118
Query: 115 HHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESM 174
+ + T +P DWR+ G + +++Q CG CWAFS V E +
Sbjct: 119 YENM-----------------TAVPATLDWRQEGAVTPIKDQGQCGCCWAFSAVAATEGI 161
Query: 175 HALKNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLE-----PESEYPL 228
L G L LS QE++DC G + GC GG L+D D K +L+ E+ YP
Sbjct: 162 TKLSTGKLISLSEQELVDCDTKGVDQGCEGG----LMD--DAFKFILQNKGLAAEAIYPY 215
Query: 229 LLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLG 286
D C KA + IK Y D SES++L +A PV A+ A +Q+Y G
Sbjct: 216 EGVDGTCNAKAEGNHATSIKGYE-DVPANSESALLKAVANQ-PVSVAIEASGFEFQFYSG 273
Query: 287 GVIQYNCDGSLANINHAVQIVGY---DNYSRTW 316
GV +C N++H V VGY D+ ++ W
Sbjct: 274 GVFTGSCG---TNLDHGVTAVGYGVSDDGTKYW 303
>gi|9542|emb|CAA78443.1| cysteine proteinase [Leishmania mexicana]
Length = 443
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 88/280 (31%), Positives = 135/280 (48%), Gaps = 25/280 (8%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F++ Y ++Y +E R NFE++L+++ E ++P A++GIT+F DLSE E
Sbjct: 37 LFEEFKRTYGRAYETLAEEQQRLANFERNLELMRE--HQARNPH-AQFGITKFFDLSEAE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F R+L N + K H H + ++ +P DWRE G + V+
Sbjct: 94 FAARYL----NGAAYFAAAKRHAAQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
+Q CG+CWAFS V E L L LS Q+++ C N GCSGG DW+
Sbjct: 143 DQGACGSCWAFSAVGNIEGQWYLAGHELVSLSEQQLVSCDDMDN-GCSGGLMLQAFDWLL 201
Query: 215 VNKVV-LEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
N L E YP + + C + G +I + + SE ++ +A +G
Sbjct: 202 QNTNGHLHTEDSYPYVSGNGYVPECSNSSELVVGAQIDGHV--LIGSSEKAMAAWLAKNG 259
Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
P+ A++A ++ Y GV+ C G +NH V +VGYD
Sbjct: 260 PIAIALDASSFMSYKSGVLT-ACIGK--QLNHGVLLVGYD 296
>gi|27413319|gb|AAO11786.1| pre-pro cysteine proteinase [Vicia faba]
Length = 363
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 89/283 (31%), Positives = 142/283 (50%), Gaps = 34/283 (12%)
Query: 37 FSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F+SF+ ++ KSYS K EHD RF F+ +L I+ + P +A +GIT+FSDL+ EF
Sbjct: 48 FTSFKSKFSKSYSTKEEHDYRFGVFKSNL--IKAKLHQKLDP-TAEHGITKFSDLTASEF 104
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
+ + L + K + + H I T +P DWRE G + V++
Sbjct: 105 RRQFL--GLKKRLRLPAHAQK-------------APILPTTNLPEDFDWREKGAVTPVKD 149
Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDFC 207
Q +CG+CWAFST E H L G L LS Q+++DC AG+ + GC+GG
Sbjct: 150 QGSCGSCWAFSTTGALEGAHYLATGKLVSLSEQQLVDCDHVCDPEQAGSCDSGCNGGLMN 209
Query: 208 ALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIA 267
+++ + V++ E +Y +D +CK S + +++ +L E I ++
Sbjct: 210 NAFEYLLQSGGVVQ-EKDYAYTGRDGSCKFD-KSKVVASVSNFSVVSL--DEEQIAANLV 265
Query: 268 THGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
+GP+ +NA Q Y+ GV Y C S ++H V +VG+
Sbjct: 266 KNGPLAVGINAAWMQTYMSGVSCPYVCAKS--RLDHGVLLVGF 306
>gi|343417244|emb|CCD20093.1| cysteine peptidase precursor [Trypanosoma vivax Y486]
Length = 454
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 87/284 (30%), Positives = 135/284 (47%), Gaps = 29/284 (10%)
Query: 36 LFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF++F+Q+Y +SY + +E R + FE D + + A +G+T FSDL+ EE
Sbjct: 33 LFAAFKQKYGRSYGTAAEEAFRLRVFE---DNMRRSRMYAAANPHATFGVTPFSDLTPEE 89
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKV 153
F+TR+ H+ H + + T + +P G P DW G + V
Sbjct: 90 FRTRY---------------HNGERHFEAARGRVRTLVQVPPGKAPAAVDWGRKGAVTPV 134
Query: 154 RNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM 213
++Q TCG+CW+FS + E A L+ LS Q ++ C N GC GG +W+
Sbjct: 135 KDQGTCGSCWSFSAIGNIEGQWAAAGNPLTSLSEQMLVSCDTKDN-GCGGGLMDNAFEWI 193
Query: 214 -DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI-KSYTCDTLIP-SESSILTDIATHG 270
N + E YP + + P G K+ + T IP E +I +A +G
Sbjct: 194 VKENSGKVYTEKSYPYV--SGGGEEPPCKPRGHKVGATITGHVDIPHDEDAIAKYLADNG 251
Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYSR 314
PV AV+A T+ Y GGV+ +C +NH V +VGY++ S+
Sbjct: 252 PVAVAVDATTFMSYSGGVVT-SCTSEA--LNHGVLLVGYNDSSK 292
>gi|1401242|gb|AAB67878.1| pre-pro-cysteine proteinase [Vicia faba]
Length = 363
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 89/283 (31%), Positives = 142/283 (50%), Gaps = 34/283 (12%)
Query: 37 FSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F+SF+ ++ KSYS K EHD RF F+ +L I+ + P +A +GIT+FSDL+ EF
Sbjct: 48 FTSFKSKFSKSYSTKEEHDYRFGVFKSNL--IKAKLHQKLDP-TAEHGITKFSDLTASEF 104
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
+ + L + K + + H I T +P DWRE G + V++
Sbjct: 105 RRQFL--GLKKRLRLPAHAQK-------------APILPTTNLPEDFDWREKGAVTPVKD 149
Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDFC 207
Q +CG+CWAFST E H L G L LS Q+++DC AG+ + GC+GG
Sbjct: 150 QGSCGSCWAFSTTGALEGAHYLATGKLVSLSEQQLVDCDHVCDPEQAGSCDSGCNGGLMN 209
Query: 208 ALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIA 267
+++ + V++ E +Y +D +CK S + +++ +L E I ++
Sbjct: 210 NAFEYLLQSGGVVQ-EKDYAYTGRDGSCKFD-KSKVVASVSNFSVVSL--DEEQIAANLV 265
Query: 268 THGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
+GP+ +NA Q Y+ GV Y C S ++H V +VG+
Sbjct: 266 KNGPLAVGINAAWMQTYMSGVSCPYVCAKS--RLDHGVLLVGF 306
>gi|1617037|emb|CAA26255.1| cysteine proteinase I precursor [Dictyostelium discoideum]
Length = 343
Score = 125 bits (313), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 89/313 (28%), Positives = 151/313 (48%), Gaps = 35/313 (11%)
Query: 10 IVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEE 69
++ L L + V E++ + F FQ ++ K YS E+ RF+ F+ +L IEE
Sbjct: 3 VILLFVLAVFTVFVSSRGIPPEEQSQ-FLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEE 61
Query: 70 LNK---NRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKR 126
LN N ++ ++G+ +F+DLS +EFK +L NK + + +++
Sbjct: 62 LNLIAINHKA--DTKFGVNKFADLSSDEFKNYYLN---NKEAIFTDDLPV---ADYLDDE 113
Query: 127 SITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLS 186
I + IP DWR G + V+NQ CG+CW+FST E H + L LS
Sbjct: 114 FINS-------IPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLS 166
Query: 187 VQEVIDC---------AGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKR 237
Q ++DC + GC+GG +++ N + + ES YP +
Sbjct: 167 EQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGI-QTESSYPYTAETGTQCN 225
Query: 238 KATSPNGVKIKSYTCDTLIPSESSILTD-IATHGPVIAAVNALTWQYYLGGVIQYNCDGS 296
++ G KI ++ T+IP +++ I + GP+ A +A+ WQ+Y+GGV C+ +
Sbjct: 226 FNSANIGAKISNF---TMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPN 282
Query: 297 LANINHAVQIVGY 309
+++H + IVGY
Sbjct: 283 --SLDHGILIVGY 293
>gi|297845064|ref|XP_002890413.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
lyrata]
gi|297336255|gb|EFH66672.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
lyrata]
Length = 357
Score = 125 bits (313), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 86/280 (30%), Positives = 141/280 (50%), Gaps = 25/280 (8%)
Query: 34 LELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSE 92
+ELF ++ ++K+Y E + RF+ F+ +L I+E NK +S G+ EF+DLS
Sbjct: 48 IELFENWISNFEKAYETVEEKLLRFEVFKDNLKHIDETNKKVKS---YWLGLNEFADLSH 104
Query: 93 EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
EEFK +L + ++ + + + R + +P DWR+ G + +
Sbjct: 105 EEFKKMYL--GLKTDIV---RRDEERSYAEFAYRDVEA-------VPKSVDWRKKGAVAE 152
Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
V+NQ +CG+CWAFSTV E ++ + G L+ LS QE+IDC N GC+GG ++
Sbjct: 153 VKNQGSCGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEY 212
Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV 272
+ V L E +YP +++ C+ + V I + D E S+L +A H P+
Sbjct: 213 I-VKNGGLRKEEDYPYSMEEGTCEMQKDESETVTIDGHQ-DVPTNDEKSLLKALA-HQPL 269
Query: 273 IAAVNA--LTWQYYLG-GVIQYNCDGSLANINHAVQIVGY 309
A++A +Q+Y G V C +++H V VGY
Sbjct: 270 SVAIDASGREFQFYSGVSVFDGRCG---VDLDHGVAAVGY 306
>gi|294885991|ref|XP_002771503.1| thiolproteinase SmTP1, putative [Perkinsus marinus ATCC 50983]
gi|239875207|gb|EER03319.1| thiolproteinase SmTP1, putative [Perkinsus marinus ATCC 50983]
Length = 337
Score = 125 bits (313), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 95/303 (31%), Positives = 146/303 (48%), Gaps = 27/303 (8%)
Query: 14 IALCFL-AIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELN 71
I++ FL A PV +LE F FQ+++ KSY +K E R F +L+ IEE+N
Sbjct: 4 ISVVFLLAFPV-YKAVDLETSSLAFIGFQKKHGKSYDNKDEEMKRAAIFHDNLNYIEEVN 62
Query: 72 KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTG 131
S + G+ E++DL+ EEF L S V TT
Sbjct: 63 AQNL---SYKLGVNEYTDLTLEEFAALKLS---------STDMSEGMGDGFVAGAGPTT- 109
Query: 132 ITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVI 191
T+PT + DWR+ G++ V++Q CG+CWAFS + E +A+ G L LS Q+++
Sbjct: 110 TTLPTSV----DWRKKGVLNPVKDQGYCGSCWAFSAIGALEPRYAIATGKLLSLSEQQLV 165
Query: 192 DCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATS-PNGVKIKS 249
DCAG GN GC+GG +++ V + ES YP + D C+ + +G+ +
Sbjct: 166 DCAGAYGNEGCNGGLMDKAFEYIKATGV--DKESTYPYVGSDETCQATVENKTDGLPVGE 223
Query: 250 YTCDTLIPSESSILTDIATHGPVIAAV--NALTWQYYLGGVI-QYNCDGSLANINHAVQI 306
T + ++ L + PV A+ N ++Q+Y GV NC+ +I+H V
Sbjct: 224 VTGNQMLHQTEKALMEGVAAAPVSIAMYANLQSFQHYKSGVYSDPNCNAKGGSIDHGVVA 283
Query: 307 VGY 309
VGY
Sbjct: 284 VGY 286
>gi|340053971|emb|CCC48265.1| cysteine peptidase precursor, fragment, partial [Trypanosoma vivax
Y486]
Length = 389
Score = 125 bits (313), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 87/284 (30%), Positives = 138/284 (48%), Gaps = 29/284 (10%)
Query: 36 LFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF++F+Q+Y +SY + +E R + FE D + + A +G+T FSDL+ EE
Sbjct: 33 LFAAFKQKYGRSYGTAAEEAFRLRVFE---DNMRRSRMYAAANPHATFGVTPFSDLTPEE 89
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKV 153
F+TR+ H+ H + + T + +P G P DWR G + V
Sbjct: 90 FRTRY---------------HNGERHFEAARGRVRTLVQVPPGKAPAAVDWRRKGAVTPV 134
Query: 154 RNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM 213
++Q TCG+CW+FS + E A L+ LS Q ++ C N GC GG +W+
Sbjct: 135 KDQGTCGSCWSFSAIGNIEGQWAAAGNPLTSLSEQMLVSCDFKDN-GCGGGFMDNAFEWI 193
Query: 214 -DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI-KSYTCDTLIP-SESSILTDIATHG 270
N + YP + +D + + P G ++ + T IP E +I +A +G
Sbjct: 194 VKENSGKVYTGKSYPYVSEDGS--KPFCIPYGHEVGATITGHVDIPHDEDAIAKYLADNG 251
Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYSR 314
PV AV+A T+ Y GGV+ +C +NH V +VGY++ S+
Sbjct: 252 PVAVAVDATTFMSYSGGVVT-SCTSEA--LNHGVLLVGYNDSSK 292
>gi|394331830|gb|AFN27134.1| cysteine protease [Leishmania tropica]
Length = 443
Score = 124 bits (312), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 88/283 (31%), Positives = 138/283 (48%), Gaps = 31/283 (10%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F++ Y+++Y +E R NFE++L+++ E ++P AR+GIT+F DLSE E
Sbjct: 37 LFEEFKRTYRRAYGTVAEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F R+L N + K H H + ++ +P DWR+ G + V+
Sbjct: 94 FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWRKKGALTPVK 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSG----GDFCALL 210
NQ CG+CWAFS V + +S AL L+ LS Q+++ C N GC G F +L
Sbjct: 143 NQGACGSCWAFSAVGSIQSQWALAGHRLTALSEQQLVSCHDKDN-GCPGRLMLQAFVGVL 201
Query: 211 DWMDVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIA 267
M+ E YP + C + G +I Y T+ S + + +A
Sbjct: 202 QNMNGTMFT---EDSYPYVSSTGYVPECSNSSQLVPGARIDGYM--TMESSGTVMAACLA 256
Query: 268 THGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
+GP+ AV+A ++ Y GV+ +C G +NH V +VGY+
Sbjct: 257 KNGPISIAVDASSFMSYQSGVLT-SCAG--MPLNHGVLLVGYN 296
>gi|224285931|gb|ACN40679.1| unknown [Picea sitchensis]
Length = 366
Score = 124 bits (312), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 94/286 (32%), Positives = 141/286 (49%), Gaps = 42/286 (14%)
Query: 37 FSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F F +RY K YS EH+ RF F+ +L + L + P A +G+T+FSDL++E F
Sbjct: 57 FRHFIRRYGKKYSGPEEHEHRFGVFKSNL--LRALEHQKLDPR-ASHGVTKFSDLTQEGF 113
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
+ ++L + D H + +PT +P DWRE G + +V+
Sbjct: 114 RHQYLG--------LRAPPLRDAHDAPI----------LPTNDLPEDFDWREKGAVTEVK 155
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
NQ +CG+CWAFST E + LK G L LS Q+++DC A + + GC+GG
Sbjct: 156 NQGSCGSCWAFSTTGALEGANFLKTGELVSLSEQQLVDCDHECDPSDARSCDSGCNGGLM 215
Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPS--ESSILT 264
+ + + LE E +YP KD C S N KI ++ + + S E I
Sbjct: 216 TSAYQYA-LKSGGLEKEEDYPYTGKDGTC-----SFNKNKIVAHVSNFSVVSIDEGQIAA 269
Query: 265 DIATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
++ +GP+ +NA Q Y+GGV Y C S N++H V +VGY
Sbjct: 270 NLVKNGPLSVGINAAFMQTYVGGVSCPYVC--SKRNLDHGVLLVGY 313
>gi|44844204|emb|CAF32698.1| cysteine proteinase [Leishmania infantum]
Length = 443
Score = 124 bits (312), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 91/284 (32%), Positives = 139/284 (48%), Gaps = 33/284 (11%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F++ Y+++Y +E R NFE++L+++ E ++P AR+GIT+F DLSE E
Sbjct: 37 LFEEFKRTYRRAYGTLAEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F R+L N + K H H + ++ +P DWRE G + V+
Sbjct: 94 FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGG----DFCALL 210
CG+CWAFS V ES A L LS Q+++ C N GC+GG F LL
Sbjct: 143 XXGACGSCWAFSAVGNIESQWARAGHGLVSLSEQQLVSCDDKDN-GCNGGLMLQAFEXLL 201
Query: 211 DWMDVNKVVLEPESEYPLLLKD---AACKRKATSPNGVKIKSYTCDTLIPSESSILTD-I 266
M ++ E YP + A C + G +I Y +IPS +++ +
Sbjct: 202 RHM---YGIVFTEKSYPYTSGNGDVAECLNSSKLVPGAQIDGY---VMIPSNETVMAAWL 255
Query: 267 ATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
A +GP+ AV+A ++ Y GV+ +C G +NH V +VGY+
Sbjct: 256 AENGPIAIAVDASSFMSYQSGVLT-SCAGDA--LNHGVLLVGYN 296
>gi|222629922|gb|EEE62054.1| hypothetical protein OsJ_16838 [Oryza sativa Japonica Group]
Length = 336
Score = 124 bits (312), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 89/272 (32%), Positives = 138/272 (50%), Gaps = 28/272 (10%)
Query: 44 YKKSYSKSEHDIR-FKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRH 102
Y+K+Y+ E +R F+ F+ +L+ I+++NK S G+ EF+DL+ +EFK +L
Sbjct: 36 YRKAYASFEEKVRRFEVFKDNLNHIDDINKKVTS---YWLGLNEFADLTHDEFKATYL-- 90
Query: 103 SVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGAC 162
+ S+ KH+ K S +P + DWR+ + +V+NQ CG+C
Sbjct: 91 GLTPPPTRSNSKHYSSEEFRYGKMSNGE-------VPKEMDWRKKNAVTEVKNQGQCGSC 143
Query: 163 WAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD---VNKVV 219
WAFSTV E ++A+ G L+ LS QE+IDC+ +GN GC+GG L+D+ +
Sbjct: 144 WAFSTVAAVEGINAIVTGNLTSLSEQELIDCSTDGNNGCNGG----LMDYAFSYIASTGG 199
Query: 220 LEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL 279
L E YP +++ C + V I Y D E +++ +A H PV A+ A
Sbjct: 200 LRTEEAYPYAMEEGDCDEGKGAAV-VTISGYE-DVPANDEQALVKALA-HQPVSVAIEAS 256
Query: 280 T--WQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+Q+Y GGV C L +H V VGY
Sbjct: 257 GRHFQFYSGGVFDGPCGEQL---DHGVTAVGY 285
>gi|297824991|ref|XP_002880378.1| hypothetical protein ARALYDRAFT_481008 [Arabidopsis lyrata subsp.
lyrata]
gi|297326217|gb|EFH56637.1| hypothetical protein ARALYDRAFT_481008 [Arabidopsis lyrata subsp.
lyrata]
Length = 360
Score = 124 bits (312), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 92/325 (28%), Positives = 159/325 (48%), Gaps = 42/325 (12%)
Query: 1 MFDVKNVLFIVALIALC-----FLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHD 54
+F V ++LF+ +++C + V ++P + + F+ F++++ K Y S EH
Sbjct: 7 LFSV-SLLFVFVSVSICGDEDLLIRQVVDEAEPKVLSSEDHFTLFKKKFGKDYGSIEEHY 65
Query: 55 IRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHK 114
RF F+ +L ++++ SAR+G+T+FSDL+ EF+ +HL V
Sbjct: 66 YRFSVFKANL---RRAMRHQKMDPSARHGVTQFSDLTGSEFRRKHL------GVTGGFKL 116
Query: 115 HHDHHHNHVKKRSITTGITIPT-GIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAES 173
D + + +PT +P + DWR+ G + V+NQ +CG+CW+FST E
Sbjct: 117 PKDANQAPI----------LPTHNLPEEFDWRDRGAVTPVKNQGSCGSCWSFSTTGALEG 166
Query: 174 MHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDFCALLDWMDVNKVVLEPESE 225
H L G L LS Q+++DC AG+ + GC+GG + ++ + L E +
Sbjct: 167 AHFLATGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYT-LKTGGLMREED 225
Query: 226 YPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYL 285
YP D + S + +++ ++ +E I ++ +GP+ A+NA Q Y+
Sbjct: 226 YPYTGTDGGSCKLDRSKIVASVSNFSVVSI--NEDQIAANLVKNGPLAVAINAAYMQTYI 283
Query: 286 GGV-IQYNCDGSLANINHAVQIVGY 309
GGV Y C L NH V ++GY
Sbjct: 284 GGVSCPYICSRRL---NHGVLLMGY 305
>gi|255563110|ref|XP_002522559.1| cysteine protease, putative [Ricinus communis]
gi|223538250|gb|EEF39859.1| cysteine protease, putative [Ricinus communis]
Length = 344
Score = 124 bits (312), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 92/324 (28%), Positives = 155/324 (47%), Gaps = 39/324 (12%)
Query: 6 NVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKS-EHDIRFKNFEKS 63
N L ++A++ L L S+ E +EL ++ +Y + Y + E + RFK F+++
Sbjct: 7 NKLVLMAML-LVTLWASQSWSRSLHEASMELRHKTWMTQYGRVYKGNVEKEKRFKIFKEN 65
Query: 64 LDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHV 123
++ IE N N P + GI F+DL+ EEF+ H ++++ S ++ + +V
Sbjct: 66 VEFIESFNNNGNKP--YKLGINAFTDLTNEEFRASHNGYTMSMSSHQSSYRTKSFRYENV 123
Query: 124 KKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLS 183
T +P DWR G + +++Q CG CWAFS V E + L GTL
Sbjct: 124 ------------TAVPPSLDWRTKGAVTHIKDQGQCGCCWAFSAVAAMEGITKLSTGTLI 171
Query: 184 LLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLE-----PESEYPLLLKDAACKR 237
LS QE++DC +G + GC GG L+D D + ++E E+ YP D +C
Sbjct: 172 SLSEQELVDCDTSGMDQGCEGG----LMD--DAFEFIIENNGLTTEANYPYEGVDGSCNT 225
Query: 238 KATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDG 295
+ + + KI Y + + + L + PV A++A +Q+Y G+ +C
Sbjct: 226 RKAANHAAKITGY--ENVPAYDEEALRKAVANQPVSVAIDAGESAFQHYSSGIFTGDCGT 283
Query: 296 SLANINHAVQIVGY---DNYSRTW 316
L +H V +VGY D+ ++ W
Sbjct: 284 EL---DHGVTVVGYGTSDDGTKYW 304
>gi|347968729|ref|XP_003436276.1| AGAP002879-PC [Anopheles gambiae str. PEST]
gi|333467870|gb|EGK96737.1| AGAP002879-PC [Anopheles gambiae str. PEST]
Length = 953
Score = 124 bits (312), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 87/281 (30%), Positives = 144/281 (51%), Gaps = 28/281 (9%)
Query: 36 LFSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
+F F+ +++ Y+ S EH++RF F +L IE+LNK + +A+YG+T+F+D++ E
Sbjct: 642 MFDKFRHHHRRQYASSMEHEMRFNIFRNNLFKIEQLNKFERG--TAKYGVTKFADMTVAE 699
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
++ + +++ H +H N V G+ +P DWR+ G + +V+
Sbjct: 700 YR-------AHTGLVVPKHDRANHVGNRVASEEDVAGVG---DLPRSFDWRDHGAVTEVK 749
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
NQ +CG+CWAFS V E +H +K L S QE+IDC N GC GG +D D
Sbjct: 750 NQGSCGSCWAFSAVGNVEGLHQIKTKKLESYSEQELIDCDKVDN-GCGGG----YMD--D 802
Query: 215 VNKVV-----LEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
K + LE E++YP K S + V++K + +E+ I + +
Sbjct: 803 AFKAIEQLGGLELENDYPYEAKAQKSCHFNRSLSHVQVKGAV--DMPKNETYIAKYLIKN 860
Query: 270 GPVIAAVNALTWQYYLGGVIQ-YNCDGSLANINHAVQIVGY 309
GP+ +NA Q+Y GG+ ++ + +I+H V IVGY
Sbjct: 861 GPIAIGLNANAMQFYRGGISHPWHPLCNHKSIDHGVLIVGY 901
>gi|162459555|ref|NP_001105685.1| cysteine proteinase 1 precursor [Zea mays]
gi|1706260|sp|Q10716.1|CYSP1_MAIZE RecName: Full=Cysteine proteinase 1; Flags: Precursor
gi|643597|dbj|BAA08244.1| cysteine proteinase [Zea mays]
Length = 371
Score = 124 bits (312), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 86/284 (30%), Positives = 136/284 (47%), Gaps = 32/284 (11%)
Query: 37 FSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F SF QR+ KSY + EH R F+ D + +++ SA +G+T+FSDL+ EF
Sbjct: 48 FLSFVQRFGKSYKDADEHAYRLSVFK---DNLRRARRHQLLDPSAEHGVTKFSDLTPAEF 104
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKDWREAGIIGKVR 154
+ +L ++ L+ H +PT G+P DWR+ G +G V+
Sbjct: 105 RRTYLGLRKSRRALLRELGESAHE-----------APVLPTDGLPDDFDWRDHGAVGPVK 153
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
NQ +CG+CW+FS E H L G L +LS Q+ +DC + + GC+GG
Sbjct: 154 NQGSCGSCWSFSASGALEGAHYLATGKLEVLSEQQFVDCDHECDSSEPDSCDSGCNGGLM 213
Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
++ LE E +YP D CK S +++++ ++ E+ I ++
Sbjct: 214 TTAFSYLQ-KAGGLESEKDYPYTGSDGKCKFD-KSKIVASVQNFSVVSV--DEAQISANL 269
Query: 267 ATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
HGP+ +NA Q Y+GGV Y C +++H V +VGY
Sbjct: 270 IKHGPLAIGINAAYMQTYIGGVSCPYICG---RHLDHGVLLVGY 310
>gi|116794072|gb|ABK26996.1| unknown [Picea sitchensis]
Length = 367
Score = 124 bits (311), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 86/279 (30%), Positives = 137/279 (49%), Gaps = 11/279 (3%)
Query: 34 LELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSE 92
+ LF + R+ K Y S E R + F +L I NKN S S R G+ +F+DL+
Sbjct: 40 VRLFDRWLGRHGKLYGSHEEKARRLQIFRTNLQYIHAHNKNSNS--SFRLGLNKFADLTN 97
Query: 93 EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
EEFKTR+ + + + V K+++ + + + I DWR+ G +
Sbjct: 98 EEFKTRYFGKNSKQWRDRRRTELEGAELRPVLKQTVGSQSSSCS-IASSLDWRKKGAVTG 156
Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
V++Q CG+CWAFST E ++ + G L LS QE++ C N GC GGD W
Sbjct: 157 VKDQAQCGSCWAFSTTGAIEGVNFISTGKLVSLSEQELVACDAT-NYGCEGGDMDYAFTW 215
Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV 272
+ + ++ E +Y D+ C + V I YT + P +S++L + PV
Sbjct: 216 V-IQNGGIDTEKDYSYTGVDSTCNTNKEAKKIVSIDGYT--DVSPDDSALLCAAGSQ-PV 271
Query: 273 IAAVN--ALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
++ A+ +Q Y GG+ +C G+ +I+HAV +VGY
Sbjct: 272 SVGIDGSAIDFQLYTGGIYDGDCSGNPDDIDHAVLVVGY 310
>gi|209882566|ref|XP_002142719.1| papain family cysteine protease [Cryptosporidium muris RN66]
gi|209558325|gb|EEA08370.1| papain family cysteine protease, putative [Cryptosporidium muris
RN66]
Length = 400
Score = 124 bits (311), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 90/290 (31%), Positives = 140/290 (48%), Gaps = 26/290 (8%)
Query: 28 PNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITE 86
P+ ++ F F+Q+YKK YS +E R+ F K+++ I+ N S + E
Sbjct: 77 PSEQEFKNQFEDFKQKYKKEYSNLTEEKYRYSIFRKNMNFIKMSN---NQGFSYVLEMNE 133
Query: 87 FSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWRE 146
+ DL+ EEF H M +H H + +++ T P +W +
Sbjct: 134 YGDLTHEEFM----------HNFMGYHPQHKNKRFSDSHNILSSNKVENTSPPRFVNWVD 183
Query: 147 AGIIGKVRNQQTCGACWAFSTVETAES-MHALKNGTLSLLSVQEVIDCA-GNGNMGCSGG 204
AG + VR+Q+ CG+CWAFS V + ES + A KN L LS Q+ +DC NGN GC GG
Sbjct: 184 AGCVNPVRDQRYCGSCWAFSVVTSLESAVCAQKNEKLVKLSEQQFVDCTRNNGNFGCDGG 243
Query: 205 DFCALLDWMDVNKVVLEPESEYPLLLKDAACK-RKATSPNGVKIKSYTCDTLIPSESSIL 263
++ + L E EYP + + +CK +P + SY ++P+ + L
Sbjct: 244 SLDLAFQYV-MEHQYLCTEEEYPYIANEKSCKFSNCKNPIRYILDSYR--NVVPNNINAL 300
Query: 264 -TDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
+A +GP+ A+ A +Q+Y GV C ++NHAV +VGYD
Sbjct: 301 KVAVAKYGPISVAIQADQAPFQFYKKGVFDAPCG---TDVNHAVVLVGYD 347
>gi|343472324|emb|CCD15484.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 445
Score = 124 bits (311), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 94/322 (29%), Positives = 158/322 (49%), Gaps = 40/322 (12%)
Query: 4 VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSY-SKSEHDIRFKNFE 61
+ + F V L+A+ +PV + + EQ L+ F++F+Q+Y +SY +E RF+ F+
Sbjct: 7 TRTLRFSVGLLAVAACFVPVALGVLHAEQSLQQQFAAFKQKYSRSYKDATEEAFRFRVFK 66
Query: 62 KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
+++ E + + A +G+T FSD+S EEF+ ++H +++
Sbjct: 67 QNM---ERAKEEAAANPYATFGVTRFSDMSPEEFRA-------------TYHNGAEYYAA 110
Query: 122 HVKK-RSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
+K+ R + +T+ TG P DWR+ G + V++Q CG+CWAFS + E +
Sbjct: 111 ALKRPRKV---VTVSTGKAPEAVDWRKKGAVTPVKDQGQCGSCWAFSAIGNIEGQWKVAG 167
Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW-MDVNKVVLEPESEYPLLLKDAACKRK 238
L+ LS Q ++ C C GG W + NK + E YP ++ R
Sbjct: 168 HELTSLSEQTLVSCDPT-EYACEGGFMDNAFRWIISSNKGKVFTEQSYPY----SSGGRN 222
Query: 239 ATSPN------GVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYN 292
+ N G I Y L E++I +A +GPV V+A ++Q Y GGV+ +
Sbjct: 223 VPACNMSGKVVGANISDYV--DLPQDENAIAEWLAKNGPVSVIVDATSFQSYTGGVLT-S 279
Query: 293 CDGSLANINHAVQIVGYDNYSR 314
C + +NHAV +VGYD+ S+
Sbjct: 280 CLSKI--LNHAVLLVGYDDTSK 299
>gi|30141027|dbj|BAC75927.1| cysteine protease-5 [Helianthus annuus]
Length = 365
Score = 124 bits (311), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 83/284 (29%), Positives = 137/284 (48%), Gaps = 23/284 (8%)
Query: 31 EQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEE--LNKNRQSPESARYGITEF 87
E+ + + R+ K+Y+ E + RF+ F +L I+E L+ NR S + G+ +F
Sbjct: 30 EEVRNTYELWLARHGKTYNALGEKESRFRIFADNLKFIDEHNLSGNR----SYKVGLNQF 85
Query: 88 SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREA 147
+DL+ EE+++ +L V+ + ++ + + + + + P K DWRE
Sbjct: 86 ADLTNEEYRSMYLGTKVDPYRRIAKMQRGEISRRYAVQENEM--------FPAKVDWRER 137
Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFC 207
G + V+NQ CG+CWAFSTV + E ++ + G L LS QE++DC N GC+GG
Sbjct: 138 GAVSPVKNQGGCGSCWAFSTVASVEGINKIVTGDLISLSEQELVDCDNKYNSGCNGGSMD 197
Query: 208 ALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIA 267
++ V+ ++ ES+YP A C V I Y + + P L
Sbjct: 198 YAFQFI-VSNGGIDSESDYPYKGVGAVCDPVRNKAKIVSIDGY--EDVPPMNEKALMKAV 254
Query: 268 THGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
H PV + A +Q Y GV+ +C N++H V +VGY
Sbjct: 255 AHQPVSVGIEASGRAFQLYTSGVLTGSCG---TNLDHGVVVVGY 295
>gi|1134882|emb|CAA92583.1| cysteine protease [Pisum sativum]
Length = 350
Score = 124 bits (311), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 101/320 (31%), Positives = 152/320 (47%), Gaps = 41/320 (12%)
Query: 7 VLFIVALIALCFL---AIPVKVSKPNLEQKLEL---------FSSFQQRYKKSY-SKSEH 53
VLF VA A F + P+++ EQ L++ F+ F RY K Y S E
Sbjct: 9 VLFCVASAAAGFSFHDSNPIRMVSDVEEQLLQVIGESRHAVSFARFANRYGKRYDSVDEM 68
Query: 54 DIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVN-KHVLMSH 112
+RFK F ++L++I NK R S + G+ F+D + EEF++ L + N L +
Sbjct: 69 KLRFKIFSENLELIRSSNKRRLS---YKLGVNHFADWTWEEFRSHRLGAAQNCSATLKGN 125
Query: 113 HKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAE 172
HK D + +P +KDWR+ GI+ V++Q +CG+CW FST E
Sbjct: 126 HKITDAN------------------LPDEKDWRKEGIVSGVKDQGSCGSCWTFSTTGALE 167
Query: 173 SMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLK 231
S +A G LS Q+++DCAG N GCSGG +++ N LE E YP
Sbjct: 168 SAYAQAFGKNISLSEQQLVDCAGAFNNFGCSGGLPSQAFEYIKYNG-GLETEEAYPYTGS 226
Query: 232 DAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL-TWQYYLGGVIQ 290
+ CK ++ VK+ + + + +E + IA PV A + ++ Y GV
Sbjct: 227 NGLCKFRSEHV-AVKVLG-SVNITLGAEDELKHAIAFARPVSVAFEVVHDFRLYKSGVYT 284
Query: 291 YN-CDGSLANINHAVQIVGY 309
C + ++NHAV VGY
Sbjct: 285 STACGSTPMDVNHAVLAVGY 304
>gi|115446097|ref|NP_001046828.1| Os02g0469600 [Oryza sativa Japonica Group]
gi|47497527|dbj|BAD19579.1| putative cysteine proteinase 1 precursor [Oryza sativa Japonica
Group]
gi|113536359|dbj|BAF08742.1| Os02g0469600 [Oryza sativa Japonica Group]
gi|215701326|dbj|BAG92750.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215704370|dbj|BAG93804.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215708762|dbj|BAG94031.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218200777|gb|EEC83204.1| hypothetical protein OsI_28465 [Oryza sativa Indica Group]
gi|222622835|gb|EEE56967.1| hypothetical protein OsJ_06681 [Oryza sativa Japonica Group]
Length = 373
Score = 124 bits (311), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 86/295 (29%), Positives = 143/295 (48%), Gaps = 37/295 (12%)
Query: 31 EQKLEL-----FSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPESARYGI 84
+ +LEL F+SF QR+ KSY + EH R F+ +L +++ SA +G+
Sbjct: 39 DNELELNAERHFASFVQRFGKSYRDADEHAYRLSVFKANL---RRARRHQLLDPSAEHGV 95
Query: 85 TEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKD 143
T+FSDL+ EF+ +L ++ + H +PT G+P D
Sbjct: 96 TKFSDLTPAEFRRAYLGLRTSRRAFLRGLGGSAHE-----------APVLPTDGLPDDFD 144
Query: 144 WREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AG 195
WR+ G +G V+NQ +CG+CW+FS E + L G + +LS Q+++DC
Sbjct: 145 WRDHGAVGPVKNQGSCGSCWSFSASGALEGANYLATGKMDVLSEQQMVDCDHECDSSEPD 204
Query: 196 NGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTL 255
+ + GC+GG ++ + LE E +YP +D CK S +++++ ++
Sbjct: 205 SCDAGCNGGLMTNAFSYL-LKSGGLESEKDYPYTGRDGTCKFD-KSKIVTSVQNFSVVSV 262
Query: 256 IPSESSILTDIATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
E I ++ HGP+ +NA Q Y+GGV Y C +++H V +VGY
Sbjct: 263 --DEDQIAANLVKHGPLAIGINAAYMQTYIGGVSCPYICG---RHLDHGVLLVGY 312
>gi|255564908|ref|XP_002523447.1| cysteine protease, putative [Ricinus communis]
gi|223537275|gb|EEF38906.1| cysteine protease, putative [Ricinus communis]
Length = 342
Score = 124 bits (311), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 92/319 (28%), Positives = 142/319 (44%), Gaps = 29/319 (9%)
Query: 5 KNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKS 63
+N L VAL+ + A + E + +Y + Y SE + RF+ F +
Sbjct: 6 ENKLMFVALLVVGLWASQAWSRSLHDAAMNERHEMWMAKYGRVYKDNSEKERRFEIFRNN 65
Query: 64 LDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHV 123
++ IE NK P + I EF+DL+ EEFK + + V ++ + +
Sbjct: 66 VEFIESFNKLGNRP--YKLDINEFADLTNEEFKVSKNGYKRSSGVGLTEKSSFRYAN--- 120
Query: 124 KKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLS 183
T +P DWR+ G + +++Q CG CWAFS V E + L G L
Sbjct: 121 -----------VTAVPTSMDWRQNGAVTPIKDQGQCGCCWAFSAVAAMEGITKLSTGKLI 169
Query: 184 LLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSP 242
LS QE++DC +G + GC GG +++ N L E+ YP D C
Sbjct: 170 SLSEQELVDCDTSGEDQGCEGGLMDDAFEFIKQNG-GLTTEANYPYQGTDGTCNTNKAGN 228
Query: 243 NGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANI 300
+ KI Y D SE ++L +A+ PV A++A +Q+Y GGV +C L
Sbjct: 229 DAAKITGYE-DVPANSEDALLKAVASQ-PVSVAIDASGSAFQFYSGGVFTGDCGTEL--- 283
Query: 301 NHAVQIVGY---DNYSRTW 316
+H V VGY D+ ++ W
Sbjct: 284 DHGVTAVGYGTSDDGTKYW 302
>gi|255564910|ref|XP_002523448.1| cysteine protease, putative [Ricinus communis]
gi|223537276|gb|EEF38907.1| cysteine protease, putative [Ricinus communis]
Length = 341
Score = 124 bits (311), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 84/271 (30%), Positives = 126/271 (46%), Gaps = 26/271 (9%)
Query: 43 RYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLR 101
+Y + Y SE + RF+ F +++ IE NK P + I EF+DL+ EEFK
Sbjct: 44 KYGRVYKDNSEKERRFEIFRNNVEFIESFNKPGNRP--YKLDINEFADLTNEEFKASRNG 101
Query: 102 HSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGA 161
+ + +V +S + + T +P DWR+ G + +++Q CG
Sbjct: 102 YKRSSNVGLSEKSSFRYGN--------------VTAVPTSMDWRQKGAVTPIKDQGQCGC 147
Query: 162 CWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVL 220
CWAFS V E + L G L LS QE++DC +G + GC GG +++ N L
Sbjct: 148 CWAFSAVAAMEGITKLSTGKLISLSEQELVDCDTSGEDQGCEGGLMDDAFEFIKQNG-GL 206
Query: 221 EPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA-- 278
E+ YP D C + KI Y D SE ++L +A+ PV A++A
Sbjct: 207 TTEANYPYQGTDGTCNTNKAGNDAAKITGYE-DVPANSEDALLKAVASQ-PVSVAIDASG 264
Query: 279 LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+Q+Y GGV +C L +H V VGY
Sbjct: 265 SAFQFYSGGVFTGDCGTEL---DHGVTAVGY 292
>gi|340380717|ref|XP_003388868.1| PREDICTED: pro-cathepsin H-like [Amphimedon queenslandica]
Length = 337
Score = 124 bits (311), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 100/312 (32%), Positives = 150/312 (48%), Gaps = 42/312 (13%)
Query: 14 IALCFLAIPVKVSKP----NLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIE 68
AL FL V+ P + E E F+ + ++Y+K+YS E++ R + + + IE
Sbjct: 8 FALFFLLASFTVALPFSPSDDEVMAESFNMWMKKYEKTYSTMEEYNERLRVYTSNYYYIE 67
Query: 69 ELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSI 128
+LNK P + Y + +FSDL+ EFK + ++ +H + + +K
Sbjct: 68 QLNK-EHGPHT-EYELNQFSDLTFAEFK----------KIYLTEPQHCSATNGNFQK--- 112
Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
+ PV DWRE +I V++Q CG+CW FST E+ HA+K G L LS Q
Sbjct: 113 ----PVNARDPVAVDWREKNVITPVKDQGKCGSCWTFSTTGCLEAHHAIKTGQLISLSEQ 168
Query: 189 EVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK-----ATSP 242
+++DCAG N GC+GG +++ N + E ES Y KD C+ AT
Sbjct: 169 QLVDCAGAFNNHGCNGGLPSQAFEYIKYNGGI-ESESNYNYTAKDGVCRFNSSLVAATVS 227
Query: 243 NGVKIKSYTCDTLIPSESSILTDIATHGPV-IAAVNALTWQYYLGGVIQYN---CDGSLA 298
+ V I +E I T +A GPV IA ++Q+Y GV Q C S
Sbjct: 228 DVVNITK-------DAEGDIGTAVANVGPVSIAFEVTKSFQHYKKGVYQGEIEVCSQSPD 280
Query: 299 NINHAVQIVGYD 310
+NHAV +VGY+
Sbjct: 281 KVNHAVLVVGYN 292
>gi|194705198|gb|ACF86683.1| unknown [Zea mays]
gi|413936851|gb|AFW71402.1| cysteine protease1 [Zea mays]
Length = 371
Score = 124 bits (311), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 86/284 (30%), Positives = 136/284 (47%), Gaps = 32/284 (11%)
Query: 37 FSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F SF QR+ KSY + EH R F+ +L +++ SA +G+T+FSDL+ EF
Sbjct: 48 FLSFVQRFGKSYKDADEHAYRLSVFKANL---RRARRHQLLDPSAEHGVTKFSDLTPAEF 104
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKDWREAGIIGKVR 154
+ +L ++ L+ H +PT G+P DWR+ G +G V+
Sbjct: 105 RRTYLGLRKSRRALLRELGESAHE-----------APVLPTDGLPDDFDWRDHGAVGPVK 153
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
NQ +CG+CW+FS E H L G L +LS Q+ +DC + + GC+GG
Sbjct: 154 NQGSCGSCWSFSASGALEGAHYLATGKLEVLSEQQFVDCDHECDSSEPDSCDSGCNGGLM 213
Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
++ LE E +YP D CK S +++++ ++ E+ I ++
Sbjct: 214 TTAFSYLQ-KAGGLESEKDYPYTGSDGKCKFD-KSKIVASVQNFSVVSV--DEAQISANL 269
Query: 267 ATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
HGP+ +NA Q Y+GGV Y C +++H V +VGY
Sbjct: 270 IKHGPLAIGINAAYMQTYIGGVSCPYICG---RHLDHGVLLVGY 310
>gi|41019551|tpe|CAD66657.1| TPA: putative cysteine proteinase precursor [Hordeum vulgare subsp.
vulgare]
gi|326489967|dbj|BAJ94057.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326525847|dbj|BAJ93100.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 377
Score = 124 bits (311), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 86/294 (29%), Positives = 141/294 (47%), Gaps = 35/294 (11%)
Query: 30 LEQKLEL---FSSFQQRYKKSYSKSE-HDIRFKNFEKSLDIIEELNKNRQSPESARYGIT 85
L+ LEL F F QR+ K+Y +E H R F+ +L +++ SA +G+T
Sbjct: 43 LDNDLELDSQFVGFVQRFGKTYRDAEEHAHRLSVFKANL---RRARRHQLLDPSAEHGVT 99
Query: 86 EFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKDW 144
+FSDL+ EF+ +L + + H +PT G+P DW
Sbjct: 100 KFSDLTPAEFRRTYLGLKTTRRSFLREMAGSAH-----------DAPVLPTDGLPEDFDW 148
Query: 145 REAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGN 196
R+ G +G V+NQ +CG+CW+FS E + L +G + +LS Q+++DC +
Sbjct: 149 RDHGAVGPVKNQGSCGSCWSFSASGALEGANYLASGKMEVLSEQQLVDCDHECDPSEPDS 208
Query: 197 GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLI 256
+ GC+GG + ++ + LE E +YP KD CK S +++Y+ +
Sbjct: 209 CDAGCNGGLMTSAFSYL-LKSGGLEREKDYPYTGKDGTCKFD-KSKIAASVQNYS--VVA 264
Query: 257 PSESSILTDIATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
E I ++ +GP+ +NA Q Y+GGV Y C +++H V +VGY
Sbjct: 265 VDEEQIAANLVKYGPLAIGINAAYMQTYIGGVSCPYICG---RHLDHGVLLVGY 315
>gi|449438381|ref|XP_004136967.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 479
Score = 124 bits (311), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 89/294 (30%), Positives = 145/294 (49%), Gaps = 32/294 (10%)
Query: 31 EQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSD 89
E+ L+ S+ + K+Y+ E + RF+ F+ +L I+E N+ ++ + G+T F+D
Sbjct: 56 EEVAALYESWLVHHGKAYNAIGEKERRFEIFKDNLRFIDEHNRESRT---YKVGLTRFAD 112
Query: 90 LSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGI 149
L+ EE++ R L ++ +S K + + G +P + DWR+ G
Sbjct: 113 LTNEEYRARFLGGRFSRKPRLSAAKS--------GRYAAALGDDLPDDV----DWRKKGA 160
Query: 150 IGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCAL 209
+ V++Q CG+CWAFS+V E ++ + G L LS QE++DC + NMGC+GG L
Sbjct: 161 VATVKDQGQCGSCWAFSSVAAVEGINQIVTGELIPLSEQELVDCDKSFNMGCNGG----L 216
Query: 210 LDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
+D+ + ++ E +YP +DAAC + V I Y D ESS+ +
Sbjct: 217 MDYAFQFIIGNGGIDTEEDYPYKGRDAACDPNRKNAKVVTIDGYE-DVPENDESSLKKAV 275
Query: 267 ATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY--DNYSRTW 316
A PV A+ A +Q Y GV C +++H V VGY DN + W
Sbjct: 276 ANQ-PVSVAIEAGGRAFQLYQSGVFTGRCG---TDLDHGVVAVGYGTDNGTDYW 325
>gi|255567869|ref|XP_002524912.1| cysteine protease, putative [Ricinus communis]
gi|223535747|gb|EEF37409.1| cysteine protease, putative [Ricinus communis]
Length = 366
Score = 124 bits (311), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 81/280 (28%), Positives = 142/280 (50%), Gaps = 23/280 (8%)
Query: 34 LELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSE 92
+ +++ + ++ K+Y+K E + RF+ F+ +L I+E N ++ + + G+T F+DL+
Sbjct: 45 ISMYNWWLAKHSKTYNKLGEREKRFEIFKNNLRFIDEHNNSKN--RTYKVGLTRFADLTN 102
Query: 93 EEFKTRHL-RHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIG 151
EE++ + L S K LM N ++ + G +P I DWR++G +
Sbjct: 103 EEYRAKFLGTKSDPKRRLMKSK-------NPSQRYAFKAGDVLPESI----DWRQSGAVS 151
Query: 152 KVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLD 211
+++Q +CG+CWAFST+ E ++ + G L LS QE++DC + N GC+GG
Sbjct: 152 AIKDQGSCGSCWAFSTIAAVEGVNKIVTGELISLSEQELVDCDRSYNAGCNGGLMDNAFQ 211
Query: 212 WMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGP 271
++ +N ++ + +YP D C V I + D + E ++ +A H P
Sbjct: 212 FI-INNGGIDTDKDYPYQAVDGKCDTTKVKNKAVTIDGFE-DVMAFDEMALQKAVA-HQP 268
Query: 272 VIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
V A+ A + Q+Y GV C +L +H V IVGY
Sbjct: 269 VSVAIEASGMALQFYQSGVFTGECGSAL---DHGVVIVGY 305
>gi|391340505|ref|XP_003744580.1| PREDICTED: digestive cysteine proteinase 1-like [Metaseiulus
occidentalis]
Length = 469
Score = 124 bits (310), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 89/305 (29%), Positives = 148/305 (48%), Gaps = 31/305 (10%)
Query: 8 LFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDII 67
L+I +++AL + V N E F++ + K+Y EH +R F+++L I
Sbjct: 147 LYIASVLALV---VAVGADLTNFEH-------FKEHFGKTYEGDEHALRQGIFQRNLAHI 196
Query: 68 EELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRS 127
E+ N + + GIT+F+D+S EF+ +L +N + K +++
Sbjct: 197 EKFNAEKAASRGYTLGITQFADMSTAEFRQTYLGLRMNASTIAKLRK--------LQREV 248
Query: 128 ITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSV 187
+ +P + DWR+ G + V++Q CG+CWAFST E H LKNG L LS
Sbjct: 249 VADDRDLPEAV----DWRDKGAVSPVKDQGQCGSCWAFSTSGAIEGQHFLKNGELLSLSE 304
Query: 188 QEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI 247
Q+++DC+ + GC+GG ++++ N LE E+ YP +C S KI
Sbjct: 305 QQMVDCSWL-DFGCNGGQPMLAMEYVRFNG-GLELETAYPYKGVGGSCHSDKKSA-AAKI 361
Query: 248 KSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDG-SLANINHAV 304
+ SES++ +A GP+ ++A +Q+Y G+ YN + S ++HAV
Sbjct: 362 TGFWMAGFY-SESALQKAVAKVGPISVGMDASGEDFQHYKSGI--YNPESCSSIGLDHAV 418
Query: 305 QIVGY 309
VGY
Sbjct: 419 LAVGY 423
>gi|258406688|gb|ACV72067.1| putative cysteine protease [Lathyrus sativus]
Length = 350
Score = 124 bits (310), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 102/320 (31%), Positives = 149/320 (46%), Gaps = 41/320 (12%)
Query: 7 VLFIVALIALCFL---AIPVKVSKPNLEQKLEL---------FSSFQQRYKKSY-SKSEH 53
VLF V A F + P+++ EQ L++ F+ F RY K Y S E
Sbjct: 9 VLFCVTTAAAGFSFHDSNPIRMVSDAEEQLLQVIGESRHAVSFARFANRYGKLYDSVDEM 68
Query: 54 DIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVN-KHVLMSH 112
+RFK F ++L++I NK R S + G+ F+D + EEFK+ L + N L +
Sbjct: 69 KLRFKIFSENLELIRSTNKRRLS---YKLGVNHFADWTWEEFKSHRLGAAQNCSATLKGN 125
Query: 113 HKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAE 172
HK D + +P +KDWR+ GI+ +V++Q CG+CW FST E
Sbjct: 126 HKITDAN------------------LPDEKDWRKEGIVSEVKDQGHCGSCWTFSTTGALE 167
Query: 173 SMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLK 231
S +A G LS Q+++DCAG N GCSGG +++ N LE E YP
Sbjct: 168 SAYAQAFGKNISLSEQQLVDCAGAFNNFGCSGGLPSQAFEYIKYNG-GLETEETYPYTGS 226
Query: 232 DAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL-TWQYYLGGVIQ 290
+ C K TS N + + + SE + +A PV A + ++ Y GV
Sbjct: 227 NGLC--KFTSENVALKVLGSVNITLGSEDELKHAVAFARPVSVAFEVVHDFRLYKSGVYT 284
Query: 291 YN-CDGSLANINHAVQIVGY 309
C + ++NHAV VGY
Sbjct: 285 STACGNTPMDVNHAVLAVGY 304
>gi|350587549|ref|XP_003482436.1| PREDICTED: cathepsin O-like [Sus scrofa]
Length = 209
Score = 124 bits (310), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 68/172 (39%), Positives = 96/172 (55%), Gaps = 11/172 (6%)
Query: 144 WREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSG 203
WR+ G CG CWAFS V ES +A+K L +LSVQ+VIDC+ N N GC+G
Sbjct: 10 WRKGG--------SKCGGCWAFSVVSAVESAYAIKGQPLEVLSVQQVIDCSYN-NYGCNG 60
Query: 204 GDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSIL 263
G L W++ +V + +SEYP ++ C + S +GV IK Y+ E +
Sbjct: 61 GSTLNALYWLNKTQVKVVSDSEYPFKAQNGLCHYFSCSHSGVSIKDYSAYDFSGQEDEMA 120
Query: 264 TDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYSRT 315
+ T GP+I V+A++WQ YLGG+IQ++C S NHAV + G+D T
Sbjct: 121 KTLLTLGPLIVIVDAVSWQDYLGGIIQHHC--SSGEANHAVLVTGFDKTGST 170
>gi|1749812|emb|CAA90237.1| cysteine proteinase LmCPB1 [Leishmania mexicana]
Length = 359
Score = 124 bits (310), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 88/280 (31%), Positives = 136/280 (48%), Gaps = 25/280 (8%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F++ Y ++Y +E R NFE++L+++ E ++P A++GIT+F DLSE E
Sbjct: 37 LFEEFKRTYGRAYETLAEEQQRLANFERNLELMRE--HQARNPH-AQFGITKFFDLSEAE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F R+L N + K H H + ++ +P DWRE G + V+
Sbjct: 94 FCARYL----NGAAYFAAAKRHTPQHYPKARADLSA-------VPDAVDWREKGAVTPVK 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
+Q CG+CWAFS V E L L LS Q+++ C + N GC GG DW+
Sbjct: 143 DQGACGSCWAFSAVGNIEGQWYLAGHELVSLSEQQLVSC-DDMNDGCDGGLMLQAFDWLL 201
Query: 215 VNKVV-LEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
N L E YP + + C + G +I + + SE ++ +A +G
Sbjct: 202 QNTNGHLYTEDSYPYVSGNGYLPECSNSSKLVVGAQIDGHVL--IGSSEKAMAAWLAKNG 259
Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
P+ A++A ++ Y GV+ C G +NHAV +VGYD
Sbjct: 260 PIAIALDASSFMSYKSGVLT-ACIGK--QVNHAVLLVGYD 296
>gi|357154164|ref|XP_003576692.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 427
Score = 124 bits (310), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 88/288 (30%), Positives = 138/288 (47%), Gaps = 27/288 (9%)
Query: 37 FSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITE--FSDLSEE 93
F + ++ ++Y+ E RF+ ++++L +IEE N Y +T+ F+DL+ E
Sbjct: 119 FEQWMGKHGRAYANGGEKQRRFEVYKENLALIEEFNSGGHG-----YTLTDNKFADLTNE 173
Query: 94 EFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKV 153
EF+ + L + H G T +P DWR+ G + +V
Sbjct: 174 EFRAKMLGG-------LGADPDRRRRARHASNALELPGNDNSTDLPKDVDWRKKGAVVEV 226
Query: 154 RNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM 213
+NQ +CG+CWAFS V E ++ +KNG L LS QE++DC +GC+GG +++
Sbjct: 227 KNQGSCGSCWAFSAVAAMEGLNQIKNGKLVSLSEQELVDCDAEA-VGCAGGFMSWAFEFV 285
Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
N L E+ YP + AC+ + + V I Y + + SE+ +L +A PV
Sbjct: 286 MANH-GLTTEASYPYKGINGACQTAKLNESSVSITGYV-NVTVNSEAELL-KVAAVQPVS 342
Query: 274 AAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY---DNYSRTW 316
AV+A +Q Y GGV C A INH V +VGY D + W
Sbjct: 343 VAVDAGGFLFQLYAGGVFSGPCT---AQINHGVTVVGYGETDKAEKYW 387
>gi|13897890|gb|AAK48495.1|AF259983_1 putative cysteine protease [Ipomoea batatas]
Length = 462
Score = 124 bits (310), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 82/295 (27%), Positives = 147/295 (49%), Gaps = 29/295 (9%)
Query: 22 PVKVSKPNLEQKLELFSSFQQRYKKSYSK--SEHDIRFKNFEKSLDIIEELNKNRQSPES 79
P K + E+ + L+ S+ + KSY+ E D RF+ F+ +L I+E +N + S
Sbjct: 34 PAKGLSRSDEEVMALYESWLVEHGKSYNGLGGEKDKRFEIFKDNLRYIDE--QNSRGDRS 91
Query: 80 ARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIP 139
+ G+ F+DL+ EE+++ +L + ++ K ++ + G ++P I
Sbjct: 92 YKLGLNRFADLTNEEYRSTYLGAKTDARRRIAKTKSD-------RRYAPKAGGSLPDSI- 143
Query: 140 VKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNM 199
DWRE G + +V++Q +CG+CWAFST+ E ++ + G L LS QE++DC + N
Sbjct: 144 ---DWREKGAVAEVKDQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVDCDTSYNE 200
Query: 200 GCSGGDFCALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLI 256
GC+GG L+D+ + ++ E++YP + C + + V I Y + +
Sbjct: 201 GCNGG----LMDYAFEFIIKNGGIDTEADYPYTGRYGRCDQTRKNAKVVSIDGY--EDVT 254
Query: 257 PSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
P + + L + PV A+ A +Q Y G+ +C +++H V VGY
Sbjct: 255 PYDEAALKEAVAGQPVSVAIEAGGRDFQLYSSGIFTGSCG---TDLDHGVTAVGY 306
>gi|356577763|ref|XP_003556992.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 124 bits (310), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 97/313 (30%), Positives = 140/313 (44%), Gaps = 29/313 (9%)
Query: 4 VKNVLFIVALIAL-C--FLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKN 59
KN + ++L L C FL V E + RY K Y E + RFK
Sbjct: 3 AKNQFYQISLALLFCSGFLTFQVTCRTLQDASMYERHEEWMGRYAKVYKDPQERERRFKI 62
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHH 119
F+++++ IE N P + GI +F+DL+ EEF R+ H+ S +
Sbjct: 63 FKENVNYIEAFNNAANKPYT--LGINQFADLTNEEFIAP--RNRFKGHMCSSITRTTTFK 118
Query: 120 HNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
+ +V T IP DWR+ G + +++Q CG CWAFS V E +HAL
Sbjct: 119 YENV------------TAIPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSA 166
Query: 180 GTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
G L LS QEV+DC G + GC+GG ++ N L E YP D C K
Sbjct: 167 GKLISLSEQEVVDCDTKGEDQGCAGGFMDGAFKFIIQNH-GLNNEPNYPYKAVDGKCNAK 225
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGS 296
A + + I Y D + +E ++ +A PV A++A +Q+Y GV +C
Sbjct: 226 AAANHVATITGYE-DVPVNNEKALQKAVANQ-PVSVAIDASGSDFQFYQSGVFTGSCGTE 283
Query: 297 LANINHAVQIVGY 309
L +H V VGY
Sbjct: 284 L---DHGVTAVGY 293
>gi|1730100|sp|P36400.2|LMCPB_LEIME RecName: Full=Cysteine proteinase B; Flags: Precursor
gi|899313|emb|CAA90236.1| LmCPb2.8 [Leishmania mexicana]
Length = 443
Score = 124 bits (310), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 87/280 (31%), Positives = 135/280 (48%), Gaps = 25/280 (8%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F++ Y ++Y +E R NFE++L+++ E ++P A++GIT+F DLSE E
Sbjct: 37 LFEEFKRTYGRAYETLAEEQQRLANFERNLELMRE--HQARNPH-AQFGITKFFDLSEAE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F R+L N + K H H + ++ +P DWRE G + V+
Sbjct: 94 FAARYL----NGAAYFAAAKRHAAQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
+Q CG+CWAFS V E L L LS Q+++ C + N GC GG DW+
Sbjct: 143 DQGACGSCWAFSAVGNIEGQWYLAGHELVSLSEQQLVSC-DDMNDGCDGGLMLQAFDWLL 201
Query: 215 VNKVV-LEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
N L E YP + + C + G +I + + SE ++ +A +G
Sbjct: 202 QNTNGHLHTEDSYPYVSGNGYVPECSNSSELVVGAQIDGHV--LIGSSEKAMAAWLAKNG 259
Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
P+ A++A ++ Y GV+ C G +NH V +VGYD
Sbjct: 260 PIAIALDASSFMSYKSGVLT-ACIGK--QLNHGVLLVGYD 296
>gi|1208549|gb|AAC49455.1| Pseudotzain [Pseudotsuga menziesii]
Length = 454
Score = 124 bits (310), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 84/282 (29%), Positives = 140/282 (49%), Gaps = 29/282 (10%)
Query: 34 LELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSE 92
+EL+ + ++KK+Y+ E +F F+ + I + N Q S + G+ +F+DLS
Sbjct: 41 MELYELWLAQHKKAYNGLDEKQKKFSVFKDNFLYIHQ--HNNQGNPSYKLGLNQFADLSH 98
Query: 93 EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
EEFK +L ++ +S + + + G +P I DWRE G +
Sbjct: 99 EEFKAAYLGTKLDAKKRLSRSPSPRYQY--------SVGEDLPESI----DWREKGAVTA 146
Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
V+NQ +CG+CWAFSTV E ++ + G L+ LS QE++DC + N GC+GG L+D+
Sbjct: 147 VKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDCDTSYNQGCNGG----LMDY 202
Query: 213 ---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
++ L+ E +YP + +C + + V I Y + + ++ L A +
Sbjct: 203 AFQFIISNGGLDSEDDYPYKANNGSCDAYRKNAHVVTIDDY--EDVPENDEKSLKKAAAN 260
Query: 270 GPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
P+ A+ A +Q+Y GV NC L +H V +VGY
Sbjct: 261 QPISVAIEASGRAFQFYESGVFTSNCGTQL---DHGVTLVGY 299
>gi|401416324|ref|XP_003872657.1| putative cathepsin L-like protease [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322488881|emb|CBZ24131.1| putative cathepsin L-like protease [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 443
Score = 124 bits (310), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 87/280 (31%), Positives = 135/280 (48%), Gaps = 25/280 (8%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F++ Y ++Y +E R NFE++L+++ E ++P A++GIT+F DLSE E
Sbjct: 37 LFEEFKRTYGRAYETLAEEQQRLANFERNLELMRE--HQARNPH-AQFGITKFFDLSEAE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F R+L N + K H H + ++ +P DWRE G + V+
Sbjct: 94 FAARYL----NGAAYFAAAKRHAAQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
+Q CG+CWAFS V E L L LS Q+++ C + N GC GG DW+
Sbjct: 143 DQGACGSCWAFSAVGNIEGQWYLAGHELVSLSEQQLVSC-DDMNDGCDGGLMLQAFDWLL 201
Query: 215 VNKVV-LEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
N L E YP + + C + G +I + + SE ++ +A +G
Sbjct: 202 QNTNGHLHTEDSYPYVSGNGYVPECSNSSELVVGAQIDGHV--LIGSSEKAMAAWLAKNG 259
Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
P+ A++A ++ Y GV+ C G +NH V +VGYD
Sbjct: 260 PIAIALDASSFMSYKSGVLT-ACIGK--QLNHGVLLVGYD 296
>gi|3980198|emb|CAA46863.1| thiolprotease [Pisum sativum]
Length = 464
Score = 124 bits (310), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 91/324 (28%), Positives = 149/324 (45%), Gaps = 36/324 (11%)
Query: 1 MFDVKNVLFIVALIAL------CFLAI----PVKVSKPNLEQKLELFSSFQQRYKKSYSK 50
M +LFI L C ++ P K + +Q L ++ + ++ K+Y+
Sbjct: 1 MLSKLTILFITLTFTLSLALDMCIISYDKTHPDKSTPRTNDQVLTMYEEWLVKHGKNYNA 60
Query: 51 -SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVL 109
E + RF+ F+ +L I+E N S R G+ F+DL+ EE++TR L +N +
Sbjct: 61 LGEKEKRFEIFKDNLGFIDEHNSKNLS---FRLGLNRFADLTNEEYRTRFLGTRINPN-- 115
Query: 110 MSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVE 169
+ + ++ + + G +P + DWR+ G + V++Q +CG+CWAFS +
Sbjct: 116 ----RRNRKVNSQTNRYATRVGDKLPESV----DWRKEGAVVGVKDQGSCGSCWAFSAIA 167
Query: 170 TAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW---MDVNKVVLEPESEY 226
E ++ L G L LS QE++DC + N GC+GG L+D+ +N V L PE +Y
Sbjct: 168 AVEGVNKLATGDLISLSEQELVDCDTSYNEGCNGG----LMDYAFEFIINMVALTPEEDY 223
Query: 227 PLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAV-NALTWQYYL 285
P D C + + V I Y D E ++ +A +A +Q Y
Sbjct: 224 PYRAIDGRCDQNRKNAKVVSIDQYE-DVPAYDEGALKKAVANQVIAVAVEGGGREFQLYD 282
Query: 286 GGVIQYNCDGSLANINHAVQIVGY 309
GV C +L +H V VGY
Sbjct: 283 SGVFTGRCGTAL---DHGVAAVGY 303
>gi|357619727|gb|EHJ72186.1| cathepsin [Danaus plexippus]
Length = 336
Score = 124 bits (310), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 90/296 (30%), Positives = 148/296 (50%), Gaps = 28/296 (9%)
Query: 24 KVSKP---NLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESA 80
KV KP ++++ LF +F + Y K Y E + RFK F +L I +LN +A
Sbjct: 25 KVRKPVFYSMDEAPILFENFIREYNKKYDSKEKEERFKIFVNNLKRINDLN---HKSTNA 81
Query: 81 RYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRS-ITTGITIPTGIP 139
+GI +F+DLS+EEFK + +K L +++KK S ++ IT P
Sbjct: 82 VHGINKFTDLSKEEFKKFYTGFKPDKSFL----------DDNIKKPSQLSFNITAPPAF- 130
Query: 140 VKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNM 199
DWR+ G++ +V+NQ TCG+CWAFST+ ES++A+K+G L LS Q+++DC +
Sbjct: 131 ---DWRDKGVVTRVKNQGTCGSCWAFSTIGNVESVNAIKHGNLVELSEQQLVDCDSK-DE 186
Query: 200 GCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSE 259
C G ++ + + E YP A C ++ V ++ + ++ SE
Sbjct: 187 ACDSGLPDNAQQYLVSHGAI--SEQSYPYKGYAANCTYDSSQ---VVVRLSNFEKVVLSE 241
Query: 260 SSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYSRT 315
+ + + P+ + A Y G++ C+ S ++NHAV +VGY N T
Sbjct: 242 CQMAEKLYSTAPLSIVIAAEVLGTYTKGILVNECEQS-QDLNHAVLLVGYGNEGGT 296
>gi|401430288|ref|XP_003886537.1| unnamed protein product [Leishmania mexicana MHOM/GT/2001/U1103]
gi|356491333|emb|CBZ40988.1| unnamed protein product [Leishmania mexicana MHOM/GT/2001/U1103]
Length = 533
Score = 124 bits (310), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 87/280 (31%), Positives = 135/280 (48%), Gaps = 25/280 (8%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F++ Y ++Y +E R NFE++L+++ E ++P A++GIT+F DLSE E
Sbjct: 127 LFEEFKRTYGRAYETLAEEQQRLANFERNLELMRE--HQARNPH-AQFGITKFFDLSEAE 183
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F R+L N + K H H + ++ +P DWRE G + V+
Sbjct: 184 FAARYL----NGAAYFAAAKRHAAQHYRKARADLSA-------VPDAVDWREKGAVTPVK 232
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
+Q CG+CWAFS V E L L LS Q+++ C + N GC GG DW+
Sbjct: 233 DQGACGSCWAFSAVGNIEGQWYLAGHELVSLSEQQLVSC-DDMNDGCDGGLMLQAFDWLL 291
Query: 215 VNKVV-LEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
N L E YP + + C + G +I + + SE ++ +A +G
Sbjct: 292 QNTNGHLHTEDSYPYVSGNGYVPECSNSSELVVGAQIDGHV--LIGSSEKAMAAWLAKNG 349
Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
P+ A++A ++ Y GV+ C G +NH V +VGYD
Sbjct: 350 PIAIALDASSFMSYKSGVLT-ACIGK--QLNHGVLLVGYD 386
>gi|116787404|gb|ABK24495.1| unknown [Picea sitchensis]
gi|224286306|gb|ACN40861.1| unknown [Picea sitchensis]
Length = 452
Score = 124 bits (310), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 86/288 (29%), Positives = 140/288 (48%), Gaps = 30/288 (10%)
Query: 34 LELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSE 92
+EL+ + +K++Y+ E RF F+ + I E N Q S + G+ +F+DLS
Sbjct: 39 MELYELWLAEHKRAYNGLDEKQKRFSVFKDNFLYIHEHN---QGNRSYKLGLNQFADLSH 95
Query: 93 EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
EEFK +L ++ +S + + + G +P I DWRE G +
Sbjct: 96 EEFKATYLGAKLDTKKRLSRPPSRRYQY--------SDGEDLPESI----DWREKGAVTS 143
Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
V++Q +CG+CWAFSTV E ++ + G L LS QE++DC + N GC+GG L+D+
Sbjct: 144 VKDQGSCGSCWAFSTVAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGG----LMDY 199
Query: 213 ---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
+N L+ E +YP D +C + + V I Y + + ++ L A +
Sbjct: 200 AFEFIINNGGLDSEEDYPYTAYDGSCDSYRKNAHVVTIDDY--EDVPENDEKSLKKAAAN 257
Query: 270 GPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYSRT 315
P+ A+ A +Q+Y GV C ++H V +VGY + S T
Sbjct: 258 QPISVAIEASGREFQFYDSGVFTSTCG---TQLDHGVTLVGYGSESGT 302
>gi|641905|gb|AAC49406.1| cysteine proteinase [Zinnia violacea]
Length = 342
Score = 124 bits (310), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 91/286 (31%), Positives = 137/286 (47%), Gaps = 30/286 (10%)
Query: 29 NLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
++ + + LF S ++ K Y + + RF+ F +L I+E NK + G+ EF
Sbjct: 41 SIHKVIHLFESSLVKHSKIYESFDEKLHRFEIFMDNLKHIDETNKK---VSNYWLGLNEF 97
Query: 88 SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREA 147
+DL+ EEFK + L ++ K D + R +P DWR+
Sbjct: 98 ADLTHEEFKNKFLGFKGE----LAERK--DESIEQFRYRDFVD-------LPKSVDWRKK 144
Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFC 207
G + V+NQ CG+CWAFSTV E ++ + G L++LS QE+IDC N GC+GG
Sbjct: 145 GAVSPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTVLSEQELIDCDTTFNNGCNGG--- 201
Query: 208 ALLDW--MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
L+D+ V + L E EYP ++ + C K + V I Y D +E S L
Sbjct: 202 -LMDYAFAYVTRNGLHKEEEYPYIMSEGTCDEKRDASEKVTISGYH-DVPRNNEDSFLKA 259
Query: 266 IATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+A P+ A+ A +Q+Y GGV +C L +H V VGY
Sbjct: 260 LANQ-PISVAIEASGRDFQFYSGGVFDGHCGTEL---DHGVAAVGY 301
>gi|401430350|ref|XP_003886559.1| unnamed protein product, partial [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|356491516|emb|CBZ40966.1| unnamed protein product, partial [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 503
Score = 124 bits (310), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 87/280 (31%), Positives = 135/280 (48%), Gaps = 25/280 (8%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F++ Y ++Y +E R NFE++L+++ E ++P A++GIT+F DLSE E
Sbjct: 97 LFEEFKRTYGRAYETLAEEQQRLANFERNLELMRE--HQARNPH-AQFGITKFFDLSEAE 153
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F R+L N + K H H + ++ +P DWRE G + V+
Sbjct: 154 FAARYL----NGAAYFAAAKRHAAQHYRKARADLSA-------VPDAVDWREKGAVTPVK 202
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
+Q CG+CWAFS V E L L LS Q+++ C + N GC GG DW+
Sbjct: 203 DQGACGSCWAFSAVGNIEGQWYLAGHELVSLSEQQLVSC-DDMNDGCDGGLMLQAFDWLL 261
Query: 215 VNKVV-LEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
N L E YP + + C + G +I + + SE ++ +A +G
Sbjct: 262 QNTNGHLYTEDSYPYVSGNGYVPECSNSSELVVGAQIDGHV--LIGSSEKAMAAWLAKNG 319
Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
P+ A++A ++ Y GV+ C G +NH V +VGYD
Sbjct: 320 PIAIALDASSFMSYKSGVLT-ACIGK--QLNHGVLLVGYD 356
>gi|144905108|dbj|BAF56428.1| cysteine proteinase [Lotus japonicus]
Length = 342
Score = 124 bits (310), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 93/320 (29%), Positives = 143/320 (44%), Gaps = 30/320 (9%)
Query: 4 VKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKS-EHDIRFKNFEK 62
V N+ + L+ FLA E + +Y K Y+ S E ++R F++
Sbjct: 6 VLNISSLALLLVFGFLAFEANARTLEDVSLKERHEQWMTQYGKVYTDSYEKELRSNIFKE 65
Query: 63 SLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNH 122
++ IE N P + GI +F+DL+ EEFK R+ H
Sbjct: 66 NVQRIEAFNNAGNKP--YKLGINQFADLTNEEFKARN---------------RFKGHMCS 108
Query: 123 VKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTL 182
R+ T + +P DWR+ G + +++Q CG CWAFS V E + L G L
Sbjct: 109 NSTRTPTFKYEDVSSVPASLDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGITKLSTGKL 168
Query: 183 SLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATS 241
LS QE++DC G + GC GG ++ NK L E++YP DA C A +
Sbjct: 169 ISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIMQNK-GLNTEAKYPYQGVDATCNANAEA 227
Query: 242 PNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLAN 299
+ IK + D SES++L +A P+ A++A +Q+Y G+ +C L
Sbjct: 228 KDAASIKGFE-DVPANSESALLKAVANQ-PISVAIDASGSEFQFYSSGLFTGSCGTEL-- 283
Query: 300 INHAVQIVGY---DNYSRTW 316
+H V VGY D+ ++ W
Sbjct: 284 -DHGVTAVGYGVSDDGTKYW 302
>gi|146078033|ref|XP_001463431.1| cathepsin L-like protease [Leishmania infantum JPCM5]
gi|134067516|emb|CAM65796.1| cathepsin L-like protease [Leishmania infantum JPCM5]
Length = 381
Score = 124 bits (310), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 84/270 (31%), Positives = 133/270 (49%), Gaps = 26/270 (9%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F++ Y+++Y +E R NFE++L+++ E ++P AR+GIT+F DLSE E
Sbjct: 37 LFEEFKRTYRRAYGTLAEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F R+L N + K H H + ++ +P DWRE G + V+
Sbjct: 94 FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
+Q CG+CWAFS V ES A L LS Q+++ C N GC+GG +W+
Sbjct: 143 DQGACGSCWAFSAVGNIESQWARAGHGLVSLSEQQLVSCDDKDN-GCNGGLMLQAFEWLL 201
Query: 215 VNKV-VLEPESEYPLLLKD---AACKRKATSPNGVKIKSYTCDTLIPSESSILTD-IATH 269
+ ++ E YP + A C + G +I Y +IPS +++ +A +
Sbjct: 202 RHMYGIVFTEKSYPYTSGNGDVAECLNSSKLVPGAQIDGY---VMIPSNETVMAAWLAEN 258
Query: 270 GPVIAAVNALTWQYYLGGV--IQYNCDGSL 297
GP+ AV+A ++ Y GV + YN G +
Sbjct: 259 GPIAIAVDASSFMSYQSGVLLVGYNKTGGV 288
>gi|400180435|gb|AFP73355.1| cysteine protease [Solanum pennellii]
Length = 344
Score = 124 bits (310), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 99/315 (31%), Positives = 150/315 (47%), Gaps = 35/315 (11%)
Query: 3 DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
D+ ++L + + F S+P L ++ EL+ S R YK K E RF
Sbjct: 6 DLMSILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
F++++ IE +NK S + G+ EF+D++ +EF + ++ N +V S +
Sbjct: 63 FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYVSPSPMSSTEF 120
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
N + + P DWRE+G + +V+NQ CG CWAFS V + E + +
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIA 170
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
G L S QE++DC N N GC+GG D++ N + ES+Y L + C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGI-SRESDYEYLGQQYTCRSQ 228
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
+ V+I SY ++P + L T PV IAA L Q+Y GG DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278
Query: 296 SLAN-INHAVQIVGY 309
S AN INHAV +GY
Sbjct: 279 SCANRINHAVTAIGY 293
>gi|242055753|ref|XP_002457022.1| hypothetical protein SORBIDRAFT_03g047290 [Sorghum bicolor]
gi|241928997|gb|EES02142.1| hypothetical protein SORBIDRAFT_03g047290 [Sorghum bicolor]
Length = 378
Score = 123 bits (309), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 96/292 (32%), Positives = 142/292 (48%), Gaps = 24/292 (8%)
Query: 31 EQKLELFSSFQQRYKK-SYSKSEHDIR-FKNFEKSLDIIEELNKNRQSPESARYGITEFS 88
E ELF + R++K +Y+ E +R F+ F+ +L I+E N+ S G+ EF+
Sbjct: 42 ESLAELFERWLSRHRKGAYASLEEKLRRFEVFKDNLHHIDETNRK---VSSYWLGLNEFA 98
Query: 89 DLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKK---------RSITTGITIPTGIP 139
DL+ +EFK +L S + H HHD ++ R G+ +P
Sbjct: 99 DLTHDEFKATYLGLSPSGGGGDVVHMHHDDDDEEPEEEGSSSSSSFRFRYEGVDAAR-LP 157
Query: 140 VKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNM 199
DWR G + V+NQ CG+CWAFSTV E ++ + G L+ LS QE++DC +GN
Sbjct: 158 KSVDWRSKGAVTGVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTALSEQELVDCDTDGNN 217
Query: 200 GCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSE 259
GC+GG ++ N L E YP L+++ C R +S V I Y D +E
Sbjct: 218 GCNGGLMDYAFSYIAHNG-GLHTEEAYPYLMEEGTCSR-GSSAAVVTISGYE-DVPRNNE 274
Query: 260 SSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
++L +A H PV A+ A Q+Y GGV C ++H V VGY
Sbjct: 275 QALLKALA-HQPVSVAIEASGRNLQFYSGGVFDGPCG---TQLDHGVAAVGY 322
>gi|297801998|ref|XP_002868883.1| hypothetical protein ARALYDRAFT_490677 [Arabidopsis lyrata subsp.
lyrata]
gi|297314719|gb|EFH45142.1| hypothetical protein ARALYDRAFT_490677 [Arabidopsis lyrata subsp.
lyrata]
Length = 368
Score = 123 bits (309), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 87/298 (29%), Positives = 143/298 (47%), Gaps = 36/298 (12%)
Query: 23 VKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESAR 81
V ++P + + FS F+ ++ K Y S EHD RF F+ +L ++++ SAR
Sbjct: 37 VGGAEPQVLTSEDHFSLFKSKFGKVYASNEEHDYRFSVFKANL---RRARRHQKLDPSAR 93
Query: 82 YGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPV 140
+G+T+FSDL+ EF+ +HL + +K +PT +P
Sbjct: 94 HGVTQFSDLTRSEFRKKHLGVRAGFKLPKDANK----------------APILPTENLPE 137
Query: 141 KKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC------- 193
DWR+ G + V+NQ +CG+CW+FS E + L G L LS Q+++DC
Sbjct: 138 DFDWRDRGAVTPVKNQGSCGSCWSFSATGALEGANFLATGKLVSLSEQQLVDCDHECDPE 197
Query: 194 -AGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTC 252
AG+ + GC+GG + ++ + L E +YP KD + S + +++
Sbjct: 198 EAGSCDSGCNGGLMNSAFEYT-LKTGGLMKEEDYPYTGKDGKTCKLDKSKIVASVSNFSV 256
Query: 253 DTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
++ E I ++ +GP+ A+NA Q Y+GGV Y C +NH V +VGY
Sbjct: 257 ISI--DEEQIAANLVKNGPLAVAINAGYMQTYIGGVSCPYIC---TRRLNHGVLLVGY 309
>gi|226533314|ref|NP_001150119.1| xylem cysteine proteinase 2 [Zea mays]
gi|195636886|gb|ACG37911.1| xylem cysteine proteinase 2 precursor [Zea mays]
gi|223946183|gb|ACN27175.1| unknown [Zea mays]
gi|413951209|gb|AFW83858.1| Xylem cysteine proteinase 2 [Zea mays]
Length = 385
Score = 123 bits (309), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 92/298 (30%), Positives = 139/298 (46%), Gaps = 35/298 (11%)
Query: 31 EQKLELFSSFQQRYKKSYSKSEHDIR-FKNFEKSLDIIEELNKNRQSPESARYGITEFSD 89
E ELF + R++++Y+ E +R F+ F+ +L I+E N+ S G+ EF+D
Sbjct: 53 ESLAELFERWLSRHRRAYASLEEKLRRFQVFKDNLHHIDETNRK---VSSYWLGLNEFAD 109
Query: 90 LSEEEFKTRHL--RHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREA 147
L+ +EFK +L R SV + + +P DWR
Sbjct: 110 LTHDEFKATYLGLRSSVGDGGSGIDDDDEPEEEEGYEGVDGAS-------LPKSVDWRSK 162
Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFC 207
G + V+NQ CG+CWAFSTV E ++ + G L+ LS QE+IDC +GN GC+GG
Sbjct: 163 GAVTGVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTALSEQELIDCDTDGNNGCNGGLMD 222
Query: 208 ALLDWMDVNKVVLEPESEYPLLLKDAACKRKATS----PNG----------VKIKSYTCD 253
++ N L E YP L+++ C+R ++S P V I Y D
Sbjct: 223 YAFSYIAHNG-GLHTEEAYPYLMEEGTCQRSSSSEKKWPGSSEDANDDAAVVTISGYE-D 280
Query: 254 TLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+E ++L +A PV A+ A +Q+Y GGV C ++H V VGY
Sbjct: 281 VPRNNEQALLKALAQQ-PVSVAIEASGRNFQFYSGGVFDGPCG---TQLDHGVAAVGY 334
>gi|118365752|ref|XP_001016096.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89297863|gb|EAR95851.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 336
Score = 123 bits (309), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 93/311 (29%), Positives = 153/311 (49%), Gaps = 28/311 (9%)
Query: 7 VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKN--FEKSL 64
+L I+ L+ LC LA + + +KL ++ + ++++ Y +EH+ F+ F ++L
Sbjct: 7 LLSIIMLMPLC-LAQNISI------EKLLTYNKWSSQHQRVY-LNEHEKLFRQMVFFENL 58
Query: 65 DIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHL-RHSVNKHVLMSHHKHHDHHHNHV 123
I+E N + S + +FSD+++EEF + L + + H + + H+ ++
Sbjct: 59 QKIQEHNSDPNKTYSIH--LNQFSDMTKEEFAEKILMKQDLVNHFIKEMDQQVTHNDSNS 116
Query: 124 KKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLS 183
+ + + +TI I DWR G + V+NQ +CG+CW FS ES + +KN L
Sbjct: 117 ETQLNSKSLTIAASI----DWRTKGAVTSVKNQGSCGSCWTFSAAALMESFNFIKNKVLV 172
Query: 184 LLSVQEVIDCA----GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
S Q+++DC G + GCSGG + LD+ +KV + +YP + C
Sbjct: 173 DFSEQQLVDCVTPANGYQSYGCSGGWPVSCLDY--ASKVGITTLDKYPYVAVQKNCNVTG 230
Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLAN 299
T+ NG K K + IP+ S+ PV V+A W Y G+ CD S N
Sbjct: 231 TN-NGFKPKGW---IYIPNTSNEFKTALNFSPVSVIVDATNWGNYQSGIFN-GCDQSHIN 285
Query: 300 INHAVQIVGYD 310
NHAV +VGYD
Sbjct: 286 YNHAVLVVGYD 296
>gi|22759715|dbj|BAC10906.1| cysteine proteinase [Zinnia elegans]
Length = 352
Score = 123 bits (309), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 91/286 (31%), Positives = 137/286 (47%), Gaps = 30/286 (10%)
Query: 29 NLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
++ + + LF S+ ++ K Y + + RF+ F +L I+E NK + G+ EF
Sbjct: 41 SIHKVIHLFESWLVKHSKFYESLDEKLHRFEIFMDNLKHIDETNKKVSN---YWLGLNEF 97
Query: 88 SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREA 147
+DL+ EEFK + L ++ K + S G +P DWR+
Sbjct: 98 ADLTHEEFKHKFLGFKGE----LAERK---------DESSKEFGYRDFVDLPKSVDWRKK 144
Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFC 207
G + V+NQ CG+CWAFSTV E ++ + G L++LS QE+IDC N GC+GG
Sbjct: 145 GAVAPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTMLSEQELIDCDTTFNNGCNGG--- 201
Query: 208 ALLDW--MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
L+D+ V + L E EYP ++ + C K V I Y D E+S L
Sbjct: 202 -LMDYAFAYVMRSGLHKEEEYPYIMSEGTCDEKKDVSEKVTISGYH-DVPRNDEASFLKA 259
Query: 266 IATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+A P+ A+ A +Q+Y GGV +C L +H V VGY
Sbjct: 260 LANQ-PISVAIEASGRDFQFYSGGVFDGHCGTEL---DHGVAAVGY 301
>gi|146216004|gb|ABQ10204.1| cysteine protease Cp6 [Actinidia deliciosa]
Length = 461
Score = 123 bits (309), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 81/285 (28%), Positives = 144/285 (50%), Gaps = 29/285 (10%)
Query: 31 EQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSD 89
++ + ++ S+ ++ KSY+ E + RF+ F+ +L I+E N ++ + G+ F+D
Sbjct: 40 DEVMAMYESWLVKHGKSYNAIGEKEKRFQIFKDNLRFIDEHNAESRT---YKVGLNRFAD 96
Query: 90 LSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGI 149
L+ +E+++ +L +S K D + G ++P + DWRE G
Sbjct: 97 LTNDEYRSMYLGARTGSRRRLSTQKRSDRY-------VPVAGESLPDSV----DWREKGA 145
Query: 150 IGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCAL 209
+ V++Q +CG+CWAFST+ E ++ + G L LS QE++DC + N GC+GG L
Sbjct: 146 VVGVKDQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGG----L 201
Query: 210 LDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
+D+ + ++ E +YP +D C + + V I Y D + +E ++ +
Sbjct: 202 MDYAFEFIIKNGGIDTEEDYPYNARDGRCDQYRKNAKVVTIDDYE-DVPVNNEQALQKAV 260
Query: 267 ATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
A PV A+ A + +Q+Y GV NC +L +H V VGY
Sbjct: 261 ANQ-PVSVAIEASGMAFQFYESGVFTGNCGTAL---DHGVTAVGY 301
>gi|347968733|ref|XP_312034.5| AGAP002879-PA [Anopheles gambiae str. PEST]
gi|333467868|gb|EAA08025.5| AGAP002879-PA [Anopheles gambiae str. PEST]
Length = 1810
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 87/281 (30%), Positives = 144/281 (51%), Gaps = 28/281 (9%)
Query: 36 LFSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
+F F+ +++ Y+ S EH++RF F +L IE+LNK + +A+YG+T+F+D++ E
Sbjct: 1499 MFDKFRHHHRRQYASSMEHEMRFNIFRNNLFKIEQLNKFERG--TAKYGVTKFADMTVAE 1556
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
++ + +++ H +H N V G+ +P DWR+ G + +V+
Sbjct: 1557 YR-------AHTGLVVPKHDRANHVGNRVASEEDVAGVG---DLPRSFDWRDHGAVTEVK 1606
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
NQ +CG+CWAFS V E +H +K L S QE+IDC N GC GG +D D
Sbjct: 1607 NQGSCGSCWAFSAVGNVEGLHQIKTKKLESYSEQELIDCDKVDN-GCGGG----YMD--D 1659
Query: 215 VNKVV-----LEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
K + LE E++YP K S + V++K + +E+ I + +
Sbjct: 1660 AFKAIEQLGGLELENDYPYEAKAQKSCHFNRSLSHVQVKGAV--DMPKNETYIAKYLIKN 1717
Query: 270 GPVIAAVNALTWQYYLGGVIQ-YNCDGSLANINHAVQIVGY 309
GP+ +NA Q+Y GG+ ++ + +I+H V IVGY
Sbjct: 1718 GPIAIGLNANAMQFYRGGISHPWHPLCNHKSIDHGVLIVGY 1758
>gi|260516678|gb|ACX43965.1| cysteine protease 1 [Brachiaria hybrid cultivar]
Length = 338
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 85/279 (30%), Positives = 136/279 (48%), Gaps = 29/279 (10%)
Query: 35 ELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
++F++F ++Y K+YS +E RF F+ +++ I N + S G+ EF+DLS EE
Sbjct: 40 DMFTAFMKQYSKAYSHAEFSSRFNQFKANVETIRL--HNTLANASYTMGLNEFADLSFEE 97
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
FK ++ + KHV + ++ H P DWR + + ++
Sbjct: 98 FKGKYFGY---KHVEREFARSNNLHQE-------------VEAAPTSIDWRTSNAVTPIK 141
Query: 155 NQQTCGACWAFSTVETAESMHALKNG-TLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDW 212
+Q CG+CWAFS + E L+ TL+ LS Q+++DC+ + G+ GC+GG ++
Sbjct: 142 DQGQCGSCWAFSATGSIEGAWVLQGKHTLTSLSEQQLVDCSTSYGDAGCNGGLMDYAFEY 201
Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV 272
+ NK + ES YP C++ T V I Y D E+S+L + T GPV
Sbjct: 202 IIANKGIC-AESAYPYKGVGGLCQKSCTKV--VTISGYK-DVASGDEASLLNAVGTVGPV 257
Query: 273 IAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
A+ A +Q+Y GV C N++H V VGY
Sbjct: 258 SVAIEADQAGFQFYSSGVFSGTCG---HNLDHGVLAVGY 293
>gi|28194643|gb|AAO33583.1|AF479265_1 cathepsin P [Meriones unguiculatus]
Length = 334
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 96/307 (31%), Positives = 151/307 (49%), Gaps = 36/307 (11%)
Query: 11 VALIALCF-LAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEE 69
V ++ LCF LA+ V PNL+ + E ++++YKK+YS +R +E+++ I++
Sbjct: 5 VFVVILCFGLALGASVHDPNLDAQWE---EWKEKYKKNYSPEVEAVRRAIWEENMRIVKL 61
Query: 70 LN-KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSI 128
N +N + F DL+ EF+ V + +
Sbjct: 62 HNGENGLGKNGFTMELNSFGDLTGGEFRNPMADIPVPAALTVERKDKK------------ 109
Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
I G+P K+W G + VRNQ TCG+CWAF+ E K G L+ LSVQ
Sbjct: 110 -----IVDGLPKFKNWINEGYVTPVRNQGTCGSCWAFAATGAIEGQMFWKTGKLTPLSVQ 164
Query: 189 EVIDCA-GNGNMGCSGGDFCALLDWMDVNKVV-LEPESEYPLLLKDAACKRKATSPNGVK 246
++DC+ GN GC+ G A +M VN+ L+ E YP K C+ +++
Sbjct: 165 NLVDCSEKQGNKGCAQGS--AFRAFMYVNETKGLQDEISYPYEGKQGTCRYNSSNS---- 218
Query: 247 IKSYTCD-TLIP-SESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSLANINH 302
++Y D L+P +E +L +A+ GPV AAV+A ++++Y GG I Y S ++NH
Sbjct: 219 -RAYVTDFRLLPQNEIYLLVAVASIGPVAAAVDASQDSFRFYRGG-IYYEPKCSQYSVNH 276
Query: 303 AVQIVGY 309
AV +VGY
Sbjct: 277 AVLVVGY 283
>gi|343471272|emb|CCD16264.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 447
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 96/317 (30%), Positives = 155/317 (48%), Gaps = 30/317 (9%)
Query: 4 VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSY-SKSEHDIRFKNFE 61
+ + F V L+A+ +PV + + EQ L+ F++F+Q+Y +SY +E RF+ F+
Sbjct: 7 TRTLRFSVGLLAVAACLVPVALGVLHAEQSLQQQFAAFKQKYSRSYKDATEEAFRFRMFK 66
Query: 62 KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
+S+ E + + A +G+T+FSD+S EE + +L + K++
Sbjct: 67 QSM---ERAKEEAAANPYATFGVTQFSDMSPEELRATYLNGA----------KYYAAALK 113
Query: 122 HVKKRSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
+K + + TG P DWR+ G + V++Q+ CG+CWAFS E +
Sbjct: 114 RPRKV-----VNVSTGKAPPAVDWRKKGAVTPVKDQRKCGSCWAFSATGNIEGQWKVAGH 168
Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACKRKA 239
L+ LS Q ++ C N + GC GG L W+ NK + E YP D
Sbjct: 169 ELTSLSEQMLVSC-DNMDDGCQGGLMDRALKWIVSSNKGNVFTEESYPYDSTDGDVPPCN 227
Query: 240 TSPN--GVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL 297
S G KI + L E++I +A +GPV AV+A ++ Y GGV+ +C S
Sbjct: 228 MSGKVVGAKISGHI--NLPKDENAIAEWLAKNGPVAIAVDASSFLDYKGGVLT-SC--SS 282
Query: 298 ANINHAVQIVGYDNYSR 314
+NH V +VGYD+ S+
Sbjct: 283 DALNHDVLLVGYDDTSK 299
>gi|255580657|ref|XP_002531151.1| cysteine protease, putative [Ricinus communis]
gi|223529264|gb|EEF31236.1| cysteine protease, putative [Ricinus communis]
Length = 340
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 93/316 (29%), Positives = 150/316 (47%), Gaps = 35/316 (11%)
Query: 2 FDVKNVLFIVALI----ALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIR 56
F +N +ALI AL A+ + ++ +K E + S R+ + Y+ +E +IR
Sbjct: 3 FTTRNGCISLALIFLLGALVSQAMARTLQDASMHEKHEEWMS---RFGRVYNDGNEKEIR 59
Query: 57 FKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHH 116
+K F++++ IE NK S +S + GI +F+DL+ EEFKT R+ H+ S
Sbjct: 60 YKIFKENVQRIESFNK--ASGKSYKLGINQFADLTNEEFKTS--RNRFKGHMCSSQAGPF 115
Query: 117 DHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHA 176
+ + T P DWR+ G + +++Q CG+CWAFS V E +
Sbjct: 116 RYEN--------------LTAAPSSMDWRKKGAVTAIKDQGQCGSCWAFSAVAAVEGITQ 161
Query: 177 LKNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAAC 235
L L LS QE++DC G + GC GG +++ N+ L E+ YP D C
Sbjct: 162 LATSKLISLSEQELVDCDTKGEDQGCQGGLMDDAFKFIEQNQ-GLTTEANYPYEGSDGTC 220
Query: 236 KRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNC 293
K + + KI + D +E +++ +A PV A++A +Q+Y G+ +C
Sbjct: 221 NTKQEANHAAKINGFE-DVPANNEGALMKAVAKQ-PVSVAIDAGGFGFQFYSSGIFTGDC 278
Query: 294 DGSLANINHAVQIVGY 309
L +H V VGY
Sbjct: 279 GTEL---DHGVAAVGY 291
>gi|225446589|ref|XP_002280263.1| PREDICTED: vignain [Vitis vinifera]
Length = 339
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 97/311 (31%), Positives = 144/311 (46%), Gaps = 43/311 (13%)
Query: 7 VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKS-EHDIRFKNFEKSLD 65
+LFI+A A A + + ++ ++ E + RY + Y + E + RFK F+ ++
Sbjct: 14 LLFILA--AWASQATSRSLHEASMYERHE---DWMARYGRMYKDANEKEKRFKIFKDNVA 68
Query: 66 IIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKK 125
IE NK ++ + I EF+DL+ EEF R LR+ H+
Sbjct: 69 RIESFNKAMD--KTYKLSINEFADLTNEEF--RSLRNRFKAHIC---------------S 109
Query: 126 RSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLL 185
+ T T +P DWR+ G + +++QQ CG CWAFS V E + + G L L
Sbjct: 110 EATTFKYENVTAVPSTIDWRKKGAVTPIKDQQQCGCCWAFSAVAATEGITQITTGKLISL 169
Query: 186 SVQEVIDC-AGNGNMGCSGGDFCALLDWMDVNKVV----LEPESEYPLLLKDAACKRKAT 240
S QE++DC G N GCSGG L+D D + + L E+ YP D C K
Sbjct: 170 SEQELVDCDTGGENQGCSGG----LMD--DAFRFIKIHGLASEATYPYEGDDGTCNSKKE 223
Query: 241 SPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLA 298
+ KIK Y D +E ++ +A H PV A++A +Q+Y GV C L
Sbjct: 224 AHPAAKIKGYE-DVPANNEKALQKAVA-HQPVAVAIDAGGFEFQFYTSGVFTGQCGTEL- 280
Query: 299 NINHAVQIVGY 309
+H V VGY
Sbjct: 281 --DHGVAAVGY 289
>gi|94421564|gb|ABF18889.1| cathepsin-L [Lygus lineolaris]
Length = 314
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 94/320 (29%), Positives = 147/320 (45%), Gaps = 35/320 (10%)
Query: 4 VKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDI--RFKNFE 61
+K VL L+A+ LA+P+ S N + + S++ +Y K+Y +E++ R F
Sbjct: 1 MKTVLAFACLVAVG-LALPL--SDDNQAE----WESYKAKYGKTYESNENEAARRTIYFM 53
Query: 62 KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
++E + Q S + G+ F+D+ EF+ K + +
Sbjct: 54 AKEKVMEHNARFEQGLVSYKLGLNSFADMHNGEFR-----------------KMMNGYRR 96
Query: 122 HVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGT 181
+ S+ + +P DWR G + ++NQ CG+CWAFST + E HALK G
Sbjct: 97 GTPRNSVVVHVESNITLPASVDWRTKGAVTPIKNQGQCGSCWAFSTTGSLEGQHALKKGK 156
Query: 182 LSLLSVQEVIDC-AGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKAT 240
L LS QE++DC A GN GC GG ++ N + + E YP +D C K
Sbjct: 157 LVSLSEQELVDCSAAEGNDGCDGGLMDDAFTYIKKNNGI-DTEQSYPYTGEDGTCSFK-K 214
Query: 241 SPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTW--QYYLGGVIQYNCDGSLA 298
S + + D SES + AT GP+ A++A +W Q Y GV + D S
Sbjct: 215 SDVAATVTGFV-DVTSGSESGLQDASATIGPISVAIDASSWDFQLYESGVYDVS-DCSTT 272
Query: 299 NINHAVQIVGY--DNYSRTW 316
++H V +VGY D+ + W
Sbjct: 273 ELDHGVLVVGYGTDDGTAYW 292
>gi|347968731|ref|XP_003436277.1| AGAP002879-PB [Anopheles gambiae str. PEST]
gi|333467869|gb|EGK96736.1| AGAP002879-PB [Anopheles gambiae str. PEST]
Length = 1834
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 87/281 (30%), Positives = 144/281 (51%), Gaps = 28/281 (9%)
Query: 36 LFSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
+F F+ +++ Y+ S EH++RF F +L IE+LNK + +A+YG+T+F+D++ E
Sbjct: 1523 MFDKFRHHHRRQYASSMEHEMRFNIFRNNLFKIEQLNKFERG--TAKYGVTKFADMTVAE 1580
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
++ + +++ H +H N V G+ +P DWR+ G + +V+
Sbjct: 1581 YR-------AHTGLVVPKHDRANHVGNRVASEEDVAGVG---DLPRSFDWRDHGAVTEVK 1630
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
NQ +CG+CWAFS V E +H +K L S QE+IDC N GC GG +D D
Sbjct: 1631 NQGSCGSCWAFSAVGNVEGLHQIKTKKLESYSEQELIDCDKVDN-GCGGG----YMD--D 1683
Query: 215 VNKVV-----LEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
K + LE E++YP K S + V++K + +E+ I + +
Sbjct: 1684 AFKAIEQLGGLELENDYPYEAKAQKSCHFNRSLSHVQVKGAV--DMPKNETYIAKYLIKN 1741
Query: 270 GPVIAAVNALTWQYYLGGVIQ-YNCDGSLANINHAVQIVGY 309
GP+ +NA Q+Y GG+ ++ + +I+H V IVGY
Sbjct: 1742 GPIAIGLNANAMQFYRGGISHPWHPLCNHKSIDHGVLIVGY 1782
>gi|386648114|gb|AFJ15104.1| mexicain-like cystein protease, partial [Jacaratia mexicana]
Length = 323
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 91/286 (31%), Positives = 138/286 (48%), Gaps = 30/286 (10%)
Query: 29 NLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
++E+ + LF S+ K Y + I RF+ F+ +L I+E NK S G+ EF
Sbjct: 14 SIERLVRLFESWTLENDKIYKNIDEKIYRFEIFKDNLMYIDETNKKNSS---YWLGLNEF 70
Query: 88 SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREA 147
+DL+ +EFK +++ ++ + + HV + P I DWR+
Sbjct: 71 ADLTHDEFKAKYVGSLGEDSTIIEQSDDEEFPYKHV--------VDYPESI----DWRQK 118
Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFC 207
G + V+NQ CG+CWAFSTV T E ++ + G L LS QE++DC + GC GG
Sbjct: 119 GAVTPVKNQNPCGSCWAFSTVATVEGINKIVTGKLISLSEQELLDCDRRSH-GCKGGYQT 177
Query: 208 ALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPS--ESSILTD 265
L ++ N V E EYP K C+ K + VKI Y +P+ E S++
Sbjct: 178 TSLQYVADNGV--HTEKEYPYEKKQGKCRAKDKKGSKVKITGY---KRVPANNEVSLIQA 232
Query: 266 IATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
IA PV V + +Q+Y GG+ + C ++HAV VGY
Sbjct: 233 IANQ-PVSVVVESKGRAFQFYKGGIFEGPCG---TKVDHAVTAVGY 274
>gi|118365722|ref|XP_001016081.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89297848|gb|EAR95836.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 337
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 92/310 (29%), Positives = 155/310 (50%), Gaps = 25/310 (8%)
Query: 7 VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLD 65
+L I+ L+ LCF A + + +KL ++ + ++++ Y ++ E R F ++L
Sbjct: 7 LLSIIVLMPLCF-AQDISI------EKLLAYNKWSSQHQRVYLNEDEKLFRQMVFFENLQ 59
Query: 66 IIEELNKNRQSPESARYGITEFSDLSEEEFKTRHL-RHSVNKHVLMSHHKHHDHHHNHVK 124
I+E N N + S + +FSD++++EF + L + ++ H++ + H+ + K
Sbjct: 60 KIKEHNSNPNNTYSIH--LNQFSDMTKQEFAEKILMKQNIVDHLMKGISQEATHNDTNNK 117
Query: 125 KRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL 184
+ + + I + DWRE G I V+NQ CG+CW+FS ES + ++N TL
Sbjct: 118 ETQLNSKSLI---LADSIDWREQGAITTVKNQGNCGSCWSFSAAALMESFNFIQNNTLVD 174
Query: 185 LSVQEVIDCA--GNG--NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKAT 240
S Q+++DC NG + GCSGG LD+ +KV + +YP + C T
Sbjct: 175 FSEQQLVDCVIPANGYYSYGCSGGAAVYCLDY--ASKVGITTLDKYPYVRIQKNCNVTGT 232
Query: 241 SPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANI 300
+ NG K K + +P+ S+ L PV V+A W Y G+ CD S ++
Sbjct: 233 N-NGYKPKQW---IKVPNTSNDLKSALNFSPVSVVVDATNWDNYESGIFN-GCDQSNISL 287
Query: 301 NHAVQIVGYD 310
NHAV +GYD
Sbjct: 288 NHAVLAIGYD 297
>gi|7211745|gb|AAF40416.1|AF216785_1 papain-like cysteine proteinase isoform III [Ipomoea batatas]
gi|7381223|gb|AAF61442.1|AF138266_1 papain-like cysteine proteinase isoform III [Ipomoea batatas]
Length = 366
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 85/284 (29%), Positives = 140/284 (49%), Gaps = 36/284 (12%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F+ F++R+ K+Y S EHD R F+ ++ +++Q +A +G+T+FSDL+ EF
Sbjct: 49 FAVFKRRFGKAYASDEEHDYRLSVFKANM---RRAKRHQQLDPAAVHGVTQFSDLTPTEF 105
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
+ + L +N+ + T +PT +P DWR+ G + V+
Sbjct: 106 RRKFL--GLNRRLKFPADAK--------------TAPILPTDELPSDFDWRDRGAVTPVK 149
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
NQ TCG+CW+FST E + L G L LS Q+++DC AG+ + GC+GG
Sbjct: 150 NQGTCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLM 209
Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
+ ++ + L E +YP D R + K+ +++ +L E I ++
Sbjct: 210 NSAFEYT-LKAGGLMREEDYPYTGNDLQVCRFDKTKIAAKVANFSVVSL--DEDQIAANL 266
Query: 267 ATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
+GP+ A+NA+ Q Y+GGV Y C L +H V +VGY
Sbjct: 267 VKNGPLAVAINAVFMQTYIGGVSCPYICSKRL---DHGVLLVGY 307
>gi|10336513|dbj|BAB13759.1| cysteine proteinase [Astragalus sinicus]
Length = 343
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 99/330 (30%), Positives = 155/330 (46%), Gaps = 49/330 (14%)
Query: 5 KNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQ---RYKKSYSK-SEHDIRFKNF 60
K + +I + +C V+V+ L Q ++ QQ +Y K Y+ E + RF+ F
Sbjct: 5 KQLYYISLALLMCLGLWAVQVTSRTL-QDASMYERHQQWMGQYAKIYNDHQEWEKRFQIF 63
Query: 61 EKSLDIIEELNKNRQSPESARY---GITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHD 117
+++++ IE NK E R+ G+ +F DL+ EEF R+ H+ S + +
Sbjct: 64 KENVNYIETSNK-----EGGRFYKLGVNQFVDLTNEEFIAP--RNRFKGHMCSSIIRTNT 116
Query: 118 HHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHAL 177
+ + +V T +P DWR+ G + V++Q CG CWAFS V E +H L
Sbjct: 117 YKYENV------------TTVPSNVDWRQKGAVTPVKDQGQCGCCWAFSAVAATEGIHQL 164
Query: 178 KNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVV-----LEPESEYPLLLK 231
G L LS QE++DC G + GC GG L+D D K + L+ E++YP
Sbjct: 165 STGKLISLSEQELVDCDTKGVDQGCEGG----LMD--DAFKFIIQNHGLDTEAKYPYQGV 218
Query: 232 DAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVI 289
D C S N I SY D +E ++ +A P+ A++A +Q+Y GV
Sbjct: 219 DGTCNANEASINAATITSYE-DVPTNNEQALQKAVANQ-PISVAIDASGSDFQFYTSGVF 276
Query: 290 QYNCDGSLANINHAVQIVGY---DNYSRTW 316
+C L +H V VGY D+ ++ W
Sbjct: 277 TGSCGTEL---DHGVTAVGYGVSDDGTKYW 303
>gi|357148994|ref|XP_003574963.1| PREDICTED: cysteine proteinase 1-like [Brachypodium distachyon]
Length = 377
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 88/293 (30%), Positives = 141/293 (48%), Gaps = 35/293 (11%)
Query: 31 EQKLEL---FSSFQQRYKKSYSKSE-HDIRFKNFEKSLDIIEELNKNRQSPESARYGITE 86
+ LEL F+SF QR+ K+Y +E H R F+ +L +++ SA +GIT+
Sbjct: 44 DNDLELSSHFTSFVQRFGKTYKDAEEHAHRLSVFKANL---RRARRHQLLDPSAEHGITK 100
Query: 87 FSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKDWR 145
FSDL+ EF+ L ++ + H +PT G+P DWR
Sbjct: 101 FSDLTPAEFRRTFLGLKTSRRSFLREIGGSAH-----------DAPVLPTDGLPDDFDWR 149
Query: 146 EAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNG 197
+ G +G V+NQ +CG+CW+FS E + L G + +LS Q+ +DC +
Sbjct: 150 DHGAVGPVKNQGSCGSCWSFSASGALEGANYLATGKMEVLSEQQFVDCDHECDPEEPDSC 209
Query: 198 NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP 257
+ GC+GG + ++ + LE E +YP +D CK S +++++ ++
Sbjct: 210 DAGCNGGLMTSAFSYL-LKSGGLEREKDYPYTGRDGTCKFD-KSKIVASVQNFSVVSV-- 265
Query: 258 SESSILTDIATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
E I ++ HGP+ +NA Q Y+GGV Y C SL +H V +VGY
Sbjct: 266 DEEQIAANLVKHGPLAIGINAAYMQTYIGGVSCPYICGRSL---DHGVLLVGY 315
>gi|394331828|gb|AFN27133.1| cysteine protease [Leishmania tropica]
Length = 443
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 85/281 (30%), Positives = 138/281 (49%), Gaps = 27/281 (9%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F++ Y+++Y +E R NFE++L+++ E ++P AR+GIT+F DLSE E
Sbjct: 37 LFEEFKRTYRRAYGTVAEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F R+L N + K H H + ++ +P DWR+ G + V+
Sbjct: 94 FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWRKKGAVTPVK 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
+Q CG+CWAFS V + ES AL L+ LS ++ C N G G +W+
Sbjct: 143 DQGACGSCWAFSAVGSIESQWALAGHRLTALSDHHLVSCHDKDN-GRPAGLMLQAFEWLL 201
Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
++N + E YP + C + G +I Y T+ SE+ + +A +
Sbjct: 202 RNMNGTMFT-EDSYPYVSSSGYVPECSNSSQLVPGARIDGYV--TIESSETVMAAWLAKN 258
Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
GP+ A++A ++ Y GV+ +C G +NH V +VGY+
Sbjct: 259 GPISIALDASSFMSYQSGVVT-SCAG--MPLNHGVLLVGYN 296
>gi|14422331|emb|CAC41636.1| early leaf senescence abundant cysteine protease [Pisum sativum]
Length = 350
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 100/320 (31%), Positives = 152/320 (47%), Gaps = 41/320 (12%)
Query: 7 VLFIVALIALCFL---AIPVKVSKPNLEQKLEL---------FSSFQQRYKKSY-SKSEH 53
VLF VA A F + P+++ EQ L++ F+ F RY K Y S E
Sbjct: 9 VLFCVASAAAGFSFHDSNPIRMVSDVEEQLLQVIGESRHAVSFARFANRYGKRYDSVDEM 68
Query: 54 DIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVN-KHVLMSH 112
+RFK F +++++I NK R S + G+ F+D + EEF++ L + N L +
Sbjct: 69 KLRFKIFSENIELIRSSNKRRLS---YKLGVNHFADWTWEEFRSHRLGAAQNCSATLKGN 125
Query: 113 HKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAE 172
HK D + +P +KDWR+ GI+ V++Q +CG+CW FST E
Sbjct: 126 HKITDAN------------------LPDEKDWRKEGIVSGVKDQGSCGSCWTFSTTGALE 167
Query: 173 SMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLK 231
S +A G LS Q+++DCAG N GCSGG +++ N LE E YP
Sbjct: 168 SAYAQAFGKNISLSEQQLVDCAGAFNNFGCSGGLPSQAFEYIKYNG-GLETEEAYPYTGS 226
Query: 232 DAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL-TWQYYLGGVIQ 290
+ CK ++ VK+ + + + +E + IA PV A + ++ Y GV
Sbjct: 227 NGLCKFRSEHV-AVKVLG-SVNITLGAEDELKHAIAFARPVSVAFEVVHDFRLYKSGVYT 284
Query: 291 YN-CDGSLANINHAVQIVGY 309
C + ++NHAV VGY
Sbjct: 285 STACGSTPMDVNHAVLAVGY 304
>gi|326431661|gb|EGD77231.1| cysteine protease [Salpingoeca sp. ATCC 50818]
Length = 347
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 91/284 (32%), Positives = 134/284 (47%), Gaps = 25/284 (8%)
Query: 37 FSSFQQRYKKSYSKSEHDIRFKN-FEKSLDIIEELNKNRQSP-ESARYGITEFSDLSEEE 94
F F+ +Y K Y +E + R F++SLD IE+ N + + G+ EF+DL+ EE
Sbjct: 31 FEEFKDKYNKVYESAEEEARRAAIFQESLDFIEKHNAEAAAGMHTYLVGVNEFADLTREE 90
Query: 95 FKTRH---LRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIG 151
F+ H L +K ++ H D H H + + +GI DWR+ G +
Sbjct: 91 FRQHHVTRLPFDDDKRDPVTATLHLDEHAVHAADSNGDS-----SGI----DWRKRGAVT 141
Query: 152 KVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLD 211
VRNQ CG F+ VE E MHA+ +G L LS Q+VIDC +G GCSGG +
Sbjct: 142 PVRNQGQCGNPAIFAAVEAVEGMHAISSGNLVELSTQQVIDC--SGTPGCSGGSLVSFFK 199
Query: 212 WMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGP 271
++ N L+ ++YP C + + + K+ Y+ + P + L P
Sbjct: 200 YIARNG-GLDSAADYPTSGAGGQCNKAKEARHVAKVGGYS--VVPPRNETKLAAAVFKMP 256
Query: 272 VIAAVNALT--WQYYLGGVIQYNCDGSLANINHAVQIVGY-DNY 312
V A+ A T +Q Y GV C L +HAV +VGY D Y
Sbjct: 257 VAVAIEADTPSFQMYTSGVYSGPCGTQL---DHAVLVVGYTDEY 297
>gi|318816588|ref|NP_001187996.1| cathepsin L precursor [Ictalurus punctatus]
gi|308324547|gb|ADO29408.1| cathepsin L [Ictalurus punctatus]
Length = 334
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 93/316 (29%), Positives = 157/316 (49%), Gaps = 31/316 (9%)
Query: 10 IVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNF---EKSLDI 66
++ + AL LA +S +LE F S++ ++ K Y E + + KN + L +
Sbjct: 4 LIVITALVALASATSISLEDLE-----FHSWKLKFGKIYKSVEEESQRKNTWLENRKLVL 58
Query: 67 IEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKR 126
+ + + Q +S R G+T F+D+ +E+ R SV K L S ++ H + +
Sbjct: 59 VHNMLAD-QGIKSYRLGMTYFADMDNQEY-----RQSVFKGCLGSFNRTKGHRASTFLLQ 112
Query: 127 SITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLS 186
+ G +P + DWR+ G + +V++Q+ CG+CWAFS + E K G L LS
Sbjct: 113 A--GGAVLPDTV----DWRDKGYVAEVKDQKNCGSCWAFSATGSLEGQTFRKTGKLVSLS 166
Query: 187 VQEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGV 245
Q+++DC+G GNMGC GG ++++ NK + + E YP D C+ K + G
Sbjct: 167 EQQLVDCSGKYGNMGCGGGLMDLAFEYIEDNKGI-DTEESYPYEATDGDCRFKPATV-GA 224
Query: 246 KIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVI-QYNCDGSLANINH 302
Y D E+++ +A GP+ A++A +++Q Y G+ + NC S +++H
Sbjct: 225 TCTGYV-DINSEDENALQKAVANIGPISVAIDAGHISFQLYGSGIYNEPNC--SSEDLDH 281
Query: 303 AVQIVGY--DNYSRTW 316
V VGY DN W
Sbjct: 282 GVLAVGYGTDNQQDYW 297
>gi|242061538|ref|XP_002452058.1| hypothetical protein SORBIDRAFT_04g017830 [Sorghum bicolor]
gi|241931889|gb|EES05034.1| hypothetical protein SORBIDRAFT_04g017830 [Sorghum bicolor]
Length = 371
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 90/295 (30%), Positives = 142/295 (48%), Gaps = 37/295 (12%)
Query: 31 EQKLEL-----FSSFQQRYKKSYSKSE-HDIRFKNFEKSLDIIEELNKNRQSPESARYGI 84
+ +LEL F SF QR+ KSY +E H R F+ +L +++ SA +G+
Sbjct: 37 DNELELNAESHFLSFVQRFGKSYKDAEEHAYRLSIFKANL---RRARRHQLLDPSAEHGV 93
Query: 85 TEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKD 143
T+FSDL+ EF+ +L ++ L+ +S +PT G+P D
Sbjct: 94 TKFSDLTPAEFRRTYLGLRKSRRALLRE-----------LGKSANEAPVLPTDGLPDDFD 142
Query: 144 WREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AG 195
WR+ G + V+NQ +CG+CW+FST E H L G L +LS Q+++DC
Sbjct: 143 WRDHGAVTPVKNQGSCGSCWSFSTSGALEGAHYLATGKLEVLSEQQMVDCDHVCDTSEPD 202
Query: 196 NGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTL 255
+ + GC+GG ++ LE E +YP D CK S +++++ ++
Sbjct: 203 SCDSGCNGGLMTNAFSYLQ-KAGGLESEKDYPYTGSDDKCKFD-KSKIVASVQNFSVVSV 260
Query: 256 IPSESSILTDIATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
E I ++ HGP+ +NA Q Y+GGV Y C +L +H V +VGY
Sbjct: 261 --DEGQIAANLIKHGPLAIGINAAYMQTYIGGVSCPYICGRTL---DHGVLLVGY 310
>gi|146084829|ref|XP_001465113.1| cysteine peptidase A (CPA) [Leishmania infantum JPCM5]
gi|134069209|emb|CAM67356.1| cysteine peptidase A (CPA) [Leishmania infantum JPCM5]
Length = 354
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 102/322 (31%), Positives = 157/322 (48%), Gaps = 43/322 (13%)
Query: 4 VKNVLFIV----ALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFK 58
V +LF+V ALIA L + ++ + + F++R+ K + + +E RF
Sbjct: 12 VVTILFVVCYGSALIAQTPLGVDDFIASAH-------YGRFKKRHGKPFGEDAEEGRRFN 64
Query: 59 NFEKSLDIIEELNKNRQSPESARYGIT-EFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHD 117
F++++ LN + A Y ++ +F+DL+ +EF +L N + H K +
Sbjct: 65 AFKQNMQTAYFLNAHN---PHAHYDVSGKFADLTPQEFAKLYL----NPNYYARHGKDYK 117
Query: 118 HHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHAL 177
H HV S+ +G+ + DWRE G++ V+NQ CG+CWAF+T E AL
Sbjct: 118 EH-VHVDD-SVRSGV-------MSVDWREKGVVTPVKNQGMCGSCWAFATTGNIEGQWAL 168
Query: 178 KNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM--DVNKVVLEPESEYPLLLKDAAC 235
KN +L LS Q ++ C N + GC+GG + W+ D N V E YP A
Sbjct: 169 KNHSLVSLSEQVLVSCD-NIDDGCNGGLMQQAMQWIINDHNGTV-PTEDSYP--YTSAGG 224
Query: 236 KRKATSPN---GVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYN 292
R N G KIK Y +L E I + +GPV AV+A TWQ Y GGV+
Sbjct: 225 TRPPCHDNGTVGAKIKGYM--SLPHDEEEIAAYVGKNGPVAVAVDATTWQLYFGGVVTL- 281
Query: 293 CDGSLANINHAVQIVGYDNYSR 314
C G ++NH V +VG++ ++
Sbjct: 282 CFG--LSLNHGVLVVGFNRQAK 301
>gi|312281839|dbj|BAJ33785.1| unnamed protein product [Thellungiella halophila]
Length = 373
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 87/298 (29%), Positives = 145/298 (48%), Gaps = 36/298 (12%)
Query: 23 VKVSKPNLEQKLELFSSFQQRYKKSYSKSE-HDIRFKNFEKSLDIIEELNKNRQSPESAR 81
V ++P + + FS F++++ K Y+ SE HD R F+ +L ++++ SAR
Sbjct: 42 VDGAEPKVLSSEDHFSLFKRKFGKVYASSEEHDYRLSVFKANL---RRARRHQKLDPSAR 98
Query: 82 YGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPV 140
+G+T+FSDL+ EF+ +HL V D + + +PT +P
Sbjct: 99 HGVTQFSDLTRSEFRKKHL------GVRGGFKLPKDANKAPI----------LPTENLPE 142
Query: 141 KKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC------- 193
DWR+ G + V+NQ +CG+CW+FS E + L G L LS Q+++DC
Sbjct: 143 DFDWRDRGAVTPVKNQGSCGSCWSFSATGALEGANFLATGKLVSLSEQQLVDCDHECDPE 202
Query: 194 -AGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTC 252
AG+ + GC+GG + ++ + L E +YP KD + S + +++
Sbjct: 203 EAGSCDSGCNGGLMNSAFEYT-LKTGGLMREEDYPYTGKDGPTCKLDKSKIVASVSNFSV 261
Query: 253 DTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
++ E I ++ +GP+ A+NA Q Y+GGV Y C +NH V +VGY
Sbjct: 262 ISI--DEDQIAANLVKNGPLAVAINAAYMQTYIGGVSCPYIC---ARRLNHGVLLVGY 314
>gi|74229834|gb|AAU14993.2| cysteine proteinase [Cryptobia salmositica]
Length = 443
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 92/315 (29%), Positives = 145/315 (46%), Gaps = 33/315 (10%)
Query: 7 VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLD 65
+ + AL+ +C + P E LF +F+ + ++Y S E RF+ F ++
Sbjct: 3 TVIVAALLMVC-----NAMGAPTTEV---LFGNFKAAHARNYASPDEERKRFEIFAGNMK 54
Query: 66 IIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKK 125
LN R++P A +G EF+D++ EEF+TRH K+ K
Sbjct: 55 KAAVLN--RKNP-MATFGPNEFADMTSEEFQTRHNAARHYAAAKARPPKNTKTFTAEEIK 111
Query: 126 RSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLL 185
++ I DWR G + V+NQ CG+CW+FST E HA+ G L +
Sbjct: 112 AAVGQQI----------DWRLKGAVTPVKNQGACGSCWSFSTTGNIEGQHAIATGQLVAV 161
Query: 186 SVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDA---ACKRKATS 241
S QE++ C + GC+GG W+ +K + E+ YP + + AC S
Sbjct: 162 SEQELVSCDPIDD-GCNGGLMDNAFGWLISAHKGQIATEANYPYVSGNGIVPACSSSPES 220
Query: 242 -PNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANI 300
P G I ++ + +E + + HGP+ V+A TWQ Y GG++ Y C I
Sbjct: 221 KPVGATISAF--QDIARTEEDMAAFVFKHGPLSIGVDASTWQSYAGGIMSY-CPQD--QI 275
Query: 301 NHAVQIVGYDNYSRT 315
+H V IVG+D+ + T
Sbjct: 276 DHGVLIVGFDDTAST 290
>gi|30141025|dbj|BAC75926.1| cysteine protease-4 [Helianthus annuus]
Length = 352
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 85/280 (30%), Positives = 134/280 (47%), Gaps = 28/280 (10%)
Query: 34 LELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEE 93
+ LF S+ ++ K Y D + FE +D ++ ++ + + G+ EF+DL+ E
Sbjct: 46 IHLFESWLAKHSKIYESL--DEKLHRFEIFMDNLKHIDDTNKKVSNYWLGLNEFADLTHE 103
Query: 94 EFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKV 153
EFK + L + + +++ S + +P + DWR+ G + V
Sbjct: 104 EFKNKFLGLK---------GELPERKDESIEEFSYRDFVDLPKSV----DWRKKGAVAPV 150
Query: 154 RNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW- 212
+NQ CG+CWAFSTV E ++ + G L++LS QE+IDC N GC+GG L+D+
Sbjct: 151 KNQGQCGSCWAFSTVAAVEGINQIVTGNLTMLSEQELIDCDTTFNNGCNGG----LMDYA 206
Query: 213 -MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGP 271
V + L E EYP ++ + C K V I Y D +E S L +A P
Sbjct: 207 FAYVMRSGLHKEEEYPYIMSEGTCDEKKDVSETVTISGYH-DVPRNNEDSFLKALANQ-P 264
Query: 272 VIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+ A+ A +Q+Y GGV +C L +H V VGY
Sbjct: 265 ISVAIEASGRDFQFYSGGVFDGHCGTEL---DHGVAAVGY 301
>gi|156124998|gb|ABU50817.1| Ale o 1 allergen [Aleuroglyphus ovatus]
Length = 337
Score = 122 bits (307), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 94/292 (32%), Positives = 141/292 (48%), Gaps = 39/292 (13%)
Query: 31 EQKLEL-FSSFQQRYKKSYSKSEHDIRFKN-FEKSLDIIEELNKNRQSPESA-RYGITEF 87
E +LE F F+ + + Y E ++ K+ F +L I N + + +S + F
Sbjct: 26 EGELEAQFEQFKSTFGRVYPSPEIELHRKSIFRANLQFILRHNIDYFNGDSTFSVSVNNF 85
Query: 88 SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREA 147
+DLS EEF+ + V ++ H D N V+ +P DW
Sbjct: 86 TDLSNEEFRATFNGYRRLAAVSLADSVHAD---NDVE------------ALPATVDWTTK 130
Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC-AGNGNMGCSGGDF 206
G++ ++NQQ CG+CWAFS V + E HALK G L LS Q ++DC A G+MGCSGG
Sbjct: 131 GVVTPIKNQQQCGSCWAFSAVASMEGQHALKTGKLVSLSEQNLVDCSAAEGDMGCSGG-- 188
Query: 207 CALLDWMD------VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSES 260
WMD + ++ E+ YP D +C+ K S G I S+ D ES
Sbjct: 189 -----WMDYAFKYVIQNRGIDTEASYPYKAIDESCEFKRNSV-GATIHSFV-DVKTGDES 241
Query: 261 SILTDIATHGPVIAAVNAL--TWQYYLGGVIQYN-CDGSLANINHAVQIVGY 309
++ +A+ GP+ A++A ++Q+Y GV YN D S ++H V VGY
Sbjct: 242 ALQNAVASIGPISVAIDAAQPSFQFYSSGV--YNEPDCSTEILDHGVTAVGY 291
>gi|5853329|gb|AAD54424.1|AF182079_1 thiol protease [Matricaria chamomilla]
Length = 501
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 85/283 (30%), Positives = 143/283 (50%), Gaps = 24/283 (8%)
Query: 35 ELFSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEE 93
+LF +++ + K+Y + E ++R +NF+KS+ + E N R+S G+ +F+DLS E
Sbjct: 48 DLFGKWKELHGKTYQHEEEENLRLENFKKSVKFVMEKNSERKSELDHTVGLNKFADLSNE 107
Query: 94 EFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT---GIPVKKDWREAGII 150
EFK ++ K N +K + +++ + P DWR+ G++
Sbjct: 108 EFKEMYMS------------KVKGSRSNELKMGGVKRNMSVSSRTCDAPTSLDWRDKGVV 155
Query: 151 GKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALL 210
+++Q CG+CWAFS + ES +A+ G L LS QE++DC + GC GG+
Sbjct: 156 TPMKDQGQCGSCWAFSVSGSIESANAIATGDLIRLSEQELVDC-DTYDYGCDGGNMDTAY 214
Query: 211 DWMDVNKVVLEPESEYPLLL---KDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIA 267
W+ + L+ E +YP +D C + ++ + V + SY + +E ++L +A
Sbjct: 215 RWI-IKNGGLDSEDDYPYTSSNGRDGKCDKTKSAKSVVSLDSYV--EVESNEDAVLCAVA 271
Query: 268 THGPVIAAV-NALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
T I V +A +Q Y GGV C +I+HAV IVGY
Sbjct: 272 TTPVTIGIVGSAYDFQLYTGGVYNGQCSSKPYDIDHAVLIVGY 314
>gi|461905|sp|Q05094.1|CYSP2_LEIPI RecName: Full=Cysteine proteinase 2; AltName: Full=Amastigote
cysteine proteinase A-2; Flags: Precursor
gi|159298|gb|AAA29229.1| cysteine proteinase [Leishmania pifanoi]
Length = 444
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 87/281 (30%), Positives = 135/281 (48%), Gaps = 26/281 (9%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F++ Y ++Y +E R NFE++L+++ E ++P A++GIT+F DLSE E
Sbjct: 37 LFEEFKRTYGRAYETLAEEQQRLANFERNLELMRE--HQARNPH-AQFGITKFFDLSEAE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F R+L N + K H H + ++ +P DWRE G + V+
Sbjct: 94 FAARYL----NGAAYFAAAKRHAAQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
+Q CG+CWAFS V E L L LS Q+++ C + N GC GG DW+
Sbjct: 143 DQGACGSCWAFSAVGNIEGQWYLAGHELVSLSEQQLVSC-DDMNDGCDGGLMLQAFDWLL 201
Query: 215 VNKVV-LEPESEYPLLLKDAACKRKATSPN----GVKIKSYTCDTLIPSESSILTDIATH 269
N L E YP + + + S G +I + + SE ++ +A +
Sbjct: 202 QNTNGHLHTEDSYPYVSGNGYVPECSNSSEELVVGAQIDGHV--LIGSSEKAMAAWLAKN 259
Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
GP+ A++A ++ Y GV+ C G +NH V +VGYD
Sbjct: 260 GPIAIALDASSFMSYKSGVLT-ACIGK--QLNHGVLLVGYD 297
>gi|118366325|ref|XP_001016381.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89298148|gb|EAR96136.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 337
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 92/317 (29%), Positives = 149/317 (47%), Gaps = 30/317 (9%)
Query: 4 VKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY----SKSEHDIRFKN 59
+ N +AL+AL + + + PN +LE +++ Q + + +E R
Sbjct: 1 MNNKFISLALVALLICSSLAQQTNPN--HQLEALTAYNQWKNNNLRIYINDAEKQYRQSV 58
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHH 119
F ++ I+E N N+ + + G+ +FSD+++EEF + +LMS+ +
Sbjct: 59 FLENFQKIKEHNANQ--ANTYQQGLNQFSDMTQEEFVQK---------ILMSNSQADSSQ 107
Query: 120 HNHVKKRSITTGITIPTGIPVKK--DWREAGIIGKVRNQQTCGACWAFSTVETAESMHAL 177
+ S T P+ DWR G + V+NQ CG+CWAFS+ ES + +
Sbjct: 108 SLSAPQSSSNNQNLTATASPIAASVDWRTKGAVTPVKNQGNCGSCWAFSSTGAMESFNFI 167
Query: 178 KNGTLSLLSVQEVIDCA----GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDA 233
KN LS S Q+++DCA G + GC+GG + L + +KV ++ ES+YP
Sbjct: 168 KNKVLSSFSEQQLVDCAIQQNGYYSHGCNGGSYYQAL--LYASKVGMKTESQYPYTAIWG 225
Query: 234 ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNC 293
C+ T+ NG K ++ + + L PV A++A Y GV NC
Sbjct: 226 TCQVSGTN-NGYKPVAFGS---VGQNTLALQTALNAAPVSIAMDATNLYLYTSGVYN-NC 280
Query: 294 DGSLANINHAVQIVGYD 310
+ S N+NHAV VGYD
Sbjct: 281 NPSSINLNHAVLAVGYD 297
>gi|269784818|ref|NP_001161481.1| cathepsin L1 precursor [Gallus gallus]
Length = 353
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 90/309 (29%), Positives = 144/309 (46%), Gaps = 25/309 (8%)
Query: 6 NVLFIVALIALCF-LAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSL 64
N+ + +++LC LA P+L+ +L+ S+ + K Y + E R +EK+L
Sbjct: 15 NMNVCLTILSLCLGLAFAAPRVDPDLDSHWQLWKSW---HSKDYHEREESWRRVVWEKNL 71
Query: 65 DIIEELNKNRQ-SPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHV 123
+IE N + S + G+ +F D++ EEF+ LM+ +KH +
Sbjct: 72 KMIELHNLDHSLGKHSYKLGMNQFGDMTAEEFRQ-----------LMNGYKHKKSERKYR 120
Query: 124 KKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLS 183
+ + P DWRE G + V++Q CG+CWAFST E H K G L
Sbjct: 121 GSQFLEPSFLEA---PRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLV 177
Query: 184 LLSVQEVIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSP 242
LS Q ++DC+ GN GC+GG ++ N + + E YP KD R
Sbjct: 178 SLSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDNGGI-DSEESYPYTAKDDEDCRYKAEY 236
Query: 243 NGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANI 300
N + D E +++ +A+ GPV A++A ++Q+Y G I Y D S ++
Sbjct: 237 NAANDTGFV-DIPQGHERALMKAVASVGPVSVAIDAGHSSFQFYQSG-IYYEPDCSSEDL 294
Query: 301 NHAVQIVGY 309
+H V +VGY
Sbjct: 295 DHGVLVVGY 303
>gi|328870281|gb|EGG18656.1| hypothetical protein DFA_04151 [Dictyostelium fasciculatum]
Length = 347
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 87/303 (28%), Positives = 140/303 (46%), Gaps = 23/303 (7%)
Query: 9 FIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIE 68
IVA++ L LA NL + F FQ +Y K Y E + F+ SL I+
Sbjct: 5 LIVAILLLVALA---SARTSNLSFEETQFREFQLKYNKHYESHEFAQKLATFKNSLKRIQ 61
Query: 69 ELNK-NRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRS 127
ELN +++ +G+ +F+DLS+EEF +L M + ++ K
Sbjct: 62 ELNDMAKRAKVDTEFGVNKFADLSKEEFANYYLNKGG-----MESTDSETYAPDYSDKE- 115
Query: 128 ITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSV 187
+ +P DWR G + V++Q CG+CW+FST E L L+ LS
Sbjct: 116 -------ISNLPTSFDWRTQGAVTPVKDQGQCGSCWSFSTTGNVEGQWFLAGNDLTGLSE 168
Query: 188 QEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLL-LKDAACKRKATSPNGVK 246
Q ++DC+ N GC+GG D++ N + + E+ YP L ++ C+ + G K
Sbjct: 169 QNLVDCS-TKNDGCNGGLMPLAYDYIVENNGI-DTEASYPYLAIQQKNCQFNPANI-GAK 225
Query: 247 IKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQI 306
I Y + +E+ + ++ +GP+ A +A WQYY G+ N++H + I
Sbjct: 226 IDGYY--NVSSNETQMQINLVNNGPLSIAADAAEWQYYKKGIFSGIFGICGKNLDHGILI 283
Query: 307 VGY 309
VGY
Sbjct: 284 VGY 286
>gi|302143416|emb|CBI21977.3| unnamed protein product [Vitis vinifera]
Length = 297
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 90/275 (32%), Positives = 128/275 (46%), Gaps = 38/275 (13%)
Query: 43 RYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLR 101
RY + Y + E + RFK F+ ++ IE NK ++ + I EF+DL+ EEF R LR
Sbjct: 3 RYGRMYKDANEKEKRFKIFKDNVARIESFNKAMD--KTYKLSINEFADLTNEEF--RSLR 58
Query: 102 HSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGA 161
+ H+ + T T +P DWR+ G + +++QQ CG
Sbjct: 59 NRFKAHIC---------------SEATTFKYENVTAVPSTIDWRKKGAVTPIKDQQQCGC 103
Query: 162 CWAFSTVETAESMHALKNGTLSLLSVQEVIDC-AGNGNMGCSGGDFCALLDWMDVNKVV- 219
CWAFS V E + + G L LS QE++DC G N GCSGG L+D D + +
Sbjct: 104 CWAFSAVAATEGITQITTGKLISLSEQELVDCDTGGENQGCSGG----LMD--DAFRFIK 157
Query: 220 ---LEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAV 276
L E+ YP D C K + KIK Y D +E ++ +A H PV A+
Sbjct: 158 IHGLASEATYPYEGDDGTCNSKKEAHPAAKIKGYE-DVPANNEKALQKAVA-HQPVAVAI 215
Query: 277 NA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+A +Q+Y GV C L +H V VGY
Sbjct: 216 DAGGFEFQFYTSGVFTGQCGTEL---DHGVAAVGY 247
>gi|244539471|dbj|BAH82657.1| cysteine protease [Lotus japonicus]
Length = 286
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 87/276 (31%), Positives = 136/276 (49%), Gaps = 30/276 (10%)
Query: 30 LEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFS 88
+++ +ELF S+ R+ K Y S E +RF+ F+ +L I+E NK + G+ EF+
Sbjct: 1 MDKLIELFESWMSRHGKIYESIEEKLLRFEIFKDNLKHIDETNK---VVSNYWLGLNEFA 57
Query: 89 DLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAG 148
DLS EFK ++L V+ ++ S +P DWR+ G
Sbjct: 58 DLSHHEFKKQYLGLKVD---------------FSTRRESSEEFTYRDVDLPKSVDWRKKG 102
Query: 149 IIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCA 208
+ ++NQ +CG+CWAFSTV E ++ + G L+ LS QE+IDC N GC+GG
Sbjct: 103 AVTNIKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNSGCNGG---- 158
Query: 209 LLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
L+D+ V L E +YP ++++ C+ V I Y D +E S+L
Sbjct: 159 LMDYAFSFIVENGGLHKEDDYPYIMEEGTCEMSKEESQVVTISGYH-DVPQNNEQSLLKA 217
Query: 266 IATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLAN 299
+A P+ A+ A +Q+Y GGV +C LA+
Sbjct: 218 LANQ-PLSVAIEASGRDFQFYSGGVFDGHCGTQLAS 252
>gi|18399697|ref|NP_565512.1| putative cysteine proteinase A494 [Arabidopsis thaliana]
gi|12643282|sp|P43295.2|A494_ARATH RecName: Full=Probable cysteine proteinase A494; Flags: Precursor
gi|4567274|gb|AAD23687.1| cysteine proteinase [Arabidopsis thaliana]
gi|116325924|gb|ABJ98563.1| At2g21430 [Arabidopsis thaliana]
gi|330252083|gb|AEC07177.1| putative cysteine proteinase A494 [Arabidopsis thaliana]
Length = 361
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 92/325 (28%), Positives = 158/325 (48%), Gaps = 42/325 (12%)
Query: 1 MFDVKNVLFIVALIALC-----FLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHD 54
+F V +++F+ +++C + V ++P + + F+ F++++ K Y S EH
Sbjct: 8 LFSV-SLIFVFVSVSVCGDEDVLIRQVVDETEPKVLSSEDHFTLFKKKFGKVYGSIEEHY 66
Query: 55 IRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHK 114
RF F+ +L + + + P SAR+G+T+FSDL+ EF+ +HL V
Sbjct: 67 YRFSVFKANL--LRAMRHQKMDP-SARHGVTQFSDLTRSEFRRKHL------GVKGGFKL 117
Query: 115 HHDHHHNHVKKRSITTGITIPT-GIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAES 173
D + + +PT +P + DWR+ G + V+NQ +CG+CW+FST E
Sbjct: 118 PKDANQAPI----------LPTQNLPEEFDWRDRGAVTPVKNQGSCGSCWSFSTTGALEG 167
Query: 174 MHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDFCALLDWMDVNKVVLEPESE 225
H L G L LS Q+++DC G+ + GC+GG + ++ + L E +
Sbjct: 168 AHFLATGKLVSLSEQQLVDCDHECDPEEEGSCDSGCNGGLMNSAFEYT-LKTGGLMREKD 226
Query: 226 YPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYL 285
YP D + S + +++ ++ +E I ++ +GP+ A+NA Q Y+
Sbjct: 227 YPYTGTDGGSCKLDRSKIVASVSNFSVVSI--NEDQIAANLIKNGPLAVAINAAYMQTYI 284
Query: 286 GGV-IQYNCDGSLANINHAVQIVGY 309
GGV Y C L NH V +VGY
Sbjct: 285 GGVSCPYICSRRL---NHGVLLVGY 306
>gi|67773376|gb|AAY81945.1| cysteine protease 7 [Paragonimus westermani]
Length = 325
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 89/308 (28%), Positives = 146/308 (47%), Gaps = 41/308 (13%)
Query: 10 IVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEE 69
+V ++ F V+V + EL+ F++ Y K+Y+ + RF F+ +L ++
Sbjct: 9 LVVVVGCAFAVNTVRVP----DNARELYEQFKRDYGKAYANEDDQKRFAIFKDNLVRAQQ 64
Query: 70 LNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSIT 129
Q +A+YG+T+FSDL+ EEF +L +++ V + V+ +
Sbjct: 65 YQMQEQG--TAKYGVTQFSDLTPEEFAAMYLGSRIDERV------------DRVQLNDLQ 110
Query: 130 TGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQE 189
T P DWR+ G +G V +Q +CG+CWAFS E LK G L LS Q+
Sbjct: 111 TA-------PASVDWRKKGAVGPVEDQGSCGSCWAFSVTANVEGQWFLKTGRLVSLSKQQ 163
Query: 190 VIDCAGNGNMGCSGGDFCALLDWMDVNKV-VLEPESEYPLLLKDAACKRKATSPNGVKIK 248
++DC + GCSGG + ++ ++ LE +S YP AC+ + K+
Sbjct: 164 LVDC-DRLDHGCSGG--YPPYTYKEIKRMGGLELQSAYPYTSWKQACRIDRS-----KLV 215
Query: 249 SYTCDTLI--PSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYN---CDGSLANINHA 303
+ D+++ E +A HGP+ +NA Q+Y G++ + C S +NHA
Sbjct: 216 AKIDDSIVLETDEEKQAAWLAEHGPMSTCLNAGPLQFYQSGILHPSKAMC--SPEGLNHA 273
Query: 304 VQIVGYDN 311
V VGYD
Sbjct: 274 VLTVGYDT 281
>gi|302754322|ref|XP_002960585.1| hypothetical protein SELMODRAFT_266583 [Selaginella moellendorffii]
gi|300171524|gb|EFJ38124.1| hypothetical protein SELMODRAFT_266583 [Selaginella moellendorffii]
Length = 330
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 93/286 (32%), Positives = 139/286 (48%), Gaps = 40/286 (13%)
Query: 37 FSSFQQRYKKSYSKSE-HDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F SF R+ K+Y+ +E + R K FE +L + ++ P SA +GIT+FSDL+EEEF
Sbjct: 21 FKSFIARFGKAYATAEAYAHRLKVFEANL--VRAVSHQALDP-SAVHGITQFSDLTEEEF 77
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
K + L V + R +PT +P DWRE G + +V+
Sbjct: 78 KQQFLGLRVPSRL-----------------REANKAPVLPTNDLPEDFDWREHGAVTEVK 120
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
NQ CG+CWAFST E H L+ G L LS Q+++DC + + GC+GG
Sbjct: 121 NQGACGSCWAFSTTGAIEGAHFLETGKLISLSEQQLVDCDHSCDPTDKVSCDAGCNGGLM 180
Query: 207 CALLDWMDVNKVVLEPESEYPLLL-KDAACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
D++ + LE E++YP + C+ A N + T+ E I +
Sbjct: 181 TNAYDYV-MKSGGLETETDYPYTGNSNGKCQFNA---NKIVASVANFSTVSLDEDQIAAN 236
Query: 266 IATHGPVIAAVNALTWQYYLGGVIQYNCD--GSLANINHAVQIVGY 309
+ HGP+ +NA+ Q Y+GGV +C S +I+H V +VGY
Sbjct: 237 LVKHGPLAIGINAVFMQTYIGGV---SCPIICSKHHIDHGVLLVGY 279
>gi|20147096|gb|AAM09951.1| 49 kDa cysteine proteinase Cysp1 [Cryptobia salmositica]
Length = 428
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 87/286 (30%), Positives = 135/286 (47%), Gaps = 25/286 (8%)
Query: 36 LFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF +F+ + ++Y S E RF+ F ++ LN R++P A +G EF+D++ EE
Sbjct: 9 LFGNFKAAHARNYASPDEERKRFEIFAGNMKKAAVLN--RKNP-MATFGPNEFADMTSEE 65
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F+TRH K+ K ++ I DWR G + V+
Sbjct: 66 FQTRHNAARHYAAAKARPPKNTKTFTAEEIKAAVGQQI----------DWRLKGAVTPVK 115
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
NQ CG+CW+FST E HA+ G L +S QE++ C + GC+GG W+
Sbjct: 116 NQGACGSCWSFSTTGNIEGQHAIATGQLVAVSEQELVSCDPIDD-GCNGGLMDNAFGWLI 174
Query: 214 DVNKVVLEPESEYPLLLKDA---ACKRKATS-PNGVKIKSYTCDTLIPSESSILTDIATH 269
+K + E+ YP + + AC S P G I ++ + +E + + H
Sbjct: 175 SAHKGQIATEANYPYVSGNGIVPACSSSPESKPVGATISAF--QDIARTEEDMAAFVFKH 232
Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYSRT 315
GP+ V+A TWQ Y GG++ Y C I+H V IVG+D+ + T
Sbjct: 233 GPLSIGVDASTWQSYAGGIMSY-CPQD--QIDHGVLIVGFDDTAST 275
>gi|121308860|dbj|BAF43527.1| cysteine proteinase [Zinnia elegans]
Length = 352
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 91/286 (31%), Positives = 136/286 (47%), Gaps = 30/286 (10%)
Query: 29 NLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
++ + + LF S+ ++ K Y + + RF+ F +L I+E NK + G+ EF
Sbjct: 41 SIHKVIHLFESWLVKHSKFYESLDEKLHRFEIFMDNLKHIDETNKKVSN---YWLGLNEF 97
Query: 88 SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREA 147
+DL+ EEFK + L ++ K + S G +P DWR+
Sbjct: 98 ADLTHEEFKHKFLGFKGE----LAERK---------DESSKEFGYRDFVDLPKSVDWRKK 144
Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFC 207
G + V+NQ CG CWAFSTV E ++ + G L++LS QE+IDC N GC+GG
Sbjct: 145 GAVAPVKNQGQCGNCWAFSTVAAVEGINQIVTGNLTMLSEQELIDCDTTFNNGCNGG--- 201
Query: 208 ALLDW--MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
L+D+ V + L E EYP ++ + C K V I Y D E+S L
Sbjct: 202 -LMDYAFAYVMRSGLHKEEEYPYIMSEGTCDEKKDVSEKVTISGYH-DVPRNDEASFLKA 259
Query: 266 IATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+A P+ A+ A +Q+Y GGV +C L +H V VGY
Sbjct: 260 LANQ-PISVAIEASGRDFQFYSGGVFDGHCGTEL---DHGVAAVGY 301
>gi|302771610|ref|XP_002969223.1| hypothetical protein SELMODRAFT_91274 [Selaginella moellendorffii]
gi|300162699|gb|EFJ29311.1| hypothetical protein SELMODRAFT_91274 [Selaginella moellendorffii]
Length = 367
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 93/286 (32%), Positives = 139/286 (48%), Gaps = 40/286 (13%)
Query: 37 FSSFQQRYKKSYSKSE-HDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F SF R+ K+Y+ +E + R K FE +L + ++ P SA +GIT+FSDL+EEEF
Sbjct: 58 FKSFIARFGKAYATAEAYAHRLKVFEANL--VRAVSHQALDP-SAVHGITQFSDLTEEEF 114
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
K + L V + R +PT +P DWRE G + +V+
Sbjct: 115 KQQFLGLRVPSRL-----------------REANKAPVLPTNDLPEDFDWREHGAVTEVK 157
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
NQ CG+CWAFST E H L+ G L LS Q+++DC + + GC+GG
Sbjct: 158 NQGACGSCWAFSTTGAIEGAHFLETGKLISLSEQQLVDCDHSCDPTDKVSCDAGCNGGLM 217
Query: 207 CALLDWMDVNKVVLEPESEYPLLL-KDAACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
D++ + LE E++YP + C+ A N + T+ E I +
Sbjct: 218 TNAYDYV-MKSGGLETETDYPYTGNSNGKCQFNA---NKIVASVANFSTVSLDEDQIAAN 273
Query: 266 IATHGPVIAAVNALTWQYYLGGVIQYNCD--GSLANINHAVQIVGY 309
+ HGP+ +NA+ Q Y+GGV +C S +I+H V +VGY
Sbjct: 274 LVKHGPLAIGINAVFMQTYIGGV---SCPIICSKHHIDHGVLLVGY 316
>gi|449464688|ref|XP_004150061.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
gi|449519862|ref|XP_004166953.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
Length = 377
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 88/290 (30%), Positives = 144/290 (49%), Gaps = 37/290 (12%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
FS F+Q++ KSY SK EHD RF+ F+ +L + +++ SA +G+T+FSDL+ EF
Sbjct: 60 FSVFKQKFGKSYASKEEHDHRFRVFKANL---KRAQRHQALDPSATHGVTQFSDLTPSEF 116
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKDWREAGIIGKVR 154
+ L + L + K I +PT G+P DWR+ G + +V+
Sbjct: 117 RRSFLGLRSRRLGLPA----------DANKAPI-----LPTDGLPTDFDWRDKGAVSEVK 161
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
NQ +CG+CW+FS E + L G L LS Q+++DC G+ + GC+GG
Sbjct: 162 NQGSCGSCWSFSATGALEGANFLATGKLVSLSEQQLVDCDHECDPEEKGSCDSGCNGGLM 221
Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
+ ++ + L E +YP D + S + +++ +L E I ++
Sbjct: 222 NSAFEYT-LKSGGLMKEQDYPYTGTDRGTCKFDKSKIAASVANFSVVSL--DEEQIAANL 278
Query: 267 ATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY--DNYS 313
+GP+ A+NA+ Q Y+ GV Y C +++H V +VGY D Y+
Sbjct: 279 VKNGPLAVAINAVFMQTYIKGVSCPYICS---KHLDHGVLLVGYGSDGYA 325
>gi|224077886|ref|XP_002305451.1| predicted protein [Populus trichocarpa]
gi|222848415|gb|EEE85962.1| predicted protein [Populus trichocarpa]
Length = 368
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 89/285 (31%), Positives = 136/285 (47%), Gaps = 39/285 (13%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
FS F+ ++KKSY S+ EHD RF F+ +L ++++ +A +G+T+FSDL+ EF
Sbjct: 53 FSLFKSKFKKSYGSQEEHDYRFSVFKANL---RRAARHQELDPTASHGVTQFSDLTPAEF 109
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
+ K VL N +PT +P DWR+ G +G ++
Sbjct: 110 R---------KQVLGLRRLRLPKDANEAP--------ILPTSDLPEDFDWRDKGAVGPIK 152
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
NQ +CG+CW+FS E H L G L LS Q+++DC G+ + GC+GG
Sbjct: 153 NQGSCGSCWSFSATGALEGAHFLATGELVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLM 212
Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDA-ACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
+ ++ + L E +YP D ACK N V + + E I +
Sbjct: 213 NSAFEYT-LKAGGLMREEDYPYTGTDRDACK---FDKNKVAARVANFSVVSLDEDQIAAN 268
Query: 266 IATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
+ +GP+ A+NA+ Q Y+GGV Y C L +H V +VGY
Sbjct: 269 LVKNGPLAVAINAVFMQTYIGGVSCPYICSRRL---DHGVLLVGY 310
>gi|226507844|ref|NP_001148894.1| LOC100282514 precursor [Zea mays]
gi|194703250|gb|ACF85709.1| unknown [Zea mays]
gi|195622994|gb|ACG33327.1| vignain precursor [Zea mays]
Length = 356
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 86/282 (30%), Positives = 131/282 (46%), Gaps = 21/282 (7%)
Query: 34 LELFSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSE 92
LE F + R+ + Y+ + E R + + ++++++E N R +F+DL+
Sbjct: 30 LERFEQWMGRHGRLYADAGEKQRRLEVYRRNVELVETFNSMGNG---YRLADNKFADLTN 86
Query: 93 EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG---IPVKKDWREAGI 149
EEF+ + L + H + V I +G+ G +P DWRE G
Sbjct: 87 EEFRAKMLGFGRPRS---GGGAGHSTAPSTVA--CIGSGLMGRQGYSDLPKSVDWREKGA 141
Query: 150 IGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCAL 209
+ V++Q CG+CWAFS V E ++ +KNG L LS QE++DC +GC+GG
Sbjct: 142 VAPVKSQGDCGSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCD-TKAIGCAGGYMSWA 200
Query: 210 LDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
+++ N+ L E YP + AC+ + V I Y + PS L A
Sbjct: 201 FEFVMKNR-GLTTERNYPYQGLNGACQTPKLKESAVSISGYM--NVTPSSEPDLLRAAAA 257
Query: 270 GPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
PV AV+A WQ Y GGV C A +NH V +VGY
Sbjct: 258 QPVSVAVDAGSFVWQLYGGGVFTGPC---TAELNHGVTVVGY 296
>gi|146215996|gb|ABQ10200.1| cysteine protease Cp2 [Actinidia deliciosa]
Length = 376
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 84/303 (27%), Positives = 145/303 (47%), Gaps = 34/303 (11%)
Query: 24 KVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARY 82
K S E+ + +++ + ++ K+Y+ E + RF+ F+ +L ++E N +S +
Sbjct: 34 KSSSRTDEEVMGIYAEWLAKHGKAYNGIGERERRFEIFKDNLKFVDEHNSENRS---YKV 90
Query: 83 GITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG--IPV 140
G+ F+DL+ EE+++ L D +K +S + + +P
Sbjct: 91 GLNRFADLTNEEYRSMFLGTKT------------DSKRRFMKSKSASRRYAVQDSDMLPE 138
Query: 141 KKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMG 200
DWRE+G + +++Q +CG+CWAFSTV E ++ + G + LS QE++DC + G
Sbjct: 139 SVDWRESGAVAPIKDQGSCGSCWAFSTVAAVEGVNQIATGEMIQLSEQELVDCDRTYDAG 198
Query: 201 CSGGDFCALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP 257
C+GG L+D+ +N ++ E +YP D C + + V I Y + + P
Sbjct: 199 CNGG----LMDYAFEFIINNGGIDTEEDYPYRGVDGTCDPERKNTKVVSINDY--EDVPP 252
Query: 258 SESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY--DNYS 313
+ L H PV A+ A +Q YL GV C +L +H V +VGY DN +
Sbjct: 253 YDEMALKKAVAHQPVSVAIEASGRAFQLYLSGVFTGECGRAL---DHGVVVVGYGTDNGA 309
Query: 314 RTW 316
W
Sbjct: 310 DHW 312
>gi|395544492|ref|XP_003774144.1| PREDICTED: cathepsin F [Sarcophilus harrisii]
Length = 451
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 93/298 (31%), Positives = 144/298 (48%), Gaps = 33/298 (11%)
Query: 22 PVKVSKPNLEQKLELFSSFQQRYKKSYSKS-EHDIRFKNFEKSLDI---IEELNKNRQSP 77
P ++ + Q + LF F Y KSY+ + E R F ++L++ ++EL++
Sbjct: 139 PAPAAQEDSVQLISLFKDFLTTYNKSYANATETQRRLGIFARNLELARKVQELDRG---- 194
Query: 78 ESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG 137
SA YG+T+FSDL+EEEF+T +L + L+S + R++ G
Sbjct: 195 -SAEYGVTKFSDLTEEEFRTSYL------NPLLSS----------LPGRALRPGPATRGP 237
Query: 138 IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG 197
P DWR+ G + V+NQ CG+CWAFS E L+ G L LS QE++DC
Sbjct: 238 APASWDWRDHGAVTGVKNQGACGSCWAFSVTGNVEGQWFLRRGALLALSEQELVDC-DTL 296
Query: 198 NMGCSGGDFCALLDWMDVNKV-VLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLI 256
+ C GG + + K+ LE E +Y + C + SP+ ++ + L
Sbjct: 297 DQACGGG--LPSNAYTAIEKLGGLETEKDYSYEGRKERC---SFSPDKARVYINSSVDLS 351
Query: 257 PSESSILTDIATHGPVIAAVNALTWQYYLGGVIQ-YNCDGSLANINHAVQIVGYDNYS 313
E + T +A +GPV A+NA Q+Y GV + S I+HAV +VGY + S
Sbjct: 352 RDEEELATWLAENGPVSIALNAFAMQFYRRGVSHPFRPLCSPWFIDHAVLLVGYGHRS 409
>gi|356533293|ref|XP_003535200.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
[Glycine max]
Length = 466
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 87/284 (30%), Positives = 138/284 (48%), Gaps = 34/284 (11%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
++ ++ ++ K+Y+ E + RFK F+ +L IEE N +S + G+ +F+DL+ EE
Sbjct: 47 VYEAWLVKHGKAYNALGEKERRFKIFKDNLRFIEE--HNGAGDKSYKLGLNKFADLTNEE 104
Query: 95 FKTRHL----RHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGII 150
++ L R NK +++ K D + + +P DWRE G +
Sbjct: 105 YRAMFLGTRTRGPKNKAAVVA--KKTDRYAYRAGEE-----------LPAMVDWREKGAV 151
Query: 151 GKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALL 210
+++Q CG+CWAFSTV E ++ + G L+ LS QE++DC NMGC+GG L+
Sbjct: 152 TPIKDQGQCGSCWAFSTVGAVEGINQIVTGNLTSLSEQELVDCDRGYNMGCNGG----LM 207
Query: 211 DW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIA 267
D+ V ++ E +YP KD C + V I Y D E S++ +A
Sbjct: 208 DYAFEFIVQNGGIDTEEDYPYHAKDNTCDPNRKNARVVTIDGYE-DVPTNDEKSLMKAVA 266
Query: 268 THGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
PV A+ A + +Q Y GV C N++H V VGY
Sbjct: 267 NQ-PVSVAIEAGGMEFQLYQSGVFTGRCG---TNLDHGVVAVGY 306
>gi|225438807|ref|XP_002283263.1| PREDICTED: germination-specific cysteine protease 1-like isoform 1
[Vitis vinifera]
Length = 374
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 82/283 (28%), Positives = 138/283 (48%), Gaps = 24/283 (8%)
Query: 31 EQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELN-KNRQSPESARYGITEFS 88
E+ + ++ + ++ K+Y+ E + RF+ F+ +L I+E N +NR + + G+ F+
Sbjct: 40 EEVMGMYQWWMAKHGKAYNGLGEKEKRFEIFKDNLKFIDEHNAQNR----TYKVGLNRFA 95
Query: 89 DLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAG 148
DL+ EE++ +L + + K N + ++ G +P + DWRE G
Sbjct: 96 DLTNEEYRAIYLGTRSDPKRRFAKLK------NASPRYAVMPGEVLPESV----DWRETG 145
Query: 149 IIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCA 208
+ V++Q++CG+CWAFSTV E ++ + G L LS QE++DC +MGC+GG
Sbjct: 146 AVNPVKDQRSCGSCWAFSTVAAVEGINQIVTGELISLSEQELVDCDTEYDMGCNGGLMDY 205
Query: 209 LLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIAT 268
D++ + L+ E +YP D C S V I Y + + P + L
Sbjct: 206 AFDFI-IKNGGLDTEKDYPYTGFDGECNLSGKSSKVVSIDGY--EDVPPFDEKALQKAVA 262
Query: 269 HGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
H PV AV A Q Y+ G+ C +L +H + VGY
Sbjct: 263 HQPVSVAVEAGGRALQLYVSGIFTGECGTAL---DHGIVAVGY 302
>gi|2511691|emb|CAB17075.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 365
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 87/284 (30%), Positives = 143/284 (50%), Gaps = 38/284 (13%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
FS+F+ ++ K+Y +K EHD RF F+ ++ + Q SA +G+T+FSDL+ EF
Sbjct: 51 FSTFKSKFGKTYATKEEHDHRFGVFKSNM---RRARLHAQLDPSAVHGVTKFSDLTPAEF 107
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
+ L + L +H +K I +PT +P DWR+ G + V+
Sbjct: 108 HRKFL--GLKPLRLPAH----------AQKAPI-----LPTNNLPKDFDWRDKGAVTNVK 150
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA--------GNGNMGCSGGDF 206
+Q +CG+CW+FST E H L G L LS Q+++DC G+ + GC+GG
Sbjct: 151 DQGSCGSCWSFSTTGALEGAHFLATGELVSLSEQQLVDCDHVCDPEEYGSCDSGCNGGLM 210
Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
+++ + ++ E +YP +D CK S + +Y+ +L E I ++
Sbjct: 211 NNAFEYL-IGSGGVQREKDYPYTGRDGTCKFD-KSKIAASVSNYSVISL--DEEQIAANL 266
Query: 267 ATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
+GP+ A+NA+ Q Y+GGV Y C +++H V +VGY
Sbjct: 267 VKNGPLAVAINAVYMQTYVGGVSCPYICG---KHLDHGVLLVGY 307
>gi|414589857|tpg|DAA40428.1| TPA: Vignain [Zea mays]
Length = 377
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 86/282 (30%), Positives = 131/282 (46%), Gaps = 21/282 (7%)
Query: 34 LELFSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSE 92
LE F + R+ + Y+ + E R + + ++++++E N R +F+DL+
Sbjct: 51 LERFEQWMGRHGRLYADAGEKQRRLEVYRRNVELVETFNSMGNG---YRLADNKFADLTN 107
Query: 93 EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG---IPVKKDWREAGI 149
EEF+ + L + H + V I +G+ G +P DWRE G
Sbjct: 108 EEFRAKMLGFGRPRS---GGGAGHSTAPSTVA--CIGSGLMGRQGYSDLPKSVDWREKGA 162
Query: 150 IGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCAL 209
+ V++Q CG+CWAFS V E ++ +KNG L LS QE++DC +GC+GG
Sbjct: 163 VAPVKSQGDCGSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCD-TKAIGCAGGYMSWA 221
Query: 210 LDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
+++ N+ L E YP + AC+ + V I Y + PS L A
Sbjct: 222 FEFVMKNR-GLTTERNYPYQGLNGACQTPKLKESAVSISGYM--NVTPSSEPDLLRAAAA 278
Query: 270 GPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
PV AV+A WQ Y GGV C A +NH V +VGY
Sbjct: 279 QPVSVAVDAGSFVWQLYGGGVFTGPC---TAELNHGVTVVGY 317
>gi|71400414|ref|XP_803044.1| cysteine peptidase [Trypanosoma cruzi strain CL Brener]
gi|70865609|gb|EAN81598.1| cysteine peptidase, putative [Trypanosoma cruzi]
Length = 467
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 91/316 (28%), Positives = 142/316 (44%), Gaps = 28/316 (8%)
Query: 1 MFDVKNVLFIVALIALCFLAIPVKVSKPNLEQKL-ELFSSFQQRYKKSY-SKSEHDIRFK 58
M L + A++ + +P + + E+ L F+ F+Q++ + Y S +E R
Sbjct: 1 MSGWARALSLAAVLVVMACLVPAATASLHAEETLASQFAEFKQKHGRVYGSAAEEAFRLS 60
Query: 59 NFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDH 118
F +L + L+ + A +G+T FSDL+ EEF++R+ H H
Sbjct: 61 VFRANL-FLARLHA--AANPHATFGVTPFSDLTREEFRSRY-------------HNGAAH 104
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
++ + + G P KDWRE G + V+NQ CG+CWAF+ + E L
Sbjct: 105 FAAAQERARVPVDVEF-VGAPAAKDWREEGAVTAVKNQGMCGSCWAFAAIGNIECQWFLA 163
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACKR 237
L+ LS Q ++ C N N GC GG W+ D N + E YP
Sbjct: 164 GNPLTRLSEQMLVSC-DNTNSGCGGGWPLVAFKWIVDRNNGTVYTEESYPYHSCIGISPP 222
Query: 238 KATSPN--GVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDG 295
TS + G I Y T+ E+ I +A +GPV V+A +W +Y GGV+
Sbjct: 223 CTTSGHTVGATITGYV--TIPRDENGIAAWLAVNGPVAVVVDASSWIFYTGGVMTSCVSK 280
Query: 296 SLANINHAVQIVGYDN 311
L +HAV +VGY++
Sbjct: 281 QL---SHAVLLVGYND 293
>gi|156124996|gb|ABU50816.1| Ale o 1 allergen [Aleuroglyphus ovatus]
Length = 337
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 94/292 (32%), Positives = 141/292 (48%), Gaps = 39/292 (13%)
Query: 31 EQKLEL-FSSFQQRYKKSYSKSEHDIRFKN-FEKSLDIIEELNKNRQSPESA-RYGITEF 87
E +LE F F+ + + Y E ++ K+ F +L I N + + +S + F
Sbjct: 26 EGELEAQFEQFKSTFGRVYPSPEIELHRKSIFRANLQFILRHNIDYFNGDSTFSVSVNNF 85
Query: 88 SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREA 147
+DLS EEF+ + V ++ H D N V+ +P DW
Sbjct: 86 TDLSNEEFRATFNGYRRLAAVSLADSVHAD---NDVE------------ALPATVDWTTK 130
Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC-AGNGNMGCSGGDF 206
G++ ++NQQ CG+CWAFS V + E HALK G L LS Q ++DC A G+MGCSGG
Sbjct: 131 GVVTPIKNQQQCGSCWAFSAVASMEGQHALKTGKLVSLSEQNLVDCSAAEGDMGCSGG-- 188
Query: 207 CALLDWMD------VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSES 260
WMD + ++ E+ YP D +C+ K S G I S+ D ES
Sbjct: 189 -----WMDYAFKYVIQNRGIDTEASYPYKAIDESCEFKRNSI-GATIHSFV-DVKTGDES 241
Query: 261 SILTDIATHGPVIAAVNAL--TWQYYLGGVIQYN-CDGSLANINHAVQIVGY 309
++ +A+ GP+ A++A ++Q+Y GV YN D S ++H V VGY
Sbjct: 242 ALQNAVASIGPISVAIDASQPSFQFYSSGV--YNEPDCSTEILDHGVTAVGY 291
>gi|37732137|gb|AAR02406.1| cysteine proteinase [Anthonomus grandis]
Length = 322
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 94/300 (31%), Positives = 150/300 (50%), Gaps = 32/300 (10%)
Query: 13 LIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELN 71
LI L +A+ + +E+ LF +F+ KSY ++ RF F ++ IE+ N
Sbjct: 6 LIGLLIVAVNASL----IEKHQALFETFKVENGKSYRNQVEEVQRFNIFRANVLEIEQHN 61
Query: 72 K-NRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITT 130
Q S + I +F+DL++EEFK H K VL + ++
Sbjct: 62 ALYEQGLVSYKKAINQFTDLTQEEFKAYLGLHV--KPVLNNTIQYE------------LK 107
Query: 131 GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEV 190
G+ +PT + DWR AG + V+NQ +CG+CW+F+ + E + K+ L LS Q++
Sbjct: 108 GLEVPTSV----DWRSAGQVTGVKNQGSCGSCWSFALTGSTEGAYYRKHKQLVSLSEQQL 163
Query: 191 IDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSY 250
+DC+ + N GC+GG A +++ + L+ ES YP D +CK +S KI +Y
Sbjct: 164 VDCSTSINYGCNGGFLDATFPYIE--QYGLQTESSYPYTGVDGSCKYD-SSKVVTKISNY 220
Query: 251 TCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYN-CDGSLANINHAVQIVGY 309
+L SES +L + + GPV ++A Y G+ N C + N+NHAV +VGY
Sbjct: 221 V--SLHGSESKVLEPVGSIGPVAITMDASYLSSYSSGIYAANKC--TTTNLNHAVLVVGY 276
>gi|7239343|gb|AAF43193.1|AF228731_1 cathepsin L [Stylonychia lemnae]
Length = 340
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 82/283 (28%), Positives = 135/283 (47%), Gaps = 42/283 (14%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F F R+ K+Y SK E ++R + ++ ++ I N ++ S G +D + +E+
Sbjct: 42 FVHFMSRFSKAYKSKEEFEMRLQQYKSNIAFINNHN-SQNDGTSFTLGPNHLADYTHDEY 100
Query: 96 KTR---HLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
K R+ K V + + IP DWRE G +
Sbjct: 101 KKMLGYKPRNKTGKEVYSTPNLKD---------------------IPESIDWREKGAVNA 139
Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
V++Q CG+CWAFST+ + ES + ++ G L LS Q+++DC+ NGN GC+GGD +D+
Sbjct: 140 VKDQGQCGSCWAFSTIASLESRYFIETGKLQSLSEQQLVDCSKNGNEGCNGGDMGLAMDY 199
Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCD----TLIPSESSILTDIAT 268
+ + +E E +YP + KD C +A+ K D ++P + + L
Sbjct: 200 I-ASAGGVETEKDYPYVGKDQTCAFEAS-------KEVATDKGHINIVPGKFATLQAAIA 251
Query: 269 HGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
GPV A+ A L +Q+Y G+ + G+ N++H V VGY
Sbjct: 252 EGPVSVAIEADSLFFQFYRSGIFDSSWCGT--NLDHGVAAVGY 292
>gi|37780047|gb|AAP32196.1| cysteine protease 8 [Trifolium repens]
Length = 343
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 95/314 (30%), Positives = 141/314 (44%), Gaps = 38/314 (12%)
Query: 7 VLFIVALIALCFLAIPVKVSKPNLEQK--LELFSSFQQRYKKSYSK-SEHDIRFKNFEKS 63
V I + C ++V+ L+ E + +Y K Y E + RFK F ++
Sbjct: 7 VYHISLALVFCLGLFAIQVTSRTLQDDSMYERHGQWMSQYGKIYKDHQERETRFKIFTEN 66
Query: 64 LDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHV 123
++ +E N + +S + GI +F+DL+ EEF + S +K H + +
Sbjct: 67 VNYVEASNAD--DTKSYKLGINQFADLTNEEF-------------VASRNKFKGHMCSSI 111
Query: 124 KKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLS 183
R+ T + IP DWR+ G + V+NQ CG CWAFS V E +H L G L
Sbjct: 112 T-RTTTFKYENVSAIPSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLI 170
Query: 184 LLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVV-----LEPESEYPLLLKDAACKR 237
LS QE++DC G + GC GG L+D D K + L E++YP D C
Sbjct: 171 SLSEQELVDCDTKGVDQGCEGG----LMD--DAFKFIIQNHGLSTEAQYPYEGVDGTCNA 224
Query: 238 KATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDG 295
S V I Y D SE ++ +A P+ A++A +Q+Y GV +C
Sbjct: 225 NKASVQAVTITGYE-DVPANSEQALQKAVANQ-PISVAIDASGSDFQFYKSGVFTGSCGT 282
Query: 296 SLANINHAVQIVGY 309
L +H V VGY
Sbjct: 283 EL---DHGVTAVGY 293
>gi|244790097|ref|NP_001156454.1| cathepsin F isoform 2 precursor [Acyrthosiphon pisum]
Length = 586
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 96/299 (32%), Positives = 142/299 (47%), Gaps = 47/299 (15%)
Query: 27 KPNLEQKLEL---FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARY 82
K N++ +L+L F +F + K Y S E RF+ F ++ ++ L + Q SA Y
Sbjct: 267 KNNIDDRLQLKTDFENFIMTHNKIYTSLEEKSRRFRIFAANMKKVKLLQNHEQG--SAIY 324
Query: 83 GITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG----- 137
G T+F+DL++ EFK ++L S+T+ T+P
Sbjct: 325 GATQFADLTKNEFKKKYLGLD----------------------SSMTSKKTLPMAVIPQS 362
Query: 138 --IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAG 195
IP + DWR ++ V+NQ CG+CWAFS + E +ALK+ L LS QE+IDC
Sbjct: 363 ASIPNEFDWRNHNVVTPVKNQGACGSCWAFSAIANIEGQYALKSKELLSLSEQELIDC-D 421
Query: 196 NGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATS--PNGVKIKSYTCD 253
N + GC GG + ++ N LE ES+YP + RK + VK+
Sbjct: 422 NLDNGCGGGLMTQAFEAVE-NLGGLETESDYPY---EGHADRKGCQLKKSDVKVSISKAV 477
Query: 254 TLIPSESSILTDIATHGPVIAAVNALTWQYYLGGV---IQYNCDGSLANINHAVQIVGY 309
+ E I + HGP+ VNA Q+Y+GGV I C S +++H V IVGY
Sbjct: 478 NVSTDEEDIAKFLVKHGPLSVGVNANAMQFYMGGVSHPIHALC--SPKSLDHGVAIVGY 534
>gi|225427714|ref|XP_002264345.1| PREDICTED: cysteine proteinase RD19a [Vitis vinifera]
Length = 377
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 92/285 (32%), Positives = 143/285 (50%), Gaps = 39/285 (13%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
FS F++R+ KSY S+ EHD RFK F+ +L +++Q SA +G+T+FSDL+ EF
Sbjct: 62 FSIFKRRFGKSYASQEEHDYRFKVFKANL---RRARRHQQLDPSATHGVTQFSDLTPAEF 118
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
+ +L L HD +K I +PT +P DWR+ G + V+
Sbjct: 119 RGTYLG-------LRPLKLPHD-----AQKAPI-----LPTNDLPEDFDWRDHGAVTAVK 161
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
NQ +CG+CW+FST E + L G L LS Q++++C G+ + GC+GG
Sbjct: 162 NQGSCGSCWSFSTTGALEGANFLATGNLVSLSEQQLVECDHECDPEEMGSCDSGCNGGLM 221
Query: 207 CALLDWMDVNKVVLEPESEYPLLLKD-AACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
++ + L E +YP D +CK T + +++ +L E I +
Sbjct: 222 NTAFEYT-LKAGGLMKEEDYPYTGTDRGSCKFDKTKI-AASVSNFSVISL--DEDQIAAN 277
Query: 266 IATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
+ +GP+ A+NA+ Q Y+GGV Y C L +H V +VGY
Sbjct: 278 LVKNGPLAVAINAVFMQTYVGGVSCPYICSKRL---DHGVLLVGY 319
>gi|244790093|ref|NP_001156453.1| cathepsin F isoform 1 precursor [Acyrthosiphon pisum]
Length = 586
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 96/299 (32%), Positives = 142/299 (47%), Gaps = 47/299 (15%)
Query: 27 KPNLEQKLEL---FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARY 82
K N++ +L+L F +F + K Y S E RF+ F ++ ++ L + Q SA Y
Sbjct: 267 KNNIDDRLQLKTDFENFIMTHNKIYTSLEEKSRRFRIFAANMKKVKLLQNHEQG--SAIY 324
Query: 83 GITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG----- 137
G T+F+DL++ EFK ++L S+T+ T+P
Sbjct: 325 GATQFADLTKNEFKKKYLGLD----------------------SSMTSKKTLPMAVIPQS 362
Query: 138 --IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAG 195
IP + DWR ++ V+NQ CG+CWAFS + E +ALK+ L LS QE+IDC
Sbjct: 363 ASIPNEFDWRNHNVVTPVKNQGACGSCWAFSAIANIEGQYALKSKELLSLSEQELIDC-D 421
Query: 196 NGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATS--PNGVKIKSYTCD 253
N + GC GG + ++ N LE ES+YP + RK + VK+
Sbjct: 422 NLDNGCGGGLMTQAFEAVE-NLGGLETESDYPY---EGHADRKGCQLKKSDVKVSISKAV 477
Query: 254 TLIPSESSILTDIATHGPVIAAVNALTWQYYLGGV---IQYNCDGSLANINHAVQIVGY 309
+ E I + HGP+ VNA Q+Y+GGV I C S +++H V IVGY
Sbjct: 478 NVSTDEEDIAKFLVKHGPLSVGVNANAMQFYMGGVSHPIHALC--SPKSLDHGVAIVGY 534
>gi|400180443|gb|AFP73358.1| cysteine protease, partial [Solanum habrochaites]
Length = 345
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 98/315 (31%), Positives = 152/315 (48%), Gaps = 35/315 (11%)
Query: 3 DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
D+ ++L + + F + S+P L ++ EL+ S R YK K E RF
Sbjct: 6 DLMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
F++++ IE +NK S + G+ EF+D++ EEF + ++ N ++ S +
Sbjct: 63 FKENMKFIESVNK--AGNLSYKLGMNEFADITSEEFLAKFTGLNIPNSYLSPSPMPSTEF 120
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
N + + P DWRE+G + +V+NQ CG CWAFS V + E + +
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIA 170
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
G L S QE++DC N N GC+GG D++ N + ES+Y L + C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGI-SRESDYEYLGQQYTCRSQ 228
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
+ V+I +Y ++P + L T PV IAA + L Q+Y GG DG
Sbjct: 229 GKTA-AVQISNY---QVVPEGETSLLQAVTKQPVSIGIAASHDL--QFYAGGTY----DG 278
Query: 296 SLAN-INHAVQIVGY 309
S AN INHAV +GY
Sbjct: 279 SCANRINHAVTAIGY 293
>gi|357438145|ref|XP_003589348.1| Cysteine proteinase [Medicago truncatula]
gi|355478396|gb|AES59599.1| Cysteine proteinase [Medicago truncatula]
Length = 364
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 89/284 (31%), Positives = 149/284 (52%), Gaps = 36/284 (12%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F+SF+ ++ K+Y +K EHD RF F+ +L I+ + P SA++GIT+FSDL+ EF
Sbjct: 51 FTSFKSKFSKNYATKEEHDYRFGVFKSNL--IKAKLHQKLDP-SAQHGITKFSDLTASEF 107
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
+ + L +NK + + H +K I +PT +P DWRE G + V+
Sbjct: 108 RRQFL--GLNKRLRLPAH---------AQKAPI-----LPTNNLPEDFDWREKGAVTPVK 151
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
+Q +CG+CWAFST E + L G L+ LS Q+++DC G+ + GC+GG
Sbjct: 152 DQGSCGSCWAFSTTGALEGANYLATGKLTSLSEQQLVDCDHVCDPEERGSCDSGCNGGLM 211
Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
+++ + V+ E +Y +D +CK S + +++ +L E I ++
Sbjct: 212 NNAFEYILQSGGVVS-EKDYAYTGRDGSCKFD-KSKVVASVSNFSVVSL--DEDQIAANL 267
Query: 267 ATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
+GP+ A+NA Q Y+ GV Y C + A ++H V ++G+
Sbjct: 268 VKNGPLAVAINAAWMQTYMSGVSCPYIC--AKARLDHGVLLLGF 309
>gi|297804580|ref|XP_002870174.1| hypothetical protein ARALYDRAFT_915142 [Arabidopsis lyrata subsp.
lyrata]
gi|297316010|gb|EFH46433.1| hypothetical protein ARALYDRAFT_915142 [Arabidopsis lyrata subsp.
lyrata]
Length = 373
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 87/285 (30%), Positives = 136/285 (47%), Gaps = 37/285 (12%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
FS F+ +Y+K+Y ++ EHD RF+ F+ +L +N+ SA +G+T+FSDL+ +EF
Sbjct: 55 FSLFKSKYEKTYATQEEHDHRFRVFKANL---RRARRNQLLDPSAVHGVTQFSDLTPKEF 111
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
+ + L L + T +PT +P + DWRE G + V+
Sbjct: 112 RRKFLGLKRRGFRLPT---------------DTQTAPILPTSDLPTEFDWREQGAVTPVK 156
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
NQ CG+CW+FS + E H L L LS Q+++DC A + + GCSGG
Sbjct: 157 NQGMCGSCWSFSAIGALEGAHFLATKELVSLSEQQLVDCDHECDPAQANSCDSGCSGGLM 216
Query: 207 CALLDWMDVNKVVLEPESEYPLLLKD-AACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
++ + L E +YP +D ACK + + + E I +
Sbjct: 217 NNAFEYA-LKAGGLMKEEDYPYTGRDNTACKFDKSK---IAASVSNFSVVSSDEDQIAAN 272
Query: 266 IATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
+ HGP+ A+NA+ Q Y+GGV Y C S +H V +VG+
Sbjct: 273 LVKHGPLAIAINAMWMQTYIGGVSCPYVCSKSQ---DHGVLLVGF 314
>gi|60396844|gb|AAX19661.1| cysteine proteinase [Populus tomentosa]
Length = 374
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 91/296 (30%), Positives = 141/296 (47%), Gaps = 38/296 (12%)
Query: 26 SKPNL-EQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYG 83
S PNL + S F++++KKSY S+ EHD RF F+ +L ++++ +A +G
Sbjct: 47 SSPNLLTAEQHHLSLFKRKFKKSYLSQEEHDYRFSVFKSNL---RRAARHQKLDPTASHG 103
Query: 84 ITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKK 142
+T+FSDL+ EF+ K VL N +PT +P
Sbjct: 104 VTQFSDLTSAEFR---------KQVLGLRKLRLPKDANKAP--------ILPTNDLPEDF 146
Query: 143 DWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------A 194
DWRE G +G V+NQ +CG+CW+FST E H L G L LS Q+++DC
Sbjct: 147 DWREKGAVGPVKNQGSCGSCWSFSTTGALEGAHFLATGELVSLSEQQLVDCDHECDPEEP 206
Query: 195 GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDT 254
G+ + GC+GG + ++ + L E +YP D + + +++ +
Sbjct: 207 GSCDSGCNGGLMNSAFEYT-LKAGGLMREEDYPYTGMDRGACKFDKDKVAAGVANFSVVS 265
Query: 255 LIPSESSILTDIATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
L E I ++ +GP+ A NA+ Q Y+GGV Y C L +H V +VGY
Sbjct: 266 L--DEDQIAANLVKNGPLAVATNAVFMQTYIGGVSCPYICSRRL---DHGVLLVGY 316
>gi|226503129|ref|NP_001149806.1| LOC100283433 precursor [Zea mays]
gi|195634783|gb|ACG36860.1| xylem cysteine proteinase 2 precursor [Zea mays]
gi|219884977|gb|ACL52863.1| unknown [Zea mays]
Length = 377
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 89/298 (29%), Positives = 142/298 (47%), Gaps = 28/298 (9%)
Query: 18 FLAIPVKVSKPNLEQK---LELFSSFQQRYKKSYSKSEHDIR-FKNFEKSLDIIEELNKN 73
F +I V S +L Q + LF + +Y+K+Y E +R F+ F+ +L I+E N
Sbjct: 51 FFSI-VGYSPEDLTQHDRLVRLFEEWVAKYRKAYGSFEEKLRRFEVFKDNLHHIDE--AN 107
Query: 74 RQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGIT 133
R+ S G+ F+DL+ +EFK +L + K ++ +
Sbjct: 108 RKEVTSYWLGLNAFADLTHDEFKATYL-GLLPKRTSGGRFRYGGVGDGGDEV-------- 158
Query: 134 IPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC 193
P DWR+ G + +V+NQ CG+CWAFSTV E ++ + G L+ LS Q+++DC
Sbjct: 159 -----PASVDWRKKGAVTEVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTSLSEQQLVDC 213
Query: 194 AGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCD 253
+ +GN GCSGG ++ L E YP L+++ C +A + S D
Sbjct: 214 STDGNNGCSGGVMDNAFSFI-ATGAGLRSEEAYPYLMEEGDCDDRARDGEVLVTISGYED 272
Query: 254 TLIPSESSILTDIATHGPVIAAVNALT--WQYYLGGVIQYNCDGSLANINHAVQIVGY 309
E +++ +A H PV A+ A +Q+Y GGV C + ++H V VGY
Sbjct: 273 VPANDEQALVKALA-HQPVSVAIEASGRHFQFYSGGVFDGPCG---SELDHGVAAVGY 326
>gi|7211741|gb|AAF40414.1|AF216783_1 papain-like cysteine proteinase isoform I [Ipomoea batatas]
Length = 368
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 84/284 (29%), Positives = 140/284 (49%), Gaps = 36/284 (12%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F+ F++R+ K+Y S EHD R F+ ++ ++++ +A +G+T+FSDL+ EF
Sbjct: 51 FTVFKRRFGKAYASDEEHDYRLSVFKANM---RRAKRHQELDPAAVHGVTQFSDLTPTEF 107
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
+ + L +N+ + T +PT +P DWR+ G + V+
Sbjct: 108 RRKFL--GLNRRLKFPADAK--------------TAPILPTDELPSDFDWRDHGAVTPVK 151
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
NQ TCG+CW+FST E + L G L LS Q+++DC AG+ + GC+GG
Sbjct: 152 NQGTCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLM 211
Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
+ ++ + L E +YP D R + K+ +++ +L E I ++
Sbjct: 212 NSAFEYT-LKAGGLMREEDYPYTGNDLQVCRFDKTKIAAKVANFSVVSL--DEDQIAANL 268
Query: 267 ATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
+GP+ A+NA+ Q Y+GGV Y C L +H V +VGY
Sbjct: 269 VKNGPLAVAINAVFMQTYIGGVSCPYICSKRL---DHGVLLVGY 309
>gi|67773374|gb|AAY81944.1| cysteine protease 6 [Paragonimus westermani]
Length = 325
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 92/312 (29%), Positives = 142/312 (45%), Gaps = 38/312 (12%)
Query: 1 MFDVKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNF 60
++ V + F+V C A+ V P+ + EL+ F++ Y KSY+ + + RF F
Sbjct: 3 LYTVSCLTFLVG----CVFAVST-VQVPDSAR--ELYEQFKRDYGKSYANDDDEKRFAIF 55
Query: 61 EKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
+ +L + N Q +ARYG+T+FSDL+ EEF + L + V
Sbjct: 56 KDNL--VRAQNYQLQEQGTARYGVTQFSDLTPEEFAAKFLSSRFDDQV-------ERVQL 106
Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
N +K P DWRE G + V +Q +CG+CWAFS E LK G
Sbjct: 107 NDLK------------AAPESVDWRELGAVAPVEDQGSCGSCWAFSVAGNVEGQWFLKTG 154
Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKAT 240
L LS Q+++DC + GC GG + + LE + +YP + ++ CK +
Sbjct: 155 QLVSLSKQQLVDCDVQ-DSGCDGG-YPPTTYGEIIRMGGLEAQRDYPYVGREQPCKLDES 212
Query: 241 SPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGV---IQYNCDGSL 297
+ K + L +E IA HGP+ + +NA+T Q+Y G+ + C
Sbjct: 213 K---LLAKINSSIVLEANEKKQAAYIAEHGPMSSGINAVTLQFYQSGISHPSKSQCQPDW 269
Query: 298 ANINHAVQIVGY 309
+NH V VGY
Sbjct: 270 --LNHGVLSVGY 279
>gi|330801846|ref|XP_003288934.1| hypothetical protein DICPUDRAFT_153222 [Dictyostelium purpureum]
gi|325081026|gb|EGC34558.1| hypothetical protein DICPUDRAFT_153222 [Dictyostelium purpureum]
Length = 334
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 90/316 (28%), Positives = 159/316 (50%), Gaps = 34/316 (10%)
Query: 8 LFIVALIALCFLAIPV----KVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKS 63
L + +++L FL+I + +V PN Q F + + + K+YS E +++ F+ +
Sbjct: 3 LSFILVLSLLFLSINIIASSRVFTPN--QYQSSFVQWMKSHGKAYSHDEFARKYRTFQDN 60
Query: 64 LDIIEELN-KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNH 122
+D + + N KN ++ G+ F+D++ E++ L S+ +
Sbjct: 61 MDYVHQWNSKNSETV----LGLNNFADMNNVEYRNTLLGASIEVEPFRT----------- 105
Query: 123 VKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTL 182
R+ + I +PT + DWRE G + +++Q CG+C++FS + AES + + NG +
Sbjct: 106 --PRTFSR-IQLPTSV----DWREKGAVHDIKDQGHCGSCYSFSAIGAAESAYYIANGEM 158
Query: 183 SLLSVQEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATS 241
LS Q ++DC+ + GN GC+GG ++ +++ E+ YP KDA+C+ +
Sbjct: 159 LTLSEQNILDCSRSYGNEGCNGGYMLESFQFL-LDQGGAVSEASYPYEAKDASCRFDSVK 217
Query: 242 PNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLAN 299
V + T + E + IATHGPV A++A +++Q Y GV Y S +
Sbjct: 218 TPIVATFNGTVEIRRGDEGDLQQAIATHGPVAVAIDAGHISFQLYKTGVY-YEPYCSSYS 276
Query: 300 INHAVQIVGYDNYSRT 315
++HAV VGYD S T
Sbjct: 277 LSHAVLAVGYDTDSVT 292
>gi|413942348|gb|AFW74997.1| Xylem cysteine proteinase 2 [Zea mays]
Length = 391
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 90/298 (30%), Positives = 141/298 (47%), Gaps = 28/298 (9%)
Query: 18 FLAIPVKVSKPNLEQK---LELFSSFQQRYKKSYSKSEHDIR-FKNFEKSLDIIEELNKN 73
F +I V S +L Q + LF + +Y+K+Y E +R F+ F+ +L I+E N
Sbjct: 65 FFSI-VGYSPEDLTQHDRLVRLFEEWVAKYRKAYGSFEEKLRRFEVFKDNLHHIDE--AN 121
Query: 74 RQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGIT 133
R+ S G+ F+DL+ +EFK +L + K ++
Sbjct: 122 RKEVTSYWLGLNAFADLTHDEFKATYL-GLLPKRTSGGRFRY-------------GGVGD 167
Query: 134 IPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC 193
+P DWR+ G + +V+NQ CG+CWAFSTV E ++ + G L+ LS Q+++DC
Sbjct: 168 GGDEVPASVDWRKKGAVTEVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTSLSEQQLVDC 227
Query: 194 AGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCD 253
+ +GN GCSGG ++ L E YP L+++ C +A + S D
Sbjct: 228 STDGNNGCSGGVMDNAFSFI-ATGAGLRSEEAYPYLMEEGDCDDRARDGEVLVTISGYED 286
Query: 254 TLIPSESSILTDIATHGPVIAAVNALT--WQYYLGGVIQYNCDGSLANINHAVQIVGY 309
E +++ +A H PV A+ A +Q+Y GGV C L +H V VGY
Sbjct: 287 VPANDEQALVKALA-HQPVSVAIEASGRHFQFYSGGVFDGPCGSEL---DHGVAAVGY 340
>gi|50355621|dbj|BAD29959.1| cysteine protease [Daucus carota]
Length = 361
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 90/312 (28%), Positives = 142/312 (45%), Gaps = 30/312 (9%)
Query: 8 LFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDI 66
I ALI L A E + +Y + Y ++E +RF+ F ++
Sbjct: 28 FMIAALILLGAWACQATSRTLPEASMFERHEQWMIQYGRVYKDEAEKSVRFQIFMDNVKF 87
Query: 67 IEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKR 126
IEE NK+ + +S + + EF+D + EEF+ R+ V + + +V
Sbjct: 88 IEEFNKDGR--QSYKLAVNEFADQTNEEFQAS--RNGYKMAVSSRPSQTTLFRYENV--- 140
Query: 127 SITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLS 186
T +P DWR+ G + V++Q CG+CWAFST+ E + LK G L LS
Sbjct: 141 ---------TAVPSSMDWRKKGAVTPVKDQGQCGSCWAFSTIAATEGITKLKTGKLISLS 191
Query: 187 VQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGV 245
QE++DC G + GC GG +++ NK + E+ YP D C K +
Sbjct: 192 EQELVDCDKTGEDQGCEGGYMEDGFEFIVKNKGIA-LEASYPYTAADGTCNSKEEASRAA 250
Query: 246 KIKSYTCDTLIP--SESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANIN 301
KI Y +P SE+++L +A PV +++A + +Q+Y GV C +++
Sbjct: 251 KISGY---EKVPANSETALLKAVANQ-PVSVSIDASGVAFQFYSSGVFTGECG---TDLD 303
Query: 302 HAVQIVGYDNYS 313
H V VGY S
Sbjct: 304 HGVTAVGYGKTS 315
>gi|7381219|gb|AAF61440.1|AF138264_1 papain-like cysteine proteinase isoform I [Ipomoea batatas]
Length = 368
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 84/284 (29%), Positives = 140/284 (49%), Gaps = 36/284 (12%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F+ F++R+ K+Y S EHD R F+ ++ ++++ +A +G+T+FSDL+ EF
Sbjct: 51 FTVFKRRFGKAYASDEEHDYRLSVFKANM---RRAKRHQELDPAAVHGVTQFSDLTPTEF 107
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
+ + L +N+ + T +PT +P DWR+ G + V+
Sbjct: 108 RRKFL--GLNRRLKFPADAK--------------TAPILPTDELPSDFDWRDHGAVTPVK 151
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
NQ TCG+CW+FST E + L G L LS Q+++DC AG+ + GC+GG
Sbjct: 152 NQGTCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLM 211
Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
+ ++ + L E +YP D R + K+ +++ +L E I ++
Sbjct: 212 NSAFEYT-LKAGGLMREEDYPYTGNDLQVCRFDKTKIAAKVANFSVVSL--DEDQIAANL 268
Query: 267 ATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
+GP+ A+NA+ Q Y+GGV Y C L +H V +VGY
Sbjct: 269 VKNGPLAVAINAVFMQTYIGGVSCPYICSKRL---DHGVLLVGY 309
>gi|40806500|gb|AAR92155.1| putative cysteine protease 2 [Iris x hollandica]
Length = 359
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 83/311 (26%), Positives = 150/311 (48%), Gaps = 31/311 (9%)
Query: 8 LFIVALIALCFLA----IPVKVSKPNLEQKL-ELFSSFQQRYKKSYSKSEHDIRFKNFEK 62
+ + ++AL F+ IP E+ L L+ ++ + S SE + RF F++
Sbjct: 6 MLLALVVALAFVGVARTIPFNEKDLASEESLWGLYERWRSHHTVSRDLSEKNKRFNVFKE 65
Query: 63 SLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNH 122
+ I E NK + +P + G+ +F+D++ +EF++ + + HHH
Sbjct: 66 NAKFIHEFNK-KDAP--YKLGLNKFADMTNQEFRSTYAGSKI-------------HHHRT 109
Query: 123 VKKRSITTGITIPT---GIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
+ TG + IP DWR G + V++Q CG+CWAFST+ + E ++ +K
Sbjct: 110 QRGTPRATGSFMYENVHSIPASVDWRTQGAVAPVKDQGQCGSCWAFSTIASVEGINKIKT 169
Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
L LS Q+++DC + N GC+GG +++ N + ES YP + +C ++
Sbjct: 170 NQLVPLSGQQLVDCDTDQNEGCNGGLMDYAFEFIKSNGGITS-ESAYPYTAEQGSCASES 228
Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIA-AVNALTWQYYLGGVIQYNCDGSLA 298
++P V I Y D +E++++ +A +A + + +Q+Y GV +C L
Sbjct: 229 SAPV-VTIDGYE-DVPANNEAALMKAVANQVVSVAIEASGMAFQFYSEGVFTGSCGNEL- 285
Query: 299 NINHAVQIVGY 309
+H V +VGY
Sbjct: 286 --DHGVAVVGY 294
>gi|113208365|dbj|BAF03553.1| cysteine proteinase CP2 [Phaseolus vulgaris]
Length = 365
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 87/284 (30%), Positives = 143/284 (50%), Gaps = 38/284 (13%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
FS+F+ ++ K+Y +K EHD RF F+ ++ + Q SA +G+T+FSDL+ EF
Sbjct: 51 FSTFKAKFGKTYATKEEHDHRFGVFKSNM---RRARLHAQLDPSAVHGVTKFSDLTPAEF 107
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
+ L + L +H +K I +PT +P DWR+ G + V+
Sbjct: 108 HRKFL--GLKPLRLPAH----------AQKAPI-----LPTNNLPKDFDWRDKGAVTNVK 150
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
+Q +CG+CW+FST E H L G L LS Q+++DC G+ + GC+GG
Sbjct: 151 DQGSCGSCWSFSTTGALEGAHFLATGELVSLSEQQLVDCDHVCDPEEYGSCDSGCNGGLM 210
Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
+++ + ++ E +YP +D CK S + +Y+ +L E I ++
Sbjct: 211 NNAFEYL-IGSGGVQREKDYPYTGRDGTCKFD-KSKIAASVSNYSVISL--DEEQIAANL 266
Query: 267 ATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
+GP+ A+NA+ Q Y+GGV Y C +++H V +VGY
Sbjct: 267 VKNGPLAVAINAVYMQTYVGGVSCPYICG---KHLDHGVLLVGY 307
>gi|224081320|ref|XP_002306369.1| predicted protein [Populus trichocarpa]
gi|222855818|gb|EEE93365.1| predicted protein [Populus trichocarpa]
Length = 340
Score = 121 bits (304), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 90/313 (28%), Positives = 147/313 (46%), Gaps = 32/313 (10%)
Query: 4 VKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQ---RYKKSY-SKSEHDIRFKN 59
K FI + A P K + L Q + ++ +Q +Y + Y +E + R+
Sbjct: 4 TKQSQFICLALLFVLGAWPSKSAARTL-QDVSMYERHEQWMAQYGRVYKDDAEKETRYNI 62
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHH 119
F++++ I+ N Q+ +S + G+ +F+DLS EEFK R+ H M +
Sbjct: 63 FKENVARIDAFNS--QTGKSYKLGVNQFADLSNEEFKAS--RNRFKGH--MCSPQAGPFR 116
Query: 120 HNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
+ +V + +P DWR+ G + V++Q CG CWAFS V E ++ L
Sbjct: 117 YENV------------SAVPATMDWRKKGAVTPVKDQGQCGCCWAFSAVAAMEGINQLTT 164
Query: 180 GTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
G L LS QEV+DC G + GC+GG +++ NK L E+ YP D C +
Sbjct: 165 GKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNK-GLTTEANYPYTGTDGTCNTQ 223
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGS 296
+ + KI + D SE++++ +A PV A++A +Q+Y G+ +C
Sbjct: 224 KEATHAAKITGFE-DVPANSEAALMKAVAKQ-PVSVAIDAGGFEFQFYSSGIFTGSCGTQ 281
Query: 297 LANINHAVQIVGY 309
L +H V VGY
Sbjct: 282 L---DHGVTAVGY 291
>gi|224076972|ref|XP_002305074.1| predicted protein [Populus trichocarpa]
gi|224106329|ref|XP_002333698.1| predicted protein [Populus trichocarpa]
gi|222837984|gb|EEE76349.1| predicted protein [Populus trichocarpa]
gi|222848038|gb|EEE85585.1| predicted protein [Populus trichocarpa]
Length = 307
Score = 121 bits (304), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 85/265 (32%), Positives = 127/265 (47%), Gaps = 27/265 (10%)
Query: 52 EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMS 111
E + R+ F+++++ IE N S + G+ +F+DL+ EEF+ H + LMS
Sbjct: 21 EKEKRYLIFKENIERIEAFNNG--SDRGYKLGVNKFADLTNEEFRAMHHGYKRQSSKLMS 78
Query: 112 HHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETA 171
H++ + IP DWR+AG + V++Q TCG CWAFS V
Sbjct: 79 SSFRHEN----------------LSAIPTSMDWRKAGAVTPVKDQGTCGCCWAFSAVAAI 122
Query: 172 ESMHALKNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLL 230
E + LK G L LS Q+++DC G + GC GG ++ N L E+ YP
Sbjct: 123 EGIIKLKTGKLISLSEQQLVDCDVKGVDQGCGGGLMDNAFQFILRNG-GLTSEATYPYQG 181
Query: 231 KDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAV--NALTWQYYLGGV 288
D CK K T+ KI Y D + +E+++L +A PV AV +Q+Y GV
Sbjct: 182 VDGTCKSKKTASIEAKITGYE-DVPVNNENALLQAVAKQ-PVSVAVEGGGYDFQFYKSGV 239
Query: 289 IQYNCDGSLANINHAVQIVGYDNYS 313
+ +C L +HAV +GY S
Sbjct: 240 FKGDCGTYL---DHAVTAIGYGTNS 261
>gi|15705865|gb|AAL05851.1|AF411121_1 cysteine proteinase precursor [Sandersonia aurantiaca]
Length = 360
Score = 121 bits (304), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 87/291 (29%), Positives = 140/291 (48%), Gaps = 50/291 (17%)
Query: 37 FSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
FSSF RY KSY+ ++EH RF F+ +L ++++ +A +G+T F+DL+ EF
Sbjct: 45 FSSFLSRYGKSYADEAEHAYRFSVFKSNL---RRARRHQRLDPTAVHGVTRFADLTPSEF 101
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGIT-----IPTG-IPVKKDWREAGI 149
+ +L +++R T G T +PT +P DWR+ G
Sbjct: 102 RRTYL---------------------GLRRRPRTAGSTHDAPILPTNELPADFDWRDHGA 140
Query: 150 IGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGC 201
+ V+NQ +CG+CW+FS E + L G L LS Q+++DC + + GC
Sbjct: 141 VTPVKNQGSCGSCWSFSAAGALEGANYLSTGNLVSLSEQQLVDCDHECDSSEPDSCDQGC 200
Query: 202 SGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPS--E 259
+GG +++ + LE E++YP D R N KI + + + S E
Sbjct: 201 NGGLMTTAFEYI-LKSGGLEREADYPYTGTD----RGTCKFNKAKISAVASNFSVVSIDE 255
Query: 260 SSILTDIATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
I ++ HGP+ +NA+ Q Y+GGV Y C +++H V +VGY
Sbjct: 256 DQIAANLVKHGPLAVGINAVFMQTYVGGVSCPYICG---KHLDHGVLLVGY 303
>gi|146215988|gb|ABQ10196.1| actinidin Act3a [Actinidia eriantha]
Length = 380
Score = 121 bits (304), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 92/313 (29%), Positives = 156/313 (49%), Gaps = 29/313 (9%)
Query: 2 FDVKNVLFIVALIALCFLAIPVKVSKPNL-EQKLELFSSFQQRYKKSY-SKSEHDIRFKN 59
F ++LF + F AI K+S ++ + L+ S+ +Y KSY S E ++R +
Sbjct: 7 FISMSLLFFSTFLIFSF-AIDAKISPLRTNDEVMALYESWLVKYGKSYNSLGEREMRIEI 65
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHH 119
F+++L I+E N + S G+ +F+DL++EE+++ +L K L S K + +
Sbjct: 66 FKENLRFIDEHNADPN--RSYTVGLNQFADLTDEEYRSTYLGF---KSSLKS--KVSNRY 118
Query: 120 HNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
V + +P DWR G + V+NQ C +CWAF+T+ T ES++ +
Sbjct: 119 MPQVGEV-----------LPDYVDWRTTGAVVDVKNQGLCSSCWAFATIATVESINQIIT 167
Query: 180 GTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
G L LS QE++DC N GC GG +++ +N + E YP + +D C
Sbjct: 168 GDLISLSEQELVDCNRTPINEGCKGGFMDDAYEFI-INNGGINTEENYPYIGQDDQCDEP 226
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGS 296
+ N V I SY + + P++ + + PV A++A L +++Y G+ G+
Sbjct: 227 KKNQNYVTIDSY--EQVPPNDELAMKRAVAYQPVSVAIDAYCLGFRFYQSGIFTGGSCGT 284
Query: 297 LANINHAVQIVGY 309
+NHAV I+GY
Sbjct: 285 --TLNHAVTIIGY 295
>gi|10946820|ref|NP_067420.1| cathepsin 6 precursor [Mus musculus]
gi|9931384|gb|AAG02172.1|AF223401_1 cathepsin-6 [Mus musculus]
gi|12838129|dbj|BAB24093.1| unnamed protein product [Mus musculus]
gi|16445021|gb|AAK00510.1| cathepsin 6 precursor [Mus musculus]
gi|68534635|gb|AAH99455.1| Cathepsin 6 [Mus musculus]
gi|148709368|gb|EDL41314.1| cathepsin 6 [Mus musculus]
Length = 334
Score = 121 bits (304), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 92/287 (32%), Positives = 144/287 (50%), Gaps = 30/287 (10%)
Query: 28 PNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELN-KNRQSPESARYGITE 86
PNL + + ++++Y+KSY+ E +R +E+++ +I+ N +N + + E
Sbjct: 23 PNLNAE---WHDWKKQYEKSYTMEEEGLRRAIWEENMRMIKLHNWENSLGKNNFTLKMNE 79
Query: 87 FSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWRE 146
F DL+ EE LR +N + SH KKR I + +P DWR+
Sbjct: 80 FGDLTPEE-----LRKMMNNFPIWSH-----------KKRKIIRKRAVGDVLPKFVDWRK 123
Query: 147 AGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAG-NGNMGCSGGD 205
G + +VR Q+ C +CWAF+ E K G L+ LSVQ ++DC GN GC GD
Sbjct: 124 KGYVTRVRRQKFCNSCWAFAVNGAIEGQMFKKTGKLTPLSVQNLVDCTKTQGNDGCQWGD 183
Query: 206 FCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
+++ +N LE E+ YP K+ C+ +P K + +L SE ++
Sbjct: 184 PYIAYEYV-LNNGGLEAEATYPYEGKEGPCR---YNPKNSKAEITGFVSLPESEDILMEA 239
Query: 266 IATHGPVIAAVNAL--TWQYYLGGVI-QYNCDGSLANINHAVQIVGY 309
+AT GP+ AAV+A + +Y GG+ Q NC S +NHAV +VGY
Sbjct: 240 VATIGPISAAVDASFNRFSFYDGGIYHQPNC--SNNTVNHAVLVVGY 284
>gi|305434754|gb|ADM53739.1| cathepsin L2 precursor [Lepeophtheirus salmonis]
Length = 382
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 98/318 (30%), Positives = 158/318 (49%), Gaps = 34/318 (10%)
Query: 9 FIVALIALCFL----AIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKS 63
F + + +CFL A+ S P +++++ F SF + Y KSY +++ ++ K F +
Sbjct: 5 FKMKFLGVCFLFGLAALAAGTSSPT-QREIQEFESFVKEYSKSYHNRALRSLKLKVFVDN 63
Query: 64 LDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHV 123
L IEE N N + + GI EFSDL++EEF+++++ +S MS
Sbjct: 64 LREIEEHNANPK--RTWDMGINEFSDLTDEEFESKYMGYSP-----MSSSAGLVTRTVAP 116
Query: 124 KKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLS 183
K+ +I +P DWRE G+I V+NQ +CG+CW FS VE ES A++N S
Sbjct: 117 KQGNIKD-------LPESVDWREKGVITDVKNQGSCGSCWVFSAVEQIESYVAIENNMTS 169
Query: 184 --LLSVQEVIDCAGNGNMGCSGGDFCALLD---WMDVNKVVLEPESEYP----LLLKDAA 234
LLS Q++ C+ N G ++ +M +E E EYP +
Sbjct: 170 PPLLSTQQITSCSSNPYSCGGSGGCKGAINEIAYMYTQLYGIETEKEYPYTSGFTEESGE 229
Query: 235 CKRKATSPNGVKIKSYTCDTLIPSES-SILTDIATHGPVIAAVNALTWQYYLGGVIQYNC 293
C A+S G + L P++ S++ +A GP+ +V A ++ Y G++ C
Sbjct: 230 CLYNASSVTGKMAHVRGYEVLPPNDMYSVMEHLANKGPLGVSVYAGRFKSYKSGILN-GC 288
Query: 294 DGSLAN--INHAVQIVGY 309
D + AN INHA+Q++GY
Sbjct: 289 DFN-ANIVINHAIQMIGY 305
>gi|224079085|ref|XP_002305743.1| predicted protein [Populus trichocarpa]
gi|222848707|gb|EEE86254.1| predicted protein [Populus trichocarpa]
Length = 494
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 85/285 (29%), Positives = 140/285 (49%), Gaps = 20/285 (7%)
Query: 28 PNLEQKLELFSSFQQRYKKSYSKSEH-DIRFKNFEKSLDIIEELNKNRQSPESARYGITE 86
P E +E+F ++ R++K+Y +E + RF NF+++L I E +++ R G+ +
Sbjct: 34 PPDESIIEIFQQWRDRHQKAYKHAEEAEKRFGNFKRNLKYIIE-KTGKETTLRHRVGLNK 92
Query: 87 FSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWRE 146
F+DLS EEFK +L V K + + D +++ P DWR+
Sbjct: 93 FADLSNEEFKQLYLSK-VKKPINKTRIDAEDRSRRNLQS----------CDAPSSLDWRK 141
Query: 147 AGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDF 206
G++ V++Q CG+CW+FST E ++A+ L LS QE++DC N GC GG
Sbjct: 142 KGVVTAVKDQGDCGSCWSFSTTGAIEGINAIVTSDLISLSEQELVDC-DTTNYGCEGGYM 200
Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
+W+ +N ++ E+ YP D C T+ +K+ S + S L
Sbjct: 201 DYAFEWV-INNGGIDTEANYPYTGVDGTCN---TAKEEIKVVSIDGYKDVDETDSALLCA 256
Query: 267 ATHGPVIAAVN--ALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
A P+ ++ A+ +Q Y GG+ +C +I+HAV IVGY
Sbjct: 257 AAQQPISVGIDGSAIDFQLYTGGIYDGDCSDDPDDIDHAVLIVGY 301
>gi|237844793|ref|XP_002371694.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
gi|50313163|gb|AAT74529.1| toxopain-2 [Toxoplasma gondii]
gi|89242977|gb|ABD64744.1| cathepsin L [Toxoplasma gondii]
gi|95007485|emb|CAJ20707.1| toxopain-2 [Toxoplasma gondii RH]
gi|211969358|gb|EEB04554.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
gi|221480879|gb|EEE19300.1| cysteine protease, putative [Toxoplasma gondii GT1]
gi|221501596|gb|EEE27366.1| cysteine protease, putative [Toxoplasma gondii VEG]
Length = 422
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 90/281 (32%), Positives = 139/281 (49%), Gaps = 29/281 (10%)
Query: 35 ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEE 93
+ FSSFQ Y KSY ++ E R+ F+ +L I N Q S + F DLS +
Sbjct: 115 DAFSSFQAMYAKSYATEEEKQRRYAIFKNNLVYIHTHN---QQGYSYSLKMNHFGDLSRD 171
Query: 94 EFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGI--TIPTGIPVKKDWREAGIIG 151
EF+ ++L +++ L SHH + T + +P+ +P DWR G +
Sbjct: 172 EFRRKYLGFKKSRN-LKSHH------------LGVATELLNVLPSELPAGVDWRSRGCVT 218
Query: 152 KVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGCSGGDFCALL 210
V++Q+ CG+CWAFST E H K G L LS QE++DC+ GN CSGG+
Sbjct: 219 PVKDQRDCGSCWAFSTTGALEGAHCAKTGKLVSLSEQELMDCSRAEGNQSCSGGEMNDAF 278
Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
++ ++ + E YP L +D C R + VKI + D SE+++ +A
Sbjct: 279 QYV-LDSGGICSEDAYPYLARDEEC-RAQSCEKVVKILGFK-DVPRRSEAAMKAALAK-S 334
Query: 271 PVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
PV A+ A + +Q+Y GV +C +++H V +VGY
Sbjct: 335 PVSIAIEADQMPFQFYHEGVFDASCG---TDLDHGVLLVGY 372
>gi|323457344|gb|EGB13210.1| hypothetical protein AURANDRAFT_18666 [Aureococcus anophagefferens]
Length = 346
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 89/297 (29%), Positives = 140/297 (47%), Gaps = 41/297 (13%)
Query: 36 LFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F+ Y KSY+ +E + RF F +L E LN R + A +G+T+F DL+E E
Sbjct: 19 LFELFKSDYVKSYNSTEAEAERFTIFSANLRKTEALNAQRVDEDDAEFGVTQFMDLTEAE 78
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWR--EAGIIGK 152
FK ++L + ++ VL D + + G P + DWR ++G++
Sbjct: 79 FKAQYLNYVPSEQVLA-----EDVY-------AAPEGFAAPGSL----DWRTKQSGVVSD 122
Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
V++Q CG+CWAFS E ES L + + Q+++ C + GC+GG+ +
Sbjct: 123 VKDQGQCGSCWAFSATEQIESEWVLAGNDPLVFAPQQIVSC-DKVDQGCNGGNTETAYAY 181
Query: 213 MDVNKVVLEPESEYPLLLKDAA----CKRKATSPNGVKIKSYTCDTLIP----------S 258
++ + ES YP + CK+ T+ V+ SY ++P
Sbjct: 182 VE-KAGGMALESAYPYKSGTSGNTGRCKKFETAGGDVESFSY----VVPECKKGKCNDQD 236
Query: 259 ESSILTDIATHGPVIAAVNALTWQYYLGGVI-QYNCDGSLAN-INHAVQIVGYDNYS 313
E + +A+HGP VNA WQ Y GV+ C AN ++H VQ+VGY Y+
Sbjct: 237 EDKMAAALASHGPASICVNAGAWQTYTKGVMTNLQCGSHAANALDHCVQVVGYTGYT 293
>gi|320164780|gb|EFW41679.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 334
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 89/281 (31%), Positives = 140/281 (49%), Gaps = 30/281 (10%)
Query: 37 FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFK 96
F ++++ + KSYS + +I + ++ ++ + + N S G+ F+DL+ EEFK
Sbjct: 30 FEAWKRTFGKSYSDAVEEINRRAVWEANKMLVDAH-NGAGIHSYTLGMNIFADLTHEEFK 88
Query: 97 TRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG----IPVKKDWREAGIIGK 152
+L V+ + + RS + IPT +P DWR AGI+
Sbjct: 89 RFYLGTKVDLN----------------RPRSNFSSTFIPTANVGALPDSVDWRTAGIVTP 132
Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGCSGGDFCALLD 211
V++Q CG+CW+FST + E HA K G L LS Q ++DC+ GN GC+GG
Sbjct: 133 VKDQGQCGSCWSFSTTGSVEGQHARKTGQLVSLSEQNLVDCSKAQGNQGCNGGLMDDAFQ 192
Query: 212 WMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGP 271
++ NK + + E+ YP KD CK A + G + S+ D SES + +AT GP
Sbjct: 193 YIITNKGI-DTEASYPYTAKDGTCKFNAANV-GATLSSFQ-DITRGSESDLQNAVATVGP 249
Query: 272 VIAAVNAL--TWQYYLGGVI-QYNCDGSLANINHAVQIVGY 309
V A++A ++Q Y GV + C S +++H V GY
Sbjct: 250 VSVAIDASKNSFQLYTSGVYNEKKC--SSTSLDHGVLAAGY 288
>gi|115448287|ref|NP_001047923.1| Os02g0715000 [Oryza sativa Japonica Group]
gi|42408029|dbj|BAD09165.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113537454|dbj|BAF09837.1| Os02g0715000 [Oryza sativa Japonica Group]
gi|215737450|dbj|BAG96580.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765786|dbj|BAG87483.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222623551|gb|EEE57683.1| hypothetical protein OsJ_08138 [Oryza sativa Japonica Group]
Length = 366
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 88/299 (29%), Positives = 144/299 (48%), Gaps = 35/299 (11%)
Query: 19 LAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSP 77
LA+P K+ + LF+S+ ++ K Y+ + + R++ F+++L I E N+ S
Sbjct: 45 LALPNKL--------VGLFTSWSVKHSKIYASPKEKVKRYEIFKRNLRHIVETNRRNGS- 95
Query: 78 ESARYGITEFSDLSEEEFKTRHL--RHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIP 135
G+ F+D++ EEFK +L + + + H + N V
Sbjct: 96 --YWLGLNHFADIAHEEFKASYLGLKPGLARRDAQPHGSTTFRYANAVN----------- 142
Query: 136 TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAG 195
+P DWR+ G + V+NQ CG+CWAFSTV E ++ + G L LS QE++DC
Sbjct: 143 --LPWAVDWRKKGAVTPVKNQGECGSCWAFSTVAAVEGINQIVTGKLVSLSEQELMDCDN 200
Query: 196 NGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTL 255
N GC GG ++ N+ + E +YP L+++ C+ K + I Y D
Sbjct: 201 TFNHGCRGGLMDFAFAYIMGNQGIYT-EEDYPYLMEEGYCREKQPHSKVITITGYE-DVP 258
Query: 256 IPSESSILTDIATHGPVIAAVNALT--WQYYLGGVIQYNCDGSLANINHAVQIVGYDNY 312
SE+S+L +A H PV + A + +Q+Y GG+ C +HA+ VGY +Y
Sbjct: 259 ANSETSLLKALA-HQPVSVGIAAGSRDFQFYKGGIFDGECG---IQPDHALTAVGYGSY 313
>gi|50355613|dbj|BAD29955.1| cysteine protease [Daucus carota]
Length = 365
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 91/304 (29%), Positives = 135/304 (44%), Gaps = 28/304 (9%)
Query: 10 IVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIE 68
+ L + LA N E + RY + Y + +E + R F+++L I+
Sbjct: 12 LALLFTIGVLASLAAARSLNEASMTETHDQWMARYGRVYKTANEKNRRSTIFQENLKYIQ 71
Query: 69 ELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSI 128
NK P + G+ EF+DL+ EEF T R+ HV + N + ++
Sbjct: 72 TFNKANNKPY--KLGVNEFADLTNEEFTTS--RNKFKSHVCATVT-------NVFRYENV 120
Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
T +P DWR+ G + ++NQ CG CWAFS V E + LK G L LS Q
Sbjct: 121 TA-------VPATMDWRKKGAVTPIKNQGQCGCCWAFSAVAAMEGITQLKTGKLISLSEQ 173
Query: 189 EVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI 247
E++DC NG + GC GG D++ N L E+ YP D C + + I
Sbjct: 174 ELVDCDTNGEDQGCEGGLMDYAFDFIQQNH-GLSTETNYPYSGTDGTCNANKEANHAATI 232
Query: 248 KSYTCDTLIPSESSILTDIATHGPVIAAVNALT--WQYYLGGVIQYNCDGSLANINHAVQ 305
+ D SES++L +A P+ A++A +Q+Y GV C L +H V
Sbjct: 233 TGHE-DVPANSESALLKAVANQ-PISVAIDASGSDFQFYSSGVFTGECGTEL---DHGVT 287
Query: 306 IVGY 309
VGY
Sbjct: 288 AVGY 291
>gi|40806498|gb|AAR92154.1| putative cysteine protease 1 [Iris x hollandica]
Length = 340
Score = 121 bits (303), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 86/308 (27%), Positives = 142/308 (46%), Gaps = 38/308 (12%)
Query: 13 LIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK--------SEHDIRFKNFEKSL 64
L AL L + + S+ + L S +R+++ ++ +E RF+ F ++
Sbjct: 10 LPALALLIVAIWASQGEAGRSLGENKSMLERHEQWMAQHGRVYKNAAEKAHRFEIFRANV 69
Query: 65 DIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVK 124
+ IE N + G+ +F+DL+ EEFKTR+ K M+ K + +
Sbjct: 70 ERIESFNAENHK---FKLGVNQFADLTNEEFKTRNTL----KPSKMASTKSFKYEN---- 118
Query: 125 KRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL 184
T +P DWR G + +++Q CG+CWAFS V E + L G L
Sbjct: 119 ----------VTAVPATMDWRTKGAVTPIKDQGQCGSCWAFSAVAATEGITKLSTGKLIS 168
Query: 185 LSVQEVIDC-AGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPN 243
LS QEV+DC + + GC+GG+ +++ NK + E+ YP D C K + +
Sbjct: 169 LSEQEVVDCDVTSDDQGCNGGEMDDAFEYIIKNKGIT-TEANYPYKAADGTCNTKKAASH 227
Query: 244 GVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANIN 301
I Y D + SE+++L A P+ A++A +Q Y GV +C +++
Sbjct: 228 AASITGYE-DVTVNSEAALLKAAANQ-PIAVAIDAGDFAFQMYSSGVFTGDCG---TDLD 282
Query: 302 HAVQIVGY 309
H V +VGY
Sbjct: 283 HGVTLVGY 290
>gi|164472556|gb|ABY58967.1| cathepsin L [Toxoplasma gondii]
Length = 421
Score = 121 bits (303), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 90/281 (32%), Positives = 139/281 (49%), Gaps = 29/281 (10%)
Query: 35 ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEE 93
+ FSSFQ Y KSY ++ E R+ F+ +L I N Q S + F DLS +
Sbjct: 114 DAFSSFQAMYAKSYATEEEKQRRYAIFKNNLVYIHTHN---QQGYSYSLKMNHFGDLSRD 170
Query: 94 EFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGI--TIPTGIPVKKDWREAGIIG 151
EF+ ++L +++ L SHH + T + +P+ +P DWR G +
Sbjct: 171 EFRRKYLGFKKSRN-LKSHH------------LGVATELLNVLPSELPAGVDWRSRGCVT 217
Query: 152 KVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGCSGGDFCALL 210
V++Q+ CG+CWAFST E H K G L LS QE++DC+ GN CSGG+
Sbjct: 218 PVKDQRDCGSCWAFSTTGALEGAHCAKTGKLVSLSEQELMDCSRAEGNQSCSGGEMNDAF 277
Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
++ ++ + E YP L +D C R + VKI + D SE+++ +A
Sbjct: 278 QYV-LDSGGICSEDAYPYLARDEEC-RAQSCEKVVKILGFK-DVPRRSEAAMKAALAK-S 333
Query: 271 PVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
PV A+ A + +Q+Y GV +C +++H V +VGY
Sbjct: 334 PVSIAIEADQMPFQFYHEGVFDASCG---TDLDHGVLLVGY 371
>gi|333069454|gb|AEF13978.1| chymopapain [Carica papaya]
Length = 352
Score = 121 bits (303), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 92/285 (32%), Positives = 134/285 (47%), Gaps = 28/285 (9%)
Query: 29 NLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
++E+ ++LF S+ ++ K Y + I RF+ F +L I+E NK S G+ F
Sbjct: 40 SIERLIQLFDSWMLKHNKIYESIDEKIYRFEIFRDNLMYIDETNKKNNS---YWLGLNGF 96
Query: 88 SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREA 147
+DLS +EFK +++ + H + D + HV T P DWR
Sbjct: 97 ADLSNDEFKKKYVGSVAEDFTGLEHFDNEDFTYKHV------------TNYPQSIDWRAK 144
Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFC 207
G + V+NQ +CG+CWAFST+ T E ++ + G L LS QE++DC N + GC GG
Sbjct: 145 GAVTPVKNQGSCGSCWAFSTIATVEGVNKIVTGNLLELSEQELVDCDKNSH-GCKGGYQT 203
Query: 208 ALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPS--ESSILTD 265
L ++ N V YP K C +AT G K+K T +PS E+S L
Sbjct: 204 TSLQYVADNGV--HTSKVYPYQAKAMQC--RATDKPGPKVK-ITGYKRVPSNCETSFLGA 258
Query: 266 IATHG-PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+A V+ +Q Y GV C L +HAV VGY
Sbjct: 259 LANQPLSVLVEAGGKPFQLYKSGVFDGPCGTKL---DHAVTAVGY 300
>gi|118365744|ref|XP_001016092.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89297859|gb|EAR95847.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 336
Score = 121 bits (303), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 100/315 (31%), Positives = 156/315 (49%), Gaps = 35/315 (11%)
Query: 7 VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKN--FEKSL 64
+L I+ L+ LC LA + V +KL ++ + ++++ Y +EH+ F+ F ++L
Sbjct: 6 LLSIIMLMPLC-LAQDISV------EKLLAYNKWSSQHQRVY-LNEHEKLFRQMVFFENL 57
Query: 65 DIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLM----SHHKHHDHHH 120
I+E N N + S + +FSD+++EEF + L S LM H+D ++
Sbjct: 58 QKIQEHNNNPNNTYSVH--LNQFSDMTKEEFAEKILMKSDFVDHLMKGISQEATHNDTNN 115
Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
N + S +T+ I DWR G + V+NQ CG+CW+FS ES + ++N
Sbjct: 116 NETQLSS--NSLTLADSI----DWRTKGAVTSVKNQGGCGSCWSFSAAAVMESFNFIQNK 169
Query: 181 TLSLLSVQEVIDCA----GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACK 236
L S Q+++DC G + GC+GG LD+ +KV + +YP + C
Sbjct: 170 ALVDFSEQQLVDCVIPANGYNSYGCNGGWPVQCLDY--ASKVGITTLDKYPYVAVQNNCN 227
Query: 237 RKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYN-CDG 295
T+ NG K KS+ IP+ S+ L PV V+A TW Y G+ YN CD
Sbjct: 228 VTGTN-NGFKPKSW---IQIPNTSNDLKSALNFSPVSVLVDASTWGNYYSGI--YNGCDQ 281
Query: 296 SLANINHAVQIVGYD 310
++NHAV VGYD
Sbjct: 282 LHISLNHAVLAVGYD 296
>gi|388513209|gb|AFK44666.1| unknown [Lotus japonicus]
gi|388514955|gb|AFK45539.1| unknown [Lotus japonicus]
Length = 352
Score = 121 bits (303), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 92/302 (30%), Positives = 149/302 (49%), Gaps = 38/302 (12%)
Query: 22 PVKVSKPNLEQKLEL---------FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELN 71
P+++ EQ L++ F+ F +Y K Y S E RF+ F ++L++I+ N
Sbjct: 29 PIRLVSDLEEQVLQVIGQTRHAVSFARFASKYGKRYDSVEEIQHRFRIFSENLELIKSTN 88
Query: 72 KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITT 130
K R S + G+ F+DLS +EF+T+ L + N L+ +HK D
Sbjct: 89 KKRLS---YKLGLNHFADLSWDEFRTQKLGAAQNCSATLIGNHKLTD------------- 132
Query: 131 GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEV 190
+P +KDWR+ I+ +V++Q CG+CW FST E+ +A +G LS Q++
Sbjct: 133 -----AVLPAEKDWRKESIVSEVKDQAHCGSCWTFSTTGALEAAYAQAHGKNISLSEQQL 187
Query: 191 IDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKS 249
+DCAG N GC+GG +++ N + E EYP KD ACK A + V++
Sbjct: 188 VDCAGAFNNFGCNGGLPSQAFEYIKYNGGI-ALEKEYPYTAKDEACKFTAENV-AVRVLD 245
Query: 250 YTCDTLIPSESSILTDIATHGPVIAAVNALT-WQYYLGGVIQYN-CDGSLANINHAVQIV 307
+ + + +E + +A PV A + ++ Y GV + C + ++NHAV V
Sbjct: 246 -SVNITLGAEDELKHAVAFARPVSVAFQVVDGFRLYKEGVYTSDTCGNTPMDVNHAVLAV 304
Query: 308 GY 309
GY
Sbjct: 305 GY 306
>gi|310656790|gb|ADP02219.1| Peptidase_C1 domain-containing protein [Triticum aestivum]
Length = 419
Score = 121 bits (303), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 86/309 (27%), Positives = 145/309 (46%), Gaps = 31/309 (10%)
Query: 7 VLFIVALIALC---FLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKS 63
+L I+ I LC L+ +E+ + + F + YK S K++ RFK F+ +
Sbjct: 8 LLAIIGSICLCSSTVLSARELGDAAMVEKHEQWMAKFNRVYKDSTEKAQ---RFKAFKAN 64
Query: 64 LDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHV 123
+ IE N G+ +F+DL+ +EF+ + ++ + + +N+V
Sbjct: 65 VAFIESFNTGNHK---FWLGVNQFTDLTNDEFRATKTNKGLKRNGARAPTRFK---YNNV 118
Query: 124 KKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLS 183
++ P DWR G++ +++Q CG CWAFS V E + L G L
Sbjct: 119 STDAL----------PAAVDWRTKGVVTPIKDQGQCGCCWAFSAVAATEGIVKLSTGKLV 168
Query: 184 LLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSP 242
LS QE++DC +G + GC GG+ ++ + L E+ YP +D CK TS
Sbjct: 169 SLSEQELVDCDVHGVDQGCEGGEMDNAFKFI-IKNGGLTTEANYPYTAQDGQCKTSTTSN 227
Query: 243 NGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANI 300
+ IK Y D ESS++ +A PV AV+ + +Q+Y GGV+ +C ++
Sbjct: 228 SVATIKGYE-DVPANDESSLMKAVANQ-PVSVAVDGGDVIFQHYSGGVMTGSCG---TDL 282
Query: 301 NHAVQIVGY 309
+H + +GY
Sbjct: 283 DHGIVAIGY 291
>gi|255538808|ref|XP_002510469.1| cysteine protease, putative [Ricinus communis]
gi|223551170|gb|EEF52656.1| cysteine protease, putative [Ricinus communis]
Length = 366
Score = 121 bits (303), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 91/289 (31%), Positives = 141/289 (48%), Gaps = 47/289 (16%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F++F+ ++ K+Y ++ EHD RFK F+ +L K++ SA +G+T+FSDL+ EF
Sbjct: 51 FTAFKAKFGKNYATQEEHDYRFKVFKANL---RRAQKHQLMDPSAVHGVTKFSDLTPREF 107
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKDWREAGIIGKVR 154
+ ++L + K L + D H + +PT GIP DWR+ G + V+
Sbjct: 108 RRQYL--GLKKLRLPA-----DAHEAPI----------LPTDGIPEDFDWRDHGAVTNVK 150
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA--------GNGNMGCSGGDF 206
NQ +CG+CW+FS E H L G L LS Q+++DC G + GC+GG
Sbjct: 151 NQGSCGSCWSFSAAGALEGAHFLATGELVSLSEQQLVDCDHECDPTEYGACDSGCNGGLM 210
Query: 207 CALLDWMDVNKVVLEPESEYPLLLKD-AACK----RKATSPNGVKIKSYTCDTLIPSESS 261
+++ + LE E +YP D CK + A S N + S E
Sbjct: 211 TNAFEYI-LKAGGLEREEDYPYTGSDRGPCKFERAKIAASVNNFSVVSV-------DEDQ 262
Query: 262 ILTDIATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
I ++ +GP+ +NA+ Q Y+GGV Y C +H V +VGY
Sbjct: 263 IAANLVQNGPLAVGINAVFMQTYIGGVSCPYICS---KRQDHGVVLVGY 308
>gi|18414611|ref|NP_567489.1| Papain family cysteine protease [Arabidopsis thaliana]
gi|2244977|emb|CAB10398.1| cysteine proteinase like protein [Arabidopsis thaliana]
gi|7268368|emb|CAB78661.1| cysteine proteinase like protein [Arabidopsis thaliana]
gi|14517442|gb|AAK62611.1| AT4g16190/dl4135w [Arabidopsis thaliana]
gi|22136546|gb|AAM91059.1| AT4g16190/dl4135w [Arabidopsis thaliana]
gi|22530956|gb|AAM96982.1| cysteine proteinase [Arabidopsis thaliana]
gi|23397184|gb|AAN31875.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|110740834|dbj|BAE98514.1| cysteine proteinase like protein [Arabidopsis thaliana]
gi|332658313|gb|AEE83713.1| Papain family cysteine protease [Arabidopsis thaliana]
Length = 373
Score = 121 bits (303), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 92/305 (30%), Positives = 145/305 (47%), Gaps = 42/305 (13%)
Query: 22 PVK--VSKPNLEQKLEL---FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQ 75
P++ V + N EQ L F+ F+ +Y+K+Y ++ EHD RF+ F+ +L +N+
Sbjct: 35 PIRQVVPEENDEQLLNAEHHFTLFKSKYEKTYATQVEHDHRFRVFKANL---RRARRNQL 91
Query: 76 SPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIP 135
SA +G+T+FSDL+ +EF+ + L L + T +P
Sbjct: 92 LDPSAVHGVTQFSDLTPKEFRRKFLGLKRRGFRLPT---------------DTQTAPILP 136
Query: 136 TG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC- 193
T +P + DWRE G + V+NQ CG+CW+FS + E H L L LS Q+++DC
Sbjct: 137 TSDLPTEFDWREQGAVTPVKNQGMCGSCWSFSAIGALEGAHFLATKELVSLSEQQLVDCD 196
Query: 194 -------AGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKD-AACKRKATSPNGV 245
A + + GCSGG ++ + L E +YP +D ACK + +
Sbjct: 197 HECDPAQANSCDSGCSGGLMNNAFEYA-LKAGGLMKEEDYPYTGRDHTACKFDKSK---I 252
Query: 246 KIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAV 304
+ E I ++ HGP+ A+NA+ Q Y+GGV Y C S +H V
Sbjct: 253 VASVSNFSVVSSDEDQIAANLVQHGPLAIAINAMWMQTYIGGVSCPYVCSKSQ---DHGV 309
Query: 305 QIVGY 309
+VG+
Sbjct: 310 LLVGF 314
>gi|363807062|ref|NP_001242584.1| uncharacterized protein LOC100804015 precursor [Glycine max]
gi|255640677|gb|ACU20623.1| unknown [Glycine max]
Length = 366
Score = 121 bits (303), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 87/316 (27%), Positives = 150/316 (47%), Gaps = 31/316 (9%)
Query: 4 VKNVLFIVALIALCF-LAIPVKVSKPNLEQKLELFSSFQQ---RYKKSYSK-SEHDIRFK 58
+ ++ I L+ L F L+ +K S E+ + +++ R++K Y++ + D RF+
Sbjct: 1 MASMTMIYTLLFLSFTLSYAIKTSTIINYTDNEVMAMYEEWLVRHQKGYNELGKKDKRFQ 60
Query: 59 NFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDH 118
F+ +L I+E N N + + + G+ +F+D++ EE++ +L N + K H
Sbjct: 61 VFKDNLGFIQEHNNNLNN--TYKLGLNKFADMTNEEYRAMYLGTKSNAKRRLMKTKSTGH 118
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
+ + + PV DWR G + +++Q +CG+CWAFSTV T E+++ +
Sbjct: 119 RYAFSARDRL----------PVHVDWRMKGAVAPIKDQGSCGSCWAFSTVATVEAINKIV 168
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW---MDVNKVVLEPESEYPLLLKDAAC 235
G LS QE++DC N GC+GG L+D+ + ++ + +YP D C
Sbjct: 169 TGKFVSLSEQELVDCDRAYNEGCNGG----LMDYAFEFIIQNGGIDTDKDYPYRGFDGIC 224
Query: 236 KRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNC 293
+ V I Y + + P + + L H PV A+ A Q Y GV C
Sbjct: 225 DPTKKNAKVVNIDGY--EDVPPYDENALKKAVAHQPVSVAIEASGRALQLYQSGVFTGKC 282
Query: 294 DGSLANINHAVQIVGY 309
SL +H V +VGY
Sbjct: 283 GTSL---DHGVVVVGY 295
>gi|326434958|gb|EGD80528.1| hypothetical protein PTSG_01119 [Salpingoeca sp. ATCC 50818]
Length = 389
Score = 121 bits (303), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 96/302 (31%), Positives = 148/302 (49%), Gaps = 33/302 (10%)
Query: 31 EQKLE-LFSSFQQRYKKSYSK--SEHDIRFKNFEKSLDIIEELNKNR---QSPESARYGI 84
E +L+ LF+SF + + + Y+ +EH R + F +++ + ++ + + + +A +
Sbjct: 48 EAQLDALFTSFVKDFGRLYASNATEHAFRRRVFARNVQLYQQRSASAATVSAGHTAVFKP 107
Query: 85 TEFSDLSEEEFK----TRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPV 140
+FSD + EEF+ TR + ++ S + + N + T + IP
Sbjct: 108 DKFSDWTVEEFRALLGTRPVSTAIGNPRCASSPVNCELSTN------MNTNAALGLAIPD 161
Query: 141 KKDWR--EAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLS--LLSVQEVIDCAGN 196
DWR G+I VR+Q CG CWAFS VET E+ L TL LSVQ+++ C
Sbjct: 162 AFDWRNDSRGVITAVRDQGQCGGCWAFSAVETVEASWVLSGHTLPEPKLSVQQILSCDTQ 221
Query: 197 GNMGCSGGD----FCALLDWMDVNKVVLEPESEYPLLLKDAACKRK-----ATSPNGVKI 247
N GC GG F +LD + K LEP++ +P D CK A S V I
Sbjct: 222 AN-GCHGGSISGAFTYVLDKSEQGKG-LEPDTAFPFKC-DKGCKNSLPQCPALSRPFVTI 278
Query: 248 KSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIV 307
+ TC E +L +A +GP+ V+A W Y G+++Y+C A+ NHAVQIV
Sbjct: 279 NA-TCRCPKMKEKDMLAFVANYGPLAIQVDAEPWHGYSSGIMRYHCSSQPASANHAVQIV 337
Query: 308 GY 309
GY
Sbjct: 338 GY 339
>gi|14349349|gb|AAC38833.2| cysteine protease [Leishmania chagasi]
Length = 353
Score = 121 bits (303), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 101/322 (31%), Positives = 156/322 (48%), Gaps = 43/322 (13%)
Query: 4 VKNVLFIV----ALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFK 58
V +LF+V ALIA L + ++ + + F++R+ K + + +E RF
Sbjct: 11 VVTILFVVCYGSALIAQTPLGVDDFIASAH-------YGRFKKRHGKPFGEDAEEGRRFN 63
Query: 59 NFEKSLDIIEELNKNRQSPESARYGIT-EFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHD 117
F++++ LN + A Y ++ +F+DL+ +EF +L N + H K +
Sbjct: 64 AFKQNMQTAYFLNAHN---PHAHYDVSGKFADLTPQEFAKLYL----NPNYYARHGKDYK 116
Query: 118 HHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHAL 177
H HV S+ +G+ + DWRE G++ V+NQ CG+CWAF+T E AL
Sbjct: 117 EH-VHVDD-SVRSGV-------MSVDWREKGVVTPVKNQGMCGSCWAFATTGNIEGQWAL 167
Query: 178 KNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM--DVNKVVLEPESEYPLLLKDAAC 235
KN +L LS Q ++ C N + GC+GG + W+ D N V E YP A
Sbjct: 168 KNHSLVSLSEQVLVSCD-NIDDGCNGGLMQQAMQWIINDHNGTV-PTEDSYP--YTSAGG 223
Query: 236 KRKATSPN---GVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYN 292
R N G KI Y +L E I + +GPV AV+A TWQ Y GGV+
Sbjct: 224 TRPPCHDNGTVGAKIAGYM--SLPHDEEEIAAYVGKNGPVAVAVDATTWQLYFGGVVTL- 280
Query: 293 CDGSLANINHAVQIVGYDNYSR 314
C G ++NH V +VG++ ++
Sbjct: 281 CFG--LSLNHGVLVVGFNRQAK 300
>gi|260234113|dbj|BAI44279.1| cysteine proteinase inhibitor precursor [Manduca sexta]
gi|261336196|dbj|BAH59606.2| cysteine proteinase inhibitor precursor [Manduca sexta]
Length = 2676
Score = 121 bits (303), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 87/287 (30%), Positives = 141/287 (49%), Gaps = 36/287 (12%)
Query: 32 QKLELFSSFQQRYKKSYSKSEHDIR--FKNFEKSLDIIEELNKNRQSPESARYGITEFSD 89
Q LF F YK Y H +R F+ F++++ + ELN + + +A YG+T F+D
Sbjct: 2366 QAEHLFYEFLSTYKPEYIDDRHQMRQRFEIFKENVRKMHELNTHERG--TATYGVTRFAD 2423
Query: 90 LSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVK-KRSITTGITIPTGIPVKKDWREAG 148
L+ EEF T+H+ K N V+ ++++ +T P DWR+ G
Sbjct: 2424 LTYEEFSTKHM-----------GMKASLRDPNQVQFRKAVIPNVTAPDSF----DWRDHG 2468
Query: 149 IIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCA 208
+ V++Q +CG+CWAFS E +K G L LS QE++DC + GC+GG
Sbjct: 2469 AVTGVKDQGSCGSCWAFSVTGNIEGQWKMKTGDLVSLSEQELVDC-DKLDQGCNGG---- 2523
Query: 209 LLD--WMDVNKV-VLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
L D + + ++ LE E +YP D C T +++ + +E+ +
Sbjct: 2524 LPDNAYRAIEQLGGLESEDDYPYEGSDDKCSFNKTL---ARVQISGAVNITSNETDMAKW 2580
Query: 266 IATHGPVIAAVNALTWQYYLGGVI---QYNCDGSLANINHAVQIVGY 309
+ HGP+ +NA Q+Y+GG+ + C+ S N++H V IVGY
Sbjct: 2581 LVKHGPISIGINANAMQFYMGGISHPWRMLCNPS--NLDHGVLIVGY 2625
>gi|356543076|ref|XP_003539989.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 121 bits (303), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 91/304 (29%), Positives = 135/304 (44%), Gaps = 26/304 (8%)
Query: 10 IVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIE 68
+ L L F A V E + RY K Y E + RFK F+++++ IE
Sbjct: 12 LALLFCLGFWAFQVTSRTLQDASMYERHEEWMARYAKVYKDPEEREKRFKIFKENVNYIE 71
Query: 69 ELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSI 128
N P + GI +F+DL+ EEF + +K H + + R+
Sbjct: 72 AFNNAADKP--YKLGINQFADLTNEEF-------------IAPRNKFKGHMCSSIT-RTT 115
Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
T T +P DWR+ G + +++Q CG CWAFS V E +HAL +G L LS Q
Sbjct: 116 TFKYENVTALPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALNSGKLISLSEQ 175
Query: 189 EVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI 247
EV+DC G + GC+GG ++ N L E+ YP D C + + I
Sbjct: 176 EVVDCDTKGEDQGCAGGFMDGAFKFIIQNH-GLNTEANYPYKAVDGKCNANEAANHAATI 234
Query: 248 KSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQ 305
Y D + +E ++ +A PV A++A +Q+Y GV +C L +H V
Sbjct: 235 TGYE-DVPVNNEKALQKAVANQ-PVSVAIDASGSDFQFYKTGVFTGSCGTQL---DHGVT 289
Query: 306 IVGY 309
VGY
Sbjct: 290 AVGY 293
>gi|144905112|dbj|BAF56429.1| cysteine proteinase [Lotus japonicus]
Length = 341
Score = 121 bits (303), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 92/310 (29%), Positives = 146/310 (47%), Gaps = 36/310 (11%)
Query: 9 FIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQ---RYKKSY-SKSEHDIRFKNFEKSL 64
+I+AL L + I +S+ E + L +Q +Y K Y +E + RF F+ ++
Sbjct: 10 YILALFLLLAVGISRVISRELHETETSLIERHEQWMAKYDKVYKDAAEKEKRFLIFKDNV 69
Query: 65 DIIEELNKNRQSPESARYGITEFSDLSEEEFKTRH--LRHSVNKHVLMSHHKHHDHHHNH 122
+ IE N P + G+ +DL+ EEFK L+ S + V + K+ +
Sbjct: 70 EFIESFNAAGNKP--YKLGVNHLADLTIEEFKASRNGLKRSYDYEVGTTSFKYEN----- 122
Query: 123 VKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTL 182
T IP DWR+ G + +++Q CG+CWAFSTV E +H + G L
Sbjct: 123 ------------VTAIPASVDWRKKGAVTPIKDQGQCGSCWAFSTVAATEGIHKISTGKL 170
Query: 183 SLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATS 241
LS QE++DC G + GC GG +++ N + E+ YP D +CK AT+
Sbjct: 171 VSLSEQELVDCDRKGTDQGCEGGYMEDGFEFIIKNGGIT-TEANYPYKAVDGSCKN-ATA 228
Query: 242 PNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSLAN 299
P +IK Y + SE ++L +A PV +++A ++ +Y G+ C L
Sbjct: 229 P-AAQIKGYE-KVPVNSEKALLKAVANQ-PVSVSIDAADGSFMFYSSGIFTGECGTEL-- 283
Query: 300 INHAVQIVGY 309
+H V VGY
Sbjct: 284 -DHGVTAVGY 292
>gi|281206749|gb|EFA80934.1| counting factor associated protein [Polysphondylium pallidum PN500]
Length = 530
Score = 121 bits (303), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 83/278 (29%), Positives = 142/278 (51%), Gaps = 25/278 (8%)
Query: 37 FSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F F+ Y K Y+ EH RF ++++ ++I + N Q S + + F D++ EEF
Sbjct: 227 FEQFKTTYDKVYAHDEEHSERFATYKQNREMI--IAHNTQE-SSYKLAMNHFGDMTAEEF 283
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
+ + ++ V + H HD+ R+I +P DWR+ G + +V++
Sbjct: 284 ELK-IKPRVPRPDTNGAHDVHDN------DRTIN--------LPATVDWRQQGCVTRVKD 328
Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMD 214
Q CG+CW F + + E + L G L LS Q+++DCA G + GC+GG ++
Sbjct: 329 QGVCGSCWTFGSTGSLEGVSCLATGKLVSLSEQQLVDCAYLGQSQGCNGGFASDAFQYI- 387
Query: 215 VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIA 274
+N + ES YP L+++ CK ++ + +K+KSY T SE ++ +AT GPV
Sbjct: 388 MNFGGIAYESTYPYLMQNGYCKDSSSQLSNIKVKSYVNVTSF-SEPALQNAVATVGPVAI 446
Query: 275 AVNALT--WQYYLGGVIQYN-CDGSLANINHAVQIVGY 309
A++A +++Y GV + C L +++H V VGY
Sbjct: 447 AIDASAPDFRFYSSGVYYSSVCKNGLDDLDHEVLAVGY 484
>gi|33333712|gb|AAQ11974.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 121 bits (303), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 91/281 (32%), Positives = 131/281 (46%), Gaps = 27/281 (9%)
Query: 35 ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKN-RQSPESARYGITEFSDLSE 92
E + F+ + K+Y S E RF F+K+L I+E NK + ES +T+F+D++
Sbjct: 21 EEWQQFKLDHGKTYRSLVEEKRRFSVFQKNLVDIQEHNKKYERGEESFAKKVTQFADMTH 80
Query: 93 EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
EEF V L S+ H D+ + I + V DWRE G +
Sbjct: 81 EEFLDLLKLQGV--PALPSNAVHFDNFED----------IDMEEKDAV--DWREEGAVTP 126
Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN--GNMGCSGGDFCALL 210
V++Q CG+CWAFS V E KNGTL LS QE++DCA GN GC GG
Sbjct: 127 VKDQANCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEDYGNNGCKGGLMGQAF 186
Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
D+ V ++ E YP + ++CK+ VK + D E + +A G
Sbjct: 187 DF--VQDEGIQTEESYPYEGRRSSCKKSGEYVTKVKTYVFPLD-----EQEMARTVAAKG 239
Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGS--LANINHAVQIVGY 309
PV A+ A +Y G++ C S ++NH V +VGY
Sbjct: 240 PVAVAIEASQLSFYDKGIVDERCRCSNKREDLNHGVLVVGY 280
>gi|356543038|ref|XP_003539970.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 121 bits (303), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 91/304 (29%), Positives = 135/304 (44%), Gaps = 26/304 (8%)
Query: 10 IVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIE 68
+ L L F A V E + RY K Y E + RFK F+++++ IE
Sbjct: 12 LALLFCLGFWAFQVTSRTLQDASMYERHEEWMARYAKVYKDPEEREKRFKIFKENVNYIE 71
Query: 69 ELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSI 128
N P + GI +F+DL+ EEF R+ H+ S + + +V
Sbjct: 72 AFNNAANKP--YKLGINQFADLTNEEFIAP--RNRFKGHMCSSITRTTTFKYENV----- 122
Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
T +P DWR+ G + +++Q CG CWAFS V E +HAL +G L LS Q
Sbjct: 123 -------TALPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALNSGKLISLSEQ 175
Query: 189 EVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI 247
EV+DC G + GC+GG ++ N L E+ YP D C + + I
Sbjct: 176 EVVDCDTKGEDQGCAGGFMDGAFKFIIQNH-GLNTEANYPYKAVDGKCNANEAANHAATI 234
Query: 248 KSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQ 305
Y D + +E ++ +A PV A++A +Q+Y GV +C L +H V
Sbjct: 235 TGYE-DVPVNNEKALQKAVANQ-PVSVAIDASGSDFQFYKTGVFTGSCGTQL---DHGVT 289
Query: 306 IVGY 309
VGY
Sbjct: 290 AVGY 293
>gi|443708542|gb|ELU03619.1| hypothetical protein CAPTEDRAFT_17807 [Capitella teleta]
Length = 350
Score = 120 bits (302), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 86/287 (29%), Positives = 138/287 (48%), Gaps = 21/287 (7%)
Query: 28 PNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNK-NRQSPESARYGITE 86
PNL +L+ F+ ++++Y ++E R + F +L I+ N + Q R GI +
Sbjct: 34 PNLVPFEKLWQDFKTVHERTYGETEESQRKEVFRNNLKKIQAHNHLHEQGKSPYRMGINQ 93
Query: 87 FSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWRE 146
F+D+ EF + +N + H H ++ IP +P + DWR+
Sbjct: 94 FADMEANEFASIMNGFRMNNRTEVRDHLHANY-----------ISPAIPVSVPAEVDWRK 142
Query: 147 AGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGD 205
G + V+NQ CG+CWAFST + E H K G L LS Q ++DC+ + GN GC+GG
Sbjct: 143 EGYVTPVKNQGQCGSCWAFSTTGSLEGQHFRKTGKLVSLSEQNLVDCSTSYGNEGCNGGI 202
Query: 206 FCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
++ N + E+ YP D C+ K+ G YT D E+ +
Sbjct: 203 VDYAFQYIKDNDGD-DTEACYPYEAVDGTCRFKSVCV-GATCTGYT-DLPKGDEAKMKEA 259
Query: 266 IATHGPVIAAVNA--LTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
+A GPV A++A ++Q Y G+ ++ C S ++HAV +VGY
Sbjct: 260 VALVGPVSVAIDASHSSFQMYQSGIYVEQEC--SPKQLDHAVLVVGY 304
>gi|297745594|emb|CBI40759.3| unnamed protein product [Vitis vinifera]
Length = 300
Score = 120 bits (302), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 89/273 (32%), Positives = 132/273 (48%), Gaps = 32/273 (11%)
Query: 43 RYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLR 101
++ KSY E + RF+ F+ +L I+E NK S G+ EF+DLS EEFK ++L
Sbjct: 3 KHGKSYRSFEEKLHRFEVFQDNLKHIDETNKKVSS---YWLGLNEFADLSHEEFKRKYL- 58
Query: 102 HSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGA 161
+ + K D K +P DWR+ G + V+NQ CG+
Sbjct: 59 -----GLKIELPKRRDSPEEFSYKDVAD--------LPKSVDWRKKGAVAHVKNQGACGS 105
Query: 162 CWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW---MDVNKV 218
CWAFSTV E ++ + G L+ LS QE+IDC N GC+GG L+D+ ++
Sbjct: 106 CWAFSTVAAVEGINQIVTGNLTALSEQELIDCDKPFNNGCNGG----LMDYAFAFIISNG 161
Query: 219 VLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA 278
L E +YP ++++ C K V I Y D +E S L +A P+ A+ A
Sbjct: 162 GLRKEEDYPYVMEEGTCGEKKEELEVVTISGYH-DVPEDNEQSFLKALANQ-PLSVAIEA 219
Query: 279 LT--WQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+ +Q+Y GG+ +C L +H V VGY
Sbjct: 220 SSRGFQFYSGGIFNGHCGTEL---DHGVAAVGY 249
>gi|410978262|ref|XP_003995514.1| PREDICTED: cathepsin L1-like [Felis catus]
Length = 333
Score = 120 bits (302), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 88/302 (29%), Positives = 139/302 (46%), Gaps = 30/302 (9%)
Query: 13 LIALCFLAIPVKVSKPNLEQKL-ELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELN 71
L ALC + + + P L Q L EL+S ++ + K Y E R + ++K++ +I + N
Sbjct: 7 LAALC---LGIASAAPQLNQSLDELWSQWKATHGKLYGMDEEGWRREVWKKNMKMIRQHN 63
Query: 72 -KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITT 130
++ Q S + F D++ EEFK + KH K+
Sbjct: 64 WEHSQGKHSFTVAMNGFGDMTNEEFKQVMNGLQMQKH-----------------KKGKMF 106
Query: 131 GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEV 190
+ IP DWRE G + V++Q CG+CWAFS E K G L LS Q +
Sbjct: 107 QAPLFAKIPSSVDWREKGYVTPVKDQGPCGSCWAFSATGALEGQMFRKTGKLVSLSEQNL 166
Query: 191 IDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKS 249
+DC+ GN GC+GG ++ N L+ E YP +D +CK K P
Sbjct: 167 VDCSQAEGNEGCNGGLMNNAFQYVKDNG-GLDSEESYPYHAQDESCKYK---PQDSAAND 222
Query: 250 YTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIV 307
+ E +++ +AT GP+ ++A T+Q+Y G I Y+ D S +++H V ++
Sbjct: 223 TGFFDIPQQEKALMVAVATKGPISVGIDASHFTFQFYHEG-IYYDPDCSSEDLDHGVLVI 281
Query: 308 GY 309
GY
Sbjct: 282 GY 283
>gi|33333706|gb|AAQ11971.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 120 bits (302), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 91/281 (32%), Positives = 131/281 (46%), Gaps = 27/281 (9%)
Query: 35 ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKN-RQSPESARYGITEFSDLSE 92
E + F+ + K+Y S E RF F+K+L I+E NK + ES +T+F+D++
Sbjct: 21 EEWQQFKLDHGKTYRSLVEEKRRFSVFQKNLVDIQEHNKKYERGEESFAKKVTQFADMTH 80
Query: 93 EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
EEF V L S+ H D+ + I + V DWRE G +
Sbjct: 81 EEFLDLLKLQGV--PALPSNAVHFDNFED----------IDMEEKDAV--DWREEGAVTP 126
Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN--GNMGCSGGDFCALL 210
V++Q CG+CWAFS V E KNGTL LS QE++DCA GN GC GG
Sbjct: 127 VKDQANCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEDYGNNGCKGGLMGQAF 186
Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
D+ V ++ E YP + ++CK+ VK + D E + +A G
Sbjct: 187 DF--VQDEGIQTEESYPYEGRRSSCKKSGEYVTKVKTYVFPLD-----EQEMARTVAAKG 239
Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGS--LANINHAVQIVGY 309
PV A+ A +Y G++ C S ++NH V +VGY
Sbjct: 240 PVAVAIEASQLSFYDKGIVDERCRCSNKREDLNHGVLVVGY 280
>gi|577617|gb|AAC37213.1| cysteine proteinase [Trypanosoma cruzi]
Length = 467
Score = 120 bits (302), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 91/322 (28%), Positives = 145/322 (45%), Gaps = 40/322 (12%)
Query: 1 MFDVKNVLFIVALIALCFLAIPVKVSKPNLEQKL-ELFSSFQQRYKKSY-SKSEHDIRFK 58
M L + A++ + +P + + E+ L F+ F+Q++ + Y S +E R
Sbjct: 1 MSGWARALSLAAVLVVMACLVPAATASLHAEETLASQFAEFKQKHGRVYGSAAEEAFRLS 60
Query: 59 NFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDH 118
F +L + L+ + A +G+T FSDL+ EEF++R+ H H
Sbjct: 61 VFRANL-FLARLHA--AANPHATFGVTPFSDLTREEFRSRY-------------HNGAAH 104
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
++ + + + G P KDWRE G + V+NQ CG+CWAF+ + E L
Sbjct: 105 FAAAEERARVPVDVEV-VGAPAAKDWREEGAVTAVKNQGICGSCWAFAAIGNIEGQWFLA 163
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYP--------LL 229
L+ LS Q ++ C N N GC GG +W+ N + E YP L
Sbjct: 164 GNPLTRLSEQMLVSC-DNTNSGCGGGLSSKAFEWIVQENNGAVYTEDSYPYHSCIGIKLP 222
Query: 230 LKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVI 289
KD+ AT V++ E+ I A GP+ AV+A +W +Y GGV+
Sbjct: 223 CKDSDRTVGATITGHVELPQ--------DEAQIAASGAVKGPLSVAVDASSWFFYTGGVL 274
Query: 290 QYNCDGSLANINHAVQIVGYDN 311
NC ++HAV +VGY++
Sbjct: 275 T-NCVSK--RLSHAVLLVGYND 293
>gi|345320664|ref|XP_001521690.2| PREDICTED: cathepsin L1-like [Ornithorhynchus anatinus]
Length = 388
Score = 120 bits (302), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 94/309 (30%), Positives = 155/309 (50%), Gaps = 31/309 (10%)
Query: 9 FIVALIALCF-LAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDII 67
+V L++LC+ LA+ + L++ EL+ ++ Q KSY K+E R +E++L +I
Sbjct: 53 LLVCLLSLCWGLAVSAPLGDSELDKHWELWKNWHQ---KSYHKAEEGWRRMVWEENLKVI 109
Query: 68 EELNKNRQ-SPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKR 126
E N + + + G+ +F DL+ EEF+ +L+S + H N +
Sbjct: 110 ELHNLEQSLGLHTYQLGMNQFGDLTNEEFQ----------QMLIS--ERHFSEGNRINGS 157
Query: 127 SI--TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL 184
+ + +PT + DWR+ G + V+NQ CG+CWAFST E K+G L
Sbjct: 158 AFLEVNYVQVPTSV----DWRDHGYVTPVKNQGHCGSCWAFSTTGALEGQLFRKSGRLVS 213
Query: 185 LSVQEVIDCAG-NGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAA-CKRKATSP 242
LS Q ++DC+ GN GC+GG ++ N+ + + E YP KD A C K
Sbjct: 214 LSEQNLVDCSWQQGNQGCNGGIVDFAFQYILENRGI-DSEDCYPYTAKDTAQCAFKPECA 272
Query: 243 NGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANI 300
++ + D SE +++ +AT GPV A++A ++++Y G+ Y S +
Sbjct: 273 T-ARVTGFV-DIPPHSEEALMKAVATVGPVSVAIDAHPTSFRFYQSGIF-YEPKCSSERL 329
Query: 301 NHAVQIVGY 309
NHAV +VGY
Sbjct: 330 NHAVLVVGY 338
>gi|94556727|gb|ABF46642.1| papain-like cysteine proteinase [Pachysandra terminalis]
Length = 374
Score = 120 bits (302), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 86/285 (30%), Positives = 138/285 (48%), Gaps = 38/285 (13%)
Query: 37 FSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
FSSF++R+ K+Y+ EHD RF F+ +L +N+ SA +G+T+F DL+ EF
Sbjct: 58 FSSFKKRFGKAYTSCDEHDRRFGVFKANL---RRAKRNQILDPSAVHGVTQFFDLTPAEF 114
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
+ +L L D H + +PT +P DWR+ G + V+
Sbjct: 115 RRTYLG-------LKRLRLPADTHEAPI----------LPTNDLPADFDWRDHGAVTPVK 157
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
NQ +CG+CW+FS E + L G L LS Q+++DC + + GC+GG
Sbjct: 158 NQGSCGSCWSFSATGALEGANFLATGKLVSLSEQQLVDCDHVCDSEDPSSCDSGCNGGLM 217
Query: 207 CALLDWMDVNKVVLEPESEYPLLLKD-AACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
+ ++ + LE E +YP D + CK T + + + + E+ I +
Sbjct: 218 TSAFEYT-LKAGGLEREEDYPYTGTDHSKCKFDKTK---IAVSASNFSVVSLDENQIAAN 273
Query: 266 IATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
+ T+GP+ +NA+ Q Y+GGV Y C L ++H V +VGY
Sbjct: 274 LVTNGPLAIGINAMFMQTYIGGVSCPYICSKRL--LDHGVLLVGY 316
>gi|297744465|emb|CBI37727.3| unnamed protein product [Vitis vinifera]
Length = 331
Score = 120 bits (302), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 88/286 (30%), Positives = 134/286 (46%), Gaps = 53/286 (18%)
Query: 30 LEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEFS 88
+++ + F S+ ++ K Y E + RF+ F ++L+ I+E NK S G+ EF+
Sbjct: 42 IDKLIARFESWVSKHGKVYKSMEEKLHRFEVFRENLNHIDERNKEVSS---YWLGLNEFA 98
Query: 89 DLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAG 148
DLS EEFK++ + +P DWR+ G
Sbjct: 99 DLSHEEFKSKDV-----------------------------------ADLPESVDWRKKG 123
Query: 149 IIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCA 208
+ V+NQ CG+CWAFSTV E ++ + G L+ LS QE+IDC N GC+GG
Sbjct: 124 AVTHVKNQGACGSCWAFSTVAAVEGINQIVTGNLTTLSEQELIDCDTTFNSGCNGG---- 179
Query: 209 LLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
L+D+ + L E +YP L+++ C+ + + V I Y D E S+L
Sbjct: 180 LMDYAFAFIASNGGLHKEDDYPYLMEEGTCEEQKEDVDIVTISGYE-DVPEKDEESLLKA 238
Query: 266 IATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+A H P+ A+ A +Q+Y GGV C L +H V VGY
Sbjct: 239 LA-HQPLSVAIEASGRDFQFYSGGVFNGPCGTEL---DHGVAAVGY 280
>gi|2414683|emb|CAB16316.1| cysteine proteinase precursor [Vicia sativa]
Length = 379
Score = 120 bits (302), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 96/325 (29%), Positives = 144/325 (44%), Gaps = 55/325 (16%)
Query: 14 IALCFLAIPVKVSKPNLEQ----KLEL-----------FSSFQQRYKKSYSKSE-HDIRF 57
I LC L + + L Q KLEL F F + Y K YS +E + +R
Sbjct: 17 IFLCALTLSSSLHHETLIQDVARKLELKDNDLLTTEKKFKLFMKDYSKKYSTTEEYLLRL 76
Query: 58 KNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHD 117
F K ++++ P +A +G+T+FSDLSEEEF+ + +
Sbjct: 77 GIFAK--NMVKAAEHQALDP-TAIHGVTQFSDLSEEEFE-----------------RFYT 116
Query: 118 HHHNHVKKRSITTGITIP---TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESM 174
+ G+ P G P DWRE G + ++ Q CG+CWAF+T + E
Sbjct: 117 GFKGGFPSSNAAGGVAPPLDVKGFPENFDWREKGAVTGIKTQGKCGSCWAFTTTGSIEGA 176
Query: 175 HALKNGTLSLLSVQEVIDCAGNGNM-------GCSGGDFCALLDWMDVNKVVLEPESEYP 227
+ L G L LS Q+++DC ++ GC+GG D++ + LE E+ YP
Sbjct: 177 NFLATGKLVSLSEQQLVDCDNKCDITKTSCDNGCNGGLMTTAYDYL-MEAGGLEEETSYP 235
Query: 228 LLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGG 287
CK PN V ++ + E+ I + HGP+ AVNA+ Q Y+GG
Sbjct: 236 YTGAQGECK---FDPNKVAVRVSNFTNIPADENQIAAYLVNHGPLAIAVNAVFMQTYVGG 292
Query: 288 VIQYNCD--GSLANINHAVQIVGYD 310
V +C S +NH V +VGY+
Sbjct: 293 V---SCPLICSKRRLNHGVLLVGYN 314
>gi|312378084|gb|EFR24752.1| hypothetical protein AND_10451 [Anopheles darlingi]
Length = 1785
Score = 120 bits (302), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 88/281 (31%), Positives = 142/281 (50%), Gaps = 33/281 (11%)
Query: 37 FSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F F+ +++ Y+ S EH++R+ F +L I++LN++ + + +YG+T+F+D++ E+
Sbjct: 1478 FEKFKLHHQRQYASSFEHEMRYNIFRNNLYKIDQLNRHERG--TGKYGVTKFADMTTAEY 1535
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
+ H V K H NH++ I T T T +P DWR+ G + V+N
Sbjct: 1536 RA-HTGLIVPKQ-----------HSNHIRN-PIATVSTERTSLPTSFDWRDHGAVTGVKN 1582
Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD- 214
Q CG+CWAFS + E +H +K L S QE+IDC N GC+GG +MD
Sbjct: 1583 QGNCGSCWAFSAIGNIEGLHQIKTKKLEAYSEQELIDCDTVDN-GCNGG-------YMDD 1634
Query: 215 ----VNKV-VLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
+ K+ LE E EYP K + + V++K + +E+ I + +
Sbjct: 1635 AFKAIEKLGGLELEDEYPYQAKAQKTCHFNKTLSHVRVKGAV--DMPKNETFIAQYLIEN 1692
Query: 270 GPVIAAVNALTWQYYLGGVIQ-YNCDGSLANINHAVQIVGY 309
GP+ +NA Q+Y GG+ ++ S I+H V IVGY
Sbjct: 1693 GPIAIGLNANAMQFYRGGISHPWHLLCSHKQIDHGVLIVGY 1733
>gi|161778780|gb|ABX79341.1| cysteine protease [Vitis vinifera]
Length = 377
Score = 120 bits (302), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 92/285 (32%), Positives = 142/285 (49%), Gaps = 39/285 (13%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
FS F++R+ KSY S+ EHD RFK F+ +L +++Q SA +G+T+FSDL+ EF
Sbjct: 62 FSIFKRRFGKSYASQEEHDYRFKVFKANL---RRARRHQQLDPSATHGVTQFSDLTPAEF 118
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
+ +L L HD +K I +PT +P DWR+ G + V+
Sbjct: 119 RGTYLG-------LRPLKLPHD-----AQKAPI-----LPTNDLPEDFDWRDHGAVTAVK 161
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
NQ +CG+CW+FST E + L G L LS Q++++C G+ + GC+GG
Sbjct: 162 NQGSCGSCWSFSTTGALEGANFLATGNLVSLSEQQLVECDHECDPEEMGSCDSGCNGGLM 221
Query: 207 CALLDWMDVNKVVLEPESEYPLLLKD-AACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
++ + L E +YP D +CK T + +++ +L E I +
Sbjct: 222 NTAFEYT-LKAGGLMKEEDYPYTGTDRGSCKFDKTKI-AASVSNFSVISL--DEDQIAAN 277
Query: 266 IATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
+ GP+ A+NA+ Q Y+GGV Y C L +H V +VGY
Sbjct: 278 LVKIGPLAVAINAVFMQTYVGGVSCPYICSKRL---DHGVLLVGY 319
>gi|223648298|gb|ACN10907.1| Cathepsin F precursor [Salmo salar]
Length = 474
Score = 120 bits (302), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 97/296 (32%), Positives = 143/296 (48%), Gaps = 37/296 (12%)
Query: 22 PVKVSKPNLEQKLELFSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQSPESA 80
PV+ S ++E L F F RY ++YS + E D R + F ++L E+L Q +A
Sbjct: 162 PVEESVDSVEL-LGQFKEFMVRYNRTYSSQEEADRRLRVFHENLKTAEKLQSLDQG--TA 218
Query: 81 RYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IP 139
YG+T+FSDL+EEEF+T +L +++ L K +P G P
Sbjct: 219 EYGVTKFSDLTEEEFRTLYLNPLLSQQNLQQSMKP----------------AAMPRGPAP 262
Query: 140 VKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNM 199
DWRE G + V+NQ CG+CWAFS E K G L LS QE++DC +
Sbjct: 263 PSWDWREHGAVSPVKNQGMCGSCWAFSVTGNIEGQWFAKTGKLVSLSEQELVDC-DTVDQ 321
Query: 200 GCSGGDFCALLDWMDVNKV-VLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDT--LI 256
C GG + + K+ LE E++Y K +C K+ +Y + L
Sbjct: 322 ACGGG--LPSNAYEAIEKLGGLETETDYSYTGKKQSCDFTTD-----KVIAYINSSVELS 374
Query: 257 PSESSILTDIATHGPVIAAVNALTWQYYLGGV---IQYNCDGSLANINHAVQIVGY 309
E+ I +A +GPV A+NA Q+Y GV ++ C+ + I+HAV +VGY
Sbjct: 375 TDENEIAAWLAENGPVSVALNAFAMQFYRKGVSHPLKIFCNPWM--IDHAVLLVGY 428
>gi|398014254|ref|XP_003860318.1| cysteine peptidase A (CBA) [Leishmania donovani]
gi|13518086|gb|AAK27384.1| cysteine proteinase-like protein [Leishmania donovani]
gi|322498538|emb|CBZ33611.1| cysteine peptidase A (CBA) [Leishmania donovani]
Length = 354
Score = 120 bits (302), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 101/322 (31%), Positives = 156/322 (48%), Gaps = 43/322 (13%)
Query: 4 VKNVLFIV----ALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFK 58
V +LF+V ALIA L + ++ + + F++R+ K + + +E RF
Sbjct: 12 VVTILFVVCYGSALIAQTPLGVDDFIASAH-------YGRFKKRHGKPFGEDAEEGRRFN 64
Query: 59 NFEKSLDIIEELNKNRQSPESARYGIT-EFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHD 117
F++++ LN + A Y ++ +F+DL+ +EF +L N + H K +
Sbjct: 65 AFKQNMQTAYFLNAHN---PHAHYDVSGKFADLTPQEFAKLYL----NPNYYARHGKDYK 117
Query: 118 HHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHAL 177
H HV S+ +G+ + DWRE G++ V+NQ CG+CWAF+T E AL
Sbjct: 118 EH-VHVDD-SVRSGV-------MSVDWREKGVVTPVKNQGMCGSCWAFATTGNIEGQWAL 168
Query: 178 KNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM--DVNKVVLEPESEYPLLLKDAAC 235
KN +L LS Q ++ C N + GC+GG + W+ D N V E YP A
Sbjct: 169 KNHSLVSLSEQVLVSCD-NIDDGCNGGLMEQAMQWIINDHNGTV-PTEDSYP--YTSAGG 224
Query: 236 KRKATSPN---GVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYN 292
R N G KI Y +L E I + +GPV AV+A TWQ Y GGV+
Sbjct: 225 TRPPCHDNGTVGAKIAGYM--SLPHDEEEIAAYVGKNGPVAVAVDATTWQLYFGGVVTL- 281
Query: 293 CDGSLANINHAVQIVGYDNYSR 314
C G ++NH V +VG++ ++
Sbjct: 282 CFG--LSLNHGVLVVGFNRQAK 301
>gi|357483847|ref|XP_003612210.1| Cysteine proteinase [Medicago truncatula]
gi|355513545|gb|AES95168.1| Cysteine proteinase [Medicago truncatula]
Length = 344
Score = 120 bits (302), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 97/316 (30%), Positives = 143/316 (45%), Gaps = 38/316 (12%)
Query: 6 NVLFIVAL-IALCFLAIPVKVSKPNLEQKL--ELFSSFQQRYKKSYSK-SEHDIRFKNFE 61
N L+ ++L + C ++V+ L+ E + Y K Y E + RFK F
Sbjct: 5 NQLYHISLALVFCLGLWAIQVTSRTLQDGSMHERHERWMNHYGKVYKDHQEREKRFKIFT 64
Query: 62 KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
+++ IE N N + ES + GI +F+DL+ EEF + S +K H +
Sbjct: 65 ENMKYIEAFN-NGDNNESYKLGINQFADLTNEEF-------------VASRNKFKGHMCS 110
Query: 122 HVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGT 181
+ R+ T + IP DWR+ G + V+NQ CG CWAFS V E +H L G
Sbjct: 111 SII-RTTTFKYENVSAIPSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGIHKLSTGK 169
Query: 182 LSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVV-----LEPESEYPLLLKDAAC 235
L LS QE++DC G + GC GG L+D D K + L E++YP D C
Sbjct: 170 LVSLSEQELVDCDTKGVDQGCEGG----LMD--DAFKFIIQNHGLNTEAQYPYQGVDGTC 223
Query: 236 KRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNC 293
S I Y D +E ++ +A P+ A++A +Q+Y GV +C
Sbjct: 224 NANKASIQATTITGYE-DVPANNEQALQKAVANQ-PISVAIDASGSDFQFYKSGVFTGSC 281
Query: 294 DGSLANINHAVQIVGY 309
L +H V VGY
Sbjct: 282 GTEL---DHGVTAVGY 294
>gi|125540888|gb|EAY87283.1| hypothetical protein OsI_08685 [Oryza sativa Indica Group]
Length = 357
Score = 120 bits (302), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 88/299 (29%), Positives = 144/299 (48%), Gaps = 35/299 (11%)
Query: 19 LAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSP 77
LA+P K+ + LF+S+ ++ K Y+ + + R++ F+++L I E N+ S
Sbjct: 36 LALPNKL--------VGLFTSWSVKHSKIYASPKEKVKRYEIFKRNLRHIVETNRRNGS- 86
Query: 78 ESARYGITEFSDLSEEEFKTRHL--RHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIP 135
G+ F+D++ EEFK +L + + + H + N V
Sbjct: 87 --YWLGLNHFADIAHEEFKASYLGLKPGLARRDAQPHGSTTFRYANAVN----------- 133
Query: 136 TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAG 195
+P DWR+ G + V+NQ CG+CWAFSTV E ++ + G L LS QE++DC
Sbjct: 134 --LPWAVDWRKKGAVTPVKNQGECGSCWAFSTVAAVEGINQIVTGKLVSLSEQELMDCDN 191
Query: 196 NGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTL 255
N GC GG ++ N+ + E +YP L+++ C+ K + I Y D
Sbjct: 192 TFNHGCRGGLMDFAFAYIMGNQGIYT-EEDYPYLMEEGYCREKQPHSKVITITGYE-DVP 249
Query: 256 IPSESSILTDIATHGPVIAAVNALT--WQYYLGGVIQYNCDGSLANINHAVQIVGYDNY 312
SE+S+L +A H PV + A + +Q+Y GG+ C +HA+ VGY +Y
Sbjct: 250 ENSETSLLKALA-HQPVSVGIAAGSRDFQFYKGGIFDGECG---IQPDHALTAVGYGSY 304
>gi|15824704|gb|AAL09448.1| cysteine protease [Leishmania donovani]
Length = 353
Score = 120 bits (302), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 101/322 (31%), Positives = 156/322 (48%), Gaps = 43/322 (13%)
Query: 4 VKNVLFIV----ALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFK 58
V +LF+V ALIA L + ++ + + F++R+ K + + +E RF
Sbjct: 11 VVTILFVVCYGSALIAQTPLGVDDFIASAH-------YGRFKKRHGKPFGEDAEEGRRFN 63
Query: 59 NFEKSLDIIEELNKNRQSPESARYGIT-EFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHD 117
F++++ LN + A Y ++ +F+DL+ +EF +L N + H K +
Sbjct: 64 AFKQNMQTAYFLNAHN---PHAHYDVSGKFADLTPQEFAKLYL----NPNYYARHGKDYK 116
Query: 118 HHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHAL 177
H HV S+ +G+ + DWRE G++ V+NQ CG+CWAF+T E AL
Sbjct: 117 EH-VHVDD-SVRSGV-------MSVDWREKGVVTPVKNQGMCGSCWAFATTGNIEGQWAL 167
Query: 178 KNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM--DVNKVVLEPESEYPLLLKDAAC 235
KN +L LS Q ++ C N + GC+GG + W+ D N V E YP A
Sbjct: 168 KNHSLVSLSEQVLVSCD-NIDDGCNGGLMEQAMQWIINDHNGTV-PTEDSYP--YTSAGG 223
Query: 236 KRKATSPN---GVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYN 292
R N G KI Y +L E I + +GPV AV+A TWQ Y GGV+
Sbjct: 224 TRPPCHDNGTVGAKIAGYM--SLPHDEEEIAAYVGKNGPVAVAVDATTWQLYFGGVVTL- 280
Query: 293 CDGSLANINHAVQIVGYDNYSR 314
C G ++NH V +VG++ ++
Sbjct: 281 CFG--LSLNHGVLVVGFNRQAK 300
>gi|155966155|gb|ABU41032.1| cysteine proteinase [Lepeophtheirus salmonis]
Length = 372
Score = 120 bits (302), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 97/313 (30%), Positives = 156/313 (49%), Gaps = 34/313 (10%)
Query: 14 IALCFL----AIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIE 68
+ +CFL A+ S P +++++ F SF + Y KSY +++ ++ K F +L IE
Sbjct: 1 LGVCFLFGLAALAAGTSSPT-QREIQEFESFVKEYSKSYHNRALRSLKLKVFVDNLREIE 59
Query: 69 ELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSI 128
E N N + + GI EFSDL++EEF+++++ +S MS K+ +I
Sbjct: 60 EHNANPK--RTWDMGINEFSDLTDEEFESKYMGYSP-----MSSSAGLVTRTAAPKQGNI 112
Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLS--LLS 186
+P DWRE G+I V+NQ +CG+CW FS VE ES A++N S LLS
Sbjct: 113 KD-------LPESVDWREKGVITDVKNQGSCGSCWVFSAVEQIESYVAIENNMTSPPLLS 165
Query: 187 VQEVIDCAGNGNMGCSGGDFCALLD---WMDVNKVVLEPESEYP----LLLKDAACKRKA 239
Q++ C+ N G ++ +M +E E EYP + C A
Sbjct: 166 TQQITSCSSNPYSCGGSGGCKGAINEIAYMYTQLYGIETEKEYPYTSGFTEESGECLYNA 225
Query: 240 TSPNGVKIKSYTCDTLIPSES-SILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLA 298
+S G + L P++ S++ +A GP+ +V A ++ Y G++ CD + A
Sbjct: 226 SSVTGKMAHVRGYEVLPPNDMYSVMEHLANKGPLGVSVYAGRFKSYKSGILN-GCDFN-A 283
Query: 299 N--INHAVQIVGY 309
N INHA+Q++GY
Sbjct: 284 NIVINHAIQMIGY 296
>gi|158148921|dbj|BAF81994.1| cysteine proteinase [Platycodon grandiflorus]
Length = 359
Score = 120 bits (302), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 90/278 (32%), Positives = 134/278 (48%), Gaps = 29/278 (10%)
Query: 37 FSSFQQRYKKSYSKSEH-DIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F+ F RY KSY +E RF F SL +I NK S G+ EF+DL+ EEF
Sbjct: 60 FARFAHRYGKSYETAEEMKRRFSIFVDSLKMIRSHNKKGLS---YTLGVNEFADLTWEEF 116
Query: 96 KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
+ L + N L +HK +T G+ +P+KKDWRE GI+ V+
Sbjct: 117 RKHRLGAAQNCSATLKGNHK-------------LTNGL-----LPLKKDWREVGIVTPVK 158
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGCSGGDFCALLDWM 213
NQ CG+CW FST E+ + G LS Q+++DCA N GC+GG +++
Sbjct: 159 NQGHCGSCWTFSTTGALEAAYVQAFGKAIFLSEQQLVDCARAYNNFGCNGGLPSQAFEYI 218
Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
N L+ E YP D CK + + GV++ + + + +E + +A PV
Sbjct: 219 KANG-GLDTEEAYPYTGVDGVCKFSSENI-GVQVLD-SVNITLGAEDELKDAVAFVRPVS 275
Query: 274 AAVNALT-WQYYLGGVIQYN-CDGSLANINHAVQIVGY 309
A ++ ++ Y GV + C + ++NHAV VGY
Sbjct: 276 VAFEVVSGFRLYKSGVYTSDTCGNTPMDVNHAVVAVGY 313
>gi|356564325|ref|XP_003550405.1| PREDICTED: cysteine proteinase 15A [Glycine max]
Length = 370
Score = 120 bits (301), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 88/292 (30%), Positives = 142/292 (48%), Gaps = 38/292 (13%)
Query: 29 NLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
NL F+SF+ ++ K+Y +K EHD RF F+ +L + + SA +G+T+F
Sbjct: 48 NLLNAEHHFASFKAKFAKTYATKEEHDHRFGVFKSNL---RRARLHAKLDPSAVHGVTKF 104
Query: 88 SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKDWRE 146
SDL+ EF+ + L + H +K I +PT +P DWR+
Sbjct: 105 SDLTPAEFRRQFLGLKPLRFPA------------HAQKAPI-----LPTKDLPKDFDWRD 147
Query: 147 AGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA--------GNGN 198
G + V++Q CG+CW+FST E H L G L LS Q+++DC G +
Sbjct: 148 KGAVTNVKDQGACGSCWSFSTTGALEGAHYLATGELVSLSEQQLVDCDHVCDPEEYGACD 207
Query: 199 MGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPS 258
GC+GG +++ + ++ E +YP +D CK T + +Y+ +L
Sbjct: 208 SGCNGGLMNNAFEYI-LQSGGVQKEKDYPYTGRDGTCKFDKTKV-AATVSNYSVVSL--D 263
Query: 259 ESSILTDIATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
E I ++ +GP+ A+NA+ Q Y+GGV Y C +++H V +VGY
Sbjct: 264 EEQIAANLVKNGPLAVAINAVFMQTYVGGVSCPYICG---KHLDHGVLLVGY 312
>gi|33333704|gb|AAQ11970.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 120 bits (301), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 90/281 (32%), Positives = 131/281 (46%), Gaps = 27/281 (9%)
Query: 35 ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKN-RQSPESARYGITEFSDLSE 92
E + F+ + K+Y S E RF F+K+L I+E NK + ES +T+F+D++
Sbjct: 21 EEWQQFKLDHGKTYRSLVEEKRRFSVFQKNLVDIQEHNKKYERGEESFAKKVTQFADMTH 80
Query: 93 EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
EEF V L S+ H D+ + I + + DWRE G +
Sbjct: 81 EEFLDLLKLQGV--PALPSNAVHFDNFED----------IDMEEKDAI--DWREEGAVTP 126
Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN--GNMGCSGGDFCALL 210
V++Q CG+CWAFS V E KNGTL LS QE++DCA GN GC GG
Sbjct: 127 VKDQANCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEDYGNNGCKGGLMGQAF 186
Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
D+ V ++ E YP + ++CK+ VK + D E + +A G
Sbjct: 187 DF--VQDEGIQTEESYPYEGRRSSCKKSGEYVTKVKTYVFPLD-----EQEMARTVAAKG 239
Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGS--LANINHAVQIVGY 309
PV A+ A +Y G++ C S ++NH V +VGY
Sbjct: 240 PVAVAIEASQLSFYDKGIVDERCRCSNKREDLNHGVLVVGY 280
>gi|350538043|ref|NP_001234324.1| cysteine protease TDI-65 precursor [Solanum lycopersicum]
gi|5726641|gb|AAD48496.1|AF172856_1 cysteine protease TDI-65 [Solanum lycopersicum]
gi|2828252|emb|CAA05894.1| CYP1 [Solanum lycopersicum]
Length = 466
Score = 120 bits (301), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 87/280 (31%), Positives = 143/280 (51%), Gaps = 28/280 (10%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
L+ S+ + KSY+ E D RF+ F+ +L I+E +N +S + G+T+F+DL+ EE
Sbjct: 48 LYESWLIEHGKSYNALGEKDKRFQIFKDNLRYIDE--QNSVPNQSYKLGLTKFADLTNEE 105
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
+++ +L S N + G ++P I DWRE G++ V+
Sbjct: 106 YRSIYLG-------TKSSGDRKKLSKNKSDRYLPKVGDSLPESI----DWREKGVLVGVK 154
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW-- 212
+Q +CG+CWAFS V ES++A+ G L LS QE++DC + N GC GG L+D+
Sbjct: 155 DQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDRSYNEGCDGG----LMDYAF 210
Query: 213 -MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGP 271
+ ++ E +YP ++ C + + VKI SY D + +E ++ +A H P
Sbjct: 211 EFVIKNGGIDTEEDYPYKERNGVCDQYRKNAKVVKIDSYE-DVPVNNEKALQKAVA-HQP 268
Query: 272 VIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
V A+ A +Q+Y G+ C + ++H V I GY
Sbjct: 269 VSIALEAGGRDFQHYKSGIFTGKCGTA---VDHGVVIAGY 305
>gi|260516672|gb|ACX43963.1| cysteine protease 3, partial [Brachiaria hybrid cultivar]
Length = 319
Score = 120 bits (301), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 86/279 (30%), Positives = 134/279 (48%), Gaps = 29/279 (10%)
Query: 35 ELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
++F++F ++Y K+YS +E RF F+ S++ I N + S G+ EF+DLS EE
Sbjct: 40 DMFTAFMKQYSKAYSHAEFSSRFNQFKASVETIRL--HNTLANASYTMGLNEFADLSFEE 97
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
FK ++ KHV + ++ H P DWR + + ++
Sbjct: 98 FKGKYFG---CKHVEREFARSNNLHQE-------------VEAAPTSIDWRTSNAVTPIK 141
Query: 155 NQQTCGACWAFSTVETAESMHALKNG-TLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDW 212
+Q CG+CWAFS + E L+ TL+ LS Q+++DC+ + GN GC+GG ++
Sbjct: 142 DQGQCGSCWAFSATGSIEGAWVLQGKHTLTSLSEQQLVDCSTSYGNAGCNGGLMDYAFEY 201
Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV 272
+ NK + ES YP C++ T V I + D E+S L + T GPV
Sbjct: 202 IIANKGIC-AESAYPYKGVGGLCQKSCTKV--VTISGHK-DVASGDEASSLNAVGTVGPV 257
Query: 273 IAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
A+ A +Q+Y GV C N++H V VGY
Sbjct: 258 SVAIEADQAGFQFYSSGVFSGTCG---HNLDHGVLAVGY 293
>gi|400180451|gb|AFP73362.1| cysteine protease [Solanum chilense]
Length = 344
Score = 120 bits (301), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 97/315 (30%), Positives = 149/315 (47%), Gaps = 35/315 (11%)
Query: 3 DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
D+ N+L + + F S+P L ++ EL+ S R YK K E RF
Sbjct: 6 DLMNILITLFFVISMFNTQTRGRSQPELSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
F++++ IE +NK S + G+ EF+D++ +EF + ++ N ++ S +
Sbjct: 63 FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
N + + P DWRE+G + +V++Q CG CWAFS V + E + +
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
G L S QE++DC N N GC GG D++ N + ES+Y L + C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCDGGFMTNAFDFIKENGGI-SSESDYEYLGQQYTCRSQ 228
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
+ V+I SY ++P + L T PV IAA L Q+Y GG DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278
Query: 296 SLAN-INHAVQIVGY 309
S A+ INHAV +GY
Sbjct: 279 SCADRINHAVTAIGY 293
>gi|400180445|gb|AFP73359.1| cysteine protease, partial [Solanum chilense]
Length = 345
Score = 120 bits (301), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 97/315 (30%), Positives = 149/315 (47%), Gaps = 35/315 (11%)
Query: 3 DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
D+ N+L + + F S+P L ++ EL+ S R YK K E RF
Sbjct: 6 DLMNILITLFFVISMFNTQTRGRSQPELSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
F++++ IE +NK S + G+ EF+D++ +EF + ++ N ++ S +
Sbjct: 63 FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
N + + P DWRE+G + +V++Q CG CWAFS V + E + +
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
G L S QE++DC N N GC GG D++ N + ES+Y L + C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCDGGFMTNAFDFIKENGGI-SSESDYEYLGQQYTCRSQ 228
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
+ V+I SY ++P + L T PV IAA L Q+Y GG DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278
Query: 296 SLAN-INHAVQIVGY 309
S A+ INHAV +GY
Sbjct: 279 SCADRINHAVTAIGY 293
>gi|343474734|emb|CCD13687.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 524
Score = 120 bits (301), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 93/316 (29%), Positives = 156/316 (49%), Gaps = 30/316 (9%)
Query: 5 KNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYS-KSEHDIRFKNFEK 62
+ + F V L+A+ +PV + + EQ L+ F++F+Q+Y +SY +E RF+ F++
Sbjct: 87 RTLRFSVGLLAVAACFVPVALGVLHAEQSLQQQFAAFKQKYSRSYKDATEEAFRFRMFKQ 146
Query: 63 SLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNH 122
S+ E + + A +G+T+FSD+S EEF+ +L + K++
Sbjct: 147 SM---ERAKEEAAANPYATFGVTQFSDMSPEEFRATYLNGA----------KYYAAALKR 193
Query: 123 VKKRSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGT 181
+K + + TG P DWR+ G + V++Q +CG+CWAF+ + E +
Sbjct: 194 PRKV-----VNVSTGKAPPAVDWRKKGAVTPVKDQGSCGSCWAFAAIGNIEGQWKIAGHE 248
Query: 182 LSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACK--RK 238
L+ LS Q ++ C + C GG W+ NK + E YP D K
Sbjct: 249 LTSLSEQMLVSCDTTED-NCGGGFADRAFKWIVSSNKGNVFTERSYPYASIDGYVPPCNK 307
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLA 298
+ G KI + L E++I +A +GPV AV+A T+ Y GGV+ +C S
Sbjct: 308 SGKVVGAKISGHI--NLPKDENAIAEWLARNGPVAIAVDASTFLDYKGGVLT-SC--SSK 362
Query: 299 NINHAVQIVGYDNYSR 314
++NH V +VGY++ S+
Sbjct: 363 HVNHEVLLVGYNDTSK 378
>gi|357160300|ref|XP_003578721.1| PREDICTED: oryzain beta chain-like [Brachypodium distachyon]
Length = 349
Score = 120 bits (301), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 91/332 (27%), Positives = 146/332 (43%), Gaps = 47/332 (14%)
Query: 5 KNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKS 63
K L V L +C + + + +E + ++ + Y +E RF+ F +
Sbjct: 5 KVFLLAVVLGCICLCSTVLSARELGDAAMVERHEQWMAQHGRVYKDGAEKARRFEAFRNN 64
Query: 64 LDIIEELNK--NRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
+ IE N NR+ G+ +F+DL+ +EF+ +
Sbjct: 65 VVFIESFNAAGNRRK---FWLGVNQFTDLTNDEFRATKT------------------NKG 103
Query: 122 HVKKRSITTGITIPTG-----------IPVKKDWREAGIIGKVRNQQTCGACWAFSTVET 170
+K+ + PTG +P DWR G + ++NQ CG CWAFS V
Sbjct: 104 FIKRNAAAVNKASPTGTFRYSNVSADALPAAVDWRAKGAVTPIKNQGQCGCCWAFSAVAA 163
Query: 171 AESMHALKNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLL 229
E + L G L LS QE++DC NG + GC GG+ +++ + L E+ YP
Sbjct: 164 TEGIVQLSTGKLVPLSEQELVDCDANGADHGCEGGEMDDAFEFI-IKNGGLTSETNYPYT 222
Query: 230 LKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGG 287
+D CK K T + IK Y D E+S++ +A PV AV+ + +Q+Y GG
Sbjct: 223 AQDGQCKAKNTINSVATIKGYE-DVPANDEASLMKAVAAQ-PVSVAVDGGDMVFQHYAGG 280
Query: 288 VIQYNCDGSLANINHAVQIVGY---DNYSRTW 316
V+ +C SL +H + VGY D+ ++ W
Sbjct: 281 VLSGSCGTSL---DHGIVAVGYGAADDGTKFW 309
>gi|15485586|emb|CAC67416.1| cysteine protease [Trypanosoma brucei rhodesiense]
Length = 450
Score = 120 bits (301), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 89/316 (28%), Positives = 157/316 (49%), Gaps = 30/316 (9%)
Query: 4 VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKS-EHDIRFKNFE 61
V+ V V L+A+ V + ++E+ LE+ F++F+++Y K Y + E RF+ FE
Sbjct: 7 VRFVRLPVVLLAMAACLASVALGSLHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFE 66
Query: 62 KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
+++ E+ + A +G+T FSD++ EEF+ R+ ++ +
Sbjct: 67 ENM---EQAKIQAAANPYATFGVTPFSDMTREEFRARY--------------RNGASYFA 109
Query: 122 HVKKRSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
+KR T + + TG P DWRE G + V++Q CG+CWAFST+ E +
Sbjct: 110 AAQKRLRKT-VNVTTGRAPAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQWQVAGN 168
Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACKRKA 239
L LS Q ++ C + GC GG +W+ + N + E+ YP + + ++
Sbjct: 169 PLVSLSEQMLVSC-DTIDFGCGGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNG--EQPQ 225
Query: 240 TSPNGVKIKSYTCD--TLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL 297
NG +I + D L E +I +A +GP+ AV+A ++ Y GG++ +C +
Sbjct: 226 CQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYNGGILT-SC--TS 282
Query: 298 ANINHAVQIVGYDNYS 313
++H V +VGY++ S
Sbjct: 283 EQLDHGVLLVGYNDSS 298
>gi|213513816|ref|NP_001133678.1| Cathepsin F precursor [Salmo salar]
gi|209154908|gb|ACI33686.1| Cathepsin F precursor [Salmo salar]
Length = 475
Score = 120 bits (301), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 95/303 (31%), Positives = 143/303 (47%), Gaps = 39/303 (12%)
Query: 18 FLAIPVKVSKPNLEQKLELFSSFQQ---RYKKSYSKSEH-DIRFKNFEKSLDIIEELNKN 73
FL++ E +EL F++ RY ++YS E D R + F ++L E+L
Sbjct: 155 FLSLSTSKPVEETEDFVELLGQFKEFMVRYNRTYSSQEDTDRRLRIFHENLKTAEKL--- 211
Query: 74 RQSPE--SARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTG 131
QS + +A YG+T+FSDL+EEEF+T +L +++ L K H
Sbjct: 212 -QSLDLGTAEYGVTKFSDLTEEEFRTLYLNPLLSQQKLQRSMKPAAMPHGPA-------- 262
Query: 132 ITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVI 191
P DWRE G + V+NQ CG+CWAFS E +K G L LS QE++
Sbjct: 263 -------PPSWDWREHGAVSPVKNQGMCGSCWAFSVTGNIEGQWFVKTGKLVSLSEQELV 315
Query: 192 DCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYT 251
DC + C GG + ++ V E E++Y K +C K+ +Y
Sbjct: 316 DC-DTADQACGGGLPSNAYEAIEKLGGV-ETETDYSYTGKKQSCDFTTD-----KVTAYI 368
Query: 252 CDT--LIPSESSILTDIATHGPVIAAVNALTWQYYLGGV---IQYNCDGSLANINHAVQI 306
+ L E+ I +A +GPV A+NA Q+Y GV ++ C+ + I+HAV +
Sbjct: 369 NSSVELSKDENEIAAWLAENGPVSVALNAFAMQFYRKGVSHPLKIFCNPWM--IDHAVLL 426
Query: 307 VGY 309
VGY
Sbjct: 427 VGY 429
>gi|113120269|gb|ABI30274.1| VS-B, partial [Vasconcellea stipulata]
Length = 341
Score = 120 bits (301), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 98/322 (30%), Positives = 151/322 (46%), Gaps = 38/322 (11%)
Query: 1 MFDVKNVLFIVALIALCFLAIP------VKVSKPNL---EQKLELFSSFQQRYKKSYSKS 51
+F + ++F+V ++L L + V S+ +L E + LF S+ ++ K Y
Sbjct: 4 IFSISKLIFVVTCLSL-HLGLSSADFSIVGYSQDDLTSIESSIRLFESWMLKHDKVYKTI 62
Query: 52 EHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLM 110
+ I RF+ F+ +L I+E NK S G+ EF+DL+ +EFK +++ +++
Sbjct: 63 DEKIYRFETFKDNLMYIDETNKKNNS---YWLGLNEFADLTHDEFKEKYVGSIPEDSMII 119
Query: 111 SHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVET 170
+ + HV + P I DWR+ G + V+NQ CG+CWAFSTV T
Sbjct: 120 EQSDDVEFPNKHV--------VDYPESI----DWRQKGAVTPVKNQNPCGSCWAFSTVAT 167
Query: 171 AESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLL 230
E ++ + G L LS QE++DC + GC GG L ++ N V E EYP
Sbjct: 168 VEGINKIVTGNLISLSEQELLDCDRRSH-GCKGGYQTTSLKYVVDNGV--HTEKEYPYEK 224
Query: 231 KDAACKRKATSPNGVKIKSYTCDTLIPS--ESSILTDIATHG-PVIAAVNALTWQYYLGG 287
K C+ K V I Y +PS E S++ I+ V+ +Q+Y GG
Sbjct: 225 KQGNCRAKNKKGLKVYINGY---KRVPSNDEISLIKTISIQPVSVLVESKGRPFQFYKGG 281
Query: 288 VIQYNCDGSLANINHAVQIVGY 309
V C L +HAV VGY
Sbjct: 282 VFGGPCGTKL---DHAVTAVGY 300
>gi|1353726|gb|AAB01769.1| cysteine proteinase homolog, partial [Naegleria fowleri]
Length = 347
Score = 120 bits (301), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 91/323 (28%), Positives = 150/323 (46%), Gaps = 45/323 (13%)
Query: 6 NVLFIVALIALCFLAIPVKVSKPNLEQKLE-LFSSFQQRYKKSYSKSEHDIRFKNFEKSL 64
++L L+ C + + KP E +++ LF F ++Y K Y EH+ R++ F+ +
Sbjct: 1 SLLIAAVLLIACVGVVLAQEYKPLAESEMKKLFIKFSRKYAKVYGTEEHNNRYQIFKAN- 59
Query: 65 DIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVN----KHVLMSHHKHHDHHH 120
+E+ + +GIT+FSDL+ EEFK L + K +L + H
Sbjct: 60 --VEKSRYYNHVGKRENFGITKFSDLTPEEFKRMFLMKTYTPEEAKKILAAPQ------H 111
Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
+ ++ + T P DWR+ G + +V+NQ CG+CW FST E A+K G
Sbjct: 112 AVLSEKEVQTA-------PTSFDWRQHGAVTRVKNQGACGSCWTFSTTGNVEGQWAIKKG 164
Query: 181 TLSLLSVQEVIDCAGN---------GNMGCSGGDFCALLDWMDVNKVV----LEPESEYP 227
L LS Q+++DC N + GC+GG L W V+ L+ E YP
Sbjct: 165 KLVSLSEQQLVDCDHNCVTYQNQQACDSGCNGG-----LMWSAFQYVIKNGGLDTEDSYP 219
Query: 228 LLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGG 287
D C+ ++ V + ++ E+ + +A +GP+ A+NA QYY G
Sbjct: 220 YEGVDDTCRFNKSN---VAATISSWTSISSDENQMAAWLAANGPISIAINAEWLQYYTSG 276
Query: 288 VIQ-YNCDGSLANINHAVQIVGY 309
+ + C+ +++H V IVGY
Sbjct: 277 ISDPWFCNPQ--DLDHGVLIVGY 297
>gi|44844206|emb|CAF32699.1| cathepsin L-like cysteine proteinase [Leishmania infantum]
Length = 381
Score = 120 bits (301), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 83/270 (30%), Positives = 131/270 (48%), Gaps = 26/270 (9%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F++ Y+++Y +E R NFE++L+++ E ++P AR+GIT+F DLSE E
Sbjct: 37 LFEEFKRTYRRAYGTLAEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F R+L N + K H H + ++ +P DWRE G + V+
Sbjct: 94 FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
CG+CWAFS V ES A L LS Q+++ C N GC+GG +W+
Sbjct: 143 XXGACGSCWAFSAVGNIESQWARAGHGLVSLSEQQLVSCDDKDN-GCNGGLMLQAFEWLL 201
Query: 215 VNKV-VLEPESEYPLLLKD---AACKRKATSPNGVKIKSYTCDTLIPSESSILTD-IATH 269
+ ++ E YP + A C + G +I Y +IPS +++ +A +
Sbjct: 202 RHMYGIVFTEKSYPYTSGNGDVAECLNSSKLVPGAQIDGY---VMIPSNETVMAAWLAEN 258
Query: 270 GPVIAAVNALTWQYYLGGV--IQYNCDGSL 297
GP+ AV+A ++ Y GV + YN G +
Sbjct: 259 GPIAIAVDASSFMSYQSGVLLVGYNKTGGV 288
>gi|400180379|gb|AFP73328.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 120 bits (301), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 97/315 (30%), Positives = 149/315 (47%), Gaps = 35/315 (11%)
Query: 3 DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
D+ N+L + + F S+P L ++ EL+ S R YK K E RF
Sbjct: 6 DLMNILITLFFVITMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
F++++ IE +NK S + G+ EF+D++ +EF + ++ N ++ S +
Sbjct: 63 FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
N + + P DWRE+G + +V++Q CG CWAFS V + E + +
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
G L S QE++DC N N GC GG D++ N + ES+Y L + C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCDGGFMTNAFDFIKENGGI-SRESDYEYLGEQYTCRSQ 228
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
+ V+I SY ++P + L T PV IAA L Q+Y GG DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278
Query: 296 SLAN-INHAVQIVGY 309
S A+ INHAV +GY
Sbjct: 279 SCADRINHAVTAIGY 293
>gi|62526575|gb|AAX84673.1| cysteine protease CP1 [Manihot esculenta]
Length = 467
Score = 120 bits (301), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 79/285 (27%), Positives = 139/285 (48%), Gaps = 30/285 (10%)
Query: 31 EQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSD 89
++ + ++ + + K Y+ E + RF+ F+ +L I+E N ++ + G+ F+D
Sbjct: 46 DEVMAIYEEWLVKQGKVYNALGEREKRFQVFKDNLRFIDEHNSENRT---YKLGLNGFAD 102
Query: 90 LSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGI 149
L+ EE+++ +L N ++K S + +P DWR+ G
Sbjct: 103 LTNEEYRSTYL------------GARGGMKRNRLRKTSDRYAPRVGESLPDSVDWRKEGA 150
Query: 150 IGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCAL 209
+ +V++Q +CG+CWAFST+ E ++ + G L LS QE++DC + N GC+GG L
Sbjct: 151 VAEVKDQGSCGSCWAFSTIAAVEGINKIVTGDLISLSEQELVDCDTSYNEGCNGG----L 206
Query: 210 LDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
+D+ +N ++ E +YP L +D C + V I Y D + SE+++ +
Sbjct: 207 MDYAFEFIINNGGIDTEEDYPYLARDGRCDTYRKNAKVVTIDDYE-DVPVNSETALQKAV 265
Query: 267 ATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
A PV A+ A +Q+Y G+ C L +H V VGY
Sbjct: 266 ANQ-PVSVAIEAGGRDFQFYASGIFSGRCGTQL---DHGVAAVGY 306
>gi|341888721|gb|EGT44656.1| hypothetical protein CAEBREN_22029 [Caenorhabditis brenneri]
Length = 396
Score = 120 bits (301), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 101/312 (32%), Positives = 159/312 (50%), Gaps = 37/312 (11%)
Query: 10 IVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEE 69
+ L+A F K+ L+Q+ + F++ QR K+ E+ +RF+ F+K+L IEE
Sbjct: 64 MTILMASIFRIRAEKLKFFGLQQQFKDFNAKFQREHKTLE--EYKMRFEIFQKNLRDIEE 121
Query: 70 LNKNRQSPESARYGITEFSDLSEEEFKT-----RHLRHSVNKHVLMSHHKHHDHHHNHVK 124
LN ++P S +YGI +FSD +E E K + L S++ L + + +
Sbjct: 122 LN--LKNP-SVQYGINKFSDKTESELKNLLMDKKFLDSSLSNSTLKTLSSYRN------- 171
Query: 125 KRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL 184
R+I + P I DWR G + V++Q CG+CWAF+TV ES +A++ GTL
Sbjct: 172 PRNIIKNVQRPDYI----DWRNDGKVMSVKDQGQCGSCWAFATVAAVESQYAIRKGTLWS 227
Query: 185 LSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNG 244
LS QE++DC G + GC GG + L ++ N LE E +YP +A + NG
Sbjct: 228 LSEQELVDCDG-ASYGCGGGFLTSALGFILGNG--LETEDDYPY----SATRHDQCWING 280
Query: 245 VKIKSYTCD--TLIPSESSILTDIATHGPVIAAVNA-LTWQYYLGGVI---QYNC-DGSL 297
K + + + L SE + +A GPV A++ ++ YY G+ ++ C D SL
Sbjct: 281 DKTRVWIDEGYQLTMSEDDVAEWVANVGPVSFAMSVPKSFPYYHDGIYSPSEHECKDESL 340
Query: 298 ANINHAVQIVGY 309
HA+ I+GY
Sbjct: 341 G--YHAMAIIGY 350
>gi|10391|emb|CAA38238.1| unnamed protein product [Trypanosoma brucei]
Length = 450
Score = 120 bits (301), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 89/316 (28%), Positives = 157/316 (49%), Gaps = 30/316 (9%)
Query: 4 VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKS-EHDIRFKNFE 61
V+ V V L+A+ V + ++E+ LE+ F++F+++Y K Y + E RF+ FE
Sbjct: 7 VRFVRLPVVLLAMAACLASVALGSLHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFE 66
Query: 62 KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
+++ E+ + A +G+T FSD++ EEF+ R+ ++ +
Sbjct: 67 ENM---EQAKIQAAANPYATFGVTPFSDMTREEFRARY--------------RNGASYFA 109
Query: 122 HVKKRSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
+KR T + + TG P DWRE G + V++Q CG+CWAFST+ E +
Sbjct: 110 AAQKRVRKT-VNVTTGRAPAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQWQVAGN 168
Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACKRKA 239
L LS Q ++ C + GC GG +W+ + N + E+ YP + + ++
Sbjct: 169 PLVSLSEQMLVSC-DTIDFGCGGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNG--EQPQ 225
Query: 240 TSPNGVKIKSYTCD--TLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL 297
NG +I + D L E +I +A +GP+ AV+A ++ Y GG++ +C +
Sbjct: 226 CQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYNGGILT-SC--TS 282
Query: 298 ANINHAVQIVGYDNYS 313
++H V +VGY++ S
Sbjct: 283 EQLDHGVLLVGYNDNS 298
>gi|281203744|gb|EFA77940.1| hypothetical protein PPL_08585 [Polysphondylium pallidum PN500]
Length = 505
Score = 120 bits (301), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 90/307 (29%), Positives = 146/307 (47%), Gaps = 30/307 (9%)
Query: 7 VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDI 66
+L I LIA+ + + EQ F ++ R++K Y SE RF F+ ++D
Sbjct: 157 LLLIFGLIAISNALLFSE------EQYKNEFENWIDRFEKKYDVSEFKKRFSIFKSNMDF 210
Query: 67 IEELN-KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKK 125
+ N KN Q+ G+ +DL+ E++ +L + K VL + H + V
Sbjct: 211 VHSWNSKNSQTV----LGLNHLADLTNLEYRQFYL-GTHKKAVLGTPGNHEVSNLQSVFG 265
Query: 126 RSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLL 185
S T DWR+ G + +++Q CG+CW+FST + E H +K+G + L
Sbjct: 266 DSATV------------DWRQKGAVSPIKDQGQCGSCWSFSTTGSVEGAHQIKSGNMVEL 313
Query: 186 SVQEVIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNG 244
S Q ++DC+ GNMGC+GG +++ N + + ES YP + + +G
Sbjct: 314 SEQNLVDCSTSEGNMGCNGGLMDYAFEYIITNNGI-DTESSYPYTASSGTTCKYNKANSG 372
Query: 245 VKIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSLANINH 302
I SY + SES + + GPV A++A ++Q Y G I Y+ S N++H
Sbjct: 373 ATISSYK-NITAGSESDLADAVKNAGPVSVAIDASHNSFQLYSHG-IYYDASCSSVNLDH 430
Query: 303 AVQIVGY 309
V +VGY
Sbjct: 431 GVLVVGY 437
>gi|322801532|gb|EFZ22193.1| hypothetical protein SINV_14496 [Solenopsis invicta]
Length = 781
Score = 120 bits (300), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 89/283 (31%), Positives = 142/283 (50%), Gaps = 29/283 (10%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F Y ++YS E ++R + F ++L IIE L K Q+ + RYG+ F+D+S EE
Sbjct: 523 LFDDFVATYNRTYSSPDERNLRLQIFRENLGIIELLQKTEQA--TGRYGVNMFADMSREE 580
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F+TR+L + ++ K +I +P DWR+ G++ V+
Sbjct: 581 FRTRYL------GLRPDLQSENEIPLQEAKFPNIE--------LPPTFDWRKKGVVTPVK 626
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
NQ CG+CWAFS E +A+K+G L LS QE++DC + G A +
Sbjct: 627 NQGGCGSCWAFSVTGNVEGQYAIKHGQLLSLSEQELVDCDDLDDGCGGGLPDNA---YRA 683
Query: 215 VNKV-VLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
+ K+ LE ES+YP ++ C K N VK++ + + E+ + + +GP+
Sbjct: 684 IEKLGGLELESDYPYEAENEKCHFKK---NLVKVELTSAVNVTSDETQMAQWLVQNGPIS 740
Query: 274 AAVNALTWQYYLGGV---IQYNCDGSLANINHAVQIVGYDNYS 313
+NA Q+Y+GGV ++ C+ N++H V IVGY S
Sbjct: 741 IGINANAMQFYMGGVSHPFKFLCNPK--NLDHGVLIVGYGTSS 781
>gi|118401108|ref|XP_001032875.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89287220|gb|EAR85212.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 360
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 93/318 (29%), Positives = 148/318 (46%), Gaps = 34/318 (10%)
Query: 4 VKNVLFIVALIALCFLAIPVKVS-KPNLEQKLEL-------FSSFQQRYKKSY-SKSEHD 54
+ +L VA++ + L + + K + E K L F +F+ +Y K+Y +E
Sbjct: 4 TQKILVSVAVLGVFLLTLNYVIDHKTDDEIKFMLRKSIERAFKNFKVKYAKTYKDDTEEQ 63
Query: 55 IRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHK 114
RF F + E+ ++ + ++ G+ +F+DL+ EEFK + H KH
Sbjct: 64 YRFSVFTNNY---VEIYRHNKFLVFSKVGVNQFADLTHEEFKALYTGH---KHSKDDDDD 117
Query: 115 HHDHHHNHVKKRSITTGITIPT-GIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAES 173
+ + H +PT +P DWR+ G I V+ Q CG CWAFSTV++ E
Sbjct: 118 DNKNKQPH-----------LPTDNLPASFDWRDKGAITPVKVQNGCGGCWAFSTVQSIEG 166
Query: 174 MHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDA 233
++ LK G L LS Q+VIDC GC GGD + N ++ E+EYP + K
Sbjct: 167 LYFLKTGKLESLSTQQVIDCCRIDESGCLGGDPEPAFRCIQNNGGIMT-ETEYPYIAKQQ 225
Query: 234 ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQY 291
+CK P +I Y +PS+ S + P+ +N+ +++YY GVI
Sbjct: 226 SCKFDEDKPT-FQIGGY---IDVPSDQSQVKAALLIQPLSICLNSSDTSFKYYKSGVITE 281
Query: 292 NCDGSLANINHAVQIVGY 309
DG +H + +VGY
Sbjct: 282 CEDGPYDGPDHCLLLVGY 299
>gi|400180369|gb|AFP73323.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 97/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)
Query: 3 DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
D+ N+L + + F S+P L ++ EL+ S R YK K E RF
Sbjct: 6 DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
F++++ IE +NK S + G+ EF+D++ +EF + ++ N ++ S +
Sbjct: 63 FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
N + + P DWRE+G + +V++Q CG CWAFS V + E + +
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
G L S QE++DC N N GC+GG D++ N + ES+Y L + C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGI-SRESDYEYLGQQYTCRSQ 228
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
+ V+I SY ++P + L T PV IAA L Q+Y GG DG
Sbjct: 229 EKTA-AVQISSY---KVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278
Query: 296 SLAN-INHAVQIVGY 309
S A+ INHAV +GY
Sbjct: 279 SCADRINHAVTAIGY 293
>gi|261328617|emb|CBH11595.1| cysteine peptidase precursor [Trypanosoma brucei gambiense DAL972]
gi|261328620|emb|CBH11598.1| cysteine peptidase precursor [Trypanosoma brucei gambiense DAL972]
Length = 450
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 89/316 (28%), Positives = 157/316 (49%), Gaps = 30/316 (9%)
Query: 4 VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKS-EHDIRFKNFE 61
V+ V V L+A+ V + ++E+ LE+ F++F+++Y K Y + E RF+ FE
Sbjct: 7 VRFVRLPVVLLAMAACLASVALGSLHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFE 66
Query: 62 KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
+++ E+ + A +G+T FSD++ EEF+ R+ ++ +
Sbjct: 67 ENM---EQAKIQAAANPYATFGVTPFSDMTREEFRARY--------------RNGASYFA 109
Query: 122 HVKKRSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
+KR T + + TG P DWRE G + V++Q CG+CWAFST+ E +
Sbjct: 110 AAQKRLRKT-VNVTTGRAPAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQWQVAGN 168
Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACKRKA 239
L LS Q ++ C + GC GG +W+ + N + E+ YP + + ++
Sbjct: 169 PLVSLSEQMLVSC-DTIDFGCGGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNG--EQPQ 225
Query: 240 TSPNGVKIKSYTCD--TLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL 297
NG +I + D L E +I +A +GP+ AV+A ++ Y GG++ +C +
Sbjct: 226 CQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYNGGILT-SC--TS 282
Query: 298 ANINHAVQIVGYDNYS 313
++H V +VGY++ S
Sbjct: 283 EQLDHGVLLVGYNDNS 298
>gi|261328615|emb|CBH11593.1| cysteine peptidase precursor [Trypanosoma brucei gambiense DAL972]
Length = 451
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 89/316 (28%), Positives = 157/316 (49%), Gaps = 30/316 (9%)
Query: 4 VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKS-EHDIRFKNFE 61
V+ V V L+A+ V + ++E+ LE+ F++F+++Y K Y + E RF+ FE
Sbjct: 7 VRFVRLPVVLLAMAACLASVALGSLHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFE 66
Query: 62 KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
+++ E+ + A +G+T FSD++ EEF+ R+ ++ +
Sbjct: 67 ENM---EQAKIQAAANPYATFGVTPFSDMTREEFRARY--------------RNGASYFA 109
Query: 122 HVKKRSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
+KR T + + TG P DWRE G + V++Q CG+CWAFST+ E +
Sbjct: 110 AAQKRLRKT-VNVTTGRAPAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQWQVAGN 168
Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACKRKA 239
L LS Q ++ C + GC GG +W+ + N + E+ YP + + ++
Sbjct: 169 PLVSLSEQMLVSC-DTIDFGCGGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNG--EQPQ 225
Query: 240 TSPNGVKIKSYTCD--TLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL 297
NG +I + D L E +I +A +GP+ AV+A ++ Y GG++ +C +
Sbjct: 226 CQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYNGGILT-SC--TS 282
Query: 298 ANINHAVQIVGYDNYS 313
++H V +VGY++ S
Sbjct: 283 EQLDHGVLLVGYNDNS 298
>gi|400180463|gb|AFP73368.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 97/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)
Query: 3 DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
D+ N+L + + F S+P L ++ EL+ S R YK K E RF
Sbjct: 6 DLMNILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
F++++ IE +NK S + G+ EF+D++ +EF + ++ N ++ S +
Sbjct: 63 FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
N + + P DWRE+G + +V++Q CG CWAFS V + E + +
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
G L S QE++DC N N GC+GG D++ N + ES+Y L + C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGI-SRESDYEYLGEQYTCRSQ 228
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
+ V+I SY ++P + L T PV IAA L Q+Y GG DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278
Query: 296 SLAN-INHAVQIVGY 309
S A+ INHAV +GY
Sbjct: 279 SCADRINHAVTAIGY 293
>gi|42573181|ref|NP_974687.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
gi|332661102|gb|AEE86502.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
Length = 288
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 82/264 (31%), Positives = 128/264 (48%), Gaps = 28/264 (10%)
Query: 29 NLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
N ++ LELF S+ + K+Y E + RF+ F ++L I++ N S G+ EF
Sbjct: 43 NTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEINS---YWLGLNEF 99
Query: 88 SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREA 147
+DL+ EEFK R+L + + + R IT +P DWR+
Sbjct: 100 ADLTHEEFKGRYL------GLAKPQFSRKRQPSANFRYRDITD-------LPKSVDWRKK 146
Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFC 207
G + V++Q CG+CWAFSTV E ++ + G LS LS QE+IDC N GC+GG
Sbjct: 147 GAVAPVKDQGQCGSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGG--- 203
Query: 208 ALLDWMD---VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILT 264
L+D+ ++ L E +YP L+++ C+ + V I Y + + ++ L
Sbjct: 204 -LMDYAFQYIISTGGLHKEDDYPYLMEEGICQEQKEDVERVTISGY--EDVPENDDESLV 260
Query: 265 DIATHGPVIAAVNA--LTWQYYLG 286
H PV A+ A +Q+Y G
Sbjct: 261 KALAHQPVSVAIEASGRDFQFYKG 284
>gi|72389859|ref|XP_845224.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|62359932|gb|AAX80357.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|70801759|gb|AAZ11665.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 450
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 89/316 (28%), Positives = 157/316 (49%), Gaps = 30/316 (9%)
Query: 4 VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKS-EHDIRFKNFE 61
V+ V V L+A+ V + ++E+ LE+ F++F+++Y K Y + E RF+ FE
Sbjct: 7 VRFVRLPVVLLAMAACLASVALGSLHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFE 66
Query: 62 KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
+++ E+ + A +G+T FSD++ EEF+ R+ ++ +
Sbjct: 67 ENM---EQAKIQAAANPYATFGVTPFSDMTREEFRARY--------------RNGASYFA 109
Query: 122 HVKKRSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
+KR T + + TG P DWRE G + V++Q CG+CWAFST+ E +
Sbjct: 110 AAQKRLRKT-VNVTTGRAPAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQWQVAGN 168
Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACKRKA 239
L LS Q ++ C + GC GG +W+ + N + E+ YP + + ++
Sbjct: 169 PLVSLSEQMLVSC-DTIDFGCGGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNG--EQPQ 225
Query: 240 TSPNGVKIKSYTCD--TLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL 297
NG +I + D L E +I +A +GP+ AV+A ++ Y GG++ +C +
Sbjct: 226 CQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYNGGILT-SC--TS 282
Query: 298 ANINHAVQIVGYDNYS 313
++H V +VGY++ S
Sbjct: 283 EQLDHGVLLVGYNDNS 298
>gi|413953051|gb|AFW85700.1| hypothetical protein ZEAMMB73_033873 [Zea mays]
Length = 359
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 92/307 (29%), Positives = 145/307 (47%), Gaps = 20/307 (6%)
Query: 8 LFIVALIALCFLAIPVKVSKPNLE-QKLELFSSFQQRYKKSYSKSEH-DIRFKNFEKSLD 65
L +V L A L S + LE F ++Q Y ++Y+ E RF + ++L
Sbjct: 10 LALVMLFACSLLLAGTAFSDDTIAIPLLERFKAWQAEYNRTYATPEEFQQRFMVYSENLR 69
Query: 66 IIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKK 125
I+ +N+ + S G +F+DL+EEEFK +L + + +
Sbjct: 70 FIKTMNQ-LSTGSSYELGENQFTDLTEEEFKDTYL-------MKLDEQPPAAEAMPPIVG 121
Query: 126 RSITTGITIP--TG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTL 182
T G++ TG P DWR G + V+NQQ CG+CWAF+TV + E +H +K G L
Sbjct: 122 TMSTAGMSNGDNTGEAPNSVDWRTKGAVTPVKNQQQCGSCWAFATVASIEGVHQIKTGRL 181
Query: 183 SLLSVQEVIDCAGNGN-MGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATS 241
LS QE++DC GN GC GG + ++W+ N L ES+YP + C
Sbjct: 182 VSLSEQEIVDCDRGGNDHGCRGGYPRSAMEWVTRNG-GLTTESDYPYVGSQRQCMSGKLG 240
Query: 242 PNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA-LTWQYYLGGVIQYNCDGSLANI 300
+ +I+ Y +E+ + +A PV ++A +Q+Y GV C+ + +
Sbjct: 241 HHAARIRGYQA-VQRKNEAELERAVAGR-PVAVVIDASRAFQFYKRGVFSGPCNTT--TV 296
Query: 301 NHAVQIV 307
NHAV +V
Sbjct: 297 NHAVTVV 303
>gi|33333702|gb|AAQ11969.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 90/281 (32%), Positives = 131/281 (46%), Gaps = 27/281 (9%)
Query: 35 ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPE-SARYGITEFSDLSE 92
E + F+ + K+Y S E RF F+K+L I+E NK + E S +T+F+D++
Sbjct: 21 EEWQQFKLDHGKTYRSLVEEKRRFSVFQKNLIDIQEHNKKYERGEVSFAKKVTQFADMTH 80
Query: 93 EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
EEF V L S+ H D+ + T + + DWRE G +
Sbjct: 81 EEFLDLLKLQGV--PALPSNAVHFDNFED--------TDMEEKDAV----DWREEGAVTP 126
Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN--GNMGCSGGDFCALL 210
V++Q CG+CWAFS V E KNGTL LS QE++DCA GN GC GG
Sbjct: 127 VKDQANCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEDYGNNGCKGGLMGQAF 186
Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
D+ V ++ E YP + ++CK+ VK + D E + +A G
Sbjct: 187 DF--VQDEGIQTEESYPYEGRRSSCKKSGEYVTKVKTYVFPLD-----EQEMARTVAAKG 239
Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGS--LANINHAVQIVGY 309
PV A+ A +Y G++ C S ++NH V +VGY
Sbjct: 240 PVAVAIEASQLSFYDKGIVDERCRCSNKREDLNHGVLVVGY 280
>gi|341888719|gb|EGT44654.1| hypothetical protein CAEBREN_19265 [Caenorhabditis brenneri]
Length = 396
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 92/276 (33%), Positives = 142/276 (51%), Gaps = 24/276 (8%)
Query: 7 VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLD 65
+ F+ L+A F K+ L+Q+ F F +++ + + S E+ +RF+ F+K+L
Sbjct: 61 LFFMTILMASTFKIRAEKLKFFGLQQQ---FKDFNKKFGREHKSLEEYKMRFEVFQKNLR 117
Query: 66 IIEELNKNRQSPESARYGITEFSDLSEEEFKTRHL-RHSVNKHVLMSHHKHHDHHHNHVK 124
IEELN ++P S +YGI FSD +E E K + + ++ + S K + N
Sbjct: 118 DIEELN--LKNP-SVQYGINRFSDKTESELKNLLMDKKFMDSSLSNSSLKTLSSYRN--- 171
Query: 125 KRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL 184
R+I + P I DWR G + V++Q CG+CWAF+TV ES +A++ GTL
Sbjct: 172 PRNIIKNVQRPDYI----DWRNVGKVMSVKDQGQCGSCWAFATVAAVESQYAIRKGTLWS 227
Query: 185 LSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNG 244
LS QE++DC G + GCSGG + L+++ N LE E +YP A K NG
Sbjct: 228 LSEQELVDCDG-ASYGCSGGFLTSALEFILGNG--LETEDDYPY----TATKHDQCWING 280
Query: 245 VKIKSYTCD--TLIPSESSILTDIATHGPVIAAVNA 278
K + + + L +E I +A GPV A+ A
Sbjct: 281 DKTRVWIDEGYQLTMNEDDIAEWVANVGPVSFAMRA 316
>gi|400180455|gb|AFP73364.1| cysteine protease [Solanum peruvianum]
gi|400180459|gb|AFP73366.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 97/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)
Query: 3 DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
D+ N+L + + F S+P L ++ EL+ S R YK K E RF
Sbjct: 6 DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
F++++ IE +NK S + G+ EF+D++ +EF + ++ N ++ S +
Sbjct: 63 FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
N + + P DWRE+G + +V++Q CG CWAFS V + E + +
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
G L S QE++DC N N GC+GG D++ N + ES+Y L + C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGI-SRESDYEYLGEQYTCRSQ 228
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
+ V+I SY ++P + L T PV IAA L Q+Y GG DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278
Query: 296 SLAN-INHAVQIVGY 309
S A+ INHAV +GY
Sbjct: 279 SCADRINHAVTAIGY 293
>gi|198427474|ref|XP_002119872.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 596
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 86/278 (30%), Positives = 140/278 (50%), Gaps = 27/278 (9%)
Query: 36 LFSSFQQRYKKSYSKS--EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEE 93
LF F ++Y ++YS S E++ RF+ F+ + +++ LN+ + +A YGIT+F D+SEE
Sbjct: 168 LFDMFLEKYPRTYSSSSDEYNERFEIFKTNYQVVQHLNEIERG--TAVYGITKFMDMSEE 225
Query: 94 EFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKV 153
E+ R L + + V +++ + T IP DWR+ G + +V
Sbjct: 226 EYH-RTLAPGFTRPL--------------VPIQTLNSAELDTTNIPDSMDWRKHGAVTEV 270
Query: 154 RNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM 213
+NQ +CG+CWAFST E LK+ L LS QE++DC + GC GG +
Sbjct: 271 KNQGSCGSCWAFSTTGNVEGQWFLKHKKLISLSEQELVDCD-TLDSGCGGG--LPSNAYK 327
Query: 214 DVNKV-VLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV 272
+ K+ LEPE +YP + + C K + K+ L E + +A +GP+
Sbjct: 328 SIEKLGGLEPEKDYPYVGEGEKCAIKQSD---FKVFVNNSVALPKDEVKLAAWLAQNGPI 384
Query: 273 IAAVNALTWQYYLGGVIQ-YNCDGSLANINHAVQIVGY 309
+NA Q+Y GG+ + + +++H V IVGY
Sbjct: 385 SIGINANLMQFYWGGISHPWKIFCNPKSLDHGVLIVGY 422
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/69 (46%), Positives = 42/69 (60%), Gaps = 1/69 (1%)
Query: 136 TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAG 195
T IP DWR+ G + +V+NQ +CG+CWAFST E LK+ L LS QE++DC
Sbjct: 473 TNIPDSMDWRKHGAVTEVKNQGSCGSCWAFSTTGNVEGQWFLKHKKLISLSEQELVDCD- 531
Query: 196 NGNMGCSGG 204
+ GC GG
Sbjct: 532 TLDSGCGGG 540
>gi|33333694|gb|AAQ11965.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 90/281 (32%), Positives = 131/281 (46%), Gaps = 27/281 (9%)
Query: 35 ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPE-SARYGITEFSDLSE 92
E + F+ + K+Y S E RF F+K+L I+E NK + E S +T+F+D++
Sbjct: 21 EEWQQFKLDHGKTYRSVVEEKRRFSVFQKNLIDIQEHNKKYERGEVSFAKKVTQFADMTH 80
Query: 93 EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
EEF V L S+ H D+ + T + + DWRE G +
Sbjct: 81 EEFLDLLKLQGV--PALPSNAVHFDNFED--------TDMEEKDAV----DWREEGAVTP 126
Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN--GNMGCSGGDFCALL 210
V++Q CG+CWAFS V E KNGTL LS QE++DCA GN GC GG
Sbjct: 127 VKDQANCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEEYGNNGCRGGLMGQAF 186
Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
D+ V ++ E YP + ++CK+ VK + D E + +A G
Sbjct: 187 DF--VQDEGIQTEESYPYEGRRSSCKKSGDYVTKVKTYVFPLD-----EQEMARTVAAKG 239
Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGS--LANINHAVQIVGY 309
PV A+ A +Y G++ C S ++NH V +VGY
Sbjct: 240 PVAVAIEASQLSFYDKGIVDEKCRCSNKREDLNHGVLVVGY 280
>gi|413953050|gb|AFW85699.1| hypothetical protein ZEAMMB73_033873 [Zea mays]
Length = 361
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 92/307 (29%), Positives = 145/307 (47%), Gaps = 20/307 (6%)
Query: 8 LFIVALIALCFLAIPVKVSKPNLE-QKLELFSSFQQRYKKSYSKSEH-DIRFKNFEKSLD 65
L +V L A L S + LE F ++Q Y ++Y+ E RF + ++L
Sbjct: 10 LALVMLFACSLLLAGTAFSDDTIAIPLLERFKAWQAEYNRTYATPEEFQQRFMVYSENLR 69
Query: 66 IIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKK 125
I+ +N+ + S G +F+DL+EEEFK +L + + +
Sbjct: 70 FIKTMNQ-LSTGSSYELGENQFTDLTEEEFKDTYL-------MKLDEQPPAAEAMPPIVG 121
Query: 126 RSITTGITIP--TG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTL 182
T G++ TG P DWR G + V+NQQ CG+CWAF+TV + E +H +K G L
Sbjct: 122 TMSTAGMSNGDNTGEAPNSVDWRTKGAVTPVKNQQQCGSCWAFATVASIEGVHQIKTGRL 181
Query: 183 SLLSVQEVIDCAGNGN-MGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATS 241
LS QE++DC GN GC GG + ++W+ N L ES+YP + C
Sbjct: 182 VSLSEQEIVDCDRGGNDHGCRGGYPRSAMEWVTRNG-GLTTESDYPYVGSQRQCMSGKLG 240
Query: 242 PNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA-LTWQYYLGGVIQYNCDGSLANI 300
+ +I+ Y +E+ + +A PV ++A +Q+Y GV C+ + +
Sbjct: 241 HHAARIRGYQA-VQRKNEAELERAVAGR-PVAVVIDASRAFQFYKRGVFSGPCNTT--TV 296
Query: 301 NHAVQIV 307
NHAV +V
Sbjct: 297 NHAVTVV 303
>gi|400180385|gb|AFP73331.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 97/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)
Query: 3 DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
D+ N+L + + F S+P L ++ EL+ S R YK K E RF
Sbjct: 6 DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
F++++ IE +NK S + G+ EF+D++ +EF + ++ N ++ S +
Sbjct: 63 FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
N + + P DWRE+G + +V++Q CG CWAFS V + E + +
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
G L S QE++DC N N GC+GG D++ N + ES+Y L + C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGI-SRESDYEYLGEQYTCRSQ 228
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
+ V+I SY ++P + L T PV IAA L Q+Y GG DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278
Query: 296 SLAN-INHAVQIVGY 309
S A+ INHAV +GY
Sbjct: 279 SCADRINHAVTAIGY 293
>gi|400180351|gb|AFP73314.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 97/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)
Query: 3 DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
D+ N+L + + F S+P L ++ EL+ S R YK K E RF
Sbjct: 6 DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
F++++ IE +NK S + G+ EF+D++ +EF + ++ N ++ S +
Sbjct: 63 FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
N + + P DWRE+G + +V++Q CG CWAFS V + E + +
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
G L S QE++DC N N GC+GG D++ N + ES+Y L + C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGI-SRESDYEYLGEQYTCRSQ 228
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
+ V+I SY ++P + L T PV IAA L Q+Y GG DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278
Query: 296 SLAN-INHAVQIVGY 309
S A+ INHAV +GY
Sbjct: 279 SCADRINHAVTAIGY 293
>gi|5823018|gb|AAD53011.1|AF089848_1 senescence-specific cysteine protease [Brassica napus]
Length = 346
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 88/311 (28%), Positives = 149/311 (47%), Gaps = 22/311 (7%)
Query: 7 VLFIVALIA-LCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSL 64
+ IV+L++ CF ++ L + + + + ++Y+ +E + R+ F++++
Sbjct: 8 IFLIVSLVSSFCFSTTLSRLLDDELIMQKK-HDEWMAEHGRTYADMNEKNNRYVVFKRNV 66
Query: 65 DIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVK 124
+ IE LN N + + + + +F+DL+ +EF+ + + VL S + K
Sbjct: 67 ERIERLN-NVPAGRTFKLAVNQFADLTNDEFRFMYTGYK-GDFVLFSQ--------SQTK 116
Query: 125 KRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL 184
S +P+ DWR+ G + ++NQ +CG CWAFS V E +K G L
Sbjct: 117 STSFRYQNVFFGALPIAVDWRKKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLIS 176
Query: 185 LSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNG 244
LS Q+++DC N + GCSGG + + + L ES YP +DA CK K+T P+
Sbjct: 177 LSEQQLVDCDTN-DFGCSGGLMDTAFEHI-MATGGLTTESNYPYKGEDANCKIKSTKPSA 234
Query: 245 VKIKSYTCDTLIPSESSILTDIATHGPVIAAV--NALTWQYYLGGVIQYNCDGSLANINH 302
I Y D + E++++ +A H PV + +Q+Y GV C L +H
Sbjct: 235 ASITGYE-DVPVNDENALMKAVA-HQPVSVGIEGGGFDFQFYSSGVFTGECTTYL---DH 289
Query: 303 AVQIVGYDNYS 313
AV VGY S
Sbjct: 290 AVTAVGYSQSS 300
>gi|945081|gb|AAC49361.1| P21 [Petunia x hybrida]
Length = 358
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 88/276 (31%), Positives = 132/276 (47%), Gaps = 25/276 (9%)
Query: 37 FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFK 96
F+ F +RY K Y E +I+ + F+ LD +E +N + S + G+ EFSDL+ +EF+
Sbjct: 59 FARFARRYGKRYDSVE-EIK-QRFDIFLDNLEMINSHNDKGLSYKLGVNEFSDLTWDEFR 116
Query: 97 TRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQ 156
L + N ++K R +P KDWREAGI+ V+NQ
Sbjct: 117 RDRLGAAQNCSATT---------KGNLKLRDAV--------LPETKDWREAGIVSPVKNQ 159
Query: 157 QTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWMDV 215
CG+CW FST E+ + K G LS Q+++DCAG N GC+GG +++
Sbjct: 160 GKCGSCWTFSTTGALEAAYTQKFGKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKS 219
Query: 216 NKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAA 275
N LE E YP K+ CK + + GVK+ + + + +E + +A PV A
Sbjct: 220 NG-GLETEEAYPYTGKNGLCKFSSQNV-GVKVTD-SVNITLGAEDELKYAVALVRPVSVA 276
Query: 276 VNALTW--QYYLGGVIQYNCDGSLANINHAVQIVGY 309
+ QY G C + ++NHAV VGY
Sbjct: 277 FEVVKGFKQYKSGVYTSTECGTTPMDVNHAVLAVGY 312
>gi|400180467|gb|AFP73370.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 97/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)
Query: 3 DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
D+ N+L + + F S+P L ++ EL+ S R YK K E RF
Sbjct: 6 DLMNILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
F++++ IE +NK S + G+ EF+D++ +EF + ++ N ++ S +
Sbjct: 63 FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
N + + P DWRE+G + +V++Q CG CWAFS V + E + +
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
G L S QE++DC N N GC+GG D++ N + ES+Y L + C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGI-SRESDYEYLGEQYTCRSQ 228
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
+ V+I SY ++P + L T PV IAA L Q+Y GG DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278
Query: 296 SLAN-INHAVQIVGY 309
S A+ INHAV +GY
Sbjct: 279 SCADRINHAVTAIGY 293
>gi|400180403|gb|AFP73340.1| cysteine protease [Solanum peruvianum]
gi|400180413|gb|AFP73345.1| cysteine protease [Solanum peruvianum]
gi|400180415|gb|AFP73346.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 97/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)
Query: 3 DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
D+ N+L + + F S+P L ++ EL+ S R YK K E RF
Sbjct: 6 DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
F++++ IE +NK S + G+ EF+D++ +EF + ++ N ++ S +
Sbjct: 63 FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
N + + P DWRE+G + +V++Q CG CWAFS V + E + +
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
G L S QE++DC N N GC+GG D++ N + ES+Y L + C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGI-SRESDYEYLGEQYTCRSQ 228
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
+ V+I SY ++P + L T PV IAA L Q+Y GG DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278
Query: 296 SLAN-INHAVQIVGY 309
S A+ INHAV +GY
Sbjct: 279 SCADRINHAVTAIGY 293
>gi|327263389|ref|XP_003216502.1| PREDICTED: cathepsin L1-like [Anolis carolinensis]
Length = 339
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 90/317 (28%), Positives = 150/317 (47%), Gaps = 31/317 (9%)
Query: 14 IALCFLAIPVKV--SKPNLEQKLE-LFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEEL 70
+ LC LA+ ++ + P+L+ L+ + +++ + K Y + E R +EK+L +I+
Sbjct: 3 VYLCALALFLEACFAAPSLDSALDDHWQAWKTWHSKKYHQQEEGWRRMIWEKNLKMIQLH 62
Query: 71 NKNRQ-SPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSIT 129
N + S R G+ F D++ EEF+ +M+ +KH + +
Sbjct: 63 NLDHSLGKHSYRLGMNHFGDMTNEEFRQ-----------VMNGYKHSKTEKKYRGSEFLE 111
Query: 130 TGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQE 189
+ +P DWRE G + V++Q CG+CWAFST + E H K G L LS Q
Sbjct: 112 PNFLV---VPKSVDWREKGYVTPVKDQGQCGSCWAFSTTGSLEGQHFRKTGKLVSLSEQN 168
Query: 190 VIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIK 248
++DC+ GN GC+GG +++ N + + E YP + KD + N
Sbjct: 169 LVDCSRPEGNQGCNGGLMDQAFEYIADNGGI-DSEESYPYIAKDDEDCLYKSEFNAANDT 227
Query: 249 SYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQI 306
+ D E +++ +A GPV A++A T+Q+Y G I Y+ D S ++H V +
Sbjct: 228 GFV-DVPEGHERALMKAVAAVGPVSVAIDASHSTFQFYESG-IYYDPDCSSEELDHGVLV 285
Query: 307 VGY-------DNYSRTW 316
VGY DN + W
Sbjct: 286 VGYGFEGTDDDNKKKYW 302
>gi|1174171|gb|AAB41816.1| NTH1 [Pisum sativum]
Length = 367
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 89/311 (28%), Positives = 156/311 (50%), Gaps = 34/311 (10%)
Query: 7 VLFIVALIALCFLAIPVKVSKPNLEQK-LELFSSFQQRYKK-SYSKSEHDIRFKNFEKSL 64
+L+ + L L L++ + +S ++ + ++ + +++K Y E + RF+ F+ +L
Sbjct: 4 ILYSLILFGLITLSLSLDMSSGRSNKEVMTMYEKWLVKHQKVYYGLGEKNQRFQIFKDNL 63
Query: 65 DIIEELNKNRQSPE-SARYGITEFSDLSEEEFKTRHLRHSVNKHV---LMSHHKHHDHHH 120
I+E N +P S R G+ EFSD++ +E++ +L N ++ + S + H
Sbjct: 64 IFIDEHN----APNHSYRVGLNEFSDITNKEYRDTYLSRWSNNNIKNKITSVRYAYKAGH 119
Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
N+ +PV DWR G + ++NQ +CGACWAFS V E+++ + G
Sbjct: 120 NN--------------KLPVSVDWR--GALTPIKNQGSCGACWAFSAVAAVEAINKIVTG 163
Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKAT 240
+L LS QE++DC N GC+GG+ ++ V L+ + +YP L + + C +
Sbjct: 164 SLVSLSEQELVDCDRTKNKGCNGGNQVNAYRFI-VENGGLDSQIDYPYLGRQSTCNQAKK 222
Query: 241 SPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSLA 298
+ V I Y + SES+++ +A PV + A +Q Y GV +C SL
Sbjct: 223 NTKVVSINGYK-NVQRNSESALMEAVANQ-PVSVGIEAYGKDFQLYQSGVFTGSCGTSL- 279
Query: 299 NINHAVQIVGY 309
+HAV +VGY
Sbjct: 280 --DHAVVVVGY 288
>gi|357619726|gb|EHJ72185.1| cathepsin [Danaus plexippus]
Length = 1118
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 87/281 (30%), Positives = 138/281 (49%), Gaps = 19/281 (6%)
Query: 29 NLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFS 88
+LE+ LF F + Y K Y +SE + RFK F +L I +N + +A YGI +FS
Sbjct: 811 SLEEAPTLFEQFIKDYNKEYDESEKEERFKIFVNNLKDINAMN---ERSSNAVYGINKFS 867
Query: 89 DLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAG 148
DLS++EF + + + S+ H KK + + P + DWR+ G
Sbjct: 868 DLSKDEFVKFYT--GLKREESPSNEDH--------KKTDLPKSFNVTA--PDQFDWRKKG 915
Query: 149 IIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCA 208
++ V+ Q C +CWAFS ES++A+K G L +S Q+++DC N GCSGG C+
Sbjct: 916 VVSSVKFQGHCVSCWAFSVAGNVESINAIKTGKLIDVSEQQLVDC-DEWNFGCSGGIACS 974
Query: 209 LLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIAT 268
+ +K YP + K+ C R +S +++K Y + SE I +
Sbjct: 975 KSHFSYFHKKGAMSLESYPYVGKEGQC-RYNSSKVVIRLKDYQY-FIALSEDEIKEYLYN 1032
Query: 269 HGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
GP+ +++ +Y GG++ C + NHAV +VGY
Sbjct: 1033 IGPLSIDIDSSQIHHYKGGIVIKECQ-EVKKTNHAVLLVGY 1072
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 86/281 (30%), Positives = 139/281 (49%), Gaps = 21/281 (7%)
Query: 29 NLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFS 88
+LE+ LF F + Y K Y +SE + RFK F +L I +N + +A YGI +FS
Sbjct: 511 SLEEAPTLFEQFIKDYNKEYDESEKEERFKIFVNNLKDINAMN---ERSSNAVYGINKFS 567
Query: 89 DLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAG 148
DLS+EEF + + + S+ H KK + + P + DWR+ G
Sbjct: 568 DLSKEEFIKYYT--GLKREESPSNEDH--------KKTDLPESFNVTA--PDQFDWRKKG 615
Query: 149 IIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCA 208
++ ++NQ+ CG+CWAFS ES+HA+K G L +S Q+++DC + GCSGG
Sbjct: 616 VVSSIKNQKHCGSCWAFSAAGNVESIHAIKTGKLVHVSEQQLVDCDSQ-DSGCSGGLTWN 674
Query: 209 LLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIAT 268
+ + N V YP + ++ C R ++ +++K Y T + SE I +
Sbjct: 675 AMRYFRTNGAV--SLKSYPYVAQNENC-RYDSNKVVIRLKDYKHITQL-SEDQIKEHLYN 730
Query: 269 HGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
G + + + +Y GG++ C S ++HAV +V Y
Sbjct: 731 IGLLSIDITSTQLTWYEGGILIEECRRSDL-VDHAVLLVEY 770
Score = 110 bits (276), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 69/190 (36%), Positives = 103/190 (54%), Gaps = 21/190 (11%)
Query: 29 NLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFS 88
+LE+ LF F + Y K Y +SE + RFK F +L I +N + +A YGI +FS
Sbjct: 294 SLEEAPTLFEQFIKDYNKEYDESEKEERFKIFVNNLKDINAMN---ERSSNAVYGINKFS 350
Query: 89 DLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAG 148
DLS+EEF + ++ HHK D K +IT P + DWR+ G
Sbjct: 351 DLSKEEFIKYYTGLKRDRCTTTEHHKSTDL----PKSFNITA--------PDQFDWRKKG 398
Query: 149 IIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCA 208
++ V+NQ+ CG+CWAFS ES+HA+K G L +S Q+++DC + GCSGG
Sbjct: 399 VVSSVKNQRHCGSCWAFSAAANVESIHAIKTGKLIDVSEQQLLDC-DKYDSGCSGG---- 453
Query: 209 LLDWMDVNKV 218
L+W+ + ++
Sbjct: 454 -LEWIAMREL 462
Score = 107 bits (268), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 75/231 (32%), Positives = 115/231 (49%), Gaps = 18/231 (7%)
Query: 79 SARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGI 138
+A YGI +FSDLS+EEF + + + S+ H KK + +
Sbjct: 7 NAVYGINKFSDLSKEEFVKYYT--GLKREESPSNEDH--------KKTDLPESFNVTA-- 54
Query: 139 PVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGN 198
P + DWR+ G++ ++NQ+ CG+CWAFS ES+HA+K G L +S Q+++DC +
Sbjct: 55 PDQFDWRKKGVVSSIKNQKHCGSCWAFSAAANVESIHAIKTGKLIDVSEQQLLDC-DKYD 113
Query: 199 MGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPS 258
GCSGG L + N + YP + K+ C R +S +++K Y + S
Sbjct: 114 SGCSGGLPWDALRYFVANGAM--SLKSYPYVAKEGKC-RYDSSKVEIRLKEYKHKEKL-S 169
Query: 259 ESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
E I + GP+ A+ + Y GG++ C S INHAV +VGY
Sbjct: 170 EDQIKEHLYNIGPLSIAITSSPLASYNGGILIEECHRSYL-INHAVLLVGY 219
>gi|33333696|gb|AAQ11966.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 90/281 (32%), Positives = 131/281 (46%), Gaps = 27/281 (9%)
Query: 35 ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPE-SARYGITEFSDLSE 92
E + F+ + K+Y S E RF F+K+L I+E NK + E S +T+F+D++
Sbjct: 21 EEWQQFKLDHGKTYRSLVEEKRRFSVFQKNLIDIQEHNKKYERGEVSFAKKVTQFADMTH 80
Query: 93 EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
EEF V L S+ H D+ + T + + DWRE G +
Sbjct: 81 EEFLDLLKLQGV--PALPSNAVHFDNFED--------TDMEEKDAV----DWREEGAVTP 126
Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN--GNMGCSGGDFCALL 210
V++Q CG+CWAFS V E KNGTL LS QE++DCA GN GC GG
Sbjct: 127 VKDQANCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEEYGNNGCRGGLMGQAF 186
Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
D+ V ++ E YP + ++CK+ VK + D E + +A G
Sbjct: 187 DF--VQDEGIQTEESYPYEGRRSSCKKSGDYVTKVKTYVFPLD-----EQEMARTVAAKG 239
Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGS--LANINHAVQIVGY 309
PV A+ A +Y G++ C S ++NH V +VGY
Sbjct: 240 PVAVAIEASQLSFYDKGIVDETCRCSNKREDLNHGVLVVGY 280
>gi|20334375|gb|AAM19208.1|AF493233_1 cysteine protease [Solanum pennellii]
Length = 337
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 98/314 (31%), Positives = 148/314 (47%), Gaps = 40/314 (12%)
Query: 3 DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
D+ ++L + + F S+P L ++ EL+ S R YK K E RF
Sbjct: 6 DLMSILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHH 119
F++++ IE +NK S + G+ EF+D++ +EF + ++ L S +D
Sbjct: 63 FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYL-SPSPINDLS 119
Query: 120 HNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
+ +P DWRE+G + +V+NQ CG CWAFS V + E + +
Sbjct: 120 DDD---------------MPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIAT 164
Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
G L S QE++DC N N GC+GG D++ N + ES+Y L + C+ +
Sbjct: 165 GNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGI-SRESDYEYLGQQYTCRSQE 222
Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDGS 296
+ V+I SY ++P + L T PV IAA L Q+Y GG DGS
Sbjct: 223 KTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DGS 272
Query: 297 LAN-INHAVQIVGY 309
AN INHAV +GY
Sbjct: 273 CANRINHAVTAIGY 286
>gi|400180426|gb|AFP73351.1| cysteine protease [Solanum corneliomuelleri]
Length = 344
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 99/325 (30%), Positives = 154/325 (47%), Gaps = 38/325 (11%)
Query: 3 DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
D+ N+L + + F S+P L ++ EL+ S R YK K E RF
Sbjct: 6 DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
F++++ IE +NK S + G+ EF+D++ +EF + ++ N ++ S +
Sbjct: 63 FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
N + + P DWRE+G + +V++Q CG CWAFS V + E + +
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
G L S QE++DC N N GC+GG D++ N + ES+Y L + C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGI-SRESDYEYLGQQYTCRSQ 228
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
+ V+I SY ++P + L T PV IAA L Q+Y GG DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278
Query: 296 SLAN-INHAVQIVGY---DNYSRTW 316
S A+ INHAV +GY +N + W
Sbjct: 279 SCADRINHAVTAIGYGTDENGQKYW 303
>gi|400180437|gb|AFP73356.1| cysteine protease [Solanum pennellii]
Length = 337
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 98/314 (31%), Positives = 148/314 (47%), Gaps = 40/314 (12%)
Query: 3 DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
D+ ++L + + F S+P L ++ EL+ S R YK K E RF
Sbjct: 6 DLMSILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHH 119
F++++ IE +NK S + G+ EF+D++ +EF + ++ L S +D
Sbjct: 63 FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYL-SPSPINDLS 119
Query: 120 HNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
+ +P DWRE+G + +V+NQ CG CWAFS V + E + +
Sbjct: 120 DDD---------------MPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIAT 164
Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
G L S QE++DC N N GC+GG D++ N + ES+Y L + C+ +
Sbjct: 165 GNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGI-SRESDYEYLGQQYTCRSQE 222
Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDGS 296
+ V+I SY ++P + L T PV IAA L Q+Y GG DGS
Sbjct: 223 KTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DGS 272
Query: 297 LAN-INHAVQIVGY 309
AN INHAV +GY
Sbjct: 273 CANRINHAVTAIGY 286
>gi|400180422|gb|AFP73349.1| cysteine protease [Solanum chmielewskii]
Length = 344
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 97/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)
Query: 3 DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
D+ N+L + + F S+P L ++ EL+ S R YK K E RF
Sbjct: 6 DLMNILITLFFVISIFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
F++++ IE +NK S + G+ EF+D++ +EF + ++ N ++ S +
Sbjct: 63 FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
N + + P DWRE+G + +V++Q CG CWAFS V + E + +
Sbjct: 121 KTNDLSDDDM----------PSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIA 170
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
G L S QE++DC N N GC+GG D++ N + ES+Y L + C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGI-SRESDYEYLGQQYTCRSQ 228
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
+ V+I SY ++P + L T PV IAA L Q+Y GG DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYSGGTY----DG 278
Query: 296 SLAN-INHAVQIVGY 309
S A+ INHAV +GY
Sbjct: 279 SCADRINHAVTAIGY 293
>gi|33333698|gb|AAQ11967.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 90/281 (32%), Positives = 131/281 (46%), Gaps = 27/281 (9%)
Query: 35 ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPE-SARYGITEFSDLSE 92
E + F+ + K+Y S E RF F+K+L I+E NK + E S +T+F+D++
Sbjct: 21 EEWQQFKLDHGKTYRSVVEEKRRFSVFQKNLIDIQEHNKKYERGEVSFAKKVTQFADMTH 80
Query: 93 EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
EEF V L S+ H D+ + T + + DWRE G +
Sbjct: 81 EEFLDLLKLQGV--PALPSNAVHFDNFED--------TDMEEKDAV----DWREEGAVTP 126
Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN--GNMGCSGGDFCALL 210
V++Q CG+CWAFS V E KNGTL LS QE++DCA GN GC GG
Sbjct: 127 VKDQANCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEEYGNNGCRGGLMGQAF 186
Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
D+ V ++ E YP + ++CK+ VK + D E + +A G
Sbjct: 187 DF--VQDEGIQTEESYPYEGRRSSCKKSGDYVTKVKTYVFPLD-----EQEMARTVAAKG 239
Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGS--LANINHAVQIVGY 309
PV A+ A +Y G++ C S ++NH V +VGY
Sbjct: 240 PVAVAIEASQLSFYDKGIVDEKCRCSNKREDLNHGVLVVGY 280
>gi|449522968|ref|XP_004168497.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 348
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 90/287 (31%), Positives = 138/287 (48%), Gaps = 32/287 (11%)
Query: 29 NLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
++++ +ELF + + K Y E RF+ F+ +L I+E NK S G+ EF
Sbjct: 37 SMDRLIELFEEWISNHGKIYETIEEKWHRFEVFKDNLKHIDETNKKVTS---YWLGVNEF 93
Query: 88 SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREA 147
+DL+ +EFK +L V + + + V +P DWR+
Sbjct: 94 ADLTHQEFKNMYLGLKVESS--RTRQSPEEFTYKDV------------VDLPKSVDWRKK 139
Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFC 207
G + +V+NQ +CG+CWAFSTV E ++ + G L+ LS QE+IDC N GC GG
Sbjct: 140 GAVTRVKNQGSCGSCWAFSTVAAVEGINKIVGGNLTSLSEQELIDCDRPYNNGCHGG--- 196
Query: 208 ALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILT 264
L+D+ V+ L E +YP L ++ C K V I Y D +E+S++
Sbjct: 197 -LMDYAFSFIVSSGGLHKEEDYPYLEVESTCDNKKGELEVVTISGYK-DVPENNEASLIK 254
Query: 265 DIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+A H P+ A+ A +Q+Y GGV C ++H V VGY
Sbjct: 255 ALA-HQPLSVAIEASGRDFQFYSGGVFDGPCG---TQLDHGVTAVGY 297
>gi|225456820|ref|XP_002278323.1| PREDICTED: vignain [Vitis vinifera]
Length = 360
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 77/281 (27%), Positives = 133/281 (47%), Gaps = 21/281 (7%)
Query: 31 EQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDL 90
E L+ ++ + S S E RF F+++++ + E NK E + + +F+D+
Sbjct: 32 ESLWNLYERWRSHHTVSRSLDEKHKRFNVFKENVNFVHEFNKK---DEPYKLKLNKFADM 88
Query: 91 SEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGII 150
+ EF++ + VN H + +H + K +S+ P DWR+ G +
Sbjct: 89 TNHEFRSTYAGSKVNHHRMFRGSQHAAGSFMYEKVKSV----------PPSVDWRKKGAV 138
Query: 151 GKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALL 210
+++Q CG+CWAFSTV E ++ +K L LS QE++DC + N GC+GG
Sbjct: 139 TPIKDQGQCGSCWAFSTVVAVEGINHIKTNKLVSLSEQELVDCDTSENQGCNGGLMGYAF 198
Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
+++ K + E YP +D C + V I + +T+ P+ L A +
Sbjct: 199 EFIK-EKGGITTEQSYPYTAEDGTCDVSKVNSPVVSIDGH--ETVPPNNEDALLKAAANQ 255
Query: 271 PVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
P+ A++A +Q+Y GV C +++H V IVGY
Sbjct: 256 PISVAIDAGGSAFQFYSEGVFAGRCG---TDLDHGVAIVGY 293
>gi|146216002|gb|ABQ10203.1| cysteine protease Cp5 [Actinidia deliciosa]
Length = 509
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 83/286 (29%), Positives = 137/286 (47%), Gaps = 22/286 (7%)
Query: 31 EQKLELFSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSD 89
E+ +ELF + +++ K Y E + +F+NF +L + E N R + G+ +F+D
Sbjct: 45 ERVVELFKKWTEKHGKVYKHGQEVEKKFQNFRDNLRYVMEKNGERGASGGHLVGLNKFAD 104
Query: 90 LSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKR----SITTGITIPTGIPVKKDWR 145
+S EEF+ V +S K +++R + P DWR
Sbjct: 105 MSNEEFR----------EVYVSKVKKPTSKRMAIERRRQGKAAAAKAVAACDGPTSLDWR 154
Query: 146 EAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGD 205
+ GI+ V++Q CG+CWAFS+ E ++AL NG L LS QE++DC N GC GG
Sbjct: 155 KYGIVTGVKDQGDCGSCWAFSSTGAIEGINALANGDLISLSEQELVDCDST-NDGCEGGY 213
Query: 206 FCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
+W+ N + + E++YP +D C V I Y + + ES++
Sbjct: 214 MDYAFEWVMSNGGI-DTETDYPYTGEDGTCNTTKEETKAVSIDGY--EDVAEEESALFCA 270
Query: 266 IATHGPVIAAVN--ALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+ P+ ++ A+ +Q Y GG+ +C +I+HAV +VGY
Sbjct: 271 VLKQ-PISVGIDGGAIDFQLYTGGIYDGDCSDDPDDIDHAVLVVGY 315
>gi|224114698|ref|XP_002316833.1| predicted protein [Populus trichocarpa]
gi|222859898|gb|EEE97445.1| predicted protein [Populus trichocarpa]
Length = 305
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 84/290 (28%), Positives = 139/290 (47%), Gaps = 32/290 (11%)
Query: 34 LELFSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSE 92
+E ++ +Y ++Y E + R F+ +++ IE NK + P + + EF+DL+
Sbjct: 1 MERHETWMAQYGRAYKGHVEKERRLNIFKNNVEFIESFNKVGKKP--YKLSVNEFADLTN 58
Query: 93 EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
EEF+ + ++ H+ S K + + + +P DWR+ G +
Sbjct: 59 EEFQASRNGYKMSAHLSSSSTKPFRYEN--------------VSAVPSTMDWRKKGAVTP 104
Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLD 211
+++Q CG CWAFS V E + L G L LS QE++DC +G + GC+GG D
Sbjct: 105 IKDQGQCGCCWAFSAVAATEGITQLSTGKLISLSEQELVDCDTSGEDQGCNGGLMDDAFD 164
Query: 212 WMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGP 271
++ NK L E+ YP D AC + KI Y D SE+++L +A P
Sbjct: 165 FIIQNK-GLTTEANYPYQGADGACNSGKAA---AKITGYE-DVPANSEAALLKAVANQ-P 218
Query: 272 VIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY---DNYSRTW 316
V A++A +Q+Y GV +C +++H V VGY D+ ++ W
Sbjct: 219 VSVAIDAGGSAFQFYSSGVFTGDCG---TDLDHGVTAVGYGMSDDGTKYW 265
>gi|33333708|gb|AAQ11972.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 90/281 (32%), Positives = 131/281 (46%), Gaps = 27/281 (9%)
Query: 35 ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPE-SARYGITEFSDLSE 92
E + F+ + K+Y S E RF F+K+L I+E NK + E S +T+F+D++
Sbjct: 21 EEWQQFKLDHGKTYRSLVEEKRRFSVFQKNLIDIQEHNKKYERGEVSFAKKVTQFADMTH 80
Query: 93 EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
EEF V L S+ H D+ + T + + DWRE G +
Sbjct: 81 EEFLDLLKLQGV--PALPSNAVHFDNFED--------TDMEEKDAV----DWREEGAVTP 126
Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN--GNMGCSGGDFCALL 210
V++Q CG+CWAFS V E KNGTL LS QE++DCA GN GC GG
Sbjct: 127 VKDQANCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEEYGNNGCRGGLMGQAF 186
Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
D+ V ++ E YP + ++CK+ VK + D E + +A G
Sbjct: 187 DF--VQDEGIQTEESYPYEGRRSSCKKSGDYVTKVKTYVFPLD-----EQEMARTVAAKG 239
Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGS--LANINHAVQIVGY 309
PV A+ A +Y G++ C S ++NH V +VGY
Sbjct: 240 PVAVAIEASQLSFYDKGIVDETCRCSNKREDLNHGVLVVGY 280
>gi|449455625|ref|XP_004145553.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 351
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 90/287 (31%), Positives = 138/287 (48%), Gaps = 32/287 (11%)
Query: 29 NLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
++++ +ELF + + K Y E RF+ F+ +L I+E NK S G+ EF
Sbjct: 40 SMDRLIELFEEWISNHGKIYETIEEKWHRFEVFKDNLKHIDETNKKVTS---YWLGVNEF 96
Query: 88 SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREA 147
+DL+ +EFK +L V + + + V +P DWR+
Sbjct: 97 ADLTHQEFKNMYLGLKVESS--RTRQSPEEFTYKDV------------VDLPKSVDWRKK 142
Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFC 207
G + +V+NQ +CG+CWAFSTV E ++ + G L+ LS QE+IDC N GC GG
Sbjct: 143 GAVTRVKNQGSCGSCWAFSTVAAVEGINKIVGGNLTSLSEQELIDCDRPYNNGCHGG--- 199
Query: 208 ALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILT 264
L+D+ V+ L E +YP L ++ C K V I Y D +E+S++
Sbjct: 200 -LMDYAFSFIVSSGGLHKEEDYPYLEVESTCDNKKGELEVVTISGYK-DVPENNEASLIK 257
Query: 265 DIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+A H P+ A+ A +Q+Y GGV C ++H V VGY
Sbjct: 258 ALA-HQPLSVAIEASGRDFQFYSGGVFDGPCG---TQLDHGVTAVGY 300
>gi|51969854|dbj|BAD43619.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 361
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 91/325 (28%), Positives = 157/325 (48%), Gaps = 42/325 (12%)
Query: 1 MFDVKNVLFIVALIALC-----FLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHD 54
+F V +++F+ +++C + V ++P + + F+ F++++ K Y S EH
Sbjct: 8 LFSV-SLIFVFVSVSVCGDEDVLIRQVVDETEPKVLSSEDHFTLFKKKFGKVYGSIEEHY 66
Query: 55 IRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHK 114
RF F+ +L + + + P SAR+G+T+FSDL+ EF+ +HL V
Sbjct: 67 YRFSVFKANL--LRAMRHQKMDP-SARHGVTQFSDLTRSEFRRKHL------GVKGGFKL 117
Query: 115 HHDHHHNHVKKRSITTGITIPT-GIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAES 173
D + + +PT +P + DWR+ G + V+NQ +CG+CW+FST E
Sbjct: 118 PKDANQAPI----------LPTQNLPEEFDWRDRGAVTPVKNQGSCGSCWSFSTTGALEG 167
Query: 174 MHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDFCALLDWMDVNKVVLEPESE 225
H L G L LS Q+++DC G+ + GC+G + ++ + L E +
Sbjct: 168 AHFLATGKLVSLSEQQLVDCDHECDPEEEGSCDSGCNGRLMNSAFEYT-LKTGGLMREKD 226
Query: 226 YPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYL 285
YP D + S + +++ ++ +E I ++ +GP+ A+NA Q Y+
Sbjct: 227 YPYTGTDGGSCKLDRSKIVASVSNFSVVSI--NEDQIAANLIKNGPLAVAINAAYMQTYI 284
Query: 286 GGV-IQYNCDGSLANINHAVQIVGY 309
GGV Y C L NH V +VGY
Sbjct: 285 GGVSCPYICSRRL---NHGVLLVGY 306
>gi|356560855|ref|XP_003548702.1| PREDICTED: P34 probable thiol protease-like [Glycine max]
Length = 357
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 94/319 (29%), Positives = 143/319 (44%), Gaps = 35/319 (10%)
Query: 7 VLFIVALIALCFLA---IPVKVS--KPNLE------QKLELFSSFQQRYKKSYSK-SEHD 54
+ F + + +CF + PV+ S PNL+ + ++LF +++ + Y E
Sbjct: 11 IFFFICITLICFSSSSNFPVQYSILGPNLDKLPSQDETIQLFQLWRKEHGLVYKDLKEMA 70
Query: 55 IRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHK 114
RF+ F +L+ I E N R SP G+ F+D S EF+ +L HS+
Sbjct: 71 KRFEIFLSNLNYIIEFNAKRSSPSGYLLGLNNFADWSPSEFQEIYL-HSL---------- 119
Query: 115 HHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESM 174
D + K G + P DWR + ++NQ +CG+CWAFS E +
Sbjct: 120 --DMPTDSAPK---LNGPLLSCIAPASLDWRNKVAVTAIKNQGSCGSCWAFSAAGAIEGI 174
Query: 175 HALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAA 234
HA+ G L LS QE+++C + GC+GG DW+ N + E+EYP KD
Sbjct: 175 HAITTGELISLSEQELVNCD-RVSKGCNGGWVNKAFDWVISNGGI-TLEAEYPYTGKDGG 232
Query: 235 -CKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQ-YN 292
C P I Y + + S++ +L I P+ +NA +Q Y G+
Sbjct: 233 NCNSDKQVPIKATIDGY--EQVEQSDNGLLCSIVKQ-PISICLNATDFQLYESGIFDGQQ 289
Query: 293 CDGSLANINHAVQIVGYDN 311
C S NH V IVGYD+
Sbjct: 290 CSSSSKYTNHCVLIVGYDS 308
>gi|7211743|gb|AAF40415.1|AF216784_1 papain-like cysteine proteinase isoform II [Ipomoea batatas]
Length = 368
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 83/284 (29%), Positives = 138/284 (48%), Gaps = 36/284 (12%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F+ F++R+ K+Y S EHD R F+ ++ ++++ +A +G+T+FSD + EF
Sbjct: 51 FTVFKRRFGKAYASDEEHDYRLSVFKANM---RRAKRHQELDPAAVHGVTQFSDSTPTEF 107
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
+ + L +N+ + T +PT +P DWR+ G + V+
Sbjct: 108 RRKFL--GLNRRLKFPADAK--------------TAPILPTDELPSDFDWRDRGAVTPVK 151
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
NQ TCG CW+FST E + L G L LS Q+++DC AG+ + GC+GG
Sbjct: 152 NQGTCGLCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDFGCNGGLM 211
Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
+ ++ + L E +YP D R + K+ +++ +L E I ++
Sbjct: 212 NSAFEYT-LKAGGLMREEDYPYTGNDLQVCRFDKTKIAAKVANFSVVSL--DEDQIAANL 268
Query: 267 ATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
+GP+ A+NA+ Q Y+GGV Y C L +H V +VGY
Sbjct: 269 VKNGPLAVAINAVFMQTYIGGVSCPYICSKRL---DHGVLLVGY 309
>gi|222425026|dbj|BAH20463.1| cysteine protease [Spinacia oleracea]
Length = 473
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 80/287 (27%), Positives = 146/287 (50%), Gaps = 27/287 (9%)
Query: 31 EQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSD 89
++ + ++ S+ +++K+Y+ E + RF F+ +L+ I++ N + ++ + G+ +F+D
Sbjct: 47 DEVMRIYESWLVQHRKNYNALGEKEKRFAIFKDNLEFIDQHNSD--DSQTFKVGLNKFAD 104
Query: 90 LSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVK--KRSITTGITIPTGIPVKKDWREA 147
L+ EEF++ +L + S + VK + G +P + DWR+
Sbjct: 105 LTNEEFRSVYL----GRKKSSSSSPLLSSAKSKVKSDRYLFKEGDELPEAV----DWRKN 156
Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFC 207
G + KV++Q CG+CWAFST+ E ++ + G L LS QE++DC + N GC GG
Sbjct: 157 GAVAKVKDQGQCGSCWAFSTIAAVEGINQIVTGELLSLSEQELVDCDTSYNSGCDGG--- 213
Query: 208 ALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILT 264
L+D+ +N ++ +++YP KD C + + V I + + + ++ L
Sbjct: 214 -LMDYAYEFIINNGGIDTDADYPYTAKDGKCDQYRKNAKVVTIDDF--EDVPENDEKALQ 270
Query: 265 DIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
H PV A+ A T+Q+Y GV C A+++H V VGY
Sbjct: 271 KAVAHQPVSVAIEAGGSTFQFYQSGVFTGKCG---ADLDHGVVAVGY 314
>gi|400180345|gb|AFP73311.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 99/325 (30%), Positives = 153/325 (47%), Gaps = 38/325 (11%)
Query: 3 DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
D+ N+L + + F S+P L ++ EL+ S R YK K E RF
Sbjct: 6 DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
F++++ IE +NK S + G+ EF+D++ +EF + ++ N ++ S +
Sbjct: 63 FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
N + + P DWRE+G + +V++Q CG CWAFS V + E + +
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
G L S QE++DC N N GC GG D++ N + ES+Y L + C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCDGGFMTNAFDFIIENGGI-SRESDYEYLGQQYTCRSQ 228
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
+ V+I SY ++P + L T PV IAA L Q+Y GG DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278
Query: 296 SLAN-INHAVQIVGY---DNYSRTW 316
S A+ INHAV +GY +N + W
Sbjct: 279 SCADRINHAVTAIGYGTDENGQKYW 303
>gi|356517350|ref|XP_003527350.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
gi|356577765|ref|XP_003556993.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 343
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 89/304 (29%), Positives = 135/304 (44%), Gaps = 26/304 (8%)
Query: 10 IVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEH-DIRFKNFEKSLDIIE 68
+ L FLA V E + RY K Y E + RF+ F+++++ IE
Sbjct: 12 LALFFCLGFLAFQVASRTLQDASMYERHEQWMARYGKVYKDPEEKEKRFRVFKENVNYIE 71
Query: 69 ELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSI 128
N P + GI +F+DL+ EEF R+ N H S+ + + +V
Sbjct: 72 AFNNAANKP--YKLGINQFADLTSEEFIVP--RNRFNGHTRSSNTRTTTFKYENV----- 122
Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
T +P DWR+ G + ++NQ +CG CWAFS + E +H + G L LS Q
Sbjct: 123 -------TVLPDSIDWRQKGAVTPIKNQGSCGCCWAFSAIAATEGIHKISTGKLVSLSEQ 175
Query: 189 EVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI 247
EV+DC G + GC GG ++ N + E+ YP D C K + + I
Sbjct: 176 EVVDCDTKGTDHGCEGGYMDGAFKFIIQNHGI-NTEASYPYKGVDGKCNIKEEAVHAATI 234
Query: 248 KSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQ 305
Y D I +E ++ +A PV A++A +Q+Y G+ +C L +H V
Sbjct: 235 TGYE-DVPINNEKALQKAVANQ-PVSVAIDASGADFQFYKSGIFTGSCGTEL---DHGVT 289
Query: 306 IVGY 309
VGY
Sbjct: 290 AVGY 293
>gi|308506829|ref|XP_003115597.1| CRE-TAG-196 protein [Caenorhabditis remanei]
gi|308256132|gb|EFP00085.1| CRE-TAG-196 protein [Caenorhabditis remanei]
Length = 475
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 97/287 (33%), Positives = 131/287 (45%), Gaps = 33/287 (11%)
Query: 37 FSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F F R++K YS K E RF+ F+K+ I EL KN Q +A YG T+FSD++ EF
Sbjct: 172 FLDFIDRHEKRYSNKREVLKRFRTFKKNAKAIRELQKNEQG--TAVYGFTKFSDMTTMEF 229
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITI-PTGIPVKKDWREAGIIGKVR 154
K L + + V + GITI +P DWR+ G + +V+
Sbjct: 230 KQTMLPYQWEQPVYPMDQADFEKE-----------GITISEEDLPESFDWRDKGAVTQVK 278
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
NQ CG+CWAFST E L L LS QE++DC G + GC+GG +
Sbjct: 279 NQGNCGSCWAFSTTGNVEGAWFLAKNKLVSLSEQELVDCDGV-DQGCNGGLPSNAYKEI- 336
Query: 215 VNKVVLEPESEYPLLLKDAAC----KRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
+ LEPE YP K C K A NG L E + + T G
Sbjct: 337 IRMGGLEPEDAYPYDGKGETCHLVRKDIAVYING-------SIELPHDEVEMQKWLVTKG 389
Query: 271 PVIAAVNALTWQYYLGGVI---QYNCDGSLANINHAVQIVGYDNYSR 314
P+ +NA T Q+Y GV+ + C+ + +NH V IVGY R
Sbjct: 390 PISIGLNANTLQFYRHGVVHPFKIFCEPFM--LNHGVLIVGYGKDGR 434
>gi|375073982|gb|AFA34858.1| cathepsin L-like protein [Trypanosoma dionisii]
Length = 467
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 84/295 (28%), Positives = 135/295 (45%), Gaps = 26/295 (8%)
Query: 21 IPVKVSKPNLEQKL-ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPE 78
+P + + E+ L F+ F+QRY + Y S +E R F K+L ++ +P
Sbjct: 21 VPAATASLHAEETLASQFADFKQRYGRVYKSAAEEAFRLSVFRKNL--LDAKLHAAANPH 78
Query: 79 SARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG- 137
A +G+T FSDL+ EEF++RH H H ++ + + G
Sbjct: 79 -ATFGVTPFSDLTREEFRSRH---------------HSGAAHFAAGRKRARVPVDVGVGD 122
Query: 138 IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG 197
P DWR+ G + V++Q CG+CWAFS + E L L+ LS Q ++ C
Sbjct: 123 APAAVDWRDRGAVTPVKDQGQCGSCWAFSAIGNVEGQWFLAGNALTSLSEQMLVSC-DTM 181
Query: 198 NMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLI 256
+ GC GG + +W+ + + + E Y D + TS V L
Sbjct: 182 DSGCDGGLMNSAFEWIVEHHNGTVYTEESYRYASGDGIAQPCRTSGRTVGAVITGHVKLP 241
Query: 257 PSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYDN 311
P E+ + T +A +GP+ AV+A +W +Y GGV+ L +H V +VGY++
Sbjct: 242 PDEAKMATWLAANGPLAVAVDASSWMFYTGGVLTSCVSNEL---DHGVLLVGYND 293
>gi|357160591|ref|XP_003578813.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 95/318 (29%), Positives = 153/318 (48%), Gaps = 43/318 (13%)
Query: 4 VKNVLFIVALIALCFLAIPVKVSKPNLEQKL-ELFSSFQQRYKKSY-SKSEHDIRFKNFE 61
+ N + L LCF A + + N + + S+ +Y +SY +E D +F+ F+
Sbjct: 3 IPNASLLAILGCLCFFASGLAARELNDDLSMVARHESWMSQYGRSYKDAAEKDRKFEVFK 62
Query: 62 KSLDIIEELN-KNRQSPESARYGITEFSDLSEEEFK-TRHLRHSVNKHVLMSHHKHHDHH 119
+ I+ N KN + GI +F+D++ EEFK T+ + ++ V S +++
Sbjct: 63 ANAAFIDSFNAKNHK----FWLGINQFADITNEEFKVTKTNKGFISNKVRASTGFSYEN- 117
Query: 120 HNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
++I +P DWR G + V++Q CG CWAFS V E + L
Sbjct: 118 ------------VSIDA-LPATIDWRTKGAVTPVKDQGQCGCCWAFSAVAATEGIVKLST 164
Query: 180 GTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVV-----LEPESEYPLLLKDA 233
G L LS QE++DC +G + GC GG L+D D K + L ES YP +D
Sbjct: 165 GKLVSLSEQELVDCDVHGEDQGCEGG----LMD--DAFKFIITNGGLTQESSYPYDAEDG 218
Query: 234 ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQY 291
CK + S IKSY D +E +++ +A PV AV+ +T+Q+Y GGV+
Sbjct: 219 KCKSGSKSAG--TIKSYE-DVPANNEGALMKAVANQ-PVSVAVDGGDMTFQFYSGGVMTG 274
Query: 292 NCDGSLANINHAVQIVGY 309
+C +++H + +GY
Sbjct: 275 SCG---TDLDHGIAAIGY 289
>gi|18407961|ref|NP_566880.1| thiol protease aleurain-like protein [Arabidopsis thaliana]
gi|73622182|sp|Q8RWQ9.1|ALEUL_ARATH RecName: Full=Thiol protease aleurain-like; Flags: Precursor
gi|20147207|gb|AAM10319.1| AT3g45310/F18N11_70 [Arabidopsis thaliana]
gi|332644500|gb|AEE78021.1| thiol protease aleurain-like protein [Arabidopsis thaliana]
Length = 358
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 90/278 (32%), Positives = 133/278 (47%), Gaps = 29/278 (10%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
FS F RY K Y S E +RF F+++LD+I NK S + + +F+DL+ +EF
Sbjct: 59 FSRFTHRYGKKYQSVEEMKLRFSVFKENLDLIRSTNKKGLS---YKLSLNQFADLTWQEF 115
Query: 96 KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
+ L + N L HK IT T +P KDWRE GI+ V+
Sbjct: 116 QRYKLGAAQNCSATLKGSHK-----------------ITEAT-VPDTKDWREDGIVSPVK 157
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWM 213
Q CG+CW FST E+ + G LS Q+++DCAG N GC GG +++
Sbjct: 158 EQGHCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGTFNNFGCHGGLPSQAFEYI 217
Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
N L+ E YP KD CK A + GV+++ + + + +E + + PV
Sbjct: 218 KYNG-GLDTEEAYPYTGKDGGCKFSAKNI-GVQVRD-SVNITLGAEDELKHAVGLVRPVS 274
Query: 274 AAVNAL-TWQYYLGGVIQYN-CDGSLANINHAVQIVGY 309
A + +++Y GV N C + ++NHAV VGY
Sbjct: 275 VAFEVVHEFRFYKKGVFTSNTCGNTPMDVNHAVLAVGY 312
>gi|343477225|emb|CCD11889.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 447
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 93/319 (29%), Positives = 157/319 (49%), Gaps = 34/319 (10%)
Query: 4 VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSY-SKSEHDIRFKNFE 61
+ + F V L+A+ +PV + + EQ L+ F++F+Q+Y +SY +E RF+ F+
Sbjct: 7 TRTLRFSVGLLAVAACFVPVALGVLHAEQSLQQQFAAFKQKYSRSYKDATEEAFRFRVFK 66
Query: 62 KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
+++ E + + A +G+T FSD+S EEF+ ++H +++
Sbjct: 67 QNM---ERAKEEAAANPYATFGVTRFSDMSPEEFRA-------------TYHNGAEYYAA 110
Query: 122 HVKK-RSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
+K+ R + + + TG P DWR+ G + V++Q CG+CWAFS + E +
Sbjct: 111 ALKRPRKV---VNVSTGKAPPAVDWRKKGAVTPVKDQGACGSCWAFSAIGNIEGQWKVAG 167
Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLL---KDAAC 235
L+ LS Q ++ C + GC GG L W+ NK + YP K C
Sbjct: 168 HELTSLSEQMLVSC-DTTDYGCRGGLMDKSLQWIVSSNKGNVFTAQSYPYASGGGKMPPC 226
Query: 236 KRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDG 295
K+ G KI + L E++I +A +GPV AV+A ++ Y GGV+ +C
Sbjct: 227 -NKSGKVVGAKISGHI--NLPKDENAIAEWLAKNGPVAIAVDATSFLGYKGGVLT-SCIS 282
Query: 296 SLANINHAVQIVGYDNYSR 314
++H V +VGYD+ S+
Sbjct: 283 K--GLDHDVLLVGYDDTSK 299
>gi|168018894|ref|XP_001761980.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162686697|gb|EDQ73084.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 369
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 89/284 (31%), Positives = 137/284 (48%), Gaps = 37/284 (13%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F SF + + K Y S E++ RF F+ +L ++ L P +A +G+T FSDL+EEEF
Sbjct: 56 FESFMKDFGKVYHSVEEYEHRFGVFKSNL--LKALKHQALDP-TASHGVTMFSDLTEEEF 112
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKDWREAGIIGKVR 154
+++L + + ++S S +PT +P DWRE G +G V+
Sbjct: 113 TSKYL--GLKRPSVLS---------------SAPQAPPLPTEDLPPNFDWREKGAVGPVK 155
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
+Q CG+CWAFST E H L +G L LS Q+++DC A + GC+GG
Sbjct: 156 DQGGCGSCWAFSTTGAVEGAHFLNSGKLVSLSEQQLVDCDHQCDREEADACDAGCNGGFM 215
Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
+++ LE ES+YP +D CK + N V +K + E + +
Sbjct: 216 TNAYQYVEAAG-GLELESDYPYEGRDGKCKFDS---NKVAVKVSNFTNIPVDEDQVAAYL 271
Query: 267 ATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
GP+ +NA Q Y+ GV C+ N++H V +VGY
Sbjct: 272 IKSGPLAIGINAEFMQTYIAGVSCPIFCNKR--NLDHGVLLVGY 313
>gi|1222694|gb|AAA92018.1| CP5 [Dictyostelium discoideum]
Length = 344
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 86/301 (28%), Positives = 142/301 (47%), Gaps = 29/301 (9%)
Query: 13 LIALCFLAIPVKVSKPNLE--QKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEEL 70
L LC L + V +K Q F+ + ++KSY+ E R+ F ++D +++
Sbjct: 4 LSFLCVLLVSVATAKQQFSELQYRNAFTDWMITHQKSYTSEEFGARYNIFTANMDYVQQW 63
Query: 71 NKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITT 130
N + E+ G+ F+D++ EE++ +L + L+ + H ++
Sbjct: 64 NS--KGSETV-LGLNNFADITNEEYRNTYLGTKFDASSLIGTQEEKVHTNSSA------- 113
Query: 131 GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEV 190
KDWR G + V+NQ CG CW+FST + E H G L LS Q +
Sbjct: 114 ---------ASKDWRSEGAVTPVKNQGQCGGCWSFSTTGSTEGAHFQSKGELVSLSEQNL 164
Query: 191 IDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSY 250
IDC+ N GC GG +++ +N ++ ES YP ++ C+ K+ + G + SY
Sbjct: 165 IDCSTE-NSGCDGGLMTYAFEYI-INNNGIDTESSYPYKAENGKCEYKSENS-GATLSSY 221
Query: 251 TCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVG 308
T SESS+ + + + PV A++A ++Q Y G I Y + S N++H V VG
Sbjct: 222 KTVT-AGSESSLESAVNVN-PVSVAIDASHQSFQLYTSG-IYYEPECSSENLDHGVLAVG 278
Query: 309 Y 309
Y
Sbjct: 279 Y 279
>gi|38344381|emb|CAD40319.2| OSJNBb0054B09.3 [Oryza sativa Japonica Group]
gi|116309071|emb|CAH66180.1| OSIGBa0130O15.4 [Oryza sativa Indica Group]
gi|116309098|emb|CAH66205.1| OSIGBa0148D14.11 [Oryza sativa Indica Group]
Length = 381
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 84/287 (29%), Positives = 144/287 (50%), Gaps = 40/287 (13%)
Query: 37 FSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F+SF++R+ ++Y + E R F +L ++++ +A +G+T+FSDL+ EF
Sbjct: 58 FASFERRFGRTYRDAGERAYRMSVFAANL---RRARRHQRLDPTATHGVTKFSDLTPGEF 114
Query: 96 KTRHL---RHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKDWREAGIIG 151
+ R L R S+ V H+ +PT G+P DWRE G +G
Sbjct: 115 RDRFLGLRRPSLEGLVGGEPHE----------------APILPTDGLPDDFDWREHGAVG 158
Query: 152 KVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSG 203
V++Q +CG+CW+FST E H L G L +LS Q+++DC + + GC+G
Sbjct: 159 PVKDQGSCGSCWSFSTSGALEGAHFLATGKLEVLSEQQMVDCDHECDASESRACDSGCNG 218
Query: 204 GDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSIL 263
G ++ + L+ E +YP ++ CK S ++K+++ ++ +E I
Sbjct: 219 GLMTTAFSYL-MKSGGLQSEKDYPYAGRENTCKFD-KSKIVAQVKNFSVISV--NEDQIA 274
Query: 264 TDIATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
++ HGP+ A+NA Q Y+GGV + C +++H V +VGY
Sbjct: 275 ANLVKHGPLAIAINAAYMQTYIGGVSCPFICG---RHLDHGVLLVGY 318
>gi|72389861|ref|XP_845225.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|72389863|ref|XP_845226.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|62359933|gb|AAX80358.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|62359934|gb|AAX80359.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|70801760|gb|AAZ11666.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|70801761|gb|AAZ11667.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 449
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 89/316 (28%), Positives = 158/316 (50%), Gaps = 30/316 (9%)
Query: 4 VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKS-EHDIRFKNFE 61
V+ V V L+A+ V + ++E+ LE+ F++F+++Y K Y + E RF+ FE
Sbjct: 7 VRFVRLPVVLLAMAACLASVALGSLHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFE 66
Query: 62 KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
+++ E+ + A +G+T FSD++ EEF+ R+ ++ +
Sbjct: 67 ENM---EQAKIQAAANPYATFGVTPFSDMTREEFRARY--------------RNGASYFA 109
Query: 122 HVKKRSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
+KR T + + TG P DWRE G + V++Q CG+CWAFST+ E +
Sbjct: 110 AAQKRLRKT-VNVTTGRAPAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQWQVAGN 168
Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACKRKA 239
L LS Q ++ C + GC+GG +W+ + N + E+ YP + + ++
Sbjct: 169 PLVSLSEQMLVSC-DTIDSGCNGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNG--EQPQ 225
Query: 240 TSPNGVKIKSYTCD--TLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL 297
NG +I + D L E +I +A +GP+ AV+A ++ Y GG++ +C +
Sbjct: 226 CQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYNGGILT-SC--TS 282
Query: 298 ANINHAVQIVGYDNYS 313
++H V +VGY++ S
Sbjct: 283 EQLDHGVLLVGYNDNS 298
>gi|190358935|sp|P00785.4|ACTN_ACTCH RecName: Full=Actinidain; Short=Actinidin; AltName: Allergen=Act c
1; Flags: Precursor
gi|12744965|gb|AAK06862.1|AF343446_1 actinidin protease [Actinidia chinensis]
Length = 380
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 85/312 (27%), Positives = 150/312 (48%), Gaps = 27/312 (8%)
Query: 2 FDVKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNF 60
F ++LF L+ L +++ ++ ++ S+ +Y KSY S E + RF+ F
Sbjct: 7 FVSMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIF 66
Query: 61 EKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
+++L I+E N + S + G+ +F+DL++EEF++ +LR + + +++
Sbjct: 67 KETLRFIDEHNADTN--RSYKVGLNQFADLTDEEFRSTYLRFTSGSNKTKVSNRYEPR-- 122
Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
G +P+ + DWR AG + +++Q CG CWAFS + T E ++ + G
Sbjct: 123 ---------VGQVLPSYV----DWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTG 169
Query: 181 TLSLLSVQEVIDCAGNGNM-GCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
L LS QE+IDC N GC+GG ++ +N + E YP +D C
Sbjct: 170 VLISLSEQELIDCGRTQNTRGCNGGYITDGFQFI-INNGGINTEENYPYTAQDGECNVDL 228
Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSL 297
+ V I +Y + + + L T+ PV A++A ++ Y G+ C +
Sbjct: 229 QNEKYVTIDTY--ENVPYNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTA- 285
Query: 298 ANINHAVQIVGY 309
++HAV IVGY
Sbjct: 286 --VDHAVTIVGY 295
>gi|66823245|ref|XP_644977.1| cysteine proteinase 5 precursor [Dictyostelium discoideum AX4]
gi|166201986|sp|P54640.2|CYSP5_DICDI RecName: Full=Cysteine proteinase 5; Flags: Precursor
gi|60473097|gb|EAL71045.1| cysteine proteinase 5 precursor [Dictyostelium discoideum AX4]
Length = 344
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 86/301 (28%), Positives = 142/301 (47%), Gaps = 29/301 (9%)
Query: 13 LIALCFLAIPVKVSKPNLE--QKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEEL 70
L LC L + V +K Q F+ + ++KSY+ E R+ F+ ++D +++
Sbjct: 4 LSFLCVLLVSVATAKQQFSELQYRNAFTDWMITHQKSYTSEEFGARYNIFKANMDYVQQW 63
Query: 71 NKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITT 130
N + E+ G+ F+D++ EE++ +L + L+ +
Sbjct: 64 NS--KGSETVL-GLNNFADITNEEYRNTYLGTKFDASSLIGTQEEK-------------- 106
Query: 131 GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEV 190
T KDWR G + V+NQ CG CW+FST + E H G L LS Q +
Sbjct: 107 --VFTTSSAASKDWRSEGAVTPVKNQGQCGGCWSFSTTGSTEGAHFQSKGELVSLSEQNL 164
Query: 191 IDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSY 250
IDC+ N GC GG +++ +N ++ ES YP ++ C+ K+ + +G + SY
Sbjct: 165 IDCSTE-NSGCDGGLMTYAFEYI-INNNGIDTESSYPYKAENGKCEYKSEN-SGATLSSY 221
Query: 251 TCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVG 308
T SESS+ + + + PV A++A ++Q Y G I Y + S N++H V VG
Sbjct: 222 KTVT-AGSESSLESAVNVN-PVSVAIDASHQSFQLYTSG-IYYEPECSSENLDHGVLAVG 278
Query: 309 Y 309
Y
Sbjct: 279 Y 279
>gi|33333700|gb|AAQ11968.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 90/281 (32%), Positives = 130/281 (46%), Gaps = 27/281 (9%)
Query: 35 ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKN-RQSPESARYGITEFSDLSE 92
E + F+ + K+Y S E RF F+K+L I+E NK + ES +T+F+D++
Sbjct: 21 EEWQQFKLDHGKTYRSLVEEKRRFSVFQKNLVDIQEHNKKYERGEESFAKKVTQFADMTH 80
Query: 93 EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
EEF V L S+ H D+ + I + V DWRE G +
Sbjct: 81 EEFLDLLKLQGV--PALPSNAVHFDNSED----------IDMEEKDAV--DWREEGAVTP 126
Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN--GNMGCSGGDFCALL 210
++Q CG+CWAFS V E KNGTL LS QE++DCA GN GC GG
Sbjct: 127 AKDQANCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEDYGNNGCKGGLMGQAF 186
Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
D+ V ++ E YP + ++CK+ VK + D E + +A G
Sbjct: 187 DF--VQDEGIQTEESYPYEGRRSSCKKSGEYVTKVKTYVFPLD-----EQEMARTVAAKG 239
Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGS--LANINHAVQIVGY 309
PV A+ A +Y G++ C S ++NH V +VGY
Sbjct: 240 PVAVAIEASQLSFYDKGIVDERCRCSNKREDLNHGVLVVGY 280
>gi|66814630|ref|XP_641494.1| cysteine protease [Dictyostelium discoideum AX4]
gi|118121|sp|P04989.1|CYSP2_DICDI RecName: Full=Cysteine proteinase 2; AltName: Full=Prestalk
cathepsin; Flags: Precursor
gi|167860|gb|AAA33240.1| pst-cathepsin [Dictyostelium discoideum]
gi|1834417|emb|CAA27050.1| cysteine proteinase 2 [Dictyostelium discoideum]
gi|60469522|gb|EAL67513.1| cysteine protease [Dictyostelium discoideum AX4]
gi|225484|prf||1304284A cathepsin,prestalk
Length = 376
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 86/281 (30%), Positives = 133/281 (47%), Gaps = 21/281 (7%)
Query: 32 QKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLS 91
Q F+ + ++ + YS SE R+ F+ ++D ++ N N + G+ F+D++
Sbjct: 31 QYRTAFTEWTLKFNRQYSSSEFSNRYSIFKSNMDYVD--NWNSKGDSQTVLGLNNFADIT 88
Query: 92 EEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIG 151
EE++ +L VN H +N R + + T P DWR +
Sbjct: 89 NEEYRKTYLGTRVNAH-----------SYNGYDGREVLNVEDLQTN-PKSIDWRTKNAVT 136
Query: 152 KVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAG-NGNMGCSGGDFCALL 210
+++Q CG+CW+FST + E HALK L LS Q ++DC+G N GC GG
Sbjct: 137 PIKDQGQCGSCWSFSTTGSTEGAHALKTKKLVSLSEQNLVDCSGPEENFGCDGGLMNNAF 196
Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
D++ NK + + ES YP + + S G IK Y + SE S L + A HG
Sbjct: 197 DYIIKNKGI-DTESSYPYTAETGSTCLFNKSDIGATIKGY-VNITAGSEIS-LENGAQHG 253
Query: 271 PVIAAVNAL--TWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
PV A++A ++Q Y G I Y S ++H V +VGY
Sbjct: 254 PVSVAIDASHNSFQLYTSG-IYYEPKCSPTELDHGVLVVGY 293
>gi|400180381|gb|AFP73329.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 97/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)
Query: 3 DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
D+ N+L + + F S+P L ++ EL+ S R YK K E RF
Sbjct: 6 DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
F++++ IE +NK S + G+ EF+D++ +EF + ++ N ++ S +
Sbjct: 63 FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
N + + P DWRE+G + +V++Q CG CWAFS V + E + +
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
G L S QE++DC N N GC+GG D++ N + ES+Y L + C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGI-SRESDYEYLGQQYTCRSQ 228
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
+ V+I SY ++P + L T PV IAA L Q+Y GG DG
Sbjct: 229 EKTA-AVQISSY---KVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278
Query: 296 SLAN-INHAVQIVGY 309
S A+ INHAV +GY
Sbjct: 279 SCADRINHAVTAIGY 293
>gi|118350314|ref|XP_001008438.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89290205|gb|EAR88193.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 389
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 97/324 (29%), Positives = 140/324 (43%), Gaps = 57/324 (17%)
Query: 29 NLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFS 88
NL Q +LFS F+ +KK Y+ E RF+ F ++LDII ELN+ + +A YGIT+FS
Sbjct: 32 NLTQVKQLFSKFKAEHKKFYNFLEEQRRFEIFRQNLDIISELNQVEEG--TAEYGITQFS 89
Query: 89 DLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAG 148
D++ EEFK++ L S + + +H K I P DWR+ G
Sbjct: 90 DMTTEEFKSQILIPST-----YARNFTGSRYHGFQK---------ISQDAPTSYDWRDHG 135
Query: 149 IIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC-------AGNGNMGC 201
+ V+NQ T G CW FST E L L LS ++++DC G+ + G
Sbjct: 136 AVTPVKNQGTVGTCWTFSTTGNIEGQWFLAGNPLVSLSEEQIVDCDGSQEPSTGHADCGV 195
Query: 202 SGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACK------------------------- 236
GG D++ +N L E YP + + C
Sbjct: 196 FGGWPYLAFDYV-INAGGLPSEETYPYCVGNGGCYPCPAPGYNETLCGPAVPYCNATAYP 254
Query: 237 -RKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYN-CD 294
R+ P KI+ + L E SI + GP+ A++A Q+Y G+ C
Sbjct: 255 CRQGQVPIAAKIEDW--KALSKDEDSIKQQLFEIGPLSVALDASYLQFYKKGISAPKFC- 311
Query: 295 GSLANINHAVQIVGY--DNYSRTW 316
S +NHAV + GY DN W
Sbjct: 312 -SKTTLNHAVLLTGYGIDNGVEFW 334
>gi|19849|emb|CAA78361.1| tobacco pre-pro-cysteine proteinase [Nicotiana tabacum]
Length = 363
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 95/331 (28%), Positives = 146/331 (44%), Gaps = 52/331 (15%)
Query: 8 LFIVALIALCFL--AIPVKVSKPNLEQKLEL------------FSSFQQRYKKSY-SKSE 52
LF+++L+A AI P + Q + FS F+ ++ K Y S+ E
Sbjct: 4 LFLLSLLAFVLFSSAIAFSDEDPLIRQVVSETDDSHLLNAEHHFSLFKSKFGKIYASEEE 63
Query: 53 HDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSH 112
HD RFK F+ +L N+ SA +GIT+FSDL+ EF+ +L
Sbjct: 64 HDHRFKVFKANL---RRARLNQLLDPSAEHGITKFSDLTPSEFRRTYLGL---------- 110
Query: 113 HKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETA 171
H K + +PT +P DWR+ G + V+NQ +CG+CW+FST
Sbjct: 111 -------HKPKPKLNAEKAPILPTSDLPADFDWRDHGAVTGVKNQGSCGSCWSFSTTGAV 163
Query: 172 ESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDFCALLDWMDVNKVVLEPE 223
E H L G L LS Q+++DC + GC GG + ++ + L+ E
Sbjct: 164 EGAHFLATGELVSLSEQQLVDCDHECDPEQQDACDAGCGGGHYATAFEYT-LKAGGLQLE 222
Query: 224 SEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQY 283
+YP KD C S + +++ L E I ++ HGP+ +NA Q
Sbjct: 223 KDYPYTGKDGKCHFD-KSKICAAVTNFSVIGL--DEDQIAANLVKHGPLAVGINAAWMQT 279
Query: 284 YLGGVIQYNCDG-SLANINHAVQIVGYDNYS 313
Y+GGV +C +H V +VGY ++
Sbjct: 280 YVGGV---SCPLICFKRQDHGVLLVGYGSHG 307
>gi|2507252|sp|P14080.2|PAPA2_CARPA RecName: Full=Chymopapain; AltName: Full=Papaya proteinase II;
Short=PPII; Flags: Precursor
gi|1332461|emb|CAA66378.1| chymopapain [Carica papaya]
Length = 352
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 91/285 (31%), Positives = 133/285 (46%), Gaps = 28/285 (9%)
Query: 29 NLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
++E+ ++LF S+ ++ K Y + I RF+ F +L I+E NK S G+ F
Sbjct: 40 SIERLIQLFDSWMLKHNKIYESIDEKIYRFEIFRDNLMYIDETNKKNNS---YWLGLNGF 96
Query: 88 SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREA 147
+DLS +EFK +++ + H + D + HV T P DWR
Sbjct: 97 ADLSNDEFKKKYVGFVAEDFTGLEHFDNEDFTYKHV------------TNYPQSIDWRAK 144
Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFC 207
G + V+NQ CG+CWAFST+ T E ++ + G L LS QE++DC + + GC GG
Sbjct: 145 GAVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLELSEQELVDCDKH-SYGCKGGYQT 203
Query: 208 ALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPS--ESSILTD 265
L ++ N V YP K C +AT G K+K T +PS E+S L
Sbjct: 204 TSLQYVANNGV--HTSKVYPYQAKQYKC--RATDKPGPKVK-ITGYKRVPSNCETSFLGA 258
Query: 266 IATHG-PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+A V+ +Q Y GV C L +HAV VGY
Sbjct: 259 LANQPLSVLVEAGGKPFQLYKSGVFDGPCGTKL---DHAVTAVGY 300
>gi|79314271|ref|NP_001030812.1| thiol protease aleurain-like protein [Arabidopsis thaliana]
gi|332644501|gb|AEE78022.1| thiol protease aleurain-like protein [Arabidopsis thaliana]
Length = 357
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 90/278 (32%), Positives = 133/278 (47%), Gaps = 29/278 (10%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
FS F RY K Y S E +RF F+++LD+I NK S + + +F+DL+ +EF
Sbjct: 59 FSRFTHRYGKKYQSVEEMKLRFSVFKENLDLIRSTNKKGLS---YKLSLNQFADLTWQEF 115
Query: 96 KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
+ L + N L HK IT T +P KDWRE GI+ V+
Sbjct: 116 QRYKLGAAQNCSATLKGSHK-----------------ITEAT-VPDTKDWREDGIVSPVK 157
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWM 213
Q CG+CW FST E+ + G LS Q+++DCAG N GC GG +++
Sbjct: 158 EQGHCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGTFNNFGCHGGLPSQAFEYI 217
Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
N L+ E YP KD CK A + GV+++ + + + +E + + PV
Sbjct: 218 KYNG-GLDTEEAYPYTGKDGGCKFSAKNI-GVQVRD-SVNITLGAEDELKHAVGLVRPVS 274
Query: 274 AAVNAL-TWQYYLGGVIQYN-CDGSLANINHAVQIVGY 309
A + +++Y GV N C + ++NHAV VGY
Sbjct: 275 VAFEVVHEFRFYKKGVFTSNTCGNTPMDVNHAVLAVGY 312
>gi|334332720|ref|XP_001367595.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
Length = 333
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 88/308 (28%), Positives = 155/308 (50%), Gaps = 30/308 (9%)
Query: 9 FIVALIALCFLAIPVKVSKPNLEQKLE-LFSSFQQRYKKSYSKSEHDIRFKNFEKSLDII 67
F + L +LC + + + P +Q L+ + ++ +++++Y+ +E R +EK+L +I
Sbjct: 3 FYLCLASLC---LGLVAATPEFDQTLDSQWHQWKAQHRRTYAANEDGWRRATWEKNLKMI 59
Query: 68 EELNKNRQSPE-SARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKR 126
E N + + S + G+ +F D++ EEFK V+ + ++ N +KR
Sbjct: 60 EMHNLEYSAGKHSFQLGMNKFGDMTTEEFK----------QVM------NGYNSNGSQKR 103
Query: 127 SITTGITIP--TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL 184
+ + P +P DWRE G + V+NQ CG+CWAFS + E K L
Sbjct: 104 TKGSLYREPLLAQLPKSVDWREKGYVTPVKNQGQCGSCWAFSATGSLEGQWFHKTKKLVS 163
Query: 185 LSVQEVIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPN 243
LS Q ++DC+ GN GCSGG +++ N ++ E YP L +D CK +A +
Sbjct: 164 LSEQNLVDCSTSEGNNGCSGGLMDNAFEYVK-NNGGIDTEQAYPYLGQDNECKYRAEC-S 221
Query: 244 GVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANIN 301
G + + D +E +++ +A GP+ A++A ++Q+Y GV Y S + ++
Sbjct: 222 GANVTGFV-DIPSMNERALMKAVANVGPISVAIDAGNPSFQFYESGVY-YEPQCSSSQLD 279
Query: 302 HAVQIVGY 309
H V +VGY
Sbjct: 280 HGVLVVGY 287
>gi|18422289|ref|NP_568620.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
gi|9757832|dbj|BAB08269.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
gi|17065064|gb|AAL32686.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
gi|21387153|gb|AAM47980.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
gi|332007522|gb|AED94905.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
Length = 463
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 85/264 (32%), Positives = 131/264 (49%), Gaps = 32/264 (12%)
Query: 51 SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLM 110
+E D RF+ F+ +L I+E N S + G+T F+DL+ EE+++ +L K VL
Sbjct: 69 AEKDQRFEIFKDNLRFIDEHNTKNLS---YKLGLTRFADLTNEEYRSMYLGAKPTKRVL- 124
Query: 111 SHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVET 170
K D + V G +P + DWR+ G + V++Q +CG+CWAFST+
Sbjct: 125 ---KTSDRYQARV-------GDALPDSV----DWRKEGAVADVKDQGSCGSCWAFSTIGA 170
Query: 171 AESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW---MDVNKVVLEPESEYP 227
E ++ + G L LS QE++DC + N GC+GG L+D+ + ++ E++YP
Sbjct: 171 VEGINKIVTGDLISLSEQELVDCDTSYNQGCNGG----LMDYAFEFIIKNGGIDTEADYP 226
Query: 228 LLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYL 285
D C + + V I SY D SE+S+ +A H P+ A+ A +Q Y
Sbjct: 227 YKAADGRCDQNRKNAKVVTIDSYE-DVPENSEASLKKALA-HQPISVAIEAGGRAFQLYS 284
Query: 286 GGVIQYNCDGSLANINHAVQIVGY 309
GV C L +H V VGY
Sbjct: 285 SGVFDGLCGTEL---DHGVVAVGY 305
>gi|400180363|gb|AFP73320.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 97/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)
Query: 3 DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
D+ N+L + + F S+P L ++ EL+ S R YK K E RF
Sbjct: 6 DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
F++++ IE +NK S + G+ EF+D++ +EF + ++ N ++ S +
Sbjct: 63 FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEL 120
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
N + + P DWRE+G + +V++Q CG CWAFS V + E + +
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
G L S QE++DC N N GC+GG D++ N + ES+Y L + C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGI-SRESDYEYLGEQYTCRSQ 228
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
+ V+I SY ++P + L T PV IAA L Q+Y GG DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278
Query: 296 SLAN-INHAVQIVGY 309
S A+ INHAV +GY
Sbjct: 279 SCADRINHAVTAIGY 293
>gi|390457768|ref|XP_002742793.2| PREDICTED: cathepsin L2 [Callithrix jacchus]
Length = 588
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 89/305 (29%), Positives = 146/305 (47%), Gaps = 32/305 (10%)
Query: 11 VALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEE 69
+ L A C + + + P +Q L+ + ++ +++ Y +E R +EK++ +IE
Sbjct: 5 LVLAAFC---LGIASAAPKFDQNLDTQWYQWKATHRRLYGTNEEGWRRAVWEKNMKMIEL 61
Query: 70 LNKN-RQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSI 128
N Q + F D++ EEF+ V + KH K R +
Sbjct: 62 HNGEYSQGKHGFTMAMNAFGDMTNEEFR--------QVMVCFRNQKH--------KNRKV 105
Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
G + +P DWR+ G + V+NQ+ CG+CWAFS E K G L LS Q
Sbjct: 106 FRGPLL-LNLPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQ 164
Query: 189 EVIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI 247
++DC+ GN GC+GG ++ N L+ E+ YP + KD +CK K + +
Sbjct: 165 NLVDCSHPQGNQGCNGGFMNNAFQYVKENG-GLDSEASYPYVAKDGSCKYKPEN----SV 219
Query: 248 KSYTCDTLIPS-ESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAV 304
+ T +IP+ E ++ +AT GP+ AV+A ++Q+Y G I + D S N++H V
Sbjct: 220 ANDTGFVVIPAHEKELMKAVATVGPISVAVDASHSSFQFYKSG-IYFEQDCSSKNLDHGV 278
Query: 305 QIVGY 309
+VGY
Sbjct: 279 LVVGY 283
>gi|312282841|dbj|BAJ34286.1| unnamed protein product [Thellungiella halophila]
Length = 358
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 86/278 (30%), Positives = 134/278 (48%), Gaps = 29/278 (10%)
Query: 37 FSSFQQRYKKSYSKSEH-DIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F+ F RY K Y +E +RF F+++LD+I NK R S + G+ +F+DL+ +EF
Sbjct: 59 FARFTHRYGKKYQNAEEIKLRFSIFKENLDLIRSTNKKRLS---YKLGVNQFADLTWQEF 115
Query: 96 KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
+ L + N L HK + +P KDWRE GI+ V+
Sbjct: 116 QRNKLGAAQNCSATLKGSHKLTE------------------AALPETKDWREDGIVSPVK 157
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWM 213
+Q CG+CW FST E+ + G LS Q+++DCAG N GC+GG +++
Sbjct: 158 DQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYI 217
Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
N L+ E YP KD CK A + GV++ + + + +E + + PV
Sbjct: 218 KSNG-GLDTEEAYPYTGKDGTCKYSAENV-GVQVLD-SVNITLGAEDELKHAVGLVRPVS 274
Query: 274 AAVNAL-TWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
A + +++ Y GV +C + ++NHAV VGY
Sbjct: 275 IAFEVVKSFRLYKSGVYTDSHCGNTPMDVNHAVLAVGY 312
>gi|7381221|gb|AAF61441.1|AF138265_1 papain-like cysteine proteinase isoform II [Ipomoea batatas]
Length = 366
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 84/284 (29%), Positives = 139/284 (48%), Gaps = 36/284 (12%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F+ F++R+ K Y S EHD R F+ ++ ++++ +A +G+T+FSDL+ EF
Sbjct: 49 FTVFKRRFGKVYASDEEHDYRLSVFKANM---RRAKQHQELDPAAVHGVTQFSDLTPTEF 105
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
+ + L +N+ + T +PT +P DWR+ G + V+
Sbjct: 106 RRKFL--GLNRRLKFPADAK--------------TAPILPTDELPSDFDWRDHGAVTPVK 149
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
NQ TCG+CW+FST E + L G L LS Q+++DC AG+ + GC+GG
Sbjct: 150 NQGTCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLM 209
Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
+ ++ + L E +YP D R + K+ +++ +L E I ++
Sbjct: 210 NSAFEYT-LKAGGLMREEDYPYTGNDLQVCRFDKTKIAAKVANFSVVSL--DEDQIAANL 266
Query: 267 ATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
+GP+ A+NA+ Q Y+GGV Y C L +H V +VGY
Sbjct: 267 VKNGPLAVAINAVFVQTYIGGVSCPYICSKRL---DHGVLLVGY 307
>gi|358339354|dbj|GAA47434.1| cathepsin F [Clonorchis sinensis]
Length = 603
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 91/309 (29%), Positives = 147/309 (47%), Gaps = 41/309 (13%)
Query: 8 LFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDII 67
LF + L LC + + P E +L+ F+Q+YKK+Y + + RF F+++L
Sbjct: 283 LFTLELWCLC-----ARTTTPEPENARQLYEEFKQKYKKTYVNDDDEYRFSVFKENLLRA 337
Query: 68 EELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRS 127
+L Q +A YG+T+F DL+ +EF+ ++L K+ D ++ S
Sbjct: 338 HQLQTMEQG--TAEYGVTQFFDLTSQEFQIQYL-----------GFKYEDMQD--TEEMS 382
Query: 128 ITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSV 187
+T + + DWR+ G +G V +Q CG+CWAFST+ E LK G L LS
Sbjct: 383 PSTRVVMDED---SFDWRDHGAVGPVLDQGKCGSCWAFSTIGNIEGQWFLKTGELLSLSE 439
Query: 188 QEVIDCAGNGNMGCSGG----DFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPN 243
Q++IDC N + GC+GG + A+ + LE S+YP C
Sbjct: 440 QQLIDCD-NVDEGCNGGYPPKTYGAV-----IKMGGLELNSDYPYKALAEKCHMDRQ--- 490
Query: 244 GVKIKSYTCDTLI-PSESSILTD-IATHGPVIAAVNALTWQYYLGGVIQYNCDGSLAN-I 300
K+K Y D+++ P + + + GP+ +A+NA ++Y G++ +
Sbjct: 491 --KLKVYINDSVVFPRNEHLQAEALKLMGPLSSALNANPLKFYKTGIMHLPVASCFPRAL 548
Query: 301 NHAVQIVGY 309
NHAV VGY
Sbjct: 549 NHAVLTVGY 557
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 57/171 (33%), Positives = 90/171 (52%), Gaps = 12/171 (7%)
Query: 143 DWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCS 202
DWR+ G +G V NQ CG+CWAFS V E LK+G L LSVQ+V+DC + + GC+
Sbjct: 44 DWRQHGAVGPVWNQGPCGSCWAFSAVGNIEGQWFLKSGELLHLSVQQVLDCD-HVDHGCN 102
Query: 203 GGDFCALLDWMDVNKV-VLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESS 261
GG + + VN++ L+ +++Y C + K ++Y ++I S++
Sbjct: 103 GGYPPQV--YRQVNQMGGLQLDADYSYKAAVGKCHTDRS-----KFRAYVNSSVILSQNE 155
Query: 262 IL--TDIATHGPVIAAVNALTWQYYLGGVIQYNCDG-SLANINHAVQIVGY 309
+ T GP+ + +NA T Q+Y G++ + +NHAV VGY
Sbjct: 156 QFQANKLKTIGPLASTLNARTLQFYRKGIMHPTPSACNPGQLNHAVLTVGY 206
>gi|5777889|emb|CAB53515.1| cysteine protease [Solanum tuberosum]
Length = 466
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 85/280 (30%), Positives = 141/280 (50%), Gaps = 28/280 (10%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
L+ S+ + KSY+ E D RF+ F+ +L I+E +N +S + G+T+F+DL+ EE
Sbjct: 48 LYESWLIEHGKSYNALGEKDKRFQIFKDNLKYIDE--QNSVPNQSYKLGLTKFADLTNEE 105
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
+++ +L K K +S + +P DWR+ G++ V+
Sbjct: 106 YRSIYL-----------GTKSSGDRRKLSKNKSDRYLPKVGDSLPESVDWRDKGVLVGVK 154
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW-- 212
+Q +CG+CWAFS V ES++A+ G L LS QE++DC + N GC GG L+D+
Sbjct: 155 DQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDKSYNEGCDGG----LMDYAF 210
Query: 213 -MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGP 271
+N ++ E +YP ++ C + + VKI SY D + +E ++ +A H P
Sbjct: 211 EFVINNGGIDTEEDYPYKERNDVCDQYRKNAKVVKIDSYE-DVPVNNEKALQKAVA-HQP 268
Query: 272 VIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
V A+ A Q+Y G+ C + ++H V GY
Sbjct: 269 VSIAIEAGGRDLQHYKSGIFTGKCGTA---VDHGVVAAGY 305
>gi|89274062|dbj|BAE80740.1| cysteine proteinase [Platycodon grandiflorus]
Length = 462
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 79/285 (27%), Positives = 141/285 (49%), Gaps = 31/285 (10%)
Query: 31 EQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSD 89
++ + ++ S+ ++ KSY+ E + RF+ F+ +L I+E N + S + G+ F+D
Sbjct: 44 DEVMAMYESWLVKHGKSYNALGEKEKRFQIFKDNLRFIDE--HNAEENLSYKVGLNRFAD 101
Query: 90 LSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGI 149
L+ EE+++ +L + K +S + +P DWR G
Sbjct: 102 LTNEEYRSTYLGAKSKPKL--------------SKVKSDRYAPRVGDSLPESVDWRAKGA 147
Query: 150 IGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCAL 209
+ +++Q +CG+CWAFSTV E ++ + G L LS QE++DC + N GC GG L
Sbjct: 148 VAPIKDQGSCGSCWAFSTVNAVEGINQIVTGELITLSEQELVDCDKSYNEGCDGG----L 203
Query: 210 LDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
+D+ +N ++ + +YP L +DA C + + V I SY D + +E ++ +
Sbjct: 204 MDYGFEFIINNGGIDTDKDYPYLGRDARCDQYRKNAKVVTIDSYE-DVPVNNEEALKKAV 262
Query: 267 ATHGPVIAAV--NALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
A+ PV + +Q+Y G+ C +L +H V +VGY
Sbjct: 263 ASQ-PVSVGIEGGGRAFQFYDSGIFTGKCGTAL---DHGVNVVGY 303
>gi|118365724|ref|XP_001016082.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89297849|gb|EAR95837.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 336
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 97/317 (30%), Positives = 158/317 (49%), Gaps = 39/317 (12%)
Query: 7 VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHD------IRFKNF 60
+L I+ L+ LC LA + V +KL ++ + + ++ Y +EH+ + F+NF
Sbjct: 6 LLSIIMLMPLC-LAQNINV------EKLLAYNQWSSQNQRVY-LNEHEKLFRQMVFFENF 57
Query: 61 EKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHS-VNKHVL--MSHHKHHD 117
+K I+E N + + S + +FSD+++EEF + L S + H++ +S H+
Sbjct: 58 QK----IQEHNSDPNNTYSVH--LNQFSDMTKEEFAEKILMKSDLVDHLMKGISQEATHN 111
Query: 118 HHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHAL 177
+N+ + S + +T+ I DWR G + V+NQ CG+CW+FS ES + +
Sbjct: 112 DTNNNETQLS-SNSLTLADSI----DWRTKGAVTSVKNQGGCGSCWSFSAAAVMESFNFI 166
Query: 178 KNGTLSLLSVQEVIDCA----GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDA 233
+N L S Q+++DC G + GC+GG LD+ +KV + +YP +
Sbjct: 167 QNKALVDFSEQQLVDCVIPANGYNSYGCNGGWPVQCLDY--ASKVGITTLDKYPYVAVQK 224
Query: 234 ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNC 293
C T NG K KS+ IP+ S+ L PV V+A TW Y G+ C
Sbjct: 225 NCNVTGTD-NGFKPKSW---IQIPNTSNDLKSALNFSPVSVLVDASTWGNYYSGIFN-GC 279
Query: 294 DGSLANINHAVQIVGYD 310
D + ++NHAV VGYD
Sbjct: 280 DQTHISLNHAVLAVGYD 296
>gi|400180365|gb|AFP73321.1| cysteine protease [Solanum peruvianum]
gi|400180395|gb|AFP73336.1| cysteine protease [Solanum peruvianum]
gi|400180405|gb|AFP73341.1| cysteine protease [Solanum peruvianum]
gi|400180409|gb|AFP73343.1| cysteine protease [Solanum peruvianum]
gi|400180411|gb|AFP73344.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 97/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)
Query: 3 DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
D+ N+L + + F S+P L ++ EL+ S R YK K E RF
Sbjct: 6 DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
F++++ IE +NK S + G+ EF+D++ +EF + ++ N ++ S +
Sbjct: 63 FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
N + + P DWRE+G + +V++Q CG CWAFS V + E + +
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
G L S QE++DC N N GC+GG D++ N + ES+Y L + C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGI-SRESDYEYLGQQYTCRSQ 228
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
+ V+I SY ++P + L T PV IAA L Q+Y GG DG
Sbjct: 229 EKTA-AVQISSY---KVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278
Query: 296 SLAN-INHAVQIVGY 309
S A+ INHAV +GY
Sbjct: 279 SCADRINHAVTAIGY 293
>gi|1163075|emb|CAA81061.1| cysteine proteinase [Trypanosoma congolense]
Length = 442
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 93/319 (29%), Positives = 157/319 (49%), Gaps = 34/319 (10%)
Query: 4 VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYS-KSEHDIRFKNFE 61
+ + F V L+A+ +PV + + EQ L+ F++F+Q+Y +SY +E RF+ F+
Sbjct: 2 TRTLRFSVGLLAVAACFVPVALGVLHAEQSLQQQFAAFKQKYSRSYKDATEEAFRFRVFK 61
Query: 62 KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
+++ E + + A +G+T FSD+S EEF+ ++H +++
Sbjct: 62 QNM---ERAKEEAAANPYATFGVTRFSDMSPEEFRA-------------TYHNGAEYYAA 105
Query: 122 HVKK-RSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
+K+ R + + + TG P DWR+ G + V++Q CG+CWAFS + E +
Sbjct: 106 ALKRPRKV---VNVSTGKAPPAVDWRKKGAVTPVKDQGACGSCWAFSAIGNIEGQWKVAG 162
Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLL---KDAAC 235
L+ LS Q ++ C + GC GG L W+ NK + YP K C
Sbjct: 163 HELTSLSEQMLVSC-DTTDYGCRGGLMDKSLQWIVSSNKGNVFTAQSYPYASGGGKMPPC 221
Query: 236 KRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDG 295
K+ G KI + L E++I +A +GPV AV+A ++ Y GGV+ +C
Sbjct: 222 -NKSGKVVGAKISGHI--NLPKDENAIAEWLAKNGPVAIAVDATSFLGYKGGVLT-SCIS 277
Query: 296 SLANINHAVQIVGYDNYSR 314
++H V +VGYD+ S+
Sbjct: 278 K--GLDHDVLLVGYDDTSK 294
>gi|20334373|gb|AAM19207.1|AF493232_1 cysteine protease [Solanum pimpinellifolium]
gi|400180424|gb|AFP73350.1| cysteine protease [Solanum pimpinellifolium]
gi|400180433|gb|AFP73354.1| cysteine protease [Solanum lycopersicum]
Length = 344
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 95/314 (30%), Positives = 147/314 (46%), Gaps = 33/314 (10%)
Query: 3 DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
D+ N+L + + F S+P L ++ EL+ S R YK K E RF
Sbjct: 6 DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
F++++ IE +NK S + G+ EF+D++ +EF + ++ N ++ S +
Sbjct: 63 FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
N + + P DWRE+G + +V++Q CG CWAFS V + E + +
Sbjct: 121 KINDLSDDYM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
G L S QE++DC N N GC+GG D++ N + ES+Y L + C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGLMTNAFDFIIENGGI-SRESDYEYLGEQYTCRSR 228
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
+ V+I SY ++P + L T PV IAA L Q+Y GG NC
Sbjct: 229 EKTA-AVQISSY---KVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTYDGNC-- 280
Query: 296 SLANINHAVQIVGY 309
INHAV +GY
Sbjct: 281 -ADQINHAVTAIGY 293
>gi|72389847|ref|XP_845218.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|72389849|ref|XP_845219.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|72389851|ref|XP_845220.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|72389857|ref|XP_845223.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|62359926|gb|AAX80351.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|62359927|gb|AAX80352.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|62359928|gb|AAX80353.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|62359931|gb|AAX80356.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|70801753|gb|AAZ11659.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|70801754|gb|AAZ11660.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|70801755|gb|AAZ11661.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|70801758|gb|AAZ11664.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 450
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 89/316 (28%), Positives = 158/316 (50%), Gaps = 30/316 (9%)
Query: 4 VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKS-EHDIRFKNFE 61
V+ V V L+A+ V + ++E+ LE+ F++F+++Y K Y + E RF+ FE
Sbjct: 7 VRFVRLPVVLLAMAACLASVALGSLHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFE 66
Query: 62 KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
+++ E+ + A +G+T FSD++ EEF+ R+ ++ +
Sbjct: 67 ENM---EQAKIQAAANPYATFGVTPFSDMTREEFRARY--------------RNGASYFA 109
Query: 122 HVKKRSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
+KR T + + TG P DWRE G + V++Q CG+CWAFST+ E +
Sbjct: 110 AAQKRLRKT-VNVTTGRAPAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQWQVAGN 168
Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACKRKA 239
L LS Q ++ C + GC+GG +W+ + N + E+ YP + + ++
Sbjct: 169 PLVSLSEQMLVSC-DTIDSGCNGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNG--EQPQ 225
Query: 240 TSPNGVKIKSYTCD--TLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL 297
NG +I + D L E +I +A +GP+ AV+A ++ Y GG++ +C +
Sbjct: 226 CQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYNGGILT-SC--TS 282
Query: 298 ANINHAVQIVGYDNYS 313
++H V +VGY++ S
Sbjct: 283 EQLDHGVLLVGYNDNS 298
>gi|72389855|ref|XP_845222.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|72389865|ref|XP_845227.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|72389867|ref|XP_845228.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|62359930|gb|AAX80355.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|62359935|gb|AAX80360.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|62359936|gb|AAX80361.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|70801757|gb|AAZ11663.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|70801762|gb|AAZ11668.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|70801763|gb|AAZ11669.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 449
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 89/316 (28%), Positives = 158/316 (50%), Gaps = 30/316 (9%)
Query: 4 VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKS-EHDIRFKNFE 61
V+ V V L+A+ V + ++E+ LE+ F++F+++Y K Y + E RF+ FE
Sbjct: 7 VRFVRLPVVLLAMAACLASVALGSLHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFE 66
Query: 62 KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
+++ E+ + A +G+T FSD++ EEF+ R+ ++ +
Sbjct: 67 ENM---EQAKIQAAANPYATFGVTPFSDMTREEFRARY--------------RNGASYFA 109
Query: 122 HVKKRSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
+KR T + + TG P DWRE G + V++Q CG+CWAFST+ E +
Sbjct: 110 AAQKRLRKT-VNVTTGRAPAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQWQVAGN 168
Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACKRKA 239
L LS Q ++ C + GC+GG +W+ + N + E+ YP + + ++
Sbjct: 169 PLVSLSEQMLVSC-DTIDSGCNGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNG--EQPQ 225
Query: 240 TSPNGVKIKSYTCD--TLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL 297
NG +I + D L E +I +A +GP+ AV+A ++ Y GG++ +C +
Sbjct: 226 CQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYNGGILT-SC--TS 282
Query: 298 ANINHAVQIVGYDNYS 313
++H V +VGY++ S
Sbjct: 283 EQLDHGVLLVGYNDNS 298
>gi|6967097|emb|CAB72480.1| cysteine protease-like protein [Arabidopsis thaliana]
Length = 377
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 90/278 (32%), Positives = 133/278 (47%), Gaps = 29/278 (10%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
FS F RY K Y S E +RF F+++LD+I NK S + + +F+DL+ +EF
Sbjct: 59 FSRFTHRYGKKYQSVEEMKLRFSVFKENLDLIRSTNKKGLS---YKLSLNQFADLTWQEF 115
Query: 96 KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
+ L + N L HK IT T +P KDWRE GI+ V+
Sbjct: 116 QRYKLGAAQNCSATLKGSHK-----------------ITEAT-VPDTKDWREDGIVSPVK 157
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWM 213
Q CG+CW FST E+ + G LS Q+++DCAG N GC GG +++
Sbjct: 158 EQGHCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGTFNNFGCHGGLPSQAFEYI 217
Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
N L+ E YP KD CK A + GV+++ + + + +E + + PV
Sbjct: 218 KYNG-GLDTEEAYPYTGKDGGCKFSAKNI-GVQVRD-SVNITLGAEDELKHAVGLVRPVS 274
Query: 274 AAVNAL-TWQYYLGGVIQYN-CDGSLANINHAVQIVGY 309
A + +++Y GV N C + ++NHAV VGY
Sbjct: 275 VAFEVVHEFRFYKKGVFTSNTCGNTPMDVNHAVLAVGY 312
>gi|268554660|ref|XP_002635317.1| C. briggsae CBR-TAG-196 protein [Caenorhabditis briggsae]
Length = 477
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 99/288 (34%), Positives = 134/288 (46%), Gaps = 35/288 (12%)
Query: 37 FSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F F R++K YS K E RF+ F+K+ +I EL KN Q SA YG T+FSD++ EF
Sbjct: 174 FLDFIDRHEKRYSNKREVLKRFRTFKKNAKVIRELQKNEQG--SAVYGFTKFSDMTTMEF 231
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
K L + + V + G+TI +P DWR+ G + +V+
Sbjct: 232 KQTMLPYQWEQPVYPMAEADFEKE-----------GVTISEDDLPDSFDWRDHGAVTQVK 280
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGG-DFCALLDWM 213
NQ CG+CWAFST E L L LS QE++DC + + GC+GG A + M
Sbjct: 281 NQGNCGSCWAFSTTGNVEGAWYLAKKKLVSLSEQELVDC-DSVDQGCNGGLPSNAYKEIM 339
Query: 214 DVNKVVLEPESEYPLLLKDAAC----KRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
+ LEPE YP K C K A NG L E I + T
Sbjct: 340 RMGG--LEPEDAYPYDGKGETCHIVRKDIAVYING-------SVELPHDEVKIQKWLVTK 390
Query: 270 GPVIAAVNALTWQYYLGGVI---QYNCDGSLANINHAVQIVGYDNYSR 314
GP+ +NA T Q+Y GV+ + C+ + +NH V IVGY R
Sbjct: 391 GPISIGLNANTLQFYRHGVVHPFKIFCEPFM--LNHGVLIVGYGKDGR 436
>gi|400180407|gb|AFP73342.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 97/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)
Query: 3 DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
D+ N+L + + F S+P L ++ EL+ S R YK K E RF
Sbjct: 6 DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVE---RFMI 62
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
F++++ IE +NK S + G+ EF+D++ +EF + ++ N ++ S +
Sbjct: 63 FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
N + + P DWRE+G + +V++Q CG CWAFS V + E + +
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
G L S QE++DC N N GC+GG D++ N + ES+Y L + C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGI-SRESDYEYLGEQYTCRSQ 228
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
+ V+I SY ++P + L T PV IAA L Q+Y GG DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278
Query: 296 SLAN-INHAVQIVGY 309
S A+ INHAV +GY
Sbjct: 279 SCADRINHAVTAIGY 293
>gi|400180361|gb|AFP73319.1| cysteine protease [Solanum peruvianum]
gi|400180397|gb|AFP73337.1| cysteine protease [Solanum peruvianum]
gi|400180401|gb|AFP73339.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 97/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)
Query: 3 DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
D+ N+L + + F S+P L ++ EL+ S R YK K E RF
Sbjct: 6 DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
F++++ IE +NK S + G+ EF+D++ +EF + ++ N ++ S +
Sbjct: 63 FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
N + + P DWRE+G + +V++Q CG CWAFS V + E + +
Sbjct: 121 IINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
G L S QE++DC N N GC+GG D++ N + ES+Y L + C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGI-SRESDYEYLGEQYTCRSQ 228
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
+ V+I SY ++P + L T PV IAA L Q+Y GG DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278
Query: 296 SLAN-INHAVQIVGY 309
S A+ INHAV +GY
Sbjct: 279 SCADRINHAVTAIGY 293
>gi|41323856|gb|AAS00027.1| cathepsin L-like cysteine proteinase [Taenia solium]
Length = 339
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 85/297 (28%), Positives = 144/297 (48%), Gaps = 25/297 (8%)
Query: 19 LAIPVKVSKPNLEQKL-ELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSP 77
LA+ V+ S E++L ++ ++ ++ + YS E R F ++L I+ N+ +
Sbjct: 16 LAVVVETSALLTERELSRQWAGWKLQHGRVYSGKEEAYRRGVFARNLLYIKGQNRRFNAG 75
Query: 78 -ESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT 136
ES G+ +F+DL EF R L V ++ I +
Sbjct: 76 LESYSTGLNQFADLESSEFSERFLGTRPESRVAG-------------RRGRIWKALASAA 122
Query: 137 GIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-G 195
G+P DWR+ ++ +V+NQ CG+CWAFS+ E A K G L LS Q+++DC+
Sbjct: 123 GLPDTVDWRDKNLVTEVKNQGNCGSCWAFSSTGALEGAFAKKTGKLISLSEQQLVDCSLK 182
Query: 196 NGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTL 255
NGN GC+GG +++ + +EPES YP D C+ + GV + D
Sbjct: 183 NGNDGCNGGYMSYAFKYLEEH--FIEPESAYPYRATDGPCRYNESL--GVGTVTDIGDIP 238
Query: 256 IPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYN-CDGSLANINHAVQIVGY 309
+E++++ +AT GP+ A++A L + +Y G+ + + C +NH V +GY
Sbjct: 239 EGNETALMEAVATVGPISIAIDASSLGFMFYRHGIYKSHWCSSKF--LNHGVLAIGY 293
>gi|400180428|gb|AFP73352.1| cysteine protease [Solanum corneliomuelleri]
Length = 344
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 97/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)
Query: 3 DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
D+ N+L + + F S+P L ++ EL+ S R YK K E RF
Sbjct: 6 DLMNILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
F++++ IE +NK S + G+ EF+D++ +EF + ++ N ++ S +
Sbjct: 63 FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
N + + P DWRE+G + +V++Q CG CWAFS V + E + +
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
G L S QE++DC N N GC+GG D++ N + ES+Y L + C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGI-SRESDYEYLGEQYTCRSQ 228
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
+ V+I SY ++P + L T PV IAA L Q+Y GG DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278
Query: 296 SLAN-INHAVQIVGY 309
S A+ INHAV +GY
Sbjct: 279 SCADRINHAVTAIGY 293
>gi|400180371|gb|AFP73324.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 97/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)
Query: 3 DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
D+ N+L + + F S+P L ++ EL+ S R YK K E RF
Sbjct: 6 DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
F++++ IE +NK S + G+ EF+D++ +EF + ++ N ++ S +
Sbjct: 63 FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
N + + P DWRE+G + +V++Q CG CWAFS V + E + +
Sbjct: 121 IINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
G L S QE++DC N N GC+GG D++ N + ES+Y L + C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGI-SRESDYEYLGEQYTCRSQ 228
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
+ V+I SY ++P + L T PV IAA L Q+Y GG DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278
Query: 296 SLAN-INHAVQIVGY 309
S A+ INHAV +GY
Sbjct: 279 SCADRINHAVTAIGY 293
>gi|400180347|gb|AFP73312.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 97/315 (30%), Positives = 149/315 (47%), Gaps = 35/315 (11%)
Query: 3 DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
D+ N+L + + F S+P L ++ EL+ S R YK K E RF
Sbjct: 6 DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
F++++ IE +NK S + G+ EF+D++ +EF + ++ N ++ S +
Sbjct: 63 FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
N + + P DWRE+G + +V++Q CG CWAFS V + E + +
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
G L S QE++DC N N GC GG D++ N + ES+Y L + C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCDGGFMTNAFDFIIENGGI-SRESDYEYLGQQYTCRSQ 228
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
+ V+I SY ++P + L T PV IAA L Q+Y GG DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278
Query: 296 SLAN-INHAVQIVGY 309
S A+ INHAV +GY
Sbjct: 279 SCADRINHAVTAIGY 293
>gi|443724292|gb|ELU12369.1| hypothetical protein CAPTEDRAFT_165495 [Capitella teleta]
Length = 351
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 91/304 (29%), Positives = 144/304 (47%), Gaps = 22/304 (7%)
Query: 12 ALIALCFLAIPVKVSK-PNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEEL 70
A++ + + P + + PN +L+ F+ ++++Y ++E R + F +L IE
Sbjct: 18 AMVPMTNILRPDTILRFPNQVPFEKLWQDFKTVHERNYGETEEMQRKEVFRNNLKKIEMH 77
Query: 71 NK-NRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSIT 129
N + Q S R GI +F+D+ +EF + VN + + K DH H+H
Sbjct: 78 NYLHSQGKSSYRMGINQFADMEVKEFAS-----VVNGFRMNNRTKVRDHLHSHY------ 126
Query: 130 TGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQE 189
IP +P + DWR+ G + +++Q CG+CW+FST E H K G L LS Q
Sbjct: 127 ISPAIPVSLPAEVDWRKEGYVTPIKDQGHCGSCWSFSTTGALEGQHFRKTGKLVSLSEQN 186
Query: 190 VIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIK 248
+IDC+ + GN GC+GG ++ N + E YP D C+ K G
Sbjct: 187 LIDCSTSYGNNGCNGGVMDYAFQYIKDNDGD-DTEDSYPYEAADGPCRFKKEYV-GATDT 244
Query: 249 SYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVI-QYNCDGSLANINHAVQ 305
YT D E + +A GPV A++A ++Q Y GV + CD ++H V
Sbjct: 245 GYT-DLPKGDEEKMKEAVAMVGPVSVAIDASHTSFQMYQSGVYDEVECD--PEGLDHGVL 301
Query: 306 IVGY 309
+VGY
Sbjct: 302 VVGY 305
>gi|400180389|gb|AFP73333.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 97/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)
Query: 3 DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
D+ N+L + + F S+P L ++ EL+ S R YK K E RF
Sbjct: 6 DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
F++++ IE +NK S + G+ EF+D++ +EF + ++ N ++ S +
Sbjct: 63 FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
N + + P DWRE+G + +V++Q CG CWAFS V + E + +
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
G L S QE++DC N N GC+GG D++ N + ES+Y L + C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGI-SRESDYEYLGEQYTCRSQ 228
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
+ V+I SY ++P + L T PV IAA L Q+Y GG DG
Sbjct: 229 EKTA-AVQISSY---KVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278
Query: 296 SLAN-INHAVQIVGY 309
S A+ INHAV +GY
Sbjct: 279 SCADRINHAVTAIGY 293
>gi|388890776|gb|AFK80364.1| cysteine proteinase 3, partial [Acanthamoeba castellanii]
Length = 329
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 87/309 (28%), Positives = 150/309 (48%), Gaps = 29/309 (9%)
Query: 4 VKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKS 63
++ + +V L A+C + P +F+ + + KSYS E R+ + ++
Sbjct: 1 MRAITILVLLAAICVASTLATTHDP----LTGVFAEWMRDNSKSYSNEEFVFRWNVWREN 56
Query: 64 LDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHV 123
+IEE N+ S +++ + +F DL+ EF + + K + + H
Sbjct: 57 QQLIEEHNR---SNKTSFLAMNKFGDLTNAEF------NKLFKGLAFDYSFH-------A 100
Query: 124 KKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLS 183
K + + P G+ DWR+ G + V+NQ CG+CW+FST + E + LK G L+
Sbjct: 101 NKAAAEKAVPAP-GLSADFDWRQKGAVTHVKNQGQCGSCWSFSTTGSTEGANFLKTGRLT 159
Query: 184 LLSVQEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSP 242
LS Q +IDC+G+ GN GC+GG +++ +N ++ E+ YP C+ +
Sbjct: 160 SLSEQNLIDCSGSYGNNGCNGGLMDYAFEYI-INNKGIDTEASYPYQTAQYTCQYNPANS 218
Query: 243 NGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSLANI 300
G + SYT D E+++L +AT P A++A ++Q+Y GGV Y S +
Sbjct: 219 GG-SLTSYT-DVSSGDENALLNAVATE-PTSVAIDASHNSFQFYSGGVY-YESACSSTQL 274
Query: 301 NHAVQIVGY 309
+H V VG+
Sbjct: 275 DHGVLAVGW 283
>gi|400180465|gb|AFP73369.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 97/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)
Query: 3 DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
D+ N+L + + F S+P L ++ EL+ S R YK K E RF
Sbjct: 6 DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
F++++ IE +NK S + G+ EF+D++ +EF + ++ N ++ S +
Sbjct: 63 FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
N + + P DWRE+G + +V++Q CG CWAFS V + E + +
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEVAYKIA 170
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
G L S QE++DC N N GC+GG D++ N + ES+Y L + C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGI-SRESDYEYLGEQYTCRSQ 228
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
+ V+I SY ++P + L T PV IAA L Q+Y GG DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278
Query: 296 SLAN-INHAVQIVGY 309
S A+ INHAV +GY
Sbjct: 279 SCADRINHAVTAIGY 293
>gi|400180461|gb|AFP73367.1| cysteine protease [Solanum peruvianum]
gi|400180473|gb|AFP73373.1| cysteine protease [Solanum peruvianum]
gi|400180475|gb|AFP73374.1| cysteine protease [Solanum peruvianum]
gi|400180479|gb|AFP73376.1| cysteine protease [Solanum peruvianum]
gi|400180481|gb|AFP73377.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 97/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)
Query: 3 DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
D+ N+L + + F S+P L ++ EL+ S R YK K E RF
Sbjct: 6 DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
F++++ IE +NK S + G+ EF+D++ +EF + ++ N ++ S +
Sbjct: 63 FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
N + + P DWRE+G + +V++Q CG CWAFS V + E + +
Sbjct: 121 IINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
G L S QE++DC N N GC+GG D++ N + ES+Y L + C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGI-SRESDYEYLGEQYTCRSQ 228
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
+ V+I SY ++P + L T PV IAA L Q+Y GG DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278
Query: 296 SLAN-INHAVQIVGY 309
S A+ INHAV +GY
Sbjct: 279 SCADRINHAVTAIGY 293
>gi|9931986|ref|NP_064680.1| cathepsin R precursor [Mus musculus]
gi|23813621|sp|Q9JIA9.1|CATR_MOUSE RecName: Full=Cathepsin R; Flags: Precursor
gi|9623188|gb|AAF90051.1|AF245399_1 cathepsin R [Mus musculus]
gi|12837970|dbj|BAB24023.1| unnamed protein product [Mus musculus]
gi|12852278|dbj|BAB29345.1| unnamed protein product [Mus musculus]
gi|16445015|gb|AAK00507.1| cathepsin R precursor [Mus musculus]
gi|71682221|gb|AAI00339.1| Cathepsin R [Mus musculus]
gi|148709367|gb|EDL41313.1| cathepsin R [Mus musculus]
Length = 334
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 90/306 (29%), Positives = 147/306 (48%), Gaps = 28/306 (9%)
Query: 10 IVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIE 68
+ A++ + FL + V P L+ L+ + ++ +Y KSYS E ++ +E+ L +I+
Sbjct: 1 MAAVVFIAFLYLGVASGVPVLDSSLDAEWQDWKIKYNKSYSLKEEKLKRVVWEEKLKMIK 60
Query: 69 ELNK-NRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRS 127
N+ N + EF D ++EEF+ + SV H + KR
Sbjct: 61 LHNRENSLGKNGFTMKMNEFGDQTDEEFRKMMIEISVWTH----------REGKSIMKRE 110
Query: 128 ITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSV 187
G +P + DWR+ G + VR Q C ACWAF+ E+ + G L+ LSV
Sbjct: 111 --AGSILPKFV----DWRKKGYVTPVRRQGDCDACWAFAVTGAIEAQAIWQTGKLTPLSV 164
Query: 188 QEVIDCAG-NGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVK 246
Q ++DC+ GN GC GGD ++ ++ LE E+ YP KD C+ +P K
Sbjct: 165 QNLVDCSKPQGNNGCLGGDTYNAFQYV-LHNGGLESEATYPYEGKDGPCR---YNPKNSK 220
Query: 247 IKSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVI-QYNCDGSLANINHA 303
+ +L SE ++ +AT GP+ A ++A +++ Y GG+ + NC S + H
Sbjct: 221 AEITGFVSLPQSEDILMAAVATIGPITAGIDASHESFKNYKGGIYHEPNC--SSDTVTHG 278
Query: 304 VQIVGY 309
V +VGY
Sbjct: 279 VLVVGY 284
>gi|20334377|gb|AAM19209.1|AF493234_1 cysteine protease [Solanum lycopersicum]
gi|400180431|gb|AFP73353.1| cysteine protease [Solanum lycopersicum]
Length = 345
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 96/313 (30%), Positives = 148/313 (47%), Gaps = 30/313 (9%)
Query: 3 DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
D+ N+L + + F S+P L ++ EL+ S R YK K E RF
Sbjct: 6 DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHH 119
F++++ IE +NK S + G+ EF+D++ +EF L ++ S+
Sbjct: 63 FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEF----LAKFTGLNIPNSYLSPSPMS 116
Query: 120 HNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
KK + + +P+ + DWRE+G + +V++Q CG CWAFS V + E + +
Sbjct: 117 STEFKKINDLSDDYMPSNL----DWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIAT 172
Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
G L S QE++DC N N GC+GG D++ N + ES+Y L + C+ +
Sbjct: 173 GNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGI-SRESDYEYLGQQYTCRSQE 230
Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDGS 296
+ V+I SY ++P + L T PV IAA L Q+Y GG NC
Sbjct: 231 KTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTYDGNC--- 281
Query: 297 LANINHAVQIVGY 309
INHAV +GY
Sbjct: 282 ADRINHAVTAIGY 294
>gi|405966497|gb|EKC31775.1| Cathepsin L1 [Crassostrea gigas]
Length = 305
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 95/308 (30%), Positives = 151/308 (49%), Gaps = 33/308 (10%)
Query: 10 IVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKN-FEKSLDIIE 68
+++ I L L P L+ + L+ +Q Y+K Y ++ + ++ +E +LD I
Sbjct: 4 LISYIYLAALIFSSLARVPELDTEWALY---KQEYRKQYLTADEETERRDIWEANLDYIN 60
Query: 69 ELNKN-RQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRS 127
+ N ++ S G+ EF+DLS EEF H+ + D +
Sbjct: 61 QHNDEFKRGEHSYTLGLNEFADLSHEEFL----------HLYGGGIRPRDSGSS-----D 105
Query: 128 ITTGITIPT-GIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLS 186
T I + T G+P + DWR+ G +G V NQ CG+CWAF+ E K G L +LS
Sbjct: 106 PDTDIVVDTSGLPSEVDWRKEGWVGPVGNQFACGSCWAFTATGALEGQVRNKTGKLIVLS 165
Query: 187 VQEVIDCAGN-GNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACKRKATSPNG 244
VQ+++DC+ GN GC GG A ++ DV + E + YP + CK ++
Sbjct: 166 VQQMMDCSEKWGNHGCEGGLMDAAFKYIHDVGGI--ESNASYPYKPAEEKCKFNESAVV- 222
Query: 245 VKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVI-QYNCDGSLANIN 301
K+K Y L SE S++ +AT GP+ AA++A ++Q Y GV NC S ++
Sbjct: 223 AKVKGYK--DLPKSEESLMVAVATVGPISAALDASHSSFQLYKSGVYDDPNC--SSGQVD 278
Query: 302 HAVQIVGY 309
H++ +VGY
Sbjct: 279 HSLVVVGY 286
>gi|225448924|ref|XP_002266821.1| PREDICTED: cysteine proteinase 15A-like [Vitis vinifera]
Length = 375
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 90/285 (31%), Positives = 131/285 (45%), Gaps = 37/285 (12%)
Query: 37 FSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F F ++Y K YS E + R F K +++ P +A +G+T FSDLSEEEF
Sbjct: 61 FRMFMEKYGKEYSSREEYVHRLGIFAK--NMVRAAEHQALDP-TALHGVTPFSDLSEEEF 117
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKDWREAGIIGKVR 154
+ R V + H+K T + G+P DWRE G + +V+
Sbjct: 118 E-RMFTGVVGR--------------PHMKGGVAETAAALEVDGLPESFDWREKGAVTEVK 162
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNM--------GCSGGDF 206
Q TCG+CWAFST E H + L LS Q+++DC ++ GC GG
Sbjct: 163 MQGTCGSCWAFSTTGAVEGAHFISTKKLLTLSEQQLVDCDHMCDIRDKTACDSGCEGGLM 222
Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
++ + LE ES YP K CK K P+ V ++ + +E+ I ++
Sbjct: 223 TNAYKYL-IEAGGLEEESSYPYTGKHGECKFK---PDRVAVRVVNFTEVPINENQIAANL 278
Query: 267 ATHGPVIAAVNALTWQYYLGGVIQYNCDGSLAN--INHAVQIVGY 309
HGP+ +NA+ Q Y+GGV +C INH V +VGY
Sbjct: 279 VCHGPLAVGLNAIFMQTYIGGV---SCPLICPKRWINHGVLLVGY 320
>gi|72389853|ref|XP_845221.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|62359929|gb|AAX80354.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|70801756|gb|AAZ11662.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 449
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 89/316 (28%), Positives = 158/316 (50%), Gaps = 30/316 (9%)
Query: 4 VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKS-EHDIRFKNFE 61
V+ V V L+A+ V + ++E+ LE+ F++F+++Y K Y + E RF+ FE
Sbjct: 7 VRFVRLPVVLLAIAACLASVALGSLHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFE 66
Query: 62 KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
+++ E+ + A +G+T FSD++ EEF+ R+ ++ +
Sbjct: 67 ENM---EQAKIQAAANPYATFGVTPFSDMTREEFRARY--------------RNGASYFA 109
Query: 122 HVKKRSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
+KR T + + TG P DWRE G + V++Q CG+CWAFST+ E +
Sbjct: 110 AAQKRLRKT-VNVTTGRAPAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQWQVAGN 168
Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACKRKA 239
L LS Q ++ C + GC+GG +W+ + N + E+ YP + + ++
Sbjct: 169 PLVSLSEQMLVSC-DTIDSGCNGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNG--EQPQ 225
Query: 240 TSPNGVKIKSYTCD--TLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL 297
NG +I + D L E +I +A +GP+ AV+A ++ Y GG++ +C +
Sbjct: 226 CQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYNGGILT-SC--TS 282
Query: 298 ANINHAVQIVGYDNYS 313
++H V +VGY++ S
Sbjct: 283 EQLDHGVLLVGYNDNS 298
>gi|400180399|gb|AFP73338.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 97/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)
Query: 3 DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
D+ N+L + + F S+P L ++ EL+ S R YK K E RF
Sbjct: 6 DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
F++++ IE +NK S + G+ EF+D++ +EF + ++ N ++ S +
Sbjct: 63 FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
N + + P DWRE+G + +V++Q CG CWAFS V + E + +
Sbjct: 121 IINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
G L S QE++DC N N GC+GG D++ N + ES+Y L + C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGI-SRESDYEYLGEQYTCRSQ 228
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
+ V+I SY ++P + L T PV IAA L Q+Y GG DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278
Query: 296 SLAN-INHAVQIVGY 309
S A+ INHAV +GY
Sbjct: 279 SCADRINHAVTAIGY 293
>gi|332376957|gb|AEE63618.1| unknown [Dendroctonus ponderosae]
Length = 318
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 100/301 (33%), Positives = 142/301 (47%), Gaps = 33/301 (10%)
Query: 12 ALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEEL 70
A I L + V S LE F SF+ ++ KSYS + E R F ++L IEE
Sbjct: 3 AFILASLLIVAVGAS---LENVGSTFQSFKLKHSKSYSNQVEEAKRLAIFTENLRDIEEH 59
Query: 71 NKNRQSP-ESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSIT 129
N + S + +F+DL+ +EFK HS K L N V +
Sbjct: 60 NALYAAGLVSYNKSVNQFTDLTIDEFKAYLTLHS--KPTL-----------NTVPY--VR 104
Query: 130 TGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQE 189
TG+ +PT + DWR G + V++Q CG+CWAFS V + E + G L LS Q+
Sbjct: 105 TGLQVPTTL----DWRSQGYVTGVKDQGDCGSCWAFSVVGSTEGAYYKSTGKLVSLSEQQ 160
Query: 190 VIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKS 249
+IDC N N GC GG + V + L ES YP +D C R + S K+
Sbjct: 161 LIDCTTNVNDGCDGGYLEETFPY--VQQTGLVSESSYPYTGRDGNC-RISESDVVTKVSK 217
Query: 250 YTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYN-CDGSLANINHAVQIVG 308
Y L+ E+ +L + + GPV A++A Y GV + + C SL ++NH V +VG
Sbjct: 218 Y---VLLGGEADLLEAVGSVGPVSVAMDATYIYSYASGVYESSLC--SLYSLNHGVLVVG 272
Query: 309 Y 309
Y
Sbjct: 273 Y 273
>gi|449512065|ref|XP_002196301.2| PREDICTED: cathepsin O-like, partial [Taeniopygia guttata]
Length = 193
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 64/152 (42%), Positives = 86/152 (56%), Gaps = 3/152 (1%)
Query: 159 CGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKV 218
CG CWAFS V ES +A+K TL LSVQ+VIDC+ N N GC+GG + L W++ KV
Sbjct: 1 CGGCWAFSVVGGIESAYAIKRNTLEELSVQQVIDCSYN-NYGCNGGSTVSALSWLNQTKV 59
Query: 219 VLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA 278
L +SEY + C S GV I + E ++ + + GP+ V+A
Sbjct: 60 KLVRDSEYTFKAQTGLCHYFERSDFGVSITGFASYDFSGQEEEMMRMLVSWGPLAVTVDA 119
Query: 279 LTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
++WQ YLGG+IQY+C A NHAV I G+D
Sbjct: 120 VSWQDYLGGIIQYHCSSGRA--NHAVLITGFD 149
>gi|218183|dbj|BAA14403.1| oryzain beta precursor [Oryza sativa Japonica Group]
Length = 471
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 85/271 (31%), Positives = 125/271 (46%), Gaps = 26/271 (9%)
Query: 51 SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLM 110
EH+ RF F +L ++ N R G+ F+DL+ EEF+ L V +
Sbjct: 68 GEHERRFLVFWDNLKFVDAHNARADEGGGFRLGMNRFADLTNEEFRATFLGAKVAERSRA 127
Query: 111 SHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVET 170
+ ++ H+ V++ +P DWRE G + V+NQ CG+CWAFS V T
Sbjct: 128 AGERYR---HDGVEE------------LPESVDWREKGAVAPVKNQGQCGSCWAFSAVST 172
Query: 171 AESMHALKNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLL 229
ES++ L G + LS QE+++C+ NG N GC+GG D++ + ++ E +YP
Sbjct: 173 VESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMADAFDFI-IKNGGIDTEDDYPYK 231
Query: 230 LKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGG 287
D C + V I + D E S+ +A H PV A+ A +Q Y G
Sbjct: 232 AVDGKCDINRENAKVVSIDGFE-DVPQNDEKSLQKAVA-HQPVSVAIEAGGREFQLYHSG 289
Query: 288 VIQYNCDGSLANINHAVQIVGY--DNYSRTW 316
V C SL +H V VGY DN W
Sbjct: 290 VFSGRCGTSL---DHGVVAVGYGTDNGKDYW 317
>gi|400180447|gb|AFP73360.1| cysteine protease [Solanum chilense]
Length = 345
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 98/314 (31%), Positives = 151/314 (48%), Gaps = 32/314 (10%)
Query: 3 DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
D+ N+L + + F S+P L ++ EL+ S R YK K E RF
Sbjct: 6 DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHH 119
F++++ IE +NK S + G+ EF+D++ +EF L ++ S+
Sbjct: 63 FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEF----LAKFTGLNIPNSYLSPSPMS 116
Query: 120 HNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
KK + + +P+ + DWRE+G + +V++Q CG CWAFS V + E + +
Sbjct: 117 STEFKKINDLSDDDMPSNL----DWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIAT 172
Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
G L S QE++DC N N GC+GG D++ N + ES+Y L + C+ +
Sbjct: 173 GKLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGI-SRESDYEYLGEQYTCRSQE 230
Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDGS 296
+ V+I SY ++P + L T PV IAA L Q+Y GG DGS
Sbjct: 231 KTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DGS 280
Query: 297 LAN-INHAVQIVGY 309
A+ INHAV +GY
Sbjct: 281 CADRINHAVTAIGY 294
>gi|348565223|ref|XP_003468403.1| PREDICTED: cathepsin L1-like [Cavia porcellus]
Length = 333
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 91/303 (30%), Positives = 146/303 (48%), Gaps = 32/303 (10%)
Query: 13 LIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELN 71
L ALC + + + P L+Q L+ + ++ + + Y +E R +EK+L +IE N
Sbjct: 7 LAALC---LGIVSALPKLDQTLDAQWDQWKAAHGRLYGLNEEGWRRAVWEKNLRMIELHN 63
Query: 72 KN-RQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITT 130
Q S G+ F D++ EEF+ +M+ +H H + + +
Sbjct: 64 GEYSQGRHSFTLGMNHFGDMTNEEFRQ-----------VMNGFQHQKHKTGKMYQEPLL- 111
Query: 131 GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEV 190
+ +P + DWRE G + +V+NQ CG+CWAFS + E K G L LS Q +
Sbjct: 112 -LQLPKSV----DWREKGYVTEVKNQGQCGSCWAFSATGSLEGQMFHKTGNLVSLSEQNL 166
Query: 191 IDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKS 249
+DC+ GN GC+GG ++ NK LE E YP + KD CK K + +
Sbjct: 167 VDCSRPQGNQGCNGGLMDFAFQYVKDNK-GLEAEKSYPYVGKDGECKYKPE----LSAAN 221
Query: 250 YTCDTLIPSESSILTD-IATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQI 306
T +P ++ +AT GP+ A++A ++Q+Y G I Y+ S ++NH V +
Sbjct: 222 DTGFVDVPQREKVVQKALATVGPLSVAIDAGLQSFQFYKEG-IYYDPGCSSRDLNHGVLL 280
Query: 307 VGY 309
VGY
Sbjct: 281 VGY 283
>gi|223049408|gb|ACM80348.1| cysteine proteinase [Solanum lycopersicum]
Length = 368
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 86/284 (30%), Positives = 141/284 (49%), Gaps = 36/284 (12%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F+ F++R+ K+Y S EH RF F+ +L ++++ SA +G+T+FSD++ +EF
Sbjct: 54 FTLFKKRFGKTYASDEEHHYRFSVFKANL---RRAMRHQKLDPSAVHGVTQFSDMTPDEF 110
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKDWREAGIIGKVR 154
+ L VN+ + + K I +PT +P DWRE G + V+
Sbjct: 111 SQKFL--GVNRRLRFP---------SDANKAPI-----LPTEDLPSDFDWREHGAVTPVK 154
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
NQ +CG+CW+FST E + L G L LS Q+++DC + + GCSGG
Sbjct: 155 NQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEKDSCDSGCSGGLM 214
Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
+ ++ + L E +YP D A + + K+ +++ +L E I ++
Sbjct: 215 NSAFEYT-LKAGGLMREEDYPYTGTDKATCKFDNTKVAAKVANFSVVSL--DEEQIAANL 271
Query: 267 ATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
+GP+ A+NA+ Q Y+GGV Y C L +H V +VGY
Sbjct: 272 VKNGPLAVAINAVFMQTYVGGVSCPYICSKQL---DHGVLLVGY 312
>gi|118360450|ref|XP_001013459.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89295226|gb|EAR93214.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 320
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 86/282 (30%), Positives = 135/282 (47%), Gaps = 39/282 (13%)
Query: 36 LFSSFQQRYKKSYSKSEHD-IRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
L+S+F+ Y K Y+ + + R + F ++L II+ +N +GIT+F DL++EE
Sbjct: 42 LWSTFKNSYNKKYADPDFEQYRIEVFTENLKIIDSNCQN--------FGITKFMDLTQEE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
FK +L K++ I + + ++ DW G + V+
Sbjct: 94 FKQTYLTLKTKKYI-----------------EEIPETVFNDSNGDIEIDWTMKGAVTPVK 136
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
+Q CG+CW+FST E H L + L LS Q +IDC+ NGN GC+GG D++
Sbjct: 137 DQGKCGSCWSFSTTGAVEGAHFLSSNELVSLSEQYLIDCSKNGNEGCNGGLMDTAFDFIA 196
Query: 215 VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIA 274
N + E+ YP D CK T P KI SY I S + +L+ + P+
Sbjct: 197 QNGI--PTENAYPYKALDGTCKM-TTGP--YKISSY---QNIISCNDLLSKLQKQ-PIAI 247
Query: 275 AVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYSRTW 316
AV+A +Q+Y G+ C N++H V +VGY + + W
Sbjct: 248 AVDANNFQFYTKGIFS-KCG---KNLDHGVLLVGYSSKDKFW 285
>gi|330805273|ref|XP_003290609.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
gi|325079248|gb|EGC32857.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
Length = 333
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 87/277 (31%), Positives = 134/277 (48%), Gaps = 27/277 (9%)
Query: 37 FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPES-ARYGITEFSDLSEEEF 95
F + +++ ++YS E R++ F++++D I + N S ES G+T+F+DL+ EE+
Sbjct: 33 FIGWMRKHDRAYSHEEFTDRYQAFKENMDFIHKWN----SQESDTVLGLTKFADLTNEEY 88
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
K +L VN + N +K T P I DWRE G + +V++
Sbjct: 89 KKHYLGIKVNVKKNL----------NAAQKGLKFFKFTGPDSI----DWREKGAVSQVKD 134
Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWMD 214
Q CG+CW+FST E H +K+G + LS Q ++DC+G GN GC GG +++
Sbjct: 135 QGQCGSCWSFSTTGAVEGAHQIKSGNMVSLSEQNLVDCSGQYGNQGCEGGLMVNAFEYI- 193
Query: 215 VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIA 274
++ + ES YP CK S NG I Y + E LT PV
Sbjct: 194 IDNGGIATESSYPYTAAQGRCKF-TKSMNGANIIGYK--EIPQGEEDSLTAALAKQPVSV 250
Query: 275 AVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
A++A +++Q Y GV S A ++H V VGY
Sbjct: 251 AIDASHMSFQLYSSGVYDEPACSSEA-LDHGVLAVGY 286
>gi|391328516|ref|XP_003738734.1| PREDICTED: cathepsin O-like [Metaseiulus occidentalis]
Length = 247
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 72/184 (39%), Positives = 101/184 (54%), Gaps = 10/184 (5%)
Query: 131 GITIPT-GIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL--LSV 187
G+ I T GIP D+R + V+ Q CGACWAF+ +E E + L+ S SV
Sbjct: 25 GLEIATLGIPKVVDYRNVSSV--VKEQGACGACWAFAPLEAVELLSTLQGRAPSRASFSV 82
Query: 188 QEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEY-PLLLKDAACKRKATSPNGVK 246
Q VIDC+ + + GCSGGD C +D++ +K E+ Y P C+++A + +
Sbjct: 83 QHVIDCS-DISYGCSGGDICDAVDYLQTSKYHFVAEAAYFPYTEDKLECRKEAKYTSDIS 141
Query: 247 IKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQI 306
I C+ E +L +A GPVIA V+A W+ YLGG+I++NCD NHAV I
Sbjct: 142 ITRSWCENYAGREGDLLRLVA-KGPVIATVDATVWRDYLGGIIRFNCDA--GEKNHAVVI 198
Query: 307 VGYD 310
VGYD
Sbjct: 199 VGYD 202
>gi|29789900|gb|AAF21457.2|U56958_1 cysteine proteinase [Paragonimus westermani]
Length = 272
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 82/234 (35%), Positives = 114/234 (48%), Gaps = 26/234 (11%)
Query: 79 SARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGI 138
+ARYG+T+FSDL+ EEF ++L VN N KR TG+
Sbjct: 13 TARYGVTQFSDLTPEEFAAKYLSAPVN---------------NDQVKRVRPTGLK---AA 54
Query: 139 PVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGN 198
P + DWR G + V NQ +CG+CWAFST E +K G L LS Q+++DC +
Sbjct: 55 PERIDWRAKGAVTAVENQGSCGSCWAFSTAGNVEGQWFIKTGQLVSLSKQQLVDCDRAAD 114
Query: 199 MGCSGG-DFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP 257
GC+GG + L+ M + LE + +YP C + + K L P
Sbjct: 115 -GCNGGWPASSYLEIMHMGG--LESQDDYPYAGVKEQCFMEKER---LLAKIDDSIALXP 168
Query: 258 SESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDG-SLANINHAVQIVGYD 310
SE +A HGP+ +NA+T QYY G+I + S ++NHAV VGYD
Sbjct: 169 SEDDNAAYLAEHGPLSTLLNAITLQYYQSGIIHPSYXXCSPVDLNHAVLTVGYD 222
>gi|37780051|gb|AAP32198.1| cysteine protease 12 [Trifolium repens]
Length = 343
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 94/316 (29%), Positives = 147/316 (46%), Gaps = 39/316 (12%)
Query: 6 NVLFIVAL-IALCFLAIPVKVSKPNLEQK--LELFSSFQQRYKKSY-SKSEHDIRFKNFE 61
N L+ V+L + C + ++V+ L+ E + Y K Y + E + R + F
Sbjct: 5 NQLYHVSLALFFCLGLLAIQVTSRTLQDDSIFERHEQWMTHYGKVYKNPQEREKRLRIFT 64
Query: 62 KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
++L IE N N + + + GI +F+DL+ EEF + S +K H +
Sbjct: 65 ENLKYIEASN-NAGNKKPYKLGINQFADLTNEEF-------------IASRNKFKGHMCS 110
Query: 122 HVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGT 181
+ + TT T +P DWR+ G + V+NQ CG CWAFS + E +H + G
Sbjct: 111 SIIR--TTTFKYENTSVPSTVDWRKKGAVTPVKNQGQCGCCWAFSAIAATEGIHKISTGK 168
Query: 182 LSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLE-----PESEYPLLLKDAAC 235
L LS QE++DC NG + GC GG L+D D K +++ E+ YP D C
Sbjct: 169 LVSLSEQELVDCDTNGVDQGCEGG----LMD--DAFKFIIQNNGISTEAGYPYQGVDGTC 222
Query: 236 KRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNC 293
K S + I Y D +E+++ +A P+ A++A +Q+Y GV +C
Sbjct: 223 KANEASTSAATITGYE-DVPANNENALQKAVANQ-PISVAIDASGSDFQFYKSGVFTGSC 280
Query: 294 DGSLANINHAVQIVGY 309
L +H V VGY
Sbjct: 281 GTEL---DHGVTAVGY 293
>gi|147788834|emb|CAN64655.1| hypothetical protein VITISV_005140 [Vitis vinifera]
Length = 341
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 85/282 (30%), Positives = 131/282 (46%), Gaps = 30/282 (10%)
Query: 31 EQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDL 90
E+ + + + + YK + KS+ R+K F+ ++ IE NK +S + I EF+DL
Sbjct: 37 ERHEDWMAQYGRVYKDAGEKSK---RYKIFKDNVARIESFNKAMN--KSYKLSINEFADL 91
Query: 91 SEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGII 150
+ EEF R R+ H+ + + H +P DWR+ G +
Sbjct: 92 TNEEF--RASRNRFKAHICSTEATSFKYEH--------------VXAVPSTVDWRKKGAV 135
Query: 151 GKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCAL 209
+++Q CG+CWAFS V E + L G L LS QE++DC +G + GCSGG
Sbjct: 136 TPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCSGGLMDDA 195
Query: 210 LDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
+++ N L E+ YP D C RK + KI Y D +E ++ +A H
Sbjct: 196 FKFIEQNH-GLTTEANYPYAGTDGTCNRKKAAHPAAKINGYE-DVPANNEKALQKAVA-H 252
Query: 270 GPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
P+ A++A +Q+Y GV C L +H V VGY
Sbjct: 253 QPIAVAIDAGGFEFQFYSSGVFTGQCGTEL---DHGVSAVGY 291
>gi|348531523|ref|XP_003453258.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 341
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 78/241 (32%), Positives = 122/241 (50%), Gaps = 24/241 (9%)
Query: 75 QSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITI 134
Q +S R G+T+F+D+ EE+K V++ L H N R +T +
Sbjct: 73 QGLKSYRLGMTQFADMENEEYK-----RLVSQGCL--------HSFNSSLPRRGSTFFRL 119
Query: 135 PTG--IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVID 192
P G +P DWR+ G + V+NQ CG+CWAFS + E H K G L LS Q+++D
Sbjct: 120 PKGTVLPDTVDWRDKGYVTNVQNQMDCGSCWAFSATGSLEGQHFRKTGKLVSLSKQQLVD 179
Query: 193 CAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYT 251
C+G GN GC+GG + ++ N + + E YP +D C+ S G Y
Sbjct: 180 CSGEFGNEGCNGGLMDSAFQYIQANGGI-DTEESYPYEAEDGKCRYNPKS-TGATCTGYV 237
Query: 252 CDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVI-QYNCDGSLANINHAVQIVG 308
D +E ++ +AT GP+ A++A ++Q+Y GV + +C ++ ++HAV VG
Sbjct: 238 -DVQPANEETLKEAVATIGPISVAIDAFHPSFQFYESGVYDEPDCSSTM--LDHAVLAVG 294
Query: 309 Y 309
Y
Sbjct: 295 Y 295
>gi|42516556|gb|AAS17989.1| cysteine proteinase CP2 [Paragonimus westermani]
Length = 272
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 82/234 (35%), Positives = 115/234 (49%), Gaps = 26/234 (11%)
Query: 79 SARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGI 138
+ARYG+T+FSDL+ EEF ++L VN N KR TG+
Sbjct: 13 TARYGVTQFSDLTPEEFAAKYLSAPVN---------------NDQVKRVRPTGLK---AA 54
Query: 139 PVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGN 198
P + DWR G + V NQ +CG+CWAFST E +K G L LS Q+++DC +
Sbjct: 55 PERIDWRAKGAVTAVENQGSCGSCWAFSTAGNVEGQWFIKTGQLVSLSKQQLVDCDRAAD 114
Query: 199 MGCSGG-DFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP 257
GC+GG + L+ M + LE + +YP C + + K L P
Sbjct: 115 -GCNGGWPASSYLEIMHMGG--LESQDDYPYAGVKEQCFMEKER---LLAKIDDSIALGP 168
Query: 258 SESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDG-SLANINHAVQIVGYD 310
SE +A HGP+ +NA+T QYY G+I + + S ++NHAV VGYD
Sbjct: 169 SEDDNAAYLAEHGPLSTLLNAITLQYYQSGIIHPSYEECSPVDLNHAVLTVGYD 222
>gi|356545063|ref|XP_003540965.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 361
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 92/309 (29%), Positives = 139/309 (44%), Gaps = 36/309 (11%)
Query: 10 IVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIE 68
+ L+ + FLA V E + RY K Y E + RF+ F+++++ IE
Sbjct: 30 LAMLLCMAFLAFQVTCRSLQDASMYERHEQWMTRYGKVYKDPQEREKRFRIFKENVNYIE 89
Query: 69 ELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSI 128
N + + + I +F+DL+ EEF R+ H+ S + + +V
Sbjct: 90 AFNN--AANKRYKLAINQFADLTNEEFIAP--RNRFKGHMCSSIIRTTTFKYENV----- 140
Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
T +P DWR+ G + +++Q CG CWAFS V E +HAL +G L LS Q
Sbjct: 141 -------TAVPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALTSGKLISLSEQ 193
Query: 189 EVIDCAGNG-NMGCSGGDFCALLDWMDVNKVV-----LEPESEYPLLLKDAACKRKATSP 242
E++DC G + GC GG L+D D K V L E+ YP D C +
Sbjct: 194 ELVDCDTKGVDQGCEGG----LMD--DAFKFVIQNHGLNTEANYPYKGVDGKCNANEAAN 247
Query: 243 NGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANI 300
+ V I Y D +E ++ +A PV A++A +Q+Y GV +C L
Sbjct: 248 DVVTITGYE-DVPANNEKALQKAVANQ-PVSVAIDASGSDFQFYKSGVFTGSCGTEL--- 302
Query: 301 NHAVQIVGY 309
+H V VGY
Sbjct: 303 DHGVTAVGY 311
>gi|242074728|ref|XP_002447300.1| hypothetical protein SORBIDRAFT_06g032360 [Sorghum bicolor]
gi|241938483|gb|EES11628.1| hypothetical protein SORBIDRAFT_06g032360 [Sorghum bicolor]
Length = 471
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 84/286 (29%), Positives = 128/286 (44%), Gaps = 31/286 (10%)
Query: 32 QKLELFSSFQQRYKKSYSKS--EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSD 89
Q ++ + R+ K+ S + EHD RF+ F +L ++ N R R GI F+D
Sbjct: 47 QVRAMYEQWMARHGKAASNALGEHDRRFRAFWDNLRFVDAHNA-RAGARGYRLGINRFAD 105
Query: 90 LSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGI 149
L+ EF+ +L + + H+ V+ +P DWR+ G
Sbjct: 106 LTNAEFRAAYLSAGARNGTATAATGER-YRHDGVEA------------LPEFVDWRQKGA 152
Query: 150 IGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG-NMGCSGG---D 205
+ V+NQ CG+CWAFS V E ++ + G L LS QE++DC+ NG N GC GG D
Sbjct: 153 VAPVKNQGQCGSCWAFSAVGAVEGINQIVTGELVTLSEQELVDCSKNGQNGGCDGGMMDD 212
Query: 206 FCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
A + V ++ + +YP +D C S + V I + + + ++ L
Sbjct: 213 AFAFI----VGNGGIDTDKDYPYTARDGKCDVAKRSRHVVSIDGF--EGVPRNDEKSLQK 266
Query: 266 IATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
H PV A+ A +Q Y GV C SL +H V VGY
Sbjct: 267 AVAHQPVAVAIEAGGREFQLYQSGVFTGRCGTSL---DHGVVAVGY 309
>gi|400180417|gb|AFP73347.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 98/315 (31%), Positives = 150/315 (47%), Gaps = 35/315 (11%)
Query: 3 DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
D+ N+L V + F S+P L ++ EL+ S R YK K E RF
Sbjct: 6 DLMNILITVFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
F++++ IE +NK S + G+ EF+D++ +EF + ++ N ++ S +
Sbjct: 63 FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPLSSTEF 120
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
N + + P DWRE+G + +V++Q CG CWAFS V + E + +
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
G L S QE++DC N N GC+GG D++ N + ES+Y L + C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGI-SRESDYEYLGQQYTCRSQ 228
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
+ V+I SY ++P + L T PV IAA L Q+Y GG DG
Sbjct: 229 EKTA-AVQISSY---KVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278
Query: 296 SLAN-INHAVQIVGY 309
S A+ INHAV +GY
Sbjct: 279 SCADRINHAVTAIGY 293
>gi|357471211|ref|XP_003605890.1| Cysteine proteinase [Medicago truncatula]
gi|355506945|gb|AES88087.1| Cysteine proteinase [Medicago truncatula]
Length = 343
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 92/299 (30%), Positives = 138/299 (46%), Gaps = 36/299 (12%)
Query: 21 IPVKVSKPNLEQKL-ELFSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPE 78
++V+ L+ + E + +Y K Y S E + RFK F ++++ IE NK + +
Sbjct: 21 FAIQVTSRTLQDDMYERHRQWMSQYGKVYKDSQEREKRFKIFTENVNYIEAFNKGDNN-K 79
Query: 79 SARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGI 138
G+ +F+DL+ +EF + S +K H + + R+ T + I
Sbjct: 80 LYTLGVNQFADLTNDEFTS-------------SRNKFKGHMCSSIT-RTSTFKYENASAI 125
Query: 139 PVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG- 197
P DWR+ G + V+NQ CG CWAFS V E +H L G L LS QE++DC G
Sbjct: 126 PSSVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLISLSEQELVDCDTKGV 185
Query: 198 NMGCSGGDFCALLDWMDVNKVV-----LEPESEYPLLLKDAACKRKATSPNGVKIKSYTC 252
+ GC GG L+D D K + L E+ YP D C S N V I Y
Sbjct: 186 DQGCEGG----LMD--DAFKFIIQNHGLNTEANYPYQGVDGTCNANKGSINAVTITGYE- 238
Query: 253 DTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
D +E ++ +A P+ A++A +Q+Y GV +C L +H V VGY
Sbjct: 239 DVPTNNEQALQKAVANQ-PISVAIDASGSDFQFYKSGVFTGSCGTEL---DHGVTAVGY 293
>gi|400180383|gb|AFP73330.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 97/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)
Query: 3 DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
D+ N+L + + F S+P L ++ EL+ S R YK K E RF
Sbjct: 6 DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
F++++ IE +NK S + G+ EF+D++ +EF + ++ N ++ S +
Sbjct: 63 FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
N + + P DWRE+G + +V++Q CG CWAFS V + E + +
Sbjct: 121 IINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
G L S QE++DC N N GC+GG D++ N + ES+Y L + C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGI-SRESDYEYLGQQYTCRSQ 228
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
+ V+I SY ++P + L T PV IAA L Q+Y GG DG
Sbjct: 229 EKTA-AVQISSY---KVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278
Query: 296 SLAN-INHAVQIVGY 309
S A+ INHAV +GY
Sbjct: 279 SCADRINHAVTAIGY 293
>gi|326430491|gb|EGD76061.1| cathepsin [Salpingoeca sp. ATCC 50818]
Length = 381
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 83/283 (29%), Positives = 133/283 (46%), Gaps = 38/283 (13%)
Query: 37 FSSFQQRYKKSYSKSEHDIR-FKNFEKSLDIIEELNKNRQSPESAR------YGITEFSD 89
F F+ ++K Y E + R F F +L I R + E+AR G+ +F+D
Sbjct: 20 FDDFKTTFEKQYESPEEEARRFAIFADNLAFIA-----RHNAEAARGLHTHTVGVNQFAD 74
Query: 90 LSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGI 149
L+ EE++ +LR + L+ + + P V DWR+ G
Sbjct: 75 LTNEEYRQLYLRPYPTE--LLGRERQE-------------VWLDGPNAGSV--DWRQKGA 117
Query: 150 IGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCA 208
+ ++NQ CG+CW+FST + E HA+ G L LS Q+++DC+G+ GN GC+GG
Sbjct: 118 VTPIKNQGQCGSCWSFSTTGSVEGAHAIATGNLVSLSEQQLVDCSGSFGNQGCNGGLMDN 177
Query: 209 LLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIAT 268
++ ++ L+ E +YP +D C + S + V I Y D +E + +
Sbjct: 178 AFKYI-ISNGGLDTEQDYPYTARDGVCDKSKESKHAVSISGYK-DVPQNNEDQLAAAV-E 234
Query: 269 HGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
GPV A+ A ++Q Y GV C N++H V +VGY
Sbjct: 235 KGPVSVAIEADQQSFQMYSSGVFSGPCG---TNLDHGVLVVGY 274
>gi|400180453|gb|AFP73363.1| cysteine protease [Solanum chilense]
Length = 344
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 97/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)
Query: 3 DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
D+ N+L + + F S+P L ++ EL+ S R YK K E RF
Sbjct: 6 DLMNILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
F++++ IE +NK S + G+ EF+D++ +EF + ++ N ++ S +
Sbjct: 63 FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPVSSTEF 120
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
N + + P DWRE+G + +V++Q CG CWAFS V + E + +
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
G L S QE++DC N N GC+GG D++ N + ES+Y L + C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGI-SRESDYEYLGEQYTCRSQ 228
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
+ V+I SY ++P + L T PV IAA L Q+Y GG DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278
Query: 296 SLAN-INHAVQIVGY 309
S A+ INHAV +GY
Sbjct: 279 SCADRINHAVTAIGY 293
>gi|156389068|ref|XP_001634814.1| predicted protein [Nematostella vectensis]
gi|156221901|gb|EDO42751.1| predicted protein [Nematostella vectensis]
Length = 276
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 86/258 (33%), Positives = 128/258 (49%), Gaps = 33/258 (12%)
Query: 57 FKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHH 116
K FE ++ ++ K +A+YG T FSDLSEEEF+ + + K + ++
Sbjct: 1 MKIFESNMRKAAKMQKMDSG--TAQYGPTIFSDLSEEEFRKQKMMPGWGKPL----YEMK 54
Query: 117 DHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMH 175
D IP G IP DWR+ G++ V+NQ +CG+CWAFST E +
Sbjct: 55 DAE--------------IPLGDIPESVDWRDKGVVTPVKNQGSCGSCWAFSTTGNIEGQY 100
Query: 176 ALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKV-VLEPESEYPLLLKDAA 234
A+K G L LS QE++DC + GC GG + + K+ LE ES+YP D+
Sbjct: 101 AIKTGKLVSLSEQELVDCD-TIDKGCEGG--LPSNAYKQIEKLGGLESESDYPYKGADSK 157
Query: 235 CKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVI---QY 291
CK VK+ + + E I +A +GP+ +NA Q+Y+GG+ +
Sbjct: 158 CKFNKAE---VKVTINSSVVISKDEKEIAAWLAKNGPISIGINANAMQFYMGGIAHPWKI 214
Query: 292 NCDGSLANINHAVQIVGY 309
C+ S ++NH V IVGY
Sbjct: 215 FCNPS--SLNHGVLIVGY 230
>gi|400180375|gb|AFP73326.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 96/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)
Query: 3 DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
D+ ++L + + F + S+P L ++ EL+ S R YK K E RF
Sbjct: 6 DLMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
F++++ IE +NK S + G+ EF+D++ +EF + ++ N ++ S +
Sbjct: 63 FKENIKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
N + + P DWRE+G + +V++Q CG CWAFS V + E + +
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
G L S QE++DC N N GC GG D++ N + ES+Y L + C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCDGGFMTNAFDFIKENGGI-SSESDYEYLGEQYTCRSQ 228
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
+ V+I SY ++P + L T PV IAA L Q+Y GG DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278
Query: 296 SLAN-INHAVQIVGY 309
S A+ INHAV +GY
Sbjct: 279 SCADRINHAVTAIGY 293
>gi|400180353|gb|AFP73315.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 97/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)
Query: 3 DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
D+ N+L + + F S+P L ++ EL+ S R YK K E RF
Sbjct: 6 DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
F++++ IE +NK S + G+ EF+D++ +EF + ++ N ++ S +
Sbjct: 63 FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
N + + P DWRE+G + +V++Q CG CWAFS V + E + +
Sbjct: 121 IINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
G L S QE++DC N N GC+GG D++ N + ES+Y L + C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGI-SRESDYEYLGEQYTCRSQ 228
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
+ V+I SY ++P + L T PV IAA L Q+Y GG DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278
Query: 296 SLAN-INHAVQIVGY 309
S A+ INHAV +GY
Sbjct: 279 SCADRINHAVTAIGY 293
>gi|400180359|gb|AFP73318.1| cysteine protease [Solanum peruvianum]
gi|400180477|gb|AFP73375.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 96/315 (30%), Positives = 151/315 (47%), Gaps = 35/315 (11%)
Query: 3 DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
D+ ++L + + F + S+P L ++ EL+ S R YK K E RF
Sbjct: 6 DLMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
F++++ IE +NK S + G+ EF+D++ +EF + ++ N ++ S +
Sbjct: 63 FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
N + + P DWRE+G + +V++Q CG CWAFS V + E + +
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
G L S QE++DC N N GC+GG D++ N + ES+Y L + C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGI-SRESDYEYLGEQYTCRSQ 228
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
+ V+I SY ++P + L T PV IAA L Q+Y GG DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278
Query: 296 SLAN-INHAVQIVGY 309
S A+ INHAV +GY
Sbjct: 279 SCADRINHAVTAIGY 293
>gi|357143305|ref|XP_003572875.1| PREDICTED: xylem cysteine proteinase 1-like [Brachypodium
distachyon]
Length = 473
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 88/308 (28%), Positives = 143/308 (46%), Gaps = 58/308 (18%)
Query: 19 LAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSP 77
LA+P K+ ++LFSS+ ++ K Y E + R++ F+++L I E N+ S
Sbjct: 38 LALPYKL--------VDLFSSWSVKHSKIYVSPEEKVKRYEVFKQNLKHIVETNRRNGS- 88
Query: 78 ESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT- 136
G+ +F+D++ EEFK+ +L + TG+ P
Sbjct: 89 --YWLGLNQFADVAHEEFKSTYL--------------------------GLKTGMDGPAR 120
Query: 137 -----------GIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLL 185
+P DWR+ G + V+NQ CG+CWAFSTV E ++ + G L L
Sbjct: 121 APTAFRYENSVNLPWSVDWRKKGAVTPVKNQGECGSCWAFSTVAAVEGINQIATGKLESL 180
Query: 186 SVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGV 245
S QE++DC + GC GG F + + + + +YP L+++ CK K V
Sbjct: 181 SEQELMDCDTTFDHGCGGG-FMDFAFAYIMGNLGIHTDDDYPYLMEEGYCKEKQPQSKVV 239
Query: 246 KIKSYTCDTLIPSESSILTDIATHGPVIAAVNALT--WQYYLGGVIQYNCDGSLANINHA 303
I Y D SE S+L +A H P+ + A + +Q+Y GV + +C ++HA
Sbjct: 240 TISGYE-DVPENSEVSLLKALA-HQPISVGIAAGSKDFQFYKRGVFEGSCG---TELDHA 294
Query: 304 VQIVGYDN 311
+ VGY +
Sbjct: 295 LTAVGYGS 302
>gi|320169658|gb|EFW46557.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 324
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 87/280 (31%), Positives = 137/280 (48%), Gaps = 30/280 (10%)
Query: 37 FSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F S++ + SY+ E R + +LD IE+ N S + A + +F+DL+ EF
Sbjct: 22 FDSWKATHGVSYATVGEETARRGIYRANLDFIEKHNSEGHSYKLA---VNKFADLTYPEF 78
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
++L + N K + +T + +P DWR AGI+ +++
Sbjct: 79 AAKYLGLRFDAT-------------NATKSFAASTYLPRMVSLPDSVDWRTAGIVTPIKD 125
Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC-AGNGNMGCSGGDFCALLDWMD 214
Q CG+CW+FST + E HA K G L LS Q ++DC + GN GC+GG ++
Sbjct: 126 QGQCGSCWSFSTTGSVEGQHARKTGQLVSLSEQNLVDCSSAQGNAGCNGGLMDQAFQYII 185
Query: 215 VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIA 274
N + + ES YP +D C+ + + G + SY D SES + +AT GP+
Sbjct: 186 SNNGI-DTESSYPYTAQDGTCQFNSANV-GATVASYQ-DIASGSESDLQNAVATVGPISV 242
Query: 275 AVNAL--TWQYYLGGVIQYN---CDGSLANINHAVQIVGY 309
A++A ++Q+Y GV YN C S + ++H V VGY
Sbjct: 243 AIDASQPSFQFYSSGV--YNEPAC--SSSQLDHGVLAVGY 278
>gi|449448298|ref|XP_004141903.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
sativus]
gi|449531757|ref|XP_004172852.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
sativus]
Length = 365
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 87/308 (28%), Positives = 150/308 (48%), Gaps = 33/308 (10%)
Query: 11 VALIALCFLAIPVK-VSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIE 68
+AL++ FL+I +S+ + + E++ + ++ K+Y+ E + RF+ F+++L I+
Sbjct: 8 LALLSFFFLSISASALSRRSDGEVREIYDLWLAKHGKAYNGIDEREKRFQIFKENLKFID 67
Query: 69 ELNKNRQSPESARYGITEFSDLSEEEFKTRHL--RHSVNKHVLMSHHKHHDHHHNHVKKR 126
+ N ++ + G+ F+DL+ EE++ +L R + V+ + + N++ +
Sbjct: 68 DHNSENRT---YKVGLNMFADLTNEEYRALYLGTRSPPARRVMKAKTASRRYAVNNLDR- 123
Query: 127 SITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLS 186
+P DWR G + V+NQ +CG+CWAFST+ E ++ + G L LS
Sbjct: 124 -----------LPESMDWRTRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLS 172
Query: 187 VQEVIDCAGNGNMGCSGGDFCALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPN 243
QE++ C N GC+GG L+D+ ++ L+ E +YP D C +
Sbjct: 173 EQELVSCDKKYNSGCNGG----LMDYAFQFIIDNGGLDTEEDYPYEAFDGQCDPTRKNAK 228
Query: 244 GVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANIN 301
V I +Y D E S+ +A H PV A+ A L Q Y GV C +L +
Sbjct: 229 VVSIDAYE-DVPANDEESLKKAVA-HQPVSVAIEASGLALQLYQSGVFTGKCGSAL---D 283
Query: 302 HAVQIVGY 309
H V VGY
Sbjct: 284 HGVVAVGY 291
>gi|400180367|gb|AFP73322.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 97/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)
Query: 3 DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
D+ N+L + + F S+P L ++ EL+ S R YK K E RF
Sbjct: 6 DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVE---RFMI 62
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
F++++ IE +NK S + G+ EF+D++ +EF + ++ N ++ S +
Sbjct: 63 FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
N + + P DWRE+G + +V++Q CG CWAFS V + E + +
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
G L S QE++DC N N GC+GG D++ N + ES+Y L + C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGI-SRESDYEYLGEQYTCRSQ 228
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
+ V+I SY ++P + L T PV IAA L Q+Y GG DG
Sbjct: 229 EKTA-AVQISSY---KVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278
Query: 296 SLAN-INHAVQIVGY 309
S A+ INHAV +GY
Sbjct: 279 SCADRINHAVTAIGY 293
>gi|400180349|gb|AFP73313.1| cysteine protease [Solanum peruvianum]
gi|400180469|gb|AFP73371.1| cysteine protease [Solanum peruvianum]
gi|400180471|gb|AFP73372.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 97/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)
Query: 3 DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
D+ N+L + + F S+P L ++ EL+ S R YK K E RF
Sbjct: 6 DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVE---RFMI 62
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
F++++ IE +NK S + G+ EF+D++ +EF + ++ N ++ S +
Sbjct: 63 FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEL 120
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
N + + P DWRE+G + +V++Q CG CWAFS V + E + +
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
G L S QE++DC N N GC+GG D++ N + ES+Y L + C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGI-SRESDYEYLGEQYTCRSQ 228
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
+ V+I SY ++P + L T PV IAA L Q+Y GG DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278
Query: 296 SLAN-INHAVQIVGY 309
S A+ INHAV +GY
Sbjct: 279 SCADRINHAVTAIGY 293
>gi|290980288|ref|XP_002672864.1| predicted protein [Naegleria gruberi]
gi|284086444|gb|EFC40120.1| predicted protein [Naegleria gruberi]
Length = 356
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 97/324 (29%), Positives = 149/324 (45%), Gaps = 42/324 (12%)
Query: 7 VLFIVALIALCFLAIPVKVSKPNL---EQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEK 62
++ +V L+A LAI N + +LF+ F++++ K Y +K D R++ F++
Sbjct: 4 LILVVLLVASFILAIEAAKGPFNALPESEMQQLFTQFRRKHVKLYGTKQVQDRRYQIFKQ 63
Query: 63 SLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV---NKHVLMSHHKHHDHH 119
+ +E E G+T FSDL+ +EFK+ L S L+S + + +
Sbjct: 64 N---VERARFENYLTERDNMGVTRFSDLTPDEFKSMFLMKSYTPKQARELLSGMRQYPAN 120
Query: 120 HNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
K+ + P + DWRE + V++Q CG+CW FST E M+A K
Sbjct: 121 AKLTMKQV--------SDAPKEFDWREHNAVTPVKDQGNCGSCWTFSTTGNVEGMYAAKT 172
Query: 180 GTLSLLSVQEVIDCAGN---------GNMGCSGGDFCALLDWMDVNKVV----LEPESEY 226
G L LS Q+++DC N N GC+GG L W ++ L E Y
Sbjct: 173 GKLISLSEQQLVDCDHNCVVWEGEKTCNAGCNGG-----LMWSSFEHIIKTGGLVTEESY 227
Query: 227 PLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLG 286
P D C R S VKI ++T + +E + +A +GP+ A+NA QYY
Sbjct: 228 PYEAVDNRC-RFNVSNAVVKISNWT--FVSSNEDEMAAWLANNGPIAIAINADYLQYYRK 284
Query: 287 GVIQ-YNCDGSLANINHAVQIVGY 309
G++ CD +NH V IVGY
Sbjct: 285 GILNPSRCDPE--ELNHGVLIVGY 306
>gi|225446585|ref|XP_002280215.1| PREDICTED: vignain [Vitis vinifera]
Length = 341
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 93/323 (28%), Positives = 143/323 (44%), Gaps = 46/323 (14%)
Query: 1 MFDVKNVLFIVALIALCFLAIPVKVSKPNL------EQKLELFSSFQQRYKKSYSKSEHD 54
M V +I + A + + NL E+ + + + + YK + KS+
Sbjct: 1 MASVNQYQYICLALLFFLAAWASQATARNLLEASMYERHEDWMAQYGRVYKDADEKSK-- 58
Query: 55 IRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHK 114
R+K F+ ++ IE NK +S + I EF+DL+ EEF R R+ H+ +
Sbjct: 59 -RYKIFKDNVARIESFNKAMD--KSYKLSINEFADLTNEEF--RASRNRFKAHICSTEAT 113
Query: 115 HHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESM 174
+ H +P DWR+ G + +++Q CG+CWAFS V E +
Sbjct: 114 SFKYEH--------------VAAVPSTVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGI 159
Query: 175 HALKNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVV-----LEPESEYPL 228
L G L LS QE++DC +G + GC+GG L+D D K + L E+ YP
Sbjct: 160 TQLSTGKLISLSEQELVDCDTSGEDQGCNGG----LMD--DAFKFIEQNHGLATEANYPY 213
Query: 229 LLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLG 286
D C RK + KI Y D +E ++ +A H P+ A++A +Q+Y
Sbjct: 214 AGTDGTCNRKKAAHPAAKINGYE-DVPANNEKALQKAVA-HQPIAVAIDAGGFEFQFYSS 271
Query: 287 GVIQYNCDGSLANINHAVQIVGY 309
GV C L +H V VGY
Sbjct: 272 GVFTGQCGTEL---DHGVAAVGY 291
>gi|225446583|ref|XP_002280204.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1 [Vitis vinifera]
Length = 341
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 90/315 (28%), Positives = 139/315 (44%), Gaps = 30/315 (9%)
Query: 1 MFDVKNVLFIVALIALCFLAIPVKVSKPNLEQK--LELFSSFQQRYKKSYSKS-EHDIRF 57
M V +I + A + + NL + E + +Y + Y + E R+
Sbjct: 1 MASVNQYQYICLALLFVLAAWASQATARNLHEASMYERHEDWMVQYGREYKDADEKSKRY 60
Query: 58 KNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHD 117
K F+ ++ IE NK +S + I EF+DL+ EEF R R+ H+ +
Sbjct: 61 KIFKDNVARIESFNKAMD--KSYKLSINEFADLTNEEF--RASRNRFKAHICSTEATSFK 116
Query: 118 HHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHAL 177
+ + T +P DWR+ G + +++Q CG+CWAFS V E + L
Sbjct: 117 YEN--------------VTAVPSTVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQL 162
Query: 178 KNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACK 236
G L LS QE++DC +G + GCSGG +++ N L E+ YP D C
Sbjct: 163 STGKLISLSEQELVDCDTSGEDQGCSGGLMDDAFKFIEQNH-GLTTEANYPYAGTDGTCN 221
Query: 237 RKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCD 294
RK + KI Y D +E ++ +A H P+ A++A +Q+Y GV C
Sbjct: 222 RKKAAHPAAKINGYE-DVPANNEKALQKAVA-HQPIAVAIDAGGSEFQFYSSGVFTGQCG 279
Query: 295 GSLANINHAVQIVGY 309
L +H V VGY
Sbjct: 280 TEL---DHGVSAVGY 291
>gi|363814535|ref|NP_001242660.1| uncharacterized protein LOC100807362 precursor [Glycine max]
gi|255636658|gb|ACU18666.1| unknown [Glycine max]
Length = 367
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 79/284 (27%), Positives = 134/284 (47%), Gaps = 29/284 (10%)
Query: 31 EQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNK-NRQSPESARYGITEFS 88
E+ + ++ + ++ K Y+ E + RF+ F+ +L+ IEE N NR + + G+ FS
Sbjct: 46 EEVMSIYEEWLVKHGKVYNAVEEKEKRFQIFKDNLNFIEEHNAVNR----TYKVGLNRFS 101
Query: 89 DLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAG 148
DLS EE+++++L ++ +M+ + +P DWR+ G
Sbjct: 102 DLSNEEYRSKYLGTKIDPSRMMARPSRRYSPR-------------VADNLPESVDWRKEG 148
Query: 149 IIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCA 208
+ +V+NQ C CWAFS + E ++ + G L+ LS QE++DC N GCSGG
Sbjct: 149 AVVRVKNQSECEGCWAFSAIAAVEGINKIVTGNLTALSEQELLDCDRTVNAGCSGGLVDY 208
Query: 209 LLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSI-LTDIA 267
+++ +N ++ E +YP D C + + V I Y +P+ + L
Sbjct: 209 AFEFI-INNGGIDTEEDYPFQGADGICDQYKINARAVTIDGY---ERVPAYDELALKKAV 264
Query: 268 THGPVIAAVNAL--TWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+ PV A+ A +Q Y G+ C S I+H V VGY
Sbjct: 265 ANQPVSVAIEAYGKEFQLYESGIFTGTCGTS---IDHGVTAVGY 305
>gi|1169186|sp|P43156.1|CYSP_HEMSP RecName: Full=Thiol protease SEN102; Flags: Precursor
gi|396568|emb|CAA52425.1| thiol-protease [Hemerocallis hybrid cultivar]
Length = 360
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 83/315 (26%), Positives = 145/315 (46%), Gaps = 29/315 (9%)
Query: 4 VKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKK-------SYSKSEHDIR 56
K +AL+AL FL+I + P E+ L S Y+K + E + R
Sbjct: 2 AKPKFIALALVALSFLSIAQSI--PFTEKDLASEDSLWNLYEKWRTHHTVARDLDEKNRR 59
Query: 57 FKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHH 116
F F++++ I E N+ + +P + + +F D++ +EF++++ + HH+
Sbjct: 60 FNVFKENVKFIHEFNQKKDAP--YKLALNKFGDMTNQEFRSKYAGSKIQ------HHRSQ 111
Query: 117 DHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHA 176
+ ++P DWR G + V++Q CG+CWAFST+ + E ++
Sbjct: 112 RGIQKNTGSFMYENVGSLPA---ASIDWRAKGAVTGVKDQGQCGSCWAFSTIASVEGINQ 168
Query: 177 LKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACK 236
+K G L LS QE++DC + N GC+GG +++ N + E YP +D C
Sbjct: 169 IKTGELVSLSEQELVDCDTSYNEGCNGGLMDYAFEFIQKNGITT--EDSYPYAEQDGTCA 226
Query: 237 RKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCD 294
+ V I + D +E++++ +A P+ ++ A +Q+Y GV C
Sbjct: 227 SNLLNSPVVSIDGHQ-DVPANNENALMQAVANQ-PISVSIEASGYGFQFYSEGVFTGRCG 284
Query: 295 GSLANINHAVQIVGY 309
L +H V IVGY
Sbjct: 285 TEL---DHGVAIVGY 296
>gi|18420375|ref|NP_568052.1| cysteine proteinase RD19a [Arabidopsis thaliana]
gi|1172872|sp|P43296.1|RD19A_ARATH RecName: Full=Cysteine proteinase RD19a; Short=RD19; Flags:
Precursor
gi|435618|dbj|BAA02373.1| thiol protease [Arabidopsis thaliana]
gi|4539328|emb|CAB38829.1| drought-inducible cysteine proteinase RD19A precursor [Arabidopsis
thaliana]
gi|7270892|emb|CAB80572.1| drought-inducible cysteine proteinase RD19A precursor [Arabidopsis
thaliana]
gi|19310552|gb|AAL85009.1| putative cysteine proteinase RD19A [Arabidopsis thaliana]
gi|22136868|gb|AAM91778.1| putative cysteine proteinase RD19A [Arabidopsis thaliana]
gi|110740898|dbj|BAE98545.1| drought-inducible cysteine proteinase RD19A precursor [Arabidopsis
thaliana]
gi|332661616|gb|AEE87016.1| cysteine proteinase RD19a [Arabidopsis thaliana]
Length = 368
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 86/297 (28%), Positives = 143/297 (48%), Gaps = 34/297 (11%)
Query: 23 VKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESAR 81
V ++P + + FS F++++ K Y S EHD RF F+ +L ++++ SA
Sbjct: 37 VGGAEPQVLTSEDHFSLFKRKFGKVYASNEEHDYRFSVFKANL---RRARRHQKLDPSAT 93
Query: 82 YGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVK 141
+G+T+FSDL+ EF+ +HL + S K + K + I +P
Sbjct: 94 HGVTQFSDLTRSEFRKKHLG-------VRSGFK--------LPKDANKAPILPTENLPED 138
Query: 142 KDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC-------- 193
DWR+ G + V+NQ +CG+CW+FS E + L G L LS Q+++DC
Sbjct: 139 FDWRDHGAVTPVKNQGSCGSCWSFSATGALEGANFLATGKLVSLSEQQLVDCDHECDPEE 198
Query: 194 AGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCD 253
A + + GC+GG + ++ + L E +YP KD + S + +++
Sbjct: 199 ADSCDSGCNGGLMNSAFEYT-LKTGGLMKEEDYPYTGKDGKTCKLDKSKIVASVSNFSVI 257
Query: 254 TLIPSESSILTDIATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
++ E I ++ +GP+ A+NA Q Y+GGV Y C +NH V +VGY
Sbjct: 258 SI--DEEQIAANLVKNGPLAVAINAGYMQTYIGGVSCPYIC---TRRLNHGVLLVGY 309
>gi|356517426|ref|XP_003527388.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 343
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 95/312 (30%), Positives = 141/312 (45%), Gaps = 29/312 (9%)
Query: 5 KNVLF---IVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNF 60
KN L+ + L + FLA V E + + RY K Y E + RF+ F
Sbjct: 4 KNQLYHISLALLFCMGFLAFQVTCRTLQDASMYERHAQWMARYAKVYKDPQEREKRFRIF 63
Query: 61 EKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
+++++ IE N +S + I +F+DL+ EEF R+ H+ S + +
Sbjct: 64 KENVNYIETFNS--ADNKSYKLDINQFADLTNEEFIAP--RNRFKGHMCSSITRTTTFKY 119
Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
+V T IP DWR+ G + +++Q CG CWAFS V E +HAL G
Sbjct: 120 ENV------------TVIPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALNAG 167
Query: 181 TLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
L LS QEV+DC G + GC+GG ++ N L E YP D C KA
Sbjct: 168 KLISLSEQEVVDCDTKGQDQGCAGGFMDGAFKFIIQNH-GLNTEPNYPYKAADGKCNAKA 226
Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSL 297
+ + I Y D + +E ++ +A PV A++A +Q+Y GV +C L
Sbjct: 227 AANHAATITGYE-DVPVNNEKALQKAVANQ-PVSVAIDASGSDFQFYKSGVFTGSCGTEL 284
Query: 298 ANINHAVQIVGY 309
+H V VGY
Sbjct: 285 ---DHGVTAVGY 293
>gi|118397782|ref|XP_001031222.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89285547|gb|EAR83559.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 331
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 88/312 (28%), Positives = 147/312 (47%), Gaps = 31/312 (9%)
Query: 6 NVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSL 64
N ++A+I F+ + +L + L+ ++ F + Y + Y +++E D R F ++
Sbjct: 2 NTKLLLAIIFSAFIC-SAYADQVSLVEALQAYNKFTRNYPRIYLNEAESDYRLAIFLENY 60
Query: 65 DIIEELNKNRQSPESA-RYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHV 123
I++ N N PE+ + G+ FSD++++EF + L+ N ++L + N+V
Sbjct: 61 QKIQDHNNN---PENTYQIGVNRFSDMTQQEFSQKILQ---NPNIL-------SNGKNYV 107
Query: 124 KKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLS 183
+K+ + P DWR G++ V+NQ CG+CWAFS ES +A+ N L
Sbjct: 108 QKQQASVNDVQPA---TSIDWRTKGVVTPVKNQGECGSCWAFSATAAMESYNAIHNKVLL 164
Query: 184 LLSVQEVIDCAGNGN-----MGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
S QE +DC N GC GG + + + V + E+EYP + +C
Sbjct: 165 RFSEQEFVDCTTEKNGGFYSFGCEGGVPGEAIRYASLYGV--KTEAEYPYVGIQGSCNTT 222
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLA 298
++ K SY +P + L + PV +++A Y+ GV Y+C
Sbjct: 223 NSTTTNFKPVSYYS---LPETTEALKVALNNAPVSVSIDATLLGDYVSGV--YDCKNQTI 277
Query: 299 NINHAVQIVGYD 310
INHAV VGYD
Sbjct: 278 EINHAVLAVGYD 289
>gi|410923307|ref|XP_003975123.1| PREDICTED: cathepsin L1-like [Takifugu rubripes]
Length = 336
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 93/310 (30%), Positives = 148/310 (47%), Gaps = 32/310 (10%)
Query: 8 LFIVALIALCFLAIPVKVSKPNLEQKL-ELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDI 66
+F V ++ALC A +S P+L+ +L E ++ ++ + K Y + E R +EK+L
Sbjct: 1 MFPVVVLALCVTAA---LSAPSLDPQLDEHWNLWKDWHSKKYHEKEEGWRRMVWEKNLKK 57
Query: 67 IEELN-KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKK 125
IE N ++ + G+ F D++ EEF R +N + L S +
Sbjct: 58 IELHNLEHSMGKHTYSLGMNHFGDMTHEEF-----RQIMNGYKLKS-------------Q 99
Query: 126 RSITTGITIPTGI---PVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTL 182
R + + + P DWR+ G + V++Q CG+CWAFST E H K GTL
Sbjct: 100 RKLRGSLFMEPNFLEAPRSVDWRDKGYVTPVKDQGQCGSCWAFSTTGAMEGQHFRKTGTL 159
Query: 183 SLLSVQEVIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATS 241
LS Q ++DC+ GN GC+GG ++ N L+ E YP L D S
Sbjct: 160 VSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYIKDNG-GLDSEESYPYLGTDEGPCHYDPS 218
Query: 242 PNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLAN 299
N + D SE +++ +A+ GPV A++A ++Q+Y G I Y+ + S
Sbjct: 219 YNSANDTGFV-DVPSGSERALMKAVASVGPVSVAIDAGHESFQFYHSG-IYYDKECSSEE 276
Query: 300 INHAVQIVGY 309
++H V +VGY
Sbjct: 277 LDHGVLVVGY 286
>gi|225428879|ref|XP_002285299.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
Length = 469
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 86/293 (29%), Positives = 139/293 (47%), Gaps = 31/293 (10%)
Query: 24 KVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNK-NRQSPESAR 81
++ K + + ++ ++ ++ KSY+ E + RF+ F+ +L IEE N NR + +
Sbjct: 41 RLEKRTDAEVMAVYEAWLVKHGKSYNALGERERRFEIFKDNLRFIEEHNAVNR----TYK 96
Query: 82 YGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVK 141
G+ F+DL+ EE+++R+L + + D + S G +P +
Sbjct: 97 VGLNRFADLTNEEYRSRYLGRRDETRRGLRASRVSDRY-------SFRAGEDLPESV--- 146
Query: 142 KDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGC 201
DWRE G + V++Q CG+CWAFST+ E ++ + G L LS QE++DC + N GC
Sbjct: 147 -DWREKGAVVPVKDQGNCGSCWAFSTIAAVEGINQIATGDLISLSEQELVDCDKSYNQGC 205
Query: 202 SGGDFCALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPS 258
+GG L+D+ +N ++ E +YP D C + V I Y D
Sbjct: 206 NGG----LMDYAFEFIINNGGIDSEEDYPYRAADTTCDPNRKNARVVSIDGYE-DVPQND 260
Query: 259 ESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
E S+ +A PV A+ A +Q Y GV C L +H V VGY
Sbjct: 261 ERSLKKAVANQ-PVSVAIEAGGRAFQLYQSGVFTGQCGTQL---DHGVVAVGY 309
>gi|354504282|ref|XP_003514206.1| PREDICTED: cathepsin J-like [Cricetulus griseus]
gi|344250851|gb|EGW06955.1| Cathepsin J [Cricetulus griseus]
Length = 334
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 89/305 (29%), Positives = 148/305 (48%), Gaps = 32/305 (10%)
Query: 11 VALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEE 69
V L LCF V ++ P L+ L+ + ++++Y+KSYS+ E + +EK++ +I
Sbjct: 5 VFLTILCF---GVALAAPVLDSSLDAEWQQWKKKYEKSYSQEEEVWKRAVWEKNMQMIRT 61
Query: 70 LN-KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSI 128
N ++ Q + F D++ EE++T V V K +S+
Sbjct: 62 HNGEDGQGKHGFTVEMNAFGDMTGEEYRTFLTDIPVPAAV---------------KVKSV 106
Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
+ +P +DW + G + VR Q CG+CWAF+ + E + G L+ LSVQ
Sbjct: 107 QNPLL--NDLPKSEDWTKKGFVTPVRKQGQCGSCWAFAAIGAIEGQMFWRTGNLTTLSVQ 164
Query: 189 EVIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI 247
++DC+ GN GC GD + ++ ++ LE E YP KD C+ +PN +
Sbjct: 165 NLLDCSKPQGNNGCVRGDAYSAYQYV-LHNGGLEAEETYPYEAKDGPCRY---NPNNSRA 220
Query: 248 KSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVI-QYNCDGSLANINHAV 304
+L E +L ++ GPV AA++A ++++Y GG+ + NC L NHAV
Sbjct: 221 YITEVVSLPAHEDYLLVAVSMIGPVAAAIDASHDSFRFYRGGIYHEPNCSSYL--TNHAV 278
Query: 305 QIVGY 309
+VGY
Sbjct: 279 LVVGY 283
>gi|400180457|gb|AFP73365.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 97/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)
Query: 3 DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
D+ N+L + + F S+P L ++ EL+ S R YK K E RF
Sbjct: 6 DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
F++++ IE +NK S + G+ EF+D++ +EF + ++ N ++ S +
Sbjct: 63 FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
N + + P DWRE+G + +V++Q CG CWAFS V + E + +
Sbjct: 121 IINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
G L S QE++DC N N GC+GG D++ N + ES+Y L + C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGI-SRESDYEYLGEQYTCRSQ 228
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
+ V+I SY ++P + L T PV IAA L Q+Y GG DG
Sbjct: 229 EKTA-AVQISSY---KVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278
Query: 296 SLAN-INHAVQIVGY 309
S A+ INHAV +GY
Sbjct: 279 SCADRINHAVTAIGY 293
>gi|297792329|ref|XP_002864049.1| hypothetical protein ARALYDRAFT_495086 [Arabidopsis lyrata subsp.
lyrata]
gi|297309884|gb|EFH40308.1| hypothetical protein ARALYDRAFT_495086 [Arabidopsis lyrata subsp.
lyrata]
Length = 361
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 91/311 (29%), Positives = 140/311 (45%), Gaps = 31/311 (9%)
Query: 9 FIVALIALCFLAIPVKVSKPNLEQK--------LELFSSFQQRYKKSYSKSEHDIRFKNF 60
FIV +ALC L + + +K EL+ ++ + + S E RF F
Sbjct: 4 FIV--LALCMLMVLETTKSLDFHEKDVESEDSLWELYERWKSHHTIARSLEEKAKRFNVF 61
Query: 61 EKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
+ ++ I E NK S + + +F D++ EEF+ + ++ HH+
Sbjct: 62 KHNVKHIHETNKKENS---YKLKLNKFGDMTSEEFRRTYAGSNI------KHHRMFQGER 112
Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
K T+PT + DWR+ G + V+NQ CG+CWAFSTV E ++ ++
Sbjct: 113 QTTKSFMYANVDTLPTSV----DWRKNGAVTPVKNQGQCGSCWAFSTVVAVEGINQIRTK 168
Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKAT 240
L+ LS QE++DC N N GC+GG +++ K L E YP D C
Sbjct: 169 KLTSLSEQELVDCDTNKNQGCNGGLMDLAFEFIK-EKGGLTSELVYPYKASDETCDTNKE 227
Query: 241 SPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLA 298
+ V I + D SE ++ +A H PV A++A +Q+Y GV C L
Sbjct: 228 NAPVVSIDGHE-DVPKNSEVDLMKAVA-HQPVSVAIDAGGSDFQFYSEGVFTGRCGTEL- 284
Query: 299 NINHAVQIVGY 309
NH V +VGY
Sbjct: 285 --NHGVAVVGY 293
>gi|355681656|gb|AER96815.1| Cathepsin L precursor [Mustela putorius furo]
Length = 331
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 94/304 (30%), Positives = 140/304 (46%), Gaps = 34/304 (11%)
Query: 13 LIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELN 71
L ALC + + + P L Q L+ +S ++ + K Y ++E R +EK+L +I++ N
Sbjct: 7 LAALC---LGIVSAAPKLYQSLDARWSQWKAAHGKLYDENEEGWRRAVWEKNLKVIKQHN 63
Query: 72 KN-RQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITT 130
+ Q S + F DL+ EEFK +M+ K +V
Sbjct: 64 QEYSQGKHSFTMAMNAFGDLTNEEFKQ-----------VMNGLKSQKRKEGNV------- 105
Query: 131 GITIP--TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
P P DWR+ G + V+NQ CG+CWAFS E K L LS Q
Sbjct: 106 -FQAPPFAETPSSVDWRKKGYVTPVKNQGPCGSCWAFSATGALEGQMFRKTKRLVSLSEQ 164
Query: 189 EVIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI 247
++DC+ GN GCSGG ++ N L+ E YP +D +CK K P
Sbjct: 165 NLVDCSQAEGNEGCSGGLMDYAFQYVKDNG-GLDSEESYPYRAQDESCKYK---PEQSAA 220
Query: 248 KSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQ 305
+ P E S+ +AT GP+ AA++A T+Q+Y G I Y+ D S N++H +
Sbjct: 221 NDTGFMDIHPEEESLKLAVATVGPISAAIDASLSTFQFYHKG-IYYDPDCSSENLDHGIL 279
Query: 306 IVGY 309
+VGY
Sbjct: 280 VVGY 283
>gi|15984|emb|CAA34486.1| unnamed protein product [Actinidia deliciosa]
Length = 380
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 85/312 (27%), Positives = 150/312 (48%), Gaps = 27/312 (8%)
Query: 2 FDVKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNF 60
F ++LF L+ L +++ ++ ++ S+ +Y KSY S E + RF+ F
Sbjct: 7 FVSMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIF 66
Query: 61 EKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
+++L I+E N + S + G+ +F+DL++EEF++ +L + + +++
Sbjct: 67 KETLRFIDEHNADTN--RSYKVGLNQFADLTDEEFRSTYLGFTSGSNKTKVSNRYEPRF- 123
Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
G +P+ + DWR AG + +++Q CG CWAFS + T E ++ + G
Sbjct: 124 ----------GQVLPSYV----DWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTG 169
Query: 181 TLSLLSVQEVIDCAGNGNM-GCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
L LS QE+IDC N GC+GG ++ +N + E YP +D C
Sbjct: 170 VLISLSEQELIDCGRTQNTRGCNGGYITDGFQFI-INNGGINTEENYPYTAQDGECNLDL 228
Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSL 297
+ V I +Y + + + L T+ PV A++A +++Y G+ C +
Sbjct: 229 QNEKYVTIDTY--ENVPYNNEWALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTA- 285
Query: 298 ANINHAVQIVGY 309
I+HAV IVGY
Sbjct: 286 --IDHAVTIVGY 295
>gi|37780045|gb|AAP32195.1| cysteine protease 5 [Trifolium repens]
Length = 343
Score = 118 bits (295), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 94/316 (29%), Positives = 147/316 (46%), Gaps = 39/316 (12%)
Query: 6 NVLFIVAL-IALCFLAIPVKVSKPNLEQK--LELFSSFQQRYKKSY-SKSEHDIRFKNFE 61
N L+ V+L + C + ++V+ L+ E + Y K Y + E + R + F
Sbjct: 5 NQLYHVSLALFFCLGLLAIQVTSRTLQDDSIFERHEQWMTHYGKVYKNPQEREKRLRIFT 64
Query: 62 KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
++L IE N N + + + GI +F+DL+ EEF + S +K H +
Sbjct: 65 ENLKYIEASN-NAGNNKPYKLGINQFADLTNEEF-------------IASRNKFKGHMCS 110
Query: 122 HVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGT 181
+ + TT T +P DWR+ G + V+NQ CG CWAFS + E +H + G
Sbjct: 111 SIIR--TTTFKYENTSVPSTVDWRKKGAVTPVKNQGQCGCCWAFSAIAATEGIHKISTGK 168
Query: 182 LSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLE-----PESEYPLLLKDAAC 235
L LS QE++DC NG + GC GG L+D D K +++ E+ YP D C
Sbjct: 169 LVSLSEQELVDCDTNGVDQGCEGG----LMD--DAFKFIIQNNGISTEAGYPYQGVDGTC 222
Query: 236 KRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNC 293
K S + I Y D +E+++ +A P+ A++A +Q+Y GV +C
Sbjct: 223 KANEASTSAATITGYE-DVPANNENALQKAVANQ-PISVAIDASGSDFQFYKSGVFTGSC 280
Query: 294 DGSLANINHAVQIVGY 309
L +H V VGY
Sbjct: 281 GTEL---DHGVTAVGY 293
>gi|328869030|gb|EGG17408.1| cysteine protease [Dictyostelium fasciculatum]
Length = 379
Score = 118 bits (295), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 83/275 (30%), Positives = 139/275 (50%), Gaps = 23/275 (8%)
Query: 43 RYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRH 102
R++KSY + RF F+ ++D + E N +++ P + +F+D++ +E++ +L
Sbjct: 45 RFEKSYESFDFLQRFAVFKTNMDYVHEWN-SKKLPTVLE--LNQFADITNQEYRRLYLGT 101
Query: 103 SVNKHVLMSHHKHHDHHHNHVK----KRSITTGITIPTGIPVKKDWREAGIIGKVRNQQT 158
+N L+ H+ +N K S ++G T+ DWR G + ++NQ
Sbjct: 102 RINARHLLGTPGTHEMSNNFGKVFGDDDSDSSGATV--------DWRAKGAVSPIKNQGQ 153
Query: 159 CGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWMDVNK 217
CG+CW+FST + E H + G + LS Q ++DC+G+ GNMGC GG D++ N+
Sbjct: 154 CGSCWSFSTTGSVEGAHYISTGKMVPLSEQNLVDCSGSEGNMGCQGGLMNLAFDYIIKNE 213
Query: 218 VVLEPESEYPLLLKDA-ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAV 276
+ + E YP + C T+ G I SY + ES++ + GPV A+
Sbjct: 214 GI-DTEDSYPYSAETGKKCLFNKTNV-GATISSYK-NITSGDESNLADAVKNAGPVSVAI 270
Query: 277 NAL--TWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+A ++Q Y G I Y D S N++H V +VGY
Sbjct: 271 DASHNSFQLYSHG-IYYEKDCSSVNLDHGVLVVGY 304
>gi|341886805|gb|EGT42740.1| hypothetical protein CAEBREN_23878 [Caenorhabditis brenneri]
Length = 396
Score = 118 bits (295), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 99/313 (31%), Positives = 155/313 (49%), Gaps = 39/313 (12%)
Query: 10 IVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIE 68
+ L+A F K+ L+Q+ F F +++ + + S E+ +RF+ F+K+L E
Sbjct: 64 MTILMASVFRIRAEKLKSFGLQQQ---FKDFNKKFGREHKSLEEYKMRFEVFQKNLREFE 120
Query: 69 ELNKNRQSPESARYGITEFSDLSEEEFKT-----RHLRHSVNKHVLMSHHKHHDHHHNHV 123
ELN Q S +YGI +FSD +E E K + L S++ L + + +
Sbjct: 121 ELN---QKNPSVQYGINKFSDKTESELKNLLMDKKFLDSSLSNSTLKTLSSYRN------ 171
Query: 124 KKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLS 183
R+I + P I DWR G + V++Q CG+CWAF+TV ES +A++ GTL
Sbjct: 172 -PRNIIKNVQRPDYI----DWRNDGKVMSVKDQGQCGSCWAFATVAAVESQYAIRKGTLW 226
Query: 184 LLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPN 243
LS QE++DC G + GC GG + L ++ N LE E +YP +A K N
Sbjct: 227 SLSEQELVDCDG-ASYGCGGGFLTSALGFILGNG--LETEDDYPY----SATKHDQCWIN 279
Query: 244 GVKIKSYTCD--TLIPSESSILTDIATHGPVIAAVNA-LTWQYYLGGVI---QYNC-DGS 296
G K + + + L SE + +A GPV A++ ++ Y G+ ++ C D S
Sbjct: 280 GDKTRVWIDEGYQLTMSEDDVAEWVANVGPVSFAMSVPKSFPAYHDGIYSPSEHECKDES 339
Query: 297 LANINHAVQIVGY 309
L HA+ I+GY
Sbjct: 340 LG--YHAMAIIGY 350
>gi|19851|emb|CAA78365.1| tobacco pre-pro-cysteine proteinase [Nicotiana tabacum]
Length = 365
Score = 118 bits (295), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 87/288 (30%), Positives = 131/288 (45%), Gaps = 38/288 (13%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
FS F+ ++ K Y S+ EHD RFK F+ +L N+ SA +GIT+FSDL+ EF
Sbjct: 49 FSLFKSKFGKIYASEEEHDHRFKVFKANL---RRARLNQLLDPSAEHGITKFSDLTPSEF 105
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
+ +L H K + +PT +P DWR+ G + V+
Sbjct: 106 RRTYLGL-----------------HKPKPKVNAEKAPILPTSDLPADYDWRDHGAVTGVK 148
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
NQ +CG+CW+FST E H L G L LS Q+++DC + + GC GG
Sbjct: 149 NQGSCGSCWSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDSEQQDSCDAGCGGGLM 208
Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
++ + L+ E +YP KD C S + +++ L E I ++
Sbjct: 209 TTAFEYT-LKAGGLQLEKDYPYTGKDGKCHFD-KSKIAAAVTNFSVIGL--DEDQIAANL 264
Query: 267 ATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGYDNYS 313
HGP+ +NA Q Y+GGV C +H V +VGY ++
Sbjct: 265 VKHGPLAVGINAAWMQTYVGGVSCPLIC---FKRQDHGVLLVGYGSHG 309
>gi|359359213|gb|AEV41117.1| putative oryzain beta chain precursor [Oryza officinalis]
Length = 465
Score = 117 bits (294), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 90/277 (32%), Positives = 129/277 (46%), Gaps = 28/277 (10%)
Query: 46 KSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV 104
+SY+ EH+ RF+ F +L + N R R G+ F+DL+ EEF+ L V
Sbjct: 63 RSYNALGEHERRFRVFWDNLRFADAHNA-RADDHGFRLGMNRFADLTNEEFRATFLGAKV 121
Query: 105 NKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWA 164
V S + H+ V++ +P DWRE G + V+NQ CG+CWA
Sbjct: 122 ---VERSRAAGERYRHDGVEE------------LPESVDWREKGAVAPVKNQGQCGSCWA 166
Query: 165 FSTVETAESMHALKNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPE 223
FS V T ES++ L G + LS QE+++C+ NG N GC+GG D++ + ++ E
Sbjct: 167 FSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFI-IKNGGIDTE 225
Query: 224 SEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTW 281
+YP D C + V I + D E S+ +A H PV A+ A +
Sbjct: 226 DDYPYKAVDGKCDINRENAKVVSIDGFE-DVPQNDEKSLQKAVA-HQPVSVAIEAGGREF 283
Query: 282 QYYLGGVIQYNCDGSLANINHAVQIVGY--DNYSRTW 316
Q Y GV C SL +H V VGY DN W
Sbjct: 284 QLYHSGVFSGRCGTSL---DHGVVAVGYGTDNGKDYW 317
>gi|146215976|gb|ABQ10190.1| actinidin Act1b [Actinidia arguta]
Length = 380
Score = 117 bits (294), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 83/307 (27%), Positives = 148/307 (48%), Gaps = 27/307 (8%)
Query: 7 VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLD 65
+LF L+ L ++K ++ ++ S+ +Y KSY S E + RF+ F+++L
Sbjct: 12 LLFFSTLLVLSLAFNAKNLTKRTNDELKAMYESWLTKYGKSYNSLGEWERRFEIFKETLR 71
Query: 66 IIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKK 125
I+E N + + S R G+ +F+D + EEF++ +L + + + +++
Sbjct: 72 FIDEHNAD--TNRSYRVGLNQFADQTNEEFQSTYLGFTSGSNKMKVSNRYEPR------- 122
Query: 126 RSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLL 185
+ +P DWR AG + +++Q CG+CWAFS + T E ++ + G L L
Sbjct: 123 --------VGQVLPDYVDWRSAGAVVDIKSQGQCGSCWAFSAIATVEGINKIVTGDLISL 174
Query: 186 SVQEVIDCAGNGNM-GCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNG 244
S QE++DC N GC GG ++ +N + E+ YP +D C +
Sbjct: 175 SEQELVDCGRTQNTRGCDGGSITDGFQFI-INNGGINTEANYPYTAEDGQCNLDLQNEKY 233
Query: 245 VKIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSLANINH 302
I +Y + +E ++ T +A + PV A+ A +Q+Y G+ C + ++H
Sbjct: 234 ASIDTYE-NVPYNNEWALQTAVA-YQPVSVALEAAGDAFQHYSSGIFTGPCGTA---VDH 288
Query: 303 AVQIVGY 309
AV IVGY
Sbjct: 289 AVTIVGY 295
>gi|343471318|emb|CCD16236.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 445
Score = 117 bits (294), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 92/317 (29%), Positives = 154/317 (48%), Gaps = 30/317 (9%)
Query: 4 VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSY-SKSEHDIRFKNFE 61
+ + F V L+A+ +PV + + EQ L+ F++F+Q+Y +SY +E RF+ F+
Sbjct: 7 TRTLRFSVGLLAVAACFVPVALGVLHAEQSLQQQFAAFKQKYSRSYKDATEEAFRFRMFK 66
Query: 62 KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
+S+ E + + A +G+T+FSD+S EEF+ +L + K++
Sbjct: 67 QSM---ERAKEEAAANPYATFGVTQFSDMSPEEFRATYLNGA----------KYYAAALE 113
Query: 122 HVKKRSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
+K + + TG P DWR+ G + V++Q +CG+CWAF+ E +
Sbjct: 114 RPRKV-----VNVSTGKAPPAVDWRKKGAVTPVKDQGSCGSCWAFAATGNIEGQWKIAGH 168
Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACK--R 237
L+ LS Q ++ C + C GG W+ NK + E YP D
Sbjct: 169 ELTSLSEQMLVSCDTTED-NCRGGFADRAFKWIVSSNKGNVFTEESYPYASTDGYVPPCN 227
Query: 238 KATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL 297
K+ G KI + L E++I +A +GPV AV+A T+ Y GGV+ +C S
Sbjct: 228 KSGKVVGAKISGHI--NLPKDENAIAEWLARNGPVAIAVDASTFLDYKGGVLT-SC--SS 282
Query: 298 ANINHAVQIVGYDNYSR 314
++H V +VGY++ S+
Sbjct: 283 EGLSHDVLLVGYNDTSK 299
>gi|116786550|gb|ABK24153.1| unknown [Picea sitchensis]
Length = 394
Score = 117 bits (294), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 85/283 (30%), Positives = 132/283 (46%), Gaps = 33/283 (11%)
Query: 37 FSSFQQRYKKSYSKSE-HDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F+ F +++ K YS +E H RF F+K+L + ++++ A +GI +FSDL+EEEF
Sbjct: 75 FAHFVKKFNKEYSGAEEHARRFSIFKKNL---HKALRHQKLDRDAIHGINKFSDLTEEEF 131
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
++L + L +R+ I +P DWRE G + V+N
Sbjct: 132 HEQYLGLTTPPRSL--------------SQRTQPAPILPTDDLPPDFDWRELGAVTPVKN 177
Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDFC 207
Q CG+CW FST E + +K G L LS Q+++DC + GC+GG
Sbjct: 178 QGACGSCWTFSTTGAMEGANFMKTGKLISLSEQQLVDCDHECDSSEPDVCDSGCNGGLMT 237
Query: 208 ALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIA 267
+ + L+ E +YP D +CK T V T+ E I ++
Sbjct: 238 TAYQYA-LKAGGLQREEDYPYTGIDGSCKFDNTK---VAAMVANFSTVSIDEDQIAANLV 293
Query: 268 THGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
+GP+ +NA Q Y+GGV Y C+ N++H V +VGY
Sbjct: 294 KNGPLAVGINAAFMQTYVGGVSCPYVCNKQ--NLDHGVLLVGY 334
>gi|118489556|gb|ABK96580.1| unknown [Populus trichocarpa x Populus deltoides]
Length = 367
Score = 117 bits (294), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 81/283 (28%), Positives = 134/283 (47%), Gaps = 34/283 (12%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F+SF+ ++ K+Y ++ EHD RF F+ +L K++ +A +G+T+FSDL+ +EF
Sbjct: 51 FTSFKSKFGKTYATQEEHDYRFGVFKANL---RRAKKHQMIDPTAAHGVTKFSDLTPKEF 107
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
+ + L + +K I T +P DWR+ G + +V++
Sbjct: 108 RRQFLGLKRRLRLPTDANK---------------APILPTTDLPTDYDWRDHGAVTEVKD 152
Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA--------GNGNMGCSGGDFC 207
Q +CG+CW+FS E H L G L+ LS Q+++DC G + GC GG
Sbjct: 153 QGSCGSCWSFSATGALEGAHYLATGELASLSEQQLVDCDHECDPEEYGACDSGCDGGLMN 212
Query: 208 ALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIA 267
++ + LE E +YP D + S + +++ ++ E I ++
Sbjct: 213 NAFEYA-LKAGGLEREEDYPYTGTDGGTCKFDKSKVVASVSNFSVVSI--DEDQIAANLV 269
Query: 268 THGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
HGP+ A+NA Q Y+GGV Y C +H V +VGY
Sbjct: 270 KHGPLSVAINAAFMQTYVGGVSCPYICS---KRQDHGVLLVGY 309
>gi|5051468|emb|CAB44983.1| putative preprocysteine proteinase [Nicotiana tabacum]
Length = 363
Score = 117 bits (294), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 94/331 (28%), Positives = 145/331 (43%), Gaps = 52/331 (15%)
Query: 8 LFIVALIALCFL--AIPVKVSKPNLEQKLEL------------FSSFQQRYKKSY-SKSE 52
LF+++L+A AI P + Q + FS F+ ++ K Y S+ E
Sbjct: 4 LFLLSLLAFVLFSSAIAFSDEDPLIRQVVSETDDSHLLNAEHHFSLFKSKFGKIYASEEE 63
Query: 53 HDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSH 112
HD RFK F+ +L +++ SA +GIT+FSDL+ EF+ +L
Sbjct: 64 HDHRFKVFKANL---RRARRHQLLDPSAEHGITKFSDLTPSEFRRTYLGL---------- 110
Query: 113 HKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETA 171
H K + +PT +P DWR+ G + V+NQ +CG+CW+FST
Sbjct: 111 -------HKPKPKLNAEKAPILPTSDLPADFDWRDHGAVTGVKNQGSCGSCWSFSTTGAV 163
Query: 172 ESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDFCALLDWMDVNKVVLEPE 223
E H L G L LS Q+++DC + GC GG ++ + L+ E
Sbjct: 164 EGAHFLATGELVSLSEQQLVDCDHECDPEQQDACDAGCGGGLMTTAFEYT-LKAGGLQLE 222
Query: 224 SEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQY 283
+YP KD C S + +++ L E I ++ HGP+ +NA Q
Sbjct: 223 KDYPYTGKDGKCHFD-KSKIAAAVTNFSVIGL--DEDQIAANLVKHGPLAVGINAAWMQT 279
Query: 284 YLGGV-IQYNCDGSLANINHAVQIVGYDNYS 313
Y+GGV C +H V +VGY ++
Sbjct: 280 YVGGVSCPLIC---FKRQDHGVLLVGYGSHG 307
>gi|193806686|sp|A5HII1.1|ACTN_ACTDE RecName: Full=Actinidain; Short=Actinidin; AltName: Full=Allergen
Act d 1; AltName: Allergen=Act d 1; Flags: Precursor
gi|146215974|gb|ABQ10189.1| actinidin Act1a [Actinidia deliciosa]
Length = 380
Score = 117 bits (294), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 85/312 (27%), Positives = 150/312 (48%), Gaps = 27/312 (8%)
Query: 2 FDVKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNF 60
F ++LF L+ L +++ ++ ++ S+ +Y KSY S E + RF+ F
Sbjct: 7 FVSMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIF 66
Query: 61 EKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
+++L I+E N + S + G+ +F+DL++EEF++ +L + + +++
Sbjct: 67 KETLRFIDEHNADTN--RSYKVGLNQFADLTDEEFRSTYLGFTSGSNKTKVSNRYEPR-- 122
Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
G +P+ + DWR AG + +++Q CG CWAFS + T E ++ + G
Sbjct: 123 ---------VGQVLPSYV----DWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTG 169
Query: 181 TLSLLSVQEVIDCAGNGNM-GCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
L LS QE+IDC N GC+GG ++ +N + E YP +D C
Sbjct: 170 VLISLSEQELIDCGRTQNTRGCNGGYITDGFQFI-INNGGINTEENYPYTAQDGECNLDL 228
Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSL 297
+ V I +Y + + + L T+ PV A++A +++Y G+ C +
Sbjct: 229 QNEKYVTIDTY--ENVPYNNEWALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTA- 285
Query: 298 ANINHAVQIVGY 309
I+HAV IVGY
Sbjct: 286 --IDHAVTIVGY 295
>gi|393906608|gb|EFO21301.2| ctsf protein [Loa loa]
Length = 472
Score = 117 bits (294), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 88/284 (30%), Positives = 145/284 (51%), Gaps = 38/284 (13%)
Query: 37 FSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F F +++K+ YS +E RFK + ++L +E+L + +A YG+T+FSD+S EEF
Sbjct: 170 FLDFIKKFKREYSSVAEQLDRFKKYMQNLHFVEKLQHEEKG--TAIYGVTQFSDMSPEEF 227
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
+ L S+ ++S+ +D +KK ++T +P + DWR G++ V+N
Sbjct: 228 QKTML-PSLWWDRVVSNGVEYD-----LKKFNLTF-----NNLPEQFDWRTKGVVTPVKN 276
Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDV 215
Q +CG+CWAFS E + A+K G L LS QE+IDC + GC+GG + + ++
Sbjct: 277 QGSCGSCWAFSVTGNIEGLWAIKTGKLISLSEQELIDC-DRIDKGCNGG--LPINAFREI 333
Query: 216 NKV-VLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTL-----IPSESSILTD-IAT 268
++ LEPE +YP ++ C I+S T+ IP +++ I
Sbjct: 334 QRMGGLEPEDQYPYKARNGTCHL---------IRSAIAVTIDDAVEIPRNETVMKAWIVQ 384
Query: 269 HGPVIAAVNALTWQYYLGGVI---QYNCDGSLANINHAVQIVGY 309
GP+ ++A YY G++ + C S I+H V I GY
Sbjct: 385 RGPLSVGIDAKLLAYYKSGILHPSRSRCPPS--GIDHGVLITGY 426
>gi|4469153|emb|CAB38314.1| chymopapain isoform II [Carica papaya]
Length = 352
Score = 117 bits (294), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 93/286 (32%), Positives = 135/286 (47%), Gaps = 30/286 (10%)
Query: 29 NLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
++E+ ++LF S+ ++ K Y + I RF+ F +L I+E NK S G+ F
Sbjct: 40 SIERLIQLFDSWMLKHNKIYESIDEKIYRFEIFRDNLMYIDETNKKNNS---YWLGLNGF 96
Query: 88 SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREA 147
+DLS +EFK +++ + H + D + HV T P DWR
Sbjct: 97 ADLSNDEFKKKYVGFVAEDFTGLEHFDNEDFTYKHV------------TNYPQSIDWRAK 144
Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFC 207
G + V+NQ CG+CWAFST+ T E ++ + G L LS QE++DC + + GC GG
Sbjct: 145 GAVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLELSEQELVDCDKH-SYGCKGGYQT 203
Query: 208 ALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPS--ESSILTD 265
L ++ N V YP K C +AT G K+K T +PS E+S L
Sbjct: 204 TSLQYVANNGV--HTSKVYPYQAKQYKC--RATDKPGPKVK-ITGYKRVPSNCETSFLGA 258
Query: 266 IATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+A P+ V A +Q Y GV C L +HAV VGY
Sbjct: 259 LANQ-PLSFLVEAGGKPFQLYKSGVFDGPCGTKL---DHAVTAVGY 300
>gi|334332718|ref|XP_001367502.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
Length = 333
Score = 117 bits (294), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 88/308 (28%), Positives = 151/308 (49%), Gaps = 30/308 (9%)
Query: 9 FIVALIALCFLAIPVKVSKPNLEQKLE-LFSSFQQRYKKSYSKSEHDIRFKNFEKSLDII 67
F + L +LC + + + P ++ L+ + ++ ++ KSY +E +R +EK+L +I
Sbjct: 3 FYLCLASLC---LGLAAAIPPFDRALDSQWHQWKAQHGKSYEANEDSLRRATWEKNLKMI 59
Query: 68 EELNKNRQSPE-SARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKR 126
E N+ + + S + + +F D+S EEFK +M+ +K + + +
Sbjct: 60 ERHNQEYSAGKHSFQLRMNKFGDMSTEEFKQ-----------VMNGYKSNGSQR---RTK 105
Query: 127 SITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLS 186
++ +P DWRE G + V+ Q CGACW+FS V E K G L LS
Sbjct: 106 GSLYRESLLAQLPESVDWREKGYVTPVKEQGDCGACWSFSAVGAIEGQWFRKTGKLVSLS 165
Query: 187 VQEVIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGV 245
+Q +IDC GN GC GG ++ N + + E YP + +D CK K +G
Sbjct: 166 IQNLIDCTIPEGNNGCDGGFMDNAFQYVQDNGGI-DTEECYPYVAQDTECKYKPEC-SGA 223
Query: 246 KIKSYTCDTLIPS--ESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSLANIN 301
I + IPS E +++ +AT GP+ +++ ++++Y GV Y D S + ++
Sbjct: 224 NITGF---VDIPSMDERALMEAVATVGPISVGIDSANPSFKFYQSGVY-YEPDCSSSQLD 279
Query: 302 HAVQIVGY 309
H V +VGY
Sbjct: 280 HGVLVVGY 287
>gi|310656789|gb|ADP02218.1| Peptidase_C1 domain-containing protein [Triticum aestivum]
Length = 341
Score = 117 bits (294), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 91/310 (29%), Positives = 149/310 (48%), Gaps = 33/310 (10%)
Query: 7 VLFIVALIALCFLAIPVKVSKPN---LEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKS 63
+L IV I LC A+ + +E+ + + F + YK K++ RF+ F+ +
Sbjct: 8 LLAIVGCICLCSSAVLSARELGDTAMVERHEQWMAKFNRVYKDGTEKAQ---RFEVFKAN 64
Query: 64 LDIIEELN-KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNH 122
+ IE N +NR+ G+ +F+DL+ +EF+ NK + MS +
Sbjct: 65 VAFIESFNAENRK----FWLGVNQFTDLTNDEFRATK----TNKGLKMSGGRAPTGF--- 113
Query: 123 VKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTL 182
K S + +PT + DWR G++ +++Q CG CWAFS V E + L G L
Sbjct: 114 --KYSNVSIDALPTAV----DWRTKGVVTPIKDQGQCGCCWAFSAVVATEGIVKLSTGKL 167
Query: 183 SLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATS 241
LS QE++DC +G + GC GG+ ++ + L E+ YP +D CK S
Sbjct: 168 ISLSEQELVDCDVHGVDQGCEGGEMDDAFKFI-IKNGGLTTEANYPYTAQDGQCKTSIAS 226
Query: 242 PNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLAN 299
+ IK Y D ESS++ +A PV AV+ + +Q+Y GGV+ +C +
Sbjct: 227 NSVATIKGYE-DVPANDESSLMKAVANQ-PVSVAVDGGDVIFQHYSGGVMTGSCG---TD 281
Query: 300 INHAVQIVGY 309
++H + +GY
Sbjct: 282 LDHGIAAIGY 291
>gi|297819568|ref|XP_002877667.1| hypothetical protein ARALYDRAFT_348033 [Arabidopsis lyrata subsp.
lyrata]
gi|297323505|gb|EFH53926.1| hypothetical protein ARALYDRAFT_348033 [Arabidopsis lyrata subsp.
lyrata]
Length = 341
Score = 117 bits (294), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 90/306 (29%), Positives = 139/306 (45%), Gaps = 22/306 (7%)
Query: 7 VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLD 65
+ F++A+I + +E + R+ + YS SE RF+ F+K+L
Sbjct: 5 IFFLLAIILSSRTSGATSRGGLFEASAIEKHEQWMSRFHRVYSDDSEKTSRFEIFKKNLK 64
Query: 66 IIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKK 125
+E N N + ++ + EFSDL++EEFK R+ V + M+ D H V
Sbjct: 65 FVESFNMN--TNKTYTLDVNEFSDLTDEEFKARYTGLVVPEG--MTRMSTTDSHET-VSF 119
Query: 126 RSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLL 185
R G T + DWRE G + V++QQ CG CWAFS V E M + G L L
Sbjct: 120 RYENVGETGES-----MDWREEGAVTSVKHQQQCGCCWAFSAVAAVEGMTKIAKGELVSL 174
Query: 186 SVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGV 245
S Q+++DC+ N GC GG D++ N+ + E YP C+ +
Sbjct: 175 SEQQLLDCSTE-NDGCDGGIMWKAFDYIVENQGIT-AEDNYPYQGAQQTCESNHVAA--A 230
Query: 246 KIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQY--YLGGVIQYNCDGSLANINHA 303
I Y +T+ ++ L + PV A+ +++ Y GG+ C ++NHA
Sbjct: 231 TISGY--ETVPQNDEEALLKAVSQQPVSVAIEGSGYEFIHYSGGIFNGECG---THLNHA 285
Query: 304 VQIVGY 309
V IVGY
Sbjct: 286 VTIVGY 291
>gi|118485910|gb|ABK94801.1| unknown [Populus trichocarpa]
Length = 367
Score = 117 bits (294), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 81/283 (28%), Positives = 134/283 (47%), Gaps = 34/283 (12%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F+SF+ ++ K+Y ++ EHD RF F+ +L K++ +A +G+T+FSDL+ +EF
Sbjct: 51 FTSFKSKFGKTYATQEEHDYRFGVFKANL---RRAKKHQMIDPTAAHGVTKFSDLTPKEF 107
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
+ + L + +K I T +P DWR+ G + +V++
Sbjct: 108 RRQFLGLKRRLRLPTDANK---------------APILPTTDLPTDYDWRDHGAVTEVKD 152
Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA--------GNGNMGCSGGDFC 207
Q +CG+CW+FS E H L G L+ LS Q+++DC G + GC GG
Sbjct: 153 QGSCGSCWSFSATGALEGAHYLATGELASLSEQQLVDCDHECDPEEYGACDSGCDGGLMN 212
Query: 208 ALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIA 267
++ + LE E +YP D + S + +++ ++ E I ++
Sbjct: 213 NAFEYA-LKAGGLEREEDYPYTGTDGGTCKFDKSKVVASVSNFSVVSI--DEDQIAANLV 269
Query: 268 THGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
HGP+ A+NA Q Y+GGV Y C +H V +VGY
Sbjct: 270 KHGPLSVAINAAFMQTYVGGVSCPYICS---KRQDHGVLLVGY 309
>gi|46309423|ref|YP_006313.1| ORF31 [Agrotis segetum granulovirus]
gi|46200640|gb|AAS82707.1| ORF31 [Agrotis segetum granulovirus]
Length = 327
Score = 117 bits (294), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 92/304 (30%), Positives = 147/304 (48%), Gaps = 30/304 (9%)
Query: 13 LIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELN 71
+ + FL V+ + NL +LF F Q+Y KSYS + E I+F NF+ + I +N
Sbjct: 1 MFYILFLIGLVQGALYNLNDSEKLFEDFVQKYNKSYSSEEERQIKFDNFKNN---IRSIN 57
Query: 72 KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTG 131
+ SA Y I +SD+++ E + +N + D N + + G
Sbjct: 58 EKNSLSNSAVYDINFYSDMNKNELLRKQTGFKINLK-----KNNLDLSWNIKCNKKLING 112
Query: 132 ITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVI 191
+P DWR+ +I V+NQ+ CG+CWAFST+ ES++A+K L LS Q+++
Sbjct: 113 -NPAVLLPDSFDWRDRHVITSVKNQRDCGSCWAFSTIANIESLYAIKYNKLLDLSEQQLV 171
Query: 192 DCAGNGNMGCSGGDFCALLDW-MD--VNKVVLEPESEYPLLLKDAACKRKA--TSPNGVK 246
+C N GC+GG L+ W M+ + + + E+++P D CKRK + NG
Sbjct: 172 NCDEQNN-GCNGG----LMHWAMEEIIRQGGVSNETDFPYTASDGFCKRKQGFVNING-- 224
Query: 247 IKSYTCDTLIPSESSILTDIAT-HGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQ 305
C+ I S L ++ +GP+ A++ + Y G I C +NHAV
Sbjct: 225 -----CNQFILSNEDRLRELLIFNGPISIAIDVIDVIDYSQG-ISSTCRNDNG-LNHAVL 277
Query: 306 IVGY 309
+VGY
Sbjct: 278 LVGY 281
>gi|148907299|gb|ABR16787.1| unknown [Picea sitchensis]
Length = 372
Score = 117 bits (294), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 84/283 (29%), Positives = 133/283 (46%), Gaps = 39/283 (13%)
Query: 39 SFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTR 98
+ Q R +S EH RF+ F++++ I+ +NK + P + G+ +F+DLS EEFK
Sbjct: 49 ALQHRSTRSLDSDEHARRFEIFKENVKHIDSVNK-KDGP--YKLGLNKFADLSNEEFKAM 105
Query: 99 HLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITI---PTGIPVKKDWREAGIIGKVRN 155
H+ + KH + R + +G + +P DWR+ G + V+N
Sbjct: 106 HMTTKMEKHKSLR------------GDRGVESGSFMYQNSKRLPASIDWRKKGAVTPVKN 153
Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDV 215
Q CG+CWAFST+ + E ++ +K G L LS Q+++DC+ N GC+GG ++
Sbjct: 154 QGQCGSCWAFSTIASVEGINYIKTGKLVSLSEQQLVDCSKE-NAGCNGGLMDNAFQYIID 212
Query: 216 NKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLI-------PSESSILTDIAT 268
N ++ E EYP + C KI+S + T+I + L
Sbjct: 213 NGGIV-TEDEYPYTAEAGECST-------TKIESKSIATIIDGFEDVPANNEGALKKAVA 264
Query: 269 HGPVIAAVNAL--TWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
H PV A+ A +Q+Y GV C L +H V +VGY
Sbjct: 265 HQPVSIAIEASGHDFQFYSTGVFTGKCGTEL---DHGVVVVGY 304
>gi|312080834|ref|XP_003142769.1| ctsf protein [Loa loa]
Length = 437
Score = 117 bits (294), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 88/284 (30%), Positives = 145/284 (51%), Gaps = 38/284 (13%)
Query: 37 FSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F F +++K+ YS +E RFK + ++L +E+L + +A YG+T+FSD+S EEF
Sbjct: 135 FLDFIKKFKREYSSVAEQLDRFKKYMQNLHFVEKLQHEEKG--TAIYGVTQFSDMSPEEF 192
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
+ L S+ ++S+ +D +KK ++T +P + DWR G++ V+N
Sbjct: 193 QKTML-PSLWWDRVVSNGVEYD-----LKKFNLTF-----NNLPEQFDWRTKGVVTPVKN 241
Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDV 215
Q +CG+CWAFS E + A+K G L LS QE+IDC + GC+GG + + ++
Sbjct: 242 QGSCGSCWAFSVTGNIEGLWAIKTGKLISLSEQELIDC-DRIDKGCNGG--LPINAFREI 298
Query: 216 NKV-VLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTL-----IPSESSILTD-IAT 268
++ LEPE +YP ++ C I+S T+ IP +++ I
Sbjct: 299 QRMGGLEPEDQYPYKARNGTCHL---------IRSAIAVTIDDAVEIPRNETVMKAWIVQ 349
Query: 269 HGPVIAAVNALTWQYYLGGVI---QYNCDGSLANINHAVQIVGY 309
GP+ ++A YY G++ + C S I+H V I GY
Sbjct: 350 RGPLSVGIDAKLLAYYKSGILHPSRSRCPPS--GIDHGVLITGY 391
>gi|356563584|ref|XP_003550041.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 366
Score = 117 bits (293), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 84/313 (26%), Positives = 146/313 (46%), Gaps = 31/313 (9%)
Query: 7 VLFIVALIALCF-LAIPVKVSKPNLEQKLELFSSFQQ---RYKKSYSK-SEHDIRFKNFE 61
L I L+ L F L+ + S E+ + +++ +++K Y+ E D RF+ F+
Sbjct: 6 TLMISTLLFLSFTLSCAIDTSTITNYTDNEVMTMYEEWLVKHQKVYNGLGEKDKRFQVFK 65
Query: 62 KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
+L I+E N N+ + + + G+ +F+D++ EE++ + + + K H +
Sbjct: 66 DNLGFIQEHNNNQNN--TYKLGLNKFADMTNEEYRVMYFGTKSDAKRRLMKTKSTGHRYA 123
Query: 122 HVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGT 181
+ + +PV DWR G + +++Q +CG+CWAFSTV T E+++ + G
Sbjct: 124 Y----------SAGDQLPVHVDWRVKGAVAPIKDQGSCGSCWAFSTVATVEAINKIVTGK 173
Query: 182 LSLLSVQEVIDCAGNGNMGCSGGDFCALLDW---MDVNKVVLEPESEYPLLLKDAACKRK 238
LS QE++DC N GC+GG L+D+ + ++ + +YP D C
Sbjct: 174 FVSLSEQELVDCDRAYNQGCNGG----LMDYAFEFIIQNGGIDTDKDYPYRGFDGICDPT 229
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGS 296
+ V I Y + + P + + L PV A+ A Q Y GV C S
Sbjct: 230 KKNAKAVNIDGY--EDVPPYDENALKKAVARQPVSIAIEASGRALQLYQSGVFTGECGTS 287
Query: 297 LANINHAVQIVGY 309
L +H V +VGY
Sbjct: 288 L---DHGVVVVGY 297
>gi|400180373|gb|AFP73325.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 117 bits (293), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 96/315 (30%), Positives = 148/315 (46%), Gaps = 35/315 (11%)
Query: 3 DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
D+ N+L + + F S+P L ++ EL+ S YK K E RF
Sbjct: 6 DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGHVYKDEVEKGE---RFMI 62
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
F++++ IE +NK S + G+ EF+D++ +EF + ++ N ++ S +
Sbjct: 63 FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
N + + P DWRE+G + +V++Q CG CWAFS V + E + +
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIA 170
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
G L S QE++DC N N GC GG D++ N + ES+Y L + C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCDGGFMTNAFDFIKENGGI-SSESDYEYLGEQYTCRSQ 228
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
+ V+I SY ++P + L T PV IAA L Q+Y GG DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278
Query: 296 SLAN-INHAVQIVGY 309
S A+ INHAV +GY
Sbjct: 279 SCADRINHAVTAIGY 293
>gi|242071345|ref|XP_002450949.1| hypothetical protein SORBIDRAFT_05g021550 [Sorghum bicolor]
gi|241936792|gb|EES09937.1| hypothetical protein SORBIDRAFT_05g021550 [Sorghum bicolor]
Length = 371
Score = 117 bits (293), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 89/318 (27%), Positives = 150/318 (47%), Gaps = 33/318 (10%)
Query: 7 VLFIVALIALCFLAIPVKVSKPNLEQKL---ELFSSFQQRYKKSYSKS------EHDIR- 56
VL V+L AL LA P + P E+ L E + ++++ Y S E D +
Sbjct: 6 VLAAVSL-ALLVLAPPARAGIPFTEKDLASEESLRALYEQWRSHYMVSRPAGLQEQDDKA 64
Query: 57 --FKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHK 114
F F++++ I E NK +S R + +F+D++ +EF+ + S +
Sbjct: 65 RWFNVFKENVRYIHEANKKGRS---FRLALNKFADMTTDEFR--------RAYAAGSRTR 113
Query: 115 HHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAES 173
HH + +++ + + G +P+ DWR+ G + +++Q CG+CWAFST+ E
Sbjct: 114 HHRALSSGIRRHGDGSFMYAQAGNLPLAVDWRQRGAVTGIKDQGQCGSCWAFSTIAAVEG 173
Query: 174 MHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDA 233
++ ++ G L LS QE++DC N GC+GG ++ N + ES YP L +
Sbjct: 174 INKIRTGKLVSLSEQELVDCDDVDNQGCNGGLMDYAFQYIKRNGGIT-TESNYPYLAEQR 232
Query: 234 ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQY 291
+C + + V I Y D +E ++ +A PV A+ A +Q+Y GV
Sbjct: 233 SCNKAKERSHDVTIDGYE-DVPANNEDALQKAVANQ-PVSIAIEASGQDFQFYSEGVFTG 290
Query: 292 NCDGSLANINHAVQIVGY 309
+C L +H V VGY
Sbjct: 291 SCGTEL---DHGVAAVGY 305
>gi|358343350|ref|XP_003635767.1| Cysteine proteinase [Medicago truncatula]
gi|355501702|gb|AES82905.1| Cysteine proteinase [Medicago truncatula]
Length = 338
Score = 117 bits (293), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 78/286 (27%), Positives = 136/286 (47%), Gaps = 33/286 (11%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
+ ++ +RY + Y + E ++RF ++ ++ IE N S + F+D++ EEF
Sbjct: 39 YETWLKRYGRHYRDREEWEVRFDIYQSNVQYIEFYNSQNYSYKLID---NRFADITNEEF 95
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
K+ +L + + + + + ++H H +P DWR+ G + V++
Sbjct: 96 KSTYLGY-LPRFRVQTEFRYHKHGE-----------------LPKSIDWRKKGAVTHVKD 137
Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC-AGNGNMGCSGGDFCALLDWMD 214
Q CG+CWAFS V E ++ +K L LS Q++IDC +GN GC GGD +++
Sbjct: 138 QGRCGSCWAFSAVAAVEGINKIKTENLVSLSEQQLIDCDIKSGNEGCEGGDMYIAFNYIK 197
Query: 215 VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIA 274
+ + + EYP +D C + N V I Y +++ +L H PV
Sbjct: 198 KHGGIATAK-EYPYKGRDGNCNKSKAKNNAVTISGY--ESVPARNEKMLKAAVAHQPVSI 254
Query: 275 AVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY--DNYSRTW 316
A +A +Q+Y G+ +C N+NH + IVGY +N + W
Sbjct: 255 ATDAGGYAFQFYSKGIFSGSCG---KNLNHGMTIVGYGEENGDKYW 297
>gi|440798540|gb|ELR19607.1| papain family cysteine protease subfamily protein [Acanthamoeba
castellanii str. Neff]
Length = 368
Score = 117 bits (293), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 90/319 (28%), Positives = 149/319 (46%), Gaps = 30/319 (9%)
Query: 7 VLFIVALIALCFLAIPVKVSKPNLEQKLEL----------FSSFQQRYKKSYSKSEHDIR 56
+L A + LC + +S E+ L F+++ ++ +SY+ E R
Sbjct: 11 MLMAAACVVLCLATLGSAISPRFDERGYTLLADTHAARSEFNAWARQNGRSYAAQEFGYR 70
Query: 57 FKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRH--LRHSVNKHVLMSHHK 114
+ + + +E N N + S G+ + +D++ +E + L + N S
Sbjct: 71 YNVWRDNAAYVEHFNANANA--SFTVGLNDLADMTLDEVARVYTGLAPAANPFTDASSPA 128
Query: 115 HHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESM 174
+R + +P DWR AG + V+NQ +CGAC+ FS E M
Sbjct: 129 APVVDDETELER-------VARQLPASYDWRNAGAVTPVKNQGSCGACYTFSANAAIEGM 181
Query: 175 HALKNGTLSLLSVQEVIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDA 233
+ + G L+ LS Q ++DCA G GN+GC+GG+ W+ N + + YP +
Sbjct: 182 YKIAAGQLTSLSEQMLLDCAQGTGNLGCNGGNMEITYSWILNNGGGVNTLASYPWSGFRS 241
Query: 234 ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGG-VIQ 290
C+ A S NG IK+Y T SE+ +LT +A+ GPV +NA ++ YY G +I
Sbjct: 242 TCRYSA-SNNGAVIKAYRRAT-SGSEAGLLT-LASRGPVSVGINASPRSFTYYRSGTLID 298
Query: 291 YNCDGSLANINHAVQIVGY 309
+C + A +NHAV +VG+
Sbjct: 299 SSC--TAAGMNHAVTVVGW 315
>gi|297816790|ref|XP_002876278.1| hypothetical protein ARALYDRAFT_485911 [Arabidopsis lyrata subsp.
lyrata]
gi|297322116|gb|EFH52537.1| hypothetical protein ARALYDRAFT_485911 [Arabidopsis lyrata subsp.
lyrata]
Length = 368
Score = 117 bits (293), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 94/304 (30%), Positives = 135/304 (44%), Gaps = 51/304 (16%)
Query: 27 KPNL-----EQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESA 80
+PNL E K +F S Y K+YS E I R F K ++++ P +A
Sbjct: 39 RPNLLGTHTESKFRVFMS---DYGKNYSTREEYIHRLGIFAK--NVLKAAEHQMMDP-TA 92
Query: 81 RYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT---- 136
+G+T+FSDL+EEEFK + + + R G P
Sbjct: 93 VHGVTQFSDLTEEEFK-----------------RMYTGVADVGGSRGHAVGAEAPMVEVD 135
Query: 137 GIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--- 193
G+P DWRE G + +V+NQ CG+CWAFST AE H + G L LS Q+++DC
Sbjct: 136 GLPEDFDWREKGGVTEVKNQGACGSCWAFSTTGAAEGAHFVSTGKLLSLSEQQLVDCDQA 195
Query: 194 ------AGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI 247
+ GC GG +++ + LE E YP K CK P V +
Sbjct: 196 VCDPKDKKACDNGCGGGLMTNAYEYL-MEAGGLEEERSYPYTGKRGHCK---FDPEKVAV 251
Query: 248 KSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCD--GSLANINHAVQ 305
+ T+ E I ++ GP+ +NA+ Q Y+GGV +C S +NH V
Sbjct: 252 RVVNFTTIPLDEDQIAANLVRQGPLAVGLNAVFMQTYIGGV---SCPLICSKRKVNHGVL 308
Query: 306 IVGY 309
+VGY
Sbjct: 309 LVGY 312
>gi|356577811|ref|XP_003557016.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 117 bits (293), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 91/309 (29%), Positives = 138/309 (44%), Gaps = 36/309 (11%)
Query: 10 IVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIE 68
+ L+ + FLA V E + RY K Y E + RF+ F+++++ IE
Sbjct: 12 LAMLLCMAFLAFQVTCRSLQDASMYERHEQWMTRYGKVYKDPQEREKRFRIFKENVNYIE 71
Query: 69 ELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSI 128
N + + + I +F+DL+ EEF R+ H+ S + + +V
Sbjct: 72 AFNN--AANKRYKLAINQFADLTNEEFIAP--RNRFKGHMCSSIIRTTTFKYENV----- 122
Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
T +P DWR+ G + +++Q CG CWAFS V E +HAL +G L LS Q
Sbjct: 123 -------TAVPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALTSGKLISLSEQ 175
Query: 189 EVIDCAGNG-NMGCSGGDFCALLDWMDVNKVV-----LEPESEYPLLLKDAACKRKATSP 242
E++DC G + GC GG L+D D K V L E+ YP D C +
Sbjct: 176 ELVDCDTKGVDQGCEGG----LMD--DAFKFVIQNHGLNTEANYPYKGVDGKCNVNEAAN 229
Query: 243 NGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANI 300
+ I Y D +E ++ +A PV A++A +Q+Y GV +C L
Sbjct: 230 DAATITGYE-DVPANNEKALQKAVANQ-PVSVAIDASGSDFQFYKSGVFTGSCGTEL--- 284
Query: 301 NHAVQIVGY 309
+H V VGY
Sbjct: 285 DHGVTAVGY 293
>gi|317106675|dbj|BAJ53178.1| JHL18I08.12 [Jatropha curcas]
Length = 368
Score = 117 bits (293), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 82/284 (28%), Positives = 135/284 (47%), Gaps = 37/284 (13%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F++F+ ++ K+Y ++ EHD RFK F+ +L K++ +A +G+T FSDL+ EF
Sbjct: 52 FTTFKAKFGKTYATQEEHDYRFKLFKANL---RRARKHQMMDPTAVHGVTMFSDLTPREF 108
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
+ ++L L D H + +PT +P DWR+ G + V+
Sbjct: 109 RRQYLG-------LRRLRLPADAHEAPI----------LPTNDLPTDFDWRDHGAVTNVK 151
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA--------GNGNMGCSGGDF 206
NQ +CG+CW+FS E H L G L LS Q+++DC G + GC+GG
Sbjct: 152 NQGSCGSCWSFSAAGALEGAHFLATGELVSLSEQQLVDCDHECDPEEYGACDSGCNGGLM 211
Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
++ + LE E +YP D + + + +++ ++ E I ++
Sbjct: 212 TTAFEYT-LKAGGLEREEDYPYTGNDRGPCKFDRNKIVASVSNFSVVSI--DEDQIAANL 268
Query: 267 ATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
HGP+ +NA+ Q Y+GGV Y C +H V +VGY
Sbjct: 269 VKHGPLAVGINAVFMQTYMGGVSCPYICS---KRQDHGVLLVGY 309
>gi|297603535|ref|NP_001054211.2| Os04g0670200 [Oryza sativa Japonica Group]
gi|109939735|sp|P25777.2|ORYB_ORYSJ RecName: Full=Oryzain beta chain; Flags: Precursor
gi|32488398|emb|CAE02823.1| OSJNBa0043A12.28 [Oryza sativa Japonica Group]
gi|90399163|emb|CAJ86092.1| H0818H01.14 [Oryza sativa Indica Group]
gi|125550169|gb|EAY95991.1| hypothetical protein OsI_17862 [Oryza sativa Indica Group]
gi|215766596|dbj|BAG98700.1| unnamed protein product [Oryza sativa Japonica Group]
gi|255675868|dbj|BAF16125.2| Os04g0670200 [Oryza sativa Japonica Group]
Length = 466
Score = 117 bits (293), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 85/271 (31%), Positives = 125/271 (46%), Gaps = 26/271 (9%)
Query: 51 SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLM 110
EH+ RF F +L ++ N R G+ F+DL+ EEF+ L V +
Sbjct: 69 GEHERRFLVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNEEFRATFLGAKVAERSRA 128
Query: 111 SHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVET 170
+ ++ H+ V++ +P DWRE G + V+NQ CG+CWAFS V T
Sbjct: 129 AGERYR---HDGVEE------------LPESVDWREKGAVAPVKNQGQCGSCWAFSAVST 173
Query: 171 AESMHALKNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLL 229
ES++ L G + LS QE+++C+ NG N GC+GG D++ + ++ E +YP
Sbjct: 174 VESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFI-IKNGGIDTEDDYPYK 232
Query: 230 LKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGG 287
D C + V I + D E S+ +A H PV A+ A +Q Y G
Sbjct: 233 AVDGKCDINRENAKVVSIDGFE-DVPQNDEKSLQKAVA-HQPVSVAIEAGGREFQLYHSG 290
Query: 288 VIQYNCDGSLANINHAVQIVGY--DNYSRTW 316
V C SL +H V VGY DN W
Sbjct: 291 VFSGRCGTSL---DHGVVAVGYGTDNGKDYW 318
>gi|359485281|ref|XP_002280230.2| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
CEP1 [Vitis vinifera]
Length = 341
Score = 117 bits (293), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 90/320 (28%), Positives = 138/320 (43%), Gaps = 40/320 (12%)
Query: 1 MFDVKNVLFIVALIALCFLAIPVKVSKPNLEQK--LELFSSFQQRYKKSYSKS-EHDIRF 57
M V +I + A + + NL + E + +Y + Y + E R+
Sbjct: 1 MASVNQYQYICLALLFVLAAWASQATARNLHEASMYERHEDWMAQYGRVYKDADEKSKRY 60
Query: 58 KNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHD 117
K F+ ++ IE NK +S + I EF+DL+ EEF T R+ H+ +
Sbjct: 61 KIFKDNVARIESFNKAMD--KSYKLSINEFADLTNEEFGTS--RNRFKAHICSTEATSFK 116
Query: 118 HHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHAL 177
+ + T +P DWR+ G + +++Q CG+CWAFS V E + L
Sbjct: 117 YEN--------------VTAVPSTIDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQL 162
Query: 178 KNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVV-----LEPESEYPLLLK 231
G L LS QE++DC +G + GC+GG L+D D K + L E+ YP
Sbjct: 163 STGKLISLSEQELVDCDTSGEDQGCNGG----LMD--DAFKFIKQNHGLTTEANYPYAGT 216
Query: 232 DAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVI 289
D C RK + KI Y + + + L H P+ A++A +Q+Y GV
Sbjct: 217 DGTCNRKKAAHPAAKINGY--EDVPANNEKALQKAVVHQPIAVAIDAGGFEFQFYSSGVF 274
Query: 290 QYNCDGSLANINHAVQIVGY 309
C L +H V VGY
Sbjct: 275 TGQCGTEL---DHGVAAVGY 291
>gi|432880227|ref|XP_004073613.1| PREDICTED: cathepsin F-like [Oryzias latipes]
Length = 473
Score = 117 bits (293), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 98/285 (34%), Positives = 141/285 (49%), Gaps = 34/285 (11%)
Query: 32 QKLELFSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDL 90
Q L F F +YKK YS + E + R + F+++L E+L Q SA YG+T+FSDL
Sbjct: 170 QLLGQFKDFMVKYKKDYSSQEEAERRLQIFQENLKTAEKLQALDQG--SAEYGVTKFSDL 227
Query: 91 SEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGII 150
+EEEF++ +L + L+S H R + T P DWR+ G +
Sbjct: 228 TEEEFRSTYL------NPLLSQWTLH---------RGMKPAPPAKTPAPDSWDWRDHGAV 272
Query: 151 GKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALL 210
V+NQ CG+CWAFS E LKNGTL LS QE++DC G + C GG
Sbjct: 273 SPVKNQGMCGSCWAFSVTGNIEGQWFLKNGTLLSLSEQELVDCDGL-DQACRGGLPSNAY 331
Query: 211 DWMDVNKV-VLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDT--LIPSESSILTDIA 267
+ + K+ LE E++Y K+K N K+ +Y + L E I +A
Sbjct: 332 E--AIEKLGGLESETDYSY----TGHKQKCDFTN-RKVAAYINSSVELPKDEREIAAWLA 384
Query: 268 THGPVIAAVNALTWQYYLGGV---IQYNCDGSLANINHAVQIVGY 309
+GP+ A+NA Q+Y GV + C+ + I+HAV +VGY
Sbjct: 385 ENGPISVALNAFAMQFYKKGVSHPWKIFCNPWM--IDHAVLLVGY 427
>gi|255568299|ref|XP_002525124.1| cysteine protease, putative [Ricinus communis]
gi|223535583|gb|EEF37251.1| cysteine protease, putative [Ricinus communis]
Length = 342
Score = 117 bits (293), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 84/314 (26%), Positives = 145/314 (46%), Gaps = 35/314 (11%)
Query: 8 LFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIR-FKNFEKSLDI 66
L I L A + + +E + ++ K Y E +R F+ F+ +++
Sbjct: 10 LLIALFFVLAMWADQASTRELHESTMVERHEKWMAKHGKVYKDDEEKLRRFQIFKNNVEF 69
Query: 67 IEELNKNRQSPESARYGITEFSDLSEEEFKT--RHLRHSVNKHVLMSHHKHHDHHHNHVK 124
IE + N S GI F+DL+ EEF+ + ++ +++ K+ +
Sbjct: 70 IE--SSNAAGNNSYMLGINRFADLTNEEFRASWNGYKRPLDASRIVTPFKYEN------- 120
Query: 125 KRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL 184
T +P DWR G + +++Q+ CG+CWAFS V E +H L+ G L
Sbjct: 121 ----------VTALPYSMDWRRKGAVTSIKDQRECGSCWAFSAVAATEGVHKLRTGKLVS 170
Query: 185 LSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPN 243
LS QE++DC G + GC GG ++ N + E+ Y +D C K + +
Sbjct: 171 LSEQELVDCDVKGEDKGCQGGLMEDAFKFIKRNGGIT-TEANYAYRGRDGKCDTKKEASH 229
Query: 244 GVKIKSYTCDTLIP--SESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLAN 299
KI Y ++P SE+++L +A H PV +++A +++Q+Y G+ +C ++
Sbjct: 230 VAKITGY---QVVPENSEAALLKAVA-HQPVSVSIDAGSMSFQFYQSGIYAGSCG---SD 282
Query: 300 INHAVQIVGYDNYS 313
+NH V VGY S
Sbjct: 283 LNHGVAAVGYGTSS 296
>gi|148927396|gb|ABR19829.1| cysteine proteinase [Elaeis guineensis]
Length = 358
Score = 117 bits (293), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 86/279 (30%), Positives = 133/279 (47%), Gaps = 31/279 (11%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F+ F RY K Y S E +RF F ++L++I N+ R P + GI ++D+S EEF
Sbjct: 58 FARFAHRYGKRYQSVEEMKLRFAIFMENLELIRSTNR-RGLP--YKLGINRYADMSWEEF 114
Query: 96 KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
+ L + N L +HK D +P KDWRE GI+ V+
Sbjct: 115 RASRLGAAQNCSATLKGNHKMTDEL------------------LPKTKDWREDGIVSPVK 156
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGCSGGDFCALLDWM 213
+Q +CG+CW FST E+ + G LS Q+++DCA N GC+GG +++
Sbjct: 157 DQGSCGSCWTFSTTGALEAAYTQATGKGISLSEQQLVDCAYAFNNFGCNGGLPSQAFEYI 216
Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSY-TCDTLIPSESSILTDIATHGPV 272
N L+ E YP + C K P V +K + + + +E +L + PV
Sbjct: 217 KYNG-GLDTEESYPYAGVNGFCHFK---PENVGVKVVESVNITLGAEDELLHAVGLVRPV 272
Query: 273 IAAVNALT-WQYYLGGVIQYN-CDGSLANINHAVQIVGY 309
A ++ +++Y GGV + C + ++NHAV VGY
Sbjct: 273 SIAFEVVSGFRFYKGGVYTSDTCGRTQMDVNHAVLAVGY 311
>gi|343473370|emb|CCD14732.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 445
Score = 117 bits (293), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 92/319 (28%), Positives = 157/319 (49%), Gaps = 34/319 (10%)
Query: 4 VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYS-KSEHDIRFKNFE 61
+ + F V L+A+ +PV + + EQ L+ F++F+Q+Y +SY +E RF+ F+
Sbjct: 7 TRTLRFSVGLLAVAACFVPVALGVLHAEQSLQQQFAAFKQKYSRSYKDATEEAFRFRVFK 66
Query: 62 KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
+++ E + + A +G+T FSD+S EEF+ ++H +++
Sbjct: 67 QNM---ERAKEEAAANPYATFGVTRFSDMSPEEFRA-------------TYHNGAEYYAA 110
Query: 122 HVKK-RSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
+K+ R + + + TG P DWR+ G + V++Q CG+CWAFS + E +
Sbjct: 111 ALKRPRKV---VNVSTGKAPEAVDWRKKGAVTPVKDQGACGSCWAFSAIGNIEGQWKVAG 167
Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLL---KDAAC 235
L+ LS Q ++ C + GC GG L W+ NK + YP K C
Sbjct: 168 HELTSLSEQMLVSC-DTTDYGCRGGLMDKSLQWIVSSNKGNVFTAQSYPYASGGGKMPPC 226
Query: 236 KRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDG 295
K+ G KI + L E++I +A +GPV AV+A ++ Y GGV+ +C
Sbjct: 227 N-KSGKVVGAKISGHI--NLPKDENAIAEWLAKNGPVAIAVDATSFLGYKGGVLT-SCIS 282
Query: 296 SLANINHAVQIVGYDNYSR 314
++H V +VGY++ S+
Sbjct: 283 K--GLDHDVLLVGYNDTSK 299
>gi|205364757|gb|ACI04578.1| cysteine protease-like protein [Robinia pseudoacacia]
Length = 335
Score = 117 bits (293), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 85/284 (29%), Positives = 140/284 (49%), Gaps = 39/284 (13%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
FS+F+ ++ K+Y +K EHD RF F+ + + + + SA +G+T+FSDL+ EF
Sbjct: 22 FSTFKSKFSKTYATKEEHDYRFGVFKSN---VRRAKLHAKLDPSAVHGVTKFSDLTPSEF 78
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKDWREAGIIGKVR 154
+ + L K + + H +K I +PT +P DWR+ G + V+
Sbjct: 79 RRQFLGL---KPLRLPEH---------AQKAPI-----LPTHDLPEDFDWRDKGAVTHVK 121
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA--------GNGNMGCSGGDF 206
NQ +CG+CWAFST E H L G L LS Q+++DC G + GC+GG
Sbjct: 122 NQGSCGSCWAFSTTGALEGSHFLATGELVSLSDQQLVDCDHVCDPEQYGACDSGCNGGLM 181
Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
+++ + ++ E +YP +D N + +++ +L E I ++
Sbjct: 182 NNAFEYI-LESGGVQREEDYPYTGRDRG--PAIDEANAASVSNFSVVSL--DEDQISANL 236
Query: 267 ATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
+GP+ +NA+ Q Y+GGV Y C N++H V +VGY
Sbjct: 237 VKNGPLAIGINAVFMQTYIGGVSCPYICG---KNLDHGVLLVGY 277
>gi|118365756|ref|XP_001016098.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|161754|gb|AAA30114.1| cysteine protease [Tetrahymena thermophila]
gi|89297865|gb|EAR95853.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 336
Score = 117 bits (293), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 92/312 (29%), Positives = 152/312 (48%), Gaps = 30/312 (9%)
Query: 7 VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLD 65
+L I+ L+ LC LA + V +KL ++ + + +++Y ++ E R F ++L
Sbjct: 7 ILSIIMLMPLC-LAQDISV------EKLLAYNKWSSQNQRAYLNEDEKLYRQIVFFENLQ 59
Query: 66 IIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHS--VNKHVL-MSHHKHHDHHHNH 122
I+E N N + S + +FSD++ EEF + L +N ++ + H++ +N
Sbjct: 60 KIKEHNSNPNNTYSIH--LNQFSDMTREEFAEKILMKQDLINDYMKGIGQQATHNNANNE 117
Query: 123 VKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTL 182
+ S T+ I DWR G + V++Q CG+CW+FS ES + ++N L
Sbjct: 118 TQMNS--QNHTLAASI----DWRTKGAVTSVKDQGQCGSCWSFSAAALMESFNFIQNKAL 171
Query: 183 SLLSVQEVIDCA----GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
S Q+++DC G + GC GG LD+ +KV + +YP + C
Sbjct: 172 VNFSEQQLVDCVTPENGYPSYGCKGGWPATCLDY--ASKVGITTLDKYPYVAVQKNCTVT 229
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLA 298
T+ NG K+K + +IP+ S+ L PV V+A W YY G+ C+ +
Sbjct: 230 GTN-NGFKLKKW---IVIPNTSNDLKSALNFSPVSVLVDATNWDYYSSGIFN-GCNQTNI 284
Query: 299 NINHAVQIVGYD 310
N+NHAV VGYD
Sbjct: 285 NLNHAVLAVGYD 296
>gi|400180393|gb|AFP73335.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 117 bits (293), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 97/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)
Query: 3 DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
D+ N+L + + F S+P L ++ EL+ S R YK K E RF
Sbjct: 6 DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVE---RFMI 62
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
F++++ IE +NK S + G+ EF+D++ +EF + ++ N ++ S +
Sbjct: 63 FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEL 120
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
N + + P DWRE+G + +V++Q CG CWAFS V + E + +
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
G L S QE++DC N N GC+GG D++ N + ES+Y L + C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGI-SRESDYEYLGEQYTCRSQ 228
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
+ V+I SY ++P + L T PV IAA L Q+Y GG DG
Sbjct: 229 EKTA-AVQISSY---KVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278
Query: 296 SLAN-INHAVQIVGY 309
S A+ INHAV +GY
Sbjct: 279 SCADRINHAVTAIGY 293
>gi|374713649|gb|AEZ65082.1| cysteine protease [Carica papaya]
Length = 471
Score = 117 bits (293), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 85/318 (26%), Positives = 152/318 (47%), Gaps = 35/318 (11%)
Query: 6 NVLFIVALIALCFLAI--------PVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIR 56
+ LF+V ++L ++I P++ ++ ++++ + ++ K+Y+ E + R
Sbjct: 13 SFLFMVFSLSLASMSIIDYDLPADPLQSTERTEAHMMKMYEHWLVKHGKNYNAIGEKERR 72
Query: 57 FKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHH 116
F+ F+ +L ++E +N + + G+T+F+DL+ EE++ +L + K + +
Sbjct: 73 FEIFKDNLRFVDE--QNSVPGRTYKLGLTKFADLTNEEYRAMYLGAKMEKKEKLRTERSQ 130
Query: 117 DHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHA 176
+ H + P DWRE G + +V++Q CG+CWAFSTV + E ++
Sbjct: 131 RYLHKAGNDDDL----------PSHVDWREKGAVTEVKDQGQCGSCWAFSTVGSVEGINQ 180
Query: 177 LKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW---MDVNKVVLEPESEYPLLLKDA 233
+ G L LS QE++DC N GC+GG L+D+ + ++ E++YP D
Sbjct: 181 IVTGDLISLSEQELVDCDKAYNQGCNGG----LMDYAFEFIIKNGGIDSEADYPYRASDN 236
Query: 234 ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQY 291
C + + V I Y D E S+ +A PV A+ A +Q Y GV
Sbjct: 237 MCDSNRKNAHVVTIDGYE-DVPENDEESLKKAVANQ-PVSVAIEAGGREFQLYQSGVFTG 294
Query: 292 NCDGSLANINHAVQIVGY 309
C N++H V VGY
Sbjct: 295 RCG---TNLDHGVVAVGY 309
>gi|118369234|ref|XP_001017822.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|18913076|gb|AAL79510.1| granule-biosynthesis induced protease Gip1p [Tetrahymena
thermophila]
gi|89299589|gb|EAR97577.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 345
Score = 117 bits (293), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 94/320 (29%), Positives = 144/320 (45%), Gaps = 31/320 (9%)
Query: 6 NVLFIVALIALCFLAIPV--------KVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRF 57
N L I ALI L +A P +S+ Q L ++ ++ YK+ Y E I
Sbjct: 2 NKLLISALICL-MIATPSVFCQDVENNISEDIKVQDLLAYNKWRFNYKRVYLNEEEQIY- 59
Query: 58 KNFEKSLDIIEELNKNRQSPESARY--GITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKH 115
+ + E L + P Y G+ +FSD+++EEFK R L ++K
Sbjct: 60 ----RQIVFFENLASVNKHPSHKSYSKGLNQFSDMTKEEFKQRVLNKKISKKA------S 109
Query: 116 HDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESM 174
+ ++ + + PT +P+ DWR+ G++ V+NQ TCG+CW F+T ES
Sbjct: 110 SNKGGRNLAADPAVSNLVFPTNNLPLSVDWRKRGVLNPVKNQGTCGSCWTFATAGILESF 169
Query: 175 HALKNGTLSLLSVQEVIDC---AGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLK 231
+ +KN L S Q+++DC AG + GC GG + + +V +YP +
Sbjct: 170 NQIKNKQLLKFSEQQLVDCVSLAGYDSDGCDGGFQEDGVRYAIEYGIV--QSYKYPYVGY 227
Query: 232 DAACKRKATSPNGVKIKSYTCD-TLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQ 290
C K TSP + Y L+ + L PV +VNA TW+ Y GGV
Sbjct: 228 QGRC--KVTSPTSRSVGFYPQKFQLVNKTEADLKAALVFSPVSISVNADTWKEYYGGVFD 285
Query: 291 YNCDGSLANINHAVQIVGYD 310
+ ++NHAV VGYD
Sbjct: 286 ECGYTTEEDLNHAVIAVGYD 305
>gi|148224022|ref|NP_001087489.1| cathepsin L2 precursor [Xenopus laevis]
gi|51258284|gb|AAH80004.1| MGC81823 protein [Xenopus laevis]
Length = 335
Score = 117 bits (293), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 96/312 (30%), Positives = 143/312 (45%), Gaps = 37/312 (11%)
Query: 7 VLFIVALIALCFLAI-PVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLD 65
L++VA ALC + + P L+ L+ ++ +KKSY E R +EK+L
Sbjct: 2 ALYLVA-AALCLTTVFAAPTTDPALDDHWHLWKNW---HKKSYLPKEEGWRRVLWEKNLR 57
Query: 66 IIEELNKNRQ-SPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVK 124
IE N + S R G+ +F D++ EEF+ LM N K
Sbjct: 58 TIEFHNLDHSLGKHSYRLGMNQFGDMTNEEFRQ-----------LM----------NGYK 96
Query: 125 KRSITTGITI--PTGIPVKK--DWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
+ + G T P K DWRE G + V++Q CG+CWAFST E H K G
Sbjct: 97 NQKMIKGSTFLAPNNFEAPKTVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHYRKAG 156
Query: 181 TLSLLSVQEVIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
L LS Q ++DC+ GN GC+GG ++ N + + E YP KD
Sbjct: 157 KLISLSEQNLVDCSRAQGNQGCNGGLMDQAFQYVKDNGGI-DSEDSYPYTAKDDQECHYD 215
Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSL 297
+ N + D SE ++ +A+ GPV AV+A ++Q+Y G I Y+ + S
Sbjct: 216 PNYNSANDTGFV-DVPSGSEKDLMKAVASVGPVSVAVDAGHKSFQFYQSG-IYYDPECSS 273
Query: 298 ANINHAVQIVGY 309
+++H V +VGY
Sbjct: 274 EDLDHGVLVVGY 285
>gi|2144501|pir||TAGB actinidain (EC 3.4.22.14) precursor - kiwi fruit
gi|166317|gb|AAA32629.1| actinidin [Actinidia deliciosa]
Length = 380
Score = 117 bits (293), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 85/312 (27%), Positives = 150/312 (48%), Gaps = 27/312 (8%)
Query: 2 FDVKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNF 60
F ++LF L+ L +++ ++ ++ S+ +Y KSY S E + RF+ F
Sbjct: 7 FVSMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIF 66
Query: 61 EKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
+++L I+E N + S + G+ +F+DL++EEF++ +L + + +++
Sbjct: 67 KETLRFIDEHNADTN--RSYKVGLNQFADLTDEEFRSTYLGFTSGSNKTKVSNRYEPR-- 122
Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
G +P+ + DWR AG + +++Q CG CWAFS + T E ++ + G
Sbjct: 123 ---------VGQVLPSYV----DWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTG 169
Query: 181 TLSLLSVQEVIDCAGNGNM-GCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
L LS QE+IDC N GC+GG ++ +N + E YP +D C +
Sbjct: 170 VLISLSEQELIDCGRTQNTRGCNGGYITDGFQFI-INNGGINTEENYPYTAQDGECNVEL 228
Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSL 297
+ V I +Y + + + L T+ PV A++A ++ Y G+ C +
Sbjct: 229 QNEKYVTIDTY--ENVPYNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTA- 285
Query: 298 ANINHAVQIVGY 309
I+HAV IVGY
Sbjct: 286 --IDHAVTIVGY 295
>gi|400180357|gb|AFP73317.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 117 bits (293), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 96/315 (30%), Positives = 151/315 (47%), Gaps = 35/315 (11%)
Query: 3 DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
D+ ++L + + F + S+P L ++ EL+ S R YK K E RF
Sbjct: 6 DLMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
F++++ IE +NK S + G+ EF+D++ +EF + ++ N ++ S +
Sbjct: 63 FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
N + + P DWRE+G + +V++Q CG CWAFS V + E + +
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
G L S QE++DC N N GC+GG D++ N + ES+Y L + C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGI-SRESDYEYLGQQYTCRSQ 228
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
+ V+I SY ++P + L T PV IAA L Q+Y GG DG
Sbjct: 229 EKTA-AVQISSY---KVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278
Query: 296 SLAN-INHAVQIVGY 309
S A+ INHAV +GY
Sbjct: 279 SCADRINHAVTAIGY 293
>gi|118485796|gb|ABK94746.1| unknown [Populus trichocarpa]
Length = 367
Score = 117 bits (293), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 82/283 (28%), Positives = 135/283 (47%), Gaps = 34/283 (12%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F+SF+ ++ K+Y ++ EHD RF F+ +L K++ +A +GIT+FSDL+ +EF
Sbjct: 51 FTSFKSKFGKTYATQEEHDYRFGVFKANL---RRAKKHQMIDPTAAHGITKFSDLTPKEF 107
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
+ + L + +K I T +P DWR+ G + +V++
Sbjct: 108 RRQFLGLKRWLRLPTDANK---------------APILPTTDLPTDYDWRDHGAVTEVKD 152
Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA--------GNGNMGCSGGDFC 207
Q +CG+CW+FS E H L G L+ LS Q+++DC G + GC GG
Sbjct: 153 QGSCGSCWSFSATGALEGAHYLATGELASLSEQQLVDCDHECDPEEYGACDSGCDGGLMN 212
Query: 208 ALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIA 267
++ + LE E++YP D + S + +++ ++ E I ++
Sbjct: 213 NAFEYA-LKAGGLEREADYPYTGTDGGTCKFDKSKVVASVSNFSVVSI--DEDQIAANLV 269
Query: 268 THGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
HGP+ A+NA Q Y+GGV Y C +H V +VGY
Sbjct: 270 KHGPLSVAINAAFMQTYVGGVSCPYICS---KRQDHGVLLVGY 309
>gi|4469155|emb|CAB38315.1| chymopapain isoform III [Carica papaya]
Length = 361
Score = 117 bits (293), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 93/286 (32%), Positives = 135/286 (47%), Gaps = 30/286 (10%)
Query: 29 NLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
++E+ ++LF S+ ++ K Y + I RF+ F +L I+E NK S G+ F
Sbjct: 40 SIERLIQLFDSWMLKHNKIYESIDEKIYRFEIFRDNLMYIDETNKKNNS---YWLGLNGF 96
Query: 88 SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREA 147
+DLS +EFK +++ + H + D + HV T P DWR
Sbjct: 97 ADLSNDEFKKKYVGFVAEDFTGLEHFDNEDFTYKHV------------TNYPQSIDWRAK 144
Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFC 207
G + V+NQ CG+CWAFST+ T E ++ + G L LS QE++DC + + GC GG
Sbjct: 145 GAVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLELSEQELVDCDKH-SYGCKGGYQT 203
Query: 208 ALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPS--ESSILTD 265
L ++ N V YP K C +AT G K+K T +PS E+S L
Sbjct: 204 TSLQYVANNGV--HTSKVYPCQAKQYKC--RATDKPGPKVK-ITGYKRVPSNCETSFLGA 258
Query: 266 IATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+A P+ V A +Q Y GV C L +HAV VGY
Sbjct: 259 LANQ-PLSFLVEAGGKPFQLYKSGVFDGPCGTKL---DHAVTAVGY 300
>gi|359806140|ref|NP_001241450.1| uncharacterized protein LOC100778716 precursor [Glycine max]
gi|255639509|gb|ACU20049.1| unknown [Glycine max]
Length = 366
Score = 117 bits (293), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 86/284 (30%), Positives = 140/284 (49%), Gaps = 37/284 (13%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
FS+F+ ++ K+Y ++ EHD RF+ F+ +L + + + P SA +G+T FSDL+ EF
Sbjct: 51 FSAFKTKFGKTYATQEEHDHRFRIFKNNL--LRAKSHQKLDP-SAVHGVTRFSDLTPAEF 107
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
+ + L + L S +K I +PT +P DWRE G + V+
Sbjct: 108 RRQFL--GLKPLRLPS----------DAQKAPI-----LPTNDLPTDFDWREHGAVTGVK 150
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
NQ +CG+CW+FS V E H L G L LS Q+++DC G + GC+GG
Sbjct: 151 NQGSCGSCWSFSAVGALEGAHFLSTGELVSLSEQQLVDCDHECDPEERGACDSGCNGGLM 210
Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
++ + L E +YP +D + S + +++ +L E I ++
Sbjct: 211 TTAFEYT-LQAGGLMREKDYPYTGRDRGPCKFDKSKVAASVANFSVVSL--DEEQIAANL 267
Query: 267 ATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
+GP+ +NA+ Q Y+GGV Y C +++H V +VGY
Sbjct: 268 VQNGPLAVGINAVFMQTYIGGVSCPYICG---KHLDHGVLLVGY 308
>gi|56682917|gb|AAW21813.1| cysteine protease [Triticum aestivum]
Length = 377
Score = 117 bits (293), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 84/294 (28%), Positives = 139/294 (47%), Gaps = 35/294 (11%)
Query: 30 LEQKLELFS---SFQQRYKKSYSKSE-HDIRFKNFEKSLDIIEELNKNRQSPESARYGIT 85
L+ LEL S F QR+ K+Y +E H R F+ +L +++ SA +G+T
Sbjct: 43 LDNDLELDSQLLGFVQRFGKTYRDAEEHAHRLSVFKANL---RRARRHQMLDPSAEHGVT 99
Query: 86 EFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKDW 144
+FSDL+ EF+ L + + H +PT G+P DW
Sbjct: 100 KFSDLTPAEFRRTFLGLKTTRRSFLREMAGSAH-----------DAPVLPTDGLPEDFDW 148
Query: 145 REAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGN 196
R+ G +G V+NQ +C +CW+FS E + L G + +LS Q+++DC +
Sbjct: 149 RDHGAVGPVKNQGSCWSCWSFSASGALEGANYLATGKMEVLSEQQLVDCDHECDPAEPDS 208
Query: 197 GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLI 256
+ GC+GG + ++ + LE E +YP KD CK + S +++++ +
Sbjct: 209 CDAGCNGGLMTSAFSYL-LKSGGLEREKDYPYTGKDGTCKFE-KSKIAASVQNFS--VVA 264
Query: 257 PSESSILTDIATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
E I ++ +GP+ +NA Q Y+GGV Y C +++H V +VGY
Sbjct: 265 VDEEQIAANLVEYGPLAIGINAAYMQTYIGGVSCPYICG---RHLDHGVLLVGY 315
>gi|109390302|gb|ABG33750.1| cysteine protease [Hevea brasiliensis]
Length = 457
Score = 117 bits (293), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 82/285 (28%), Positives = 140/285 (49%), Gaps = 30/285 (10%)
Query: 31 EQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSD 89
++ + ++ + ++ K+Y S E + RF+ F+ +L I+E N ++ R G+ F+D
Sbjct: 36 DEVMAIYEDWLVKHGKAYNSLGEKERRFEVFKDNLRFIDEHNSENRT---YRVGLNRFAD 92
Query: 90 LSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGI 149
L+ EE+++ +L N ++K S + +P DWR+ G
Sbjct: 93 LTNEEYRSMYL------------GALSGIRRNKLRKISDRYTPRVGDSLPDSVDWRKEGA 140
Query: 150 IGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCAL 209
+ V++Q +CG+CWAFS V E ++ + G L LS QE++DC + N GC+GG L
Sbjct: 141 VVGVKDQGSCGSCWAFSAVAAVEGINKIVTGDLISLSEQELVDCDNSYNEGCNGG----L 196
Query: 210 LDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
+D+ +N ++ E +YP L +D C + V I SY D + +E+++ +
Sbjct: 197 MDYGFEFIINNGGIDSEEDYPYLARDGRCDTYRKNARVVSIDSYE-DVPVNNEAALQKAV 255
Query: 267 ATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
A PV A+ A +Q Y GV C +L +H V VGY
Sbjct: 256 ANQ-PVSVAIEAGGRDFQLYSSGVFSGRCGTAL---DHGVVAVGY 296
>gi|242044818|ref|XP_002460280.1| hypothetical protein SORBIDRAFT_02g025920 [Sorghum bicolor]
gi|241923657|gb|EER96801.1| hypothetical protein SORBIDRAFT_02g025920 [Sorghum bicolor]
Length = 363
Score = 117 bits (293), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 89/279 (31%), Positives = 128/279 (45%), Gaps = 29/279 (10%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F+ F RY KSY S +E RF+ F +SL ++ N+ S R GI FSD+S EEF
Sbjct: 62 FARFAVRYGKSYESAAEVQKRFRIFSESLQLVRSTNRKGLS---YRLGINRFSDMSWEEF 118
Query: 96 KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
+ L + N L +H+ +P KDWRE GI+ V+
Sbjct: 119 RATRLGAAQNCSATLAGNHRMR----------------AAAVALPKTKDWREDGIVSPVK 162
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWM 213
NQ CG+CW FST E+ + G LS Q+++DC N GC+GG +++
Sbjct: 163 NQGHCGSCWTFSTTGALEAAYTQATGKPISLSEQQLVDCGKPFNNFGCNGGLPSQAFEYI 222
Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
N L+ E YP + C KA + GVK+ + + + +E + +A PV
Sbjct: 223 KYNG-GLDTEESYPYKGVNGICDFKAENV-GVKVLD-SVNITLGAEDELKDAVALVRPVS 279
Query: 274 AA---VNALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
A VN QY G +C + ++NHAV VGY
Sbjct: 280 VAFQVVNGFR-QYKSGVYTSDSCGNTPMDVNHAVLAVGY 317
>gi|343472970|emb|CCD15012.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 382
Score = 117 bits (292), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 94/320 (29%), Positives = 156/320 (48%), Gaps = 36/320 (11%)
Query: 4 VKNVLFIVAL--IALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSY-SKSEHDIRFKN 59
+ + F V L +A CF +PV + + EQ L+ F++F+Q+Y +SY +E RF+
Sbjct: 7 TRTLRFSVGLHAVAACF--VPVALGVLHAEQSLQQQFAAFKQKYSRSYRDATEEAFRFRV 64
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHH 119
F++++ E + + A +G+T FSD+S EEF+ ++H +++
Sbjct: 65 FKQNM---ERAKEEAAANPYATFGVTRFSDMSPEEFRA-------------TYHNGAEYY 108
Query: 120 HNHVKK-RSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHAL 177
+K+ R + + + TG P DWR+ G + V++Q C + WAFS + E +
Sbjct: 109 AAALKRPRKV---VNVSTGKAPPAIDWRKKGAVTPVKDQGQCDSSWAFSAIGNIEGQWKV 165
Query: 178 KNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACK 236
L+ LS Q ++ C N + GC GG W+ NK + E YP
Sbjct: 166 AGHELTSLSEQMLVSCDTN-DFGCGGGFSDPAFKWIVSSNKGNVFTEQSYPYASGGGNVP 224
Query: 237 R--KATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCD 294
K+ G KI+ L E++I +A +GPV AV+A ++Q Y GGV+ +C
Sbjct: 225 TCDKSGKVVGAKIRDRV--DLPRDENAIAEWLAKNGPVAIAVDATSFQSYTGGVLT-SCI 281
Query: 295 GSLANINHAVQIVGYDNYSR 314
+N AV +VGYD+ S+
Sbjct: 282 SK--EMNSAVLLVGYDDTSK 299
>gi|1173630|gb|AAB37233.1| cysteine proteinase [Phalaenopsis sp. SM9108]
Length = 359
Score = 117 bits (292), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 89/308 (28%), Positives = 145/308 (47%), Gaps = 22/308 (7%)
Query: 8 LFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKS----EHDIRFKNFEKS 63
L +VA A + ++ +LE + L++ ++ R++ ++ S E RF F+++
Sbjct: 6 LILVASFLASVAATAIDIADKDLETEDSLWNLYE-RWRSHHTVSRDLDEKQKRFNVFKEN 64
Query: 64 LDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHV 123
I + NK + P R + +F+DL+ EF++ + +N HH+
Sbjct: 65 PRYIHDFNKRKDIPYKLR--LNKFADLTNHEFRSTYAGSRIN------HHRSLRGSRRGG 116
Query: 124 KKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLS 183
S +P DWR+ G + V++Q CG+CWAFSTV E ++ +K L
Sbjct: 117 ATNSFMYQSLDSRSLPASIDWRQKGAVTAVKDQGQCGSCWAFSTVAAVEGINQIKTKKLL 176
Query: 184 LLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPN 243
LS QE+IDC + N GC+GG D++ N + E+EYP +D+ C + S +
Sbjct: 177 SLSEQELIDCDTDENNGCNGGLMDYAFDFIKKNGGI-SSEAEYPYAAEDSYCATEKKS-H 234
Query: 244 GVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANIN 301
V I + D E S+L +A PV A+ A +Q+Y GV S ++
Sbjct: 235 VVSIDGHE-DVPANDEDSLLKAVANQ-PVSIAIEASGYDFQFYSEGVF---TGRSGTELD 289
Query: 302 HAVQIVGY 309
H V IVGY
Sbjct: 290 HGVAIVGY 297
>gi|224093956|ref|XP_002310053.1| predicted protein [Populus trichocarpa]
gi|224147016|ref|XP_002336386.1| predicted protein [Populus trichocarpa]
gi|222834869|gb|EEE73318.1| predicted protein [Populus trichocarpa]
gi|222852956|gb|EEE90503.1| predicted protein [Populus trichocarpa]
Length = 340
Score = 117 bits (292), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 90/312 (28%), Positives = 146/312 (46%), Gaps = 38/312 (12%)
Query: 9 FIVALIALCFL--AIPVK------VSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNF 60
F +AL F+ A P K + P E+ + + + + YK +E R+ F
Sbjct: 7 FQFVCLALLFILGAWPSKSTARTLLDAPMYERHEQWMTQYGRVYKDD---NERATRYSIF 63
Query: 61 EKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
++++ I+ N Q+ +S + G+ +F+DL+ EEFK R+ H M + +
Sbjct: 64 KENVARIDAFNS--QTGKSYKLGVNQFADLTNEEFKAS--RNRFKGH--MCSPQAGPFRY 117
Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
+V + +P DWR+ G + V++Q CG CWAFS V E ++ L G
Sbjct: 118 ENV------------SAVPSTVDWRKEGAVTPVKDQGQCGCCWAFSAVAAMEGINKLTTG 165
Query: 181 TLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
L LS QEV+DC G + GC+GG +++ NK L E+ YP D C
Sbjct: 166 KLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNK-GLTTEANYPYKGTDGTCNTNK 224
Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSL 297
+ + KI + D SE++++ +A PV A++A +Q+Y G+ +CD L
Sbjct: 225 AAIHAAKITGFE-DVPANSEAALMKAVAKQ-PVSVAIDAGGSDFQFYSSGIFTGSCDTQL 282
Query: 298 ANINHAVQIVGY 309
+H V VGY
Sbjct: 283 ---DHGVTAVGY 291
>gi|443698586|gb|ELT98517.1| hypothetical protein CAPTEDRAFT_128252 [Capitella teleta]
Length = 324
Score = 117 bits (292), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 84/282 (29%), Positives = 141/282 (50%), Gaps = 32/282 (11%)
Query: 35 ELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELN-KNRQSPESARYGITEFSDLSEE 93
E+++ F+ + K+Y+ D+R +E+ L++I + N + + G+ E+ DL++
Sbjct: 22 EMWTLFKTTHSKTYATEAEDMRRFIWERHLNMINQHNIEADLGKHTFSLGMNEYGDLTQH 81
Query: 94 EFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKK--DWREAGIIG 151
E+ MS +K + K S+ + P + V K DWRE G +
Sbjct: 82 EY------------AAMSGYK--------MAKSSVGSSFLEPENLQVPKTVDWREKGYVT 121
Query: 152 KVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALL 210
V+NQ CG+CWAFS+ + E K G L +S Q ++DC+ + GNMGCSGG
Sbjct: 122 PVKNQGQCGSCWAFSSTGSLEGQVFRKTGRLPSISEQNLVDCSRDEGNMGCSGGLMDNAF 181
Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
++ N + ++ E YP D C+ K + + V S D E+++ T +A+ G
Sbjct: 182 TYIKKN-MGIDSEKSYPYEAVDGECRYKKS--DSVTTDSGFVDIPHGDETALRTAVASVG 238
Query: 271 PVIAAVNA--LTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
PV A++A ++Q+Y GV + NC S ++H V +VGY
Sbjct: 239 PVSVAIDASHTSFQFYKTGVYTEANC--SSTQLDHGVLVVGY 278
>gi|242066206|ref|XP_002454392.1| hypothetical protein SORBIDRAFT_04g029960 [Sorghum bicolor]
gi|241934223|gb|EES07368.1| hypothetical protein SORBIDRAFT_04g029960 [Sorghum bicolor]
Length = 356
Score = 117 bits (292), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 92/297 (30%), Positives = 139/297 (46%), Gaps = 37/297 (12%)
Query: 19 LAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSP 77
LA+P ++ + LF S+ +++K Y S E R+ F+++L I E N+ S
Sbjct: 35 LALPNRL--------VNLFKSWSVKHRKIYVSPKEKLKRYGIFKQNLMHIAETNRKNGS- 85
Query: 78 ESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG 137
G+ +F+D++ EEFK HL K + T
Sbjct: 86 --YWLGLNQFADITHEEFKANHL-----------GLKQGLSRMGAQTRTPTTFRYAAAAN 132
Query: 138 IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG 197
+P DWR G + V+NQ CG+CWAFS+V E ++ + G L LS QE++DC
Sbjct: 133 LPWSVDWRYKGAVTPVKNQGKCGSCWAFSSVAAVEGINQIVTGKLVSLSEQELMDCDTML 192
Query: 198 NMGCSGGDFCALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDT 254
+ GC GG L+D+ + + E +YP L+++ CK K N V I Y D
Sbjct: 193 DHGCEGG----LMDFAFAYIMGSQGIHAEDDYPYLMEEGYCKEKQPYANVVTITGYE-DV 247
Query: 255 LIPSESSILTDIATHGPVIAAVNALT--WQYYLGGVIQYNCDGSLANINHAVQIVGY 309
SE S+L +A H PV + A + +Q+Y GGV +C L +HA+ VGY
Sbjct: 248 PENSEISLLKALA-HQPVSVGIAAGSRDFQFYKGGVFDGSCSDEL---DHALTAVGY 300
>gi|303275866|ref|XP_003057227.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226461579|gb|EEH58872.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 329
Score = 117 bits (292), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 91/295 (30%), Positives = 134/295 (45%), Gaps = 36/295 (12%)
Query: 37 FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESAR----YGITEFSDLSE 92
F +F + K+Y+ K + K L+I E N R SAR YG T F+DL+E
Sbjct: 8 FDAFVLEHGKTYASDA-----KEYAKRLEIFAE-NMARAKEMSARDGAEYGATPFADLTE 61
Query: 93 EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKDWREAGIIG 151
+EF + L + K H+ S +PT IP+ DWR G +
Sbjct: 62 DEFASSLLMREPIDAARVERLKRHE---------SSRVLPHLPTENIPLNFDWRALGAVT 112
Query: 152 KVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC-------AGNG-NMGCSG 203
V+NQ CG+CW+FS E H +K+G L LS Q+++DC +G + GC G
Sbjct: 113 PVKNQGMCGSCWSFSATGAVEGAHFVKSGALVSLSEQQLVDCDHTCDPDSGTACDSGCDG 172
Query: 204 GDFCALLDWMDVNKVVLEPESEYPLL--LKDAACKRKATSPNGVKIKSYTCDTLIPSESS 261
G + ++ V + L+ E+ YP L D CK K P I +Y+ + ES
Sbjct: 173 GLPANAMAYV-VKRGGLDAEAAYPYLGARGDGRCKSKEDGPPAATITNYS--FVSADESQ 229
Query: 262 ILTDIATHGPVIAAVNALTWQYYLGGVI-QYNCDGSLANINHAVQIVGYDNYSRT 315
I + HGP+ ++A Q Y GV + CD + ++H V IVG+ R
Sbjct: 230 IAAALVKHGPLSVGIDARWMQLYRRGVACPWACDKT--RLDHGVLIVGFGAEGRA 282
>gi|403376023|gb|EJY87990.1| Cathepsin L [Oxytricha trifallax]
Length = 343
Score = 117 bits (292), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 87/312 (27%), Positives = 146/312 (46%), Gaps = 36/312 (11%)
Query: 7 VLFIVALIA-LCFLAIPVKVSKPNL-----EQKLELFSSFQQRYKKSY-SKSEHDIRFKN 59
L IV +A + AI + NL Q F+++ +Y KSY +K E R++
Sbjct: 7 TLAIVGTVATVGLFAISEAPASTNLFAIEVTQDNVAFANYLAKYGKSYGTKEEFQFRYEQ 66
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHH 119
++K++ + + N Q+ + R GI +F+D + EE+K VL+ +
Sbjct: 67 YQKNMAKVAQYNG--QNGNTFRLGINKFTDYTPEEYK-----------VLLGYKPQS--- 110
Query: 120 HNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
K ++ P DWRE G + V++Q CG+CWAFS E + + N
Sbjct: 111 ----KPMTLEASYLSEENTPASIDWREKGAVTPVKDQGQCGSCWAFSATGALEGHYQISN 166
Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
L +S Q+++DC+ +GN GC+GG+ D+ NK +E ES+Y KD C +A
Sbjct: 167 NKLISISEQQLVDCSHDGNNGCNGGEMYLAFDYASKNK--MELESDYVYHAKDEKCSYEA 224
Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSL 297
+ K+++ + + + L +GPV A+ A +Q Y GG++ N
Sbjct: 225 SKG---KMEADHFQRVPKNSPAQLKAALANGPVSVAIEADNEVFQAYDGGIL--NSKECG 279
Query: 298 ANINHAVQIVGY 309
N++H V VG+
Sbjct: 280 TNLDHGVLAVGF 291
>gi|395545396|ref|XP_003774588.1| PREDICTED: cathepsin W [Sarcophilus harrisii]
Length = 358
Score = 117 bits (292), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 89/289 (30%), Positives = 139/289 (48%), Gaps = 43/289 (14%)
Query: 35 ELFSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEE 93
E F +FQ +Y KSY + E + R K F +L ++L + Q A++G+T FSDL+EE
Sbjct: 42 ERFKAFQIQYNKSYPDAAEQECRLKIFADNLARAQQLTEEHQG--LAQFGVTRFSDLTEE 99
Query: 94 EFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKK---DWREAGII 150
EF+ + N++ R T G P +K DWR+A ++
Sbjct: 100 EFR----------------RLYQPSQPNYLGLRVKTEGGGYPRLQRLKTRSCDWRKARVL 143
Query: 151 GKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALL 210
VR+Q+ C +CWA S V E++ A+ L LSVQE++DC G GC GG F
Sbjct: 144 TPVRDQKNCNSCWAISAVGNVEALWAINYQQLFKLSVQELLDCRRCGQ-GCEGG-FVWDA 201
Query: 211 DWMDVNKVVLEPESEYPLLLK-DAACKRKATSPNGVKIKSYTCDTLI-------PSESSI 262
+N+ L E +YP + C++K K +++ D L+ PS +
Sbjct: 202 YMTILNQSGLAEEQDYPYRPQLSKGCQKK-------KKRAWIHDFLMLHKEENSPSPPDM 254
Query: 263 LTDIATHGPVIAAVNALTWQYYLGGVIQ--YNCDGSLANINHAVQIVGY 309
+A GP+ +N+ + Y+ GVI+ NCD ++H VQ+VG+
Sbjct: 255 AQYLAEKGPITVTINSRLLKSYIRGVIKPGNNCDPKY--VDHVVQLVGF 301
>gi|312451836|gb|ADQ85985.1| actinidin [Actinidia chinensis]
Length = 380
Score = 117 bits (292), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 85/312 (27%), Positives = 149/312 (47%), Gaps = 27/312 (8%)
Query: 2 FDVKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNF 60
F ++LF L+ L +++ ++ ++ S+ +Y KSY S E + RF+ F
Sbjct: 7 FVSMSLLFFSTLLILSLAFNTKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIF 66
Query: 61 EKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
+++L I+E N + S + G+ +F+DL++EEF++ +L + + +++
Sbjct: 67 KETLRFIDE--HNADTNRSYKVGLNQFADLTDEEFRSTYLGFTSGSNKTKVSNRYEPR-- 122
Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
G +P+ + DWR AG + +++Q CG CWAFS + T E ++ + G
Sbjct: 123 ---------VGQVLPSYV----DWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTG 169
Query: 181 TLSLLSVQEVIDCAGNGNM-GCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
L LS QE+IDC N GC+GG ++ +N + E YP +D C
Sbjct: 170 VLISLSEQELIDCGRTQNTRGCNGGYITDGFQFI-INNGGINTEENYPYTAQDGECNVDL 228
Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSL 297
+ V I +Y + + + L T+ PV A++A ++ Y G+ C +
Sbjct: 229 QNEKYVTIDTY--ENVPYNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTA- 285
Query: 298 ANINHAVQIVGY 309
I+HAV IVGY
Sbjct: 286 --IDHAVTIVGY 295
>gi|516865|emb|CAA52403.1| putative thiol protease [Arabidopsis thaliana]
Length = 313
Score = 117 bits (292), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 85/281 (30%), Positives = 137/281 (48%), Gaps = 36/281 (12%)
Query: 40 FQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTR 98
F++++ K Y S EH RF F+ +L + + + P SAR+G+T+FSDL+ EF+ +
Sbjct: 3 FKKKFGKVYGSIEEHYYRFSVFKANL--LRAMRHQKMDP-SARHGVTQFSDLTRSEFRRK 59
Query: 99 HLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKDWREAGIIGKVRNQQ 157
HL V D + + +PT +P + DWR+ G + V+NQ
Sbjct: 60 HL------GVKGGFKLPKDANQAPI----------LPTQNLPEEFDWRDRGAVTPVKNQG 103
Query: 158 TCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDFCAL 209
+CG+CW+FST E H L G L LS Q+++DC G+ + GC+GG +
Sbjct: 104 SCGSCWSFSTTGALEGAHFLATGKLVSLSEQQLVDCDHECDPEEEGSCDSGCNGGLMNSA 163
Query: 210 LDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
++ + L E +YP D + S + +++ ++ +E I ++ +
Sbjct: 164 FEYT-LKTGGLMREKDYPYTGTDGGSCKLDRSKIVASVSNFSVVSI--NEDQIAANLIKN 220
Query: 270 GPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
GP+ A+NA Q Y+GGV Y C L NH V +VGY
Sbjct: 221 GPLAVAINAAYMQTYIGGVSCPYICSRRL---NHGVLLVGY 258
>gi|449524070|ref|XP_004169046.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like, partial
[Cucumis sativus]
Length = 314
Score = 117 bits (292), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 83/279 (29%), Positives = 129/279 (46%), Gaps = 32/279 (11%)
Query: 35 ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEE 93
+ + + +Y + Y S+ E + RF ++ ++ I+ N S A F+DL+ E
Sbjct: 17 DRYQKWMDKYGRQYKSREEWERRFTIYQANVQYIDNFNSMNHSHTLAE---NNFADLTNE 73
Query: 94 EFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKV 153
EFK +L + +S + N V +P DWR+ G + +
Sbjct: 74 EFKATYLGYKT-----VSIPDTCFRYGNMVN-------------LPTNVDWRQEGAVTPI 115
Query: 154 RNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC-AGNGNMGCSGGDFCALLDW 212
+NQ CG+CWAFS V E ++ +K G L LS QE++DC +GN GC+GG ++
Sbjct: 116 KNQGQCGSCWAFSAVAAVEGINKIKAGKLISLSEQELVDCDVTSGNQGCNGGYMYKAFEF 175
Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV 272
+ + L E EYP ++AC + V I Y + E S+ +A PV
Sbjct: 176 --IKRTGLTTEIEYPYQGAESACNEQKEKYQFVSISGYE-KVPVNDEKSLKAAVANQ-PV 231
Query: 273 IAAVNAL--TWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
A++A +Q+Y GG+ NC L NH V IVGY
Sbjct: 232 SVAIDAEGNNFQFYSGGIFSGNCGNQL---NHGVAIVGY 267
>gi|229596051|ref|XP_001013456.3| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|225565626|gb|EAR93211.3| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 315
Score = 117 bits (292), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 92/318 (28%), Positives = 150/318 (47%), Gaps = 46/318 (14%)
Query: 5 KNVLFIVALIALCFLAIPVKVSKP----NLEQKLE-LFSSFQQRYKKSYSKSEHD-IRFK 58
KN+LF +A +AL A + ++K +Q ++ L+S+F+ +Y K Y+ + + R +
Sbjct: 3 KNILFAIAGLALLATATTILLTKTHHNTQEDQNIQALWSAFKTKYNKKYADPDFERYRIE 62
Query: 59 NFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDH 118
F ++L ++E KN YGIT+F D++ EEFK +L + + S +
Sbjct: 63 IFTENLKVVESNTKN--------YGITQFMDITREEFKQTYLTLKMKNGLKASPFAKFND 114
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
G+ I DW G + V++Q CG+CW+FST E L
Sbjct: 115 -----------AGVEI--------DWTTKGAVTPVKDQGQCGSCWSFSTTGAVEGALFLS 155
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
L+ LS Q ++DC+ +GN GC+GG D+ +++ + E+ YP D CK
Sbjct: 156 TKKLTSLSEQYLVDCSKDGNEGCNGGLMDTAFDF--ISQHGIPTEAAYPYKAVDGTCKM- 212
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLA 298
+ P KI S+ T I + +L I P+ AV+A +QYY + +C L
Sbjct: 213 TSGP--YKISSH---TDIQDCNDLLNKIQKQ-PIAIAVDANNFQYYQKDIFS-DCGTEL- 264
Query: 299 NINHAVQIVGYDNYSRTW 316
+H V +VGY + W
Sbjct: 265 --DHGVLLVGYSASGKYW 280
>gi|300121328|emb|CBK21708.2| unnamed protein product [Blastocystis hominis]
Length = 318
Score = 117 bits (292), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 97/306 (31%), Positives = 141/306 (46%), Gaps = 39/306 (12%)
Query: 14 IALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEH-DIRFKNFEKSLDIIEELNK 72
I A+ + V+ N E F+S+ +Y K+Y+ E R + F +L I+E N
Sbjct: 4 IFFVLFAVALSVNLRNSE-----FTSYMSKYGKTYAAPEEARYRLRVFNDNLLKIKEHNA 58
Query: 73 NRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGI 132
+ P + G+ +F+D+S EEF + + + K T
Sbjct: 59 -KNLPWT--LGVNKFADVSAEEFAYKFCGCAKDP------------------KTRGTRQT 97
Query: 133 TIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVID 192
T+ +P + DWRE G + V+NQ CG+CWAFST T E + LK G L LS Q+++D
Sbjct: 98 TLVGDVPARVDWREQGAVTPVKNQGMCGSCWAFSTTGTTEGAYFLKTGNLVSLSEQQLVD 157
Query: 193 CAGNG---NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKS 249
CA + N GCSGG + +D+ V K L E +YP DA CK + V ++S
Sbjct: 158 CARDPEYENFGCSGGWPWSAVDY--VTKHGLCTEEDYPYKGVDAECKESSCK---VAVQS 212
Query: 250 YTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
L + L + PV ++A Q Y G+I C S INHAV VGY
Sbjct: 213 VDKVQLPVGDEDSLAVAVSKTPVSIVLDATAMQLYDKGIIT-RCSES---INHAVLAVGY 268
Query: 310 DNYSRT 315
D + T
Sbjct: 269 DKDAET 274
>gi|281207557|gb|EFA81740.1| hypothetical protein PPL_05734 [Polysphondylium pallidum PN500]
Length = 387
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 88/306 (28%), Positives = 147/306 (48%), Gaps = 27/306 (8%)
Query: 11 VALIALCFLAIPVKVSKPNL--EQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIE 68
V L+A + V + L Q + F S+ Q + Y+ E + R+ F+K+L+ +
Sbjct: 6 VYLLACTVFMLAVLSANATLTERQYQDSFVSWMQTHNVKYTTQEFNHRYGVFKKNLNFVN 65
Query: 69 ELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHH--KHHDHHHNHVKKR 126
+ N S G+ F+DL+ E++ +L ++ +M+ + + D +N VK
Sbjct: 66 QWNA---KGSSTVLGMNVFADLTNAEYQRIYLGSKIDTSSMMNANAARLFDRTYN-VKAL 121
Query: 127 SITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLS 186
S T DWR+ G + ++NQQ CG+CW+FST + E H + G L LS
Sbjct: 122 SPTV------------DWRQKGAVTHIKNQQQCGSCWSFSTTGSIEGAHEIATGNLVSLS 169
Query: 187 VQEVIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGV 245
Q +IDC+ GN GC+GG +++ N + + E+ YP R + +G
Sbjct: 170 EQNLIDCSTAEGNQGCNGGLMTNAFEYVIKNGGI-DTEASYPYSATGPNKCRYNPANSGA 228
Query: 246 KIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSLANINHA 303
I SY + + SE++++ A GPV A++A ++Q Y G I Y S ++H
Sbjct: 229 TISSYV-NVTVGSETALMA-AANIGPVSVAIDASHNSFQLYDSG-IYYESKCSTTQLDHG 285
Query: 304 VQIVGY 309
V +VGY
Sbjct: 286 VLVVGY 291
>gi|228245|prf||1801240C Cys protease 3
Length = 321
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 83/307 (27%), Positives = 152/307 (49%), Gaps = 41/307 (13%)
Query: 11 VALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEE 69
VA + LC LA+ + P+ + F+ +Y + Y ++ ++ R + F+++ +IE+
Sbjct: 2 VAALFLCGLALAT--ASPSWDH-------FKTQYGRKYGDAKEELYRQRVFQQNEQLIED 52
Query: 70 LNKNRQSPE-SARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSI 128
NK ++ E + + + +F D++ EEF + +M +K R
Sbjct: 53 FNKKFENGEVTFKVAMNQFGDMTNEEF-----------NAVMKGYKK--------GSRGE 93
Query: 129 TTGITIPTGIPVKKD--WREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLS 186
+ G P+ +D WR ++ V++Q+ CG+CWAFS E H LKN L LS
Sbjct: 94 PKAVFTAEGRPMARDVDWRTKALVTPVKDQEQCGSCWAFSATGALEGQHFLKNDELVSLS 153
Query: 187 VQEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGV 245
Q+++DC+ + GN GC GG + D++ N + + ES YP +D +C+ A S +
Sbjct: 154 EQQLVDCSTDYGNDGCGGGWMTSAFDYIKDNGGI-DTESSYPYEAEDRSCRFDANSIGAI 212
Query: 246 KIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGV-IQYNCDGSLANINH 302
S + + +E ++ ++ GP+ A++A ++Q+Y GV + NC + ++H
Sbjct: 213 CTGS--VEIVQHTEEALQEAVSGVGPISVAIDASHFSFQFYSSGVYYEQNCSPTF--LDH 268
Query: 303 AVQIVGY 309
V VGY
Sbjct: 269 GVLAVGY 275
>gi|354507493|ref|XP_003515790.1| PREDICTED: cathepsin L1-like [Cricetulus griseus]
gi|344259154|gb|EGW15258.1| Cathepsin L1 [Cricetulus griseus]
Length = 333
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 86/294 (29%), Positives = 138/294 (46%), Gaps = 35/294 (11%)
Query: 25 VSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYG 83
++ P Q + ++ Y++ Y +E + R +EK++ +IE N ++G
Sbjct: 16 LATPKFNQTFNAQWHKWKSTYRRLYGTNEEEWRRAVWEKNMKMIELHNGEYSE---GKHG 72
Query: 84 IT----EFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIP 139
T F D++ EEF+ L++ +KH H V + + + +P +
Sbjct: 73 YTMEMNAFGDMTNEEFRQ-----------LVNGYKHQKHRKGKVFQEPLM--LQLPKSV- 118
Query: 140 VKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGN 198
DWRE G + V+NQ CG+CWAFS E LK G L LS Q ++DC+ GN
Sbjct: 119 ---DWREKGCVTPVKNQGQCGSCWAFSACGALEGQMCLKTGVLVSLSEQNLVDCSQAEGN 175
Query: 199 MGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPS 258
GC+GG ++ +N L+ E YP KD CK K + T IP
Sbjct: 176 QGCNGGLMDFAFQYV-LNNKGLDSEESYPYEAKDGTCKYKPE----FAAANDTGYVDIPQ 230
Query: 259 -ESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
E +++ +AT GP+ A++A ++Q+Y G I Y + S ++H V +VGY
Sbjct: 231 LEKALMKAVATVGPIAIAIDASHPSFQFYSSG-IYYEPNCSSKELDHGVLVVGY 283
>gi|449460678|ref|XP_004148072.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Cucumis
sativus]
Length = 317
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 83/279 (29%), Positives = 129/279 (46%), Gaps = 32/279 (11%)
Query: 35 ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEE 93
+ + + +Y + Y S+ E + RF ++ ++ I+ N S A F+DL+ E
Sbjct: 17 DRYQKWMDKYGRQYKSREEWERRFTIYQANVQYIDNFNSMNHSHTLAE---NNFADLTNE 73
Query: 94 EFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKV 153
EFK +L + +S + N V +P DWR+ G + +
Sbjct: 74 EFKATYLGYKT-----VSIPDTCFRYGNMVN-------------LPTNVDWRQEGAVTPI 115
Query: 154 RNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC-AGNGNMGCSGGDFCALLDW 212
+NQ CG+CWAFS V E ++ +K G L LS QE++DC +GN GC+GG ++
Sbjct: 116 KNQGQCGSCWAFSAVAAVEGINKIKAGKLISLSEQELVDCDVTSGNQGCNGGYMYKAFEF 175
Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV 272
+ + L E EYP ++AC + V I Y + E S+ +A PV
Sbjct: 176 --IKRTGLTTEIEYPYQGAESACNEQKEKYQFVSISGYE-KVPVNDEKSLKAAVANQ-PV 231
Query: 273 IAAVNAL--TWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
A++A +Q+Y GG+ NC L NH V IVGY
Sbjct: 232 SVAIDAEGNNFQFYSGGIFSGNCGNQL---NHGVAIVGY 267
>gi|341878608|gb|EGT34543.1| hypothetical protein CAEBREN_26318 [Caenorhabditis brenneri]
Length = 478
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 97/300 (32%), Positives = 137/300 (45%), Gaps = 34/300 (11%)
Query: 24 KVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARY 82
K+ KP F F R++K Y +K E RF+ F+++ +I EL KN Q +A Y
Sbjct: 163 KIIKPRDYVVWNSFLDFIDRHEKRYENKREVLKRFRVFKRNAKVIRELQKNEQG--TAVY 220
Query: 83 GITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVK 141
G T+FSD++ EFK L + + V M G+TI +P
Sbjct: 221 GFTKFSDMTTMEFKETMLPYQWEQPVPMDQANFEKE------------GVTISEEDLPDS 268
Query: 142 KDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGC 201
DWRE G + +V+NQ +CG+CWAFST E L L LS QE++DC + + GC
Sbjct: 269 FDWREHGAVTQVKNQGSCGSCWAFSTTGNIEGAWFLAKKKLVSLSEQELVDCD-SVDQGC 327
Query: 202 SGGDFCALLDWMDVNKVVLEPESEYPLLLKDAAC----KRKATSPNGVKIKSYTCDTLIP 257
+GG + + LEPE YP + C K A NG L
Sbjct: 328 NGGLPSNAYKEI-IRMGGLEPEDAYPYDGRGETCHLVRKDIAVYING-------SVELPH 379
Query: 258 SESSILTDIATHGPVIAAVNALTWQYYLGGVI---QYNCDGSLANINHAVQIVGYDNYSR 314
E + + T GP+ +NA T Q+Y GV+ + C+ + +NH V IVGY R
Sbjct: 380 DEVEMQKWLVTKGPISIGLNANTLQFYRHGVVHPFKIFCEPFM--LNHGVLIVGYGKDGR 437
>gi|147809367|emb|CAN64491.1| hypothetical protein VITISV_015725 [Vitis vinifera]
Length = 321
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 90/285 (31%), Positives = 127/285 (44%), Gaps = 37/285 (12%)
Query: 37 FSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F F ++Y K YS E + R F K +++ P A +G+T FSDLSEEEF
Sbjct: 7 FRMFMEKYGKEYSSREEYVHRLGIFAK--NMVRAAEHQALDP-XALHGVTPFSDLSEEEF 63
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKDWREAGIIGKVR 154
+ R V + H+K T + G+P DWRE G + +V+
Sbjct: 64 E-RMFTGVVGR--------------PHMKGGVAETAAALEVDGLPESFDWREKGAVTEVK 108
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
Q TCG+CWAFST E H + L LS Q+++DC + GC GG
Sbjct: 109 MQGTCGSCWAFSTTGAVEGAHFISTKKLLTLSEQQLVDCDHMCDIRDKXACDSGCEGGLM 168
Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
++ + LE ES YP K CK K P+ V ++ + E+ I ++
Sbjct: 169 TNAYKYL-IEAGGLEEESSYPYTGKHGECKFK---PDRVAVRVVNFTEVPIBENQIAANL 224
Query: 267 ATHGPVIAAVNALTWQYYLGGVIQYNCDGSLAN--INHAVQIVGY 309
HGP+ +NA Q Y+GGV +C INH V +VGY
Sbjct: 225 VCHGPLAVGLNAXFMQTYIGGV---SCPLICPKRWINHGVLLVGY 266
>gi|400180449|gb|AFP73361.1| cysteine protease [Solanum chilense]
Length = 344
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 97/315 (30%), Positives = 149/315 (47%), Gaps = 35/315 (11%)
Query: 3 DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
D+ N+L + + F S+P L ++ EL+ S R YK K E RF
Sbjct: 6 DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
F+K++ IE +NK S + G+ EF+D++ +EF + ++ N ++ S +
Sbjct: 63 FKKNMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
N + + P DWRE+G + +V++Q CG CWAFS V + E + +
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIA 170
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
G L S QE++DC N N GC+GG D++ N + ES+Y L + C+ +
Sbjct: 171 TGKLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGI-SRESDYEYLGEQYTCRSQ 228
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
+ V+I SY ++P + L T PV IAA L Q+Y G DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAEGTY----DG 278
Query: 296 SLAN-INHAVQIVGY 309
S A+ INHAV +GY
Sbjct: 279 SCADRINHAVTAIGY 293
>gi|218202389|gb|EEC84816.1| hypothetical protein OsI_31898 [Oryza sativa Indica Group]
Length = 350
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 83/279 (29%), Positives = 132/279 (47%), Gaps = 21/279 (7%)
Query: 34 LELFSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSE 92
L+ F + R+ ++Y+ S E RF+ + ++++++E N + A +F+DL+
Sbjct: 29 LDRFEQWMIRHGRAYTDSGEKQRRFEVYRRNVELVETFNSMSNGYKLAD---NKFADLTN 85
Query: 93 EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
EEF+ + L HV + N G + +P DWR+ G + +
Sbjct: 86 EEFRAKML--GFRPHVTIPQIS------NTCSADIAMPGESSDDILPKSVDWRKKGAVVE 137
Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
V+NQ CG+CWAFS V E ++ +KNG L LS QE++DC +GC GG ++
Sbjct: 138 VKNQGDCGSCWAFSAVAAIEGINQIKNGELVSLSEQELVDCDDEA-VGCGGGYMSWAFEF 196
Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV 272
+ V L E+ YP + AC+ + + V I Y + PS L A PV
Sbjct: 197 V-VGNHGLTTEASYPYHAANGACQAAKLNQSAVAIAGYR--NVTPSSEPDLARAAAAQPV 253
Query: 273 IAAVN--ALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
AV+ + +Q Y GV C A++NH V +VGY
Sbjct: 254 SVAVDGGSFMFQLYGSGVYTGPC---TADVNHGVTVVGY 289
>gi|224066056|ref|XP_002302004.1| predicted protein [Populus trichocarpa]
gi|222843730|gb|EEE81277.1| predicted protein [Populus trichocarpa]
Length = 367
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 82/283 (28%), Positives = 134/283 (47%), Gaps = 34/283 (12%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F+SF+ ++ K+Y ++ EHD RF F+ +L K++ +A +GIT+FSDL+ +EF
Sbjct: 51 FTSFKSKFGKTYATQEEHDYRFGVFKANL---RRAKKHQMIDPTAAHGITKFSDLTPKEF 107
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
+ + L + +K I T +P DWR+ G + +V++
Sbjct: 108 RRQFLGLKRWLRLPTDANK---------------APILPTTDLPTDYDWRDHGAVTEVKD 152
Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA--------GNGNMGCSGGDFC 207
Q +CG+CW+FS E H L G L+ LS Q+++DC G + GC GG
Sbjct: 153 QGSCGSCWSFSATGALEGAHYLATGELASLSEQQLVDCDHECDPEEYGACDSGCDGGLMN 212
Query: 208 ALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIA 267
++ + LE E +YP D + S + +++ ++ E I ++
Sbjct: 213 NAFEYA-LKAGGLEREEDYPYTGTDGGTCKFDKSKVVASVSNFSVVSI--DEDQIAANLV 269
Query: 268 THGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
HGP+ A+NA Q Y+GGV Y C +H V +VGY
Sbjct: 270 KHGPLSVAINAAFMQTYVGGVSCPYICS---KRQDHGVLLVGY 309
>gi|18422605|ref|NP_568651.1| senescence-associated protein 12 [Arabidopsis thaliana]
gi|13877737|gb|AAK43946.1|AF370131_1 putative senescence-specific cysteine protease SAG12 [Arabidopsis
thaliana]
gi|9758936|dbj|BAB09317.1| senescence-specific cysteine protease [Arabidopsis thaliana]
gi|14532898|gb|AAK64131.1| putative senescence-specific cysteine protease SAG12 [Arabidopsis
thaliana]
gi|332007929|gb|AED95312.1| senescence-associated protein 12 [Arabidopsis thaliana]
Length = 346
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 90/323 (27%), Positives = 150/323 (46%), Gaps = 40/323 (12%)
Query: 8 LFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK--------SEHDIRFKN 59
LF+ + CF + +S+P L+ +L + Q+R+ + +K E + R+
Sbjct: 10 LFVAIFSSFCF---SITLSRP-LDNELIM----QKRHIEWMTKHGRVYADVKEENNRYVV 61
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLR-HSVNKHVLMSHHKHHDH 118
F+ +++ IE LN + + + + + +F+DL+ +EF++ + V+ S K
Sbjct: 62 FKNNVERIEHLN-SIPAGRTFKLAVNQFADLTNDEFRSMYTGFKGVSALSSQSQTKMSPF 120
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
+ +V ++ PV DWR+ G + ++NQ +CG CWAFS V E +K
Sbjct: 121 RYQNVSSGAL----------PVSVDWRKKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIK 170
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
G L LS Q+++DC N + GC GG + + L ES YP +DA C K
Sbjct: 171 KGKLISLSEQQLVDCDTN-DFGCEGGLMDTAFEHIKATG-GLTTESNYPYKGEDATCNSK 228
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAV--NALTWQYYLGGVIQYNCDGS 296
T+P I Y D + E +++ +A H PV + +Q+Y GV C
Sbjct: 229 KTNPKATSITGYE-DVPVNDEQALMKAVA-HQPVSVGIEGGGFDFQFYSSGVFTGECTTY 286
Query: 297 LANINHAVQIVGYD---NYSRTW 316
L +HAV +GY N S+ W
Sbjct: 287 L---DHAVTAIGYGESTNGSKYW 306
>gi|302816909|ref|XP_002990132.1| hypothetical protein SELMODRAFT_428615 [Selaginella moellendorffii]
gi|300142145|gb|EFJ08849.1| hypothetical protein SELMODRAFT_428615 [Selaginella moellendorffii]
Length = 358
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 79/262 (30%), Positives = 125/262 (47%), Gaps = 28/262 (10%)
Query: 51 SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVN-KHVL 109
E + RF+ F + + IEE NRQ ++ G+ F+D++ +EFK + V + +
Sbjct: 49 GEKERRFQIFRDNAEYIEE--HNRQVNQTYWLGLNNFADMTHDEFKALYFGTKVPLSNTI 106
Query: 110 MSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVE 169
S ++ D T +P+ DWR G + V+NQ CG+CWAFSTV
Sbjct: 107 KSGFRYED-----------------ATNLPLDTDWRSKGAVATVKNQGACGSCWAFSTVA 149
Query: 170 TAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLL 229
E ++ + G L LS QE++DC N GC+GG + +++ + L+ E++YP
Sbjct: 150 AVEGVNQIVTGELVSLSEQELVDCDKQKNQGCNGGLMDSAFEFI-IQNGGLDSEADYPYK 208
Query: 230 LKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGG 287
+C + + V I + D SE+ +L +A PV A+ A +Q Y GG
Sbjct: 209 AVSGSCDESRRNSHVVTIDGFE-DVPAESEADLLKAVANQ-PVSVAIEASGRNFQLYSGG 266
Query: 288 VIQYNCDGSLANINHAVQIVGY 309
V +C L +H V VGY
Sbjct: 267 VYTGHCGYEL---DHGVVAVGY 285
>gi|400180355|gb|AFP73316.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 96/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)
Query: 3 DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
D+ ++L + + F S+P L ++ EL+ S R YK K E RF
Sbjct: 6 DLMSILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
F++++ IE +NK S + G+ EF+D++ +EF + ++ N ++ S +
Sbjct: 63 FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
N + + P DWRE+G + +V++Q CG CWAFS V + E + +
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
G L S QE++DC N N GC+GG D++ N + ES+Y L + C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGI-SRESDYEYLGEQYTCRSQ 228
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
+ V+I SY ++P + L T PV IAA L Q+Y GG DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278
Query: 296 SLAN-INHAVQIVGY 309
S A+ INHAV +GY
Sbjct: 279 SCADRINHAVTAIGY 293
>gi|356577813|ref|XP_003557017.1| PREDICTED: uncharacterized protein LOC100801364 [Glycine max]
Length = 890
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 92/309 (29%), Positives = 139/309 (44%), Gaps = 36/309 (11%)
Query: 10 IVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIE 68
+ L+ + FLA V E + RY K Y E + RF+ F+++++ IE
Sbjct: 559 LAMLLCMAFLAFQVTCRSLQDASMYERHEQWMTRYGKVYKDPQEREKRFRIFKENVNYIE 618
Query: 69 ELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSI 128
N + + + I +F+DL+ EEF R+ H+ S + + +V
Sbjct: 619 AFNN--AANKRYKLAINQFADLTNEEFIAP--RNRFKGHMCSSIIRTTTFKYENV----- 669
Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
T +P DWR+ G + +++Q CG CWAFS V E +HAL +G L LS Q
Sbjct: 670 -------TAVPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALTSGKLISLSEQ 722
Query: 189 EVIDCAGNG-NMGCSGGDFCALLDWMDVNKVV-----LEPESEYPLLLKDAACKRKATSP 242
E++DC G + GC GG L+D D K V L E+ YP D C +
Sbjct: 723 ELVDCDTKGVDQGCEGG----LMD--DAFKFVIQNHGLNTEANYPYKGVDGKCNANEAAN 776
Query: 243 NGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANI 300
+ V I Y D +E ++ +A PV A++A +Q+Y GV +C L
Sbjct: 777 DVVTITGYE-DVPANNEKALQKAVANQ-PVSVAIDASGSDFQFYKSGVFTGSCGTEL--- 831
Query: 301 NHAVQIVGY 309
+H V VGY
Sbjct: 832 DHGVTAVGY 840
>gi|91092022|ref|XP_970951.1| PREDICTED: similar to cathepsin l [Tribolium castaneum]
gi|270001246|gb|EEZ97693.1| cathepsin L precursor [Tribolium castaneum]
Length = 343
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 93/315 (29%), Positives = 148/315 (46%), Gaps = 35/315 (11%)
Query: 7 VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDI 66
++F+ ++A + + V+ NL Q E + +F+ Y KSY+ E + NF + +
Sbjct: 5 LVFVATVVAFAKSQLSIGVTLENLLQ--EEWMAFKLTYNKSYASPEEE----NFRREI-F 57
Query: 67 IEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKR 126
IE N+++ + + YG ++S + + +N M HH+ H + +
Sbjct: 58 IE--NRHKIARFNQEYGRGQWSFVQQ-----------LNNFADMLHHEFHRTLNGFNRTL 104
Query: 127 SITTGIT-----IPTG---IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
S GI IP+ P DWRE G + V+NQ +C CWAFS E + K
Sbjct: 105 SARVGIPQSSTFIPSANVIFPDYVDWREVGAVTPVKNQGSCAGCWAFSAAGALEGHNFRK 164
Query: 179 NGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKR 237
G L LS Q +IDC+ N GN GCSGG +++ N + + E YP ++ C+
Sbjct: 165 TGRLVELSPQNLIDCSTNYGNDGCSGGLMNPAYEYVRTNPGI-DTEDSYPYEARNGPCRF 223
Query: 238 KATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGV-IQYNCD 294
+ + G Y D E + IAT GPV AA++A ++Q+Y G+ C
Sbjct: 224 RPETV-GAYCTGYV-DIAEGDEQGLEAAIATLGPVSAAMDAGRQSFQFYSDGIYYDPQCG 281
Query: 295 GSLANINHAVQIVGY 309
++NHAV +VGY
Sbjct: 282 NRPDDVNHAVLVVGY 296
>gi|33333710|gb|AAQ11973.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 89/276 (32%), Positives = 128/276 (46%), Gaps = 27/276 (9%)
Query: 40 FQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKN-RQSPESARYGITEFSDLSEEEFKT 97
F+ + K+Y S E RF F+K+L I+E NK + ES +T+F+D++ EEF
Sbjct: 26 FKLDHGKTYRSLVEEKRRFSVFQKNLVDIQEHNKKYERGEESFAKKVTQFADMTHEEFLD 85
Query: 98 RHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQ 157
V L S+ H D+ + I + V DWRE G + V++Q
Sbjct: 86 LLKLQGV--PALPSNAVHFDNFED----------IDMEEKDAV--DWREEGAVTPVKDQA 131
Query: 158 TCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN--GNMGCSGGDFCALLDWMDV 215
CG+CWAFS V E KNGTL LS QE++DCA GN GC GG D+ V
Sbjct: 132 NCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEDYGNNGCKGGLMGQAFDF--V 189
Query: 216 NKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAA 275
++ E YP + ++CK+ VK + D E + +A GPV A
Sbjct: 190 QDEGIQTEESYPYEGRRSSCKKSGEYVTKVKTYVFPLD-----EQEMARTVAAKGPVAVA 244
Query: 276 VNALTWQYYLGGVIQYNCDGS--LANINHAVQIVGY 309
+ A +Y G++ C S ++N V +VGY
Sbjct: 245 IEASQLSFYDKGIVDERCRCSNKREDLNPGVLVVGY 280
>gi|71482944|gb|AAZ32411.1| cysteine proteinase glycinain type [Nicotiana benthamiana]
Length = 355
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 86/288 (29%), Positives = 132/288 (45%), Gaps = 38/288 (13%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
FS F+ ++ K Y S+ EHD RFK F+ +L +++ SA +GIT+FSDL+ EF
Sbjct: 49 FSLFKSKFGKIYASEEEHDHRFKVFKANL---RRARRHQLLDPSAEHGITKFSDLTPSEF 105
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
+ +L H K + +PT +P DWR+ G + V+
Sbjct: 106 RRTYLGL-----------------HKPKPKLNAEKAPILPTSDLPADYDWRDHGAVTGVK 148
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
NQ +CG+CW+FST E H L G L LS Q+++DC + + GCSGG
Sbjct: 149 NQGSCGSCWSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDPEQQDSCDAGCSGGLM 208
Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
++ + L+ E +YP K C S + +++ L E I ++
Sbjct: 209 TTAFEYT-LKAGGLQREKDYPYTGKXGKCHFD-KSKIAAAVTNFSVIGL--DEDQIAANL 264
Query: 267 ATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGYDNYS 313
HGP+ +NA Q Y+GGV C +H V +VGY ++
Sbjct: 265 VKHGPLAVGINAAWMQTYVGGVSCPLIC---FKRQDHGVLLVGYGSHG 309
>gi|330803818|ref|XP_003289899.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
gi|325080010|gb|EGC33584.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
Length = 326
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 92/310 (29%), Positives = 147/310 (47%), Gaps = 33/310 (10%)
Query: 9 FIVALIALCFLAIPVKVSKPNLEQKL--ELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDI 66
++ALI CFL I + QK F ++ +++KSY+ E R+ F+ ++DI
Sbjct: 3 LVLALI-FCFLIINCCSAARIFSQKQYQTAFQNWMVKHQKSYTNDEFGSRYSVFQDNMDI 61
Query: 67 IEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKR 126
+ + N Q + G+ +DL+ EEFK +L N K+
Sbjct: 62 VAKWN---QKGSNTILGLNVMADLTNEEFKKLYLGTKANV----------------TYKK 102
Query: 127 SITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLS 186
G++ G+P DWR G + V+NQ CG C+AFST + E +H + + L LS
Sbjct: 103 KTLVGVS---GLPASVDWRANGAVTAVKNQGQCGGCYAFSTTGSVEGIHEITSQQLVPLS 159
Query: 187 VQEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGV 245
Q+++DC+G+ GN GC GG +++ + L+ E+ YP + CK + G
Sbjct: 160 EQQILDCSGSEGNNGCDGGLMTNSFEYI-IAVGGLDTEASYPYTGEVGKCKFNKKNI-GA 217
Query: 246 KIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHA 303
I Y + SES + T +A PV A++A ++Q Y GV Y + S ++H
Sbjct: 218 TITGYK-NVESGSESDLQTAVAAQ-PVSVAIDASQSSFQLYASGVY-YEPECSSTQLDHG 274
Query: 304 VQIVGYDNYS 313
V VGY + S
Sbjct: 275 VLAVGYGSQS 284
>gi|118156|sp|P14658.1|CYSP_TRYBB RecName: Full=Cysteine proteinase; Flags: Precursor
gi|10393|emb|CAA34485.1| unnamed protein product [Trypanosoma brucei]
Length = 450
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 89/316 (28%), Positives = 157/316 (49%), Gaps = 30/316 (9%)
Query: 4 VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKS-EHDIRFKNFE 61
V+ V V L+A+ V + ++E+ LE+ F++F+++Y K Y + E RF+ FE
Sbjct: 7 VRFVRLPVVLLAMAACLASVALGSLHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFE 66
Query: 62 KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
+++ E+ + A +G+T FSD++ EEF+ R+ ++ +
Sbjct: 67 ENM---EQAKIQAAANPYATFGVTPFSDMTREEFRARY--------------RNGASYFA 109
Query: 122 HVKKRSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
+KR T + + TG P DWRE G + V+ Q CG+CWAFST+ E +
Sbjct: 110 AAQKRLRKT-VNVTTGRAPAAVDWREKGAVTPVKVQGQCGSCWAFSTIGNIEGQWQVAGN 168
Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACKRKA 239
L LS Q ++ C + GC+GG +W+ + N + E+ YP + + ++
Sbjct: 169 PLVSLSEQMLVSC-DTIDSGCNGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNG--EQPQ 225
Query: 240 TSPNGVKIKSYTCD--TLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL 297
NG +I + D L E +I +A +GP+ AV+A ++ Y GG++ +C +
Sbjct: 226 CQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDAESFMDYNGGILT-SC--TS 282
Query: 298 ANINHAVQIVGYDNYS 313
++H V +VGY++ S
Sbjct: 283 KQLDHGVLLVGYNDNS 298
>gi|400180377|gb|AFP73327.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 96/315 (30%), Positives = 149/315 (47%), Gaps = 35/315 (11%)
Query: 3 DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
D+ N+L + + F S+P L ++ EL+ S R YK K E RF
Sbjct: 6 DLMNILITLFFVISMFNTQTRGRSQPELSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
F++++ IE +NK S + G+ EF+D++ +EF + ++ N ++ S +
Sbjct: 63 FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
N + + P DWRE+G + +V++Q CG CWAFS V + E + +
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
G L S QE++DC N N GC+GG D++ N + ES+Y + C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGI-SRESDYEYQGEQYTCRSQ 228
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
+ V+I SY ++P + L T PV IAA L Q+Y GG DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278
Query: 296 SLAN-INHAVQIVGY 309
S A+ INHAV +GY
Sbjct: 279 SCADRINHAVTAIGY 293
>gi|357465603|ref|XP_003603086.1| Cysteine proteinase [Medicago truncatula]
gi|355492134|gb|AES73337.1| Cysteine proteinase [Medicago truncatula]
Length = 474
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 75/262 (28%), Positives = 127/262 (48%), Gaps = 22/262 (8%)
Query: 51 SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLM 110
SE D RF+ F+ +L I+E N ++ + G+ F+DLS EE+++R+L ++ +M
Sbjct: 70 SEKDKRFEIFKDNLKFIDEHNAENRT---YKVGLNRFADLSNEEYRSRYLGTKIDPIGMM 126
Query: 111 SHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVET 170
K RS ++ +P DWR G + +V++Q +CG+CWAFST+
Sbjct: 127 ---------MARTKTRSNRYAPSVGDKLPKSVDWRSQGAVVQVKDQGSCGSCWAFSTIAA 177
Query: 171 AESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLL 230
E ++ + G L LS QE++DC N GC GG +++ +N ++ + +YP
Sbjct: 178 VEGINKIVTGELVSLSEQELVDCDRTVNAGCDGGLMEYAFEFI-INNGGIDSDEDYPYRG 236
Query: 231 KDAACKRKATSPNGVKIKSYTCDTLIPSESSI-LTDIATHGPVIAAVNA--LTWQYYLGG 287
D C + + V I Y +P+ + L + P+ A+ A +Q Y+ G
Sbjct: 237 VDGKCDQYKKNARVVSIDDY---EQVPAYDELALKKAVANQPISVAIEAGGREFQLYVSG 293
Query: 288 VIQYNCDGSLANINHAVQIVGY 309
+ C +L +H V VGY
Sbjct: 294 IFTGKCGTAL---DHGVTAVGY 312
>gi|161408101|dbj|BAF94154.1| cathepsin F-like cysteine protease [Plautia stali]
Length = 803
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 88/276 (31%), Positives = 135/276 (48%), Gaps = 29/276 (10%)
Query: 45 KKSYSKSEH-DIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHS 103
++SY +E RF+ F ++ + L K Q +A+YG+T FSD+S +EFK
Sbjct: 508 QRSYKTTEELKKRFRIFRANMKKADYLQKTEQG--TAKYGVTIFSDISSKEFK------- 558
Query: 104 VNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACW 163
KH L + D +K + I T +P + DWR + V+NQ CG+CW
Sbjct: 559 --KHYLGLKKRTPD-----IKFKQEMAQIPNIT-LPEEYDWRNYNAVTPVKNQGMCGSCW 610
Query: 164 AFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPE 223
AFS E +A+K G L LS QE++DC + GC GG F ++ LE E
Sbjct: 611 AFSVTGNIEGQYAIKTGNLVSLSEQELVDCDKYDD-GCEGGLFETAYHAIE-ELGGLELE 668
Query: 224 SEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQY 283
S+YP +D C ++ V++ + + E+ + + +GP+ +NA Q+
Sbjct: 669 SDYPYSGRDNTCHFNSSE---VRVSITSSVNISNDETDMAKWLVANGPISIGINANAMQF 725
Query: 284 YLGGV---IQYNCDGSLANINHAVQIVGYDNYSRTW 316
YLGGV +++ CD ++H V IVGY RTW
Sbjct: 726 YLGGVSHPLKFLCDPK--TLDHGVLIVGY-GIHRTW 758
>gi|3377952|emb|CAA08906.1| cysteine proteinase [Cicer arietinum]
Length = 362
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 83/286 (29%), Positives = 143/286 (50%), Gaps = 36/286 (12%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F++F+ ++ KSY +K EHD RF F+ +L ++ +++ SA +G+T+FSDL+ EF
Sbjct: 47 FTTFKSKFSKSYATKEEHDYRFGVFKSNL---KKAKLHQKLDPSAEHGVTKFSDLTASEF 103
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
+ + L + K + + H +K I +PT +P DWRE G + V+
Sbjct: 104 RRQFL--GLKKRLRLPAH---------AQKAPI-----LPTNNLPEDFDWREKGAVTPVK 147
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA--------GNGNMGCSGGDF 206
+Q +CG+CWAFST E + L G L LS Q+++DC + + GC+GG
Sbjct: 148 DQGSCGSCWAFSTTGALEGANYLATGKLVSLSEQQLVDCDHVCDPDEYNSCDSGCNGGLM 207
Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
+++ + V+ E +Y +D +CK + + + E I ++
Sbjct: 208 NNAFEYLLQSGGVVR-EQDYSYTGRDGSCKFDKSK---IAASVSNFSVVSVDEDQIAANL 263
Query: 267 ATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGYDN 311
+GP+ A+NA Q Y+ GV Y C + + ++H V +VG+ N
Sbjct: 264 VKNGPLAVAINAAWMQTYMSGVSCPYIC--AKSRLDHGVLLVGFGN 307
>gi|391343119|ref|XP_003745860.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 385
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 91/307 (29%), Positives = 144/307 (46%), Gaps = 30/307 (9%)
Query: 7 VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDI 66
+L ++A+I L P PNL Q E F + + KK S E +R FE++
Sbjct: 58 LLAVLAVIGLASALSP----NPNLNQHWENFKA--EHNKKYESFPEELMRRLIFEENHQF 111
Query: 67 IEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKR 126
IE+ N ++ G+ F DL+ +E++ R+L + + K
Sbjct: 112 IEDHNSKKEF--DFYLGMNHFGDLTNKEYRERYL-------------GYRRPENTPSKAS 156
Query: 127 SITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLS 186
I + +P + DWR+ G + V+NQ CG+CWAFS V + E H G L LS
Sbjct: 157 YIFSRAEKIEDVPDQIDWRDQGFVTPVKNQGQCGSCWAFSAVGSLEGQHFKSTGKLVSLS 216
Query: 187 VQEVIDCAG-NGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGV 245
Q ++DC+ GN GC+GG +++ N + + E YP + D +C K S G
Sbjct: 217 EQNLVDCSTPEGNSGCNGGWMDQAFEYVKDNHGI-DTEDSYPYVGTDGSCHFKNKSI-GA 274
Query: 246 KIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDG-SLANINH 302
+K + D E ++ + GPV A++A + +Q+Y GGV YN S + ++H
Sbjct: 275 TLKGFM-DVKEGDEEALRQAVGVAGPVSVAIDASSMLFQFYRGGV--YNVPWCSTSELDH 331
Query: 303 AVQIVGY 309
V +VGY
Sbjct: 332 GVLVVGY 338
>gi|71993922|ref|NP_505215.2| Protein TAG-196 [Caenorhabditis elegans]
gi|351050011|emb|CCD64084.1| Protein TAG-196 [Caenorhabditis elegans]
Length = 477
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 91/286 (31%), Positives = 132/286 (46%), Gaps = 31/286 (10%)
Query: 37 FSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F F R++K Y+ K E RF+ F+K+ +I EL KN Q +A YG T+FSD++ EF
Sbjct: 174 FLDFVDRHEKKYTNKREVLKRFRVFKKNAKVIRELQKNEQG--TAVYGFTKFSDMTTMEF 231
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
K L + + V + + H + + + P DWRE G + +V+N
Sbjct: 232 KKIMLPYQWEQPVYPMEQANFEKHDVTINEEDL----------PESFDWREKGAVTQVKN 281
Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDV 215
Q CG+CWAFST E + L LS QE++DC + + GC+GG + +
Sbjct: 282 QGNCGSCWAFSTTGNVEGAWFIAKNKLVSLSEQELVDC-DSMDQGCNGGLPSNAYKEI-I 339
Query: 216 NKVVLEPESEYPLLLKDAAC----KRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGP 271
LEPE YP + C K A NG L E + + T GP
Sbjct: 340 RMGGLEPEDAYPYDGRGETCHLVRKDIAVYING-------SVELPHDEVEMQKWLVTKGP 392
Query: 272 VIAAVNALTWQYYLGGVI---QYNCDGSLANINHAVQIVGYDNYSR 314
+ +NA T Q+Y GV+ + C+ + +NH V IVGY R
Sbjct: 393 ISIGLNANTLQFYRHGVVHPFKIFCEPFM--LNHGVLIVGYGKDGR 436
>gi|45822207|emb|CAE47500.1| cathepsin L-like proteinase [Diabrotica virgifera virgifera]
Length = 326
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 89/304 (29%), Positives = 142/304 (46%), Gaps = 30/304 (9%)
Query: 10 IVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIE 68
+ I L F ++ K E + F+ R KSY E RF F+ SL IE
Sbjct: 3 VFVFILLAFASVHALSDK-------EEWVQFKVRNNKSYRNYIEEQKRFTIFQGSLRKIE 55
Query: 69 ELN-KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRS 127
N K + + G+T+F+DL+E+EF +++ S +
Sbjct: 56 NHNDKYDHGLSTFKLGVTKFADLTEKEFSDML---GISRSTKSSRPR------------- 99
Query: 128 ITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSV 187
+ +T +P K DWRE G + +V++Q +CG+CW+FST T E + LK G L LS
Sbjct: 100 VIHSLTPVKDLPSKFDWREKGAVTEVKDQGSCGSCWSFSTTGTVEGAYFLKTGKLVSLSE 159
Query: 188 QEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI 247
Q ++DCA GCSGG L++++ ++ E++YP D C R +S KI
Sbjct: 160 QNLVDCAKEDCYGCSGGYMDKALEYIETAGGIM-SENDYPYEGIDDKC-RFDSSKVAAKI 217
Query: 248 KSYTCDTLIPSESSILTDIATHGPVIAAVNA-LTWQYYLGGVI-QYNCDGSLANINHAVQ 305
++T E + + GP+ A++A +Q Y G++ +C ++NH V
Sbjct: 218 SNFTY-IKKNDEDDLKNAVIAKGPISVAIDASFNFQLYDSGILDDSSCYSDFNSLNHGVL 276
Query: 306 IVGY 309
+VGY
Sbjct: 277 VVGY 280
>gi|359359066|gb|AEV40973.1| putative oryzain beta chain precursor [Oryza punctata]
Length = 461
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 87/277 (31%), Positives = 129/277 (46%), Gaps = 27/277 (9%)
Query: 46 KSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV 104
+SY+ E + RF+ F +L ++ N R G+ F+DL+ +EF++ L V
Sbjct: 58 RSYNALGERERRFRVFWDNLKFVDAHNARADEHGGFRLGMNRFADLTNDEFRSTFLGAKV 117
Query: 105 NKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWA 164
V S + H+ V++ +P DWRE G + V+NQ CG+CWA
Sbjct: 118 ---VERSRAAGERYRHDGVEE------------LPESVDWREKGAVAPVKNQGQCGSCWA 162
Query: 165 FSTVETAESMHALKNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPE 223
FS V T ES++ L G + LS QE+++C+ NG N GC+GG D++ + ++ E
Sbjct: 163 FSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFI-IKNGGIDTE 221
Query: 224 SEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTW 281
+YP D C + V I + D E S+ +A H PV A+ A +
Sbjct: 222 DDYPYKAVDGKCDINRENAKVVSIDGFE-DVPQNDEKSLQKAVA-HQPVSVAIEAGGREF 279
Query: 282 QYYLGGVIQYNCDGSLANINHAVQIVGY--DNYSRTW 316
Q Y GV C SL +H V VGY DN W
Sbjct: 280 QLYHSGVFSGRCGTSL---DHGVVAVGYGTDNGKDYW 313
>gi|356509908|ref|XP_003523684.1| PREDICTED: cysteine proteinase RD19a-like [Glycine max]
Length = 366
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 85/284 (29%), Positives = 140/284 (49%), Gaps = 37/284 (13%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
FS+F+ ++ K+Y ++ EHD RF+ F+ +L + + + P SA +G+T FSDL+ EF
Sbjct: 51 FSAFKTKFAKTYATQEEHDHRFRIFKNNL--LRAKSHQKLDP-SAVHGVTRFSDLTPSEF 107
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
+ + L + L S +K I +PT +P DWR+ G + V+
Sbjct: 108 RGQFL--GLKPLRLPS----------DAQKAPI-----LPTSDLPTDFDWRDHGAVTGVK 150
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
NQ +CG+CW+FS V E H L G L LS Q+++DC G + GC+GG
Sbjct: 151 NQGSCGSCWSFSAVGALEGAHFLSTGGLVSLSEQQLVDCDHECDPEERGACDSGCNGGLM 210
Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
++ + L E +YP +D + S + +++ +L E I ++
Sbjct: 211 TTAFEYT-LKAGGLMREEDYPYTGRDRGPCKFDKSKIAASVANFSVVSL--DEEQIAANL 267
Query: 267 ATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
+GP+ +NA+ Q Y+GGV Y C +++H V +VGY
Sbjct: 268 VKNGPLAVGINAVFMQTYIGGVSCPYICG---KHLDHGVLLVGY 308
>gi|147839728|emb|CAN70559.1| hypothetical protein VITISV_032465 [Vitis vinifera]
Length = 341
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 84/273 (30%), Positives = 128/273 (46%), Gaps = 30/273 (10%)
Query: 40 FQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRH 99
+ + YK + KS+ R+K F+ ++ IE NK +S + I EF+DL+ EEF R
Sbjct: 46 YGREYKDADEKSK---RYKIFKDNVARIESFNKAMD--KSYKLSINEFADLTNEEF--RA 98
Query: 100 LRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTC 159
R+ H+ + + + T +P DWR+ G + +++Q C
Sbjct: 99 SRNRFKAHICSTEATSFKYEN--------------VTAVPSTVDWRKKGAVTPIKDQGQC 144
Query: 160 GACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKV 218
G+CWAFS V E + L G L LS QE++DC +G + GCSGG +++ N
Sbjct: 145 GSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCSGGLMDDAFKFIEQNH- 203
Query: 219 VLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA 278
L E+ YP D C RK + KI Y D +E ++ +A H P+ A++A
Sbjct: 204 GLTTEANYPYAGTDGTCNRKKAAHPAAKINGYE-DVPANNEKALQKAVA-HQPIAVAIDA 261
Query: 279 --LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+Q+Y GV C L +H V VGY
Sbjct: 262 SGSEFQFYSSGVFTGQCGTEL---DHGVAAVGY 291
>gi|359806985|ref|NP_001241331.1| uncharacterized protein LOC100811719 precursor [Glycine max]
gi|255645733|gb|ACU23360.1| unknown [Glycine max]
Length = 362
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 88/313 (28%), Positives = 150/313 (47%), Gaps = 33/313 (10%)
Query: 8 LFIVALIALCFLAIPVKVSK----PNLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEK 62
FIV + C L++ + ++ + E+ +LF ++Q+ +K+ Y E RF+ F+
Sbjct: 12 FFIVLVSFTCSLSLAMSSNQLEQFASEEEVFQLFQAWQKEHKREYGNQEEKAKRFQIFQS 71
Query: 63 SLDIIEELNKNRQSPESA-RYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
+L I E+N R+SP + R G+ +F+D+S EEF +L K + M + N
Sbjct: 72 NLRYINEMNAKRKSPTTQHRLGLNKFADMSPEEFMKTYL-----KEIEMPYS-------N 119
Query: 122 HVKKRSITTGITIP-TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
++ + G +P DWR+ G + +VR+Q C + WAFS E ++ + G
Sbjct: 120 LESRKKLQKGDDADCDNLPHSVDWRDKGAVTEVRDQGKCQSHWAFSVTGAIEGINKIVTG 179
Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKAT 240
L LSVQ+V+DC + GC+GG + ++ N + + E+ YP ++ CK A
Sbjct: 180 NLVSLSVQQVVDC-DPASHGCAGGFYFNAFGYVIENGGI-DTEAHYPYTAQNGTCKANAN 237
Query: 241 SPNGVKIKSYTCDTL---IPSESSILTDIATHGPVIAAVNALTWQYYLGGVI-QYNCDGS 296
K + D L + E ++L ++ PV +++A Q+Y GGV NC +
Sbjct: 238 -------KVVSIDNLLVVVGPEEALLCRVSKQ-PVSVSIDATGLQFYAGGVYGGENCSKN 289
Query: 297 LANINHAVQIVGY 309
IVGY
Sbjct: 290 STKATLVCLIVGY 302
>gi|302816222|ref|XP_002989790.1| hypothetical protein SELMODRAFT_184826 [Selaginella moellendorffii]
gi|300142356|gb|EFJ09057.1| hypothetical protein SELMODRAFT_184826 [Selaginella moellendorffii]
Length = 358
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 79/262 (30%), Positives = 125/262 (47%), Gaps = 28/262 (10%)
Query: 51 SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVN-KHVL 109
E + RF+ F + + IEE NRQ ++ G+ F+D++ +EFK + V + +
Sbjct: 49 GEKERRFQIFRDNAEYIEE--HNRQVNQTYWLGLNNFADMTHDEFKALYFGTKVPLSNTI 106
Query: 110 MSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVE 169
S ++ D T +P+ DWR G + V+NQ CG+CWAFSTV
Sbjct: 107 KSGFRYKD-----------------ATNLPLDTDWRSKGAVATVKNQGACGSCWAFSTVA 149
Query: 170 TAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLL 229
E ++ + G L LS QE++DC N GC+GG + +++ + L+ E++YP
Sbjct: 150 AVEGVNQIVTGELVSLSEQELVDCDKQKNQGCNGGLMDSAFEFI-IQNGGLDSEADYPYK 208
Query: 230 LKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGG 287
+C + + V I + D SE+ +L +A PV A+ A +Q Y GG
Sbjct: 209 AVSGSCDESRRNSHVVTIDGFE-DVPAESEADLLKAVANQ-PVSVAIEASGRNFQLYSGG 266
Query: 288 VIQYNCDGSLANINHAVQIVGY 309
V +C L +H V VGY
Sbjct: 267 VYTGHCGYEL---DHGVVAVGY 285
>gi|75277440|sp|O23791.1|BROM1_ANACO RecName: Full=Fruit bromelain; AltName: Allergen=Ana c 2; Flags:
Precursor
gi|2342496|dbj|BAA21849.1| bromelain [Ananas comosus]
Length = 351
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 89/308 (28%), Positives = 146/308 (47%), Gaps = 31/308 (10%)
Query: 7 VLFIVALIALCFL-AIPVKVSK--PNLEQKLELFSSFQQRYKKSYSKSEHDIR-FKNFEK 62
V + + LC + A P S+ PN + ++ F + Y + Y + +R F+ F+
Sbjct: 5 VQLVFLFLFLCAMWASPSAASRDEPN-DPMMKRFEEWMAEYGRVYKDDDEKMRRFQIFKN 63
Query: 63 SLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNH 122
++ IE N ++ S GI +F+D+++ EF ++ S+ ++ D
Sbjct: 64 NVKHIETFNSRNEN--SYTLGINQFTDMTKSEFVAQYTGVSLPLNIEREPVVSFDD---- 117
Query: 123 VKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTL 182
+ I + +P DWR+ G + +V+NQ CG+CW+F+ + T E ++ +K G L
Sbjct: 118 ---------VNI-SAVPQSIDWRDYGAVNEVKNQNPCGSCWSFAAIATVEGIYKIKTGYL 167
Query: 183 SLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSP 242
LS QEV+DCA + GC GG D++ N V E YP L C + P
Sbjct: 168 VSLSEQEVLDCA--VSYGCKGGWVNKAYDFIISNNGVTT-EENYPYLAYQGTCNANSF-P 223
Query: 243 NGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL-TWQYYLGGVIQYNCDGSLANIN 301
N I Y+ E S++ ++ P+ A ++A +QYY GGV C SL N
Sbjct: 224 NSAYITGYSY-VRRNDERSMMYAVSNQ-PIAALIDASENFQYYNGGVFSGPCGTSL---N 278
Query: 302 HAVQIVGY 309
HA+ I+GY
Sbjct: 279 HAITIIGY 286
>gi|341878637|gb|EGT34572.1| hypothetical protein CAEBREN_13324 [Caenorhabditis brenneri]
Length = 478
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 97/301 (32%), Positives = 137/301 (45%), Gaps = 34/301 (11%)
Query: 23 VKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESAR 81
K+ KP F F R++K Y +K E RF+ F+++ +I EL KN Q +A
Sbjct: 162 AKIIKPRDYVIWNSFLDFIDRHEKRYENKREVLKRFRVFKRNAKVIRELQKNEQG--TAV 219
Query: 82 YGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITI-PTGIPV 140
YG T+FSD++ EFK L + + V M G+TI +P
Sbjct: 220 YGFTKFSDMTTMEFKETMLPYQWEQPVPMDQANFEKE------------GVTISEEDLPD 267
Query: 141 KKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMG 200
DWRE G + +V+NQ +CG+CWAFST E L L LS QE++DC + + G
Sbjct: 268 SFDWREHGAVTQVKNQGSCGSCWAFSTTGNIEGAWFLAKKKLVSLSEQELVDCD-SVDQG 326
Query: 201 CSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAAC----KRKATSPNGVKIKSYTCDTLI 256
C+GG + + LEPE YP + C K A NG L
Sbjct: 327 CNGGLPSNAYKEI-IRMGGLEPEDAYPYDGRGETCHLVRKDIAVYING-------SVELP 378
Query: 257 PSESSILTDIATHGPVIAAVNALTWQYYLGGVI---QYNCDGSLANINHAVQIVGYDNYS 313
E + + T GP+ +NA T Q+Y GV+ + C+ + +NH V IVGY
Sbjct: 379 HDEVEMQKWLVTKGPISIGLNANTLQFYRHGVVHPFKIFCEPFM--LNHGVLIVGYGKDG 436
Query: 314 R 314
R
Sbjct: 437 R 437
>gi|224082940|ref|XP_002306900.1| predicted protein [Populus trichocarpa]
gi|118481986|gb|ABK92924.1| unknown [Populus trichocarpa]
gi|222856349|gb|EEE93896.1| predicted protein [Populus trichocarpa]
Length = 367
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 83/284 (29%), Positives = 137/284 (48%), Gaps = 36/284 (12%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F++F+ ++ K+Y ++ EHD RF F+ +L K++ +A +G+T+FSDL+ +EF
Sbjct: 51 FTTFKSKFGKNYATQEEHDYRFSVFKANL---LRAKKHQIMDPTAAHGVTKFSDLTPKEF 107
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
+ + L + +K +PTG +P DWR+ G + V+
Sbjct: 108 RRQLLGLKRRLRLPTDANK----------------APILPTGDLPTDFDWRDHGAVTSVK 151
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA--------GNGNMGCSGGDF 206
+Q +CG+CW+FS E H L G L LS Q+++DC G + GCSGG
Sbjct: 152 DQGSCGSCWSFSATGALEGAHYLATGELVSLSEQQLVDCDHECDPEEYGACDSGCSGGLM 211
Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
++ + LE E +YP D + S + +++ +L E I ++
Sbjct: 212 NNAFEYA-LKAGGLEREKDYPYTGNDRGACKFEKSKVAASVSNFSVVSL--DEDQIAANL 268
Query: 267 ATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
HGP+ A+NA+ Q Y+GGV Y C + +H V +VGY
Sbjct: 269 VKHGPLSVAINAVFMQTYIGGVSCPYICS---KHQDHGVLLVGY 309
>gi|225446581|ref|XP_002280246.1| PREDICTED: vignain [Vitis vinifera]
Length = 341
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 84/273 (30%), Positives = 128/273 (46%), Gaps = 30/273 (10%)
Query: 40 FQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRH 99
+ + YK + KS+ R+K F+ ++ IE NK +S + I EF+DL+ EEF R
Sbjct: 46 YGREYKDADEKSK---RYKIFKDNVARIESFNKAMD--KSYKLSINEFADLTNEEF--RA 98
Query: 100 LRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTC 159
R+ H+ + + + T +P DWR+ G + +++Q C
Sbjct: 99 SRNRFKAHICSTEATSFKYEN--------------VTAVPSTVDWRKKGAVTPIKDQGQC 144
Query: 160 GACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKV 218
G+CWAFS V E + L G L LS QE++DC +G + GCSGG +++ N
Sbjct: 145 GSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCSGGLMDDAFKFIEQNH- 203
Query: 219 VLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA 278
L E+ YP D C RK + KI Y D +E ++ +A H P+ A++A
Sbjct: 204 GLTTEANYPYAGTDGTCNRKKAAHPAAKINGYE-DVPANNEKALQKAVA-HQPIAVAIDA 261
Query: 279 --LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+Q+Y GV C L +H V VGY
Sbjct: 262 SGSEFQFYSSGVFTGQCGTEL---DHGVAAVGY 291
>gi|194689248|gb|ACF78708.1| unknown [Zea mays]
gi|414885653|tpg|DAA61667.1| TPA: cysteine protease2 [Zea mays]
Length = 360
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 87/278 (31%), Positives = 129/278 (46%), Gaps = 27/278 (9%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F+ F RY KSY S +E RF+ F +SL ++ N+ S R GI F+D+S EEF
Sbjct: 59 FARFAVRYGKSYESAAEVHKRFRIFSESLQLVRSTNRKGLS---YRLGINRFADMSWEEF 115
Query: 96 KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
+ L + N L +H+ +P KDWRE GI+ V+
Sbjct: 116 RATRLGAAQNCSATLTGNHRMR----------------AAAVALPETKDWREDGIVSPVK 159
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGCSGGDFCALLDWM 213
NQ CG+CW FST E+ + G LS Q++IDC N GC+GG +++
Sbjct: 160 NQGHCGSCWTFSTTGALEAAYTQATGKPISLSEQQLIDCGFAFNNFGCNGGLPSQAFEYI 219
Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
N L+ E YP + CK K + GVK+ + + + +E + + PV
Sbjct: 220 KYNG-GLDTEESYPYQGVNGICKFKNENV-GVKVLD-SVNITLGAEDELKDAVGLVRPVS 276
Query: 274 AAVNALT-WQYYLGGVIQYN-CDGSLANINHAVQIVGY 309
A +T ++ Y GV + C + ++NHAV VGY
Sbjct: 277 VAFEVITGFRLYKSGVYTSDHCGTTPMDVNHAVLAVGY 314
>gi|229893789|gb|ACQ90252.1| cathepsin L [Pinctada fucata]
Length = 362
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 91/298 (30%), Positives = 142/298 (47%), Gaps = 28/298 (9%)
Query: 19 LAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSP 77
L + VK S L E + F+ + K Y E +I RF F +L+ IEE N+
Sbjct: 37 LKLQVKAS-TRLGPYHETWKEFKTLFGKVYDTVEEEIKRFDIFRDTLERIEEHNRKYHMG 95
Query: 78 ESARY-GITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT 136
+ + Y G+ +FSD+S +E+ LRH+ + + + K +
Sbjct: 96 QKSYYMGVNQFSDMSHDEY----LRHNGLRR----------GNRKYSKGEGCDSYTKSGK 141
Query: 137 GIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN 196
+ K DWR+ G + V+NQ CG+CW+FST + E H + G L LS Q+++DC+G
Sbjct: 142 QLDDKVDWRDKGYVTPVKNQGQCGSCWSFSTTGSLEGQHFRQTGKLISLSEQQLVDCSGT 201
Query: 197 -GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTL 255
GN GC+GG +++ + LE E +YP K C K + K C +
Sbjct: 202 FGNEGCNGGLMDNAFEYIK-SIGGLEGEDDYPYTAKQGKCHLKKSL---FKANDTGCTDV 257
Query: 256 IPSESSILTD-IATHGPVIAAVNA--LTWQYYLGGVI-QYNCDGSLANINHAVQIVGY 309
+ L D +A+ GP+ A++A ++Q Y GGV + C S N++H V VGY
Sbjct: 258 ESGDEDALKDALASVGPISVAIDASHASFQSYDGGVYDEEEC--SSQNLDHGVLTVGY 313
>gi|357621272|gb|EHJ73161.1| putative C1A cysteine protease precursor [Danaus plexippus]
Length = 545
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 85/276 (30%), Positives = 133/276 (48%), Gaps = 19/276 (6%)
Query: 36 LFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
+F+ F Q++ K+Y EH+ R K FE +L IEE N+ S ++ + I +F+DL+ +E
Sbjct: 242 VFAEFMQKHNKNYDGPEHEQRRKIFETNLRKIEEHNR---SNKNFKLAINKFADLTHKEM 298
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
+ R + + S + + + + S T +P + D R G++ V++
Sbjct: 299 EKRK---GLKRRGKSSGAIPFPYSKSKIAEMSDT--------LPKEYDARMYGLVTSVKD 347
Query: 156 QQTCGACWAFSTVETAE-SMHALKNGTLSLLSVQEVIDCA-GNGNMGCSGGDFCALLDWM 213
QQ CG+CW F T E ++ + G L L+ Q +IDCA G N GC GG WM
Sbjct: 348 QQDCGSCWTFGTTSAVEGALARINGGRLMRLANQALIDCAWGYENFGCDGGTDTGAYHWM 407
Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
+N + E P + KD C R KIK +T T E ++ + HGP+
Sbjct: 408 -LNYGMPTEEEYGPYVNKDGFC-RIHNMTQTYKIKGFTNVTPYSVE-ALKVALVNHGPLS 464
Query: 274 AAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+++A Y G I + D S N+NH V +VGY
Sbjct: 465 VSIDATDMLTYYNGGIYSDSDCSTTNLNHEVTLVGY 500
>gi|357452869|ref|XP_003596711.1| Cysteine proteinase [Medicago truncatula]
gi|355485759|gb|AES66962.1| Cysteine proteinase [Medicago truncatula]
Length = 344
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 91/325 (28%), Positives = 148/325 (45%), Gaps = 34/325 (10%)
Query: 2 FDVKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFE 61
F+ KN I+ L + L + +S LE+ + + YK + +E + RF+ F+
Sbjct: 4 FNQKNQYNILTLFFILTLWTSLVISSRLLEKHEQWMEEHGKFYKDA---AEKEQRFQIFK 60
Query: 62 KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
++L+ IE N I +F D + +EFK +L K L+ +
Sbjct: 61 ENLEFIESFNA--AGDNGFNLSINQFGDQTNDEFKANYLNGK--KKPLIGVGIAAIEEES 116
Query: 122 HVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGT 181
+ ++T +P DWRE G + +++Q CG+CWAF+TV E +H + G
Sbjct: 117 VFRYENVTE-------VPATMDWRERGAVTPIKHQHLCGSCWAFATVAAIEGIHQITTGR 169
Query: 182 LSLLSVQEVIDCA-GNGNMGCSGG---DFCALLDWMDVNKVVLEPESEYPLLLKDAACKR 237
L LS QE++DC N GC+GG D C + V K + E+ YP D C
Sbjct: 170 LVSLSEQELVDCVKTNTTDGCNGGYVEDACDFI----VKKGGITSETNYPYTRVDGKCNV 225
Query: 238 KATSPNGVKIKSYTCDTLIPS--ESSILTDIATHG-PVIAAVNALTWQYYLGGVIQYNCD 294
+ + N KIK Y +P+ E ++L +A V A +Q+Y G+++ C
Sbjct: 226 RKGTYNVAKIKGY---EHVPANNEKALLKAVANQPIAVYIAATKRAFQFYSSGILKGKCG 282
Query: 295 GSLANINHAVQIVGY---DNYSRTW 316
+++H V IVGY D+ + W
Sbjct: 283 ---IDLDHTVTIVGYGTSDDGVKYW 304
>gi|22661|emb|CAA49504.1| papaya proteinase omega [Carica papaya]
Length = 367
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 82/280 (29%), Positives = 136/280 (48%), Gaps = 28/280 (10%)
Query: 31 EQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEFSD 89
E+ ++LF+S+ + K Y + + RF+ F+ +L+ I+E NK S R G+ EF+D
Sbjct: 42 ERLIQLFNSWMLNHNKFYENVDEKLYRFEIFKDNLNYIDETNKKNNS---YRLGLNEFAD 98
Query: 90 LSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGI 149
LS +EF +++ ++ + S+ + I I +P DWR+ G
Sbjct: 99 LSNDEFNEKYVGSLIDATIEQSYDEEF-----------INEDIV---NLPENVDWRKKGA 144
Query: 150 IGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCAL 209
+ VR+Q +CG+CWAFS V T E ++ ++ G L LS QE++DC + GC GG
Sbjct: 145 VTPVRHQGSCGSCWAFSAVATVEGINKIRTGKLVELSEQELVDCERRSH-GCKGGYPPYA 203
Query: 210 LDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
L+++ N + L S+YP K C+ K G +K+ + P+ L +
Sbjct: 204 LEYVAKNGIHL--RSKYPYKAKQGTCRAKQVG--GPIVKTSGVGRVQPNNEGNLLNAIAK 259
Query: 270 GPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIV 307
PV V + +Q Y GG+ + C ++HAV V
Sbjct: 260 QPVSVVVESKGRPFQLYKGGIFEGPCG---TKVDHAVTAV 296
>gi|413938554|gb|AFW73105.1| hypothetical protein ZEAMMB73_931917 [Zea mays]
Length = 361
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 88/294 (29%), Positives = 139/294 (47%), Gaps = 21/294 (7%)
Query: 23 VKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESAR 81
V S+ +L LF S+ ++ K Y S +E R++ F+++L I E N+ S
Sbjct: 30 VGYSQEDLALPSSLFRSWSVKHGKLYASPTEKLERYEIFKQNLMHIAETNRKNGS---YW 86
Query: 82 YGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVK 141
G+ +F+D++ EEFK +L K L + + G +P
Sbjct: 87 LGLNQFADVAHEEFKASYLGL---KRALPRAGAPQTRTPTAFRYAAAAAG-----SLPWS 138
Query: 142 KDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGC 201
DWR G + V+NQ CG+CWAFS+V E ++ + G L LS QE++DC + GC
Sbjct: 139 VDWRYKGAVTPVKNQGKCGSCWAFSSVAAVEGINQIVTGKLVSLSEQELVDCDTTLDHGC 198
Query: 202 SGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP--SE 259
GG +M + + E +YP L+++ CK K G+ + T +P SE
Sbjct: 199 EGGTMDLAFAYM-MGSQGIHAEDDYPYLMEEGYCKEKQPCVLGITEQDLTGFEDVPENSE 257
Query: 260 SSILTDIATHGPVIAAVNALT--WQYYLGGVIQYNCDGSLANINHAVQIVGYDN 311
S+L +A H PV + A + +Q+Y GGV C ++HA+ VGY +
Sbjct: 258 ISLLKALA-HQPVSVGIAAGSRDFQFYRGGVFDGACS---VELDHALTAVGYGS 307
>gi|255538210|ref|XP_002510170.1| cysteine protease, putative [Ricinus communis]
gi|223550871|gb|EEF52357.1| cysteine protease, putative [Ricinus communis]
Length = 469
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 76/263 (28%), Positives = 128/263 (48%), Gaps = 29/263 (11%)
Query: 52 EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMS 111
E + RF+ F+ +L I+E N +S + G+ F+DL+ EE+++ +L
Sbjct: 70 EKERRFQVFKDNLRFIDEHNSENRS---YKVGLNRFADLTNEEYRSMYL----------- 115
Query: 112 HHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETA 171
N + + S + +P DWR+ G + +V++Q +CG+CWAFST+
Sbjct: 116 -GARSGAKRNRLSRSSNRYLPRVGDSLPDSVDWRKEGAVAEVKDQGSCGSCWAFSTIAAV 174
Query: 172 ESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW---MDVNKVVLEPESEYPL 228
E ++ + G L LS QE++DC + N GC+GG L+D+ +N ++ E +YP
Sbjct: 175 EGINKIVTGDLISLSEQELVDCDRSYNEGCNGG----LMDYAFQFIINNGGIDSEEDYPY 230
Query: 229 LLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLG 286
L +D C + V I +Y D + E ++ +A PV A+ A +Q+Y
Sbjct: 231 LARDGTCDTYRKNAKVVTIDNYE-DVPVNDEKALQKAVANQ-PVSVAIEAGGREFQFYQS 288
Query: 287 GVIQYNCDGSLANINHAVQIVGY 309
G+ C +L +H V VGY
Sbjct: 289 GIFTGRCGTAL---DHGVAAVGY 308
>gi|348531519|ref|XP_003453256.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 334
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 89/313 (28%), Positives = 155/313 (49%), Gaps = 32/313 (10%)
Query: 4 VKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNF--- 60
+K +L + A++A+ A +S +LE F +++ +++KSY + K
Sbjct: 1 MKLLLVVSAVLAVASCA---SISLEDLE-----FHAWKLKFEKSYDSESDEAHRKQVWLN 52
Query: 61 EKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
+ ++ + + Q +S R G+T F+D+ EE+K L+S H +
Sbjct: 53 NRKFVLMHNILAD-QGLKSYRLGMTHFADMDNEEYKQ-----------LVSQGCLHTFNA 100
Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
+ ++ S G+ T +P DWR+ G + +V++Q+ CG+CWAFST E H K G
Sbjct: 101 SLPERGSAFLGLPEGTALPDTVDWRDKGYVTEVKDQKQCGSCWAFSTTGVLEGQHFRKTG 160
Query: 181 TLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
L LS Q+++DC+ + GN GC+GG L ++ N + + E+ YP K C+ K
Sbjct: 161 KLVSLSEQQLMDCSHSFGNNGCNGGSVKRALQYIQANGGI-DTETSYPYKAKGQRCRYK- 218
Query: 240 TSPNGVKIKSYTCDTLIPS-ESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGS 296
P+G+ K + PS E ++ +AT GP+ ++A ++Q+Y GV + D S
Sbjct: 219 --PDGIGAKCTGYVHVKPSNEETLKKAVATLGPISVGIDASRHSFQFYQSGVYD-DPDCS 275
Query: 297 LANINHAVQIVGY 309
++H VGY
Sbjct: 276 KTVLDHGALAVGY 288
>gi|21593213|gb|AAM65162.1| cysteine proteinase RD19A [Arabidopsis thaliana]
Length = 368
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 86/297 (28%), Positives = 142/297 (47%), Gaps = 34/297 (11%)
Query: 23 VKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESAR 81
V ++P + + FS F++++ K Y S EHD RF F+ +L ++++ SA
Sbjct: 37 VGGAEPQVLTSEDHFSLFKRKFGKVYASNEEHDYRFSVFKANL---RRARRHQKLDPSAT 93
Query: 82 YGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVK 141
+G+T+FSDL+ EF+ +HL + S K + K + I +P
Sbjct: 94 HGVTQFSDLTRSEFRKKHLG-------VRSGFK--------LPKDANKAPILPTENLPED 138
Query: 142 KDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC-------- 193
DWR+ G + V+NQ +CG+CW+FS E + L G L LS Q+++DC
Sbjct: 139 FDWRDHGAVTPVKNQGSCGSCWSFSATGALEGANFLATGKLVSLSEQQLVDCDHECDPEE 198
Query: 194 AGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCD 253
A + + GC+GG + + + L E +YP KD + S + +++
Sbjct: 199 ADSCDSGCNGGLMNSAFE-HTLKTGGLMKEEDYPYTGKDGKTCKLDKSKIVASVSNFSVI 257
Query: 254 TLIPSESSILTDIATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
++ E I ++ +GP+ A+NA Q Y+GGV Y C +NH V +VGY
Sbjct: 258 SI--DEEQIAANLVKNGPLAVAINAGYMQTYIGGVSCPYIC---TRRLNHGVLLVGY 309
>gi|28194647|gb|AAO33585.1|AF479267_1 cathepsin L [Mesocricetus auratus]
Length = 333
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 84/282 (29%), Positives = 138/282 (48%), Gaps = 42/282 (14%)
Query: 40 FQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGIT----EFSDLSEEEF 95
++ +++ Y +E + R +EK++ +IE N ++G T F D++ EEF
Sbjct: 32 WKSTHRRLYDTNEEEWRRAVWEKNMKMIELHNGEYSE---GKHGFTMEMNAFGDMTNEEF 88
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
+ L++ +KH H + + + + +P + DWRE G + V+N
Sbjct: 89 RQ-----------LVNGYKHQKHRKGKLFQEPLM--LQLPKSV----DWREKGCVTPVKN 131
Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGCSGGDFCALLDWMD 214
Q CG+CWAFS E LK G L LS Q ++DC+ G GN GC+GG L+D+
Sbjct: 132 QGQCGSCWAFSACGALEGQMCLKTGVLVSLSEQNLVDCSRGEGNQGCNGG----LMDFAF 187
Query: 215 ---VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPS-ESSILTDIATHG 270
+N L+ E YP KD CK K + T IP E +++ +AT G
Sbjct: 188 QYVLNNKGLDSEESYPYEAKDGTCKYKPE----FAAANDTGYVDIPQLEKALMKAVATVG 243
Query: 271 PVIAAVNA--LTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
P+ A++A ++Q+Y G+ + NC S +++H V ++GY
Sbjct: 244 PIAVAIDASHPSFQFYSSGIYFEPNC--SSKDLDHGVLVIGY 283
>gi|146147376|gb|ABQ01982.1| cathepsin [Fasciola gigantica]
Length = 326
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 88/311 (28%), Positives = 147/311 (47%), Gaps = 34/311 (10%)
Query: 8 LFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDII 67
LFI+A++A+ L +L+ +++ Y K Y+ ++ + R +E+++ I
Sbjct: 3 LFILAVLAVGVLG-----------SNDDLWHQWKRMYNKEYNGADDEHRRNIWEENVKHI 51
Query: 68 EELN-KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKR 126
+E N ++ + G+ +F+D++ EEFK ++L ++SH ++ ++ V
Sbjct: 52 QEHNLRHYLGFVTYTLGLNQFTDMTFEEFKAKYLTEMPRASDILSHGIPYEANNRAV--- 108
Query: 127 SITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLS 186
P K DWRE+G + +V++Q CG+CWAFST T E + T S
Sbjct: 109 ------------PDKIDWRESGYVTEVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFS 156
Query: 187 VQEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGV 245
Q+++DC+G GNMGC GG +++ + LE ES YP + C+
Sbjct: 157 EQQLVDCSGPWGNMGCMGGLMENAYEYL--KQFGLETESSYPYTAVEGQCRYNRQLGVAK 214
Query: 246 KIKSYTCDTLIPSESSILTDIATHGPVIAAVNALT-WQYYLGGVIQYNCDGSLANINHAV 304
YT + SE + + GP AV+ + + Y GG+ Q SL +NHAV
Sbjct: 215 VTDYYTVHS--GSEVELKNLVGAEGPAAVAVDVESDFMMYSGGIYQSRTCSSL-RVNHAV 271
Query: 305 QIVGYDNYSRT 315
VGY S T
Sbjct: 272 LAVGYGTQSGT 282
>gi|2511689|emb|CAB17074.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 364
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 82/314 (26%), Positives = 148/314 (47%), Gaps = 37/314 (11%)
Query: 8 LFIVALIALCFL---AIPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKS 63
+ I L+ L F A + + + + ++++ + +++K Y+ E + RF+ F+ +
Sbjct: 4 MLIPTLLLLSFTFSHATAMSIINYSENEVMDMYEEWLVKHRKVYNGLDEKEKRFQVFKDN 63
Query: 64 LDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHL--RHSVNKHVLMSHHKHHDHHHN 121
L I++ N + G+ +F+D++ EE++ +L R + V+ + + H + +N
Sbjct: 64 LGFIQDHNAQNNT---YTLGLNKFADITNEEYRAMYLGTRTDAKRRVMKTQNTGHRYAYN 120
Query: 122 HVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGT 181
+ +PV DWR G +G +++Q CG+CWAFSTV E ++ + G
Sbjct: 121 SGDQ------------LPVHVDWRLKGAVGPIKDQGNCGSCWAFSTVAAVEGINNIVTGE 168
Query: 182 LSLLSVQEVIDCAGNGNMGCSGGDFCALLDW---MDVNKVVLEPESEYPLLLKDAACKRK 238
LS QE++DC + GC+GG L+D+ + ++ E +YP D C +
Sbjct: 169 FVSLSEQELVDCDREYDEGCNGG----LMDYAFQFIIQNGGIDTEEDYPYQGIDGTCDQT 224
Query: 239 ATSPNGVKIKSYTCDTLIPSES-SILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDG 295
V+I Y +PS + + L +H PV A+ A Q Y GV C
Sbjct: 225 KKKTKVVQIDGYED---VPSNNENALKKAVSHQPVSVAIEASGRALQLYQSGVFTGKCGT 281
Query: 296 SLANINHAVQIVGY 309
+L +H V +VGY
Sbjct: 282 AL---DHGVVVVGY 292
>gi|409190991|gb|AFV30165.1| cysteine proteinase [Lotus japonicus]
Length = 342
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 94/330 (28%), Positives = 150/330 (45%), Gaps = 47/330 (14%)
Query: 4 VKNVLFIVAL-IALCFLAIPVKVSKPNLEQK--LELFSSFQQRYKKSYSK-SEHDIRFKN 59
KN + V+ + LC +VS L+ E + RY + Y E + RF
Sbjct: 3 TKNQFYQVSFALVLCLGLWAFQVSSRTLQDASMQERHEQWMARYGRVYKDLQEKEKRFSI 62
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHH 119
F+++++ IE N P + G+ +F+DL+ EEF + + +K H
Sbjct: 63 FKENVNYIEASNNAGDKP--YKLGVNQFADLTNEEF-------------IATRNKFKGHM 107
Query: 120 HNHVKKRSI--TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHAL 177
+ + + + +T P+ + DWR+ G + V+NQ TCG CWAFS V E +H L
Sbjct: 108 SSSITRTTTFKYENVTAPSTV----DWRQEGAVTPVKNQGTCGCCWAFSAVAATEGIHKL 163
Query: 178 KNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVV-----LEPESEYPLLLK 231
G L LS QE++DC +G + GC GG L+D D K + L E++YP
Sbjct: 164 STGNLVSLSEQELVDCDTSGADQGCQGG----LMD--DAFKFIIQNGGLNTEAQYPYQGV 217
Query: 232 DAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVI 289
D C + + I Y D +E ++ +A P+ A++A +Q Y GV
Sbjct: 218 DGTCNTNEEATHVATITGYE-DVPSNNEQALQQAVANQ-PISIAIDASGSDFQNYQSGVF 275
Query: 290 QYNCDGSLANINHAVQIVGY---DNYSRTW 316
+C L +H V +VGY D+ ++ W
Sbjct: 276 TGSCGTQL---DHGVAVVGYGVSDDGTKYW 302
>gi|11055|emb|CAA45129.1| cysteine proteinase preproenzyme [Homarus americanus]
Length = 320
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 83/305 (27%), Positives = 153/305 (50%), Gaps = 38/305 (12%)
Query: 11 VALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEE 69
VA + LC LA+ + P+ + F+ +Y + Y ++ ++ R + F+++ +IE+
Sbjct: 2 VAALFLCGLALAT--ASPSWDH-------FKTQYGRKYGDAKEELYRQRVFQQNEQLIED 52
Query: 70 LNKNRQSPE-SARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSI 128
NK ++ E + + + +F D++ EEF + +M +K + + +++
Sbjct: 53 FNKKFENGEVTFKVAMNQFGDMTNEEF-----------NAVMKGYKKG----SRGEPKAV 97
Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
T P V DWR ++ V++Q+ CG+CWAFS E H LKN L LS Q
Sbjct: 98 FTAEAGPMAADV--DWRTKALVTPVKDQEQCGSCWAFSATGALEGQHFLKNDELVSLSEQ 155
Query: 189 EVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI 247
+++DC+ + GN GC GG + D++ N + + ES YP +D +C+ A S +
Sbjct: 156 QLVDCSTDYGNDGCGGGWMTSAFDYIKDNGGI-DTESSYPYEAEDRSCRFDANSIGAICT 214
Query: 248 KSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGV-IQYNCDGSLANINHAV 304
S + +E ++ ++ GP+ A++A ++Q+Y GV + NC + ++H V
Sbjct: 215 GSV---EVQHTEEALQEAVSGVGPISVAIDASHFSFQFYSSGVYYEQNCSPTF--LDHGV 269
Query: 305 QIVGY 309
VGY
Sbjct: 270 LAVGY 274
>gi|17384029|emb|CAD12392.1| cysteine proteinase [Leishmania infantum]
Length = 354
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 92/285 (32%), Positives = 141/285 (49%), Gaps = 32/285 (11%)
Query: 37 FSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGIT-EFSDLSEEE 94
+ F++R+ K + + +E RF F++++ LN + A Y ++ +F+DL+ +E
Sbjct: 42 YGRFKKRHGKPFGEDAEEGRRFNAFKQNMQTAYFLNAHN---PHAHYDVSGKFADLTPQE 98
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F +L N + H K + H HV S+ +G+ + DWRE G++ V+
Sbjct: 99 FAKLYL----NPNYYARHGKDYKEH-VHVDD-SVRSGV-------MSVDWREKGVVTPVK 145
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
NQ CG+CWAF+T E ALKN +L LS Q ++ C N + GC+GG + W+
Sbjct: 146 NQGMCGSCWAFATTGNIEGQWALKNHSLVSLSEQVLVSCD-NIDDGCNGGLMQQAMQWII 204
Query: 214 -DVNKVVLEPESEYPLLLKDAACKRKATSPN---GVKIKSYTCDTLIPSESSILTDIATH 269
D N V E YP A R N G KIK Y +L E I + +
Sbjct: 205 NDHNGTV-PTEDSYP--YTSAGGTRPPCHDNGTVGAKIKGYM--SLPHDEEEIAAYVGKN 259
Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYSR 314
GPV AV+A T Q Y GGV+ C G ++NH V +VG++ ++
Sbjct: 260 GPVAVAVDATTRQLYFGGVVTL-CFG--LSLNHGVLVVGFNRQAK 301
>gi|391328550|ref|XP_003738751.1| PREDICTED: cathepsin K-like [Metaseiulus occidentalis]
Length = 320
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 86/281 (30%), Positives = 134/281 (47%), Gaps = 26/281 (9%)
Query: 34 LELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARY--GITEFSDL 90
L +S F++++ K Y S S +I NF ++ I E NK +S Y + +SD
Sbjct: 15 LAEWSQFKEQFGKEYRSTSAEEIALLNFGRNSRTITEHNKRLHDGDSPSYRMAVNPWSDK 74
Query: 91 SEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGII 150
S EEF+ + + S+ D N+V +R G P DW +AG +
Sbjct: 75 SHEEFRQYYGLYGD------SYDFTSDRILNYVPER----------GTPANVDWNKAGFV 118
Query: 151 GKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALL 210
R+Q+ CG+CWAF+ V E+ + G L+ LSVQ +IDC+ + N GCSGG +L
Sbjct: 119 TPSRDQKGCGSCWAFAAVGAIEARVSKSTGNLTALSVQNLIDCS-DTNFGCSGG--SPIL 175
Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
D+ + L YP L +D C R S +I + + SE + +A G
Sbjct: 176 ALRDLLSIGLHTADSYPYLARDGICHR-VNSSRLYQISGFYREEYYLSEERLKEMVAIIG 234
Query: 271 PVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
PV A ++A + +Y G+ Y+ + + NHAV +VG+
Sbjct: 235 PVTATIDASPFGFMHYRDGIF-YDPACNPDSPNHAVLVVGF 274
>gi|297791625|ref|XP_002863697.1| hypothetical protein ARALYDRAFT_917391 [Arabidopsis lyrata subsp.
lyrata]
gi|297309532|gb|EFH39956.1| hypothetical protein ARALYDRAFT_917391 [Arabidopsis lyrata subsp.
lyrata]
Length = 463
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 84/264 (31%), Positives = 131/264 (49%), Gaps = 32/264 (12%)
Query: 51 SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLM 110
+E D RF+ F+ +L I+E N S + G+T F+DL+ +E+++ +L K VL
Sbjct: 69 AEKDQRFEIFKDNLRYIDEHNTKNLS---YKLGLTRFADLTNDEYRSMYLGAKPVKRVL- 124
Query: 111 SHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVET 170
K D + V G +P + DWR+ G + V++Q +CG+CWAFST+
Sbjct: 125 ---KTSDRYEARV-------GDALPDSV----DWRKEGAVADVKDQGSCGSCWAFSTIGA 170
Query: 171 AESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW---MDVNKVVLEPESEYP 227
E ++ + G L LS QE++DC + N GC+GG L+D+ + ++ E++YP
Sbjct: 171 VEGINKIVTGDLISLSEQELVDCDTSYNQGCNGG----LMDYAFEFIIKNGGIDTEADYP 226
Query: 228 LLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYL 285
D C + + V I SY D SE+S+ +A H P+ A+ A +Q Y
Sbjct: 227 YKAADGRCDQNRKNAKVVTIDSYE-DVPENSEASLKKALA-HQPISVAIEAGGRAFQLYS 284
Query: 286 GGVIQYNCDGSLANINHAVQIVGY 309
GV C L +H V VGY
Sbjct: 285 SGVFDGICGTEL---DHGVVAVGY 305
>gi|50657027|emb|CAH04631.1| cathepsin H [Suberites domuncula]
Length = 335
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 96/319 (30%), Positives = 147/319 (46%), Gaps = 38/319 (11%)
Query: 1 MFDVKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYS-KSEHDIRFKN 59
M +K L + L+ +C IP+ + P L + + F +Q+++ K YS + E R K
Sbjct: 1 MSAMKLFLGLCVLVHVCSAFIPLVLPIPGLYE--DYFKEWQEKHGKVYSTEEESQSRLKV 58
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHH 119
F K++ I+ NK S E + E++D++ +EFK ++L +H +H D
Sbjct: 59 FMKNVIYIDNHNKQGHSYELE---VNEYADMTLDEFKDQYLMEP--QHCSATHSLKSDPP 113
Query: 120 HNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
++I DWR G + V+NQ CG+CW FST ES H LK
Sbjct: 114 KYRDPPKAI--------------DWRSKGAVTPVKNQGQCGSCWTFSTTGCLESHHFLKT 159
Query: 180 GTLSLLSVQEVIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAAC--- 235
G L LS Q+++DCA N GC+GG +++ N L+ E YP D C
Sbjct: 160 GQLVSLSEQQLVDCAQAFNNNGCNGGLPSQAFEYIHYNG-GLDSEESYPYRAHDEKCHFV 218
Query: 236 --KRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVN-ALTWQYYLGGVIQY- 291
+ AT N V I S E + + T GPV A + + +++Y GV +
Sbjct: 219 PSEVSATVSNVVNITS-------KDEMQLYNAVGTVGPVSIAYDVSADFRFYKKGVYKSK 271
Query: 292 NCDGSLANINHAVQIVGYD 310
C ++NHAV VGY+
Sbjct: 272 ECKTDPEHVNHAVLAVGYN 290
>gi|21666724|gb|AAM73806.1|AF448505_1 cysteine proteinase [Brassica napus]
gi|21666726|gb|AAM73807.1|AF448506_1 cysteine proteinase [Brassica napus]
Length = 343
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 79/270 (29%), Positives = 130/270 (48%), Gaps = 22/270 (8%)
Query: 51 SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLM 110
+E + R+ F+++++ IE LN+ Q + + + +F+DL+ EEF++ + + N VL
Sbjct: 52 NEKNNRYVVFKRNVESIERLNE-VQYGLTFKLAVNQFADLTNEEFRSMYTGYKGNS-VLS 109
Query: 111 SHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVET 170
S K + HV ++ P+ DWR+ G + +++Q +CG+CWAFS V
Sbjct: 110 SRTKPTSFRYQHVSSDAL----------PISVDWRKKGAVTPIKDQGSCGSCWAFSAVAA 159
Query: 171 AESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLL 230
E + +K G L LS QE++DC N + GC GG + ++ + L ES YP
Sbjct: 160 IEGVAQIKKGKLISLSEQELVDCDTNDD-GCMGGYMNSAFNYT-MTTGGLTSESNYPYKS 217
Query: 231 KDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALT-WQYYLGGVI 289
D C T IK + D E +++ +A H I T +Q+Y GV
Sbjct: 218 TDGTCNINKTKQIATSIKGFE-DVPANDEKALMKAVAHHPVSIGIAGGGTGFQFYSSGVF 276
Query: 290 QYNCDGSLANINHAVQIVGY---DNYSRTW 316
C +++H V +VGY N S+ W
Sbjct: 277 SGECS---THLDHGVAVVGYGKSSNGSKYW 303
>gi|115479933|ref|NP_001063560.1| Os09g0497500 [Oryza sativa Japonica Group]
gi|113631793|dbj|BAF25474.1| Os09g0497500 [Oryza sativa Japonica Group]
gi|215704298|dbj|BAG93138.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 349
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 82/279 (29%), Positives = 133/279 (47%), Gaps = 21/279 (7%)
Query: 34 LELFSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSE 92
L+ F + R+ ++Y+ + E RF+ + ++++++E N + A +F+DL+
Sbjct: 28 LDRFEQWMIRHGRAYTDAGEKQRRFEVYRRNVELVETFNSMSNGYKLAD---NKFADLTN 84
Query: 93 EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
EEF+ + L HV + N G + +P DWR+ G + +
Sbjct: 85 EEFRAKML--GFRPHVTIPQIS------NTCSADIAMPGESSDDILPKSVDWRKKGAVVE 136
Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
V+NQ CG+CWAFS V E ++ +KNG L LS QE++DC + +GC GG ++
Sbjct: 137 VKNQGDCGSCWAFSAVAAIEGINQIKNGELVSLSEQELVDC-DDEAVGCGGGYMSWAFEF 195
Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV 272
+ V L E+ YP + AC+ + + V I Y + PS L A PV
Sbjct: 196 V-VGNHGLTTEASYPYHAANGACQAAKLNQSAVAIAGYR--NVTPSSEPDLARAAAAQPV 252
Query: 273 IAAVN--ALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
AV+ + +Q Y GV C A++NH V +VGY
Sbjct: 253 SVAVDGGSFMFQLYGSGVYTGPC---TADVNHGVTVVGY 288
>gi|255568297|ref|XP_002525123.1| cysteine protease, putative [Ricinus communis]
gi|223535582|gb|EEF37250.1| cysteine protease, putative [Ricinus communis]
Length = 349
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 81/259 (31%), Positives = 124/259 (47%), Gaps = 30/259 (11%)
Query: 56 RFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKH 115
RF+ F+ ++ IE N +S GI +F+DL+ EEF+ + K L + K
Sbjct: 59 RFQIFKSNVVFIESFNT--AGNKSYMLGINKFADLTNEEFRAFWNGY---KRPLGASRKI 113
Query: 116 HDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMH 175
+ +V T +P DWR G + +++Q CG+CWAFS V E +H
Sbjct: 114 TPFKYENV------------TALPSSIDWRSKGAVTPIKDQGVCGSCWAFSAVAATEGIH 161
Query: 176 ALKNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAA 234
L+ G L LS QE++DC G + GC GG ++ + + E+ YP +D
Sbjct: 162 KLRTGKLVSLSEQELVDCDVKGQDKGCQGGLMVDAFKFIKRHG-GMTSEANYPYQGRDGK 220
Query: 235 CKRKATSPNGVKIKSYTCDTLIP--SESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQ 290
C K + VKI Y +P SE+++L +A PV A++A L++Q+Y G+
Sbjct: 221 CDTKKEASRAVKITGYQA---VPKNSEAALLKAVANQ-PVSVAIDAGSLSFQFYRSGIFT 276
Query: 291 YNCDGSLANINHAVQIVGY 309
C +INH V VGY
Sbjct: 277 GICG---KDINHGVAAVGY 292
>gi|6630974|gb|AAF19631.1|AF194427_1 cysteine proteinase precursor [Myxine glutinosa]
Length = 324
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 89/283 (31%), Positives = 142/283 (50%), Gaps = 33/283 (11%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNK-NRQSPESARYGITEFSDLSEEE 94
+ S++ +Y KSY + E +R + +E +L I+++ N Q + R G+ ++DL EE
Sbjct: 19 WESWKGKYGKSYLGRGEEVLRKRVWESNLQIVQQHNVLADQGQANYRLGMNTYADLYNEE 78
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F L+ S +L + K + G+T+P+ + DWR G + V+
Sbjct: 79 FMA--LKGS--GGLLQAKDKSSTQTFKPL------VGVTLPSSV----DWRNQGYVTPVK 124
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWM 213
+Q CG+CW FS + E H K G L LS Q+++DCAG GN GC+GG + D++
Sbjct: 125 DQGQCGSCWTFSATGSLEGQHFAKTGNLLSLSEQQLVDCAGRYGNYGCNGGLMESAYDYI 184
Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCD--TLIP--SESSILTDIATH 269
V E ES YP +D CK + K+ + TC +IP E +++ + T
Sbjct: 185 KGVGGV-ELESAYPYTARDGRCKFDRS-----KVVA-TCKGYVVIPVGDEQALMQAVGTI 237
Query: 270 GPVIAAVNA--LTWQYYLGGVIQY-NCDGSLANINHAVQIVGY 309
GPV +++A ++Q Y GV + C S N++H V VGY
Sbjct: 238 GPVAVSIDASGYSFQLYESGVYDFRRC--SSTNLDHGVLAVGY 278
>gi|1706261|sp|Q10717.1|CYSP2_MAIZE RecName: Full=Cysteine proteinase 2; Flags: Precursor
gi|644490|dbj|BAA08245.1| cysteine proteinase [Zea mays]
Length = 360
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 86/278 (30%), Positives = 129/278 (46%), Gaps = 27/278 (9%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F+ F RY KSY S +E RF+ F +SL ++ N+ S R GI F+D+S EEF
Sbjct: 59 FARFAVRYGKSYESAAEVHKRFRIFSESLQLVRSTNRKGLS---YRLGINRFADMSWEEF 115
Query: 96 KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
+ L + N L +H+ +P KDWRE GI+ V+
Sbjct: 116 RATRLGAAQNCSATLTGNHRMR----------------AAAVALPETKDWREDGIVSPVK 159
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGCSGGDFCALLDWM 213
NQ CG+CW FST E+ + G LS Q+++DC N GC+GG +++
Sbjct: 160 NQGHCGSCWTFSTTGALEAAYTQATGKPISLSEQQLVDCGFAFNNFGCNGGLPSQAFEYI 219
Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
N L+ E YP + CK K + GVK+ + + + +E + + PV
Sbjct: 220 KYNG-GLDTEESYPYQGVNGICKFKNENV-GVKVLD-SVNITLGAEDELKDAVGLVRPVS 276
Query: 274 AAVNALT-WQYYLGGVIQYN-CDGSLANINHAVQIVGY 309
A +T ++ Y GV + C + ++NHAV VGY
Sbjct: 277 VAFEVITGFRLYKSGVYTSDHCGTTPMDVNHAVLAVGY 314
>gi|343473977|emb|CCD14279.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 361
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 88/319 (27%), Positives = 157/319 (49%), Gaps = 34/319 (10%)
Query: 4 VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSY-SKSEHDIRFKNFE 61
+ + F V L+A+ +PV + + EQ L+ F++F+Q+Y +SY +E RF+ F+
Sbjct: 7 TRTLRFSVGLLAVAACFVPVALGVLHAEQSLQQQFAAFKQKYSRSYRDATEEAFRFRVFK 66
Query: 62 KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
+++ E + + A +G+T FSD+S EEF+ ++H +++
Sbjct: 67 QNM---ERAKEEAAANPYATFGVTRFSDMSPEEFRA-------------TYHNGAEYYAA 110
Query: 122 HVKK-RSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
+K+ R + + + TG P DWR+ G + V++Q C + WAF+ + E +
Sbjct: 111 ALKRPRKV---VNVSTGKAPEAVDWRKKGAVTPVKDQGKCDSSWAFTVIGNIEGQWKIAG 167
Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDA---AC 235
L+ LS Q ++ C N ++GC G W+ N + E YP AC
Sbjct: 168 HELTSLSEQMLVSCDTN-DLGCRAGFMDTAFKWIVSPNDGNVFTEQSYPYASGGGNVPAC 226
Query: 236 KRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDG 295
K+ G I+ + ++ +E++I +A +GPV AV+A ++Q Y GGV+ +C
Sbjct: 227 N-KSGKVVGANIRDHV--HILDNENAIAEWLAKNGPVAIAVDATSFQRYTGGVLT-SCIS 282
Query: 296 SLANINHAVQIVGYDNYSR 314
+N A +VGYD+ S+
Sbjct: 283 K--EVNSAALLVGYDDTSK 299
>gi|255568345|ref|XP_002525147.1| cysteine protease, putative [Ricinus communis]
gi|223535606|gb|EEF37274.1| cysteine protease, putative [Ricinus communis]
Length = 347
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 88/320 (27%), Positives = 148/320 (46%), Gaps = 39/320 (12%)
Query: 11 VALIALCFLAIP---------VKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFE 61
+ LI LC L IP + + K+ +Q +K +K E+ +RF +
Sbjct: 12 LMLITLCTLWIPSIARSEIHSLPIDSAPTAMKVRYDKWLEQYGRKYDTKDEYLLRFGIYH 71
Query: 62 KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
++ IE +N S + +F+DL+ +EF + +L + + + ++ H H
Sbjct: 72 SNIQFIEYINSQNLS---FKLTDNKFADLTNDEFNSIYLGYQIRSY----KRRNLSHMHE 124
Query: 122 HVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGT 181
+ T +P DWRE G + +++Q CG+CWAFS V E ++ +K G
Sbjct: 125 N------------STDLPDAVDWRENGAVTPIKDQGQCGSCWAFSAVAAVEGINKIKTGN 172
Query: 182 LSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKAT 240
L LS QE++DC NG N GC+GG ++ + L E++YP D +C++ T
Sbjct: 173 LVSLSEQELVDCDVNGDNKGCNGGFMEKAFTFIK-SIGGLTTENDYPYKGTDGSCEKAKT 231
Query: 241 SPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQY--YLGGVIQYNCDGSLA 298
+ V I Y +T+ + + L + PV A++A +++ Y GV C
Sbjct: 232 DNHAVIIGGY--ETVPANNENSLKVAVSKQPVSVAIDASGYEFQLYSEGVFSGYCG---I 286
Query: 299 NINHAVQIVGY--DNYSRTW 316
+NH V IVGY +N + W
Sbjct: 287 QLNHGVTIVGYGDNNGQKYW 306
>gi|37911662|gb|AAR05023.1| cathepsin L-like protein [Tenebrio molitor]
Length = 336
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 97/305 (31%), Positives = 139/305 (45%), Gaps = 25/305 (8%)
Query: 16 LCFLAIPVKVSKPNLEQKL--ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELN- 71
LAI + + L E + +F+ Y +SY + E R + F+K L+ EE N
Sbjct: 4 FIILAIAIYGASAALPSTFVAEKWENFKTTYARSYVNAKEETFRKQIFQKKLETFEEHNE 63
Query: 72 KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVK-KRSITT 130
K RQ S G+ F+D++ EE K H L+ D H N + K
Sbjct: 64 KYRQGLVSYTLGVNLFTDMTPEEMKAY-------THGLI---MPADLHKNGIPIKTREDL 113
Query: 131 GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTL--SLLSVQ 188
G+ P DWR+ G++ V+NQ +CG+CWAFS+ ES + NG S +S Q
Sbjct: 114 GLNASVRYPASFDWRDQGMVSPVKNQGSCGSCWAFSSTGAIESQMKIANGAGYDSSVSEQ 173
Query: 189 EVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIK 248
+++DC N +GCSGG ++ N + + E YP + D C PN V +
Sbjct: 174 QLVDCVPNA-LGCSGGWMNDAFTYVAQNGGI-DSEGAYPYEMADGNCHY---DPNQVAAR 228
Query: 249 SYTCDTLIPSESSILTD-IATHGPVIAAVNA-LTWQYYLGGVIQYNCDGSLANINHAVQI 306
L + ++L D +AT GPV A +A + Y GGV YN HAV I
Sbjct: 229 LSGYVYLSGPDENMLADMVATKGPVAVAFDADDPFGSYSGGVY-YNPTCETNKFTHAVLI 287
Query: 307 VGYDN 311
VGY N
Sbjct: 288 VGYGN 292
>gi|292397748|ref|YP_003517814.1| cathepsin [Lymantria xylina MNPV]
gi|291065465|gb|ADD73783.1| cathepsin [Lymantria xylina MNPV]
Length = 335
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 93/309 (30%), Positives = 148/309 (47%), Gaps = 27/309 (8%)
Query: 6 NVLFIVALIALCFLAIPVKV---SKPNLEQKLELFSSFQQRYKKSYSKS-EHDIRFKNFE 61
N++ IV L ALC + V + NL++ + F SF + Y K+Y+ E + R+ F+
Sbjct: 2 NIIVIVTL-ALCAASSRAAVVAETAYNLQRAPDYFESFVENYNKNYTSDWEKNKRYSIFK 60
Query: 62 KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
+L I N N +A YGI +FSDLS+ E + S+ + N
Sbjct: 61 DNLHEINAKNGNATDGPTATYGINKFSDLSKSELIAKFTGLSIPQRA-----------SN 109
Query: 122 HVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGT 181
K + P P+ DWRE + ++NQ CGACWAF+T+ + ES A+++
Sbjct: 110 FCKTIVLNQP---PDKGPLHFDWREQNKVTSIKNQGACGACWAFATLASVESQFAMRHNR 166
Query: 182 LSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATS 241
L LS Q++IDC + +MGC+GG + + + ++ E +YP + +D C
Sbjct: 167 LVDLSEQQLIDC-DSVDMGCNGGLLHTAFEEI-IRMGGVQAELDYPFVGRDRRCGVDRHR 224
Query: 242 PNGVKIKSYTCDTLIPSESSILTDIATH-GPVIAAVNALTWQYYLGGVIQYNCDGSLANI 300
P V + C + L D+ GP+ A++A Y GVI +C+ + +
Sbjct: 225 PYVVSLVG--CYRYVMVNEEKLKDLLRAVGPIPMAIDAADIVNYYRGVIS-SCENN--GL 279
Query: 301 NHAVQIVGY 309
NHAV +VGY
Sbjct: 280 NHAVLLVGY 288
>gi|18141285|gb|AAL60580.1|AF454958_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 485
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 79/270 (29%), Positives = 130/270 (48%), Gaps = 32/270 (11%)
Query: 45 KKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV 104
K S +E D RF+ F+ +L I+E N S R G+T+F+DL+ +E+++ +L +
Sbjct: 57 KAQNSLTEKDRRFEIFKDNLRFIDEHNGKNLS---YRLGLTKFADLTNDEYRSMYLGSRL 113
Query: 105 NKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWA 164
+ S ++ + + IP DWR+ G + +V++Q +CG+CWA
Sbjct: 114 KRKATKSSLRYE---------------VRVGDAIPESVDWRKEGAVAEVKDQGSCGSCWA 158
Query: 165 FSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW---MDVNKVVLE 221
FST+ E ++ + G L LS QE++DC + N GC+GG L+D+ +N ++
Sbjct: 159 FSTIGAVEGINKIVTGDLITLSEQELVDCDTSYNEGCNGG----LMDYAFEFIINNGGID 214
Query: 222 PESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAV--NAL 279
E +YP D C + + V I Y D SE S L +H P+ A+
Sbjct: 215 TEEDYPYKGVDGRCDQTRKNAKVVTIDLYE-DVPANSEES-LKKALSHQPISVAIEGGGR 272
Query: 280 TWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+Q Y G+ C +++H V VGY
Sbjct: 273 AFQLYDSGIFDGICG---TDLDHGVVAVGY 299
>gi|118125|sp|P25784.1|CYSP3_HOMAM RecName: Full=Digestive cysteine proteinase 3; Flags: Precursor
Length = 321
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 83/305 (27%), Positives = 153/305 (50%), Gaps = 38/305 (12%)
Query: 11 VALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEE 69
VA + LC LA+ + P+ + F+ +Y + Y ++ ++ R + F+++ +IE+
Sbjct: 3 VAALFLCGLALAT--ASPSWDH-------FKTQYGRKYGDAKEELYRQRVFQQNEQLIED 53
Query: 70 LNKNRQSPE-SARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSI 128
NK ++ E + + + +F D++ EEF + +M +K + + +++
Sbjct: 54 FNKKFENGEVTFKVAMNQFGDMTNEEF-----------NAVMKGYKKG----SRGEPKAV 98
Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
T P V DWR ++ V++Q+ CG+CWAFS E H LKN L LS Q
Sbjct: 99 FTAEAGPMAADV--DWRTKALVTPVKDQEQCGSCWAFSATGALEGQHFLKNDELVSLSEQ 156
Query: 189 EVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI 247
+++DC+ + GN GC GG + D++ N + + ES YP +D +C+ A S +
Sbjct: 157 QLVDCSTDYGNDGCGGGWMTSAFDYIKDNGGI-DTESSYPYEAEDRSCRFDANSIGAICT 215
Query: 248 KSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGV-IQYNCDGSLANINHAV 304
S + +E ++ ++ GP+ A++A ++Q+Y GV + NC + ++H V
Sbjct: 216 GSV---EVQHTEEALQEAVSGVGPISVAIDASHFSFQFYSSGVYYEQNCSPTF--LDHGV 270
Query: 305 QIVGY 309
VGY
Sbjct: 271 LAVGY 275
>gi|356559055|ref|XP_003547817.1| PREDICTED: cysteine proteinase RD21a [Glycine max]
Length = 366
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 76/282 (26%), Positives = 135/282 (47%), Gaps = 27/282 (9%)
Query: 34 LELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSE 92
+ ++ + +++K Y+ E D RF+ F+ +L I+E N N+ + + + G+ +F+D++
Sbjct: 37 MTMYEEWLVKHQKVYNGLREKDKRFQVFKDNLGFIQEHNNNQNN--TYKLGLNQFADMTN 94
Query: 93 EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
EE++ + + + K H + + + +PV DWR G +
Sbjct: 95 EEYRVMYFGTKSDAKRRLMKTKSTGHRYAY----------SAGDRLPVHVDWRVKGAVAP 144
Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
+++Q +CG+CWAFSTV T E+++ + G LS QE++DC N GC+GG L+D+
Sbjct: 145 IKDQGSCGSCWAFSTVATVEAINKIVTGKFVSLSEQELVDCDRAYNEGCNGG----LMDY 200
Query: 213 ---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
+ ++ + +YP D C + V I + + + P + + L H
Sbjct: 201 AFEFIIQNGGIDTDKDYPYRGFDGICDPTKKNAKVVNIDGF--EDVPPYDENALKKAVAH 258
Query: 270 GPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
PV A+ A Q Y GV C SL +H V +VGY
Sbjct: 259 QPVSIAIEASGRDLQLYQSGVFTGKCGTSL---DHGVVVVGY 297
>gi|171702831|dbj|BAG16371.1| cysteine protease [Brassica oleracea var. italica]
Length = 441
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 79/270 (29%), Positives = 130/270 (48%), Gaps = 32/270 (11%)
Query: 45 KKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV 104
K S +E D RF+ F+ +L I+E N S R G+T+F+DL+ +E+++ +L +
Sbjct: 51 KAQNSLTEKDRRFEIFKDNLRFIDEHNGKNLS---YRLGLTKFADLTNDEYRSMYLGSRL 107
Query: 105 NKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWA 164
+ S ++ + + IP DWR+ G + +V++Q +CG+CWA
Sbjct: 108 KRKATKSSLRYE---------------VRVGDAIPESVDWRKEGAVAEVKDQGSCGSCWA 152
Query: 165 FSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW---MDVNKVVLE 221
FST+ E ++ + G L LS QE++DC + N GC+GG L+D+ +N ++
Sbjct: 153 FSTIGAVEGINKIVTGDLITLSEQELVDCDTSYNEGCNGG----LMDYAFEFIINNGGID 208
Query: 222 PESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAV--NAL 279
E +YP D C + + V I Y D SE S L +H P+ A+
Sbjct: 209 TEEDYPYKGVDGRCDQTRKNAKVVTIDLYE-DVPANSEES-LKKALSHQPISVAIEGGGR 266
Query: 280 TWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+Q Y G+ C +++H V VGY
Sbjct: 267 AFQLYDSGIFDGICG---TDLDHGVVAVGY 293
>gi|413953046|gb|AFW85695.1| thiol protease SEN102 [Zea mays]
Length = 382
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 81/273 (29%), Positives = 127/273 (46%), Gaps = 29/273 (10%)
Query: 34 LELFSSFQQRYKKSYSKSEH-DIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSE 92
LE F ++Q Y ++Y+ E RF + +++ I+ +N+ + S G +F+DL+E
Sbjct: 61 LERFKAWQAEYNRTYATPEEFQQRFMIYSENVRFIKTMNQ-LSTGSSYELGENQFTDLTE 119
Query: 93 EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGI---------PVKKD 143
EEFK +L D + T G G+ P D
Sbjct: 120 EEFKDTYL-------------MKLDEQPPAAEAMPPTVGTMSTAGMSNGNNTGEAPNSVD 166
Query: 144 WREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGN-MGCS 202
WR G + +V++QQ CG+CWAF+TV + E +H +K G L LS QE++DC GN GC
Sbjct: 167 WRTKGAVTRVKDQQQCGSCWAFATVASIEGVHQIKTGRLVSLSEQEIVDCDRGGNDNGCR 226
Query: 203 GGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSI 262
GG + ++W+ N L ES+YP + C + +I+ Y +E+ +
Sbjct: 227 GGSPRSAMEWVTRNG-GLTTESDYPYVGSQRQCMSGKLGHHAARIRGYQA-VQRNNEAEL 284
Query: 263 LTDIATHGPVIAAVNA-LTWQYYLGGVIQYNCD 294
+A PV V+A +Q+Y GV CD
Sbjct: 285 ERAVAGQ-PVAVFVDASRAFQFYKSGVFSGPCD 316
>gi|67605684|ref|XP_666697.1| cryptopain precursor [Cryptosporidium hominis TU502]
gi|54657738|gb|EAL36466.1| cryptopain precursor [Cryptosporidium hominis]
Length = 401
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 86/296 (29%), Positives = 133/296 (44%), Gaps = 23/296 (7%)
Query: 21 IPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPES 79
+P P + + F F+++Y K+YS E + RF+ ++++++ I+ N S
Sbjct: 70 VPGDYVDPATREYRKSFEEFKKKYNKTYSSMEEENQRFEIYKQNMNFIKTTNSQGFS--- 126
Query: 80 ARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIP 139
+ EF DLS+EEF R ++ S + V + P I
Sbjct: 127 YVLEMNEFGDLSKEEFMARF-----TGYIKDSKDDERVFKSSRVSASELEEEFVPPNSI- 180
Query: 140 VKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMH-ALKNGTLSLLSVQEVIDCAG-NG 197
+W EAG + +RNQ+ CG+CWAFS V E A N L LS Q+ +DC+ NG
Sbjct: 181 ---NWVEAGCVNPIRNQKNCGSCWAFSAVAALEGATCAQTNRGLPSLSEQQFVDCSKQNG 237
Query: 198 NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP 257
N GC GG + NK L +YP ++ C + N ++I + P
Sbjct: 238 NFGCDGGTMGLAFQYAIKNK-YLCTNDDYPYFAEEKTC-MDSFCENYIEIPVKAYKYVFP 295
Query: 258 SESSIL-TDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
+ L T +A +GP+ A+ A +Q+Y GV C +NH V +VGYD
Sbjct: 296 RNINTLKTALAKYGPISVAIQADQTPFQFYKSGVFDAPCG---TKVNHGVVLVGYD 348
>gi|171854651|dbj|BAG16515.1| putative cysteine proteinase [Capsicum chinense]
Length = 367
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 84/284 (29%), Positives = 131/284 (46%), Gaps = 38/284 (13%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
FS F+ ++ K Y ++ EHD R K F+ +L +++ +A +GIT+FSDL+ EF
Sbjct: 50 FSLFKSKFGKIYATQEEHDHRLKVFKANL---RRARRHQLLDPTAEHGITKFSDLTPSEF 106
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
+ +L H K S T +PT +P DWRE G + V+
Sbjct: 107 RRTYLGL-----------------HKPKPKLSTTKAPILPTSDLPEDFDWREKGAVTGVK 149
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
NQ +CG+CW+FST E H L G L LS Q+++DC + GC GG
Sbjct: 150 NQGSCGSCWSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDAEQKSECDAGCGGGLM 209
Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
++ + L+ E +YP ++ C S + +Y+ L E I ++
Sbjct: 210 TTAFEYT-LKAGGLQREKDYPYTGRNGQCHFD-KSKIAASVTNYSVVGL--DEDQIAANL 265
Query: 267 ATHGPVIAAVNALTWQYYLGGVIQYNCD-GSLANINHAVQIVGY 309
HGP+ +N+ Q Y+GGV +C + +H V +VGY
Sbjct: 266 VKHGPLAVGINSAWMQTYIGGV---SCPLVCFKHQDHGVLLVGY 306
>gi|407844577|gb|EKG02025.1| cysteine peptidase, putative,cysteine peptidase, clan CA, family
C1, cathepsin L-like, putative, partial [Trypanosoma
cruzi]
Length = 308
Score = 115 bits (288), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 83/289 (28%), Positives = 130/289 (44%), Gaps = 27/289 (9%)
Query: 7 VLFIVALIALCFLAIPVKVSKPNLEQKL-ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSL 64
L + A++ + +P + + E+ L F+ F+Q++ + Y S +E R F +L
Sbjct: 35 ALSLAAVLVVMACLVPAATASLHAEETLASQFAEFKQKHGRVYGSAAEEAFRLSVFRANL 94
Query: 65 DIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVK 124
+ L+ + A +G+T FSDL+ EEF++R+ ++ H +
Sbjct: 95 -FLARLHA--AANPHANFGVTPFSDLTREEFRSRY--------------QNGAAHFAAAQ 137
Query: 125 KRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL 184
+R+ G P KDWRE G + V+NQ CG+CWAF+ + E L L+
Sbjct: 138 ERARVPVDVEVVGAPAAKDWREEGAVTAVKNQGMCGSCWAFAAIGNIEGQWFLAGNPLTR 197
Query: 185 LSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPL---LLKDAACKRKAT 240
LS Q ++ C N N GC GG W+ D N + E YP + CK +
Sbjct: 198 LSEQMLVSC-DNTNSGCGGGSPFRAFKWIVDRNNGAVYTEDSYPYHSCIGIKLPCK-DSD 255
Query: 241 SPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVI 289
G I Y T+ E I +A GP+ AV+A +W +Y GGV
Sbjct: 256 RTVGATISGYV--TIPSDEKRIAAVLAVKGPLSVAVDASSWMHYTGGVF 302
>gi|171702829|dbj|BAG16370.1| cysteine protease [Brassica oleracea var. italica]
Length = 332
Score = 115 bits (288), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 79/270 (29%), Positives = 130/270 (48%), Gaps = 22/270 (8%)
Query: 51 SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLM 110
+E + R+ F+++++ IE LN+ Q + + + +F+DL+ EEF++ + + N VL
Sbjct: 46 NEKNNRYVVFKRNVESIERLNE-VQYGLTFKLAVNQFADLTNEEFRSMYTGYKGNS-VLS 103
Query: 111 SHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVET 170
S K + HV ++ P+ DWR+ G + +++Q +CG+CWAFS V
Sbjct: 104 SRTKPTSFRYQHVSSDAL----------PISVDWRKKGAVTPIKDQGSCGSCWAFSAVAA 153
Query: 171 AESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLL 230
E + +K G L LS QE++DC N + GC GG + ++ + L ES YP
Sbjct: 154 IEGVAQIKKGKLISLSEQELVDCDTNDD-GCMGGYMNSAFNYT-MTTGGLTSESNYPYKS 211
Query: 231 KDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALT-WQYYLGGVI 289
D C T IK + D E +++ +A H I T +Q+Y GV
Sbjct: 212 TDGTCNINKTKQIATSIKGFE-DVPANDEKALMKAVAHHPVSIGIAGGGTGFQFYSSGVF 270
Query: 290 QYNCDGSLANINHAVQIVGY---DNYSRTW 316
C +++H V +VGY N S+ W
Sbjct: 271 SGECS---THLDHGVAVVGYGKSSNGSKYW 297
>gi|297793593|ref|XP_002864681.1| hypothetical protein ARALYDRAFT_496172 [Arabidopsis lyrata subsp.
lyrata]
gi|297310516|gb|EFH40940.1| hypothetical protein ARALYDRAFT_496172 [Arabidopsis lyrata subsp.
lyrata]
Length = 361
Score = 115 bits (288), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 85/278 (30%), Positives = 132/278 (47%), Gaps = 29/278 (10%)
Query: 37 FSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F+ F RY K Y E +RF F+++LD+I NK S + G+ +F+DL+ +EF
Sbjct: 59 FARFTHRYGKKYQNVEEMKLRFSIFKENLDLIRSTNKKGLS---YKLGVNQFADLTWQEF 115
Query: 96 KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
+ L + N L HK +P KDWRE GI+ V+
Sbjct: 116 QRTKLGAAQNCSATLKGSHK------------------LTEAALPETKDWREDGIVSPVK 157
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWM 213
+Q CG+CW FST E+ + G LS Q+++DCAG N GC+GG +++
Sbjct: 158 DQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAYNNYGCNGGLPSQAFEYI 217
Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
N L+ E YP + KD CK A + GV++ + + + +E + + PV
Sbjct: 218 KSNG-GLDTEEAYPYIGKDGTCKFSAENV-GVQVLD-SVNITLGAEDELKHAVGLVRPVS 274
Query: 274 AAVNAL-TWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
A + +++ Y GV +C + ++NHAV VGY
Sbjct: 275 IAFEVIHSFRLYKSGVYTDSHCGSTPMDVNHAVLAVGY 312
>gi|73946536|ref|XP_541257.2| PREDICTED: cathepsin L1 [Canis lupus familiaris]
Length = 333
Score = 115 bits (288), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 87/302 (28%), Positives = 138/302 (45%), Gaps = 30/302 (9%)
Query: 13 LIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELN 71
L ALC + + + P + L+ +S +++ + K Y K E R +E+++++IE+ N
Sbjct: 7 LAALC---LGIASAAPQQDHSLDAHWSQWKEAHGKLYDKDEEGWRRTVWERNMEMIEQHN 63
Query: 72 KN-RQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITT 130
+ Q S + F D++ EEFK + KH K+
Sbjct: 64 QEYSQGEHSFTLAMNAFGDMTNEEFKQVLNDFKIQKH-----------------KKGKVF 106
Query: 131 GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEV 190
+ +P DWRE G + V++Q C CWAFS E K G L LS Q +
Sbjct: 107 PAPLFAEVPSSVDWREQGYVTPVKDQGQCLGCWAFSATGALEGQMFRKTGKLVSLSEQNL 166
Query: 191 IDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKS 249
+DC+ GN GC+GG ++ N L+ E YP L ++ CK + P
Sbjct: 167 VDCSWSQGNRGCNGGLMEYAFQYVKDNG-GLDSEESYPYLARNEPCKYR---PEKSAANV 222
Query: 250 YTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIV 307
++ E ++T +AT GPV AAV++ ++Q+Y G I Y+ S +NH V +V
Sbjct: 223 TAFWPILNEEDGLMTTVATVGPVSAAVDSSPQSFQFYKKG-IYYDPKCSNKLLNHGVLVV 281
Query: 308 GY 309
GY
Sbjct: 282 GY 283
>gi|50355617|dbj|BAD29957.1| cysteine protease [Daucus carota]
Length = 437
Score = 115 bits (288), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 79/286 (27%), Positives = 140/286 (48%), Gaps = 30/286 (10%)
Query: 31 EQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSD 89
++ + +++S+ ++ KSY+ E + RF+ F+ +L I+ N N S G+ F+D
Sbjct: 43 DEVMTMYNSWLVKHGKSYNALGEKETRFQIFKDNLRYID--NHNADPDRSYELGLNRFAD 100
Query: 90 LSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGI 149
L+ EE++ ++L S + + G +P I DWRE G
Sbjct: 101 LTNEEYRAKYLG-------TKSRESRPKLSKGPSDRYAPVEGEELPDSI----DWREKGA 149
Query: 150 IGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCAL 209
+ V++Q +CG+CWAFS + E ++ + G L LS QE++DC + N GC GG L
Sbjct: 150 VAAVKDQGSCGSCWAFSAIGAVEGINQITTGELITLSEQELVDCDRSYNEGCEGG----L 205
Query: 210 LDWMDVNKVV----LEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
+D+ N ++ ++ + +YP +D C + + V I SY D + E + L
Sbjct: 206 MDYA-FNFIIKNGGIDSDLDYPYTGRDGTCNQNKENAKVVTIDSYE-DVPVYDEKA-LQK 262
Query: 266 IATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
A + P+ A+ A + +Q Y+ G+ C + ++H V +VGY
Sbjct: 263 AAANQPISVAIEAGGMDFQLYVSGIFTGKCGTA---VDHGVVVVGY 305
>gi|224162986|ref|XP_002338508.1| predicted protein [Populus trichocarpa]
gi|222872535|gb|EEF09666.1| predicted protein [Populus trichocarpa]
Length = 306
Score = 115 bits (288), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 83/279 (29%), Positives = 135/279 (48%), Gaps = 28/279 (10%)
Query: 35 ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEE 93
E + +Y + Y +E R+ F++++ I+ N Q+ +S + G+ +F+DL+ E
Sbjct: 3 ERHEQWMTQYGRVYKDDNERATRYSIFKENVARIDAFNS--QTGKSYKLGVNQFADLTNE 60
Query: 94 EFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKV 153
EFK R+ H M + + +V + +P DWR+ G + V
Sbjct: 61 EFKAS--RNRFKGH--MCSPQAGPFRYENV------------SAVPSTVDWRKEGAVTPV 104
Query: 154 RNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDW 212
++Q CG CWAFS V E ++ L G L LS QEV+DC G + GC+GG +
Sbjct: 105 KDQGQCGCCWAFSAVAAMEGINKLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKF 164
Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV 272
++ NK L E+ YP D C K ++ + KI + D SE++++ +A PV
Sbjct: 165 IEQNK-GLTTEANYPYKGTDGTCNTKKSAIHAAKITGFE-DVPANSEAALMKAVAKQ-PV 221
Query: 273 IAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
A++A +Q+Y G+ +CD L +H V VGY
Sbjct: 222 SVAIDAGGSDFQFYSSGIFTGSCDTQL---DHGVTAVGY 257
>gi|18423124|ref|NP_568722.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|75309064|sp|Q9FGR9.1|CEP1_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP1; AltName:
Full=Cysteine proteinase CP56; Short=AtCP56; Flags:
Precursor
gi|9759028|dbj|BAB09397.1| cysteine endopeptidase [Arabidopsis thaliana]
gi|20258850|gb|AAM13907.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|308097832|gb|ADO14465.1| papain [Arabidopsis thaliana]
gi|332008536|gb|AED95919.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 361
Score = 115 bits (288), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 90/311 (28%), Positives = 139/311 (44%), Gaps = 31/311 (9%)
Query: 9 FIVALIALCFLAIPVKVSKPNLEQK--------LELFSSFQQRYKKSYSKSEHDIRFKNF 60
FIV +ALC L + + K EL+ ++ + + S E RF F
Sbjct: 4 FIV--LALCMLMVLETTKGLDFHNKDVESENSLWELYERWRSHHTVARSLEEKAKRFNVF 61
Query: 61 EKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
+ ++ I E NK +S + + +F D++ EEF+ + ++ HH+
Sbjct: 62 KHNVKHIHETNK---KDKSYKLKLNKFGDMTSEEFRRTYAGSNI------KHHRMFQGEK 112
Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
K T+PT + DWR+ G + V+NQ CG+CWAFSTV E ++ ++
Sbjct: 113 KATKSFMYANVNTLPTSV----DWRKNGAVTPVKNQGQCGSCWAFSTVVAVEGINQIRTK 168
Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKAT 240
L+ LS QE++DC N N GC+GG +++ K L E YP D C
Sbjct: 169 KLTSLSEQELVDCDTNQNQGCNGGLMDLAFEFIK-EKGGLTSELVYPYKASDETCDTNKE 227
Query: 241 SPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLA 298
+ V I + D SE ++ +A PV A++A +Q+Y GV C L
Sbjct: 228 NAPVVSIDGHE-DVPKNSEDDLMKAVANQ-PVSVAIDAGGSDFQFYSEGVFTGRCGTEL- 284
Query: 299 NINHAVQIVGY 309
NH V +VGY
Sbjct: 285 --NHGVAVVGY 293
>gi|1046373|gb|AAC49135.1| SAG12 protein [Arabidopsis thaliana]
Length = 346
Score = 115 bits (288), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 90/323 (27%), Positives = 150/323 (46%), Gaps = 40/323 (12%)
Query: 8 LFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK--------SEHDIRFKN 59
LF+ + CF + +S+P L+ +L + Q+R+ + +K E + R+
Sbjct: 10 LFVAIFSSFCF---SITLSRP-LDNELIM----QKRHIEWMTKHGRVYADVKEENNRYVV 61
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLR-HSVNKHVLMSHHKHHDH 118
F+ +++ IE LN + + + + + +F+DL+ +EF + + V+ S K
Sbjct: 62 FKNNVERIEHLN-SIPAGRTFKLAVNQFADLTNDEFCSMYTGFKGVSALSSQSQTKMSPF 120
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
+ +V ++ PV DWR+ G + ++NQ +CG CWAFS V E +K
Sbjct: 121 RYQNVSSGAL----------PVSVDWRKKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIK 170
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
G L LS Q+++DC N + GC GG + + L ES+YP +DA C K
Sbjct: 171 KGKLISLSEQQLVDCDTN-DFGCEGGLMDTAFEHIKATG-GLTTESDYPYKGEDATCNSK 228
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAV--NALTWQYYLGGVIQYNCDGS 296
T+P I Y D + E +++ +A H PV + +Q+Y GV C
Sbjct: 229 KTNPKATSITGYE-DVPVNDEQALMKAVA-HQPVSVGIEGGGFDFQFYSSGVFTGECTTY 286
Query: 297 LANINHAVQIVGYD---NYSRTW 316
L +HAV +GY N S+ W
Sbjct: 287 L---DHAVTAIGYGESTNGSKYW 306
>gi|28192375|gb|AAK07731.1| CPR2-like cysteine proteinase [Nicotiana tabacum]
Length = 363
Score = 115 bits (288), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 93/331 (28%), Positives = 144/331 (43%), Gaps = 52/331 (15%)
Query: 8 LFIVALIALCFL--AIPVKVSKPNLEQKLEL------------FSSFQQRYKKSY-SKSE 52
LF+++L+A AI P + Q + FS F+ ++ K Y S+ E
Sbjct: 4 LFLLSLLAFVLFSSAIAFSDEDPLIRQVVSETDDSHLLNAEHHFSLFKSKFGKIYASEEE 63
Query: 53 HDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSH 112
HD RFK F+ + +++ SA +GIT+FSDL+ EF+ +L
Sbjct: 64 HDHRFKVFKANR---RRARRHQLLDPSAEHGITKFSDLTPSEFRRTYLGL---------- 110
Query: 113 HKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETA 171
H K + +PT +P DWR+ G + V+NQ +CG+CW+FST
Sbjct: 111 -------HKPKPKLNAEKAPILPTSDLPADFDWRDHGAVTGVKNQGSCGSCWSFSTTGAV 163
Query: 172 ESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDFCALLDWMDVNKVVLEPE 223
E H L G L LS Q+++DC + GC GG ++ + L+ E
Sbjct: 164 EGAHFLATGELVSLSEQQLVDCDHECDPEQQDACDAGCGGGLMTTAFEYT-LKAGGLQLE 222
Query: 224 SEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQY 283
+YP KD C S + +++ L E I ++ HGP+ +NA Q
Sbjct: 223 KDYPYTGKDGKCHFD-KSKIAAAVTNFSVIGL--DEDQIAANLVKHGPLAVGINAAWMQT 279
Query: 284 YLGGV-IQYNCDGSLANINHAVQIVGYDNYS 313
Y+GGV C +H V +VGY ++
Sbjct: 280 YVGGVSCPLIC---FKRQDHGVLLVGYGSHG 307
>gi|66816665|ref|XP_642342.1| hypothetical protein DDB_G0278401 [Dictyostelium discoideum AX4]
gi|60470393|gb|EAL68373.1| hypothetical protein DDB_G0278401 [Dictyostelium discoideum AX4]
Length = 337
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 90/305 (29%), Positives = 146/305 (47%), Gaps = 32/305 (10%)
Query: 13 LIALCFLAIPVKVSKPNLE--QKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEEL 70
L LC L I V +K L Q + F+ + +KSYS SE R+ F+ + D IEE
Sbjct: 4 LSVLCALLITVATAKQELSESQYRDAFTDWMISNQKSYSSSEFITRYNIFKTNFDYIEEW 63
Query: 71 NKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITT 130
N + E+ G+ + +D++ EE+++ +L + L+ +
Sbjct: 64 NS--KGSETV-LGLNKMADITNEEYRSLYLGKPFDASSLIGTKEE--------------- 105
Query: 131 GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHAL-KNGTLSLLSV-- 187
I DWR+ G + V+NQQ+C CW+FS E H L NGT L+S+
Sbjct: 106 -ILFSNKFSSTVDWRKKGAVTHVKNQQSCSGCWSFSATGATEGAHKLANNGTNELVSLSE 164
Query: 188 QEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVK 246
Q +IDC+ GN GC+GG +++ N + + E YP D C+ K+ + +G
Sbjct: 165 QNLIDCSTPFGNTGCNGGVITYAFEYIISNGGI-DTEKSYPFEGTDGTCRYKSEN-SGAT 222
Query: 247 IKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAV 304
I SY + SESS+ + + + PV +++A ++ +Y G I + S N++H V
Sbjct: 223 ISSYV-NVTFGSESSLESAVNVN-PVACSIDASHSSFLFYKSG-IYFEPACSRTNLDHGV 279
Query: 305 QIVGY 309
+VGY
Sbjct: 280 LVVGY 284
>gi|359483753|ref|XP_002266308.2| PREDICTED: oryzain alpha chain-like [Vitis vinifera]
Length = 501
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 88/311 (28%), Positives = 153/311 (49%), Gaps = 25/311 (8%)
Query: 7 VLFIVALIALCFLAIPVKV-----SKPNLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNF 60
VLFI A +A ++P + + E+ ELF +++R+K+ Y +E RF+ F
Sbjct: 11 VLFIWASLACLSSSLPTEFYITGEEFASEERVRELFHLWKERHKRVYKHAEETAKRFEIF 70
Query: 61 EKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
+++L + E N G+ +F+D+S EEFK ++L +++
Sbjct: 71 KENLKYVIERNSKGHR---HTLGMNKFADMSNEEFKEKYLSKIKKPINKKNNYLRR---- 123
Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
++++ T P+ + DWR+ G++ +++Q CG+CWAFS+ E ++A+ G
Sbjct: 124 -SMQQKKGTASCEAPSSL----DWRKKGVVTGIKDQGDCGSCWAFSSTGAMEGINAIVTG 178
Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKAT 240
L LS QE++DC N GC GG +W+ ++ ++ ES+YP D C
Sbjct: 179 DLISLSEQELVDC-DTTNYGCEGGYMDYAFEWV-ISNGGIDSESDYPYTGTDGTCNTTKE 236
Query: 241 SPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVN--ALTWQYYLGGVIQYNCDGSLA 298
V I Y + S+S++L A + P+ ++ AL +Q Y G+ +C
Sbjct: 237 DTKVVSIDGYK--DVDESDSALLC-AAVNQPISVGMDGSALDFQLYTSGIYAGDCSDDPD 293
Query: 299 NINHAVQIVGY 309
+I+HAV IVGY
Sbjct: 294 DIDHAVLIVGY 304
>gi|356515086|ref|XP_003526232.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 96/323 (29%), Positives = 143/323 (44%), Gaps = 39/323 (12%)
Query: 6 NVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSL 64
N + + L+ + FLA V E + RY K Y E + RF+ F++++
Sbjct: 8 NHISLAMLLCMTFLAFQVTCRTLQDASMYERHEQWMTRYGKVYKDPQEREKRFRVFKENV 67
Query: 65 DIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVK 124
+ IE N + +S + GI +F+DL+ +EF R+ H+ S
Sbjct: 68 NYIEAFN--NAANKSYKLGINQFADLTNKEFIAP--RNGFKGHMCSS------------I 111
Query: 125 KRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL 184
R+ T T P DWR+ G + +++Q CG CWAFS V E +HAL G L
Sbjct: 112 IRTTTFKFENVTATPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGKLIS 171
Query: 185 LSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVV-----LEPESEYPLLLKDAACKRK 238
LS QE++DC G + GC GG L+D D K + L E+ YP D C
Sbjct: 172 LSEQELVDCDTKGVDQGCEGG----LMD--DAFKFIIQNHGLNTEANYPYKGVDGKCNAN 225
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGS 296
+ N I Y D +E ++ +A PV A++A +Q+Y GV +C
Sbjct: 226 EAAKNAATITGYE-DVPANNEMALQKAVANQ-PVSVAIDASGSDFQFYKSGVFTGSCGTE 283
Query: 297 LANINHAVQIVGY---DNYSRTW 316
L +H V VGY D+ + W
Sbjct: 284 L---DHGVTAVGYGVSDDGTEYW 303
>gi|427778331|gb|JAA54617.1| Putative cysteine proteinase cathepsin f [Rhipicephalus pulchellus]
Length = 361
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 94/302 (31%), Positives = 142/302 (47%), Gaps = 44/302 (14%)
Query: 30 LEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFS 88
L Q+ LFS F + Y K+Y K EH+ RF F+ +L I N+ + +A YG+TEFS
Sbjct: 27 LSQERSLFSVFARTYNKTYKDKEEHEARFMIFKNNLKRIALFNRLEEG--TAHYGLTEFS 84
Query: 89 DLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAG 148
DLS EF+ RH + ++ HK + + I G + +P DWR G
Sbjct: 85 DLSPSEFE----RHYLGLKKDLAEHK--------AEVKPIKVG-PVNEPLPDLFDWRTKG 131
Query: 149 IIGKVRNQQTCG------------------ACWAFSTVETAESMHALKNGTLSLLSVQEV 190
+ +V+NQ CG +CWAFS E L L LS QE+
Sbjct: 132 AVTEVKNQGMCGSCWAFSXXTEVKNQGMCGSCWAFSVTGNVEGQWFLSRSKLLSLSEQEL 191
Query: 191 IDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSY 250
+DC +G+ GC GG + + + LE ESEYP D C+ T +++S+
Sbjct: 192 VDCD-HGDHGCKGGYMGQAMKAV-IEMGGLETESEYPYKGVDGTCEFNKTESK-ARVQSF 248
Query: 251 TCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGV---IQYNCDGSLANINHAVQIV 307
L +E+ + + HGPV +NA Q+Y GG+ ++ C S +++H V +V
Sbjct: 249 V--GLPQNETELAYWLMKHGPVSIGINANAMQFYFGGISHPWKFLC--SPTDLDHGVLLV 304
Query: 308 GY 309
G+
Sbjct: 305 GF 306
>gi|255544115|ref|XP_002513120.1| cysteine protease, putative [Ricinus communis]
gi|223548131|gb|EEF49623.1| cysteine protease, putative [Ricinus communis]
Length = 362
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 77/268 (28%), Positives = 133/268 (49%), Gaps = 36/268 (13%)
Query: 31 EQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDL 90
E+ + +S+ + YK + +E +R+K F++++ I+ N +S +S + + +F+DL
Sbjct: 37 ERHEQWMASYARVYKDA---NEKQMRYKIFKENVQRIDSFNS--ESDKSYKLAVNQFADL 91
Query: 91 SEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGII 150
+ EEFK+ LR+ H+ + H + + T +P DWR+ G +
Sbjct: 92 TNEEFKS--LRNGFKGHMCSAQAGHFRYEN--------------VTAVPASIDWRKKGAV 135
Query: 151 GKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCAL 209
+++ Q CG+CWAFS V E + +K G L LS QE++DC N + GC GG L
Sbjct: 136 TQIKEQGQCGSCWAFSAVAAVEGITEIKTGKLISLSEQELVDCDTNSEDQGCQGG----L 191
Query: 210 LDWMDVNKVV----LEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
+D D K + L E+ YP D+ CK K + KI Y + + ++ + L +
Sbjct: 192 MD--DAFKFIEQHGLASEATYPYDAADSTCKTKEEAKPSAKITGY--EDVPANDEAALKN 247
Query: 266 IATHGPVIAAVNA--LTWQYYLGGVIQY 291
+ PV A++A +Q+Y G+ Y
Sbjct: 248 AVANQPVSVAIDAGGFEFQFYSSGIEWY 275
>gi|357474523|ref|XP_003607546.1| Cysteine proteinase [Medicago truncatula]
gi|358347207|ref|XP_003637651.1| Cysteine proteinase [Medicago truncatula]
gi|355503586|gb|AES84789.1| Cysteine proteinase [Medicago truncatula]
gi|355508601|gb|AES89743.1| Cysteine proteinase [Medicago truncatula]
Length = 345
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 88/318 (27%), Positives = 147/318 (46%), Gaps = 46/318 (14%)
Query: 7 VLFIVALIALCFLAIPV------KVSKPNLEQKLELFSSFQQRYKKSYS-KSEHDIRFKN 59
+ ++ L C +A K ++E + F + +R+ + Y E ++RF
Sbjct: 10 IFILLMLCNTCVIASESECPPTHKQKSSDVEAMKKRFDGWVKRHGRKYKHNDEREVRFGI 69
Query: 60 FEKSLDIIEELNKNRQSPESARYGITE--FSDLSEEEFKTRHLRHSVNKHVLMSHHK--H 115
++ ++ I+ N + S Y +T+ F+DL+ EEF++ ++ S L SH+
Sbjct: 70 YQANVQYIQCKNAQKNS-----YNLTDNKFADLTNEEFQSTYMGLSTR---LRSHNTGFR 121
Query: 116 HDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMH 175
+D H + +P KDWR+ G + ++ +Q CG CWAF+ V E ++
Sbjct: 122 YDEHGD----------------LPESKDWRKEGAVTEIMDQGQCGGCWAFAAVAAVEGIN 165
Query: 176 ALKNGTLSLLSVQEVIDC-AGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAA 234
+K+G L LS QE+IDC +GN GC GG ++ + L E +YP D
Sbjct: 166 KIKSGKLISLSEQELIDCDVKSGNQGCQGGLMETAYTFI-IENGGLTTEQDYPYEGVDGT 224
Query: 235 CKRKATSPNGVKIKSYTCDTLIPSESSI-LTDIATHGPVIAAVNA--LTWQYYLGGVIQY 291
CK + + I Y +P+++ L A H PV A++A ++Q+Y GV
Sbjct: 225 CKMEKAAHYAASISGY---EEVPADNEAKLKAAAAHQPVSVAIDAGGYSFQFYSEGVFSG 281
Query: 292 NCDGSLANINHAVQIVGY 309
C L NH V +VGY
Sbjct: 282 ICGKQL---NHGVTVVGY 296
>gi|537437|gb|AAC35211.1| cysteine proteinase [Hemerocallis hybrid cultivar]
Length = 359
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 88/326 (26%), Positives = 152/326 (46%), Gaps = 46/326 (14%)
Query: 6 NVLFIVALIALCFLAIPVKVSKPNLEQKL-ELFSSFQQRYKKSYSKSEHDIRFKNFEKSL 64
+V+ ++ +AL +IP E+ L L+ ++ + S + D RF F++++
Sbjct: 10 SVVLVLGSVALA-QSIPFDEKDLASEESLWSLYEKWRAHHAVSRDLDDTDKRFNVFKENV 68
Query: 65 DIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLM---------SHHKH 115
I E N+ + + + + + +F D++ +EF++ + ++ H+ + S+ K
Sbjct: 69 KFIHEFNQKKDA--TYKLALNKFGDMTNQEFRSTYAGSKIDHHMTLRGVKDAGEFSYEKF 126
Query: 116 HDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMH 175
HD +P DWRE G + V++Q CG+CWAFSTV E ++
Sbjct: 127 HD--------------------LPTSVDWREKGAVTGVKDQGQCGSCWAFSTVVAVEGIN 166
Query: 176 ALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAAC 235
+K L LS Q+++DC N GC+GG D++ N L E YP L + +C
Sbjct: 167 QIKTNELVSLSEQQLVDC-DTKNSGCNGGLMDYAFDFIK-NNGGLSSEDSYPYLAEQKSC 224
Query: 236 KRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNC 293
+A S V I Y D +E++++ +A PV A+ A +Q+Y GV +C
Sbjct: 225 GSEANSAV-VTIDGYQ-DVPRNNEAALMKAVANQ-PVSVAIEASGYAFQFYSQGVFSGHC 281
Query: 294 DGSLANINHAVQIVGY---DNYSRTW 316
L +H V VGY D+ + W
Sbjct: 282 GTEL---DHGVAAVGYGVDDDGKKYW 304
>gi|226531284|ref|NP_001147086.1| thiol protease SEN102 precursor [Zea mays]
gi|195607128|gb|ACG25394.1| thiol protease SEN102 precursor [Zea mays]
Length = 356
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 79/273 (28%), Positives = 125/273 (45%), Gaps = 29/273 (10%)
Query: 34 LELFSSFQQRYKKSYSKSEH-DIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSE 92
LE F ++Q Y ++Y+ E RF + +++ I+ +N+ + S G +F+DL+E
Sbjct: 35 LERFKAWQAEYNRTYATPEEFQQRFMIYSENVRFIKTMNQ-LSTGSSYELGENQFTDLTE 93
Query: 93 EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGI---------PVKKD 143
EEFK +L D + T G G+ P D
Sbjct: 94 EEFKDTYL-------------MKLDEQPPAAEAMGPTVGTMSTAGMSNGNNTGEAPNSVD 140
Query: 144 WREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGN-MGCS 202
WR G + +V++QQ CG+CWAF+TV + E +H +K G L LS QE++DC GN GC
Sbjct: 141 WRTKGAVTRVKDQQQCGSCWAFATVASIEGVHQIKTGRLVSLSEQEIVDCDRGGNDNGCR 200
Query: 203 GGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSI 262
GG + ++W+ N L ES+YP + C + +I+ Y + + +
Sbjct: 201 GGSPRSAMEWVTRNG-GLTTESDYPYVGSQRQCMSGKLGHHAARIRGY--QAVQRNNEAE 257
Query: 263 LTDIATHGPVIAAVNA-LTWQYYLGGVIQYNCD 294
L PV ++A +Q+Y GV CD
Sbjct: 258 LERAVAERPVAVFIDASRAFQFYKSGVFSGPCD 290
>gi|156399477|ref|XP_001638528.1| predicted protein [Nematostella vectensis]
gi|156225649|gb|EDO46465.1| predicted protein [Nematostella vectensis]
Length = 325
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 88/308 (28%), Positives = 139/308 (45%), Gaps = 42/308 (13%)
Query: 8 LFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDII 67
L + LIA CF S+ + +++ + F + K+Y+ E D+R + +L+I+
Sbjct: 8 LLVAVLIAQCF-------SELSQDRQWHAWKDF---HGKTYTGEEEDLRRAIWNDNLEIV 57
Query: 68 EELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRS 127
++ N S + + F+DL+ EFK R + + S
Sbjct: 58 KKHNAENHS---YKLDMNHFADLTVTEFKQRFMGY-------------------RAASNS 95
Query: 128 ITTGITIPTG---IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL 184
+P +P + DWR+ G + V+NQ CG+CWAFS+ + E H K G L
Sbjct: 96 TGGSTFLPLSNVQLPAEVDWRDKGFVTAVKNQGQCGSCWAFSSTGSLEGQHFRKTGKLVS 155
Query: 185 LSVQEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPN 243
LS Q ++DC+ GN GC GG ++ N + + E YP +D C K S
Sbjct: 156 LSEQNLVDCSKKYGNNGCEGGLMDYAFKYIKNNDGI-DTEQSYPYTARDGQCHFKPGSV- 213
Query: 244 GVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANIN 301
G + YT D SE + + +AT GP+ A++A ++Q Y GV D S ++
Sbjct: 214 GATVTGYT-DVQRGSEGDLQSAVATVGPISVAIDAGHSSFQLYKTGVYS-EPDCSSTQLD 271
Query: 302 HAVQIVGY 309
H V VGY
Sbjct: 272 HGVLAVGY 279
>gi|50355619|dbj|BAD29958.1| cysteine protease [Daucus carota]
Length = 496
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 80/289 (27%), Positives = 145/289 (50%), Gaps = 37/289 (12%)
Query: 31 EQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSD 89
++ LF S+ + KSY+ E + RF+ F+ +L I+E +N + G+ +F+D
Sbjct: 39 DEATTLFESWLVTHGKSYNALGEEEKRFQIFKNNLRYIDE--QNLVEDRGFKLGLNKFAD 96
Query: 90 LSEEEFKTRHL---RHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWRE 146
L+ EE+++++ + K V ++ + +G ++P + DWRE
Sbjct: 97 LTNEEYRSKYTGIKSKDLRKKVSAKSGRY-----------ATLSGESLPESV----DWRE 141
Query: 147 AGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDF 206
+G + V++Q +CG+CWAFST+ E ++ + G L LS QE++DC + N GC+GG
Sbjct: 142 SGAVATVKDQGSCGSCWAFSTISAVEGINQIATGKLITLSEQELVDCDRSYNEGCNGG-- 199
Query: 207 CALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSI- 262
L+D+ +N ++ + +YP +D C + + V I SY +P+ +
Sbjct: 200 --LMDYAFEFIINNGGIDTDVDYPYTGRDGKCDQYRKNAKVVTIDSY---EDVPAYDELA 254
Query: 263 LTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
L A + P+ A+ A +Q+Y G+ C +L +H V +VGY
Sbjct: 255 LKKAAANQPISVAIEASGRDFQFYDSGIFTGKCGIAL---DHGVVVVGY 300
>gi|145476403|ref|XP_001424224.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124391287|emb|CAK56826.1| unnamed protein product [Paramecium tetraurelia]
Length = 312
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 85/274 (31%), Positives = 126/274 (45%), Gaps = 29/274 (10%)
Query: 37 FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFK 96
F S++ +Y KSY+ + RF NF+ +L+ + N + R + +FSDLSEEEF
Sbjct: 29 FQSWKTKYGKSYTGEQEVFRFLNFQINLNKVNSHNSDETKTYKMR--MNQFSDLSEEEFA 86
Query: 97 TRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQ 156
+L H N ++ + D + +KK I DWR I +V++Q
Sbjct: 87 LLYLTH-YNSDEIIEQQQITDDKESSIKKND---------NIKTSVDWRS---ITQVKDQ 133
Query: 157 QTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVN 216
CG CWAF V E+ +KN T +LS Q++IDC + GC+GG L + V
Sbjct: 134 GKCGGCWAFGAVGAVEAWFQVKNKTQVVLSEQQLIDCDTQ-SFGCNGGYQNLALKY--VA 190
Query: 217 KVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAV 276
L + YP K ++ + + P Y + SS + T P++ V
Sbjct: 191 NHGLNDANVYPYTQKQSSACQYNSGP-------YKTNGAQGVSSSNFKSLLTEYPLVVVV 243
Query: 277 NALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
+A WQ Y GGV C S +NHAV VG+D
Sbjct: 244 DASNWQLYGGGVFN-ECSKS---VNHAVLAVGFD 273
>gi|328875652|gb|EGG24016.1| counting factor associated protein [Dictyostelium fasciculatum]
Length = 529
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 87/288 (30%), Positives = 150/288 (52%), Gaps = 26/288 (9%)
Query: 37 FSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F F++++ K+Y + EH+ RF +++ L + N + S + + + F D+S+EEF
Sbjct: 222 FDQFKKQFGKTYENTLEHNTRFATYKQMLHRVATHNAHN-SESTYKLAMNHFGDMSDEEF 280
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
+ + H V++ + HD+ + +P DWR +G + V++
Sbjct: 281 RKFIIPH-VDRDENNGASEVHDNED--------------VSALPASLDWRTSGCVTPVKD 325
Query: 156 QQTCGACWAFSTVETAESMHALK-NGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWM 213
Q CG+CW F ++ + E++ LK N L LS QE++DCA G +MGC+GG F +
Sbjct: 326 QGVCGSCWTFGSLASLETVACLKHNKDLISLSEQELVDCAYVGQSMGCNGG-FASNAYQY 384
Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
+N + ES+YP L+++A CK +GV+++SY T SE+++ +AT G V
Sbjct: 385 IMNAGGIATESDYPYLMQNAYCKASTVQNSGVRVQSYVNVTAF-SEAALQNAVATVGVVA 443
Query: 274 AAVNALT--WQYYLGGVIQYN-CDGSLANINHAVQIVGY--DNYSRTW 316
A++A ++YY GV C L ++H V ++GY DN + W
Sbjct: 444 VAIDASAPDFRYYSSGVYYSTVCQSGLDYLDHEVAVLGYGTDNGQQYW 491
>gi|444519959|gb|ELV12909.1| Cathepsin L1 [Tupaia chinensis]
Length = 333
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 89/303 (29%), Positives = 145/303 (47%), Gaps = 31/303 (10%)
Query: 14 IALCFLAIPVKVSKPNLEQKL-ELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELN- 71
+ L L + + + P +Q L E ++ + + K YS E +R +EK+L +IE+ N
Sbjct: 5 LFLTILCLGIASAAPTHDQSLDEQWNQWTAEHGKVYSTGEESLRRAVWEKNLKMIEQHNL 64
Query: 72 KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTG 131
+ Q + G+ F D++ E+F+ +M+ ++ ++ V +
Sbjct: 65 EYSQGKHTFTMGMNAFGDMTNEDFRQ-----------MMTGFQNQKYNKGEVFQPPQ--- 110
Query: 132 ITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVI 191
P +P DWRE G + V+NQ CG+CWAFS E K G L LS Q ++
Sbjct: 111 ---PLEVPESVDWREKGYVTPVKNQHRCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLV 167
Query: 192 DCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSY 250
DC+ N GC GG ++ N L+ E YP ++ C+ SP G +
Sbjct: 168 DCSQPQHNSGCKGGLVIKAFQYVKDNG-GLDSEESYPYEEMESTCRY---SP-GNSAATV 222
Query: 251 TCDTLIPSESSILTD-IATHGPVIAAVNA--LTWQYYLGGVI-QYNCDGSLANINHAVQI 306
T IP+E L +A+ GP+ A++A ++Q+Y GG++ + NC S +NHAV +
Sbjct: 223 TGFKHIPAEEKALEKAVASVGPISVAIDAHHHSFQFYTGGILHEPNC--SPKWLNHAVLV 280
Query: 307 VGY 309
VGY
Sbjct: 281 VGY 283
>gi|46401612|dbj|BAD16614.1| cysteine proteinase [Dianthus caryophyllus]
Length = 459
Score = 115 bits (287), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 84/316 (26%), Positives = 153/316 (48%), Gaps = 31/316 (9%)
Query: 1 MFDVKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKN 59
+F + +++FIV+ AL I ++P+ ++ L+ ++ ++ K+Y+ E +RF
Sbjct: 8 IFLLFSIIFIVSSSALDLSIIDRAFNRPD-DEIASLYETWLVKHGKNYNGLGEKQLRFNI 66
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHL-RHSVNKHVLMSHHKHHDH 118
F+ +L ++E N S + G+ F+DL+ EE+++ +L + V S D
Sbjct: 67 FKDNLRFVDERNSENLS---FKLGLNRFADLTNEEYRSVYLGTRPRSVAVARSGRSKSDR 123
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
+ + G T+P + DWR+ G + +++Q +CG+CWAFS + E ++ +
Sbjct: 124 Y-------AFRAGDTLPESV----DWRKKGAVAGIKDQGSCGSCWAFSAIAAVEGVNQIV 172
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW---MDVNKVVLEPESEYPLLLKDAAC 235
G L LS QE+++C + N GC GG L+D+ + ++ + +YP +D C
Sbjct: 173 TGDLISLSEQELVECDTSYNDGCDGG----LMDYAFEFIIKNEGIDSDEDYPYTGRDGRC 228
Query: 236 KRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAV--NALTWQYYLGGVIQYNC 293
+ V I Y D+ + E S+ +A PV A+ +Q Y GV C
Sbjct: 229 DTNRKNAKVVTIDDYE-DSPVYDEKSLQKAVANQ-PVSVAIEGGGRDFQLYDSGVFTGKC 286
Query: 294 DGSLANINHAVQIVGY 309
+L +H V +VGY
Sbjct: 287 GTAL---DHGVAVVGY 299
>gi|42564163|gb|AAS20593.1| digestive cysteine proteinase intestain [Leptinotarsa decemlineata]
Length = 324
Score = 115 bits (287), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 82/279 (29%), Positives = 135/279 (48%), Gaps = 25/279 (8%)
Query: 35 ELFSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKN-RQSPESARYGITEFSDLSE 92
E + +F+ + KSY E RF F +L IEE N+N + + G+ +F+DL+
Sbjct: 21 EKWQNFKINFSKSYQNVVEEKRRFNIFLSNLLRIEEHNQNFSRGLSTYEMGVNKFADLTP 80
Query: 93 EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
EEF R K +S + + +P + DW + G + +
Sbjct: 81 EEFMERFRPLRKTKPKFLSEQAKFNFDGD----------------LPAEVDWTKQGAVTE 124
Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
V++Q +CG+CWAFST + ES + +K G L LS Q+++DC N N GC+GG L++
Sbjct: 125 VKSQGSCGSCWAFSTTGSVESHNFIKTGKLISLSEQQLVDCVKN-NSGCAGGWMDIALEY 183
Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV 272
++ + ++ E +YP ++ C R S V+IKSY E + +A GPV
Sbjct: 184 IEADGIM--SEDDYPYEERNTTC-RFNNSKAAVQIKSYKA-IKKNDEIDLQKAVALEGPV 239
Query: 273 IAAVN-ALTWQYYLGGVIQ-YNCDGSLANINHAVQIVGY 309
A+ + +Q Y G++ C + ++ HAV + GY
Sbjct: 240 SVAIEVTIAFQLYARGILNDPQCKNTEGDLTHAVLVTGY 278
>gi|410913409|ref|XP_003970181.1| PREDICTED: cathepsin F-like [Takifugu rubripes]
Length = 476
Score = 115 bits (287), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 89/282 (31%), Positives = 137/282 (48%), Gaps = 32/282 (11%)
Query: 34 LELFSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSE 92
L LF F +Y K YS + E D R + F+++L E++ + SA YG+T+FSDL+E
Sbjct: 175 LGLFKEFMTKYNKVYSSQEEADRRLQIFKENLKTAEKIQSLDEG--SAEYGVTKFSDLTE 232
Query: 93 EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
EEF+ +L +++ L +R + + P DWR+ G +
Sbjct: 233 EEFRLTYLNPLLSQWTL---------------RRPMKPASPARSPAPASWDWRDHGAVSP 277
Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
V+NQ CG+CWAFS E LK+G L LS QE++DC G + C GG +
Sbjct: 278 VKNQGLCGSCWAFSVTGNIEGQWFLKHGKLLSLSEQELVDCDGL-DHACRGGLPSNAYEA 336
Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTL-IPS-ESSILTDIATHG 270
++ LE E++Y C K+ +Y ++ +PS E+ + +A +G
Sbjct: 337 IE-GLGGLEAENDYTYSGHKQKCSFATE-----KVAAYINSSVELPSDENEMAAWLAENG 390
Query: 271 PVIAAVNALTWQYYLGGV---IQYNCDGSLANINHAVQIVGY 309
PV A+NA Q+Y GV C+ + I+HAV +VGY
Sbjct: 391 PVSVALNAFAMQFYKKGVSHPWMILCNPWM--IDHAVLLVGY 430
>gi|4581057|gb|AAD24589.1|AF139913_1 cysteine protease [Trypanosoma congolense]
Length = 440
Score = 115 bits (287), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 93/320 (29%), Positives = 156/320 (48%), Gaps = 36/320 (11%)
Query: 4 VKNVLFIVAL--IALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSY-SKSEHDIRFKN 59
+ + F V L +A CF +PV + + EQ L+ F++F+Q+Y +SY +E RF+
Sbjct: 7 TRTLGFSVGLHAVAACF--VPVALGVLHAEQSLQQQFAAFKQKYSRSYKDATEEAFRFRV 64
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHH 119
F++++ E + + A +G+T FSD+S EEF+ ++H +++
Sbjct: 65 FKQNM---ERAKEEAAANPYATFGVTRFSDMSPEEFRA-------------TYHNGAEYY 108
Query: 120 HNHVKK-RSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHAL 177
+K+ R + + + TG P DWR+ G + V++Q C + WAFS + E +
Sbjct: 109 AAALKRPRKV---VNVSTGKAPPAIDWRKKGAVTPVKDQGQCHSSWAFSAIGNIEGQWKI 165
Query: 178 KNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACK 236
L+ LS Q ++ C N + GC GG W+ NK + E YP
Sbjct: 166 AGHELTSLSEQMLVSCDTN-DFGCGGGFSDPAFKWIVSSNKGNVFTEQSYPYASGGGNVP 224
Query: 237 R--KATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCD 294
K+ G KI+ L E++I +A GPV AV+A ++Q Y GGV+ +C
Sbjct: 225 TCDKSGKVVGAKIRDRV--DLPRDENAIAEWLAKKGPVAIAVDATSFQSYTGGVLT-SCI 281
Query: 295 GSLANINHAVQIVGYDNYSR 314
+++H V +VGYD+ S+
Sbjct: 282 SE--HLDHGVLLVGYDDTSK 299
>gi|355344587|gb|AER60490.1| cysteine proteases [Gossypium hirsutum]
Length = 371
Score = 115 bits (287), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 85/304 (27%), Positives = 148/304 (48%), Gaps = 31/304 (10%)
Query: 22 PVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESA 80
P S + ++ + L+ S+ ++ K+Y+ E + RF+ F+ +L I+E N N + +
Sbjct: 30 PSSSSWRSDDEVMGLYKSWVIQHGKAYNGIGEEEKRFEIFKDNLRFIDEHNSNNNT--TY 87
Query: 81 RYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPV 140
+ G+ +F+DL+ +E++ + L + + K + H G +P +
Sbjct: 88 KLGLNKFADLTNQEYRAKFLGTRTDPRRRLMKSKIPSSRYAH------RAGDNLPDSV-- 139
Query: 141 KKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMG 200
DWR+ G + V++Q +CG+CWAFST+ T E ++ + +G L LS QE++DC + + G
Sbjct: 140 --DWRDHGAVSPVKDQGSCGSCWAFSTIATVEGINKIVSGELVSLSEQELVDCDRSYDAG 197
Query: 201 CSGGDFCALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP 257
C+GG L+D+ ++ ++ E +YP L + C + V I Y +P
Sbjct: 198 CNGG----LMDYAFQFIMDNGGIDTEKDYPYLGFNNQCDPTKKNAKVVSIDGYED---VP 250
Query: 258 SESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY---DNY 312
+ + L H PV A+ A +Q Y GV C LA ++H V VGY DN
Sbjct: 251 NNENALKKAVAHQPVSIAIEAGGRAFQLYESGVFNGEC--GLA-LDHGVVAVGYGTDDNG 307
Query: 313 SRTW 316
W
Sbjct: 308 QDYW 311
>gi|30142040|gb|AAN34825.1| cysteine proteinase [Leishmania amazonensis]
gi|30142042|gb|AAN34826.1| cysteine proteinase [Leishmania amazonensis]
gi|30142572|gb|AAP21894.1| cysteine proteinase [Leishmania amazonensis]
Length = 354
Score = 115 bits (287), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 90/316 (28%), Positives = 149/316 (47%), Gaps = 32/316 (10%)
Query: 7 VLFIVALIALCFLAIPVKVSKPNLEQKLEL--FSSFQQRYKKSYS-KSEHDIRFKNFEKS 63
+ + L +C+ + + + P ++ + + SF++R+ K++ +E RF F+++
Sbjct: 10 AIVVTILFVVCYGSALIAQTPPAVDNFVASAHYGSFKKRHSKAFGGDAEEGHRFNAFKQN 69
Query: 64 LDIIEELNKNRQSPESARYGIT-EFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNH 122
+ LN Q+P A Y ++ +F+DL+ +EF +L N SH K H
Sbjct: 70 MQTAYFLNT--QNPH-AHYDVSGKFADLTPQEFAKLYL----NPDYYTSHLKDH------ 116
Query: 123 VKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTL 182
K + + P+G+ + DWR+ G + V+NQ CG+CWAFS + E A +L
Sbjct: 117 --KEDVHVDDSAPSGV-MSVDWRDKGAVTPVKNQGLCGSCWAFSAIGNIEGQWAASGHSL 173
Query: 183 SLLSVQEVIDCAGNGNMGCSGGDFCALLDW-MDVNKVVLEPESEYPLLLKDAA---CKRK 238
LS Q ++ C N + GC+GG ++W M + + E+ YP C +
Sbjct: 174 VSLSEQMLVSC-DNVDEGCNGGLMDQAMNWIMQSHNGSVFTEASYPYTSGGGTRPPCHDE 232
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLA 298
G KI + +L E I + GPV AV+A TWQ Y GGV+ SL
Sbjct: 233 GEV--GAKITGFL--SLPHDEERIADWVEKRGPVAVAVDATTWQLYFGGVVSLCLAWSL- 287
Query: 299 NINHAVQIVGYDNYSR 314
NH V IVG++ ++
Sbjct: 288 --NHGVLIVGFNKNAK 301
>gi|343477207|emb|CCD11901.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 361
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 89/320 (27%), Positives = 156/320 (48%), Gaps = 36/320 (11%)
Query: 4 VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSY-SKSEHDIRFKNFE 61
+ + F V L+A+ +PV + + EQ L+ F++F+Q+Y +SY +E RF+ F+
Sbjct: 7 TRTLRFSVGLLAVAACFVPVALGVLHAEQSLQQQFAAFKQKYSRSYRDATEEAFRFRVFK 66
Query: 62 KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
+++ E + + A +G+T FSD+S EEF+ ++H +++
Sbjct: 67 QNM---ERAKEEAAANPYATFGVTRFSDMSPEEFRA-------------TYHNGAEYYAA 110
Query: 122 HVKK-RSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
+K+ R + + + TG P+ DWR+ G + V++Q C + WAFS + E +
Sbjct: 111 ALKRPRKV---VNVSTGRPPMTVDWRKKGAVTPVKDQGKCDSSWAFSAIGNIEGQWKIAG 167
Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW-MDVNKVVLEPESEYPLLLKDA---AC 235
L+ LS Q ++ C + + GC GG W + NK + E YP C
Sbjct: 168 HELTSLSEQMLVSCDTD-DFGCRGGFSDPAFKWILWSNKGNVFTEQSYPYASGGGNVPTC 226
Query: 236 KRKATSPNGVKIKSYTCDTLIPSESSILTD-IATHGPVIAAVNALTWQYYLGGVIQYNCD 294
K G KI + +P + ++T+ +A GPV AV+A ++Q Y GGV+ +C
Sbjct: 227 KMSGKV-VGAKISN---RLYLPEDEDMITEWLARKGPVAIAVDATSFQSYTGGVLT-SCI 281
Query: 295 GSLANINHAVQIVGYDNYSR 314
+N+ +VGYD+ S+
Sbjct: 282 SK--EMNYGALLVGYDDTSK 299
>gi|4886998|gb|AAD32136.1|AF121837_1 cathepsin L [Mus musculus]
gi|4887000|gb|AAD32137.1|AF121838_1 cathepsin L [Mus musculus]
gi|4887002|gb|AAD32138.1|AF121839_1 cathepsin L [Mus musculus]
gi|200501|gb|AAA39984.1| preprocathepsin L precursor [Mus musculus]
Length = 334
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 87/291 (29%), Positives = 137/291 (47%), Gaps = 29/291 (9%)
Query: 25 VSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESA-RY 82
++ P +Q + ++ +++ Y +E + R +EK++ II+ N + +
Sbjct: 16 LATPKFDQTFSAEWHQWKSTHRRLYGTNEEEWRRAIWEKNMRIIQLHNGEYSNGQHGFSM 75
Query: 83 GITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKK 142
+ F D++ EEF R VN + H H K R + + IP
Sbjct: 76 EMNAFGDMTNEEF-----RQVVNGY----------RHQKHKKGRLFQEPLMLK--IPKSV 118
Query: 143 DWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGC 201
DWRE G + V+NQ CG+CWAFS E LK G L LS Q ++DC+ GN GC
Sbjct: 119 DWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGC 178
Query: 202 SGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP-SES 260
+GG ++ N L+ E YP KD +CK +A + + T IP E
Sbjct: 179 NGGLMDFAFQYIKENG-GLDSEESYPYEAKDGSCKYRAE----FAVANDTGFVDIPQQEK 233
Query: 261 SILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+++ +AT GP+ A++A + Q+Y G I Y + S N++H V +VGY
Sbjct: 234 ALMKAVATVGPISVAMDASHPSLQFYSSG-IYYEPNCSSKNLDHGVLLVGY 283
>gi|157868354|ref|XP_001682730.1| cysteine peptidase A (CPA) [Leishmania major strain Friedlin]
gi|68126185|emb|CAJ07238.1| cysteine peptidase A (CPA) [Leishmania major strain Friedlin]
Length = 354
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 92/325 (28%), Positives = 154/325 (47%), Gaps = 49/325 (15%)
Query: 4 VKNVLFIV----ALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFK 58
V +LF+V AL+A L + ++ + + F++R+ KS+ + ++ RF
Sbjct: 12 VVTILFVVCYGSALVAQTPLGVDNFIASAH-------YGRFKERHGKSFGEDADEGHRFN 64
Query: 59 NFEKSLDIIEELNKNRQSPESARYGIT-EFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHD 117
F++++ LN + A Y ++ +F+DL+ +EF +L H + +H
Sbjct: 65 AFKQNMQTAYFLNTHN---PHAHYDVSGKFADLTPQEFAKLYLNPDYYAHRGKDYKEH-- 119
Query: 118 HHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHAL 177
HV ++ +++ DWRE G + V+NQ CG+CWAFS + ES AL
Sbjct: 120 ---VHVDDSVLSGAMSV--------DWREKGAVTPVKNQGMCGSCWAFSAIGNIESQWAL 168
Query: 178 KNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW-MDVNKVVLEPESEYPLLLKDAACK 236
KN +L LS Q ++ C + + GC+GG ++W + + + E YP
Sbjct: 169 KNHSLVSLSEQMLVSC-DDIDDGCNGGLMDQAMEWIIQHHNGTVPTEKSYPY------AS 221
Query: 237 RKATSPN-------GVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVI 289
TSP G +I Y +L E +I + GPV AV+A TWQ Y GGV+
Sbjct: 222 AGGTSPPCHDKGEFGARISGYM--SLPHDEKAIAAYVEKKGPVAVAVDATTWQLYFGGVV 279
Query: 290 QYNCDGSLANINHAVQIVGYDNYSR 314
C G ++NH V +VG++ ++
Sbjct: 280 TL-CFG--LSLNHGVLVVGFNKRAK 301
>gi|224136808|ref|XP_002326950.1| predicted protein [Populus trichocarpa]
gi|222835265|gb|EEE73700.1| predicted protein [Populus trichocarpa]
Length = 456
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 78/285 (27%), Positives = 136/285 (47%), Gaps = 31/285 (10%)
Query: 31 EQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSD 89
++ + ++ + ++ K+Y+ E + RF+ F+ +L I++ N ++ G+ F+D
Sbjct: 36 DEVMAMYEEWLVKHGKNYNALGEKEKRFEIFKDNLMFIDQHNSENRT---YTVGLNRFAD 92
Query: 90 LSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGI 149
L+ EEF++ +L H + K S + +P DWR+ G
Sbjct: 93 LTNEEFRSMYLGTRTG-------------HKKRLPKTSDRYAPRVGDSLPDSVDWRKEGA 139
Query: 150 IGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCAL 209
+ +V++Q CG+CWAFST+ E ++ + G L LS QE++DC + N GC+GG L
Sbjct: 140 VAEVKDQGGCGSCWAFSTIAAVEGINKIVTGDLIALSEQELVDCDTSYNEGCNGG----L 195
Query: 210 LDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
+D+ +N ++ E +YP L +D C + V I SY + + ++ + L
Sbjct: 196 MDYAFEFIINNGGIDTEDDYPYLGRDGRCDTYRKNAKVVSIDSY--EDVPENDETALKKA 253
Query: 267 ATHGPVIAAV--NALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+ PV A+ +Q Y GV C SL +H V VGY
Sbjct: 254 VANQPVSVAIEGGGRNFQLYNSGVFTGECGTSL---DHGVAAVGY 295
>gi|149617838|ref|XP_001521715.1| PREDICTED: cathepsin L1-like [Ornithorhynchus anatinus]
Length = 338
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 92/315 (29%), Positives = 156/315 (49%), Gaps = 34/315 (10%)
Query: 9 FIVALIALCF-LAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDII 67
+V L++LC+ LA+ + L++ +L+ ++ Q KSY ++E R +E++L I
Sbjct: 3 LLVCLVSLCWGLAVSAPLGDSELDRHWKLWKNWHQ---KSYHEAEEGWRRTVWEENLKAI 59
Query: 68 EELNKNRQ-SPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKR 126
+ N + + R G+ +F DL+ EEF+ +++ +H N +
Sbjct: 60 QLHNLEQSLGLHTYRLGMNQFGDLTNEEFQE-----------ILTGERHFSKG-NRINGS 107
Query: 127 SITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLS 186
+ + +P DWR+ G + V+NQ CG+CWAFST E K+G L LS
Sbjct: 108 AFLEANFVQ--VPTSVDWRDHGYVTPVKNQGHCGSCWAFSTTGALEGQLFRKSGRLISLS 165
Query: 187 VQEVIDCAG-NGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAA-CKRK---ATS 241
Q ++DC+ GN GC GG ++ N+ + + E YP KD A C K AT+
Sbjct: 166 EQNLVDCSWQQGNQGCHGGIVDLAFQYILQNQGI-DSEDCYPYTAKDTAQCTFKPECATA 224
Query: 242 PNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLAN 299
P + + D SE +++ +AT GPV ++A ++++Y G+ Y+ S +
Sbjct: 225 P----VTGFV-DIPPHSEEALMKAVATVGPVSVGIDASSTSFRFYQSGIF-YDPKCSSES 278
Query: 300 INHAVQIVGYDNYSR 314
++HAV +VGY Y R
Sbjct: 279 LDHAVLVVGY-GYER 292
>gi|358248896|ref|NP_001239703.1| uncharacterized protein LOC100799247 precursor [Glycine max]
gi|255636729|gb|ACU18700.1| unknown [Glycine max]
Length = 341
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 86/282 (30%), Positives = 138/282 (48%), Gaps = 30/282 (10%)
Query: 35 ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEE 93
E + +Y K Y +E + RF+ F+ ++ IE N P + I +F+DL +E
Sbjct: 33 ERHEKWMAQYGKVYKDAAEKEKRFQVFKNNVQFIESFNAAGDKPFN--LSINQFADLHDE 90
Query: 94 EFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKV 153
EFK L + K + + +V K IP DWR+ G + +
Sbjct: 91 EFKAL-LNNVQKKASRVETATETSFRYENVTK------------IPSTMDWRKRGAVTPI 137
Query: 154 RNQQ-TCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
++Q TCG+CWAF+TV T ES+H + G L LS QE++DC + GC GG ++
Sbjct: 138 KDQGYTCGSCWAFATVATVESLHQITTGELVSLSEQELVDCVRGDSEGCRGGYVENAFEF 197
Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP--SESSILTDIATHG 270
+ NK + E+ YP KD +CK K + +I Y +P SE ++L +A
Sbjct: 198 I-ANKGGITSEAYYPYKGKDRSCKVKKETHGVARIIGYES---VPSNSEKALLKAVANQ- 252
Query: 271 PVIAAVN--ALTWQYYLGGVIQ-YNCDGSLANINHAVQIVGY 309
PV ++ A+ +++Y G+ + NC +++HAV +VGY
Sbjct: 253 PVSVYIDAGAIAFKFYSSGIFEARNCG---THLDHAVAVVGY 291
>gi|326501772|dbj|BAK02675.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 333
Score = 114 bits (286), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 93/311 (29%), Positives = 149/311 (47%), Gaps = 35/311 (11%)
Query: 6 NVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLD 65
+VL ++AL C LA K L Q +L+ ++ K YS +E +R +E +L
Sbjct: 5 SVLAVLALAFSCTLAFDAK-----LNQHWKLW---KEANNKRYSDAEEHVRRATWEGNLQ 56
Query: 66 IIEELN-KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVK 124
++E N + + G+ +++D++ EF ++ + M + D H
Sbjct: 57 KVQEHNLQADLGVHTYWLGMNKYADMTVTEF----VKVMNGYNATMRGQRTQDRH----- 107
Query: 125 KRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL 184
S + I +P + DWR+ G + V++Q CG+CWAFST E H + G L
Sbjct: 108 TFSFNSKIALPDTV----DWRDKGYVTDVKDQGQCGSCWAFSTTGALEGQHFKQTGKLVS 163
Query: 185 LSVQEVIDCAG-NGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPN 243
LS Q ++DC+G GNMGC+GG +++ N + + E YP D C+ KA +
Sbjct: 164 LSEQNLVDCSGKQGNMGCNGGLMDQAFEYIKENNGI-DTEDSYPYEAVDNQCRFKAANV- 221
Query: 244 GVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYN---CDGSLA 298
G +T D ES++ +AT GP+ A++A ++Q Y GV YN C S
Sbjct: 222 GATDTGFT-DITSKDESALQQAVATVGPISVAIDAGHTSFQLYKHGV--YNEPFC--SQT 276
Query: 299 NINHAVQIVGY 309
++H V VGY
Sbjct: 277 RLDHGVLAVGY 287
>gi|168058022|ref|XP_001781010.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162667567|gb|EDQ54194.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 457
Score = 114 bits (286), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 81/293 (27%), Positives = 137/293 (46%), Gaps = 23/293 (7%)
Query: 19 LAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPE 78
+ +P V K L F+++ ++ K YS +E R F D +E + ++ +
Sbjct: 29 IRMPTDVGKDQLLAGQ--FAAWAHKHGKVYSAAEE--RAHRFLVWKDNLEYIQRHSEKNL 84
Query: 79 SARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGI 138
S G+T+F+DL+ EEF+ ++ +++ + ++ + +
Sbjct: 85 SYWLGLTKFADLTNEEFRRQYTGTRIDRSRRLKKGRNATGSFRYANSEA----------- 133
Query: 139 PVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGN 198
P DWRE G + V++Q +CG+CWAFS V + E ++A++ G LSVQE++DC N
Sbjct: 134 PKSIDWREKGAVTSVKDQGSCGSCWAFSAVGSVEGINAIRTGDAISLSVQELVDCDKKYN 193
Query: 199 MGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPS 258
GC+GG D++ + ++ E +YP D C + V I SY D
Sbjct: 194 QGCNGGLMDYAFDFV-IQNGGIDTEKDYPYQGYDGRCDVNKMNARVVTIDSYE-DVPEND 251
Query: 259 ESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
E ++ +A PV A+ A +Q Y GGV C +++H V VGY
Sbjct: 252 EEALKKAVAGQ-PVSVAIEAGGRDFQLYSGGVFTGRCG---TDLDHGVLAVGY 300
>gi|535473|emb|CAA53377.1| cysteine protease [Vicia sativa]
Length = 368
Score = 114 bits (286), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 85/315 (26%), Positives = 148/315 (46%), Gaps = 37/315 (11%)
Query: 7 VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQ---RYKKSYSK-SEHDIRFKNFEK 62
+L +L ++ + + P E+ + +++ +++K Y+ E D RF+ F+
Sbjct: 6 ILPFFLFFSLITFSLALDIQLPTGRSNDEVMTMYEEWLVKHQKVYNGLREKDQRFQIFKD 65
Query: 63 SLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHL--RHSVNKHVLMSHHKHHDHHH 120
+L+ I+E N + G+ +F+D++ EE++ +L R + + ++ + H + +
Sbjct: 66 NLNFIDEHNAQNYT---YIVGLNKFADMTNEEYRDMYLGTRSDIKRRIMKNKITGHRYAY 122
Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
N + +PV DWR G I +++Q +CG+CWAFST+ T E+++ + G
Sbjct: 123 NSGDR------------LPVHVDWRLKGAITHIKDQGSCGSCWAFSTIATVEAINKIVTG 170
Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW---MDVNKVVLEPESEYPLLLKDAACKR 237
L LS QE++DC N GC+GG L+D+ + ++ + YP + C
Sbjct: 171 KLVSLSEQELVDCDRAFNEGCNGG----LMDYAFEFIIGNGGIDTDQHYPYKGFEGRCDP 226
Query: 238 KATSPNGVKIKSYTCDTLIPSES-SILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCD 294
V I Y +PS + + L H PV A+ A Q Y GV C
Sbjct: 227 TRKKAKIVSIDGYED---VPSNNENALKKAVAHQPVSVAIEASGRALQLYQSGVFTGKCG 283
Query: 295 GSLANINHAVQIVGY 309
SL +HAV IVGY
Sbjct: 284 TSL---DHAVVIVGY 295
>gi|400180387|gb|AFP73332.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 114 bits (286), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 96/315 (30%), Positives = 149/315 (47%), Gaps = 35/315 (11%)
Query: 3 DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
D+ N+L + + F S+P L ++ EL+ S R YK K E RF
Sbjct: 6 DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVE---RFMI 62
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
F++++ IE +NK S + G+ EF+D++ +EF + ++ N ++ S +
Sbjct: 63 FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEL 120
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
N + + P DW E+G + +V++Q CG CWAFS V + E + +
Sbjct: 121 KINDLSDDDM----------PSNLDWIESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
G L S QE++DC N N GC+GG D++ N + ES+Y L + C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGI-SRESDYEYLGEQYTCRSQ 228
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
+ V+I SY ++P + L T PV IAA L Q+Y GG DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278
Query: 296 SLAN-INHAVQIVGY 309
S A+ INHAV +GY
Sbjct: 279 SCADRINHAVTAIGY 293
>gi|255550445|ref|XP_002516273.1| cysteine protease, putative [Ricinus communis]
gi|223544759|gb|EEF46275.1| cysteine protease, putative [Ricinus communis]
Length = 358
Score = 114 bits (286), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 86/278 (30%), Positives = 134/278 (48%), Gaps = 29/278 (10%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
FS F R+ K Y S+ E +RF F ++LD I N+ S A + +F+DL+ +EF
Sbjct: 59 FSRFVYRHGKRYQSEDEMKMRFAIFSENLDFIRSTNRKGLSYTLA---VNDFADLTWQEF 115
Query: 96 KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
+ L + N +HK TG+ +P KDWRE GI+ V+
Sbjct: 116 QKHRLGAAQNCSATTKGNHK--------------LTGVALPD----TKDWREVGIVSPVK 157
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWM 213
NQ CG+CW FST E+ + G LS Q+++DCAG N GC GG +++
Sbjct: 158 NQGHCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFNNFGCHGGLPSQAFEYI 217
Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
N LE E YP +D ACK + + G+++ + + + +E + + PV
Sbjct: 218 KYNG-GLETEEAYPYTGEDGACKFSSENV-GIQVLD-SVNITLGAEDELKEAVGLVRPVS 274
Query: 274 AAVNALT-WQYYLGGVIQYN-CDGSLANINHAVQIVGY 309
A ++ +++Y GV + C + ++NHAV VGY
Sbjct: 275 VAFEVVSGFRFYKSGVYTSDTCGSTPMDVNHAVLAVGY 312
>gi|189239337|ref|XP_973607.2| PREDICTED: similar to cathepsin F-like cysteine protease [Tribolium
castaneum]
Length = 1726
Score = 114 bits (286), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 90/287 (31%), Positives = 147/287 (51%), Gaps = 37/287 (12%)
Query: 31 EQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDL 90
E L LF+ F ++Y K Y K E+ RF F ++L I LN Q +A YGIT F+D+
Sbjct: 1417 EYHLSLFTDFLKKYNKKYHKKEYKYRFNVFVQNLMQIRVLNTFEQG--TATYGITRFADM 1474
Query: 91 SEEEF-KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKDWREAG 148
+++EF ++ LR + + + IP +P + DWR+
Sbjct: 1475 TQKEFSRSLGLRTDLRN-----------------ENETPFAQAKIPNIELPKEFDWRKKN 1517
Query: 149 IIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCA 208
++ +V+NQ+ CG+CWAFS E +AL++G L S QE++DC + + GC+GG
Sbjct: 1518 VVTEVKNQEQCGSCWAFSVTGNVEGQYALRHGKLLEFSEQELVDCDTD-DQGCNGG---- 1572
Query: 209 LLD--WMDVNKV-VLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
L+D + + K+ LE E +YP +D C T +++ + +E+ +
Sbjct: 1573 LMDTAYRSIEKIGGLETEQDYPYDAEDEKCHFNRTL---ARVQVTGALNISHNETDMAKW 1629
Query: 266 IATHGPVIAAVNALTWQYYLGGV---IQYNCDGSLANINHAVQIVGY 309
+ +GP+ A+NA Q+Y+GGV ++ C S N++H V IVGY
Sbjct: 1630 LVANGPISIAINANAMQFYMGGVSHPFKFLC--SPKNLDHGVLIVGY 1674
>gi|118486542|gb|ABK95110.1| unknown [Populus trichocarpa]
Length = 465
Score = 114 bits (286), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 78/285 (27%), Positives = 136/285 (47%), Gaps = 31/285 (10%)
Query: 31 EQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSD 89
++ + ++ + ++ K+Y+ E + RF+ F+ +L I++ N ++ G+ F+D
Sbjct: 45 DEVMAMYEEWLVKHGKNYNALGEKEKRFEIFKDNLMFIDQHNSENRT---YTVGLNRFAD 101
Query: 90 LSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGI 149
L+ EEF++ +L H + K S + +P DWR+ G
Sbjct: 102 LTNEEFRSMYLGTRTG-------------HKKRLPKTSDRYAPRVGDSLPDSVDWRKEGA 148
Query: 150 IGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCAL 209
+ +V++Q CG+CWAFST+ E ++ + G L LS QE++DC + N GC+GG L
Sbjct: 149 VAEVKDQGGCGSCWAFSTIAAVEGINKIVTGDLIALSEQELVDCDTSYNEGCNGG----L 204
Query: 210 LDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
+D+ +N ++ E +YP L +D C + V I SY + + ++ + L
Sbjct: 205 MDYAFEFIINNGGIDTEDDYPYLGRDGRCDTYRKNAKVVSIDSY--EDVPENDETALKKA 262
Query: 267 ATHGPVIAAV--NALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+ PV A+ +Q Y GV C SL +H V VGY
Sbjct: 263 VANQPVSVAIEGGGRNFQLYNSGVFTGECGTSL---DHGVAAVGY 304
>gi|413917937|gb|AFW57869.1| hypothetical protein ZEAMMB73_830006 [Zea mays]
Length = 443
Score = 114 bits (286), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 83/280 (29%), Positives = 126/280 (45%), Gaps = 31/280 (11%)
Query: 40 FQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQS--PESARYGITEFSDLSEEEFK 96
+ +Y + YS + E RF+ F+ ++ +IE +N E+ R F+DL+++EF+
Sbjct: 44 WMAKYDRVYSDAAEKARRFEVFKANMALIESVNAGNHKFWLEANR-----FADLTDDEFR 98
Query: 97 TRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT----GIPVKKDWREAGIIGK 152
+ + + R+ TTG +P DWR G +
Sbjct: 99 A----------TWTGYRPKTAAASSKGRSRTATTGFKYANVSLDDVPASVDWRTKGAVTP 148
Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLD 211
++NQ CG CWAFS V + E + L G L LS QE++DC NG + GC GG+ D
Sbjct: 149 IKNQGECGCCWAFSAVASMEGVVKLSTGKLVSLSEQELVDCDVNGMDQGCEGGEMDDAFD 208
Query: 212 WMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGP 271
++ V L ES YP D C S + IK Y D E+S+ +A P
Sbjct: 209 FI-VGNGGLTTESRYPYTASDGTCNSNEASGDAASIKGYE-DVPANDEASLRKAVANQ-P 265
Query: 272 VIAAVNALT--WQYYLGGVIQYNCDGSLANINHAVQIVGY 309
V AV+ +++Y GGV+ C L +H + VGY
Sbjct: 266 VSVAVDGGDSHFRFYKGGVLSGACGTEL---DHGIAAVGY 302
>gi|224085750|ref|XP_002307688.1| predicted protein [Populus trichocarpa]
gi|222857137|gb|EEE94684.1| predicted protein [Populus trichocarpa]
Length = 436
Score = 114 bits (286), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 94/310 (30%), Positives = 143/310 (46%), Gaps = 36/310 (11%)
Query: 6 NVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSL 64
N L+I AL L I V + +LF ++ + + KSY S+ E R K FE +
Sbjct: 2 NFLYIFALTLL----ISVLSPSTSSSDISQLFETWCKEHGKSYTSQEERSHRLKVFEDNY 57
Query: 65 DIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVK 124
D + + N S S + F+DL+ EFKT L ++ L H++ +
Sbjct: 58 DFVTKHNSKGNSSYS--LALNAFADLTHHEFKTSRL--GLSAAPLNLAHRNLE------- 106
Query: 125 KRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL 184
TG+ IP DWR G++ V++Q +CGACW+FS E ++ + G+L
Sbjct: 107 ----ITGVV--GDIPASIDWRNKGVVTNVKDQGSCGACWSFSATGAIEGINKIVTGSLVS 160
Query: 185 LSVQEVIDCAGNGNMGCSGGDFCALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATS 241
LS QE+I+C + N GC GG L+D+ +N ++ E +YP +D C +
Sbjct: 161 LSEQELIECDKSYNDGCGGG----LMDYAFQFVINNHGIDTEEDYPYRARDGTCNKDRMK 216
Query: 242 PNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAV--NALTWQYYLGGVIQYNCDGSLAN 299
V I Y D +E +L +A PV + + +Q Y G+ C SL
Sbjct: 217 RRVVTIDKYV-DVPENNEKQLLQAVAAQ-PVSVGICGSERAFQMYSKGIFTGPCSTSL-- 272
Query: 300 INHAVQIVGY 309
+HAV IVGY
Sbjct: 273 -DHAVLIVGY 281
>gi|145506497|ref|XP_001439209.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124406393|emb|CAK71812.1| unnamed protein product [Paramecium tetraurelia]
Length = 349
Score = 114 bits (286), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 82/284 (28%), Positives = 139/284 (48%), Gaps = 32/284 (11%)
Query: 34 LELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSP-ESARYGITEFSDLSE 92
++++ ++Q+ + K Y++ E+ RF F+K+ I+E + ++ E+ G+ +F+DLS
Sbjct: 37 MKVYQNWQKEHGKRYTQFENSHRFGIFKKNYQYIQEHQQRVEAGLETFELGLNDFADLSV 96
Query: 93 EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
EEF+ ++L++ N V +R TG +P + ++KD G++ +
Sbjct: 97 EEFEAKYLKYRSTPR----------EQTNQVYRR---TGKQVPIEVDLRKD----GVVSE 139
Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLS--LLSVQEVIDCAGNGNM---GCSGGDFC 207
V+NQ +CG+CWAFS V E+ AL+ G + LS QE++DCA GC GG+
Sbjct: 140 VKNQGSCGSCWAFSAVAALET--ALRQGGVKNVELSEQELVDCAVKDEFESEGCDGGEMY 197
Query: 208 ALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIA 267
+ +K + SEYP D C K T + Y + P + + A
Sbjct: 198 DGFQY--ASKYGIAIRSEYPYAGVDQKCAAKQTKTR-YQFAGYV--DVEPLSAQAYVEAA 252
Query: 268 THGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+ + +NA + +Q Y G+ CDGS +NH V VGY
Sbjct: 253 SEHALSIGINASGINFQLYKKGIYSAKCDGSKPALNHGVTNVGY 296
>gi|300122868|emb|CBK23875.2| unnamed protein product [Blastocystis hominis]
Length = 316
Score = 114 bits (286), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 92/313 (29%), Positives = 143/313 (45%), Gaps = 48/313 (15%)
Query: 4 VKNVLFIVALIALCFLAIPVKVSKPNLEQKL---ELFSSFQQRYKKSYSKSEHDIRFKNF 60
+K++ F++ +AL NL +LF +F+ +Y K+Y SE + R K
Sbjct: 1 MKSIFFVLFAVALSL----------NLHSDAYYEKLFQTFEAKYGKNYLSSEREYRKKVL 50
Query: 61 EKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
++D IE+ N + S G+T F+D++ EF T L + K +
Sbjct: 51 AYNMDWIEKFNSDEHS---FTLGMTPFADMTNTEFATSKLCGCMKKPL------------ 95
Query: 121 NHVKKRSITTGITIPTGIPVKK-DWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
NH + R + + V+ DWRE G + V+NQ +CG+CWAFS E + +
Sbjct: 96 NHKQARVLNN-------MAVESIDWREKGAVTPVKNQGSCGSCWAFSATGALEGGNFVAT 148
Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
G L LS Q+++DC + GC GG ++ V K L E +YP KD CK
Sbjct: 149 GKLVSLSEQQLVDCDTE-DAGCGGGFMDTAFEY--VMKKGLCTEEDYPYHAKDEDCKDDQ 205
Query: 240 TSPNGVKIKSYTCDTLIPSESSI-LTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGS 296
+ + S T +P+ + L T PV A+ A +Q Y GGV+ + G+
Sbjct: 206 CT----SVISITGYEDVPANDGVALKQALTKAPVSVAIQADSFVFQMYTGGVLDSDMCGT 261
Query: 297 LANINHAVQIVGY 309
++NH V VGY
Sbjct: 262 --SLNHGVLAVGY 272
>gi|118365720|ref|XP_001016080.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89297847|gb|EAR95835.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 335
Score = 114 bits (286), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 95/311 (30%), Positives = 152/311 (48%), Gaps = 28/311 (9%)
Query: 7 VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKN--FEKSL 64
+L I+ L+ LC LA + V +KL ++ + +Y++ Y +EH+ F+ F + L
Sbjct: 6 LLSIIMLMPLC-LAQDINV------EKLLAYNKWSSQYQRVY-LNEHEKLFRQMIFFEKL 57
Query: 65 DIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHL-RHSVNKHVLMSHHKHHDHHHNHV 123
++E N N + S + +FSD+++EEF + L + + H+ + H+ +
Sbjct: 58 QKMKEHNSNPNNTYSIH--LNQFSDMTKEEFTQKILMKQDLVGHLTKGASQEATHNDVNS 115
Query: 124 KKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLS 183
+ + + T+ I DWR G + V+NQ CG+CW+FS ES + +KN L
Sbjct: 116 EAQLNSKSPTLAASI----DWRTKGAVTSVKNQGNCGSCWSFSAAGLMESFNFIKNKALV 171
Query: 184 LLSVQEVIDC--AGNG-NM-GCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
S Q+++DC A NG N+ GC GG +D+ +KV + YP + C
Sbjct: 172 DFSEQQLLDCVIAANGYNIHGCDGGWPAYCVDY--ASKVGITTLKNYPYVGVQNKCNVTG 229
Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLAN 299
T+ NG K K + +P+ S+ L PV V+A W Y G+ CD SL
Sbjct: 230 TN-NGFKPKQW---NQVPNTSNDLKMALNFSPVSVLVDANNWDGYQSGIFN-GCDQSLII 284
Query: 300 INHAVQIVGYD 310
+NHAV VGYD
Sbjct: 285 LNHAVLAVGYD 295
>gi|242081867|ref|XP_002445702.1| hypothetical protein SORBIDRAFT_07g024430 [Sorghum bicolor]
gi|241942052|gb|EES15197.1| hypothetical protein SORBIDRAFT_07g024430 [Sorghum bicolor]
Length = 372
Score = 114 bits (286), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 78/276 (28%), Positives = 129/276 (46%), Gaps = 22/276 (7%)
Query: 36 LFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
L+ ++ R+ + + RF F++++ +I + N+ R P R + F D++ +EF
Sbjct: 46 LYERWRGRHAVARDLGDKARRFNVFKENVRLIHDFNQ-RDEPYKLR--LNRFGDMTADEF 102
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
+ +H S HH + + + +P DWR+ G + V++
Sbjct: 103 R---------RHYAGSRVAHHRMFRGDRQGSASSFMYAGARDLPTSVDWRQKGAVTDVKD 153
Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDV 215
Q CG+CWAFST+ E ++A+K L+ LS Q+++DC GN GC GG ++
Sbjct: 154 QGQCGSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKGNAGCDGGLMDYAFQYIAK 213
Query: 216 NKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAA 275
+ V E YP + A+CK K+ +P V I Y + + ++ S L H PV A
Sbjct: 214 HGGVA-AEDAYPYKARQASCK-KSPAP-AVTIDGY--EDVPANDESALKKAVAHQPVSVA 268
Query: 276 VNALT--WQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+ A +Q+Y GV C L +H V VGY
Sbjct: 269 IEASGSHFQFYSEGVFAGRCGTEL---DHGVTAVGY 301
>gi|395509415|ref|XP_003758993.1| PREDICTED: cathepsin L1-like, partial [Sarcophilus harrisii]
Length = 323
Score = 114 bits (286), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 86/287 (29%), Positives = 140/287 (48%), Gaps = 32/287 (11%)
Query: 28 PNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARY-GITE 86
P L+ + ELF S Y+K+Y++ E R + +EK++ I + N + + + Y G+
Sbjct: 2 PELDSEWELFKS---TYEKNYTEKEESFRKQVWEKNMKFINDQNLLYKEGKLSYYLGMNN 58
Query: 87 FSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWRE 146
DL+++EFK +N +L V++ + T +I + +P DWRE
Sbjct: 59 LGDLTDKEFKIM-----LNPSMLQ-----------RVRRDTTTKNFSIFSHLPKSVDWRE 102
Query: 147 AGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDF 206
G I VR Q CG+CWAFS E LK G L LS Q +IDC+ GC GG
Sbjct: 103 KGFITPVRQQGRCGSCWAFSATGAVEGQLFLKTGKLVELSKQNLIDCS--KFQGCHGGTV 160
Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP--SESSILT 264
+ ++ N+ ++ E YP + K + + VKI+ Y ++P +E ++
Sbjct: 161 TSAFKYIKKNEGIVSEEC-YPYVAKKNSLCSYRSECAAVKIRDY---VVLPYGNEEILME 216
Query: 265 DIATHGPVIAAVNAL-TWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
+A GPV ++NA + +Y GG+ ++ C NHA+ +VGY
Sbjct: 217 AVAIVGPVSVSLNAQKSLHFYKGGIYVEPKCKPRYT--NHALLLVGY 261
>gi|226529105|ref|NP_001150196.1| cysteine protease 1 precursor [Zea mays]
gi|194701798|gb|ACF84983.1| unknown [Zea mays]
gi|194704800|gb|ACF86484.1| unknown [Zea mays]
gi|195637480|gb|ACG38208.1| cysteine protease 1 precursor [Zea mays]
gi|413919895|gb|AFW59827.1| cysteine protease 1 [Zea mays]
Length = 470
Score = 114 bits (286), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 79/267 (29%), Positives = 124/267 (46%), Gaps = 28/267 (10%)
Query: 49 SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV---N 105
+ E D RF F +L ++ N+ R R G+ +F+DL+ +EF+ +L V
Sbjct: 74 GEGERDRRFLVFWDNLRFVDAHNE-RAGARGFRLGMNQFADLTNDEFRAAYLGAMVPAAR 132
Query: 106 KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAF 165
+ ++ HD +P DWRE G + V+NQ CG+CWAF
Sbjct: 133 RGAVVGERYRHDGAAEE---------------LPESVDWREKGAVAPVKNQGQCGSCWAF 177
Query: 166 STVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPES 224
S V + ES++ + G + LS QE+++C+ + GN GC+GG A D++ + ++ E
Sbjct: 178 SAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFI-IKNGGIDTED 236
Query: 225 EYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQ 282
+YP D C + V I + D E S+ +A H PV A+ A +Q
Sbjct: 237 DYPYRAVDGKCDMNRKNARVVSIDGFE-DVPENDEKSLQKAVA-HQPVSVAIEAGGREFQ 294
Query: 283 YYLGGVIQYNCDGSLANINHAVQIVGY 309
Y GV +C N++H V VGY
Sbjct: 295 LYKSGVFSGSC---TTNLDHGVVAVGY 318
>gi|449469923|ref|XP_004152668.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
gi|449520697|ref|XP_004167370.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
Length = 371
Score = 114 bits (286), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 85/286 (29%), Positives = 145/286 (50%), Gaps = 40/286 (13%)
Query: 37 FSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F F+ ++ K+Y+ EHD RF+ F+ +L + ++++ A +G+T FSDL+E EF
Sbjct: 58 FQDFKLKFGKTYTTDEEHDYRFRVFKANL---RKAKRHQKLDPDAVHGVTRFSDLTESEF 114
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKDWREAGIIGKVR 154
+ + +N+ L + D H + +PT + DWR+ G + V+
Sbjct: 115 RENFV--GLNRLRLPA-----DAHQAPI----------LPTDNLASDFDWRDQGAVTPVK 157
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
+Q +CG+CW+FS V E + L G L LS Q+++DC AG + GC+GG
Sbjct: 158 DQGSCGSCWSFSAVGALEGANFLSTGKLISLSEQQLVDCDHECDPEEAGACDAGCNGGLM 217
Query: 207 CALLDWMDVNKVVLEPESEYPLLLKD-AACKRKATSPNGVKIKSYTCDTLIPSESS-ILT 264
+ +++ V LE E +YP D +CK + NG S ++I +++ I
Sbjct: 218 TSAFEYI-VKAGGLEREEDYPYTGTDRGSCKFQ----NGKIAASAANFSVISNDADQIAA 272
Query: 265 DIATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
++ +GP+ +NA+ Q Y+ G+ Y C S N++H V +VGY
Sbjct: 273 NLVKNGPLAIGINAVFMQTYMKGISCPYIC--SKRNLDHGVLLVGY 316
>gi|23397070|gb|AAN31820.1| putative cysteine proteinase AALP [Arabidopsis thaliana]
Length = 358
Score = 114 bits (286), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 85/278 (30%), Positives = 132/278 (47%), Gaps = 29/278 (10%)
Query: 37 FSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F+ F RY K Y E +RF F+++LD+I NK S + G+ +F+DL+ +EF
Sbjct: 59 FARFTHRYGKKYQNVEEMKLRFSIFKENLDLIRSTNKKGLS---YKLGVNQFADLTWQEF 115
Query: 96 KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
+ L + N L HK +P KDWRE GI+ V+
Sbjct: 116 QRTKLGAAQNCSATLKGSHK------------------VTEAALPETKDWREDGIVSPVK 157
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWM 213
+Q CG+CW FST E+ + G LS Q+++DCAG N GC+GG +++
Sbjct: 158 DQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYI 217
Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
N L+ E YP KD CK A + GV++ + + + + +E + + PV
Sbjct: 218 KSNG-GLDTEKAYPYTGKDETCKFSAENV-GVQVLN-SVNITLGAEDELKHAVGLVRPVS 274
Query: 274 AAVNAL-TWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
A + +++ Y GV +C + ++NHAV VGY
Sbjct: 275 IAFEVIHSFRLYKSGVYTDSHCGSTPMDVNHAVLAVGY 312
>gi|270011071|gb|EFA07519.1| cystatin [Tribolium castaneum]
Length = 1761
Score = 114 bits (286), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 90/287 (31%), Positives = 147/287 (51%), Gaps = 37/287 (12%)
Query: 31 EQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDL 90
E L LF+ F ++Y K Y K E+ RF F ++L I LN Q +A YGIT F+D+
Sbjct: 1452 EYHLSLFTDFLKKYNKKYHKKEYKYRFNVFVQNLMQIRVLNTFEQG--TATYGITRFADM 1509
Query: 91 SEEEF-KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKDWREAG 148
+++EF ++ LR + + + IP +P + DWR+
Sbjct: 1510 TQKEFSRSLGLRTDLRN-----------------ENETPFAQAKIPNIELPKEFDWRKKN 1552
Query: 149 IIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCA 208
++ +V+NQ+ CG+CWAFS E +AL++G L S QE++DC + + GC+GG
Sbjct: 1553 VVTEVKNQEQCGSCWAFSVTGNVEGQYALRHGKLLEFSEQELVDCDTD-DQGCNGG---- 1607
Query: 209 LLD--WMDVNKV-VLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
L+D + + K+ LE E +YP +D C T +++ + +E+ +
Sbjct: 1608 LMDTAYRSIEKIGGLETEQDYPYDAEDEKCHFNRTL---ARVQVTGALNISHNETDMAKW 1664
Query: 266 IATHGPVIAAVNALTWQYYLGGV---IQYNCDGSLANINHAVQIVGY 309
+ +GP+ A+NA Q+Y+GGV ++ C S N++H V IVGY
Sbjct: 1665 LVANGPISIAINANAMQFYMGGVSHPFKFLC--SPKNLDHGVLIVGY 1709
>gi|343477446|emb|CCD11725.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 361
Score = 114 bits (286), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 88/319 (27%), Positives = 156/319 (48%), Gaps = 34/319 (10%)
Query: 4 VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSY-SKSEHDIRFKNFE 61
+ + F V L+A+ +PV + + EQ L+ F++F+Q+Y +SY +E RF+ F+
Sbjct: 7 TRTLRFSVGLLAVAACFVPVALGVLHAEQSLQQQFAAFKQKYSRSYRDATEEAFRFRVFK 66
Query: 62 KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
+++ E + + A +G+T FSD+S EEF+ ++H +++
Sbjct: 67 QNM---ERAKEEAAANPYATFGVTRFSDMSPEEFRA-------------TYHNGAEYYAA 110
Query: 122 HVKK-RSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
+K+ R + + + TG P DWR+ G + V++Q C + WAF+ + E +
Sbjct: 111 ALKRPRKV---VNVSTGKAPEAVDWRKKGAVTPVKDQGKCDSSWAFTVIGNIEGQWKIAG 167
Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDA---AC 235
L+ LS Q ++ C N ++GC G W+ N + E YP AC
Sbjct: 168 HELTSLSEQMLVSCDTN-DLGCRAGFMDTAFKWIVSPNDGNVFTEQSYPYASGGGNVPAC 226
Query: 236 KRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDG 295
K+ G I + ++ +E++I +A +GPV AV+A ++Q Y GGV+ +C
Sbjct: 227 N-KSGKVVGANIDDHV--HILDNENAIAEWLAKNGPVAIAVDATSFQRYTGGVLT-SCIS 282
Query: 296 SLANINHAVQIVGYDNYSR 314
+N A +VGYD+ S+
Sbjct: 283 K--EVNSAALLVGYDDTSK 299
>gi|115484973|ref|NP_001067630.1| Os11g0255300 [Oryza sativa Japonica Group]
gi|530335|emb|CAA56844.1| cysteine protease [Oryza sativa Japonica Group]
gi|5761322|dbj|BAA83472.1| cysteine endopeptidase [Oryza sativa Japonica Group]
gi|62732672|gb|AAX94791.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
Group]
gi|62732673|gb|AAX94792.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
Group]
gi|62732674|gb|AAX94793.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
Group]
gi|77549615|gb|ABA92412.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|77549616|gb|ABA92413.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|77549617|gb|ABA92414.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|113644852|dbj|BAF27993.1| Os11g0255300 [Oryza sativa Japonica Group]
gi|125576789|gb|EAZ18011.1| hypothetical protein OsJ_33558 [Oryza sativa Japonica Group]
gi|215701098|dbj|BAG92522.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 378
Score = 114 bits (286), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 85/301 (28%), Positives = 131/301 (43%), Gaps = 27/301 (8%)
Query: 21 IPVKVSKPNLEQKLE-LFSSFQQRYKKSYSKSEHDI---------RFKNFEKSLDIIEEL 70
IP S + E+ L L+ ++ RY S + + RF F ++ I E
Sbjct: 25 IPFTESDLSSEESLRALYERWRSRYTVSRPAASGGVGNDDGEARRRFNVFVENARYIHEA 84
Query: 71 NKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITT 130
N+ P R + +F+D++ +EF+ + H +S + + S
Sbjct: 85 NRRGGRP--FRLALNKFADMTTDEFRRTYAGSRARHHRSLSGGRGG-------EGGSFRY 135
Query: 131 GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEV 190
G +P DWRE G + +++Q CG+CWAFSTV E ++ +K G L LS QE+
Sbjct: 136 GGDDEDNLPPAVDWRERGAVTGIKDQGQCGSCWAFSTVAAVEGVNKIKTGRLVTLSEQEL 195
Query: 191 IDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSY 250
+DC N GC GG ++ N + ES YP + C + S + V I Y
Sbjct: 196 VDCDTGDNQGCDGGLMDYAFQFIKRNGGIT-TESNYPYRAEQGRCNKAKASSHDVTIDGY 254
Query: 251 TCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVG 308
D ES++ +A PV AV A +Q+Y GV C +++H V VG
Sbjct: 255 E-DVPANDESALQKAVANQ-PVAVAVEASGQDFQFYSEGVFTGECG---TDLDHGVAAVG 309
Query: 309 Y 309
Y
Sbjct: 310 Y 310
>gi|18424347|ref|NP_568921.1| thiol protease aleurain [Arabidopsis thaliana]
gi|71152227|sp|Q8H166.2|ALEU_ARATH RecName: Full=Thiol protease aleurain; Short=AtALEU; AltName:
Full=Senescence-associated gene product 2; Flags:
Precursor
gi|7230640|gb|AAF43041.1|AF233883_1 AALP protein [Arabidopsis thaliana]
gi|13430722|gb|AAK25983.1|AF360273_1 putative cysteine proteinase AALP [Arabidopsis thaliana]
gi|9757740|dbj|BAB08221.1| AALP protein [Arabidopsis thaliana]
gi|21617934|gb|AAM66984.1| cysteine proteinase AALP [Arabidopsis thaliana]
gi|23397068|gb|AAN31819.1| putative cysteine proteinase AALP [Arabidopsis thaliana]
gi|23397074|gb|AAN31822.1| putative cysteine proteinase AALP [Arabidopsis thaliana]
gi|24417304|gb|AAN60262.1| unknown [Arabidopsis thaliana]
gi|222423506|dbj|BAH19723.1| AT5G60360 [Arabidopsis thaliana]
gi|222424411|dbj|BAH20161.1| AT5G60360 [Arabidopsis thaliana]
gi|332009930|gb|AED97313.1| thiol protease aleurain [Arabidopsis thaliana]
Length = 358
Score = 114 bits (286), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 85/278 (30%), Positives = 132/278 (47%), Gaps = 29/278 (10%)
Query: 37 FSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F+ F RY K Y E +RF F+++LD+I NK S + G+ +F+DL+ +EF
Sbjct: 59 FARFTHRYGKKYQNVEEMKLRFSIFKENLDLIRSTNKKGLS---YKLGVNQFADLTWQEF 115
Query: 96 KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
+ L + N L HK +P KDWRE GI+ V+
Sbjct: 116 QRTKLGAAQNCSATLKGSHK------------------VTEAALPETKDWREDGIVSPVK 157
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWM 213
+Q CG+CW FST E+ + G LS Q+++DCAG N GC+GG +++
Sbjct: 158 DQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYI 217
Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
N L+ E YP KD CK A + GV++ + + + + +E + + PV
Sbjct: 218 KSNG-GLDTEKAYPYTGKDETCKFSAENV-GVQVLN-SVNITLGAEDELKHAVGLVRPVS 274
Query: 274 AAVNAL-TWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
A + +++ Y GV +C + ++NHAV VGY
Sbjct: 275 IAFEVIHSFRLYKSGVYTDSHCGSTPMDVNHAVLAVGY 312
>gi|400180391|gb|AFP73334.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 114 bits (286), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 95/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)
Query: 3 DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
D+ ++L + + F + S+P L ++ EL+ S R YK K E RF
Sbjct: 6 DLMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
F++++ IE +NK S + G+ EF+D++ +EF + ++ N ++ S +
Sbjct: 63 FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
N + + P DWRE+G + +V++Q CG CWAFS V + E + +
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
G L S QE++DC N N GC+GG D++ N + ES+Y L + C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGI-SRESDYEYLGEQYTCRSQ 228
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
+ V+I SY ++P + L T PV IAA L Q+ GG DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFCAGGTY----DG 278
Query: 296 SLAN-INHAVQIVGY 309
S A+ INHAV +GY
Sbjct: 279 SCADRINHAVTAIGY 293
>gi|330805275|ref|XP_003290610.1| hypothetical protein DICPUDRAFT_98747 [Dictyostelium purpureum]
gi|325079249|gb|EGC32858.1| hypothetical protein DICPUDRAFT_98747 [Dictyostelium purpureum]
Length = 334
Score = 114 bits (286), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 92/310 (29%), Positives = 145/310 (46%), Gaps = 32/310 (10%)
Query: 8 LFIVALIALCFLAIPVKVSKPNLEQKLELFSSF---QQRYKKSYSKSEHDIRFKNFEKSL 64
L + +++L L+I V + NL +SF +++ K+Y E + +++ F+ ++
Sbjct: 3 LAVFLIVSLVILSINV-CAATNLFSAQTYQTSFLGWMKKHNKAYHHHEFNDKYQTFKDNM 61
Query: 65 DIIEELNKNRQSPES-ARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHV 123
D I N S ES G+ F+DL+ EE+K +L S+N ++ N V
Sbjct: 62 DFIHNWN----SKESDTVLGLNRFADLTNEEYKKTYLGMSINVNL----------RANQV 107
Query: 124 KKRSIT-TGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTL 182
+ T P+ I DWR+ G + V++Q CG+CWAF+T E H +K G +
Sbjct: 108 PMNGLNFERFTGPSSI----DWRQNGAVAYVKDQGHCGSCWAFATTGAVEGAHQIKTGNM 163
Query: 183 SLLSVQEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATS 241
S Q ++DC+G GN GC GG + ++ ++ + E YP C T
Sbjct: 164 VTFSEQHLVDCSGRYGNNGCDGGLMTSAFKYI-IDNDGIATEEAYPYTATQNRCVYNTTM 222
Query: 242 PNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLAN 299
G I Y D SES++ I+ PV A++A +T+Q Y GV Q S
Sbjct: 223 L-GTAISGYK-DVPRGSESALTAAISKQ-PVAVAIDASPITFQLYKSGVYQ-EATCSSYR 278
Query: 300 INHAVQIVGY 309
+NH V VGY
Sbjct: 279 LNHGVLAVGY 288
>gi|79331505|ref|NP_001032106.1| thiol protease aleurain [Arabidopsis thaliana]
gi|332009931|gb|AED97314.1| thiol protease aleurain [Arabidopsis thaliana]
Length = 357
Score = 114 bits (286), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 85/278 (30%), Positives = 132/278 (47%), Gaps = 29/278 (10%)
Query: 37 FSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F+ F RY K Y E +RF F+++LD+I NK S + G+ +F+DL+ +EF
Sbjct: 59 FARFTHRYGKKYQNVEEMKLRFSIFKENLDLIRSTNKKGLS---YKLGVNQFADLTWQEF 115
Query: 96 KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
+ L + N L HK +P KDWRE GI+ V+
Sbjct: 116 QRTKLGAAQNCSATLKGSHK------------------VTEAALPETKDWREDGIVSPVK 157
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWM 213
+Q CG+CW FST E+ + G LS Q+++DCAG N GC+GG +++
Sbjct: 158 DQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYI 217
Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
N L+ E YP KD CK A + GV++ + + + + +E + + PV
Sbjct: 218 KSNG-GLDTEKAYPYTGKDETCKFSAENV-GVQVLN-SVNITLGAEDELKHAVGLVRPVS 274
Query: 274 AAVNAL-TWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
A + +++ Y GV +C + ++NHAV VGY
Sbjct: 275 IAFEVIHSFRLYKSGVYTDSHCGSTPMDVNHAVLAVGY 312
>gi|449449489|ref|XP_004142497.1| PREDICTED: cysteine proteinase 15A-like [Cucumis sativus]
Length = 406
Score = 114 bits (286), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 86/285 (30%), Positives = 133/285 (46%), Gaps = 29/285 (10%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F F ++Y KSY ++ E+ RF F K+L I P +A +G+T+FSDLSEEEF
Sbjct: 89 FVMFMEKYGKSYPTRKEYLHRFGIFVKNL--IRAAEHQALDP-TAVHGVTQFSDLSEEEF 145
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
+ + M +++ G+P + DWR+ G + +V+
Sbjct: 146 E----------RMFMGVRGGAGGEGLPEMNQAVEVTAEEVKGLPERFDWRDKGAVTEVKM 195
Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDFC 207
Q TCG+CWAFST E + + G L LS Q+++DC N GC+GG
Sbjct: 196 QGTCGSCWAFSTCGAVEGANFIATGNLLNLSEQQLVDCDHTCDPTDKTACNNGCNGGLMT 255
Query: 208 ALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIA 267
++ + LE ES YP + C ++ VK+ ++T T+ E+ I +
Sbjct: 256 NAYKYL-IQSGGLEEESSYPYTGRSGQCNFQSDKI-AVKVSNFT--TIPIDENQIAAHLV 311
Query: 268 THGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGYDN 311
GP+ +NA+ Q Y+GGV C +NH V +VGY +
Sbjct: 312 RSGPLAVGLNAVFMQTYIGGVSCPLICGKRF--VNHGVLMVGYGD 354
>gi|407838603|gb|EKG00105.1| cysteine peptidase, putative,cysteine peptidase, clan CA, family
C1, cathepsin L-like, putative, partial [Trypanosoma
cruzi]
Length = 326
Score = 114 bits (286), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 85/314 (27%), Positives = 140/314 (44%), Gaps = 24/314 (7%)
Query: 1 MFDVKNVLFIVALIALCFLAIPVKVSKPNLEQKL-ELFSSFQQRYKKSY-SKSEHDIRFK 58
M L + A++ + +P + + E+ L F+ F+Q++ + Y S +E R
Sbjct: 34 MSGWARALSLAAVLVVMACLVPAATASLHAEETLASQFAEFKQKHGRVYGSAAEEAFRLS 93
Query: 59 NFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDH 118
F +L + L+ + A +G+T FSDL+ EEF++R+ H H
Sbjct: 94 VFRANL-FLARLHA--AANPHATFGVTPFSDLTREEFRSRY-------------HNGAAH 137
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
++ + + + G P KDWR G + V++Q CG+CWAFS + E L
Sbjct: 138 FAAAQERARVPVDVEV-VGAPAAKDWRARGAVTAVKDQGQCGSCWAFSAIGNVECQWFLA 196
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACKR 237
L+ LS Q ++ C + GC GG +W+ N + E YP +
Sbjct: 197 GHPLTNLSEQMLVSC-DKTDSGCGGGLMNNAFEWIVQENNGAVYTEGSYPYASGEGISPP 255
Query: 238 KATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL 297
TS + V L E+ I +A +GPV AV+A +W Y GGV+ +C
Sbjct: 256 CTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGVMT-SCVSE- 313
Query: 298 ANINHAVQIVGYDN 311
++H V +VGY++
Sbjct: 314 -QLDHGVLLVGYND 326
>gi|145334857|ref|NP_001078774.1| thiol protease aleurain [Arabidopsis thaliana]
gi|332009932|gb|AED97315.1| thiol protease aleurain [Arabidopsis thaliana]
Length = 361
Score = 114 bits (286), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 85/278 (30%), Positives = 132/278 (47%), Gaps = 29/278 (10%)
Query: 37 FSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F+ F RY K Y E +RF F+++LD+I NK S + G+ +F+DL+ +EF
Sbjct: 59 FARFTHRYGKKYQNVEEMKLRFSIFKENLDLIRSTNKKGLS---YKLGVNQFADLTWQEF 115
Query: 96 KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
+ L + N L HK +P KDWRE GI+ V+
Sbjct: 116 QRTKLGAAQNCSATLKGSHK------------------VTEAALPETKDWREDGIVSPVK 157
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWM 213
+Q CG+CW FST E+ + G LS Q+++DCAG N GC+GG +++
Sbjct: 158 DQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYI 217
Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
N L+ E YP KD CK A + GV++ + + + + +E + + PV
Sbjct: 218 KSNG-GLDTEKAYPYTGKDETCKFSAENV-GVQVLN-SVNITLGAEDELKHAVGLVRPVS 274
Query: 274 AAVNAL-TWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
A + +++ Y GV +C + ++NHAV VGY
Sbjct: 275 IAFEVIHSFRLYKSGVYTDSHCGSTPMDVNHAVLAVGY 312
>gi|302845628|ref|XP_002954352.1| hypothetical protein VOLCADRAFT_76255 [Volvox carteri f.
nagariensis]
gi|300260282|gb|EFJ44502.1| hypothetical protein VOLCADRAFT_76255 [Volvox carteri f.
nagariensis]
Length = 489
Score = 114 bits (286), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 82/288 (28%), Positives = 135/288 (46%), Gaps = 28/288 (9%)
Query: 30 LEQKLELFSSFQQ---RYKKSYSKS--EHDIRFKNFEKSLDIIEELNKNRQSPESARYGI 84
L+ K ++FQQ +Y K+Y+ E + RF + ++L+ I N S +
Sbjct: 35 LDAKANPMAAFQQWMMQYTKAYANDIKELETRFSVWLENLNYILAYNARTTS---HWLHL 91
Query: 85 TEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDW 144
F+DL+ +EF+ R + + N ++ +P + DW
Sbjct: 92 NAFADLTTDEFRNR-----------LGYDFKARQASNRLQSSPFIYDNVDANQLPTEIDW 140
Query: 145 REAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGG 204
R+ G + +V+NQ CG+CWAF+T + E ++A+ G L+ LS QE++DC + + GCSGG
Sbjct: 141 RKKGAVTEVKNQGQCGSCWAFATTGSVEGINAIVTGELASLSEQELVDCDTDEDRGCSGG 200
Query: 205 DFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSI-L 263
W+ + L+ E +YP +D C + V I Y IP + L
Sbjct: 201 LMDYAYQWI-IKNGGLDTEDDYPYTAEDGVCVAAKKNRRVVTIDGY---VDIPENDEVAL 256
Query: 264 TDIATHGPVIAAV--NALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
A H P+ A+ +A ++Q Y GGV Y+ ++NH V +VGY
Sbjct: 257 KKAAAHQPIAVAIEADAKSFQLYGGGV--YDDPTCGTSLNHGVLVVGY 302
>gi|530736|emb|CAA56915.1| cathepsin l [Nephrops norvegicus]
gi|1582621|prf||2119193B cathepsin L-related Cys protease
Length = 313
Score = 114 bits (285), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 80/280 (28%), Positives = 136/280 (48%), Gaps = 37/280 (13%)
Query: 40 FQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPE-SARYGITEFSDLSEEEFKT 97
F+ +Y + Y ++ ++ R + F+++ ++E NK ++ E + + + +F D++ EEF
Sbjct: 15 FKTQYGRKYGDAKEELYRQRVFQQNEQLVEAFNKKFENGEVTFKVAMNQFGDMTNEEF-- 72
Query: 98 RHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKD--WREAGIIGKVRN 155
+ +M +K R T + G P+ D WR G + V++
Sbjct: 73 ---------NAVMKGYKK--------GSRGEPTTVFTAEGRPMAADVDWRTKGAVTPVKD 115
Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWMD 214
Q CG+CWAFS + E H LKN L LS QE++DC+ GN GC GG + D++
Sbjct: 116 QGQCGSCWAFSATGSLEGQHFLKNNELVSLSEQELVDCSTEYGNDGCGGGWMTSAFDYIK 175
Query: 215 VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP---SESSILTDIATHGP 271
N + + ES YP +D +C+ A S TC + +E ++ ++ GP
Sbjct: 176 DNGGI-DTESSYPYEAQDRSCRFDANSIGA------TCTGFVEVQHTEEALHEAVSDIGP 228
Query: 272 VIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+ A++A ++Q+Y GV Y S N++H V VGY
Sbjct: 229 ISVAIDASHFSFQFYSSGVY-YEKKCSPTNLDHGVLAVGY 267
>gi|61200410|gb|AAX39778.1| cathepsin R [Mus musculus]
Length = 335
Score = 114 bits (285), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 90/307 (29%), Positives = 147/307 (47%), Gaps = 29/307 (9%)
Query: 10 IVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIE 68
+ A++ + FL + V P L+ L+ + ++ +Y KSYS E ++ +E+ L +I+
Sbjct: 1 MAAVVFIAFLYLGVASGVPVLDSSLDAEWQDWKIKYNKSYSLKEEKLKRVVWEEKLKMIK 60
Query: 69 ELNK-NRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRS 127
N+ N + EF D ++EEF+ + SV H + KR
Sbjct: 61 LHNRENSLGKNGFTMKMNEFGDQTDEEFRKMMIEISVWTH----------REGKSIMKRE 110
Query: 128 ITTGITIPTGIPVKKDWR-EAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLS 186
G +P + DWR + G + VR Q C ACWAF+ E+ + G L+ LS
Sbjct: 111 --AGSILPKFV----DWRTKKGYVTPVRRQGDCDACWAFAVTGAIEAQAIWQTGKLTPLS 164
Query: 187 VQEVIDCAG-NGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGV 245
VQ ++DC+ GN GC GGD ++ ++ LE E+ YP KD C+ +P
Sbjct: 165 VQNLVDCSKPQGNNGCLGGDTYNAFQYV-LHNGGLESEATYPYEGKDGPCRY---NPKNS 220
Query: 246 KIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVI-QYNCDGSLANINH 302
K + +L SE ++ +AT GP+ A ++A +++ Y GG+ + NC S + H
Sbjct: 221 KAEITGFVSLPQSEDILMAAVATIGPITAGIDASHESFKNYKGGIYHEPNC--SSDTVTH 278
Query: 303 AVQIVGY 309
V +VGY
Sbjct: 279 GVLVVGY 285
>gi|224076968|ref|XP_002305072.1| predicted protein [Populus trichocarpa]
gi|222848036|gb|EEE85583.1| predicted protein [Populus trichocarpa]
Length = 305
Score = 114 bits (285), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 76/260 (29%), Positives = 122/260 (46%), Gaps = 27/260 (10%)
Query: 52 EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMS 111
E + R+ F+++++ IE N S + G+ +F+DL+ EEF+ + + LMS
Sbjct: 21 EKEKRYLIFKENIERIEAFNNG--SDRGYKLGVNKFADLTNEEFRAMYHGYKRQSSKLMS 78
Query: 112 HHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETA 171
+++ + IP DWR G + V++Q TCG CWAFSTV
Sbjct: 79 SSFRYENLSD----------------IPTSMDWRNDGAVTPVKDQGTCGCCWAFSTVAAI 122
Query: 172 ESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLK 231
E + L+ G L LS Q+++DC GN GC GG ++ + L E YP
Sbjct: 123 EGIIKLQTGNLISLSEQQLVDCTA-GNKGCQGGLMDTAFQYI-IRNGGLTSEDNYPYQGV 180
Query: 232 DAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALT--WQYYLGGVI 289
D C + + +I Y D +E+++L +A PV AV+ +++Y GV
Sbjct: 181 DGTCSSEKAASTEAQITGYE-DVPQNNENALLQAVAKQ-PVSVAVDGGGNDFRFYKSGVF 238
Query: 290 QYNCDGSLANINHAVQIVGY 309
+ +C N+NH V +GY
Sbjct: 239 EGDCG---TNLNHGVTAIGY 255
>gi|255543801|ref|XP_002512963.1| cysteine protease, putative [Ricinus communis]
gi|223547974|gb|EEF49466.1| cysteine protease, putative [Ricinus communis]
Length = 373
Score = 114 bits (285), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 92/296 (31%), Positives = 148/296 (50%), Gaps = 39/296 (13%)
Query: 26 SKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGI 84
+ PNL FS F++++KK+Y S+ EHD RFK F+ +L E ++++ +A +G+
Sbjct: 47 ANPNLLGAEHHFSLFKKKFKKTYASQEEHDYRFKIFKSNLRRAE---RHQKLDPTATHGV 103
Query: 85 TEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKD 143
T+FSDL+ EF+ + L + + L + +PT +P D
Sbjct: 104 TQFSDLTHSEFRRQFL--GLRRLRL---------------PKDANEAPMLPTNDLPADFD 146
Query: 144 WREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AG 195
WRE G + V+NQ +CG+CW+FST E + L G L LS Q+++DC G
Sbjct: 147 WREKGAVTAVKNQGSCGSCWSFSTTGALEGANYLATGKLVSLSEQQLVDCDHECDPAEEG 206
Query: 196 NGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKD-AACKRKATSPNGVKIKSYTCDT 254
+ GC+GG + ++ + L E +YP D AC+ T K+ +++ +
Sbjct: 207 ACDSGCNGGLMNSAFEYT-LKAGGLMREEDYPYTGTDRGACQFDKTKI-AAKVANFSVVS 264
Query: 255 LIPSESSILTDIATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
L E I ++ +GP+ A+NA+ Q Y+GGV Y C L +H V +VGY
Sbjct: 265 L--DEDQIAANLVKNGPLAVAINAVFMQTYIGGVSCPYICSKRL---DHGVLLVGY 315
>gi|1223922|gb|AAA92063.1| cysteinyl endopeptidase [Vigna radiata]
Length = 362
Score = 114 bits (285), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 77/281 (27%), Positives = 138/281 (49%), Gaps = 21/281 (7%)
Query: 31 EQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDL 90
E +L+ ++ + S S +E RF F++++ + NK + + + +F+D+
Sbjct: 34 ESLWDLYERWRSHHTVSRSLTEKHKRFNVFKENVMHVHNTNK---MDKPYKLKLNKFADM 90
Query: 91 SEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGII 150
+ EF++ + VN H + +H + + K S+ P DWR+ G +
Sbjct: 91 TNHEFRSTYAGSKVNHHKMFRGTQHGNGTFMYEKVGSV----------PASVDWRKKGAV 140
Query: 151 GKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALL 210
V++Q CG+CWAFSTV E ++ +K L LS QE++DC N GC+GG +
Sbjct: 141 TDVKDQGQCGSCWAFSTVVAVEGINQIKTDKLVSLSEQELVDCDKEENQGCNGGLMESAF 200
Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
+++ K + ES YP ++ C + V I + + + E+++L +A
Sbjct: 201 EFIK-QKGGITTESNYPYTAQEGTCDASKVNDLAVSIDGHE-NVPVNDENALLKAVANQ- 257
Query: 271 PVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
PV A++A +Q+Y GV+ +C+ ++NH V IVGY
Sbjct: 258 PVSVAIDAGGSDFQFYSEGVLTGDCN---TDLNHGVAIVGY 295
>gi|357474573|ref|XP_003607571.1| Cysteine proteinase EP-B [Medicago truncatula]
gi|34329348|gb|AAQ63885.1| putative cysteine proteinase [Medicago truncatula]
gi|355508626|gb|AES89768.1| Cysteine proteinase EP-B [Medicago truncatula]
Length = 345
Score = 114 bits (285), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 92/306 (30%), Positives = 139/306 (45%), Gaps = 39/306 (12%)
Query: 16 LCFLAIPVKVSKPNLEQKLELFSSFQQ---RYKKSYSK-SEHDIRFKNFEKSLDIIEELN 71
C ++V+ L+ ++ +Q Y K Y E + R K F+++++ IE N
Sbjct: 17 FCLGLFAIQVTSRTLQDDSIIYEKHEQWMVHYGKVYKDLQERENRLKIFKENVNYIEASN 76
Query: 72 KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTG 131
N + + + GI +F+DL+ EEF + S +K H + + K S T
Sbjct: 77 -NAGNNKLYKLGINQFADLTNEEF-------------IASRNKFKGHMCSSITKTS--TF 120
Query: 132 ITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVI 191
+P DWR+ G + V+NQ CG CWAFS V E +H L G L LS QE++
Sbjct: 121 KYENASVPSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELV 180
Query: 192 DCAGNG-NMGCSGGDFCALLDWMDVNKVV-----LEPESEYPLLLKDAACKRKATSPNGV 245
DC G + GC GG L+D D K + L E++YP D C S + V
Sbjct: 181 DCDTKGVDQGCEGG----LMD--DAFKFIIQNHGLNTEAQYPYQGVDGTCSANKASIHAV 234
Query: 246 KIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHA 303
I Y D +E ++ +A P+ A++A +Q+Y GV +C L +H
Sbjct: 235 TITGYE-DVPANNEQALQKAVANQ-PISVAIDASGSDFQFYKSGVFTGSCGTEL---DHG 289
Query: 304 VQIVGY 309
V VGY
Sbjct: 290 VTAVGY 295
>gi|449487301|ref|XP_004157559.1| PREDICTED: cysteine proteinase 15A-like [Cucumis sativus]
Length = 406
Score = 114 bits (285), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 86/285 (30%), Positives = 133/285 (46%), Gaps = 29/285 (10%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F F ++Y KSY ++ E+ RF F K+L I P +A +G+T+FSDLSEEEF
Sbjct: 89 FVMFMEKYGKSYPTRKEYLHRFGIFVKNL--IRAAEHQALDP-TAVHGVTQFSDLSEEEF 145
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
+ + M +++ G+P + DWR+ G + +V+
Sbjct: 146 E----------RMFMGVRGGAGGEGLPEMNQAVEVTAEEVKGLPERFDWRDKGAVTEVKM 195
Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDFC 207
Q TCG+CWAFST E + + G L LS Q+++DC N GC+GG
Sbjct: 196 QGTCGSCWAFSTCGAVEGANFIATGNLLNLSEQQLVDCDHTCDPTDKTACNNGCNGGLMT 255
Query: 208 ALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIA 267
++ + LE ES YP + C ++ VK+ ++T T+ E+ I +
Sbjct: 256 NAYKYL-IQSGGLEEESSYPYTGRSGQCNFQSDKI-AVKVSNFT--TIPIDENQIAAHLV 311
Query: 268 THGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGYDN 311
GP+ +NA+ Q Y+GGV C +NH V +VGY +
Sbjct: 312 RSGPLAVGLNAVFMQTYIGGVSCPLICGKRF--VNHGVLMVGYGD 354
>gi|125547724|gb|EAY93546.1| hypothetical protein OsI_15336 [Oryza sativa Indica Group]
Length = 348
Score = 114 bits (285), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 81/262 (30%), Positives = 131/262 (50%), Gaps = 37/262 (14%)
Query: 61 EKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHL---RHSVNKHVLMSHHKHHD 117
+ LD + EL R P +A +G+T+FSDL+ EF+ R L R S+ V H+
Sbjct: 48 DAQLDGLRELRAARLDP-TATHGVTKFSDLTPGEFRDRLLGLRRPSLEGLVGGEPHE--- 103
Query: 118 HHHNHVKKRSITTGITIPT-GIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHA 176
+PT G+P DWRE G +G V++Q +CG+CW+FST E H
Sbjct: 104 -------------APILPTDGLPDDFDWREHGAVGPVKDQGSCGSCWSFSTSGALEGAHF 150
Query: 177 LKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPL 228
L G L +LS Q+++DC + + GC+GG ++ + L+ E +YP
Sbjct: 151 LATGKLEVLSEQQMVDCDHECDASESRACDSGCNGGLMTTAFSYL-MKSGGLQSEKDYPY 209
Query: 229 LLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGV 288
++ CK S ++K+++ ++ +E I ++ HGP+ A+NA Q Y+GGV
Sbjct: 210 AGRENTCKFD-KSKIVAQVKNFSVISV--NEDQIAANLVKHGPLAIAINAAYMQTYIGGV 266
Query: 289 -IQYNCDGSLANINHAVQIVGY 309
+ C +++H V +VGY
Sbjct: 267 SCPFICG---RHLDHGVLLVGY 285
>gi|449513868|ref|XP_002191976.2| PREDICTED: cathepsin L1-like [Taeniopygia guttata]
Length = 443
Score = 114 bits (285), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 86/288 (29%), Positives = 135/288 (46%), Gaps = 28/288 (9%)
Query: 28 PNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQ-SPESARYGITE 86
P L+ +L+ S+ ++K Y + E R +EK+L +IE N + S + G+ +
Sbjct: 128 PELDGHWQLWKSW---HRKDYHEREEGWRRVVWEKNLKMIEIHNLDHALGKHSYKLGMNQ 184
Query: 87 FSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGI--PVKKDW 144
F D++ EEF R +N +V H +++ + P + P DW
Sbjct: 185 FGDMTTEEF-----RQLMNGYV-----------HKKSERKYRGSQFLEPNFLEAPRSVDW 228
Query: 145 REAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGCSG 203
RE G + V++Q CG+CWAFST E H K G L LS Q ++DC+ GN GC+G
Sbjct: 229 REKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGNQGCNG 288
Query: 204 GDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSIL 263
G ++ N + + E YP KD R N + D E +++
Sbjct: 289 GLMDQAFQYVQDNGGI-DSEESYPYTAKDDEDCRYKAEYNAANDTGFV-DIPQGHERALM 346
Query: 264 TDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+A GPV A++A ++Q+Y G I Y D S +++H V +VGY
Sbjct: 347 KAVAAVGPVSVAIDAGHSSFQFYQSG-IYYEPDCSSEDLDHGVLVVGY 393
>gi|195624522|gb|ACG34091.1| thiol protease aleurain precursor [Zea mays]
Length = 360
Score = 114 bits (285), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 86/278 (30%), Positives = 128/278 (46%), Gaps = 27/278 (9%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F+ F RY KSY S +E RF+ F +SL ++ N+ S R GI F+D+S EEF
Sbjct: 59 FARFAVRYGKSYESAAEVHKRFRIFSESLQLVRSTNRKGLS---YRLGINRFADMSWEEF 115
Query: 96 KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
+ L + N L +H+ +P KDWRE GI+ V+
Sbjct: 116 RATRLGAAQNCSATLTGNHRMR----------------AAAVALPETKDWREDGIVSPVK 159
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGCSGGDFCALLDWM 213
NQ CG+CW FST E+ + G LS Q++IDC N GC+GG +++
Sbjct: 160 NQGHCGSCWTFSTTGALEAAYTQATGKPISLSEQQLIDCGFAFNNFGCNGGLPSQAFEYI 219
Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
N L+ E YP + CK K + G K+ + + + +E + + PV
Sbjct: 220 KYNG-GLDTEESYPYQGVNGICKFKNENV-GFKVLD-SVNITLGAEDELKDAVGLVRPVS 276
Query: 274 AAVNALT-WQYYLGGVIQYN-CDGSLANINHAVQIVGY 309
A +T ++ Y GV + C + ++NHAV VGY
Sbjct: 277 VAFEVITGFRLYKSGVYTSDHCGTTPMDVNHAVLAVGY 314
>gi|403300987|ref|XP_003941193.1| PREDICTED: cathepsin L2 [Saimiri boliviensis boliviensis]
Length = 333
Score = 114 bits (285), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 87/309 (28%), Positives = 149/309 (48%), Gaps = 40/309 (12%)
Query: 11 VALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEE 69
+ L A C + + + P +Q L+ + ++ +++ YS +E R +EK++ +IE
Sbjct: 5 LVLTAFC---LGIASAAPKFDQNLDTQWYQWKATHRRLYSTNEEGWRRAVWEKNMKMIEL 61
Query: 70 LNKNRQSPESARYGIT----EFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKK 125
N ++G T F D++ EEF+ +M ++ H + V +
Sbjct: 62 HNGEY---SRGKHGFTMAMNAFGDMTNEEFRQ-----------VMVCFRNQKHKNGKVFR 107
Query: 126 RSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLL 185
+ + +P + DWR+ G + V+NQ+ CG+CWAFS E K G L L
Sbjct: 108 GPLL--LDLPKSV----DWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSL 161
Query: 186 SVQEVIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNG 244
S Q ++DC+ GN GC+GG ++ N L+ E+ YP KD CK K +
Sbjct: 162 SEQNLVDCSRPQGNQGCNGGFMNYAFRYVKENG-GLDSEASYPYEAKDGICKYKPEN--- 217
Query: 245 VKIKSYTCDTLIPS-ESSILTDIATHGPVIAAVNA--LTWQYYLGGV-IQYNCDGSLANI 300
+ + T +IP+ E ++ +AT GP+ AV+A ++Q+Y G+ + C S N+
Sbjct: 218 -SVANDTGFVVIPTHEKELMKAVATVGPISVAVDASHSSFQFYKSGIYFEKKC--SSKNL 274
Query: 301 NHAVQIVGY 309
+H V +VGY
Sbjct: 275 DHGVLVVGY 283
>gi|111073719|dbj|BAF02548.1| triticain gamma [Triticum aestivum]
Length = 365
Score = 114 bits (285), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 86/277 (31%), Positives = 129/277 (46%), Gaps = 26/277 (9%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F+ F RY KSY S +E RF+ F +SL EE+ + S R GI FSD+S EEF
Sbjct: 64 FARFAVRYGKSYESAAEVRRRFRIFSESL---EEVRSTNRKGLSYRLGINRFSDMSWEEF 120
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
+ L + ++ NH+ + + +P KDWRE GI+ V++
Sbjct: 121 QATRLGAAQTCSATLAG--------NHLMRDA--------AALPETKDWREDGIVSPVKD 164
Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWMD 214
Q CG+CW FST E+ + G LS Q+++DCAG N GCSGG +++
Sbjct: 165 QSHCGSCWTFSTTGALEAAYTQATGKNISLSEQQLVDCAGGFNNFGCSGGLPSQAFEYIK 224
Query: 215 VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIA 274
N + + E YP + C KA N V + + + +E + + PV
Sbjct: 225 YNGGI-DTEESYPYKGVNGVCHYKA--ENAVVQVLDSVNITLNAEDELKNAVGLVRPVSV 281
Query: 275 AVNALT-WQYYLGGVIQYN-CDGSLANINHAVQIVGY 309
A + ++ Y GV + C + ++NHAV VGY
Sbjct: 282 AFEVINGFRQYKSGVYSSDHCGTTPDDVNHAVLAVGY 318
>gi|3097321|dbj|BAA25899.1| Bd 30K [Glycine max]
gi|84371705|gb|ABC56139.1| 34 kDa maturing seed protein [Glycine max]
gi|195957142|gb|ACG59282.1| major allergen Gly m Bd 30K [Glycine max]
gi|223452512|gb|ACM89583.1| maturing seed protein [Glycine max]
gi|226432468|gb|ACO55749.1| Gly m Bd 30K allergen [Glycine max]
gi|320090153|gb|ADW08728.1| P34 allergen [Glycine max]
gi|320090155|gb|ADW08729.1| P34 allergen [Glycine max]
gi|320090157|gb|ADW08730.1| P34 allergen [Glycine max]
gi|320090159|gb|ADW08731.1| P34 allergen [Glycine max]
Length = 379
Score = 114 bits (285), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 81/284 (28%), Positives = 137/284 (48%), Gaps = 28/284 (9%)
Query: 36 LFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF ++ + + Y E + R + F+ +L+ I ++N NR+SP S R G+ +F+D++ +E
Sbjct: 43 LFQLWKSEHGRVYHNHEEEAKRLEIFKNNLNYIRDMNANRKSPHSHRLGLNKFADITPQE 102
Query: 95 FKTRHLR--HSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
F ++L+ V++ + M++ K +KK + P DWR+ G+I +
Sbjct: 103 FSKKYLQAPKDVSQQIKMANKK--------MKKEQYSCDHP-----PASWDWRKKGVITQ 149
Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
V+ Q CG+ WAFS E+ HA+ G L LS QE++DC GC G +W
Sbjct: 150 VKYQGGCGSGWAFSATGAIEAAHAIATGDLVSLSEQELVDCVEESE-GCYNGWHYQSFEW 208
Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIAT---- 268
+ + + + +YP K+ CK + V I Y +TLI S+ S ++
Sbjct: 209 V-LEHGGIATDDDYPYRAKEGRCKANKIQ-DKVTIDGY--ETLIMSDESTESETEQAFLS 264
Query: 269 ---HGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
P+ +++A + Y GG+ S INH V +VGY
Sbjct: 265 AILEQPISVSIDAKDFHLYTGGIYDGENCTSPYGINHFVLLVGY 308
>gi|146216000|gb|ABQ10202.1| cysteine protease Cp4 [Actinidia deliciosa]
Length = 463
Score = 114 bits (285), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 83/289 (28%), Positives = 137/289 (47%), Gaps = 40/289 (13%)
Query: 32 QKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDL 90
+ + ++ + + K+Y+ E + RF+ F+ +L ++E N S R G+ F+DL
Sbjct: 42 EAMAIYEKWLTTHGKAYNAIGEKERRFEIFKDNLRFVDEHNA---VAGSYRVGLNRFADL 98
Query: 91 SEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITT-----GITIPTGIPVKKDWR 145
+ EE+++ L ++ +K+RS +T +P DWR
Sbjct: 99 TNEEYRSMFLGGNM-----------------EMKERSASTKSDRYAFRAGDKLPGSVDWR 141
Query: 146 EAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGD 205
E G + V++Q CG+CWAFST+ E ++ + G L LS QE++DC + NMGC+GG
Sbjct: 142 EKGAVSPVKDQGQCGSCWAFSTISAVEGINQIVTGELISLSEQELVDCDKSYNMGCNGG- 200
Query: 206 FCALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSI 262
L+D+ +N ++ E +YP D C + + V I Y D E+S+
Sbjct: 201 ---LMDYGFQFIINNGGIDTEEDYPYRAVDGTCDQFRKNARVVSINGYE-DVPEDDENSL 256
Query: 263 LTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+A PV A+ A +Q Y GV +C N++H V VGY
Sbjct: 257 KKAVANQ-PVSVAIEAGGRAFQLYESGVFTGHCG---TNLDHGVVAVGY 301
>gi|238006338|gb|ACR34204.1| unknown [Zea mays]
Length = 465
Score = 114 bits (285), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 79/278 (28%), Positives = 130/278 (46%), Gaps = 22/278 (7%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
L+ + + ++Y+ E D RF+ F +L ++ N+ R + R G+ +F+DL+ +E
Sbjct: 51 LYELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNE-RAAEHGFRLGMNQFADLTNDE 109
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F+ +L + + H + +P DWRE G + V+
Sbjct: 110 FRAAYLGARIPASRRRGTAVGERYRHGGGAEE-----------LPESVDWREKGAVAPVK 158
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWM 213
NQ CG+CWAFS V + ES++ + G + LS QE+++C+ + GN GC+GG A D++
Sbjct: 159 NQGQCGSCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFI 218
Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
+ ++ E +YP D C + V I + D E S+ +A H PV
Sbjct: 219 -IKNGGIDTEGDYPYKAVDGKCDINRENAKVVSIDGFE-DVPENDEKSLQKAVA-HQPVS 275
Query: 274 AAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
A+ A +Q Y GV C N++H V VGY
Sbjct: 276 VAIEAGGREFQLYKAGVFTGTC---TTNLDHGVVAVGY 310
>gi|449452572|ref|XP_004144033.1| PREDICTED: thiol protease aleurain-like [Cucumis sativus]
gi|449500499|ref|XP_004161114.1| PREDICTED: thiol protease aleurain-like [Cucumis sativus]
Length = 356
Score = 114 bits (285), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 85/279 (30%), Positives = 132/279 (47%), Gaps = 31/279 (11%)
Query: 37 FSSFQQRYKKSYSKSEH-DIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F+ F RY K Y +E +RF F +SL++I+ NK S + G+ +F+D + EEF
Sbjct: 57 FARFAHRYGKKYETAEEMKLRFGIFLESLELIKSTNKQGLS---YKLGVNQFADWTWEEF 113
Query: 96 KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
+ L + N HK D T +P KDWR+ GI+ V+
Sbjct: 114 RKHRLGAAQNCSATTKGSHKLTD------------------TALPESKDWRKDGIVSPVK 155
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGCSGGDFCALLDWM 213
+Q CG+CW FST E+ +A +G LS Q+++DC G N GC+GG +++
Sbjct: 156 DQGHCGSCWTFSTTGALEAAYAQAHGKGISLSEQQLVDCGRGFNNFGCNGGLPSQAFEYI 215
Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSY-TCDTLIPSESSILTDIATHGPV 272
N L+ E YP D +CK P V ++ + + + +E + +A PV
Sbjct: 216 KYNG-GLDTEEAYPYTGVDGSCK---FVPENVGVQVIDSVNITLGAEDELKHAVAFVRPV 271
Query: 273 IAAVNALT-WQYYLGGVIQYN-CDGSLANINHAVQIVGY 309
A ++ ++ Y GV N C + ++NHAV VGY
Sbjct: 272 SVAFEVVSGFRLYSKGVYTSNSCGSTPMDVNHAVLAVGY 310
>gi|388519351|gb|AFK47737.1| unknown [Medicago truncatula]
Length = 359
Score = 114 bits (285), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 87/307 (28%), Positives = 144/307 (46%), Gaps = 35/307 (11%)
Query: 13 LIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELN 71
+L L++ + S + E+ + ++ + ++ K Y+ E D RF+ F+ +L I+E N
Sbjct: 11 FFSLITLSLAMDTSMRSNEEVMTMYEEWLVKHHKVYNGLGEKDQRFEIFKDNLGFIDEHN 70
Query: 72 KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTG 131
+ + G+ +F+D + EE++ +L +D N V K ITTG
Sbjct: 71 AQNYT---YKVGLNKFADTTNEEYRNMYL------------GTKNDAKRN-VMKIKITTG 114
Query: 132 --ITIPTG--IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSV 187
+G +PV DWR G + +++Q +CG+CWAFST+ T E+++ + G L LS
Sbjct: 115 HRYAFNSGDRLPVHVDWRSKGAVAHIKDQGSCGSCWAFSTIATVEAINKIVTGKLVSLSE 174
Query: 188 QEVIDCAGNGNMGCSGGDFCALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNG 244
QE++DC N GC+GG L+D+ V ++ E +YP + C +
Sbjct: 175 QELVDCDRAFNEGCNGG----LMDYAFEFIVENGGIDTEQDYPYKGFEGRCDPTRKNAKV 230
Query: 245 VKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINH 302
V I Y + + + L H PV A+ A Q Y GV C N++H
Sbjct: 231 VSIDGY--EDVPAYNENALKKAVFHQPVSVAIEAGGRALQLYQSGVFTGRCG---TNLDH 285
Query: 303 AVQIVGY 309
V +VGY
Sbjct: 286 GVVVVGY 292
>gi|222629675|gb|EEE61807.1| hypothetical protein OsJ_16426 [Oryza sativa Japonica Group]
Length = 459
Score = 114 bits (285), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 82/291 (28%), Positives = 135/291 (46%), Gaps = 24/291 (8%)
Query: 23 VKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSP-ESA 80
V + + E+ L++ ++ + KSY+ E + R+ F +L I+E N + S
Sbjct: 27 VSYGERSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSF 86
Query: 81 RYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPV 140
R G+ F+DL+ EE++ +L ++ V R + +P
Sbjct: 87 RLGLNRFADLTNEEYRDTYL-----------GLRNKPRRERKVSDRYLAAD---NEALPE 132
Query: 141 KKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMG 200
DWR G + ++++Q CG+CWAFS + E ++ + G L LS QE++DC + N G
Sbjct: 133 SVDWRTKGAVAEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEG 192
Query: 201 CSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSES 260
C+GG D++ +N ++ E +YP KD C + V I SY D SE+
Sbjct: 193 CNGGLMDYAFDFI-INNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYE-DVTPNSET 250
Query: 261 SILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
S+ +A PV A+ A +Q Y G+ C +L +H V VGY
Sbjct: 251 SLQKAVANQ-PVSVAIEAGGRAFQLYSSGIFTGKCGTAL---DHGVAAVGY 297
>gi|225443827|ref|XP_002274223.1| PREDICTED: vignain-like [Vitis vinifera]
Length = 340
Score = 114 bits (285), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 89/282 (31%), Positives = 139/282 (49%), Gaps = 33/282 (11%)
Query: 44 YKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARY--GITEFSDLSEEEFKTRHL 100
Y ++Y +E + RFK F+++++ IE +N S + RY I EF+D + EEFK
Sbjct: 43 YGRTYKDIAEKERRFKIFKENVEYIESVN----SAGNRRYKLSINEFADQTNEEFKAS-- 96
Query: 101 RHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCG 160
R+ N + + +V +P DWR+ G + +++Q CG
Sbjct: 97 RNGYNMSSRPRSSEITSFRYENV------------AAVPSSMDWRKKGAVTPIKDQGQCG 144
Query: 161 ACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVV 219
CWAFS V E + LK G L LS QE++DC +G + GC GG + +++ +
Sbjct: 145 CCWAFSAVAAMEGVTQLKTGELISLSEQELVDCDTSGEDQGCGGGLMDSAFEFI-IGNGG 203
Query: 220 LEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA- 278
L E+ YP DA C +K + + KIK+Y D SE+++L +A H PV A++A
Sbjct: 204 LTTEANYPYKGVDATCNKKKAASSAAKIKNYE-DVPANSEAALLKAVAQH-PVSVAIDAG 261
Query: 279 -LTWQYYLGGVIQYNCDGSLANINHAVQIVGY---DNYSRTW 316
+Q+Y GV C L +H V VGY D+ ++ W
Sbjct: 262 GSDFQFYSSGVFTGQCGTEL---DHGVTAVGYGKTDDGTKYW 300
>gi|226501480|ref|NP_001150266.1| cysteine protease 1 precursor [Zea mays]
gi|195637948|gb|ACG38442.1| cysteine protease 1 precursor [Zea mays]
Length = 462
Score = 114 bits (285), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 79/278 (28%), Positives = 130/278 (46%), Gaps = 22/278 (7%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
L+ + + ++Y+ E D RF+ F +L ++ N+ R + R G+ +F+DL+ +E
Sbjct: 48 LYELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNE-RAAEHGFRLGMNQFADLTNDE 106
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F+ +L + + H + +P DWRE G + V+
Sbjct: 107 FRAAYLGARIPAARRRGTAVGERYRHGGGAEE-----------LPESVDWREKGAVAPVK 155
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWM 213
NQ CG+CWAFS V + ES++ + G + LS QE+++C+ + GN GC+GG A D++
Sbjct: 156 NQGQCGSCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFI 215
Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
+ ++ E +YP D C + V I + D E S+ +A H PV
Sbjct: 216 -IKNGGIDTEGDYPYKAVDGKCDINRENAKVVSIDGFE-DVPENDEKSLQKAVA-HQPVS 272
Query: 274 AAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
A+ A +Q Y GV C N++H V VGY
Sbjct: 273 VAIEAGGREFQLYKAGVFSGTC---TTNLDHGVVAVGY 307
>gi|66378053|gb|AAY45871.1| cathepsin L-like cysteine proteinase [Longidorus elongatus]
Length = 358
Score = 114 bits (285), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 84/283 (29%), Positives = 144/283 (50%), Gaps = 20/283 (7%)
Query: 36 LFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPE-SARYGITEFSDLSEE 93
++++F+ ++ KSY +K E +RF+ F + +IE+ N ++ + S + +F+D++
Sbjct: 42 VWTNFKLKHAKSYKTKDEELLRFQVFASNHKVIEQHNIEYEAGQHSFALSLNKFADMTNA 101
Query: 94 EFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKV 153
EF+ R +N L + K +K+ + + IP DWR+ G + KV
Sbjct: 102 EFRQR-----MNGFKLPAKRKLAKSQP--LKEDGMIFEMPDNVTIPDSVDWRKEGYVTKV 154
Query: 154 RNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDW 212
++Q +CG+CWAFS + E H + G L LS Q ++DC NG + GC+GG +
Sbjct: 155 KDQGSCGSCWAFSATGSLEGQHYKQTGKLVSLSEQNLVDCDVNGDDEGCNGGYMDGAFQY 214
Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTD--IATHG 270
++ NK + + E+ YP +D C+ K+ T IP + L + IAT G
Sbjct: 215 VETNKGI-DTEASYPYKGRDGRCRFKSEDVGATD----TGFVDIPEGNETLLEAAIATVG 269
Query: 271 PVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGYDN 311
PV A++A +Q+Y GV Y+ S ++H V VGY++
Sbjct: 270 PVSVAIDAASFKFQFYSHGVY-YDRSCSPEYLDHGVLAVGYNS 311
>gi|109939734|sp|P25776.2|ORYA_ORYSJ RecName: Full=Oryzain alpha chain; Flags: Precursor
gi|78192122|gb|ABB30151.1| oryzain alpha [Oryza sativa Japonica Group]
Length = 458
Score = 114 bits (285), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 82/291 (28%), Positives = 135/291 (46%), Gaps = 24/291 (8%)
Query: 23 VKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSP-ESA 80
V + + E+ L++ ++ + KSY+ E + R+ F +L I+E N + S
Sbjct: 26 VSYGERSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSF 85
Query: 81 RYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPV 140
R G+ F+DL+ EE++ +L ++ V R + +P
Sbjct: 86 RLGLNRFADLTNEEYRDTYL-----------GLRNKPRRERKVSDRYLAAD---NEALPE 131
Query: 141 KKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMG 200
DWR G + ++++Q CG+CWAFS + E ++ + G L LS QE++DC + N G
Sbjct: 132 SVDWRTKGAVAEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEG 191
Query: 201 CSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSES 260
C+GG D++ +N ++ E +YP KD C + V I SY D SE+
Sbjct: 192 CNGGLMDYAFDFI-INNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYE-DVTPNSET 249
Query: 261 SILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
S+ +A PV A+ A +Q Y G+ C +L +H V VGY
Sbjct: 250 SLQKAVANQ-PVSVAIEAGGRAFQLYSSGIFTGKCGTAL---DHGVAAVGY 296
>gi|118365718|ref|XP_001016079.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89297846|gb|EAR95834.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 336
Score = 114 bits (285), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 92/312 (29%), Positives = 156/312 (50%), Gaps = 29/312 (9%)
Query: 7 VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKN--FEKSL 64
+L I+ L+ LCF A + V +KL ++ + ++++ Y +EH+ F+ F +++
Sbjct: 6 LLSIIMLMPLCF-AQDISV------EKLLAYNKWSSQHQRVY-LNEHEKLFRQMVFFENM 57
Query: 65 DIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHL-RHSVNKHVLMSHHKHHDHHHNHV 123
I+E N + + S + +FSD+++EEF + L + + H + ++ H ++
Sbjct: 58 QKIQEHNSDPNNTYSTH--LNQFSDMTKEEFVEKILMKQDLVDHFMKGINQETTHSDSNN 115
Query: 124 KKRSITT-GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTL 182
K+ + + +T+ I DWR G + V+NQ CG+CW FS ES + +KN L
Sbjct: 116 KETQLNSKSLTLADSI----DWRTKGAVTSVKNQGDCGSCWTFSAAGLMESFNFIKNNVL 171
Query: 183 SLLSVQEVIDCA----GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
S Q+++DC G + GCSGG L++ +K+ + +YP + C+
Sbjct: 172 VDFSEQQLLDCVYFTRGYNSYGCSGGWPDQCLNY--ASKIGITTLDKYPYVGVMTNCRGS 229
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLA 298
T+ NG K KS+ IP+ S+ L PV V+A T Y G+ CD S
Sbjct: 230 GTN-NGFKPKSW---IQIPNTSNDLKSALNFSPVSVLVDASTLGIYKSGIFN-GCDQSNI 284
Query: 299 NINHAVQIVGYD 310
++NHAV VGYD
Sbjct: 285 SLNHAVLAVGYD 296
>gi|218202220|gb|EEC84647.1| hypothetical protein OsI_31538 [Oryza sativa Indica Group]
Length = 363
Score = 114 bits (285), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 85/279 (30%), Positives = 129/279 (46%), Gaps = 30/279 (10%)
Query: 37 FSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F+ F R+ K Y +E RF+ F +SL+++ N+ R P R GI F+D+S EEF
Sbjct: 63 FARFAVRHGKRYGDAAEVQRRFRIFSESLELVRSTNR-RGLPY--RLGINRFADMSWEEF 119
Query: 96 KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
+ L + N L +H+ D +P KDWRE GI+ V+
Sbjct: 120 QASRLGAAQNCSATLAGNHRMRD-----------------AAALPETKDWREDGIVSPVK 162
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGCSGGDFCALLDWM 213
+Q CG+CW FST + E+ + G LS Q+++DCA N GCSGG +++
Sbjct: 163 DQGHCGSCWTFSTTGSLEAAYTQATGKPVSLSEQQLVDCATAYNNFGCSGGLPSQAFEYI 222
Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSY-TCDTLIPSESSILTDIATHGPV 272
N L+ E YP + C K P V +K + + + +E + + PV
Sbjct: 223 KYNG-GLDTEEAYPYTGVNGICHYK---PENVGVKVLDSVNITLGAEDELKNAVGLVRPV 278
Query: 273 IAAVNALT-WQYYLGGVIQYN-CDGSLANINHAVQIVGY 309
A + ++ Y GV + C S ++NHAV VGY
Sbjct: 279 SVAFQVINGFRMYKSGVYTSDHCGTSPMDVNHAVLAVGY 317
>gi|74149661|dbj|BAE36450.1| unnamed protein product [Mus musculus]
Length = 334
Score = 114 bits (285), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 86/291 (29%), Positives = 137/291 (47%), Gaps = 29/291 (9%)
Query: 25 VSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESA-RY 82
++ P +Q + ++ +++ Y +E + R +EK++ +I+ N + +
Sbjct: 16 LATPKFDQTFSAEWHQWKSTHRRLYGTNEEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSM 75
Query: 83 GITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKK 142
+ F D++ EEF R VN + H H K R + + IP
Sbjct: 76 EMNAFGDMTNEEF-----RQVVNGY----------RHQKHKKGRLFQEPLMLK--IPKSV 118
Query: 143 DWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGC 201
DWRE G + V+NQ CG+CWAFS E LK G L LS Q ++DC+ GN GC
Sbjct: 119 DWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGC 178
Query: 202 SGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP-SES 260
+GG ++ N L+ E YP KD +CK +A + + T IP E
Sbjct: 179 NGGLMDFAFQYIKENG-GLDSEESYPYEAKDGSCKYRAE----FAVANGTGFVDIPQQEK 233
Query: 261 SILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+++ +AT GP+ A++A + Q+Y G I Y + S N++H V +VGY
Sbjct: 234 ALMKAVATVGPISVAMDASHPSLQFYSSG-IYYEPNCSSKNLDHGVLLVGY 283
>gi|62945374|ref|NP_001017509.1| uncharacterized protein LOC498688 precursor [Rattus norvegicus]
gi|60552853|gb|AAH91563.1| Similar to cathepsin R [Rattus norvegicus]
gi|149039732|gb|EDL93848.1| similar to cathepsin R [Rattus norvegicus]
Length = 334
Score = 114 bits (284), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 87/306 (28%), Positives = 147/306 (48%), Gaps = 28/306 (9%)
Query: 10 IVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIE 68
+ + + L + V P L+ L+ + ++++Y KSYS E ++R +E++L +I+
Sbjct: 1 MTPAVFIAILCLGVASGAPILDPSLDAEWQEWKKKYDKSYSLEEEELRRAVWEENLKMIK 60
Query: 69 ELN-KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRS 127
N +N I EF D + EEF+ + V H + KR+
Sbjct: 61 LHNGENGLGKNGFTMEINEFGDTTGEEFRKMMVEFPVQTH----------REGKSIMKRA 110
Query: 128 ITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSV 187
G P + DWR+ G + VR Q C ACWAFS E+ ++G L LSV
Sbjct: 111 --AGSIFPKFV----DWRKKGYVTPVRRQGNCNACWAFSVTGAIEAQTIWQSGKLIPLSV 164
Query: 188 QEVIDCAG-NGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVK 246
Q ++DC+ GN GC GGD ++ ++ L+ E+ YP KD C+ + + +
Sbjct: 165 QNLVDCSKPQGNNGCLGGDTYNAFQYV-LHNGGLQSEATYPYEGKDGPCRYNPKN-SSAE 222
Query: 247 IKSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVI-QYNCDGSLANINHA 303
I + +L SE ++ +AT GP+ A ++A ++++Y G+ + NC S ++ H
Sbjct: 223 ITGFV--SLPESEDILMVAVATIGPISAGIDASHESFKFYKKGIYHEPNC--SSNSVTHG 278
Query: 304 VQIVGY 309
V +VGY
Sbjct: 279 VLVVGY 284
>gi|182892046|gb|AAI65744.1| Ctsf protein [Danio rerio]
Length = 473
Score = 114 bits (284), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 92/298 (30%), Positives = 144/298 (48%), Gaps = 34/298 (11%)
Query: 20 AIPVKVSKPNLE--QKLELFSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQS 76
A+P+ SKP E + L +F +F Y ++YS + E + R + F++++ + L Q
Sbjct: 156 AVPLTHSKPMKESVELLTMFKNFMITYNRTYSSQEEAEKRLRIFQQNMKTAQTLQSLEQG 215
Query: 77 PESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT 136
SA YGIT+FSDL+E+EF+ +L +++ L K+ + I
Sbjct: 216 --SAEYGITKFSDLTEDEFRMMYLNPMLSQWSL---------------KKEMKPAIPASA 258
Query: 137 GIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN 196
P DWR+ G + V+NQ CG+CWAFS E K G L LS QE++DC
Sbjct: 259 PAPDTWDWRDHGAVSPVKNQGMCGSCWAFSVTGNIEGQWFKKTGQLLSLSEQELVDC-DK 317
Query: 197 GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDT-- 254
+ C GG + ++ N LE E++Y +C K+ +Y +
Sbjct: 318 LDQACGGGLPSNAYEAIE-NLGGLETETDYSYTGHKQSCDFSTG-----KVAAYINSSVE 371
Query: 255 LIPSESSILTDIATHGPVIAAVNALTWQYYLGGV---IQYNCDGSLANINHAVQIVGY 309
L E I +A +GPV AA+NA Q+Y GV ++ C+ + I+HAV +VG+
Sbjct: 372 LPKDEKEIAAFLAENGPVSAALNAFAMQFYRKGVSHPLKIFCNPWM--IDHAVLLVGF 427
>gi|4574304|gb|AAD23996.1|AF112566_1 cathepsin [Fasciola gigantica]
Length = 326
Score = 114 bits (284), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 85/305 (27%), Positives = 145/305 (47%), Gaps = 34/305 (11%)
Query: 8 LFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDII 67
LFI+A++ + L +L+ +++ Y K Y+ ++ + R +E+++ I
Sbjct: 3 LFILAVLTVGVLG-----------SNDDLWHQWKRMYNKEYNGADDEHRRNIWEENVKHI 51
Query: 68 EELN-KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKR 126
+E N ++ + G+ +F+D++ EEFK ++L ++SH ++ ++ V
Sbjct: 52 QEHNLRHDLGLVTYTLGLNQFTDMTFEEFKAKYLTEMPRASDILSHGIPYEANNRAV--- 108
Query: 127 SITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLS 186
P K DWRE+G + ++++Q CG+CWAFST T E + T S
Sbjct: 109 ------------PDKIDWRESGYVTELKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFS 156
Query: 187 VQEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGV 245
Q+++DC+G GNMGCSGG +++ + LE ES YP + C+
Sbjct: 157 EQQLVDCSGPWGNMGCSGGLMENAYEYL--KQFGLETESSYPYTAVEGQCRYNRQLGVAK 214
Query: 246 KIKSYTCDTLIPSESSILTDIATHGPVIAAVNALT-WQYYLGGVIQYNCDGSLANINHAV 304
YT + SE + + GP AV+ + + Y GG+ Q SL +NHAV
Sbjct: 215 VTDYYTVHS--GSEVELKNLVGAEGPAAVAVDVESDFMMYSGGIYQSRTCSSL-RVNHAV 271
Query: 305 QIVGY 309
VGY
Sbjct: 272 LAVGY 276
>gi|115479391|ref|NP_001063289.1| Os09g0442300 [Oryza sativa Japonica Group]
gi|115510968|sp|P25778.2|ORYC_ORYSJ RecName: Full=Oryzain gamma chain; Flags: Precursor
gi|51535997|dbj|BAD38077.1| putative oryzain gamma chain precursor [Oryza sativa Japonica
Group]
gi|113631522|dbj|BAF25203.1| Os09g0442300 [Oryza sativa Japonica Group]
gi|215694919|dbj|BAG90110.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 362
Score = 114 bits (284), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 85/279 (30%), Positives = 129/279 (46%), Gaps = 30/279 (10%)
Query: 37 FSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F+ F R+ K Y +E RF+ F +SL+++ N+ R P R GI F+D+S EEF
Sbjct: 62 FARFAVRHGKRYGDAAEVQRRFRIFSESLELVRSTNR-RGLPY--RLGINRFADMSWEEF 118
Query: 96 KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
+ L + N L +H+ D +P KDWRE GI+ V+
Sbjct: 119 QASRLGAAQNCSATLAGNHRMRD-----------------AAALPETKDWREDGIVSPVK 161
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGCSGGDFCALLDWM 213
+Q CG+CW FST + E+ + G LS Q+++DCA N GCSGG +++
Sbjct: 162 DQGHCGSCWTFSTTGSLEAAYTQATGKPVSLSEQQLVDCATAYNNFGCSGGLPSQAFEYI 221
Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSY-TCDTLIPSESSILTDIATHGPV 272
N L+ E YP + C K P V +K + + + +E + + PV
Sbjct: 222 KYNG-GLDTEEAYPYTGVNGICHYK---PENVGVKVLDSVNITLGAEDELKNAVGLVRPV 277
Query: 273 IAAVNALT-WQYYLGGVIQYN-CDGSLANINHAVQIVGY 309
A + ++ Y GV + C S ++NHAV VGY
Sbjct: 278 SVAFQVINGFRMYKSGVYTSDHCGTSPMDVNHAVLAVGY 316
>gi|297852302|ref|XP_002894032.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
gi|297339874|gb|EFH70291.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
Length = 455
Score = 114 bits (284), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 80/263 (30%), Positives = 129/263 (49%), Gaps = 31/263 (11%)
Query: 52 EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMS 111
E D RF+ F+ +L I++ NK S R G+T F+DL+ +E+++++L + K
Sbjct: 61 EKDRRFEIFKDNLRFIDDHNKKNLS---YRLGLTRFADLTNDEYRSKYLGAKMEKKGERR 117
Query: 112 HHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETA 171
+ ++ G +P I DWR+ G + +V++Q +CG+CWAFST+
Sbjct: 118 TSQRYEAR----------VGDELPESI----DWRKKGAVAEVKDQGSCGSCWAFSTIGAV 163
Query: 172 ESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW---MDVNKVVLEPESEYPL 228
E ++ + G L LS QE++DC + N GC+GG L+D+ + ++ + +YP
Sbjct: 164 EGINQIVTGDLITLSEQELVDCDTSYNEGCNGG----LMDYAFEFIIKNGGIDTDKDYPY 219
Query: 229 LLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLG 286
D C + + V I SY D SE S+ +A H PV A+ A +Q Y
Sbjct: 220 KGVDGTCDQIRKNAKVVTIDSYE-DVPTYSEESLKKAVA-HQPVSVAIEAGGRAFQLYDS 277
Query: 287 GVIQYNCDGSLANINHAVQIVGY 309
G+ C L +H V VGY
Sbjct: 278 GIFDGTCGTQL---DHGVVAVGY 297
>gi|117606135|ref|NP_001071036.1| cathepsin F precursor [Danio rerio]
gi|115313533|gb|AAI24244.1| Cathepsin F [Danio rerio]
Length = 473
Score = 114 bits (284), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 92/298 (30%), Positives = 144/298 (48%), Gaps = 34/298 (11%)
Query: 20 AIPVKVSKPNLE--QKLELFSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQS 76
A+P+ SKP E + L +F +F Y ++YS + E + R + F++++ + L Q
Sbjct: 156 AVPLTHSKPMKESVELLTMFKNFMITYNRTYSSQEEAEKRLRIFQQNMKTAQTLQSLEQG 215
Query: 77 PESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT 136
SA YGIT+FSDL+E+EF+ +L +++ L K+ + I
Sbjct: 216 --SAEYGITKFSDLTEDEFRMMYLNPMLSQWSL---------------KKEMKPAIPASA 258
Query: 137 GIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN 196
P DWR+ G + V+NQ CG+CWAFS E K G L LS QE++DC
Sbjct: 259 PAPDTWDWRDHGAVSPVKNQGMCGSCWAFSVTGNIEGQWFKKTGQLLSLSEQELVDC-DK 317
Query: 197 GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDT-- 254
+ C GG + ++ N LE E++Y +C K+ +Y +
Sbjct: 318 LDQACGGGLPSNAYEAIE-NLGGLETETDYSYTGHKQSCDFSTG-----KVAAYINSSVE 371
Query: 255 LIPSESSILTDIATHGPVIAAVNALTWQYYLGGV---IQYNCDGSLANINHAVQIVGY 309
L E I +A +GPV AA+NA Q+Y GV ++ C+ + I+HAV +VG+
Sbjct: 372 LPKDEKEIAAFLAENGPVSAALNAFAMQFYRKGVSHPLKIFCNPWM--IDHAVLLVGF 427
>gi|27960477|gb|AAO27843.1|AF456459_1 cathepsin R [Rattus norvegicus]
Length = 334
Score = 114 bits (284), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 90/306 (29%), Positives = 145/306 (47%), Gaps = 28/306 (9%)
Query: 10 IVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIE 68
+ + + L + V P L+ L+ + ++++Y KSYS E ++R +E++L +I+
Sbjct: 1 MTPAVFIAILCLGVASGAPILDPSLDAEWQEWKKKYDKSYSLEEEELRRAVWEENLKMIK 60
Query: 69 ELN-KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRS 127
N +N I EF D + EEF+ + V H + KR+
Sbjct: 61 LHNGENGLGKNGFTMEINEFGDTTGEEFRKMMVEFPVQTH----------REGKSIMKRA 110
Query: 128 ITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSV 187
G P + DWR+ G + VR Q C ACWAFS E+ + G L LSV
Sbjct: 111 --AGSIFPKFV----DWRKKGYVTPVRRQGNCNACWAFSVTGAIEAQTIWQTGKLIPLSV 164
Query: 188 QEVIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVK 246
Q ++DC+ GN GC GD +++ +N LE E+ YP K+ C+ +P K
Sbjct: 165 QNLVDCSKSQGNEGCQWGDPHIAYEYV-LNNGGLEAEATYPYKGKEGVCRY---NPKHSK 220
Query: 247 IKSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVI-QYNCDGSLANINHA 303
+ +L SE ++ +AT GP+ AV+A ++ +Y G+ + NC S +NH+
Sbjct: 221 AEITGFVSLPESEDILMEAVATIGPISVAVDASFNSFGFYKKGLYDEPNC--SNNTVNHS 278
Query: 304 VQIVGY 309
V +VGY
Sbjct: 279 VLVVGY 284
>gi|391328505|ref|XP_003738729.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 323
Score = 114 bits (284), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 84/276 (30%), Positives = 135/276 (48%), Gaps = 26/276 (9%)
Query: 40 FQQRYKKSYSKSEHDIRFKNFEKSLDIIEELN-KNRQSPESARYGITEFSDLSEEEFKT- 97
+++ + KSY E R + F KS+ I N ++ + R G+ +F+D++ EEF+
Sbjct: 22 YKKVHGKSYGHDEEHFRRQLFYKSVAKINAHNLRHDLGLTTYRMGLNKFTDMTSEEFRNF 81
Query: 98 RHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQ 157
+ L+ K N + + G +PT + DWRE G + V+NQ
Sbjct: 82 KGLKFDATKT-----------KRNGTRFQKELLGEALPTQV----DWREKGYVTPVKNQG 126
Query: 158 TCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAG-NGNMGCSGGDFCALLDWMDVN 216
CG+CWAFST + E H G L LS Q ++DC+ GN GC+GG ++ N
Sbjct: 127 QCGSCWAFSTTGSLEGQHFKATGKLVSLSEQNLVDCSRVEGNNGCNGGLMDNGFTYIQQN 186
Query: 217 KVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAV 276
+ + E YP KD C S G ++K + D E+++ +A+ GPV A+
Sbjct: 187 GGI-DTEESYPYTGKDGDCAFNENSV-GARVKGFV-DVPQRDEAALQAAVASVGPVSVAI 243
Query: 277 NAL--TWQYYLGGVI-QYNCDGSLANINHAVQIVGY 309
+A ++QYY GV + +C S + ++H V +VGY
Sbjct: 244 DASNDSFQYYKEGVYDEPSC--SFSQLDHGVLVVGY 277
>gi|47227478|emb|CAG04626.1| unnamed protein product [Tetraodon nigroviridis]
Length = 175
Score = 114 bits (284), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 57/144 (39%), Positives = 80/144 (55%), Gaps = 21/144 (14%)
Query: 70 LNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSIT 129
LN P+SA+YGI +FSDLSE EFK +LR S ++ + + K
Sbjct: 11 LNSFSTEPQSAKYGINQFSDLSEREFKDLYLRASADRAPVFTGQKIK------------- 57
Query: 130 TGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQE 189
G+P + DWR+ ++G V+NQQ CG+CWAFS V +S+HA+ + L LSVQ+
Sbjct: 58 -------GLPARFDWRDNAVVGPVQNQQACGSCWAFSVVGAVQSVHAIGSSPLVELSVQQ 110
Query: 190 VIDCAGNGNMGCSGGDFCALLDWM 213
V+DC+ N GC GG L W+
Sbjct: 111 VLDCSFQNN-GCDGGTPINALKWL 133
>gi|328870624|gb|EGG18997.1| cysteine proteinase [Dictyostelium fasciculatum]
Length = 521
Score = 114 bits (284), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 89/306 (29%), Positives = 145/306 (47%), Gaps = 26/306 (8%)
Query: 9 FIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIE 68
F+ +A+ F+A+ + +Q + F+++ + +SY +E RF F+K++D +
Sbjct: 4 FLFVCLAV-FMALQAANAAFTEKQYRDAFTNWMIKNDRSYQSAEFGNRFNVFKKNMDYVN 62
Query: 69 ELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSI 128
E N + E+ +T F+D+S EE++ +L ++ + ++N
Sbjct: 63 EWNS--KGSETV-LDLTIFADISNEEYQRIYLGTKIDATQKLIDAARITMNNNFAAAPVF 119
Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
+ DWR+ G + ++NQ CG+CW+FST + E H L G L LS Q
Sbjct: 120 NATV----------DWRQKGAVTPIKNQGQCGSCWSFSTTGSTEGAHFLSTGNLVSLSEQ 169
Query: 189 EVIDCAG-NGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPN--GV 245
++DC+G GN GC+GG ++ NK + + ES YP C A +P G
Sbjct: 170 NLVDCSGPEGNDGCNGGLMDQAFTYIIKNKGI-DTESSYPYKAVQGKC---AFNPKNIGA 225
Query: 246 KIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSLANINHA 303
+ YT D SES L A GPV A++A ++Q Y GV Y S ++H
Sbjct: 226 TLTGYT-DVKSGSESD-LEAKANTGPVSVAIDASHNSFQLYGSGVY-YEPKCSATQLDHG 282
Query: 304 VQIVGY 309
V +VGY
Sbjct: 283 VLVVGY 288
>gi|12847813|dbj|BAB27719.1| unnamed protein product [Mus musculus]
Length = 334
Score = 114 bits (284), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 86/291 (29%), Positives = 137/291 (47%), Gaps = 29/291 (9%)
Query: 25 VSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESA-RY 82
++ P +Q + ++ +++ Y +E + R +EK++ +I+ N + +
Sbjct: 16 LATPKFDQTFSAEWHQWKSTHRRLYGTNEEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSM 75
Query: 83 GITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKK 142
+ F D++ EEF R VN + H H K R + + IP
Sbjct: 76 EMNAFGDMTNEEF-----RQVVNGY----------RHQKHKKGRLFQEPLMLK--IPKSV 118
Query: 143 DWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGC 201
DWRE G + V+NQ CG+CWAFS E LK G L LS Q ++DC+ GN GC
Sbjct: 119 DWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGC 178
Query: 202 SGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP-SES 260
+GG ++ N L+ E YP KD +CK +A + + T IP E
Sbjct: 179 NGGLMDYAFQYIKENG-GLDSEESYPYEAKDGSCKYRAE----FAVANDTGFVDIPQQEK 233
Query: 261 SILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+++ +AT GP+ A++A + Q+Y G I Y + S N++H V +VGY
Sbjct: 234 ALMKAVATVGPISVAMDASHPSLQFYSSG-IYYEPNCSSKNLDHGVLLVGY 283
>gi|74151179|dbj|BAE27712.1| unnamed protein product [Mus musculus]
Length = 334
Score = 114 bits (284), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 86/291 (29%), Positives = 137/291 (47%), Gaps = 29/291 (9%)
Query: 25 VSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESA-RY 82
++ P +Q + ++ +++ Y +E + R +EK++ +I+ N + +
Sbjct: 16 LATPKFDQTFSAEWHQWKSTHRRLYGTNEEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSM 75
Query: 83 GITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKK 142
+ F D++ EEF R VN + H H K R + + IP
Sbjct: 76 EMNAFGDMTNEEF-----RQVVNGY----------RHQKHKKGRLFQEPLMLK--IPKSV 118
Query: 143 DWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGC 201
DWRE G + V+NQ CG+CWAFS E LK G L LS Q ++DC+ GN GC
Sbjct: 119 DWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGC 178
Query: 202 SGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP-SES 260
+GG ++ N L+ E YP KD +CK +A + + T IP E
Sbjct: 179 NGGLMDFAFQYIKENG-GLDSEESYPYEAKDGSCKYRAE----FAVANDTGFVDIPQQEE 233
Query: 261 SILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+++ +AT GP+ A++A + Q+Y G I Y + S N++H V +VGY
Sbjct: 234 ALMKAVATVGPISVAMDASHPSLQFYSSG-IYYEPNCSSKNLDHGVLLVGY 283
>gi|351724281|ref|NP_001237820.1| cysteine protease-like precursor [Glycine max]
gi|149393486|gb|ABR26679.1| putative cysteine protease [Glycine max]
Length = 355
Score = 114 bits (284), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 90/278 (32%), Positives = 133/278 (47%), Gaps = 29/278 (10%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F+ F R+ KSY S+ E R++ F ++L I NKNR P + + F+D + EEF
Sbjct: 55 FARFMSRFGKSYRSEEEMRERYEIFSQNLRFIRSHNKNRL-PYT--LSVNHFADWTWEEF 111
Query: 96 KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
K L + N L +HK +T + PT KDWR+ GI+ V+
Sbjct: 112 KRHRLGAAQNCSATLNGNHK-------------LTDAVLPPT-----KDWRKEGIVSDVK 153
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWM 213
+Q +CG+CW FST E+ A G LS Q+++DCAG N GC+GG +++
Sbjct: 154 DQGSCGSCWTFSTTGALEAACAQAFGKSISLSEQQLVDCAGRFNNFGCNGGLPSQAFEYI 213
Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
N LE E YP KD CK A + I S + + +E+ + +A PV
Sbjct: 214 KYNG-GLETEEAYPYTGKDGVCKFSAENVAVQVIDS--VNITLGAENELKHAVAFVRPVS 270
Query: 274 AAVNALT-WQYYLGGVIQYN-CDGSLANINHAVQIVGY 309
A + + +Y GV + C + ++NHAV VGY
Sbjct: 271 VAFQVVNGFHFYENGVYTSDICGSTSQDVNHAVLAVGY 308
>gi|163914827|ref|NP_001106423.1| cathepsin F precursor [Xenopus (Silurana) tropicalis]
gi|157423494|gb|AAI53364.1| LOC100127591 protein [Xenopus (Silurana) tropicalis]
Length = 463
Score = 114 bits (284), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 88/299 (29%), Positives = 146/299 (48%), Gaps = 41/299 (13%)
Query: 21 IPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIR----FKNFEKSLDIIEELNKNRQS 76
+P + + + L LF F Y K YS E R F K +I+E+++
Sbjct: 150 VPSSELEDEMLKTLTLFKDFVTTYNKKYSDQEEAARRLQIFSQNLKKAQMIQEMDQG--- 206
Query: 77 PESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT 136
+A YG+T++SDL+E+EF++ +L ++ L + K++I ++ P
Sbjct: 207 --TAEYGVTKYSDLTEDEFRSLYLNPLLSSKPL------------YQMKKAIVPNMSAPD 252
Query: 137 GIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN 196
+ DWR+ G + +V+NQ CG+CWAFS + E LK G+L LS QE++DC G
Sbjct: 253 ----QWDWRDHGAVTEVKNQGMCGSCWAFSVIGNIEGQWFLKKGSLVSLSEQELVDCDGV 308
Query: 197 GNMGCSGGDFCALLDWMDVNKV-VLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTL 255
+ C+GG + + K+ +E E EY C + K+ +Y ++
Sbjct: 309 -DHACAGGLPSNAYE--AIEKLGGIETEQEYSYEGHKNTCSFSTS-----KVSAYINSSV 360
Query: 256 -IP-SESSILTDIATHGPVIAAVNALTWQYYLGGV---IQYNCDGSLANINHAVQIVGY 309
IP E+ I +A +GP+ A+NA Q+Y G+ + C+ + I+HAV +VGY
Sbjct: 361 EIPKDENEIAAWLAQNGPISIALNAFAMQFYRKGISHPFRILCNPWM--IDHAVLLVGY 417
>gi|149392541|gb|ABR26073.1| oryzain gamma chain precursor [Oryza sativa Indica Group]
Length = 367
Score = 114 bits (284), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 85/279 (30%), Positives = 129/279 (46%), Gaps = 30/279 (10%)
Query: 37 FSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F+ F R+ K Y +E RF+ F +SL+++ N+ R P R GI F+D+S EEF
Sbjct: 67 FARFAVRHGKRYGDAAEVQRRFRIFSESLELVRSTNR-RGLPY--RLGINRFADMSWEEF 123
Query: 96 KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
+ L + N L +H+ D +P KDWRE GI+ V+
Sbjct: 124 QASRLGAAQNCSATLAGNHRMRD-----------------AAALPETKDWREDGIVSPVK 166
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGCSGGDFCALLDWM 213
+Q CG+CW FST + E+ + G LS Q+++DCA N GCSGG +++
Sbjct: 167 DQGHCGSCWTFSTTGSLEAAYTQATGKPVSLSEQQLVDCATAYNNFGCSGGLPSQAFEYI 226
Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSY-TCDTLIPSESSILTDIATHGPV 272
N L+ E YP + C K P V +K + + + +E + + PV
Sbjct: 227 KYNG-GLDTEEAYPYTGVNGICHYK---PENVGVKVLDSVNITLGAEDELKNAVGLVRPV 282
Query: 273 IAAVNALT-WQYYLGGVIQYN-CDGSLANINHAVQIVGY 309
A + ++ Y GV + C S ++NHAV VGY
Sbjct: 283 SVAFQVINGFRMYKSGVYTSDHCGTSPMDVNHAVLAVGY 321
>gi|6753558|ref|NP_034114.1| cathepsin L1 preproprotein [Mus musculus]
gi|115742|sp|P06797.2|CATL1_MOUSE RecName: Full=Cathepsin L1; AltName: Full=Major excreted protein;
Short=MEP; AltName: Full=p39 cysteine proteinase;
Contains: RecName: Full=Cathepsin L1 heavy chain;
Contains: RecName: Full=Cathepsin L1 light chain; Flags:
Precursor
gi|53047|emb|CAA29470.1| unnamed protein product [Mus musculus]
gi|309186|gb|AAA37445.1| preprocysteine proteinase [Mus musculus]
gi|12832050|dbj|BAB21945.1| unnamed protein product [Mus musculus]
gi|26340196|dbj|BAC33761.1| unnamed protein product [Mus musculus]
gi|45768760|gb|AAH68163.1| Cathepsin L [Mus musculus]
gi|74139700|dbj|BAE31701.1| unnamed protein product [Mus musculus]
gi|74146632|dbj|BAE41323.1| unnamed protein product [Mus musculus]
gi|74151584|dbj|BAE41141.1| unnamed protein product [Mus musculus]
gi|74185397|dbj|BAE30172.1| unnamed protein product [Mus musculus]
gi|74197196|dbj|BAE35143.1| unnamed protein product [Mus musculus]
gi|74203006|dbj|BAE26206.1| unnamed protein product [Mus musculus]
gi|74219606|dbj|BAE29572.1| unnamed protein product [Mus musculus]
gi|148684295|gb|EDL16242.1| cathepsin L [Mus musculus]
Length = 334
Score = 114 bits (284), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 86/291 (29%), Positives = 137/291 (47%), Gaps = 29/291 (9%)
Query: 25 VSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESA-RY 82
++ P +Q + ++ +++ Y +E + R +EK++ +I+ N + +
Sbjct: 16 LATPKFDQTFSAEWHQWKSTHRRLYGTNEEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSM 75
Query: 83 GITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKK 142
+ F D++ EEF R VN + H H K R + + IP
Sbjct: 76 EMNAFGDMTNEEF-----RQVVNGY----------RHQKHKKGRLFQEPLMLK--IPKSV 118
Query: 143 DWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGC 201
DWRE G + V+NQ CG+CWAFS E LK G L LS Q ++DC+ GN GC
Sbjct: 119 DWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGC 178
Query: 202 SGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP-SES 260
+GG ++ N L+ E YP KD +CK +A + + T IP E
Sbjct: 179 NGGLMDFAFQYIKENG-GLDSEESYPYEAKDGSCKYRAE----FAVANDTGFVDIPQQEK 233
Query: 261 SILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+++ +AT GP+ A++A + Q+Y G I Y + S N++H V +VGY
Sbjct: 234 ALMKAVATVGPISVAMDASHPSLQFYSSG-IYYEPNCSSKNLDHGVLLVGY 283
>gi|387915132|gb|AFK11175.1| cathspsin H [Callorhinchus milii]
Length = 330
Score = 114 bits (284), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 96/317 (30%), Positives = 144/317 (45%), Gaps = 32/317 (10%)
Query: 7 VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDI 66
++ L+A+ L VS+ Q++ F ++ ++ K YS E+ R + F ++
Sbjct: 1 MVLSATLLAIALLGGVCCVSEFTF-QEIVSFKTWMTQHNKHYSSEEYSYRLRTFIQNKRK 59
Query: 67 IEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKR 126
+EE N R S R G+ +FSD++ EFK +L L NHV
Sbjct: 60 VEEHNSGRHS---YRMGLNQFSDMTFSEFKKLYL--------LREPQNCSATRGNHV--- 105
Query: 127 SITTGITIPTGIPVKKDWREAG-IIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLL 185
++ G P DWR G + V+NQ CG+CW FST ES A+K G L L
Sbjct: 106 -LSMGP-----YPDFVDWRTKGNYVTPVKNQGGCGSCWTFSTTGCLESAIAIKTGKLLSL 159
Query: 186 SVQEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPN- 243
+ Q+++DCAG N GC+GG +++ N LE E +YP +D C+ + PN
Sbjct: 160 AEQQLVDCAGAYKNHGCNGGLPSQAFEYIKYNG-GLEAEKDYPYTAQDQHCQYQ---PNK 215
Query: 244 GVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALT--WQYYLGGVIQYNCDGSLANIN 301
V + E+ I+ +A PV A +QY G NCD + +N
Sbjct: 216 AVAFVKEVVNITQYDENGIVDAVARLNPVSIAFEVTDDFFQYEGGVYSNSNCDSTPDKVN 275
Query: 302 HAVQIVGY--DNYSRTW 316
HAV VGY N ++ W
Sbjct: 276 HAVLAVGYGVQNGTKYW 292
>gi|291383517|ref|XP_002708299.1| PREDICTED: cathepsin L1 [Oryctolagus cuniculus]
Length = 333
Score = 114 bits (284), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 86/303 (28%), Positives = 144/303 (47%), Gaps = 32/303 (10%)
Query: 13 LIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELN 71
L A+C+ + + P +Q L+ + ++ +K+ Y +E R +EK++ +IE N
Sbjct: 7 LAAVCW---GIASAIPKFDQNLDTQWYQWKATHKRLYGLNEEGWRRAVWEKNMRMIELHN 63
Query: 72 KN-RQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITT 130
Q G+ + D++ EEF+ +M+ ++ H + + +
Sbjct: 64 GEYSQGKHGFTMGMNAYGDMTNEEFRQ-----------VMNGFQNQKHKKGKMFRDPLL- 111
Query: 131 GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEV 190
+ P + DWRE G + V+NQ CG+CWAFS E K G L LS Q +
Sbjct: 112 -LQYPKSV----DWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFQKTGKLISLSEQNL 166
Query: 191 IDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKS 249
+DC+ GN GC+GG ++ N L+ E YP D CK K + +
Sbjct: 167 VDCSHPQGNQGCNGGLMDYAFQYVKDNS-GLDSEESYPYEGMDGTCKYKPE----CSVAN 221
Query: 250 YTCDTLIPS-ESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQI 306
T IP E ++L +AT GP+ AA++A +++Q+Y G I Y+ D S +++H + +
Sbjct: 222 DTGFVDIPGHEKALLRAVATVGPISAAIDAGHMSFQFYKSG-IYYDPDCSSKDLDHGILV 280
Query: 307 VGY 309
VGY
Sbjct: 281 VGY 283
>gi|74213650|dbj|BAE35627.1| unnamed protein product [Mus musculus]
Length = 334
Score = 114 bits (284), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 86/291 (29%), Positives = 137/291 (47%), Gaps = 29/291 (9%)
Query: 25 VSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESA-RY 82
++ P +Q + ++ +++ Y +E + R +EK++ +I+ N + +
Sbjct: 16 LATPKFDQTFSAEWHQWKSTHRRLYGTNEEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSM 75
Query: 83 GITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKK 142
+ F D++ EEF R VN + H H K R + + IP
Sbjct: 76 EMNAFGDMTNEEF-----RQVVNGY----------RHQKHKKGRLFQEPLMLK--IPKSV 118
Query: 143 DWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGC 201
DWRE G + V+NQ CG+CWAFS E LK G L LS Q ++DC+ GN GC
Sbjct: 119 DWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGC 178
Query: 202 SGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP-SES 260
+GG ++ N L+ E YP KD +CK +A + + T IP E
Sbjct: 179 NGGLMDFAFQYIKENG-GLDSEESYPYEAKDGSCKYRAE----FAVANDTGFVDIPQQEK 233
Query: 261 SILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+++ +AT GP+ A++A + Q+Y G I Y + S N++H V +VGY
Sbjct: 234 ALMKAVATVGPISVAMDASHPSLQFYSSG-IYYEPNCSSKNLDHGVLLVGY 283
>gi|344953542|gb|AEN28617.1| cathepsin L-like cysteine protease [Epinephelus coioides]
Length = 336
Score = 114 bits (284), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 91/309 (29%), Positives = 144/309 (46%), Gaps = 30/309 (9%)
Query: 8 LFIVALIALCF-LAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDI 66
+ +A++ALC A+ P L+ EL+ S+ + K Y + E R +EK+L
Sbjct: 1 MLPLAVVALCLSAALSAPSLDPQLDDHWELWKSW---HSKKYHEKEEGWRRMVWEKNLKK 57
Query: 67 IEELN-KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKK 125
IE N ++ S R G+ F D++ EEF+ LM+ +K +
Sbjct: 58 IELHNLEHSMGTHSYRLGMNHFGDMTHEEFRQ-----------LMNGYK------RKAET 100
Query: 126 RSITTGITIPTGI--PVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLS 183
++ + P + P DWR+ G + V++Q CG+CWAFST E H K G L
Sbjct: 101 KARGSLFLEPNFLEAPKSVDWRDNGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLV 160
Query: 184 LLSVQEVIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSP 242
LS Q ++DC+ GN GC+GG ++ N+ L+ E YP L D +
Sbjct: 161 SLSEQNLVDCSRPEGNEGCNGGLMDQAFQYVKDNQ-GLDSEDSYPYLGTDDQPCHYDPTY 219
Query: 243 NGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANI 300
N V + D E +++ +A GPV A++A ++Q+Y G I Y + S +
Sbjct: 220 NSVNDTGFV-DIPSGKERALMKAVAAVGPVSVAIDAGHESFQFYQSG-IYYEKECSSEEL 277
Query: 301 NHAVQIVGY 309
+H V +VGY
Sbjct: 278 DHGVLVVGY 286
>gi|388491952|gb|AFK34042.1| unknown [Lotus japonicus]
Length = 352
Score = 114 bits (284), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 90/302 (29%), Positives = 147/302 (48%), Gaps = 38/302 (12%)
Query: 22 PVKVSKPNLEQKLEL---------FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELN 71
P+++ EQ L++ F+ F +Y K Y S E RF+ F ++L++I+ N
Sbjct: 29 PIRLVSDLEEQVLQVIGQTRHAASFARFASKYGKRYDSVEEIQHRFRIFSENLELIKSTN 88
Query: 72 KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITT 130
K R S + G+ F+DLS +EF+T+ L + N L+ +HK D
Sbjct: 89 KKRLS---YKLGLNHFADLSWDEFRTQKLGAAQNCSATLIGNHKLTD------------- 132
Query: 131 GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEV 190
+ +KDWR+ I+ +V++Q CG+CW FST E+ +A +G LS Q++
Sbjct: 133 -----AVLSAEKDWRKESIVSEVKDQAHCGSCWTFSTTGALEAAYAQAHGKNISLSEQQL 187
Query: 191 IDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKS 249
+DCAG N GC+GG +++ N + E EYP KD A K A + V++
Sbjct: 188 VDCAGAFNNFGCNGGLPSQAFEYIKYNGGI-ALEKEYPYTAKDEASKFTAENV-AVRVLD 245
Query: 250 YTCDTLIPSESSILTDIATHGPVIAAVNALT-WQYYLGGVIQYN-CDGSLANINHAVQIV 307
+ + + +E + +A PV A + ++ Y GV + C + ++NHAV V
Sbjct: 246 -SVNITLGAEDELKHAVAFARPVSVAFQVVDGFRLYKEGVYTSDTCGNTPMDVNHAVLAV 304
Query: 308 GY 309
GY
Sbjct: 305 GY 306
>gi|357158628|ref|XP_003578189.1| PREDICTED: thiol protease aleurain-like [Brachypodium distachyon]
Length = 363
Score = 114 bits (284), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 85/279 (30%), Positives = 126/279 (45%), Gaps = 30/279 (10%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F+ F RY KSY S +E RF+ F +SL EE+ Q S R GI +SD+S EEF
Sbjct: 62 FARFAVRYGKSYESAAEVQRRFRIFSESL---EEVRSTNQKGLSYRLGINRYSDMSWEEF 118
Query: 96 KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
+ L + L +H+ D + +P KDWRE GI+ V+
Sbjct: 119 QASRLGAAQTCSATLRGNHRMQDAN-----------------ALPETKDWREDGIVSPVK 161
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWM 213
+Q CG+CW FST E+ + G LS Q+++DCAG N GC+GG +++
Sbjct: 162 DQSHCGSCWTFSTTGALEAAYTQATGKNISLSEQQLVDCAGAYNNFGCNGGLPSQAFEYI 221
Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSY-TCDTLIPSESSILTDIATHGPV 272
N L+ E YP + C K P ++ + + + +E + + PV
Sbjct: 222 KYNG-GLDTEESYPYKGVNGVCHYK---PENAAVQVLDSVNITLNAEDELQNAVGLVRPV 277
Query: 273 IAAVNALTW--QYYLGGVIQYNCDGSLANINHAVQIVGY 309
A + QY G +C + ++NHAV VGY
Sbjct: 278 SVAFEVINGFRQYKSGVYTSDHCGTTPDDVNHAVLAVGY 316
>gi|312451845|gb|ADQ85986.1| actinidin [Actinidia chinensis]
Length = 380
Score = 114 bits (284), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 84/312 (26%), Positives = 148/312 (47%), Gaps = 27/312 (8%)
Query: 2 FDVKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNF 60
F ++LF L+ L +++ ++ ++ S+ +Y KSY S E + RF+ F
Sbjct: 7 FVSMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIF 66
Query: 61 EKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
+++L I+E N + S + G+ +F+DL++EEF++ +L + + +++
Sbjct: 67 KETLRFIDEHNADTN--RSYKVGLNQFADLTDEEFRSTYLGFTSGSNKTKVSNRYEPR-- 122
Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
G +P+ + DWR AG + +++Q CG CWAFS + T E ++ + G
Sbjct: 123 ---------VGQVLPSYV----DWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTG 169
Query: 181 TLSLLSVQEVIDCAGNGNM-GCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
L LS QE+IDC N GC+G ++ +N + E YP +D C
Sbjct: 170 VLISLSEQELIDCGRTQNTRGCNGSYITDGFPFI-INNGGINTEENYPYTAQDGECNVDL 228
Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSL 297
+ V I +Y + + + L T+ PV A++A ++ Y G+ C +
Sbjct: 229 QNEKYVTIDTY--ENVPYNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTA- 285
Query: 298 ANINHAVQIVGY 309
I+HAV IVGY
Sbjct: 286 --IDHAVTIVGY 295
>gi|66815893|ref|XP_641963.1| cysteine protease 4 [Dictyostelium discoideum AX4]
gi|166201984|sp|P54639.2|CYSP4_DICDI RecName: Full=Cysteine proteinase 4; Flags: Precursor
gi|60469981|gb|EAL67962.1| cysteine protease 4 [Dictyostelium discoideum AX4]
Length = 442
Score = 114 bits (284), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 90/306 (29%), Positives = 151/306 (49%), Gaps = 34/306 (11%)
Query: 13 LIALCFLAIPVKVSKPNLE--QKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEEL 70
L LC L + +K Q F+++ Q ++++YS E + R++ F+ ++D + +
Sbjct: 4 LSFLCLLLVSYASAKQQFSELQYRNAFTNWMQAHQRTYSSEEFNARYQIFKSNMDYVHQW 63
Query: 71 NKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITT 130
N + E+ G+ F+D++ +E++T +L + L+ + +K T
Sbjct: 64 NS--KGGETV-LGLNVFADITNQEYRTTYLGTPFDGSALIGTEE---------EKIFSTP 111
Query: 131 GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGT---LSLLSV 187
T+ DWR G + ++NQ CG CW+FST + E H + +GT L LS
Sbjct: 112 APTV--------DWRAQGAVTPIKNQGQCGGCWSFSTTGSTEGAHFIASGTKKDLVSLSE 163
Query: 188 QEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDA-ACKRKATSPNGV 245
Q +IDC+ + GN GC GG +++ +N ++ ES YP +D CK K TS G
Sbjct: 164 QNLIDCSKSYGNNGCEGGLMTLAFEYI-INNKGIDTESSYPYTAEDGKECKFK-TSNIGA 221
Query: 246 KIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSLANINHA 303
+I SY + SE+S L + + PV A++A ++Q Y G I Y S ++H
Sbjct: 222 QIVSYQ-NVTSGSEAS-LQSASNNAPVSVAIDASNESFQLYESG-IYYEPACSPTQLDHG 278
Query: 304 VQIVGY 309
V +VGY
Sbjct: 279 VLVVGY 284
>gi|33333714|gb|AAQ11975.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 323
Score = 114 bits (284), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 89/281 (31%), Positives = 133/281 (47%), Gaps = 30/281 (10%)
Query: 35 ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPE-SARYGITEFSDLSE 92
E + F+ + K+Y S E RF F+K+L I+E NK + E S +T+F+D++
Sbjct: 21 EEWQQFKLDHGKTYRSVVEEKRRFSVFQKNLIDIQEHNKKYERGEVSFAKKVTQFADMTH 80
Query: 93 EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
EEF V L S + + T I + DWR+ G +
Sbjct: 81 EEFLDLLKLQGV--PALPSDAVYFEE-----------TDIEEKDAV----DWRKEGAVTP 123
Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN--GNMGCSGGDFCALL 210
V+NQ CG+CWAFS V E KNGTL LS QE++DCA GN GC+GG
Sbjct: 124 VKNQGHCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEYYGNEGCNGGLMGQAF 183
Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
D+++ + + E YP K + C+ K+K+Y L+ +E I ++ G
Sbjct: 184 DFVEDEGI--QTEESYPYKAKRSICQMNGEYV--TKVKTY---HLLLNEQEIARAVSAKG 236
Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGS--LANINHAVQIVGY 309
PV A++A +Y G++ C S ++NH V +VGY
Sbjct: 237 PVAVAIDASQLSFYDQGIVDEKCKCSKKREDLNHGVLVVGY 277
>gi|449525012|ref|XP_004169515.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 459
Score = 114 bits (284), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 82/311 (26%), Positives = 153/311 (49%), Gaps = 35/311 (11%)
Query: 10 IVALIALCFLAIPVKVSKPNLEQK-----LELFSSFQQRYKKSYSK--SEHDIRFKNFEK 62
I+AL+ F+A+ + Q+ + L+ ++ ++ K ++ +E + RF F+
Sbjct: 9 IMALLFFLFIALSAASPSSIIPQRTDDEVMALYDQWRAKHGKLHNNLGAEPENRFHIFKD 68
Query: 63 SLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNH 122
+L I+E+N R G+ F+DL+ EE+++R+L S + + + +
Sbjct: 69 NLKFIDEINAQNLP---YRLGLNVFADLTNEEYRSRYLGGK-----FASGSRRNRTSNRY 120
Query: 123 VKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTL 182
+ + G +P I DWR G + V++Q +CG+CWAFSTV + E+++ + G L
Sbjct: 121 LPR----LGDDLPDSI----DWRAKGAVAPVKDQGSCGSCWAFSTVASVEAINQIVTGDL 172
Query: 183 SLLSVQEVIDCAGNGNMGCSGGDFCALLDW---MDVNKVVLEPESEYPLLLKDAACKRKA 239
LS QE++DC + N GC+GG L+D+ + L+ E +YP D++C +
Sbjct: 173 IALSEQELVDCDRSYNEGCNGG----LMDYAFEFIIENGGLDTEEDYPYYGFDSSCIQYK 228
Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIA-AVNALTWQYYLGGVIQYNCDGSLA 298
+ V I SY D + +E ++ ++ +A ++Q Y G+ C
Sbjct: 229 KNAKVVAIDSYE-DVPVNNEKALQKAVSKQVVSVAIEGGGRSFQLYQSGIFTGRCG---T 284
Query: 299 NINHAVQIVGY 309
+++H V +VGY
Sbjct: 285 DLDHGVNVVGY 295
>gi|118373813|ref|XP_001020099.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89301866|gb|EAR99854.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 332
Score = 114 bits (284), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 96/316 (30%), Positives = 147/316 (46%), Gaps = 37/316 (11%)
Query: 5 KNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHD-----IRFKN 59
K +L +AL+ +LA V + +KL ++ + + +++ E + F+N
Sbjct: 4 KFILLSIALLMPIYLAQNVSI------EKLLAYNKWSTQNLRAFLSDEEKLFRQLVFFEN 57
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHH 119
+K D N Q + + +FSD++EEEF + L S HV+ H K +
Sbjct: 58 LQKVKD------HNSQDHHTYSLDLNQFSDMTEEEFVEKVLMKS---HVVDLHIKQATSN 108
Query: 120 HNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
++ S +T V DWR G + V+NQ CG+CW FS ES + +KN
Sbjct: 109 NSTSSASSNSTSNNAT----VTVDWRTKGAVTSVKNQGQCGSCWTFSAAGLMESFNFIKN 164
Query: 180 GTLSLLSVQEVIDCA----GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAAC 235
L+ S Q+++DC G G+ GC+GG + LD+ K + YP + C
Sbjct: 165 KNLTNFSEQQLVDCVNSANGYGSNGCNGGWPASCLDYSS--KFGITTLQNYPYVGVQKKC 222
Query: 236 KRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYN-CD 294
T+ NG K KS+ IP+ S L + PV V+A TW +Y GV YN C+
Sbjct: 223 NITGTN-NGFKPKSW---KQIPNTSKDLQNALNFSPVSVVVDASTWSHYRSGV--YNGCN 276
Query: 295 GSLANINHAVQIVGYD 310
+ +NHAV VGYD
Sbjct: 277 QTKIQLNHAVLAVGYD 292
>gi|30141019|dbj|BAC75923.1| cysteine protease-1 [Helianthus annuus]
Length = 461
Score = 114 bits (284), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 79/281 (28%), Positives = 142/281 (50%), Gaps = 31/281 (11%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
L+ S+ ++ K+Y+ E D RF+ F+ +L I+E N + + G+ +F+DL+ EE
Sbjct: 51 LYESWLVKHGKTYNALGEKDRRFQIFKDNLRFIDEHNSGDHT---YKLGLNKFADLTNEE 107
Query: 95 FKTRHLR-HSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKV 153
++ + +++ +S K + + +G ++P + DWRE G + V
Sbjct: 108 YRMTYTGIKTIDDKKKLSKMKSDRYAYR--------SGDSLPEYV----DWREQGAVTDV 155
Query: 154 RNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW- 212
++Q +CG+CWAFST + E ++ + G L +S QE+++C + N GC+GG L+D+
Sbjct: 156 KDQGSCGSCWAFSTTGSVEGVNKIVTGDLISVSEQELVNCDTSYNQGCNGG----LMDYA 211
Query: 213 --MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
+ ++ E +YP KD C + + V I SY D + ESS+ ++
Sbjct: 212 FEFIIKNGGIDTEEDYPYTGKDGKCDKNKKNAKVVTIDSYE-DVPVNDESSLKKAVSNQ- 269
Query: 271 PVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
PV A+ A +Q+Y G+ +C +L +H V GY
Sbjct: 270 PVAVAIEAGGRDFQFYTSGIFTGSCGTAL---DHGVLAAGY 307
>gi|449673497|ref|XP_002169904.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 325
Score = 114 bits (284), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 94/319 (29%), Positives = 153/319 (47%), Gaps = 42/319 (13%)
Query: 7 VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYS-KSEHDIRFKNFEKSLD 65
L V+LI LCF I + KP E ++ + + K+YS +SE ++R+ ++ +++
Sbjct: 3 ALIFVSLITLCFGYI---IEKPIRESSWYVW---KMAHNKAYSHESEENVRYAIWKDNMN 56
Query: 66 IIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKK 125
I E N ++ + F D++ EF+ + +N +L HKH +
Sbjct: 57 RITEYNSKSKN---VILRMNHFGDMTNTEFRAK-----MNGLLL---HKHQN-------- 97
Query: 126 RSITTGITIP--TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLS 183
+ +P T P DWR G + V+NQ CG+CWAFS+ E H K G L
Sbjct: 98 ---GSTFLVPSHTAAPDAVDWRSEGYVTPVKNQGQCGSCWAFSSTGALEGQHFKKTGRLV 154
Query: 184 LLSVQEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSP 242
LS Q ++DC+ + GN GC+GG ++ N + + E+ YP +D C R + S
Sbjct: 155 SLSEQNLVDCSTDYGNNGCNGGLMDNAFSYIKANGGI-DTETGYPYEGQDGTC-RYSKSS 212
Query: 243 NGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVI-QYNCDGSLAN 299
G + D E ++ +AT GPV A++A +++Q+Y GV + C S +
Sbjct: 213 IGADDTGFV-DIPEGDEDALKQAVATVGPVSVAIDASHMSFQFYHSGVYDEPQC--SPSA 269
Query: 300 INHAVQIVGY--DNYSRTW 316
++H V +VGY DN W
Sbjct: 270 LDHGVLVVGYGTDNGKDYW 288
>gi|156142226|gb|ABU51882.1| ervatamin-C precursor [Tabernaemontana divaricata]
Length = 365
Score = 114 bits (284), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 89/325 (27%), Positives = 159/325 (48%), Gaps = 47/325 (14%)
Query: 1 MFDVKNVLFIVAL-IALCFLAIPVKVSKPNL----EQKLELFSSFQQRYKKSYSK-SEHD 54
+F + +LF+ + A+ I K K + E+ E++ + ++ K YS E++
Sbjct: 4 LFIISILLFLASFSYAMDISTIEYKYDKSSAWRTDEEVKEIYELWLAKHDKVYSGLVEYE 63
Query: 55 IRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLR------HSVNKHV 108
RF+ F+ +L I+E N + + G+T ++DL+ EEF+ +L H + + +
Sbjct: 64 KRFEIFKDNLKFIDEHNSENHT---YKMGLTPYTDLTNEEFQAIYLGTRSDTIHRLKRTI 120
Query: 109 LMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTV 168
+S ++ N +P + DWR+ G + V+NQ CG+CWAFSTV
Sbjct: 121 NISERYAYEAGDN----------------LPEQIDWRKKGAVTPVKNQGKCGSCWAFSTV 164
Query: 169 ETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPL 228
T ES++ ++ G L LS Q+++DC N GC GG F ++ ++ ++ E+ YP
Sbjct: 165 STVESINQIRTGNLISLSEQQLVDC-NKKNHGCKGGAFVYAYQYI-IDNGGIDTEANYPY 222
Query: 229 LLKDAACKRKATSPNGVKIKSYTCDTLIP--SESSILTDIATHGPVIAAVNALT--WQYY 284
C+ + V+I Y +P +E+++ +A+ P + A++A + +Q+Y
Sbjct: 223 KAVQGPCR---AAKKVVRIDGYKG---VPHCNENALKKAVASQ-PSVVAIDASSKQFQHY 275
Query: 285 LGGVIQYNCDGSLANINHAVQIVGY 309
G+ C L NH V IVGY
Sbjct: 276 KSGIFSGPCGTKL---NHGVVIVGY 297
>gi|297794671|ref|XP_002865220.1| senescence-associated gene 12 [Arabidopsis lyrata subsp. lyrata]
gi|297311055|gb|EFH41479.1| senescence-associated gene 12 [Arabidopsis lyrata subsp. lyrata]
Length = 346
Score = 114 bits (284), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 90/329 (27%), Positives = 151/329 (45%), Gaps = 39/329 (11%)
Query: 2 FDVKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK--------SEH 53
F + VA+ + + +I +S+P L+ +L + Q+R+ + +K E
Sbjct: 3 FKHMQIFLFVAIFSSFYFSI--SLSRP-LDNELIM----QKRHIEWMTKHGRVYADVKEK 55
Query: 54 DIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRH-SVNKHVLMSH 112
R+ F+ +++ IE LN N + + + + +F+DL+ +EF++ + V+ S
Sbjct: 56 SNRYVVFKSNVERIEHLN-NIPAGRTFKLAVNQFADLTNDEFRSMYTGFKGVSSLSSQSQ 114
Query: 113 HKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAE 172
K + +V ++ P+ DWR G + ++NQ +CG CWAFS V E
Sbjct: 115 TKTTSFRYQNVSSGAL----------PISVDWRTKGAVTPIKNQGSCGCCWAFSAVAAIE 164
Query: 173 SMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKD 232
+K G L LS Q+++DC N + GC GG + + + L ES YP +D
Sbjct: 165 GATQIKKGKLISLSEQQLVDCDTN-DFGCEGGLMDTAFEHI-MATGGLTTESNYPYKGED 222
Query: 233 AACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAV--NALTWQYYLGGVIQ 290
A C K T+P I Y D + E +++ +A H PV + +Q+Y GV
Sbjct: 223 ATCNSKKTNPKATSITGYE-DVPVNDEQALMKAVA-HQPVSVGIEGGGFDFQFYSSGVFT 280
Query: 291 YNCDGSLANINHAVQIVGYD---NYSRTW 316
C L +HAV +GY N S+ W
Sbjct: 281 GECTTYL---DHAVTAIGYGQSTNGSKYW 306
>gi|1256830|gb|AAB68374.1| cysteine endopeptidase 1 [Phaseolus vulgaris]
gi|2959418|emb|CAA12118.1| cysteine protease [Phaseolus vulgaris]
Length = 364
Score = 114 bits (284), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 81/314 (25%), Positives = 147/314 (46%), Gaps = 37/314 (11%)
Query: 8 LFIVALIALCFL---AIPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKS 63
+ I L+ L F A + + + + ++++ + +++K Y+ E + RF+ F+ +
Sbjct: 4 MLIPTLLLLSFTFSHATAMSIINYSENEVMDMYEEWLVKHRKVYNGLDEKEKRFQVFKDN 63
Query: 64 LDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHL--RHSVNKHVLMSHHKHHDHHHN 121
L I++ N + G+ +F+D++ +E++ +L R + V+ + + H + +N
Sbjct: 64 LGFIQDHNAQNNT---YTLGLNKFADITNKEYRAMYLGTRTDAKRRVMKTQNTGHRYAYN 120
Query: 122 HVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGT 181
+ +PV DWR G +G +++Q CG+CWAFSTV E ++ + G
Sbjct: 121 SGDQ------------LPVHVDWRLKGAVGPIKDQGNCGSCWAFSTVAAVEGINNIVTGE 168
Query: 182 LSLLSVQEVIDCAGNGNMGCSGGDFCALLDW---MDVNKVVLEPESEYPLLLKDAACKRK 238
LS QE++DC + GC+GG L+D+ + ++ E +YP D C
Sbjct: 169 FVSLSEQELVDCDREYDEGCNGG----LMDYAFQFIIQNGGIDTEEDYPYQGIDGTCDET 224
Query: 239 ATSPNGVKIKSYTCDTLIPSES-SILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDG 295
V+I Y +PS + + L +H PV A+ A Q Y GV C
Sbjct: 225 KKKTKVVQIDGYED---VPSNNENALKKAVSHQPVSVAIEASGRALQLYQSGVFTGKCGT 281
Query: 296 SLANINHAVQIVGY 309
+L +H V +VGY
Sbjct: 282 AL---DHGVVVVGY 292
>gi|343475823|emb|CCD12886.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 361
Score = 113 bits (283), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 88/316 (27%), Positives = 150/316 (47%), Gaps = 28/316 (8%)
Query: 4 VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSY-SKSEHDIRFKNFE 61
+ + F V L+A+ +PV + + EQ L+ F++F+Q+Y +SY +E RF+ F+
Sbjct: 7 TRTLRFSVGLLAVAACFVPVALGVLHAEQSLQQQFAAFKQKYSRSYKDATEEAFRFRVFK 66
Query: 62 KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
+++ E + + A +G+T FSD+S EEF+ ++H +++
Sbjct: 67 QNM---ERAKEEAAANPYATFGVTRFSDMSPEEFRA-------------TYHNGAEYYAA 110
Query: 122 HVKK-RSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
+K+ R + + + TG P+ DWR+ G + V++Q C + WAFS + E +
Sbjct: 111 ALKRPRKV---VNVSTGRPPMTVDWRKKGAVTPVKDQGKCDSSWAFSAIGNIEGQWKIAG 167
Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW-MDVNKVVLEPESEYPLLLKDAACKRK 238
L+ LS Q ++ C N ++GC G W + NK + E YP
Sbjct: 168 HELTSLSEQMLVSCDTN-DLGCELGLKDPAFQWILWSNKGNVFTEQSYPYASGGGNVPTC 226
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLA 298
S V K L E +I +A GPV AV+A ++Q Y GGV+ +C
Sbjct: 227 DMSGKVVGAKISNMRYLPLDEDTIAEWLARKGPVAIAVDATSFQRYTGGVLT-SCISR-- 283
Query: 299 NINHAVQIVGYDNYSR 314
+N+ +VGYD+ S+
Sbjct: 284 RLNYGALLVGYDDTSK 299
>gi|414870137|tpg|DAA48694.1| TPA: vignain [Zea mays]
Length = 484
Score = 113 bits (283), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 79/276 (28%), Positives = 129/276 (46%), Gaps = 19/276 (6%)
Query: 36 LFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
L+ ++ R+ + + RF F+ ++ +I E N+ R P R + F D++ +EF
Sbjct: 155 LYERWRGRHALARDLGDKARRFNVFKANVRLIHEFNR-RDEPYKLR--LNRFGDMTADEF 211
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
+ RH V +HH+ + + +P DWR+ G + V++
Sbjct: 212 R----RHYAGSRV--AHHRMFRGDRQGSSASASSFMYADARDVPASVDWRQKGAVTDVKD 265
Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDV 215
Q CG+CWAFST+ E ++A+K L+ LS Q+++DC N GC+GG ++
Sbjct: 266 QGQCGSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKANAGCNGGLMDYAFQYIAK 325
Query: 216 NKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAA 275
+ V E YP + A+CK K+ +P V I Y + + ++ S L H PV A
Sbjct: 326 HGGVAA-EDAYPYRARQASCK-KSPAPV-VTIDGY--EDVPANDESALKKAVAHQPVSVA 380
Query: 276 VNALT--WQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+ A +Q+Y GV C L +H V VGY
Sbjct: 381 IEASGSHFQFYSEGVFSGRCGTEL---DHGVAAVGY 413
>gi|2463584|dbj|BAA22544.1| FBSB precursor [Ananas comosus]
Length = 356
Score = 113 bits (283), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 83/308 (26%), Positives = 145/308 (47%), Gaps = 32/308 (10%)
Query: 7 VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIR-FKNFEKSLD 65
V+F+ + + + + + + ++ F + Y + Y ++ +R F+ F+ +++
Sbjct: 7 VVFLFLFLCVMWASPSAASADEPSDPMMKRFEEWMVEYGRVYKDNDEKMRRFQIFKNNVN 66
Query: 66 IIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKK 125
IE N ++ S GI +F+D++ EF ++ + ++++
Sbjct: 67 HIETFNSRNEN--SYTLGINQFTDMTNNEF--------------IAQYTGGISRPLNIER 110
Query: 126 RSITTGITIP-TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL 184
+ + + + +P DWR+ G + V+NQ CGACWAF+ + T ES++ +K G L
Sbjct: 111 EPVVSFDDVDISAVPQSIDWRDYGAVTSVKNQNPCGACWAFAAIATVESIYKIKKGILEP 170
Query: 185 LSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNG 244
LS Q+V+DCA GC GG +++ NK V + YP CK PN
Sbjct: 171 LSEQQVLDCA--KGYGCKGGWEFRAFEFIISNKGVAS-GAIYPYKAAKGTCKTNGV-PNS 226
Query: 245 VKIKSYTCDTLIP--SESSILTDIATHGPVIAAVNA-LTWQYYLGGVIQYNCDGSLANIN 301
I Y +P +ESS++ ++ P+ AV+A +QYY GV C SL N
Sbjct: 227 AYITGY---ARVPRNNESSMMYAVSKQ-PITVAVDANANFQYYKSGVFNGPCGTSL---N 279
Query: 302 HAVQIVGY 309
HAV +GY
Sbjct: 280 HAVTAIGY 287
>gi|13491752|gb|AAK27969.1|AF242373_1 cysteine protease [Ipomoea batatas]
Length = 366
Score = 113 bits (283), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 82/284 (28%), Positives = 138/284 (48%), Gaps = 36/284 (12%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F+ F++R+ K Y S EHD R F+ ++ ++++ +A +G+T+FSDL+ EF
Sbjct: 49 FTVFKRRFGKVYASDEEHDYRLSEFKANM---RRAKQHQELDPAAVHGVTQFSDLTPTEF 105
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
+ + L +N+ + T +PT +P DWR+ G + V+
Sbjct: 106 RRKFL--GLNRRLKFPADAK--------------TAPILPTDELPSDFDWRDHGAVTPVK 149
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
NQ TCG+C +FST E + L G L LS Q+++DC AG+ + GC+GG
Sbjct: 150 NQGTCGSCCSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLM 209
Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
+ ++ + L E ++P D R + K+ +++ +L E I ++
Sbjct: 210 NSAFEYT-LKAGGLMREEDHPYTGNDLQVCRFDKTKIAAKVANFSVVSL--DEDQIAANL 266
Query: 267 ATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
+GP+ A+NA+ Q Y+GGV Y C L +H V +VGY
Sbjct: 267 VKNGPLAVAINAVFMQTYIGGVSCPYICSKRL---DHGVLLVGY 307
>gi|225458143|ref|XP_002280937.1| PREDICTED: cysteine proteinase RD21a [Vitis vinifera]
gi|302142569|emb|CBI19772.3| unnamed protein product [Vitis vinifera]
Length = 436
Score = 113 bits (283), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 86/303 (28%), Positives = 140/303 (46%), Gaps = 33/303 (10%)
Query: 13 LIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELN 71
L A+ L + V S +LF ++ ++Y K+YS E R K FE++ + +
Sbjct: 5 LWAVSILILAVHSSVSEASSTADLFEAWCEQYGKTYSSEEEKASRLKVFEENHAFVTQ-- 62
Query: 72 KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTG 131
N + S + F+DL+ EFK L S + + RS+ T
Sbjct: 63 HNSMANASYTLALNAFADLTHHEFKASRLGFSPGRAQSI---------------RSVGTP 107
Query: 132 ITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVI 191
+ +P DWR++G + V++Q CG CW+FST E ++ + G+L LS QE++
Sbjct: 108 VQ-ELHVPPAVDWRKSGAVTGVKDQGNCGGCWSFSTTGAIEGINKIVTGSLVSLSEQELV 166
Query: 192 DCAGNGNMGCSGGDFCALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIK 248
DC + N GC GG L+D+ + ++ E++YP + D C ++ + V I
Sbjct: 167 DCDRSYNSGCEGG----LMDYAYQFVIKNQGIDSEADYPYVGMDKPCNKEKLKKHIVTID 222
Query: 249 SYTCDTLIPSESSILTDIATHGPVIAAV--NALTWQYYLGGVIQYNCDGSLANINHAVQI 306
YT + P++ L + PV + + T+Q Y GV C +L +HAV I
Sbjct: 223 GYT--DIPPNDEKQLLQVVAKQPVSVGICGSEKTFQLYSKGVYTGPCSSTL---DHAVLI 277
Query: 307 VGY 309
VGY
Sbjct: 278 VGY 280
>gi|388521567|gb|AFK48845.1| unknown [Medicago truncatula]
Length = 343
Score = 113 bits (283), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 97/313 (30%), Positives = 146/313 (46%), Gaps = 34/313 (10%)
Query: 7 VLFIVALIA--LCFL-AIPVKVSKPNLEQKLELF--SSFQQRYKKSY-SKSEHDIRFKNF 60
V F VA A L F + P+++ EQ L++ S F RY K Y + E RFK F
Sbjct: 9 VFFCVATAAAGLSFHDSNPIRMVSDMEEQLLQVIGESRFANRYGKRYDTVDEMKRRFKIF 68
Query: 61 EKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVN-KHVLMSHHKHHDHH 119
++L +I+ NK R G+ F+D + EEF++ L + N L +H+ D
Sbjct: 69 SENLQLIKSTNKKRLG---YTLGVNHFADWTWEEFRSHRLGAAQNCSATLKGNHRITD-- 123
Query: 120 HNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
+P +KDWR+ GI+ +V++Q CG+CW FST ES +A
Sbjct: 124 ----------------VVLPAEKDWRKEGIVSEVKDQGHCGSCWTFSTTGALESAYAQAF 167
Query: 180 GTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
G LS Q+++DCAG N GC+GG +++ N LE E YP ++ C K
Sbjct: 168 GKNISLSEQQLVDCAGAYNNFGCNGGLPSQAFEYIKYNG-GLETEEVYPYTGQNGLC--K 224
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL-TWQYYLGGVIQ-YNCDGS 296
TS N + + + +E + +A PV A + ++ Y GV C +
Sbjct: 225 FTSENVAVQVLGSVNITLGAEDELKHAVAFARPVSVAFQVVDDFRLYKKGVYTGTTCGST 284
Query: 297 LANINHAVQIVGY 309
++NHAV VGY
Sbjct: 285 PMDVNHAVLAVGY 297
>gi|354504701|ref|XP_003514412.1| PREDICTED: cathepsin R-like [Cricetulus griseus]
gi|344245862|gb|EGW01966.1| Cathepsin R [Cricetulus griseus]
Length = 333
Score = 113 bits (283), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 89/311 (28%), Positives = 147/311 (47%), Gaps = 29/311 (9%)
Query: 10 IVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIE 68
++ + L FL + V + P + L+ + +++ Y K+YS+ E + +E ++ +I+
Sbjct: 1 MILAVLLGFLYLGVASAAPTPDYSLDAEWEEWKKSYDKTYSQEEERQKRAVWEDNVKMIK 60
Query: 69 ELN-KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRS 127
L+ +N + + EF DL+ EE K + + VL + H VK
Sbjct: 61 LLSMENGLGMNNFTVEMNEFGDLTGEEMK----KMMTDSSVLTLRNGKHMQRLGDVK--- 113
Query: 128 ITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSV 187
IP DWR G +G VR Q CGACWAF+ + ES K G ++ LSV
Sbjct: 114 ----------IPKTLDWRTQGYVGPVRKQNGCGACWAFAVAASIESQLFKKTGKMTQLSV 163
Query: 188 QEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVK 246
Q +IDCA + GC GG ++ NK LE E+ YP K+ C+ +A + VK
Sbjct: 164 QNLIDCARSYSTYGCKGGLVYGAFLYVKNNK-GLEAEATYPYEAKEGRCRYRAER-SVVK 221
Query: 247 IKSYTCDTLIP-SESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHA 303
I + ++P +E +++ + THGP+ ++A ++ Y GG I + N H
Sbjct: 222 ITRF---LVVPRNEEALMNALVTHGPIAVGIDAGHESFTNYAGG-IYHEPKCKTDNPTHG 277
Query: 304 VQIVGYDNYSR 314
+ +VG+ R
Sbjct: 278 LLLVGFGYEGR 288
>gi|302796898|ref|XP_002980210.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
gi|300151826|gb|EFJ18470.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
Length = 479
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 85/294 (28%), Positives = 138/294 (46%), Gaps = 39/294 (13%)
Query: 31 EQKLE-LFSSFQQRYKKSYSKS---------EHDIRFKNFEKSLDIIEELNKNRQSPESA 80
E++L+ LF S+ ++ KSY+ + E R+ F+ +L I N+ Q
Sbjct: 50 EERLQALFDSWMLQHGKSYADNALSGDSQAGEKATRYGIFKDNLRFIHGENEKNQG---Y 106
Query: 81 RYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPV 140
G+ F+DL+ EEF+ + RH H + + V+ + + P
Sbjct: 107 FLGLNAFADLTNEEFRAQ--RHGGRFDRSRERTSHEEFRYGSVQLKDL----------PD 154
Query: 141 KKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMG 200
DWRE G + V++Q +CG+CWAFS V E ++ L G L LS QE++DC + G
Sbjct: 155 SIDWREKGAVVGVKDQGSCGSCWAFSAVAAIEGVNKLATGELVSLSEQELVDCDKGEDEG 214
Query: 201 CSGGDFCALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP 257
C+GG L+D+ + L+ E++YP C R + V I Y D +
Sbjct: 215 CNGG----LMDYAFGFVIKNGGLDTEADYPYKGYGTRCDRSKMNAKVVTIDGYE-DVPVN 269
Query: 258 SESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
E+++L +A H PV A++A + Q+Y G+ C +++H V VGY
Sbjct: 270 DETALLKAVA-HQPVSVAIDAGGSSMQFYRSGIFTGRCG---TDLDHGVTNVGY 319
>gi|224081756|ref|XP_002306486.1| predicted protein [Populus trichocarpa]
gi|222855935|gb|EEE93482.1| predicted protein [Populus trichocarpa]
Length = 352
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 81/283 (28%), Positives = 136/283 (48%), Gaps = 30/283 (10%)
Query: 34 LELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSE 92
+ ++ + ++ K+Y+ E RF+ F+ +L I+E N + + G+T+F+DL+
Sbjct: 1 MSMYKWWLAKHGKAYNGLGEEAERFEIFKNNLRFIDEHNSQNHT---YKVGLTKFADLTN 57
Query: 93 EEFKTRHL-RHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIG 151
EE++ L S K LM + + + G +P + DWR G +
Sbjct: 58 EEYRAMFLGTRSDAKRRLMKSKSPSERY-------AFKAGDKLPESV----DWRAKGAVN 106
Query: 152 KVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLD 211
+++Q +CG+CWAFSTV E ++ + G L LS QE++DC N GC+GG L+D
Sbjct: 107 PIKDQGSCGSCWAFSTVAAVEGINQIVTGELISLSEQELVDCDRTYNAGCNGG----LMD 162
Query: 212 W---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIAT 268
+ +N L+ E +YP + D C + V I + + ++P + L
Sbjct: 163 YAFQFIINNGGLDTEKDYPYVGDDDKCDKDKMKTKAVSIDGF--EDVLPYDEKALQKAVA 220
Query: 269 HGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
H PV A+ A + Q+Y GV C +L +H V +VGY
Sbjct: 221 HQPVSVAIEASGMALQFYQSGVFTGECGTAL---DHGVVVVGY 260
>gi|9630063|ref|NP_046281.1| cathepsin [Orgyia pseudotsugata MNPV]
gi|2499880|sp|O10364.1|CATV_NPVOP RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|7435821|pir||T10394 cathepsin - Orgyia pseudotsugata nuclear polyhedrosis virus
gi|1911371|gb|AAC59124.1| cathepsin [Orgyia pseudotsugata MNPV]
Length = 324
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 97/301 (32%), Positives = 149/301 (49%), Gaps = 32/301 (10%)
Query: 14 IALCFLAIPV-KVSKPNLEQKLELFSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELN 71
I LC L V + +L + F F ++ K+YS +SE RFK F+ +L+ E +N
Sbjct: 4 IMLCLLVCGVVHAATYDLLKAPNYFEDFLHKFNKNYSSESEKLHRFKIFQHNLE--EIIN 61
Query: 72 KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMS-HHKHHDHHHNHVKKRSITT 130
KN Q+ +A+Y I +FSDLS+EE +++K+ +S H+ + + R
Sbjct: 62 KN-QNDSTAQYEINKFSDLSKEE--------AISKYTGLSLPHQTQNFCEVVILDRPPDR 112
Query: 131 GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEV 190
G P++ DWR+ + V+NQ CGACWAF+T+ + ES A+K L LS Q+
Sbjct: 113 G-------PLEFDWRQFNKVTSVKNQGVCGACWAFATLGSLESQFAIKYNRLINLSEQQF 165
Query: 191 IDCAGNGNMGCSGGDF-CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKS 249
IDC N GC GG A M++ V + ES+YP + C+ +PN +
Sbjct: 166 IDC-DRVNAGCDGGLLHTAFESAMEMGGVQM--ESDYPYETANGQCR---INPNRFVVGV 219
Query: 250 YTCDTLIPSESSILTDIATH-GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVG 308
+C I L D+ GP+ A++A Y G+++ + L NHAV +VG
Sbjct: 220 RSCRRYIVMFEEKLKDLLRAVGPIPVAIDASDIVNYRRGIMRQCANHGL---NHAVLLVG 276
Query: 309 Y 309
Y
Sbjct: 277 Y 277
>gi|302781881|ref|XP_002972714.1| hypothetical protein SELMODRAFT_98707 [Selaginella moellendorffii]
gi|300159315|gb|EFJ25935.1| hypothetical protein SELMODRAFT_98707 [Selaginella moellendorffii]
Length = 446
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 87/281 (30%), Positives = 133/281 (47%), Gaps = 29/281 (10%)
Query: 37 FSSFQQRYKK--SYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
++S+ ++ K + S S D RF+ F+++ IEE NR S R G+ +FSDL+ EE
Sbjct: 13 YASWCAKFGKECASSNSLGDRRFETFKENFRYIEE--HNRAGKHSYRLGLNQFSDLTSEE 70
Query: 95 FKTRHL--RHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
F+ R L R + ++ + D I G +P DWR+ G +
Sbjct: 71 FRQRFLGLRPDLIDSPVLKMPRDSD----------IEEGFQ-NVDLPASVDWRKHGAVTA 119
Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
++Q +CG CWAF+T E ++ + G L LS QE+IDC + GC GG +
Sbjct: 120 PKDQGSCGGCWAFATTGAIEGINQIVTGQLMSLSEQELIDCDKKADKGCDGGLMENAYQF 179
Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP--SESSILTDIATHG 270
+ V L+ E++YP ++ C K + V I Y IP E ++L +A
Sbjct: 180 I-VENGGLDTETDYPYHASESHCNMKKLNSRVVAIDGY---EAIPDGDEQALLRAVAKQ- 234
Query: 271 PVIAAVNALT--WQYYLGGVIQYNCDGSLANINHAVQIVGY 309
PV A+ + +Q+Y GV +C INH V IVGY
Sbjct: 235 PVSVAIEGASKDFQHYASGVFTGHCG---EEINHGVLIVGY 272
>gi|218181|dbj|BAA14402.1| oryzain alpha precursor [Oryza sativa Japonica Group]
Length = 458
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 78/283 (27%), Positives = 131/283 (46%), Gaps = 24/283 (8%)
Query: 31 EQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSP-ESARYGITEFS 88
E+ L++ ++ + KSY+ E + R+ F +L I+E N + S R G+ F+
Sbjct: 34 EEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRLGLNRFA 93
Query: 89 DLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAG 148
DL+ EE++ +L ++ V R + +P DWR G
Sbjct: 94 DLTNEEYRDTYL-----------GLRNKPRRERKVSDRYLAAD---NEALPESVDWRTKG 139
Query: 149 IIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCA 208
+ ++++Q CG+CWAFS + E ++ + G L LS QE++DC + N GC+GG
Sbjct: 140 AVAEIKDQGGCGSCWAFSAIAAVEDINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDY 199
Query: 209 LLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIAT 268
D++ +N ++ E +YP KD C + V I SY + + P+ + L
Sbjct: 200 AFDFI-INNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSY--EDVTPNSETSLQKAVR 256
Query: 269 HGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+ PV A+ A +Q Y G+ C +L +H V VGY
Sbjct: 257 NQPVSVAIEAGGRAFQLYSSGIFTGKCGTAL---DHGVAAVGY 296
>gi|226507950|ref|NP_001151278.1| LOC100284911 precursor [Zea mays]
gi|195645488|gb|ACG42212.1| vignain precursor [Zea mays]
Length = 376
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 79/278 (28%), Positives = 128/278 (46%), Gaps = 24/278 (8%)
Query: 36 LFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
L+ ++ R+ + + RF F+ ++ +I E N+ R P R + F D++ +EF
Sbjct: 48 LYERWRGRHALARDLGDKARRFNVFKANVRLIHEFNR-RDEPYKLR--LNRFGDMTADEF 104
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG--IPVKKDWREAGIIGKV 153
+ +H S HH + S + +P DWR+ G + V
Sbjct: 105 R---------RHYAGSRVAHHRMFRGDRQGSSASASFMYADARDVPASVDWRQKGAVTDV 155
Query: 154 RNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM 213
++Q CG+CWAFST+ E ++A+K L+ LS Q+++DC N GC+GG ++
Sbjct: 156 KDQGQCGSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKANAGCNGGLMDYAFQYI 215
Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
+ V E YP + A+CK K+ +P V I Y + + ++ S L H PV
Sbjct: 216 AKHGGVA-AEDAYPYRARQASCK-KSPAPV-VTIDGY--EDVPANDESALKKAVAHQPVS 270
Query: 274 AAVNALT--WQYYLGGVIQYNCDGSLANINHAVQIVGY 309
A+ A +Q+Y GV C L +H V VGY
Sbjct: 271 VAIEASGSHFQFYSEGVFSGRCGTEL---DHGVTAVGY 305
>gi|294883334|ref|XP_002770714.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
gi|239873999|gb|EER02719.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
Length = 330
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 87/308 (28%), Positives = 145/308 (47%), Gaps = 40/308 (12%)
Query: 10 IVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIE 68
I++ + L FL + + + +E F FQ ++ K+Y E ++ R F+ +L +IE
Sbjct: 4 IISFVLLSFLPLVKCLDEGTVELA---FMGFQHKFGKNYESKEEEVKRNAIFQANLHLIE 60
Query: 69 ELNKNRQSPESARYGITEFSDLSEEEF---KTRHLRHSVNKHVLMSHHKHHDHHHNHVKK 125
++N S + G+ E++DL+ EEF K L+ +H +S D
Sbjct: 61 QVNAKNLS---YKLGVNEYADLTHEEFAALKLGTLKMRPAEHASLSLFVSAD-------- 109
Query: 126 RSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLL 185
T +P DWR ++ V++Q +CG+CWAFS E+ +A+ G L L
Sbjct: 110 ---------TTQLPTSVDWRNKSVLSPVKDQGSCGSCWAFSAAGALEAQYAIATGKLRPL 160
Query: 186 SVQEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNG 244
S Q+++DC+ G GC GG + + L+ ES YP + C+ + +G
Sbjct: 161 SEQQLVDCSHKYGTNGCFGGFMADAYKY--IKSAGLDQESTYPYKGVNEPCRPREKKADG 218
Query: 245 VKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVI-QYNCDGSLANIN 301
+ ++ + DT +E S++ +A PV A+ A + YL GV C+G I+
Sbjct: 219 IPVR-FVLDT--KTEQSLMKALA-DAPVSVAMYASDFLFHLYLSGVYSSTTCNG---EID 271
Query: 302 HAVQIVGY 309
HAV VGY
Sbjct: 272 HAVVAVGY 279
>gi|18141283|gb|AAL60579.1|AF454957_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 460
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 78/263 (29%), Positives = 126/263 (47%), Gaps = 32/263 (12%)
Query: 52 EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMS 111
E D RF+ F+ +L I+E N S + G+T F+DL+ EE+++ +L K VL +
Sbjct: 69 EKDQRFEIFKDNLRFIDEHNNKNLS---YKLGLTRFADLTNEEYRSIYLGAKSKKRVLKT 125
Query: 112 HHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETA 171
++ + IP DWR+ G + V++Q +CG+CWAFST+
Sbjct: 126 SDRYQPR---------------VGDAIPDSVDWRKEGAVAAVKDQGSCGSCWAFSTIGAV 170
Query: 172 ESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW---MDVNKVVLEPESEYPL 228
E ++ + G L LS QE++DC + N GC+GG L+D+ + ++ E +YP
Sbjct: 171 EGINKIVTGDLISLSEQELVDCDTSYNQGCNGG----LMDYAFEFIIKNGGIDTEEDYPY 226
Query: 229 LLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLG 286
D C + + V I +Y D +E+++ +A P+ A+ A +Q Y
Sbjct: 227 KAADGRCDQTRKNAKVVTIDAYE-DVPENNEAALKKTLANQ-PISVAIEAGGRAFQLYSS 284
Query: 287 GVIQYNCDGSLANINHAVQIVGY 309
GV C L +H V VGY
Sbjct: 285 GVFDGICGTEL---DHGVVAVGY 304
>gi|4757570|gb|AAD29084.1|AF082181_1 cysteine proteinase precursor [Solanum melongena]
Length = 363
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 84/284 (29%), Positives = 128/284 (45%), Gaps = 38/284 (13%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
FS F+ +Y K Y S+ EHD R K F+ +L +++ +A +GIT+FSDL+ EF
Sbjct: 47 FSLFKSKYGKIYASQEEHDHRLKVFKANL---RRARRHQLLDPTAEHGITQFSDLTPSEF 103
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
+ +L H K + +PT +P DWRE G + V+
Sbjct: 104 RRTYLGL-----------------HKPRPKLNAQKAPILPTSDLPEDFDWREKGAVTGVK 146
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
NQ +CG+CW+FST E H L G L LS Q+++DC + GC+GG
Sbjct: 147 NQGSCGSCWSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDAEEKSECDAGCNGGLM 206
Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
++ + L+ E +YP +D C S + +++ L E I ++
Sbjct: 207 TTAFEYT-LKAGGLQREKDYPYTGRDGKCHFD-KSKIAASVANFSVIGL--DEDQIAANL 262
Query: 267 ATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
HGP+ +NA Q Y+ GV C +H V +VGY
Sbjct: 263 VKHGPLAVGINAAWMQTYMRGVSCPLIC---FKRQDHGVLLVGY 303
>gi|218478060|dbj|BAH03396.1| cathepsin L-like cysteine peptidase [Taenia saginata]
Length = 338
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 87/307 (28%), Positives = 147/307 (47%), Gaps = 26/307 (8%)
Query: 9 FIVALIALCFLAIPVKVSKPNLEQKL-ELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDII 67
F++ LI + LA V+ S E++L + ++ ++ + YS+ E R F ++L I
Sbjct: 6 FLLLLI-IHPLAAVVETSALLTERELSRQWIGWKLQHGRVYSEKEEAYRRGIFARNLLYI 64
Query: 68 EELNKNRQSP-ESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKR 126
+ N+ + ES G+ +F+DL EF R L K+
Sbjct: 65 KGQNRRFNAGLESYSTGLNQFADLESSEFSERFLGTRPGSRAAG-------------KRG 111
Query: 127 SITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLS 186
I + +P DWR+ ++ +V+NQ CG+CWAFS+ E A K G L LS
Sbjct: 112 RIWKALASAADLPDTVDWRDKNLVTEVKNQGNCGSCWAFSSTGALEGAFAKKTGKLISLS 171
Query: 187 VQEVIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGV 245
Q+++DC+ NGN GC+GG +++ + + EPES YP D C+ + GV
Sbjct: 172 EQQLVDCSLKNGNDGCNGGYMSYAFKYLEEHSI--EPESAYPYRATDGPCRYNESL--GV 227
Query: 246 KIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYN-CDGSLANINH 302
+ D +E++++ +AT GP+ A++A L + +Y G+ + + C +NH
Sbjct: 228 GTVTDIGDIPEGNETALMEAVATVGPISIAIDASSLGFMFYRHGIYKSHWCSSKF--LNH 285
Query: 303 AVQIVGY 309
V +GY
Sbjct: 286 GVLAIGY 292
>gi|302759380|ref|XP_002963113.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
gi|300169974|gb|EFJ36576.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
Length = 479
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 84/294 (28%), Positives = 134/294 (45%), Gaps = 39/294 (13%)
Query: 31 EQKLE-LFSSFQQRYKKSYSKS---------EHDIRFKNFEKSLDIIEELNKNRQSPESA 80
E++L+ LF S+ ++ KSY+++ E R+ F+ +L I N+ Q
Sbjct: 50 EERLQALFDSWMLQHGKSYAENALSGDSQAGEKATRYGIFKDNLRFIHGENEKNQG---Y 106
Query: 81 RYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPV 140
G+ F+DL+ EEF+ + H D G +P
Sbjct: 107 FLGLNAFADLTNEEFRAQR------------HGGRFDRSRERTSYEEFRYGSVQLKDLPD 154
Query: 141 KKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMG 200
DWRE G + V++Q +CG+CWAFS V E ++ L G L LS QE++DC + G
Sbjct: 155 SIDWREKGAVVGVKDQGSCGSCWAFSAVAAIEGVNKLATGELVSLSEQELVDCDKGEDEG 214
Query: 201 CSGGDFCALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP 257
C+GG L+D+ + L+ E++YP C R + V I Y D +
Sbjct: 215 CNGG----LMDYAFGFVIKNGGLDTEADYPYKGYGTRCDRSKMNAKVVTIDGYE-DVPVN 269
Query: 258 SESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
E+++L +A H PV A++A + Q+Y G+ C +++H V VGY
Sbjct: 270 DETALLKAVA-HQPVSVAIDAGGSSMQFYRSGIFTGRCG---TDLDHGVTNVGY 319
>gi|222628593|gb|EEE60725.1| hypothetical protein OsJ_14236 [Oryza sativa Japonica Group]
Length = 364
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 81/286 (28%), Positives = 137/286 (47%), Gaps = 55/286 (19%)
Query: 37 FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFK 96
F+SF++R+ ++Y + R+ +A +G+T+FSDL+ EF+
Sbjct: 58 FASFERRFGRTYPGP-------------------RRARRLDPTATHGVTKFSDLTPGEFR 98
Query: 97 TRHL---RHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKDWREAGIIGK 152
R L R S+ V H+ +PT G+P DWRE G +G
Sbjct: 99 DRFLGLRRPSLEGLVGGEPHE----------------APILPTDGLPDDFDWREHGAVGP 142
Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGG 204
V++Q +CG+CW+FST E H L G L +LS Q+++DC + + GC+GG
Sbjct: 143 VKDQGSCGSCWSFSTSGALEGAHFLATGKLEVLSEQQMVDCDHECDASESRACDSGCNGG 202
Query: 205 DFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILT 264
++ + L+ E +YP ++ CK S ++K+++ ++ +E I
Sbjct: 203 LMTTAFSYL-MKSGGLQSEKDYPYAGRENTCKFD-KSKIVAQVKNFSVISV--NEDQIAA 258
Query: 265 DIATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
++ HGP+ A+NA Q Y+GGV + C +++H V +VGY
Sbjct: 259 NLVKHGPLAIAINAAYMQTYIGGVSCPFICG---RHLDHGVLLVGY 301
>gi|144905104|dbj|BAF56427.1| cysteine proteinase [Lotus japonicus]
Length = 342
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 93/330 (28%), Positives = 148/330 (44%), Gaps = 47/330 (14%)
Query: 4 VKNVLFIVAL-IALCFLAIPVKVSKPNLEQKL--ELFSSFQQRYKKSYSK-SEHDIRFKN 59
KN + ++ + LC +VS L+ E + RY K Y E + RF
Sbjct: 3 TKNQFYQISFALVLCLGLWAFQVSSRTLQDASMHERHEQWMARYGKVYKDLQEKEKRFNI 62
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHH 119
F++++ IE N P + G+ +F+DL+ +EF + + +K H
Sbjct: 63 FQENVKYIEASNNAGNKP--YKLGVNQFTDLTNKEF-------------IATRNKFKGHM 107
Query: 120 HNHVKKRSI--TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHAL 177
+ + + + +T P+ + DWR+ G + V+NQ TCG CWAFS V E +H L
Sbjct: 108 SSSITRTTTFKYENVTAPSTV----DWRQEGAVTPVKNQGTCGCCWAFSAVAATEGIHKL 163
Query: 178 KNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVV-----LEPESEYPLLLK 231
G L LS QE++DC +G + GC GG L+D D K + L E++YP
Sbjct: 164 STGNLVSLSEQELVDCDTSGADQGCQGG----LMD--DAFKFIIQNGGLNTEAQYPYQGV 217
Query: 232 DAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVI 289
D C + I Y D +E ++ +A P+ A++A +Q Y GV
Sbjct: 218 DGTCNTNEEVTHVATITGYE-DVPSNNEQALQQAVANQ-PISVAIDASGSDFQNYQSGVF 275
Query: 290 QYNCDGSLANINHAVQIVGY---DNYSRTW 316
+C L +H V +VGY D+ ++ W
Sbjct: 276 TGSCGTQL---DHGVAVVGYGVSDDGTKYW 302
>gi|218185|dbj|BAA14404.1| oryzain gamma precursor [Oryza sativa Japonica Group]
Length = 362
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 87/278 (31%), Positives = 129/278 (46%), Gaps = 28/278 (10%)
Query: 37 FSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F+ F R+ K Y +E RF+ F +SL+++ N+ R P R GI F+D+S EEF
Sbjct: 62 FARFAVRHGKRYGDAAEVQRRFRIFSESLELVRSTNR-RGLPY--RLGINRFADMSWEEF 118
Query: 96 KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
+ L + N L +H+ D +P KDWRE GI+ V+
Sbjct: 119 QASRLGAAQNCSATLAGNHRMRD-----------------APALPETKDWREDGIVSPVK 161
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWM 213
+Q CG+CW FST + E+ + G LS Q++ DCA N GCSGG +++
Sbjct: 162 DQGHCGSCWPFSTTGSLEARYTQATGPPVSLSEQQLADCATRYNNFGCSGGLPSQAFEYI 221
Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
N L+ E YP + C K + GVK+ TL+ +E + + PV
Sbjct: 222 KYNG-GLDTEEAYPYTGVNGICHYKPENA-GVKVLDSVNITLV-AEDELKNAVGLVRPVS 278
Query: 274 AAVNALT-WQYYLGGVIQYN-CDGSLANINHAVQIVGY 309
A + ++ Y GV + C S ++NHAV VGY
Sbjct: 279 VAFQVINGFRMYKSGVYTSDHCGTSPMDVNHAVLAVGY 316
>gi|414584879|tpg|DAA35450.1| TPA: cysteine protease 1 [Zea mays]
Length = 522
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 79/278 (28%), Positives = 130/278 (46%), Gaps = 22/278 (7%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
L+ + + ++Y+ E D RF+ F +L ++ N+ R + R G+ +F+DL+ +E
Sbjct: 108 LYELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNE-RAAEHGFRLGMNQFADLTNDE 166
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F+ +L + + H + +P DWRE G + V+
Sbjct: 167 FRAAYLGARIPASRRRGTAVGERYRHGGGAEE-----------LPESVDWREKGAVAPVK 215
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWM 213
NQ CG+CWAFS V + ES++ + G + LS QE+++C+ + GN GC+GG A D++
Sbjct: 216 NQGQCGSCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFI 275
Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
+ ++ E +YP D C + V I + D E S+ +A H PV
Sbjct: 276 -IKNGGIDTEGDYPYKAVDGKCDINRENAKVVSIDGFE-DVPENDEKSLQKAVA-HQPVS 332
Query: 274 AAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
A+ A +Q Y GV C N++H V VGY
Sbjct: 333 VAIEAGGREFQLYKAGVFTGTC---TTNLDHGVVAVGY 367
>gi|357446993|ref|XP_003593772.1| Cysteine proteinase [Medicago truncatula]
gi|355482820|gb|AES64023.1| Cysteine proteinase [Medicago truncatula]
Length = 339
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 81/285 (28%), Positives = 133/285 (46%), Gaps = 19/285 (6%)
Query: 28 PNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITE 86
P ++ +E+F + + + + Y E +F F +L I E N R+S G+T
Sbjct: 9 PTQDKTIEIFQLWMKEHGRVYKDLDEMAKKFDIFISNLKYITETNAKRKSSNGFLLGLTN 68
Query: 87 FSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWRE 146
F+D S EEF+ R+L H+++ + K +D H + + P+ + DWR
Sbjct: 69 FTDWSSEEFQERYL-HNIDMPTDIDTMKVNDVH---------LSSCSAPSSL----DWRS 114
Query: 147 AGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDF 206
G++ +++Q+ CG+CWAFS V E ++A+ G L LS QE++DC GC+ G
Sbjct: 115 KGVVSDIKDQKNCGSCWAFSAVGAIEGINAITTGKLINLSEQELLDCDPISG-GCNSGWV 173
Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
DW+ NK V +++YP + CK PN T + S+ +L +
Sbjct: 174 NKAFDWVIRNKGV-ALDNDYPYTAEKGVCKASQI-PNSAISSINTYHHVEQSDQGLLCAV 231
Query: 267 ATHGPVIAAVNALTWQYYLGGVIQ-YNCDGSLANINHAVQIVGYD 310
A + + +Y G+ NC + + NH V IVGYD
Sbjct: 232 AKQPVSVCLYAPQDFHHYSSGIYDGPNCPVNSKDTNHCVLIVGYD 276
>gi|391333957|ref|XP_003741376.1| PREDICTED: cathepsin S-like [Metaseiulus occidentalis]
Length = 333
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 93/311 (29%), Positives = 148/311 (47%), Gaps = 36/311 (11%)
Query: 7 VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDI 66
V+ L++ C + K+ K L K ++S++ +KKSYS +E +R N+ + +
Sbjct: 5 VVCAALLVSACQAEVSPKLMKAALRAK---WTSYKAAHKKSYSAAEESLRMANYLDNTRV 61
Query: 67 IEELN-KNRQSPESARYGITEFSDLSEEEFKTRHLRHSV--NKHVLMSHHKHHDHHHNHV 123
IEE N + Q ES G E SDL+ EE K+ + + N + ++ H
Sbjct: 62 IEEHNARFHQGLESYELGHNELSDLTLEEIKSTRMGLVLPPNAAEIAANASRH------- 114
Query: 124 KKRSITTGITIPTGI--PVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGT 181
P+ I P DWR + V+NQ +CG+C++FS + E+ + K+G
Sbjct: 115 ---------FAPSDIVAPGSVDWRSKRCVQYVKNQGSCGSCYSFSALGALETSYCNKHGQ 165
Query: 182 LSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATS 241
L L+ Q ++DCAG GCSGG + +++ N ++ + YP K CK+
Sbjct: 166 LPDLAEQHLVDCAGR---GCSGGWMHDMFNYLQSNGGAID-QRRYPYTGKVEQCKQDRM- 220
Query: 242 PNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQ--YYLGGVIQY-NCDGSLA 298
P + +Y +E+ ++ IAT G V A NA T Q YY GG++ NC +
Sbjct: 221 PKAAGVATYK-QISRGNENELMQAIATVGTVSIAYNAGTQQHSYYRGGILDVPNCGNT-- 277
Query: 299 NINHAVQIVGY 309
HAV +VGY
Sbjct: 278 -PTHAVLLVGY 287
>gi|311247276|ref|XP_003122571.1| PREDICTED: cathepsin W-like [Sus scrofa]
Length = 367
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 92/284 (32%), Positives = 140/284 (49%), Gaps = 34/284 (11%)
Query: 35 ELFSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEE 93
E+F+ FQ +Y +SYS +EH R F ++L + L + +A +G+T FSDL+EE
Sbjct: 40 EVFTLFQIQYNRSYSNPAEHARRLDIFAQNLAKAQRLQEEDLG--TAEFGVTPFSDLTEE 97
Query: 94 EFKTRHLRH-SVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWRE-AGIIG 151
EF H H K M +K S +G T+P DWR+ G+I
Sbjct: 98 EFGQLHGHHWGAGKAPSMG-----------IKVGSEESGETVPQSC----DWRKKPGVIS 142
Query: 152 KVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGG-DFCALL 210
+++Q+ C CWA + V+ E+ A+K LSVQ+V+DC GN GC+GG + A L
Sbjct: 143 AIKHQKDCNCCWAMAAVDNVEAQWAIKYHQAVQLSVQQVLDCDRCGN-GCNGGFVWDAFL 201
Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPS--ESSILTDIAT 268
++ + + E + Y +K C K + ++ D L+ E SI +AT
Sbjct: 202 TVLNTSGLASEQDYPYKGTVKTHRCLAKQH-----RKVAWIQDFLMLQFCEQSIARYLAT 256
Query: 269 HGPVIAAVNALTWQYYLGGVIQ---YNCDGSLANINHAVQIVGY 309
GP+ +NA Q Y GVI+ CD L +NH+V +VG+
Sbjct: 257 EGPITVTINAGLLQQYKRGVIRATPATCDPHL--VNHSVLLVGF 298
>gi|351629615|gb|AEQ54771.1| KDDL-tailed cysteine proteinase CP4 [Coffea canephora]
Length = 359
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 88/312 (28%), Positives = 148/312 (47%), Gaps = 30/312 (9%)
Query: 5 KNVLFIVALIALCFLAIPVKVSKPNL---EQKLELFSSFQQRYKKSYSKSEHDIRFKNFE 61
K LF V L + A+ +++++ +L E +L+ ++ + S SE RF F+
Sbjct: 5 KAFLFAVVLAVILVAAMSMEITERDLASEESLWDLYERWRSHHTVSRDLSEKRKRFNVFK 64
Query: 62 KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
++ I ++N Q + + + F+D++ EF R S KH M H +
Sbjct: 65 ANVHHIHKVN---QKDKPYKLKLNSFADMTNHEF--REFYSSKVKHYRMLHGSRANTGFM 119
Query: 122 HVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGT 181
H K S+ P DWR+ G + V+NQ CG+CWAFSTV E ++ +K G
Sbjct: 120 HGKTESL----------PASVDWRKQGAVTGVKNQGKCGSCWAFSTVVGVEGINKIKTGQ 169
Query: 182 LSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATS 241
L LS QE++DC + N GC+GG +++ + + E YP +D +C +
Sbjct: 170 LVSLSEQELVDCETD-NEGCNGGLMENAYEFIKKSGGIT-TERLYPYKARDGSCDSSKMN 227
Query: 242 PNGVKIKSYTCDTLIPS--ESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSL 297
V I + ++P+ E++++ +A PV A++A Q+Y GV Y D
Sbjct: 228 APAVTIDGH---EMVPANDENALMKAVANQ-PVSVAIDASGSDMQFYSEGV--YAGDSCG 281
Query: 298 ANINHAVQIVGY 309
++H V +VGY
Sbjct: 282 NELDHGVAVVGY 293
>gi|21263041|gb|AAM44832.1|AF510856_1 cathepsin L2 [Fasciola gigantica]
Length = 326
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 84/305 (27%), Positives = 145/305 (47%), Gaps = 34/305 (11%)
Query: 8 LFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDII 67
LFI+A++ + L +L+ +++ Y K Y+ ++ + R +E+++ I
Sbjct: 3 LFILAVLTVGVLG-----------SNDDLWHQWKRMYNKEYNGADDEHRRNIWEENVKHI 51
Query: 68 EELN-KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKR 126
+E N ++ + G+ +F+D++ EEFK ++L ++SH ++ ++
Sbjct: 52 QEHNLRHDLGLVTYTLGLNQFTDMTFEEFKAKYLTEMPRASDILSHGIPYEANNR----- 106
Query: 127 SITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLS 186
+P K DWRE+G + +V++Q CG+CWAFST T E + T S
Sbjct: 107 ----------AVPDKIDWRESGYVTEVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFS 156
Query: 187 VQEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGV 245
Q+++DC+G GNMGC GG +++ + LE ES YP + C+
Sbjct: 157 EQQLVDCSGPWGNMGCMGGLMENAYEYL--KQFGLETESSYPYTAVEGQCRYNRQLGVAK 214
Query: 246 KIKSYTCDTLIPSESSILTDIATHGPVIAAVNALT-WQYYLGGVIQYNCDGSLANINHAV 304
YT + SE + + GP AV+ + + Y GG+ Q SL ++NHAV
Sbjct: 215 VTDYYTVHS--GSEVELKNLVGAEGPAAVAVDVESDFMMYSGGIYQSRTCSSL-HVNHAV 271
Query: 305 QIVGY 309
VGY
Sbjct: 272 LAVGY 276
>gi|218478062|dbj|BAH03397.1| cathepsin L-like cysteine peptidase [Taenia asiatica]
Length = 338
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 87/307 (28%), Positives = 147/307 (47%), Gaps = 26/307 (8%)
Query: 9 FIVALIALCFLAIPVKVSKPNLEQKL-ELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDII 67
F++ LI + LA V+ S E++L + ++ ++ + YS+ E R F ++L I
Sbjct: 6 FLLLLI-IHPLAAVVETSALLTERELSRQWIGWKLQHGRVYSEKEEAYRRGIFARNLLYI 64
Query: 68 EELNKNRQSP-ESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKR 126
+ N+ + ES G+ +F+DL EF R L K+
Sbjct: 65 KGQNRRFNAGLESYSTGLNQFADLESSEFSERFL-------------GTRPESRAAGKRG 111
Query: 127 SITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLS 186
I + +P DWR+ ++ +V+NQ CG+CWAFS+ E A K G L LS
Sbjct: 112 RIWKALASAADLPDTVDWRDKNLVTEVKNQGNCGSCWAFSSTGALEGAFAKKTGKLISLS 171
Query: 187 VQEVIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGV 245
Q+++DC+ NGN GC+GG +++ + + EPES YP D C+ + GV
Sbjct: 172 EQQLVDCSLKNGNDGCNGGYMSYAFKYLEEHSI--EPESAYPYRATDGPCRYNESL--GV 227
Query: 246 KIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYN-CDGSLANINH 302
+ D +E++++ +AT GP+ A++A L + +Y G+ + + C +NH
Sbjct: 228 GTVTDIGDIPEGNETALMEAVATVGPISIAIDASSLGFMFYRHGIYKSHWCSSKF--LNH 285
Query: 303 AVQIVGY 309
V +GY
Sbjct: 286 GVLAIGY 292
>gi|6649575|gb|AAF21461.1|U69120_1 cysteine proteinase PWCP1 [Paragonimus westermani]
Length = 427
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 94/287 (32%), Positives = 144/287 (50%), Gaps = 46/287 (16%)
Query: 36 LFSSFQQRYKKSYSKSEHDIRFKNFEKSL---DIIEELNKNRQSPESARYGITEFSDLSE 92
LF FQ++++KSYS S+ R+ F+ +L +I+ L K +A YGIT+FSDLS
Sbjct: 126 LFEEFQRKFRKSYS-SDTAKRYALFKYNLLKMQLIQRLEKG-----TANYGITKFSDLSA 179
Query: 93 EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
EEF RHS +++ K + ++ T I +P DWR G + +
Sbjct: 180 EEF-----RHS------LANMKRRKSKGSQMETAIFPTTIQ---SLPPSFDWRANGAVTE 225
Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
V++Q CG+CWAF+T E K L LS Q+++DC + C+GG L +W
Sbjct: 226 VKDQGMCGSCWAFATTGNIEGQWFRKTNKLISLSEQQLLDC-DTKDEACNGG----LPEW 280
Query: 213 MDVNKVV----LEPESEYPL-LLKDAACKRKATSPNGVKIKSYT--CDTLIPSESSILTD 265
+++V L E +YP +K+ +C + PN I +Y TL E+ +
Sbjct: 281 A-YDEIVKMGGLMSEKDYPYEAMKEQSCHLR--RPN---ISAYINGSATLPSDEAKLAAW 334
Query: 266 IATHGPVIAAVNALTWQYYLGGV---IQYNCDGSLANINHAVQIVGY 309
+ +GP+ VNA Q+YLGG+ C S A ++HAV +VGY
Sbjct: 335 LVQNGPISVGVNANFLQFYLGGISHPPHMLC--SEAGLDHAVLLVGY 379
>gi|238481789|gb|ACR43934.1| cathepsin L-like cysteine proteinase [Haliotis diversicolor
supertexta]
Length = 347
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 84/261 (32%), Positives = 128/261 (49%), Gaps = 36/261 (13%)
Query: 60 FEKSLDIIEELNKNRQSPESARY-GITEFSDLSEEEFKTRH-LRHSVNKHVLMSHHKHHD 117
F+++L IEE NK + + Y GI +F+D+ EEF+ + LR N +
Sbjct: 66 FKQNLQYIEEHNKKFSLGQKSYYLGINQFADMKNEEFRMYNGLRRDYN-------YSREV 118
Query: 118 HHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHAL 177
NH+ + P + DWR+ G + V+NQ CG+CW+FST + E H
Sbjct: 119 QCSNHLTPEYLVA--------PDEVDWRKKGYVTAVKNQGQCGSCWSFSTTGSLEGQHFH 170
Query: 178 KNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACK 236
K+G L LS Q+++DC+G GN GC+GG +++ N + E E EYP + C
Sbjct: 171 KSGKLVSLSEQQLVDCSGKFGNEGCNGGLMDQAFEYIITNGGI-ETEEEYPYDARQERCH 229
Query: 237 RK-----ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVI 289
K AT+ V +KS E+ + +A GPV A++A ++Q Y GGV
Sbjct: 230 FKKSEVAATASGCVDVKS-------GDETDLKNSVAEVGPVSIAIDASHQSFQLYSGGVY 282
Query: 290 -QYNCDGSLANINHAVQIVGY 309
+ C S ++H V +VGY
Sbjct: 283 DEPKC--SSTELDHGVLVVGY 301
>gi|268581031|ref|XP_002645498.1| Hypothetical protein CBG22748 [Caenorhabditis briggsae]
Length = 379
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 94/309 (30%), Positives = 143/309 (46%), Gaps = 30/309 (9%)
Query: 7 VLFIVALIALCFLAIPVKVSKPNLEQKLE-LFSSFQQRYKKSYS-KSEHDIRFKNFEKSL 64
V I L + C A+ + Q+ E LF F ++ + YS + E+ R+ F ++
Sbjct: 49 VFLIFVLFSSC--ALREMGKRKTATQRYEVLFDEFLYKFNRLYSSQEEYKYRYHIFVHNV 106
Query: 65 DIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVK 124
EE + R+ P + I EF+D SEEE + K ++ + + + +
Sbjct: 107 REFEE--EERKHP-GLDFDINEFTDWSEEELR---------KMIVDKKNVKEEKNAVRFE 154
Query: 125 KRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL 184
+++GI P I DWR+ G + ++NQ CG+CWAF+TV E+ HA+K G L
Sbjct: 155 GSVLSSGIKRPASI----DWRDQGKLTPIKNQGQCGSCWAFATVAAIEAQHAIKKGILVS 210
Query: 185 LSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPL-LLKDAACKRKATSPN 243
LS QE++DC G N GCSGG + ++ N LE E YP LK C N
Sbjct: 211 LSEQEMVDCDGRNN-GCSGGYRPYAMRFVKENG--LETEKSYPYSALKHDQC---MLHQN 264
Query: 244 GVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQY-YLGGVIQYNCD--GSLANI 300
K+ L SE +I + T GPV +N + Y Y G+ + + +
Sbjct: 265 DTKVYIDDYRMLSTSEENIADWVGTKGPVTFGMNVVKAMYSYRSGIFNPSAEDCAEKSMG 324
Query: 301 NHAVQIVGY 309
HA+ IVGY
Sbjct: 325 AHALTIVGY 333
>gi|319976406|gb|ADV90878.1| cysteine proteinase B [Leishmania donovani]
Length = 332
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 77/236 (32%), Positives = 115/236 (48%), Gaps = 23/236 (9%)
Query: 80 ARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIP 139
AR+GIT+F DLSE EF R+L N + K H H + ++ +P
Sbjct: 3 ARFGITKFFDLSEAEFAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VP 51
Query: 140 VKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNM 199
DWRE G + V+NQ CG+CWAFS V ES A L LS Q+++ C N
Sbjct: 52 DAVDWREKGAVTPVKNQGACGSCWAFSAVGNIESQWARAGHGLVSLSEQQLVSCDDKDN- 110
Query: 200 GCSGGDFCALLDWMDVNKV-VLEPESEYPLLLKD---AACKRKATSPNGVKIKSYTCDTL 255
GC+GG +W+ + ++ E YP + A C + G +I Y +
Sbjct: 111 GCNGGLMLQAFEWLLRHMYGIVFTEKSYPYTSGNGDVAECLNSSKLVPGAQIDGY---VM 167
Query: 256 IPSESSILTD-IATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
IPS +++ +A +GP+ AV+A ++ Y GV+ +C G +NH V +VGY+
Sbjct: 168 IPSNETVMAAWLAENGPIAIAVDASSFMSYQSGVLT-SCAGDA--LNHGVLLVGYN 220
>gi|374713651|gb|AEZ65083.1| cysteine protease [Carica papaya]
Length = 467
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 80/285 (28%), Positives = 137/285 (48%), Gaps = 28/285 (9%)
Query: 31 EQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSD 89
++ + ++ ++ ++ K+Y+ E + RF F+ +L I+E N + R G+ F+D
Sbjct: 43 DEVMAMYEAWLVKHGKAYNALGEKEKRFGIFKDNLRFIDEHNSQNLT---YRLGLNRFAD 99
Query: 90 LSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGI 149
L+ EE+++ +L V K V ++S + +P DWR+ G
Sbjct: 100 LTNEEYRSMYL--GVKPGATRVTRK--------VSRKSDRFAARVGDALPDFIDWRKEGA 149
Query: 150 IGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCAL 209
+ V++Q +CG+CWAFST+ E ++ + G L LS QE++DC + N GC+GG L
Sbjct: 150 VVGVKDQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGG----L 205
Query: 210 LDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
+D+ +N ++ E +YP D C + + N V I Y + + ++ + L
Sbjct: 206 MDYAFEFIINNGGIDSEEDYPYRAADQKCDQYRKNANVVSIDGY--EDVPENDEAALKKA 263
Query: 267 ATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
PV A+ A +Q Y GV C SL +H V VGY
Sbjct: 264 VAKQPVSVAIEAGGRAFQLYQSGVFTGKCGTSL---DHGVAAVGY 305
>gi|171702843|dbj|BAG16377.1| cysteine protease [Brassica rapa var. perviridis]
Length = 431
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 80/270 (29%), Positives = 129/270 (47%), Gaps = 32/270 (11%)
Query: 45 KKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV 104
K S +E D RF+ F+ +L I+E N S R G+T+F+DL+ +E+++ +L +
Sbjct: 51 KAQNSLTEKDRRFEIFKDNLRFIDEHNGKNLS---YRLGLTKFADLTNDEYRSMYLGSRL 107
Query: 105 NKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWA 164
+ K S+ + IP DWR+ G + +V++Q +CG+CWA
Sbjct: 108 KRKA---------------TKTSLRYEARVGDAIPESVDWRKEGAVAEVKDQGSCGSCWA 152
Query: 165 FSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW---MDVNKVVLE 221
FST+ E ++ + G L LS QE++DC + N GC+GG L+D+ + ++
Sbjct: 153 FSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNEGCNGG----LMDYAFEFIIKNGGID 208
Query: 222 PESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAV--NAL 279
E +YP D C + + V I SY D SE S L +H P+ A+
Sbjct: 209 TEEDYPYKGVDGRCDQTRKNAKVVTIDSYE-DVPANSEES-LKKALSHQPISVAIEGGGR 266
Query: 280 TWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+Q Y G+ C +++H V VGY
Sbjct: 267 AFQLYDSGIFDGICG---TDLDHGVVAVGY 293
>gi|74222595|dbj|BAE38161.1| unnamed protein product [Mus musculus]
Length = 334
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 87/292 (29%), Positives = 138/292 (47%), Gaps = 31/292 (10%)
Query: 25 VSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESA-RY 82
++ P +Q + ++ +++ Y +E + R +EK++ +I+ N + +
Sbjct: 16 LATPKFDQTFSAEWHQWKSTHRRLYGTNEEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSM 75
Query: 83 GITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKK 142
+ F D++ EEF R VN + H H K R + + IP
Sbjct: 76 EMNAFGDMTNEEF-----RQVVNGY----------RHQKHKKGRLFQEPLMLK--IPKSV 118
Query: 143 DWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGC 201
DWRE G + V+NQ CG+CWAFS E LK G L LS Q ++DC+ GN GC
Sbjct: 119 DWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGC 178
Query: 202 SGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP-SES 260
+GG ++ N L+ E YP KD +CK +A + + T IP E
Sbjct: 179 NGGLMDFAFQYIKENG-GLDSEESYPYEAKDGSCKYRAE----FAVANDTGFVDIPQQEK 233
Query: 261 SILTDIATHGPVIAAVNAL--TWQYY-LGGVIQYNCDGSLANINHAVQIVGY 309
+++ +AT GP+ A++A + Q+Y LG + NC S N++H V +VGY
Sbjct: 234 ALMKAVATVGPISVAMDASHPSLQFYSLGIYYEPNC--SSKNLDHGVLLVGY 283
>gi|1709574|sp|P10056.2|PAPA3_CARPA RecName: Full=Caricain; AltName: Full=Papaya peptidase A; AltName:
Full=Papaya proteinase III; Short=PPIII; AltName:
Full=Papaya proteinase omega; Flags: Precursor
gi|18098|emb|CAA46862.1| proteinase omega [Carica papaya]
Length = 348
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 79/280 (28%), Positives = 134/280 (47%), Gaps = 28/280 (10%)
Query: 31 EQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEFSD 89
E+ ++LF+S+ + K Y + + RF+ F+ +L+ I+E NK S G+ EF+D
Sbjct: 42 ERLIQLFNSWMLNHNKFYENVDEKLYRFEIFKDNLNYIDETNKKNNS---YWLGLNEFAD 98
Query: 90 LSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGI 149
LS +EF +++ ++ + S+ + + +P DWR+ G
Sbjct: 99 LSNDEFNEKYVGSLIDATIEQSYDEEFINEDT--------------VNLPENVDWRKKGA 144
Query: 150 IGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCAL 209
+ VR+Q +CG+CWAFS V T E ++ ++ G L LS QE++DC + GC GG
Sbjct: 145 VTPVRHQGSCGSCWAFSAVATVEGINKIRTGKLVELSEQELVDCERRSH-GCKGGYPPYA 203
Query: 210 LDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
L+++ N + L S+YP K C+ K G +K+ + P+ L +
Sbjct: 204 LEYVAKNGIHL--RSKYPYKAKQGTCRAKQVG--GPIVKTSGVGRVQPNNEGNLLNAIAK 259
Query: 270 GPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIV 307
PV V + +Q Y GG+ + C ++HAV V
Sbjct: 260 QPVSVVVESKGRPFQLYKGGIFEGPCG---TKVDHAVTAV 296
>gi|126021|sp|P25775.1|LMCPA_LEIME RecName: Full=Cysteine proteinase A; Flags: Precursor
gi|9573|emb|CAA44094.1| cysteine proteinase [Leishmania mexicana]
Length = 354
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 89/316 (28%), Positives = 149/316 (47%), Gaps = 32/316 (10%)
Query: 7 VLFIVALIALCFLAIPVKVSKPNLEQKLEL--FSSFQQRYKKSYS-KSEHDIRFKNFEKS 63
+ + L +C+ + + + P ++ + + SF++R+ K++ +E RF F+++
Sbjct: 10 AIVVTILFVVCYGSALIAQTPPPVDNFVASAHYGSFKKRHGKAFGGDAEEGHRFNAFKQN 69
Query: 64 LDIIEELNKNRQSPESARYGIT-EFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNH 122
+ LN Q+P A Y ++ +F+DL+ +EF +L N H K+H
Sbjct: 70 MQTAYFLNT--QNPH-AHYDVSGKFADLTPQEFAKLYL----NPDYYARHLKNH------ 116
Query: 123 VKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTL 182
K + + P+G+ + DWR+ G + V+NQ CG+CWAFS + E A +L
Sbjct: 117 --KEDVHVDDSAPSGV-MSVDWRDKGAVTPVKNQGLCGSCWAFSAIGNIEGQWAASGHSL 173
Query: 183 SLLSVQEVIDCAGNGNMGCSGGDFCALLDW-MDVNKVVLEPESEYPLLLKDAA---CKRK 238
LS Q ++ C N + GC+GG ++W M + + E+ YP C +
Sbjct: 174 VSLSEQMLVSC-DNIDEGCNGGLMDQAMNWIMQSHNGSVFTEASYPYTSGGGTRPPCHDE 232
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLA 298
G KI + +L E I + GPV AV+A TWQ Y GGV+ SL
Sbjct: 233 GEV--GAKITGFL--SLPHDEERIAEWVEKRGPVAVAVDATTWQLYFGGVVSLCLAWSL- 287
Query: 299 NINHAVQIVGYDNYSR 314
NH V IVG++ ++
Sbjct: 288 --NHGVLIVGFNKNAK 301
>gi|356563155|ref|XP_003549830.1| PREDICTED: vignain-like [Glycine max]
Length = 361
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 85/317 (26%), Positives = 147/317 (46%), Gaps = 32/317 (10%)
Query: 2 FDVKNVLFIVALIALCF-LAIPVKVSKPNLEQK---LELFSSFQQRYKKSYSKSEHDIRF 57
+VK V F+ AL +A + ++ +LE + +L+ ++ + S S E RF
Sbjct: 1 MEVKKVFFVALSFALVLRVAESFEFNEKDLESEEGLWDLYERWRSHHTVSRSLDEKHNRF 60
Query: 58 KNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHD 117
F+ ++ + NK + + + F+D++ EF++ + VN
Sbjct: 61 NVFKGNVMHVHSSNK---MDKPYKLKLNRFADMTNHEFRSIYAGSKVN------------ 105
Query: 118 HHHNHVKKRSITTGITIPTGI---PVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESM 174
HH + G + + P DWR+ G + V++Q CG+CWAFST+ E +
Sbjct: 106 -HHRMFRGTPRGNGTFMYQNVDRVPSSVDWRKKGAVTDVKDQGQCGSCWAFSTIVAVEGI 164
Query: 175 HALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAA 234
+ +K L LS QE++DC N GC+GG + ++ + + + S YP KD
Sbjct: 165 NQIKTHKLVPLSEQELVDCDTTQNQGCNGGLMESAFEF--IKQYGITTASNYPYEAKDGT 222
Query: 235 CKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYN 292
C + V I + + + +E+++L +A H PV A+ A + +Q+Y GV N
Sbjct: 223 CDASKVNEPAVSIDGHE-NVPVNNEAALLKAVA-HQPVSVAIEAGGIDFQFYSEGVFTGN 280
Query: 293 CDGSLANINHAVQIVGY 309
C +L +H V IVGY
Sbjct: 281 CGTAL---DHGVAIVGY 294
>gi|330793420|ref|XP_003284782.1| hypothetical protein DICPUDRAFT_28222 [Dictyostelium purpureum]
gi|325085276|gb|EGC38686.1| hypothetical protein DICPUDRAFT_28222 [Dictyostelium purpureum]
Length = 347
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 79/281 (28%), Positives = 135/281 (48%), Gaps = 27/281 (9%)
Query: 32 QKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLS 91
Q F+++ + ++ Y+ E R+ F+ ++D ++E N + E+ G+ F+D++
Sbjct: 25 QYRNAFTNWMIQNQRHYASEEFAARYNIFKANMDYVQEWNS--KGSETV-LGLNTFADIT 81
Query: 92 EEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIG 151
+EF++ +L + +++ I DWR G +
Sbjct: 82 NQEFRSIYLGTPFDGSSIINTETEK-----------------IFAAPAASIDWRTKGAVT 124
Query: 152 KVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALL 210
++NQQ CG CW+FST + E A+ G L LS Q +IDC+G+ GN GC+GG
Sbjct: 125 PIKNQQQCGGCWSFSTTGSTEGATAIAKGNLPSLSEQNLIDCSGSYGNNGCNGGLMTLAF 184
Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
+++ +N ++ ES YP KD + + G + SY+ + SE S L A G
Sbjct: 185 EYI-INNKGIDTESSYPYTAKDGKTCKYNPANIGATLSSYS-NVTSGSEPS-LESAANIG 241
Query: 271 PVIAAVNAL--TWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
PV A++A ++Q Y G I Y S +++H V +VGY
Sbjct: 242 PVSVAIDASHNSFQLYSSG-IYYEPACSTTSLDHGVLVVGY 281
>gi|302812789|ref|XP_002988081.1| hypothetical protein SELMODRAFT_183539 [Selaginella moellendorffii]
gi|300144187|gb|EFJ10873.1| hypothetical protein SELMODRAFT_183539 [Selaginella moellendorffii]
Length = 425
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 87/281 (30%), Positives = 133/281 (47%), Gaps = 29/281 (10%)
Query: 37 FSSFQQRYKK--SYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
++S+ ++ K + S S D RF+ F+++ IEE NR S R G+ +FSDL+ EE
Sbjct: 13 YASWCAKFGKECASSNSLGDHRFETFKENFRYIEE--HNRAGKHSYRLGLNQFSDLTSEE 70
Query: 95 FKTRHL--RHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
F+ R L R + ++ + D I G +P DWR+ G +
Sbjct: 71 FRQRFLGLRPDLIDSPVLKMPRDSD----------IEEGFQ-NVDLPASVDWRQHGAVTA 119
Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
++Q +CG CWAF+T E ++ + G L LS QE+IDC + GC GG +
Sbjct: 120 PKDQGSCGGCWAFATTGAIEGINQIVTGQLVSLSEQELIDCDKKADKGCDGGLMENAYQF 179
Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP--SESSILTDIATHG 270
+ V L+ E++YP ++ C K + V I Y IP E ++L +A
Sbjct: 180 I-VENGGLDTETDYPYHASESHCNMKKLNSRVVAIDGY---KAIPEGDEQALLLAVAKQ- 234
Query: 271 PVIAAVNALT--WQYYLGGVIQYNCDGSLANINHAVQIVGY 309
PV A+ + +Q+Y GV +C INH V IVGY
Sbjct: 235 PVSVAIEGASKDFQHYASGVFTGHCG---EEINHGVLIVGY 272
>gi|77379397|gb|ABA71355.1| cysteine protease [Brassica napus]
Length = 359
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 85/278 (30%), Positives = 134/278 (48%), Gaps = 29/278 (10%)
Query: 37 FSSFQQRYKKSYSKSEH-DIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F+ F RY K Y +E +RF F+++LD+I NK S + G+ +F+D++ +EF
Sbjct: 60 FARFTHRYGKRYENAEEMKLRFSIFKENLDLIRSTNKKGLS---YKLGVNQFTDMTWQEF 116
Query: 96 KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
+ L + N L HK TG +P KDWRE GI+ V+
Sbjct: 117 QRTKLGAAQNCSATLKGTHK--------------LTG----EALPETKDWREDGIVSPVK 158
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWM 213
+Q CG+CW FST E+ + G LS Q+++DCAG N GC+GG +++
Sbjct: 159 DQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYI 218
Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
N L+ E YP +D CK A + GV++ + + + +E + + PV
Sbjct: 219 KSNG-GLDTEEAYPYTGEDGTCKYSAENV-GVQVLD-SVNITLGAEDELKHAVGLLRPVS 275
Query: 274 AAVNAL-TWQYYLGGVIQ-YNCDGSLANINHAVQIVGY 309
A + +++ Y GV +C + ++NHAV VGY
Sbjct: 276 IAFEVIHSFRLYKSGVYSDSHCGQTPMDVNHAVLAVGY 313
>gi|89272015|emb|CAJ83143.1| cathepsin L2 [Xenopus (Silurana) tropicalis]
Length = 335
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 89/310 (28%), Positives = 142/310 (45%), Gaps = 32/310 (10%)
Query: 7 VLFIVALIALCFLAI-PVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLD 65
+ + + A+C + + P L+ L+ ++ +KKSY+ E R +EK+L
Sbjct: 1 MALYLGIAAICLTTVFAAPTTDPALDNHWNLWKNW---HKKSYAPKEEGWRRVLWEKNLR 57
Query: 66 IIEELN-KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVK 124
+IE N ++ S G+ +F D++ EEF+ LM+ +K N K
Sbjct: 58 MIEFHNLEHSLGKHSHSLGMNQFGDMTNEEFRQ-----------LMNGYK------NQKK 100
Query: 125 KRSITTGITIPTGI--PVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTL 182
R T P P DWR+ G + V++Q CG+CWAFST E H G +
Sbjct: 101 IRGST--FLAPNNFESPKSVDWRKKGYVTPVKDQGQCGSCWAFSTTGALEGQHYRNTGKM 158
Query: 183 SLLSVQEVIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATS 241
LS Q ++DC+ GN GC+GG ++ N + + E YP KD +
Sbjct: 159 ISLSEQNLVDCSRAQGNQGCNGGLMDQAFQYVKDNGGI-DSEDSYPYTAKDDQECHYDPN 217
Query: 242 PNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLAN 299
N + D SE ++ +A+ GPV AV+A ++Q+Y G I Y + S +
Sbjct: 218 YNSANDTGFV-DVTSESEKDLMNAVASVGPVSVAVDAGHQSFQFYKSG-IYYEPECSSED 275
Query: 300 INHAVQIVGY 309
++H V +VGY
Sbjct: 276 LDHGVLVVGY 285
>gi|157093563|gb|ABV22436.1| cysteine proteinase [Oxyrrhis marina]
Length = 329
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 84/282 (29%), Positives = 131/282 (46%), Gaps = 29/282 (10%)
Query: 40 FQQRYKKSYSKSEHDIRFKN-FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTR 98
F+ ++ +SY+ E + K F +++ +I E N + G+ +F+DL+ EEF
Sbjct: 22 FKAKFGESYNGEEEEAERKGVFAQNVQLINEENSKGHT---YTLGVNQFADLTVEEFSKT 78
Query: 99 HLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQT 158
++ K+ D + R + G +PT + DW G + V+NQ
Sbjct: 79 YMGFK------KPAQKYGDAAY---LGRHVYNGEALPTSV----DWSSQGAVTPVKNQGQ 125
Query: 159 CGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWMDVNK 217
CG+CW+FST + E + + G L LS Q+ +DCAG GN GC+GG + + + N
Sbjct: 126 CGSCWSFSTTGSLEGANEISTGKLVSLSEQQFVDCAGTYGNQGCNGGLMDSAFKYAEAN- 184
Query: 218 VVLEPESEYPLLLKDAACKRKATSPNGVK--IKSYTCDTLIPSESSILTDIATHGPVIAA 275
L E YP D +C+ + S K + Y D SE +++ +A PV A
Sbjct: 185 -ALCTEQSYPYKGTDGSCQASSCSTGLAKGSVSGYK-DVSSDSEQDMMSAVAQQ-PVSIA 241
Query: 276 VNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYSRT 315
+ A +Q Y GGV+ C SL +H V VGY S T
Sbjct: 242 IEADKSVFQLYSGGVLTGACGASL---DHGVLAVGYGTLSGT 280
>gi|218478069|dbj|BAH03395.1| cathepsin L-like cysteine peptidase [Taenia solium]
Length = 346
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 85/304 (27%), Positives = 142/304 (46%), Gaps = 32/304 (10%)
Query: 19 LAIPVKVSKPNLEQKL-ELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSP 77
LA+ V+ S E++L ++ ++ ++ + YS E R F ++L I+ N+ +
Sbjct: 16 LAVVVETSALLTERELSRQWAGWKLQHGRVYSGKEEAYRRGIFARNLLYIKGQNRRFNAG 75
Query: 78 -ESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT 136
ES G+ +F+DL EF R L V ++ I +
Sbjct: 76 LESYSTGLNQFADLESSEFSERFLGTRPESRVAG-------------RRGRIWKALASAA 122
Query: 137 GIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-G 195
G+P DWR+ ++ +V+NQ CG+CWAFS+ E A K G L LS Q+++DC+
Sbjct: 123 GLPDTVDWRDKNLVTEVKNQGNCGSCWAFSSTGALEGAFAKKTGKLISLSEQQLVDCSLK 182
Query: 196 NGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTL 255
NGN GC+GG +++ + +EPES YP D C+ + GV + D
Sbjct: 183 NGNDGCNGGYMSYAFKYLEEH--FIEPESAYPYRATDGPCRYNESL--GVGTVTDIGDIP 238
Query: 256 IPSESSILTDIATHGPVIAAVNA--LTWQYYL--------GGVIQYNCDGSLANINHAVQ 305
+E++++ +AT GP+ A++A L + +Y G + C +NH V
Sbjct: 239 EGNETALMEAVATVGPISIAIDASSLGFMFYRQVATNPHHGIYKSHWCSSKF--LNHGVL 296
Query: 306 IVGY 309
+GY
Sbjct: 297 AIGY 300
>gi|74273320|gb|ABA01328.1| secreted cathepsin F [Teladorsagia circumcincta]
Length = 364
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 87/276 (31%), Positives = 134/276 (48%), Gaps = 24/276 (8%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F+SF +R+ K Y ++SE RF F+++L+II +N + +A YGI +F+DLS EEF
Sbjct: 64 FTSFIERHDKVYRNESEALKRFGIFKRNLEIIRSAQENDKG--TAIYGINQFADLSPEEF 121
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
K HL H+ K DH + V + G+ +P DWRE G + KV+
Sbjct: 122 KKTHLPHT---------WKQPDHPNRIVDLAA--EGVDPKEPLPESFDWREHGAVTKVKT 170
Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDV 215
+ C ACWAFS E L L LS Q+++DC + GC+GG L + ++
Sbjct: 171 EGHCAACWAFSVTGNIEGQWFLAKKKLVSLSAQQLLDC-DVVDEGCNGG--FPLDAYKEI 227
Query: 216 NKV-VLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIA 274
++ LEPE +YP K C+ P+ + + L E + + GP+
Sbjct: 228 VRMGGLEPEDKYPYEAKAEQCR---LVPSDIAVYINGSVELPHDEEKMRAWLVKKGPISI 284
Query: 275 AVNALTWQYYLGGVIQ-YNCDGSLANINHAVQIVGY 309
+ Q+Y GGV + C L+++ H +VGY
Sbjct: 285 GITVDDIQFYKGGVSRPTTC--RLSSMIHGALLVGY 318
>gi|2352469|gb|AAC00067.1| cysteine protease [Trypanosoma cruzi]
Length = 471
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 82/309 (26%), Positives = 138/309 (44%), Gaps = 24/309 (7%)
Query: 5 KNVLFIVALIALCFLAIPVKVSKPNLEQKL-ELFSSFQQRYKKSYSKSEHDIRFKNFEKS 63
+ VL L+ + L +P + + E+ L F+ F+Q++ + Y + + F ++
Sbjct: 6 RFVLLAAVLVVMACL-VPAATASLHAEETLTSQFAEFKQKHGRVYESAARRLPLSVFREN 64
Query: 64 LDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHV 123
L + + + A +G+T FSDL+ EEF++R+ H H
Sbjct: 65 LFLAR---LHAAANPHATFGVTPFSDLTREEFRSRY-------------HNGAAHFAAAQ 108
Query: 124 KKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLS 183
++ + + + G P DWR G + V++Q CG+CWAFS + E L L+
Sbjct: 109 ERARVPVKVEV-VGAPAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNVECQWFLAGHPLT 167
Query: 184 LLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACKRKATSP 242
LS Q ++ C + GCSGG +W+ N + E YP + TS
Sbjct: 168 NLSEQMLVSC-DKTDFGCSGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSG 226
Query: 243 NGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINH 302
+ V L E+ I +A +GPV AV+A +W Y GGV+ +C ++H
Sbjct: 227 HTVGATITGHVELPQDEAQIAACVAVNGPVAVAVDASSWMTYTGGVMT-SCVSE--QLDH 283
Query: 303 AVQIVGYDN 311
V +VGY++
Sbjct: 284 GVLLVGYND 292
>gi|194352760|emb|CAQ00108.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326510977|dbj|BAJ91836.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326523875|dbj|BAJ96948.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326528631|dbj|BAJ97337.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 368
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 85/308 (27%), Positives = 134/308 (43%), Gaps = 21/308 (6%)
Query: 15 ALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKS-----EHDIRFKNFEKSLDIIEE 69
A+ FL S+P E + F+ R+K +S++ E R + + +++ IE
Sbjct: 17 AVFFLHGSSATSRPATEDADPMAQRFR-RWKAEHSRTYATPEEERHRLRVYARNMRYIEA 75
Query: 70 LNKNRQSPESARYGITEFSDLSEEEFKTRH------LRHSVNKHVLMSHHKHHDHHHNHV 123
N + + + G T ++DL+ +EF + L + +
Sbjct: 76 TNGDAGAGLTYELGETAYTDLTSDEFTAMYTSRAPPLSDDDDDLPMTMITTRAGPVAAAG 135
Query: 124 KKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLS 183
+ + G P DWRE G + V+NQ CG+CWAFSTV E +H +K G L+
Sbjct: 136 GGGWLQVYVNESAGAPASVDWRERGAVTAVKNQGQCGSCWAFSTVAVIEGIHQIKTGKLA 195
Query: 184 LLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPN 243
LS QE++DC + GC+GG L W+ N + + +YP KD C K S +
Sbjct: 196 SLSEQELVDC-DKLDHGCNGGVSYRALQWITSNGGITS-QDDYPYTAKDDTCDTKKLSHH 253
Query: 244 GVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANIN 301
I + SE S+ +A PV ++ A +Q+Y GV C +N
Sbjct: 254 AASISGFQ-RVATRSELSLTNAVAMQ-PVAVSIEAGGANFQHYRNGVYNGPCG---TRLN 308
Query: 302 HAVQIVGY 309
H V +VGY
Sbjct: 309 HGVTVVGY 316
>gi|224460525|gb|ACN43674.1| cathepsin L [Paralichthys olivaceus]
Length = 334
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 95/289 (32%), Positives = 142/289 (49%), Gaps = 42/289 (14%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNK-NRQSPESARYGITEFSDLSEEE 94
F +++ ++ +SY S SE D R + + ++ +I+ N Q + R G+T ++DL EE
Sbjct: 26 FHAWKLKFGRSYNSSSEEDKRMQIWLRNREIVMAHNAMADQGHSTYRLGMTFYADLEHEE 85
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT--GIPVKKDWREAGIIGK 152
FK +V L S N K R ++ + + +P DWR+ G +
Sbjct: 86 FK-----QTVFGVCLGSF--------NASKPRGGSSFLKMHRFYNLPQTIDWRQWGFVTP 132
Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLD 211
V+NQ +CG+CW+FS+ E + K G L LS QE++DC+GN GN GC+GG
Sbjct: 133 VKNQGSCGSCWSFSSTGALEGQNFRKTGRLVSLSEQELVDCSGNYGNYGCNGG------- 185
Query: 212 WMD------VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
WMD VNK + E YP + C R G Y D +E ++
Sbjct: 186 WMDNAFRYIVNKGGIHTEDSYPYEGQVGQC-RANYGEIGATCTGYY-DIPSGNEHALKEA 243
Query: 266 IATHGPVIAAVNA--LTWQYYLGGVIQYN---CDGSLANINHAVQIVGY 309
+AT GPV A++A ++Q Y GV YN C G+ ++HAV IVGY
Sbjct: 244 VATFGPVSVAIHASDQSFQLYHSGV--YNNPYCSGTA--LDHAVLIVGY 288
>gi|158268255|gb|ABW25047.1| cathepsin L-like protease [Strongylus vulgaris]
Length = 354
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 77/291 (26%), Positives = 143/291 (49%), Gaps = 20/291 (6%)
Query: 23 VKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNK-NRQSPESAR 81
VK + +++ +L+ +++ + KSY+K E + + F K++ I+E N+ +R ++
Sbjct: 33 VKSLRQKIDEAFKLWDDYKEAFGKSYNKDEENDYMEAFVKNVIHIDEHNQEHRLGRKTFE 92
Query: 82 YGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVK 141
G+ +DL +++ ++ ++H + + ++ IP
Sbjct: 93 MGLNSIADLPFSQYRK------------LNGYRHRRNFGDSMQSNGTKWLAPFNVEIPDS 140
Query: 142 KDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMG 200
DWR+ G++ V+NQ CG+CWAFS E HA +G + LS Q ++DC+ GN G
Sbjct: 141 VDWRDKGLVTDVKNQGMCGSCWAFSATGALEGQHARASGKMVSLSEQNLVDCSTKYGNHG 200
Query: 201 CSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSES 260
C+GG +++ N + + E YP + ++ C K G + K + D E
Sbjct: 201 CNGGLMDLAFEYIKDNHGI-DTEESYPYVGRETKCHFKKKDI-GAEDKGFV-DLPEGDEE 257
Query: 261 SILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
++ +AT GP+ A++A T+Q Y GV Y+ + S ++H V +VGY
Sbjct: 258 ALKVAVATQGPISIAIDAGHRTFQLYKKGVY-YDEECSSEELDHGVLLVGY 307
>gi|281207567|gb|EFA81750.1| cysteine protease 4 [Polysphondylium pallidum PN500]
Length = 432
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 88/312 (28%), Positives = 142/312 (45%), Gaps = 24/312 (7%)
Query: 1 MFDVKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNF 60
M+ + L + L L+ ++ Q + F S+ Q Y E + R+ F
Sbjct: 1 MYRLSAYLLACTVFMLAVLSANAAFTE---RQYQDSFVSWMQTNNVKYDGKEFNHRYGVF 57
Query: 61 EKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
+K++D +++ N S G+ F+DL+ E++ +L ++ L+
Sbjct: 58 KKNMDYVQQWNAK---GSSTVLGMNIFADLTNAEYQRIYLGTKIDASGLL---------- 104
Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
N R+ I P DWR G + ++NQ CG+CW+FST + E H + G
Sbjct: 105 NVAAARAFDRNFNIKALNPTV-DWRAKGAVTPIKNQAQCGSCWSFSTTGSVEGAHEISTG 163
Query: 181 TLSLLSVQEVIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
L LS Q +IDC+ GN GC+GG A ++++ N + + ES YP R
Sbjct: 164 NLVALSEQNLIDCSVPEGNQGCNGGLMWAAMEYIIKNGGI-DTESSYPYTATGPNKCRYN 222
Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSL 297
++ +G KI SY + SE+S L A PV A++A ++Q Y G I Y S
Sbjct: 223 SANSGAKISSYV-NVTSGSETS-LASAANVNPVSVAIDASHNSFQLYSSG-IYYEPACST 279
Query: 298 ANINHAVQIVGY 309
++H V +VGY
Sbjct: 280 TQLDHGVLVVGY 291
>gi|242077600|ref|XP_002448736.1| hypothetical protein SORBIDRAFT_06g032320 [Sorghum bicolor]
gi|241939919|gb|EES13064.1| hypothetical protein SORBIDRAFT_06g032320 [Sorghum bicolor]
Length = 467
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 77/262 (29%), Positives = 127/262 (48%), Gaps = 23/262 (8%)
Query: 51 SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLM 110
EHD RF+ F +L ++ N+ R R G+ +F+DL+ +EF+ +L +
Sbjct: 72 GEHDSRFRVFWDNLRFVDAHNE-RAGEHGFRLGMNQFADLTNDEFRAAYLGARI-PAARS 129
Query: 111 SHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVET 170
+ + H+ ++ +P DWRE G + V+NQ CG+CWAFS V +
Sbjct: 130 GNAVGEMYRHDGAEE------------LPESVDWREKGAVAPVKNQGQCGSCWAFSAVSS 177
Query: 171 AESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLL 229
ES++ + G + LS QE+++C+ + GN GC+GG A +++ + ++ E +YP
Sbjct: 178 VESINQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFNFI-IKNGGIDTEDDYPYK 236
Query: 230 LKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGG 287
D C + V I ++ D E S+ +A H PV A+ A +Q Y G
Sbjct: 237 AVDGKCDINRRNAKVVSIDAFE-DVPENDEKSLQKAVA-HQPVSVAIEAGGRQFQLYKSG 294
Query: 288 VIQYNCDGSLANINHAVQIVGY 309
V +C N++H V VGY
Sbjct: 295 VFSGSC---TTNLDHGVVAVGY 313
>gi|357474527|ref|XP_003607548.1| Cysteine protease [Medicago truncatula]
gi|358347211|ref|XP_003637653.1| Cysteine protease [Medicago truncatula]
gi|355503588|gb|AES84791.1| Cysteine protease [Medicago truncatula]
gi|355508603|gb|AES89745.1| Cysteine protease [Medicago truncatula]
Length = 345
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 91/301 (30%), Positives = 138/301 (45%), Gaps = 39/301 (12%)
Query: 21 IPVKVSKPNLEQKLELFSSFQQ---RYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQS 76
++V+ L+ ++ +Q Y K Y E + R K F+++++ IE N N +
Sbjct: 22 FAIQVTSRTLQDDSNIYEKHEQWMVHYGKVYKDLQERENRLKIFKENVNYIEASN-NAGN 80
Query: 77 PESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT 136
+ + GI +F+DL+ EEF + S +K H + + K S T
Sbjct: 81 NKLYKLGINQFADLTNEEF-------------IASRNKFKGHMCSSITKTS--TFKYENA 125
Query: 137 GIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN 196
+P DWR+ G + V+NQ CG CWAFS V E +H L G L LS QE++DC
Sbjct: 126 SVPSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTK 185
Query: 197 G-NMGCSGGDFCALLDWMDVNKVV-----LEPESEYPLLLKDAACKRKATSPNGVKIKSY 250
G + GC GG L+D D K + L E++YP D C S + V I Y
Sbjct: 186 GVDQGCEGG----LMD--DAFKFIIQNHGLNTEAQYPYQGVDGTCSANKASIHAVTITGY 239
Query: 251 TCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVG 308
D +E ++ +A P+ A++A +Q+Y GV +C L +H V VG
Sbjct: 240 E-DVPANNEQALQKAVANQ-PISVAIDASGSDFQFYKSGVFTGSCGTEL---DHGVTAVG 294
Query: 309 Y 309
Y
Sbjct: 295 Y 295
>gi|224135841|ref|XP_002327317.1| predicted protein [Populus trichocarpa]
gi|222835687|gb|EEE74122.1| predicted protein [Populus trichocarpa]
Length = 342
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 84/311 (27%), Positives = 144/311 (46%), Gaps = 33/311 (10%)
Query: 9 FIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKS-EHDIRFKNFEKSLDII 67
FI+ L + ++ +P++ + E + + + K Y+ + E + RF+ F+ +++ I
Sbjct: 13 FILILGMWAYEVASRELQEPSMSARHE---QWMETFGKVYADAAEKERRFEIFKDNVEYI 69
Query: 68 EELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRS 127
E N P + + +F+DL+ EE K R+ + + K + +V
Sbjct: 70 ESFNTAGNKP--YKLSVNKFADLTNEELKV--ARNGYRRPLQTRPMKVTSFKYENV---- 121
Query: 128 ITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSV 187
T +P DWR+ G + +++Q CG+CWAFSTV E ++ L G L LS
Sbjct: 122 --------TAVPATMDWRKKGAVTPIKDQGQCGSCWAFSTVAATEGINQLTTGKLVSLSE 173
Query: 188 QEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVK 246
QE++DC G + GC GG +++ N + E+ YP D C K + K
Sbjct: 174 QELVDCDTQGEDQGCEGGLMEDGFEFIIKNHGIT-TEANYPYQAADGTCNSKKEASRIAK 232
Query: 247 IKSYTCDTLIP--SESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINH 302
I Y +P SE+++L +A+ P+ +++A +Q+Y GV C L +H
Sbjct: 233 ITGYES---VPANSEAALLKAVASQ-PISVSIDAGGSDFQFYSSGVFTGQCGTEL---DH 285
Query: 303 AVQIVGYDNYS 313
V VGY S
Sbjct: 286 GVTAVGYGETS 296
>gi|158268253|gb|ABW25046.1| cathepsin L-like protease [Strongylus vulgaris]
Length = 354
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 77/291 (26%), Positives = 143/291 (49%), Gaps = 20/291 (6%)
Query: 23 VKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNK-NRQSPESAR 81
VK + +++ +L+ +++ + KSY+K E + + F K++ I+E N+ +R ++
Sbjct: 33 VKSLRQKIDEAFKLWDDYKESFGKSYNKDEENDYMEAFVKNVIHIDEHNQEHRLGRKTFE 92
Query: 82 YGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVK 141
G+ +DL +++ ++ ++H + + ++ IP
Sbjct: 93 MGLNSIADLPFSQYRK------------LNGYRHRRNFGDSMQSNGTKWLAPFNVEIPDS 140
Query: 142 KDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMG 200
DWR+ G++ V+NQ CG+CWAFS E HA +G + LS Q ++DC+ GN G
Sbjct: 141 VDWRDKGLVTDVKNQGMCGSCWAFSATGALEGQHARASGKMVSLSEQNLVDCSTKYGNHG 200
Query: 201 CSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSES 260
C+GG +++ N + + E YP + ++ C K G + K + D E
Sbjct: 201 CNGGLMDLAFEYIKDNHGI-DTEESYPYVGRETKCHFKKKDI-GAEDKGFV-DLPEGDEE 257
Query: 261 SILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
++ +AT GP+ A++A T+Q Y GV Y+ + S ++H V +VGY
Sbjct: 258 ALKVAVATQGPISIAIDAGHRTFQLYKKGVY-YDEECSSEELDHGVLLVGY 307
>gi|196002275|ref|XP_002111005.1| expressed hypothetical protein [Trichoplax adhaerens]
gi|190586956|gb|EDV27009.1| expressed hypothetical protein [Trichoplax adhaerens]
Length = 325
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 91/308 (29%), Positives = 143/308 (46%), Gaps = 40/308 (12%)
Query: 10 IVALIALCFL---AIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDI 66
+ +A+C AIP+K + E + SF KK +++ E D R F +++
Sbjct: 4 LSVFLAICLAVVSAIPLK------DPSWEAWKSFHG--KKYHNQGEDDFRHYVFLQNIKT 55
Query: 67 IEELNKNRQSPESARYGITEFSDLSEEEFKTRH--LRHSVNKHVLMSHHKHHDHHHNHVK 124
I N + + + I EFSDL+ +EF + R S+ K
Sbjct: 56 IAAHN----AKSTFKMAINEFSDLTRKEFVKTYNGYRLSMKKST---------------- 95
Query: 125 KRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL 184
+ T + T +P + DWR+ G + ++NQ CG+CWAFST + E H K G L
Sbjct: 96 NKPSTFMAPLNTNMPTEVDWRKEGYVTPIKNQGRCGSCWAFSTTGSLEGQHFRKTGKLVS 155
Query: 185 LSVQEVIDC-AGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPN 243
LS Q +IDC A GN GC GG +++ +N + + E+ YP +D C+ K T N
Sbjct: 156 LSEQNLIDCSAAEGNDGCGGGFMDDAFEYIKLNNGI-DTEASYPYEGRDDICRYKKT--N 212
Query: 244 GVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANIN 301
I + D SE + +AT GP+ A++A ++ Y GV + + S ++
Sbjct: 213 KGAIDTGYMDIKQYSEDDLKAAVATVGPISVAIDASHKSFHMYHTGVY-HEPECSQTVLD 271
Query: 302 HAVQIVGY 309
H V +VGY
Sbjct: 272 HGVLVVGY 279
>gi|118373823|ref|XP_001020104.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89301871|gb|EAR99859.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 337
Score = 112 bits (281), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 92/316 (29%), Positives = 146/316 (46%), Gaps = 31/316 (9%)
Query: 6 NVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLD 65
N FI+ IAL +P+ +++ +KL ++ + + +++ +E + + L
Sbjct: 2 NSKFILLSIALL---MPLYLAQNVFIEKLIAYNKWSSKNLRTFLNNEEKLF-----RQLV 53
Query: 66 IIEELNK----NRQSPESARYGITEFSDLSEEEFKTRHLRHS--VNKHVLMSHHKHHDHH 119
E L K N Q + + +FSD++EEEF + L S V+ H+ + H+
Sbjct: 54 FFENLQKVNYHNAQDHHTYSLALNQFSDMTEEEFAEKILMQSDLVDLHI----QQTASHN 109
Query: 120 HNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
T+ + V DWR G + V+ Q C +CWAFS ES + +KN
Sbjct: 110 STSSTTGGSTSSNSTSNNATVTVDWRSKGAVTPVKQQGYCSSCWAFSAAGLMESFNFIKN 169
Query: 180 GTLSLLSVQEVIDCAGNGN----MGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAAC 235
L+ S Q+++DC + N GCSGG + +D+ K + YP + C
Sbjct: 170 KNLTDFSEQQLVDCVNSANGYSSKGCSGGWPASAIDYSS--KFGITTLQNYPYIGVQKKC 227
Query: 236 KRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYN-CD 294
T+ NG K KS+ IP+ S L + + PV AV+A TW +Y GV YN C+
Sbjct: 228 NITGTN-NGFKPKSW---KQIPNTSKDLQNALNYSPVSIAVDASTWSHYKSGV--YNGCN 281
Query: 295 GSLANINHAVQIVGYD 310
+ INH V +GYD
Sbjct: 282 QTDIKINHGVLAIGYD 297
>gi|343474209|emb|CCD14094.1| unnamed protein product, partial [Trypanosoma congolense IL3000]
Length = 307
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 86/318 (27%), Positives = 153/318 (48%), Gaps = 32/318 (10%)
Query: 4 VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSY-SKSEHDIRFKNFE 61
+ + F V L+A+ +PV + + EQ L+ F++F+Q+Y +SY +E RF+ F+
Sbjct: 7 TRTLRFSVGLLAVAACFVPVALGVLHAEQSLQQQFAAFKQKYSRSYKDATEEAFRFRVFK 66
Query: 62 KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
+++ E + + A +G+T FSD+S EEF+ ++H +++
Sbjct: 67 QNM---ERAKEEAAANPYATFGVTRFSDMSPEEFRA-------------TYHNGAEYYAA 110
Query: 122 HVKK-RSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
+K+ R + + + TG P DWR+ G + V++Q C + WAF+ + E +
Sbjct: 111 ALKRPRKV---VNVSTGKAPKTVDWRKKGAVTPVKDQGKCDSSWAFAAIGNIEGQWKIAG 167
Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACK-- 236
L+ LS Q ++ C N ++GC G W+ N + E YP
Sbjct: 168 HELTSLSEQMLVSCDTN-DLGCRAGFLDTAFKWIVSSNNGNVFTEQSYPYASGGGNVPTC 226
Query: 237 RKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGS 296
K+ G I + ++ +E++I +A GPV AV+A ++Q Y GGV+ +C
Sbjct: 227 NKSGKVVGANIDDHV--HILDNENAIAEWLAKKGPVAIAVDATSFQSYTGGVLT-SCISK 283
Query: 297 LANINHAVQIVGYDNYSR 314
+N A +VGYD+ S+
Sbjct: 284 --EVNSAALLVGYDDTSK 299
>gi|194352754|emb|CAQ00105.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326513690|dbj|BAJ87864.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514532|dbj|BAJ96253.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 463
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 82/309 (26%), Positives = 145/309 (46%), Gaps = 37/309 (11%)
Query: 11 VALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEE 69
V+L A ++I V + + E+ +++ + + +Y+ E + RF+ F +L I++
Sbjct: 18 VSLAAAADMSI-VSYGERSEEEVRRMYAEWMAEHGSTYNAIGEEERRFEAFRDNLRYIDQ 76
Query: 70 LNKNRQSP-ESARYGITEFSDLSEEEFKTRHLRHSV---NKHVLMSHHKHHDHHHNHVKK 125
N + S R G+ F+DL+ EE+++ +L + L + ++ D+
Sbjct: 77 HNAAADAGVHSFRLGLNRFADLTNEEYRSTYLGARTKPDRERKLSARYQAADNDE----- 131
Query: 126 RSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLL 185
+P DWR+ G +G V++Q CG+CWAFS + E ++ + G + L
Sbjct: 132 ------------LPESVDWRKKGAVGAVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPL 179
Query: 186 SVQEVIDCAGNGNMGCSGGDFCALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSP 242
S QE++DC + N GC+GG L+D+ +N ++ E +YP +D C +
Sbjct: 180 SEQELVDCDTSYNQGCNGG----LMDYAFEFIINNGGIDSEEDYPYKERDNRCDANKKNA 235
Query: 243 NGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANI 300
V I Y D + SE S+ +A P+ A+ A +Q Y G+ C +L
Sbjct: 236 KVVTIDGYE-DVPVNSEKSLQKAVANQ-PISVAIEAGGRAFQLYKSGIFTGTCGTAL--- 290
Query: 301 NHAVQIVGY 309
+H V VGY
Sbjct: 291 DHGVAAVGY 299
>gi|121531598|gb|ABM55484.1| digestive cysteine protease intestain [Leptinotarsa decemlineata]
Length = 326
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 91/311 (29%), Positives = 145/311 (46%), Gaps = 38/311 (12%)
Query: 10 IVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIE 68
+A+ A +A+ ++ + + +F+Q + K+Y E RF F+++L I+
Sbjct: 3 FLAIFATVLIAVTASTNE-------DQWIAFKQTHGKTYKNLLEEKTRFGIFQRNLIKIK 55
Query: 69 ELN-KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRS 127
E N + + E+ G+T F+DL+ EEFK +L K+ K R
Sbjct: 56 EHNARYDKGEETYLLGVTRFADLTHEEFK----------DILKGQIKN--------KPRL 97
Query: 128 ITTGITIPTG--IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLL 185
T P +P DW E G + +V++Q CG+CWAFS + +A+ N L
Sbjct: 98 NATPTVFPEDLEVPDSIDWTEKGAVLEVKDQNPCGSCWAFSATGALKGQNAILNNVKISL 157
Query: 186 SVQEVIDC-AGNGNMGC-SGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPN 243
S Q+++DC A GN C GGD A D+ V ++ E YP + K C+ A S
Sbjct: 158 SEQQLLDCSAAYGNGNCKEGGDMSAAFDY--VRDYGIQSEKSYPYIRKQTECQYDA-SKT 214
Query: 244 GVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHA 303
+KIK Y + SE + + T GP+ A+N+ Q Y G I + G +++H
Sbjct: 215 ILKIKGYK--NVTTSEEGLRKAVGTIGPISIAMNSDPLQLYYSGTI--SGKGCSHDLDHG 270
Query: 304 VQIVGYDNYSR 314
V +VGY S+
Sbjct: 271 VLVVGYGKASQ 281
>gi|52345644|ref|NP_001004869.1| cathepsin L2 precursor [Xenopus (Silurana) tropicalis]
gi|49522051|gb|AAH74718.1| MGC69486 protein [Xenopus (Silurana) tropicalis]
Length = 335
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 89/310 (28%), Positives = 142/310 (45%), Gaps = 32/310 (10%)
Query: 7 VLFIVALIALCFLAI-PVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLD 65
+ + + A+C + + P L+ L+ ++ +KKSY+ E R +EK+L
Sbjct: 1 MALYLGIAAICLTTVFAAPTTDPALDNHWNLWKNW---HKKSYAPKEEGWRRVLWEKNLR 57
Query: 66 IIEELN-KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVK 124
+IE N ++ S G+ +F D++ EEF+ LM+ +K N K
Sbjct: 58 MIEFHNLEHSLGKHSHSLGMNQFGDMTNEEFRQ-----------LMNGYK------NQKK 100
Query: 125 KRSITTGITIPTGI--PVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTL 182
R T P P DWR+ G + V++Q CG+CWAFST E H G +
Sbjct: 101 IRGST--FLAPNNFESPKSVDWRKKGYVTPVKDQGQCGSCWAFSTTGALEGQHYRNTGKM 158
Query: 183 SLLSVQEVIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATS 241
LS Q ++DC+ GN GC+GG ++ N + + E YP KD +
Sbjct: 159 ISLSEQNLVDCSRAQGNQGCNGGLMDQAFQYVKDNGGI-DSEDSYPYTAKDDQECHYDPN 217
Query: 242 PNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLAN 299
N + D SE ++ +A+ GPV AV+A ++Q+Y G I Y + S +
Sbjct: 218 YNSANDTGFV-DVTSGSEKDLMNAVASVGPVSVAVDAGHQSFQFYKSG-IYYEPECSSED 275
Query: 300 INHAVQIVGY 309
++H V +VGY
Sbjct: 276 LDHGVLVVGY 285
>gi|218195711|gb|EEC78138.1| hypothetical protein OsI_17694 [Oryza sativa Indica Group]
Length = 458
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 81/291 (27%), Positives = 135/291 (46%), Gaps = 24/291 (8%)
Query: 23 VKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSP-ESA 80
V + + E+ L++ ++ + K+Y+ E + R+ F +L I+E N + S
Sbjct: 26 VSYGERSEEEARRLYAEWKAEHGKNYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSF 85
Query: 81 RYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPV 140
R G+ F+DL+ EE++ +L ++ V R + +P
Sbjct: 86 RLGLNRFADLTNEEYRDTYL-----------GLRNKPRRERKVSDRYLAAD---NEALPE 131
Query: 141 KKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMG 200
DWR G + ++++Q CG+CWAFS + E ++ + G L LS QE++DC + N G
Sbjct: 132 SVDWRTKGAVAEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEG 191
Query: 201 CSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSES 260
C+GG D++ +N ++ E +YP KD C + V I SY D SE+
Sbjct: 192 CNGGLMDYAFDFI-INNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYE-DVTPNSET 249
Query: 261 SILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
S+ +A PV A+ A +Q Y G+ C +L +H V VGY
Sbjct: 250 SLQKAVANQ-PVSVAIEAGGRAFQLYSSGIFTGKCGTAL---DHGVAAVGY 296
>gi|94420703|gb|ABF18679.1| cysteine protease [Medicago sativa]
Length = 350
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 86/278 (30%), Positives = 129/278 (46%), Gaps = 29/278 (10%)
Query: 37 FSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F+ F RY K Y E RFK F ++L +IE NK R G+ F+D + EEF
Sbjct: 51 FARFANRYGKRYDTVDEMKRRFKIFSENLQLIESTNKKRLG---YTLGVNHFADWTWEEF 107
Query: 96 KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
++ L + N L +H+ D +P +KDWR+ GI+ +V+
Sbjct: 108 RSHRLGAAQNCSATLKGNHRITD------------------VVLPAEKDWRKEGIVSEVK 149
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWM 213
+Q CG+CW FST ES +A G LS Q+++DCAG N GC+GG +++
Sbjct: 150 DQGHCGSCWTFSTTGALESAYAQAFGKNISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYI 209
Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
N LE E YP ++ C K TS + + + + +E + +A PV
Sbjct: 210 KYNG-GLETEEAYPYTGQNGPC--KFTSEDVAVQVLGSVNITLGAEDELKHAVAFARPVS 266
Query: 274 AAVNAL-TWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
A + ++ Y GV C + ++NHAV VGY
Sbjct: 267 VAFEVVDDFRLYKKGVYTSTTCGNTPMDVNHAVLAVGY 304
>gi|118365742|ref|XP_001016091.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89297858|gb|EAR95846.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 335
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 91/313 (29%), Positives = 152/313 (48%), Gaps = 32/313 (10%)
Query: 7 VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDI 66
+L I+ L+ LC LA + V +KL ++ +Q ++++ Y +EH+ ++ +
Sbjct: 6 LLSIIMLMPLC-LAQNITV------EKLLAYNQWQSQHQRIY-LNEHEKLYR----QMVF 53
Query: 67 IEELNK--NRQSPESARYGI--TEFSDLSEEEFKTRHL-RHSVNKHVLMSHHKHHDHHHN 121
E+L K + + Y I +FSD+++EEF + L + + H++ + + H+
Sbjct: 54 FEKLQKINEHNNNSNNTYSIHLNQFSDMTKEEFTQKILMKQDLADHLMKAGSQEATHNDV 113
Query: 122 HVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGT 181
+++ + + T+ T I DWR G + V+NQ CG+CW+FS ES + ++N
Sbjct: 114 NIEAKLNSKNSTLATSI----DWRTKGAVTSVKNQGNCGSCWSFSATGLMESFNFIQNKA 169
Query: 182 LSLLSVQEVIDCAGNGN----MGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKR 237
L S Q+++DC N GC GG +D+ +KV L +YP + C
Sbjct: 170 LVEFSEQQLLDCVTPANGYRIHGCDGGWPAYCVDY--ASKVGLTTLKKYPYVGVQNNCNV 227
Query: 238 KATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL 297
T+ NG K K + +P+ S+ L PV V+A W Y G+ CD SL
Sbjct: 228 TGTN-NGFKPKKW---NQVPNTSNDLKTALNFSPVSVLVDANNWDGYQSGIFN-GCDQSL 282
Query: 298 ANINHAVQIVGYD 310
+NHAV VGYD
Sbjct: 283 IILNHAVLAVGYD 295
>gi|350535639|ref|NP_001233949.1| phytophthora-inhibited protease 1 [Solanum lycopersicum]
gi|108937128|gb|ABG23376.1| phytophthora-inhibited protease 1 [Solanum lycopersicum]
Length = 345
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 87/313 (27%), Positives = 146/313 (46%), Gaps = 28/313 (8%)
Query: 2 FDVKNVLFIVALIALCFLAIPVKVSKPNLEQ--KLELFSSFQQRYKKSYSKS-EHDIRFK 58
F +KN+ ++ L ++ L P V+ NL++ LE ++ + + Y E + RFK
Sbjct: 5 FFLKNITVVLLLFSILSL-YPFIVTSRNLKELSMLERHENWMVHHGRVYKDDIEKEHRFK 63
Query: 59 NFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDH 118
F+++++ IE NKN + + + +++DL+ EEF T + + L+S +
Sbjct: 64 TFKENVEFIESFNKN--GTQRYKLAVNKYADLTTEEFTTSFMGLDTS---LLSQQES-TA 117
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
K S+T +P DWR+ G + V++Q CG CWAFS E + +
Sbjct: 118 TTTSFKYDSVTE-------VPNSMDWRKRGSVTGVKDQGVCGCCWAFSAAAAIEGAYQIA 170
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW-MDVNKVVLEPESEYPLLLKDAACKR 237
N L LS Q+++DC+ N GC GG D+ + N + E+ YP CK
Sbjct: 171 NNELISLSEQQLLDCSTQ-NKGCEGGLMTVAYDFLLQNNGGGITTETNYPYEEAQNVCKT 229
Query: 238 KATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA-LTWQYYLGGVIQYNCDGS 296
+ P V I Y ++PS+ S L + P+ + A + Y G+ +C+
Sbjct: 230 E--QPAAVTINGY---EVVPSDESSLLKAVVNQPISVGIAANDEFHMYGSGIYDGSCNSR 284
Query: 297 LANINHAVQIVGY 309
L NHAV ++GY
Sbjct: 285 L---NHAVTVIGY 294
>gi|225446523|ref|XP_002275891.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP2 [Vitis vinifera]
Length = 358
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 86/331 (25%), Positives = 151/331 (45%), Gaps = 50/331 (15%)
Query: 3 DVKNVLFIVALIALCFLAI-----PVKVSKPNLEQKLELF-SSFQQRYK------KSYSK 50
+V L I+ + + + A P++ ++E++ E + +RYK + +
Sbjct: 9 NVYFALLIMWTVGVSWSAFSEEHEPMESEMSDMEKRYERWLVQHGRRYKNRDEWQRHFGI 68
Query: 51 SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLM 110
+ ++RF N+ + + L N +F+D++ EE+K ++ L
Sbjct: 69 YQSNVRFINYINAQNFSFTLTDN------------QFADMTNEEYKALYMG-------LG 109
Query: 111 SHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVET 170
+ + + ++RS +P+ DWR+ G + VRNQ CG+CWAFSTV
Sbjct: 110 TSETSRKNQSSFKRERSKV--------LPISVDWRKMGAVTPVRNQGECGSCWAFSTVAA 161
Query: 171 AESMHALKNGTLSLLSVQEVIDC-AGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLL 229
E ++ ++ G L LS QE++DC +GN GC+GG ++ N + + YP +
Sbjct: 162 VEGINKIRTGKLVSLSEQELLDCDIDSGNEGCNGGYMVNAFKFIKQNGGITTARN-YPYI 220
Query: 230 LKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGG 287
+ C + + + VKI Y +T+ P+ IL PV A++A +Q Y G
Sbjct: 221 GEQGICNKDKAANHVVKISGY--ETVPPNNEKILQAAVAKQPVSVAIDAGGYEFQLYSKG 278
Query: 288 VIQYNCDGSLANINHAVQIVGY--DNYSRTW 316
+ C L NHAV ++GY DN + W
Sbjct: 279 IFNGFCGKQL---NHAVTVIGYGEDNGKKYW 306
>gi|225458701|ref|XP_002284973.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
Length = 467
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 83/287 (28%), Positives = 143/287 (49%), Gaps = 34/287 (11%)
Query: 31 EQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSD 89
E + ++ ++ ++ KSY+ E + RF+ F+ +L I+E N ++ + G+ F+D
Sbjct: 45 EDVMAVYEAWLAKHGKSYNALGEKERRFQIFKDNLRFIDEHNAENRT---YKVGLNRFAD 101
Query: 90 LSEEEFKTRHL--RHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREA 147
L+ EE+++ +L R + + S +K D + V G ++P + DWR+
Sbjct: 102 LTNEEYRSMYLGTRTAAKRR---SSNKISDRYAFRV-------GDSLPESV----DWRKK 147
Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFC 207
G + +V++Q +CG+CWAFST+ E ++ + G L LS QE++DC + N GC+GG
Sbjct: 148 GAVVEVKDQGSCGSCWAFSTIAAVEGINKIVTGGLISLSEQELVDCDTSYNEGCNGG--- 204
Query: 208 ALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILT 264
L+D+ +N ++ E +YP D C + + V I Y D E S+
Sbjct: 205 -LMDYAFEFIINNGGIDSEEDYPYKASDGRCDQYRKNAKVVTIDGYE-DVPENDEKSLEK 262
Query: 265 DIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+A PV A+ A +Q Y G+ C +L +H V VGY
Sbjct: 263 AVANQ-PVSVAIEAGGREFQLYQSGIFTGRCGTAL---DHGVTAVGY 305
>gi|38147395|gb|AAR12010.1| cathepsin L-like proteinase [Triatoma infestans]
Length = 328
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 91/303 (30%), Positives = 145/303 (47%), Gaps = 30/303 (9%)
Query: 14 IALCFLAIPVKVSKPNLEQKL-ELFSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELN 71
+A+CFLA +S L E + +F+ ++ KSY S E R ++++ I+E N
Sbjct: 3 VAICFLAF-FAISHTALHDYFPEEWLAFKAQFGKSYKNSFEELFRMNVYKENQRKIDEHN 61
Query: 72 KNRQSPE-SARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITT 130
K ++ E S + + F DL + EFK ++ K N + T
Sbjct: 62 KRYENGEVSYKLKMNHFGDLMQHEFKA------------LNKLKRSAKQQNSGEVFRATG 109
Query: 131 GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEV 190
G +P K DWR+ G + V++ CG+CWAFS+ + LKN L LS Q++
Sbjct: 110 GK-----LPAKVDWRQKGAVTPVKDPGQCGSCWAFSSTGSLGGQLFLKNKKLVSLSEQQL 164
Query: 191 IDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKS 249
+DC+GN GN GC GG ++ N + + E YP +D C+ K S G K
Sbjct: 165 VDCSGNYGNDGCDGGIMVQAFQYIKGNGGI-DTEGSYPYEAEDDKCRYKTKSVAGTD-KG 222
Query: 250 YTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVI-QYNCDGSLANINHAVQI 306
Y D E+++ +A GP+ A++A L++Q+Y G+ + C S ++H V +
Sbjct: 223 YV-DIAQGDENALKEAVAEIGPISVAIDAGNLSFQFYSEGIYDEPFC--SNTELDHGVLV 279
Query: 307 VGY 309
VGY
Sbjct: 280 VGY 282
>gi|224076970|ref|XP_002305073.1| predicted protein [Populus trichocarpa]
gi|222848037|gb|EEE85584.1| predicted protein [Populus trichocarpa]
Length = 340
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 85/307 (27%), Positives = 139/307 (45%), Gaps = 31/307 (10%)
Query: 8 LFIVALIALCFLAIPVKVSKPNLEQK--LELFSSFQQRYKKSYSK-SEHDIRFKNFEKSL 64
+F+ L+ L A + +P EQ+ L+ + ++ + Y E + R+ F++++
Sbjct: 10 IFLPFLLILAAWATKI-ACRPLDEQEYMLKRHEEWMAQHGRVYGDMKEKEKRYLIFKENI 68
Query: 65 DIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVK 124
+ IE N S + G+ +F+DL+ EEF+ + + LMS +++ +
Sbjct: 69 ERIEAFNNG--SDRGYKLGVNKFADLTNEEFRAMYHGYKRQSSKLMSSSFRYENLSD--- 123
Query: 125 KRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL 184
IP DWR G + V++Q TCG CWAFSTV E + L+ G L
Sbjct: 124 -------------IPTSMDWRNDGAVTPVKDQGTCGCCWAFSTVAAIEGIIKLQTGNLIS 170
Query: 185 LSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNG 244
LS Q+++DC GN GC GG ++ + L E YP D C + +
Sbjct: 171 LSEQQLVDCTA-GNKGCQGGLMDTAFQYI-IRNGGLTSEDNYPYQGVDGTCSSEKAASTE 228
Query: 245 VKIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSLANINH 302
+I Y D +E+++L +A PV V+ +Q+Y GV +C NH
Sbjct: 229 AQITGYE-DVPQNNENALLQAVAKQ-PVSVGVDGGGNDFQFYKSGVFNGDCG---TQQNH 283
Query: 303 AVQIVGY 309
AV +GY
Sbjct: 284 AVTAIGY 290
>gi|410898132|ref|XP_003962552.1| PREDICTED: cathepsin L-like [Takifugu rubripes]
Length = 335
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 77/247 (31%), Positives = 123/247 (49%), Gaps = 20/247 (8%)
Query: 75 QSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITI 134
Q +S R G+T+F+D+ EE+K+ L+S + + ++ S +
Sbjct: 67 QGIKSYRLGMTQFADMDNEEYKS-----------LISLGCLRAFNTSAPRRGSAFFRLAE 115
Query: 135 PTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA 194
T +P DWR+ G + V++Q+ CG+CWAFS + E + K G L LS Q+++DC+
Sbjct: 116 GTHLPTTVDWRDKGYVTGVKDQKQCGSCWAFSATGSLEGQNFRKTGKLVSLSEQQLVDCS 175
Query: 195 GN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCD 253
G+ GNMGC+GG ++ N + + E YP +D C+ K + G K Y D
Sbjct: 176 GDYGNMGCNGGLMDYAFKYIQENGGI-DTEKSYPYEAEDGQCRFKPENV-GAKCTGYV-D 232
Query: 254 TLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY-- 309
+ E ++ +AT GPV ++A ++Q Y GV D S +++H V VGY
Sbjct: 233 VTVGDEDALKEAVATIGPVSVGIDASHSSFQLYDSGVYDEQ-DCSSQDLDHGVLAVGYGT 291
Query: 310 DNYSRTW 316
DN W
Sbjct: 292 DNGQDYW 298
>gi|535454|gb|AAA50755.1| cysteine proteinase [Alnus glutinosa]
Length = 340
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 93/325 (28%), Positives = 146/325 (44%), Gaps = 44/325 (13%)
Query: 11 VALIALCFLAIPVKVSKPNLEQKLELFSSFQ--------QRYKKSYSKSEHDI-----RF 57
+ ++ CF + V V+ L +L S Q + + SY + DI R+
Sbjct: 1 MGFVSQCFCLV-VMVTLGALASQLAAARSLQDASMRERHEEWMASYGRVYKDINEKQKRY 59
Query: 58 KNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHD 117
K FE+++ +IE NK+ P + + +F+DL+ EEFK R+ H+ + K
Sbjct: 60 KIFEENVALIESSNKDANKP--YKLSVNQFADLTNEEFKAS--RNRFKGHICST--KSTS 113
Query: 118 HHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHAL 177
+ +V + +P DWR G + V++Q CG CWAFS V E + L
Sbjct: 114 FKYGNV------------SAVPSAMDWRMKGAVTPVKDQGQCGCCWAFSAVAATEGITKL 161
Query: 178 KNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACK 236
G L LS QE++DC +G + GC GG ++ N L E+ YP D C
Sbjct: 162 TTGELISLSEQELVDCDTSGVDQGCEGGLMDNAFTFIQHNH-GLASEANYPYKGVDGTCN 220
Query: 237 RKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCD 294
+ + +I + D SE ++L +A H PV A++A +Q+Y GV C
Sbjct: 221 TNKQAIHAAEINGFE-DVPANSEEALLNAVA-HQPVSVAIDAGGSGFQFYSKGVFIGACG 278
Query: 295 GSLANINHAVQIVGY---DNYSRTW 316
L +H V VGY D+ ++ W
Sbjct: 279 TQL---DHGVTAVGYGTSDDGTKYW 300
>gi|74200292|dbj|BAE22939.1| unnamed protein product [Mus musculus]
Length = 308
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 84/275 (30%), Positives = 131/275 (47%), Gaps = 28/275 (10%)
Query: 40 FQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESA-RYGITEFSDLSEEEFKTR 98
++ +++ Y +E + R +EK++ +I+ N + + + F D++ EEF
Sbjct: 6 WKSTHRRLYGTNEEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAFGDMTNEEF--- 62
Query: 99 HLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQT 158
R VN + H H K R + + IP DWRE G + V+NQ
Sbjct: 63 --RQVVNGY----------RHQKHKKGRLFQEPLMLK--IPKSVDWREKGCVTPVKNQGQ 108
Query: 159 CGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGCSGGDFCALLDWMDVNK 217
CG+CWAFS E LK G L LS Q ++DC+ GN GC+GG ++ N
Sbjct: 109 CGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENG 168
Query: 218 VVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP-SESSILTDIATHGPVIAAV 276
L+ E YP KD +CK +A + + T IP E +++ +AT GP+ A+
Sbjct: 169 -GLDSEESYPYEAKDGSCKYRAE----FAVANDTGFVDIPQQEKALMKAVATVGPISVAM 223
Query: 277 NAL--TWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+A + Q+Y G I Y + S N++H V +VGY
Sbjct: 224 DASHPSLQFYSSG-IYYEPNCSSKNLDHGVLLVGY 257
>gi|375073976|gb|AFA34855.1| cathepsin L-like protein [Trypanosoma cruzi]
Length = 467
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 84/314 (26%), Positives = 140/314 (44%), Gaps = 24/314 (7%)
Query: 1 MFDVKNVLFIVALIALCFLAIPVKVSKPNLEQKL-ELFSSFQQRYKKSY-SKSEHDIRFK 58
M L + A++ + +P + + E+ L F+ F+Q++ + Y S +E R
Sbjct: 1 MSGWARALSLAAVLVVMACLVPAATASLHAEETLASQFAEFKQKHGRVYGSAAEEAFRLS 60
Query: 59 NFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDH 118
F ++L + L+ + A +G+T FSDL+ EEF++R+ H H
Sbjct: 61 VFRENL-FLARLHA--AANPHATFGVTPFSDLTREEFRSRY-------------HNGAAH 104
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
++ + + + G P DWR G + V++Q CG+CWAFS + E L
Sbjct: 105 FAAAQERARVPVNVEV-VGAPAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNVECQWFLA 163
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACKR 237
L+ LS Q ++ C + GC GG +W+ N + E YP +
Sbjct: 164 GHPLTNLSEQMLVSC-DKTDSGCGGGLMNNAFEWIVQENNGAVYTEGSYPYASGEGISPP 222
Query: 238 KATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL 297
TS + V L E+ I +A +GPV AV+A +W Y GGV+ +C
Sbjct: 223 CTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGVMT-SCVSE- 280
Query: 298 ANINHAVQIVGYDN 311
++H V +VGY++
Sbjct: 281 -QLDHGVLLVGYND 293
>gi|449275508|gb|EMC84350.1| Cathepsin L1, partial [Columba livia]
Length = 319
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 84/288 (29%), Positives = 132/288 (45%), Gaps = 24/288 (8%)
Query: 26 SKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQ-SPESARYGI 84
+ P L+ +L+ S+ + K Y + E R +EK+L +IE N + S + G+
Sbjct: 2 ADPELDGHWQLWKSW---HNKDYHEREESWRRVVWEKNLKMIELHNLDHTLGKHSYKLGM 58
Query: 85 TEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDW 144
+F D++ EEF+ LM+ + H + + + P DW
Sbjct: 59 NQFGDMTTEEFRQ-----------LMNGYAHKKSERKYRGSQFLEPSFLE---APRSVDW 104
Query: 145 REAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGCSG 203
RE G + V++Q CG+CWAFST E H K G L LS Q ++DC+ GN GC+G
Sbjct: 105 REKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGNQGCNG 164
Query: 204 GDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSIL 263
G ++ N + + E YP KD R N + D E +++
Sbjct: 165 GLMDQAFQYVQDNGGI-DSEESYPYTAKDDEDCRYKAEYNAANDTGFV-DIPQGHERALM 222
Query: 264 TDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+A GPV A++A ++Q+Y G I Y D S +++H V +VGY
Sbjct: 223 KAVAAVGPVSVAIDAGHSSFQFYQSG-IYYEPDCSSEDLDHGVLVVGY 269
>gi|50355623|dbj|BAD29960.1| cysteine protease [Daucus carota]
Length = 460
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 73/279 (26%), Positives = 136/279 (48%), Gaps = 23/279 (8%)
Query: 34 LELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSE 92
+ + S+ ++ KSY+ E + RF+ F+ + I+E N + S + G+ F+DL+
Sbjct: 41 MAAYESWLVKHGKSYNALGEKEQRFQIFKDNFLYIDEQNAAKD--RSFKLGLNRFADLTN 98
Query: 93 EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
EE+++ K+ + ++ + G ++P + DWRE G +
Sbjct: 99 EEYRS--------KYTGIRTKDSRKKVSGKSQRYASLAGESLPESV----DWREHGAVAS 146
Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
V++Q CG+CWAFST+ E ++ + G L LS QE++DC + N GC+GG +
Sbjct: 147 VKDQGQCGSCWAFSTISAVEGINQIATGKLITLSEQELVDCDRSYNEGCNGGLMDDAFQF 206
Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV 272
+ +N ++ +++YP +D C + + V I SY + + + L A + P+
Sbjct: 207 I-INNGGIDSDADYPYTGRDGQCDQYRKNAKVVTIDSY--EDVPEYDEKALQKAAANQPI 263
Query: 273 IAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
A+ A +Q+Y G+ C +++H V +VGY
Sbjct: 264 SVAIEASGRDFQFYDSGIFTGKCG---TDLDHGVVVVGY 299
>gi|2463586|dbj|BAA22545.1| FB22 precursor [Ananas comosus]
Length = 340
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 89/308 (28%), Positives = 146/308 (47%), Gaps = 30/308 (9%)
Query: 7 VLFIVALIALCFL-AIPVKVSKPN-LEQKLELFSSFQQRYKKSYSKSEHDIR-FKNFEKS 63
V + + LC + A P S+ + ++ F + Y + Y ++ +R F+ F+ +
Sbjct: 5 VQLVFLFLFLCVMWASPSAASRDEPSDPMMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNN 64
Query: 64 LDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHV 123
++ IE N NR S GI +F+D++ EF T++ S+ + D +
Sbjct: 65 VNHIETFN-NRNG-NSYTLGINKFTDMTNNEFVTQYTGVSLPLNFKREPVVSFDDVNISA 122
Query: 124 KKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLS 183
+SI DWR+ G + +V++Q CG+CWAFS + T E ++ + G L
Sbjct: 123 VGQSI--------------DWRDYGAVTEVKDQNPCGSCWAFSAIATVEGIYKIVTGYLV 168
Query: 184 LLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPN 243
LS QEV+DCA + GC GG D++ N V E++YP + C + PN
Sbjct: 169 SLSEQEVLDCAVSN--GCDGGFVDNAYDFIISNNGVAS-EADYPYQAYEGDCTANSW-PN 224
Query: 244 GVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSLANIN 301
I Y+ + ++ S + + P+ AA++A +QYY GGV C SL N
Sbjct: 225 SAYITGYS--YVRSNDESSMKYAVWNQPIAAAIDASGDNFQYYNGGVFSGPCGTSL---N 279
Query: 302 HAVQIVGY 309
HA+ I+GY
Sbjct: 280 HAITIIGY 287
>gi|6978723|ref|NP_037288.1| cathepsin L1 preproprotein [Rattus norvegicus]
gi|55888|emb|CAA68691.1| prepro-cathepsin L [Rattus norvegicus]
Length = 334
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 87/294 (29%), Positives = 139/294 (47%), Gaps = 35/294 (11%)
Query: 25 VSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYG 83
++ P +Q + ++ +++ Y +E + R +EK++ +I+ N + ++G
Sbjct: 16 LATPKFDQTFNAQWHQWKSTHRRLYGTNEEEWRRAVWEKNMRMIQLHNGEY---SNGKHG 72
Query: 84 IT----EFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIP 139
T F D++ EEF R VN + H H K R + + IP
Sbjct: 73 FTMEMNAFGDMTNEEF-----RQIVNGY----------RHQKHKKGRLFQEPLMLQ--IP 115
Query: 140 VKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGN 198
DWRE G + V+NQ CG+CWAFS E LK G L LS Q ++DC+ GN
Sbjct: 116 KTVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHDQGN 175
Query: 199 MGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP- 257
GC+GG ++ N L+ E YP KD +CK +A + + T IP
Sbjct: 176 QGCNGGLMDFAFQYIKENG-GLDSEESYPYEAKDGSCKYRAE----YAVANDTGFVDIPQ 230
Query: 258 SESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
E +++ +AT GP+ A++A + Q+Y G I Y + S +++H V +VGY
Sbjct: 231 QEKALMKPVATVGPISVAMDASHPSLQFYSSG-IYYEPNCSSKDLDHGVLVVGY 283
>gi|351726954|ref|NP_001236888.1| cysteine proteinase precursor [Glycine max]
gi|479060|emb|CAA83673.1| cysteine proteinase [Glycine max]
gi|300507422|gb|ADK24076.1| cysteine proteinase [Glycine max]
gi|300507425|gb|ADK24077.1| cysteine proteinase [Glycine max]
gi|1096153|prf||2111244A Cys protease
Length = 380
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 89/304 (29%), Positives = 140/304 (46%), Gaps = 38/304 (12%)
Query: 19 LAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPE 78
+A +K+ L + + F F + Y +SYS E +R + +++ P
Sbjct: 36 IARKLKLGDNELLRTEKKFKVFMENYGRSYSTEEEYLRRLGI-FAQNMVRAAEHQALDP- 93
Query: 79 SARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIP--- 135
+A +G+T+FSDL+E+EF+ L VN S++ GI P
Sbjct: 94 TAVHGVTQFSDLTEDEFE--KLYTGVNGGFPSSNNA--------------AGGIAPPLEV 137
Query: 136 TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAG 195
G+P DWRE G + +V+ Q CG+CWAFST + E + L G L LS Q+++DC
Sbjct: 138 DGLPENFDWREKGAVTEVKLQGRCGSCWAFSTTGSIEGANFLATGKLVSLSEQQLLDCDN 197
Query: 196 NGNM--------GCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI 247
++ GC+GG +++ + LE ES YP + CK P + +
Sbjct: 198 KCDITEKTSCDNGCNGGLMTNAYNYL-LESGGLEEESSYPYTGERGECK---FDPEKIAV 253
Query: 248 KSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCD--GSLANINHAVQ 305
K + E+ I + +GP+ VNA+ Q Y+GGV +C S +NH V
Sbjct: 254 KITNFTNIPADENQIAAYLVKNGPLAMGVNAIFMQTYIGGV---SCPLICSKKRLNHGVL 310
Query: 306 IVGY 309
+VGY
Sbjct: 311 LVGY 314
>gi|13124011|sp|Q9YWK4.1|CATV_NPVBS RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|3882976|gb|AAC77812.1| cathepsin [Buzura suppressaria NPV]
Length = 331
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 90/278 (32%), Positives = 137/278 (49%), Gaps = 28/278 (10%)
Query: 35 ELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEE 93
+ F +F Y K Y+ SE + RF F+++L EE+N + +SA Y I +F+DLS+
Sbjct: 29 DYFETFLANYNKMYNDTSEKERRFSIFQQTL---EEINYKNRLNDSAVYQINKFADLSKN 85
Query: 94 EFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGI-PVKKDWREAGIIGK 152
E +++ +N V + N K T I P G P+ DWR+ +
Sbjct: 86 EIISKYT--GLNMPVQTT---------NFCK----TIVIDQPPGKGPLNFDWRQQNKVTS 130
Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
++NQ+ CGACWAF+T+ + ES +A+KN LS Q++IDC +MGC GG +
Sbjct: 131 IKNQKACGACWAFATLASIESQYAIKNNVHIDLSEQQMIDC-DYVDMGCDGGLLHTAFEQ 189
Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH-GP 271
M + L E EYP + C+ + VK+K C + L D+ GP
Sbjct: 190 M-IQMGELVQEHEYPYAGVNKPCELRGDETGVVKVKG--CYRYVVFREEKLKDLLRAVGP 246
Query: 272 VIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+ A++A Y G+I Y C+ +NHAV +VGY
Sbjct: 247 IPMAIDASGIVNYHHGIIHY-CENY--GLNHAVLLVGY 281
>gi|255078398|ref|XP_002502779.1| cysteine endopeptidase [Micromonas sp. RCC299]
gi|226518045|gb|ACO64037.1| cysteine endopeptidase [Micromonas sp. RCC299]
Length = 414
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 79/279 (28%), Positives = 137/279 (49%), Gaps = 23/279 (8%)
Query: 35 ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARY-GITEFSDLSE 92
+LF + Q++ K+Y S+ E ++R K F + + +++ N ++ E + G+ +DL++
Sbjct: 66 DLFHEWTQKHGKTYDSEEEKELRLKIFADNHEFVQKHNAEYENGEHTHFVGLNHLADLTK 125
Query: 93 EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
+EFK ++ ++ V + P P + DW +G +
Sbjct: 126 DEFKK-----------MLGYNAALRASRAPVDASTWEYADVTP---PEEIDWVASGAVTP 171
Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
V+NQ+ CG+CWAFST E ++A+K G L LS +E+I C+ NGNMGC+GG +W
Sbjct: 172 VKNQKQCGSCWAFSTTGAVEGVNAIKTGKLISLSEEELISCSTNGNMGCNGGLMDNGFEW 231
Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV 272
+ VN ++ E + + K+ C V I + D E S++ ++ PV
Sbjct: 232 I-VNNRGIDTEDGWEYVAKEEKCGFFRRHHRAVAIDGFK-DVPSNDEDSLMKAVSQQ-PV 288
Query: 273 IAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
A+ A ++Q Y GGV Y+ ++H V +VGY
Sbjct: 289 SVAIEADHQSFQLYAGGV--YSAKDCGTELDHGVLLVGY 325
>gi|147790682|emb|CAN61026.1| hypothetical protein VITISV_001146 [Vitis vinifera]
Length = 469
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 83/287 (28%), Positives = 143/287 (49%), Gaps = 34/287 (11%)
Query: 31 EQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSD 89
E + ++ ++ ++ KSY+ E + RF+ F+ +L I+E N ++ + G+ F+D
Sbjct: 47 EDVMAVYEAWLAKHGKSYNALGEKERRFQIFKDNLRFIDEHNAENRT---YKVGLNRFAD 103
Query: 90 LSEEEFKTRHL--RHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREA 147
L+ EE+++ +L R + + S +K D + V G ++P + DWR+
Sbjct: 104 LTNEEYRSMYLGTRTAAKRR---SSNKISDRYAFRV-------GDSLPESV----DWRKK 149
Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFC 207
G + +V++Q +CG+CWAFST+ E ++ + G L LS QE++DC + N GC+GG
Sbjct: 150 GAVVEVKDQGSCGSCWAFSTIAAVEGINKIVTGGLISLSEQELVDCDTSYNEGCNGG--- 206
Query: 208 ALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILT 264
L+D+ +N ++ E +YP D C + + V I Y D E S+
Sbjct: 207 -LMDYAFEFIINNGGIDSEEDYPYKASDGRCDQYRKNAXVVTIDGYE-DVPENDEKSLEK 264
Query: 265 DIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+A PV A+ A +Q Y G+ C +L +H V VGY
Sbjct: 265 AVANQ-PVSVAIEAGGREFQLYQSGIFTGRCGTAL---DHGVTAVGY 307
>gi|348546019|ref|XP_003460476.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
gi|348546143|ref|XP_003460538.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 334
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 77/241 (31%), Positives = 121/241 (50%), Gaps = 24/241 (9%)
Query: 75 QSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITI 134
Q +S R G+T F+D+ EE+K +++ L H N R +T +
Sbjct: 66 QGLKSYRLGMTYFADMENEEYK-----RVISQGCL--------HSFNASLPRRGSTFFRL 112
Query: 135 PTG--IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVID 192
P G +P DWR+ G + V++Q+ CG+CWAFS + E H K GTL LS Q+++D
Sbjct: 113 PEGTDLPDAVDWRDKGYVTDVKDQKQCGSCWAFSATGSLEGQHFRKTGTLVSLSEQQLVD 172
Query: 193 CAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYT 251
C+G+ GNMGC GG ++ N + + E YP ++ C+ + G YT
Sbjct: 173 CSGDYGNMGCMGGLMDYAFQYIQANGGI-DTEESYPYEAENGKCRYNPDNI-GATSTGYT 230
Query: 252 CDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYN-CDGSLANINHAVQIVG 308
+ E ++ +AT GP+ ++A +++Q+Y GV YN D S ++H V VG
Sbjct: 231 -EVSQGDEDALKEAVATIGPISVGIDASQMSFQFYESGV--YNEPDCSSLELDHGVLAVG 287
Query: 309 Y 309
Y
Sbjct: 288 Y 288
>gi|401419663|ref|XP_003874321.1| cysteine peptidase A (CBA) [Leishmania mexicana MHOM/GT/2001/U1103]
gi|1706259|sp|P35591.2|CYSP1_LEIPI RecName: Full=Cysteine proteinase 1; AltName: Full=Amastigote
cysteine proteinase A-1; Flags: Precursor
gi|1220383|gb|AAA91859.1| cysteine proteinase [Leishmania pifanoi]
gi|322490556|emb|CBZ25817.1| cysteine peptidase A (CBA) [Leishmania mexicana MHOM/GT/2001/U1103]
Length = 354
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 89/316 (28%), Positives = 148/316 (46%), Gaps = 32/316 (10%)
Query: 7 VLFIVALIALCFLAIPVKVSKPNLEQKLEL--FSSFQQRYKKSYS-KSEHDIRFKNFEKS 63
+ + L +C+ + + + P ++ + + SF++R+ K++ +E RF F+++
Sbjct: 10 AIVVTILFVVCYGSALIAQTPPPVDNFVASAHYGSFKKRHGKAFGGDAEEGHRFNAFKQN 69
Query: 64 LDIIEELNKNRQSPESARYGIT-EFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNH 122
+ LN Q+P A Y ++ +F+DL+ +EF +L N H K H
Sbjct: 70 MQTAYFLNT--QNPH-AHYDVSGKFADLTPQEFAKLYL----NPDYYARHLKDH------ 116
Query: 123 VKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTL 182
K + + P+G+ + DWR+ G + V+NQ CG+CWAFS + E A +L
Sbjct: 117 --KEDVHVDDSAPSGV-MSVDWRDKGAVTPVKNQGLCGSCWAFSAIGNIEGQWAASGHSL 173
Query: 183 SLLSVQEVIDCAGNGNMGCSGGDFCALLDW-MDVNKVVLEPESEYPLLLKDAA---CKRK 238
LS Q ++ C N + GC+GG ++W M + + E+ YP C +
Sbjct: 174 VSLSEQMLVSC-DNIDEGCNGGLMDQAMNWIMQSHNGSVFTEASYPYTSGGGTRPPCHDE 232
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLA 298
G KI + +L E I + GPV AV+A TWQ Y GGV+ SL
Sbjct: 233 GEV--GAKITGFL--SLPHDEERIAEWVEKRGPVAVAVDATTWQLYFGGVVSLCLAWSL- 287
Query: 299 NINHAVQIVGYDNYSR 314
NH V IVG++ ++
Sbjct: 288 --NHGVLIVGFNKNAK 301
>gi|326493368|dbj|BAJ85145.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 436
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 82/309 (26%), Positives = 145/309 (46%), Gaps = 37/309 (11%)
Query: 11 VALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEE 69
V+L A ++I V + + E+ +++ + + +Y+ E + RF+ F +L I++
Sbjct: 18 VSLAAAADMSI-VSYGERSEEEVRRMYAEWMAEHGSTYNAIGEEERRFEAFRDNLRYIDQ 76
Query: 70 LNKNRQSP-ESARYGITEFSDLSEEEFKTRHLRHSV---NKHVLMSHHKHHDHHHNHVKK 125
N + S R G+ F+DL+ EE+++ +L + L + ++ D+
Sbjct: 77 HNAAADAGVHSFRLGLNRFADLTNEEYRSTYLGARTKPDRERKLSARYQAADNDE----- 131
Query: 126 RSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLL 185
+P DWR+ G +G V++Q CG+CWAFS + E ++ + G + L
Sbjct: 132 ------------LPESVDWRKKGAVGAVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPL 179
Query: 186 SVQEVIDCAGNGNMGCSGGDFCALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSP 242
S QE++DC + N GC+GG L+D+ +N ++ E +YP +D C +
Sbjct: 180 SEQELVDCDTSYNQGCNGG----LMDYAFEFIINNGGIDSEEDYPYKERDNRCDANKKNA 235
Query: 243 NGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANI 300
V I Y D + SE S+ +A P+ A+ A +Q Y G+ C +L
Sbjct: 236 KVVTIDGYE-DVPVNSEKSLQKAVANQ-PISVAIEAGGRAFQLYKSGIFTGTCGTAL--- 290
Query: 301 NHAVQIVGY 309
+H V VGY
Sbjct: 291 DHGVAAVGY 299
>gi|443685370|gb|ELT89004.1| hypothetical protein CAPTEDRAFT_95613, partial [Capitella teleta]
Length = 295
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 83/257 (32%), Positives = 127/257 (49%), Gaps = 22/257 (8%)
Query: 57 FKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHH 116
F+N K + + L++ +SP + GI +FSD+ E+EF T +N + + K
Sbjct: 11 FRNNIKKIQMHNYLHEQGKSPFTM--GINQFSDMDEKEFST-----IMNGFRMNNRTKVR 63
Query: 117 DHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHA 176
DH H+H IP +P + DWR+ G + V+NQ CG+CWAFS + E H
Sbjct: 64 DHLHSHY------ISPAIPVSVPAEVDWRKKGYVTPVKNQGQCGSCWAFSAIGALEGQHF 117
Query: 177 LKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAAC 235
K G L LS Q ++DC+ + GN GC+GG ++ N + E+ YP D C
Sbjct: 118 RKTGKLVSLSEQNLVDCSKSYGNNGCNGGVMDYAFKYIKDNDGD-DTEACYPYEAVDGMC 176
Query: 236 KRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGV-IQYN 292
+ K G + YT D +E + +A GPV A++A ++ Y GGV ++
Sbjct: 177 RFKRECV-GATCRGYT-DLPWGNEVKMKEAVALVGPVSVAIDASHSSFMSYKGGVYVEKE 234
Query: 293 CDGSLANINHAVQIVGY 309
C S ++H V +VGY
Sbjct: 235 C--SPYQLDHGVLVVGY 249
>gi|126338866|ref|XP_001379280.1| PREDICTED: cathepsin F-like [Monodelphis domestica]
Length = 567
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 94/308 (30%), Positives = 138/308 (44%), Gaps = 42/308 (13%)
Query: 18 FLAIPVKVSKPNLEQKLEL---FSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKN 73
LA P S P + +EL F F Y KSY+ +E R F ++L++ +L +
Sbjct: 248 LLAEPHSSSLPRMGDSVELISLFKDFLTTYNKSYANATETQRRLGIFARNLELAHKLQEL 307
Query: 74 RQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGIT 133
Q SA+YG+T+FSDL+EEEF+ +L + L+S + R++
Sbjct: 308 DQG--SAQYGVTKFSDLTEEEFRMFYL------NPLLSS----------LPGRALRPAPR 349
Query: 134 IPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC 193
P DWR+ G + +NQ CG+CWAFS E L+ G L LS QE++DC
Sbjct: 350 ARGPAPASWDWRDHGALTAAKNQGMCGSCWAFSVTGNVEGQWFLRRGALLTLSEQELVDC 409
Query: 194 AGNGNMGCSGGDFCALLDWMDVNKVV-------LEPESEYPLLLKDAACKRKATSPNGVK 246
+ C GG + N LE E +Y + C + SP+ +
Sbjct: 410 -DTLDQACGGG--------LPSNAYTAIETLGGLETEKDYSYEGRKERC---SFSPDKAR 457
Query: 247 IKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVI-QYNCDGSLANINHAVQ 305
+ L E I +A +GPV A+NA Q+Y GV + S I+HAV
Sbjct: 458 AYINSSVDLSRDEQEIAAWLAENGPVSIALNAFAMQFYRRGVSHPFRPLCSPWFIDHAVL 517
Query: 306 IVGYDNYS 313
+VGY + S
Sbjct: 518 LVGYGDRS 525
>gi|118158|sp|P12412.1|CYSEP_VIGMU RecName: Full=Vignain; AltName: Full=Bean endopeptidase; AltName:
Full=Cysteine proteinase; AltName:
Full=Sulfhydryl-endopeptidase; Short=SH-EP; Contains:
RecName: Full=Vignain-1; Contains: RecName:
Full=Vignain-2; Flags: Precursor
gi|22062|emb|CAA33753.1| sulfhydryl-pre-endopeptidase (AA -20 to 342) [Vigna mungo]
gi|22066|emb|CAA36181.1| sulfhydryl-endopeptidase [Vigna mungo]
Length = 362
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 76/281 (27%), Positives = 134/281 (47%), Gaps = 21/281 (7%)
Query: 31 EQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDL 90
E +L+ ++ + S S E RF F+ ++ + NK + + + +F+D+
Sbjct: 34 ESLWDLYERWRSHHTVSRSLGEKHKRFNVFKANVMHVHNTNK---MDKPYKLKLNKFADM 90
Query: 91 SEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGII 150
+ EF++ + VN H + +H + K S+ P DWR+ G +
Sbjct: 91 TNHEFRSTYAGSKVNHHKMFRGSQHGSGTFMYEKVGSV----------PASVDWRKKGAV 140
Query: 151 GKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALL 210
V++Q CG+CWAFST+ E ++ +K L LS QE++DC N GC+GG +
Sbjct: 141 TDVKDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDKEENQGCNGGLMESAF 200
Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
+++ K + ES YP ++ C + V I + + + E+++L +A
Sbjct: 201 EFIK-QKGGITTESNYPYTAQEGTCDESKVNDLAVSIDGHE-NVPVNDENALLKAVANQ- 257
Query: 271 PVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
PV A++A +Q+Y GV +C+ ++NH V IVGY
Sbjct: 258 PVSVAIDAGGSDFQFYSEGVFTGDCN---TDLNHGVAIVGY 295
>gi|302143380|emb|CBI21941.3| unnamed protein product [Vitis vinifera]
Length = 354
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 86/331 (25%), Positives = 151/331 (45%), Gaps = 50/331 (15%)
Query: 3 DVKNVLFIVALIALCFLAI-----PVKVSKPNLEQKLELF-SSFQQRYK------KSYSK 50
+V L I+ + + + A P++ ++E++ E + +RYK + +
Sbjct: 5 NVYFALLIMWTVGVSWSAFSEEHEPMESEMSDMEKRYERWLVQHGRRYKNRDEWQRHFGI 64
Query: 51 SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLM 110
+ ++RF N+ + + L N +F+D++ EE+K ++ L
Sbjct: 65 YQSNVRFINYINAQNFSFTLTDN------------QFADMTNEEYKALYMG-------LG 105
Query: 111 SHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVET 170
+ + + ++RS +P+ DWR+ G + VRNQ CG+CWAFSTV
Sbjct: 106 TSETSRKNQSSFKRERSKV--------LPISVDWRKMGAVTPVRNQGECGSCWAFSTVAA 157
Query: 171 AESMHALKNGTLSLLSVQEVIDC-AGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLL 229
E ++ ++ G L LS QE++DC +GN GC+GG ++ N + + YP +
Sbjct: 158 VEGINKIRTGKLVSLSEQELLDCDIDSGNEGCNGGYMVNAFKFIKQNGGITTARN-YPYI 216
Query: 230 LKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGG 287
+ C + + + VKI Y +T+ P+ IL PV A++A +Q Y G
Sbjct: 217 GEQGICNKDKAANHVVKISGY--ETVPPNNEKILQAAVAKQPVSVAIDAGGYEFQLYSKG 274
Query: 288 VIQYNCDGSLANINHAVQIVGY--DNYSRTW 316
+ C L NHAV ++GY DN + W
Sbjct: 275 IFNGFCGKQL---NHAVTVIGYGEDNGKKYW 302
>gi|18141289|gb|AAL60582.1|AF454960_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 359
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 85/278 (30%), Positives = 134/278 (48%), Gaps = 29/278 (10%)
Query: 37 FSSFQQRYKKSYSKSEH-DIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F+ F RY K Y +E +RF F+++LD+I NK S + G+ +F+D++ +EF
Sbjct: 60 FARFAHRYGKRYENAEEMKLRFSIFKENLDLIRSTNKKGLS---YKLGVNQFADMTWQEF 116
Query: 96 KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
+ L + N L HK TG +P KDWRE GI+ V+
Sbjct: 117 QRTKLGAAQNCSATLKGTHK--------------LTG----EALPETKDWREDGIVSPVK 158
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWM 213
+Q CG+CW FST E+ + G LS Q+++DCAG N GC+GG +++
Sbjct: 159 DQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYI 218
Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
N L+ E YP +D CK A + GV++ + + + +E + + PV
Sbjct: 219 KSNG-GLDTEEAYPYTGEDGTCKYSAENV-GVEVLD-SVNITLGAEDELKHAVGLVRPVS 275
Query: 274 AAVNAL-TWQYYLGGVI-QYNCDGSLANINHAVQIVGY 309
A + +++ Y GV +C + ++NHAV VGY
Sbjct: 276 IAFEVIHSFRLYKSGVYSDSHCGQTPMDVNHAVLAVGY 313
>gi|255563134|ref|XP_002522571.1| cysteine protease, putative [Ricinus communis]
gi|223538262|gb|EEF39871.1| cysteine protease, putative [Ricinus communis]
Length = 343
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 83/307 (27%), Positives = 152/307 (49%), Gaps = 30/307 (9%)
Query: 9 FIVALIALCFLAIPVKVSKP--NLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLD 65
++ L+ + + + +P N E E + R+ ++Y +E + RF+ F+ +LD
Sbjct: 10 LVITLLMILGTWVSQAMPRPLLNAEAIAEKHEQWMARHGRTYHDNAEKERRFQIFKNNLD 69
Query: 66 IIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKK 125
IE NK ++ + G+ +FSDLSEEEF T + + + + + N K
Sbjct: 70 YIENFNKAFN--KTYKLGLNKFSDLSEEEFVTTYNGYEMPTTLPTA---------NTTVK 118
Query: 126 RSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLL 185
+ + +P DWRE G++ V+NQ CG CWAFS V E + G + L
Sbjct: 119 PTFFSNYYNQDEVPESIDWRENGVVTSVKNQGECGCCWAFSAVAAVEGIA----GNGASL 174
Query: 186 SVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGV 245
S Q+++DC G+ N GC GG +++ N+ ++ +++YP C ++ S
Sbjct: 175 SAQQLLDCVGD-NSGCGGGTMIKAFEYIVQNQGIVS-DTDYPYEQTQEMC--RSGSNVAA 230
Query: 246 KIKSYTCDTLIPSESSILTDIATHGPVIAAVNALT---WQYYLGGVIQYNCDGSLANINH 302
+I Y +++I SE ++ +A P+ A++A + ++ Y+ GV ++ + ++ H
Sbjct: 231 RITGY--ESVIQSEEALKRAVAKQ-PISVAIDASSGPNFKSYISGV--FSAEDCGTHLTH 285
Query: 303 AVQIVGY 309
AV +VGY
Sbjct: 286 AVTLVGY 292
>gi|1136308|gb|AAB41119.1| cruzipain [Trypanosoma cruzi]
Length = 467
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 84/314 (26%), Positives = 140/314 (44%), Gaps = 24/314 (7%)
Query: 1 MFDVKNVLFIVALIALCFLAIPVKVSKPNLEQKL-ELFSSFQQRYKKSY-SKSEHDIRFK 58
M L + A++ + +P + + E+ L F+ F+Q++ + Y S +E R
Sbjct: 1 MSGWARALSLAAVLVVMACLVPAATASLHAEETLASQFAEFKQKHGRVYGSAAEEAFRLS 60
Query: 59 NFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDH 118
F ++L + L+ + A +G+T FSDL+ EEF++R+ H H
Sbjct: 61 VFRENL-FLARLHA--AANPHATFGVTAFSDLTREEFRSRY-------------HNGAAH 104
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
++ + + + G P DWR G + V++Q CG+CWAFS + E L
Sbjct: 105 FAAAQERARVPVNVEV-VGAPAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNVECQWFLA 163
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACKR 237
L+ LS Q ++ C + GC GG +W+ N + E YP +
Sbjct: 164 GHPLTNLSEQMLVSC-DKTDSGCGGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPP 222
Query: 238 KATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL 297
TS + V L E+ I +A +GPV AV+A +W Y GGV+ +C
Sbjct: 223 CTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGVMT-SCVSE- 280
Query: 298 ANINHAVQIVGYDN 311
++H V +VGY++
Sbjct: 281 -QLDHGVLLVGYND 293
>gi|6650705|gb|AAF21977.1|AF115280_1 thiolproteinase SmTP1 [Sarcocystis muris]
Length = 394
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 86/279 (30%), Positives = 131/279 (46%), Gaps = 29/279 (10%)
Query: 37 FSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F FQ+ + K Y+ E + R+ F+ +L I N N Q S + +F DL+ EEF
Sbjct: 89 FYQFQRDHNKFYATEEERLKRYAIFKNNLTYIH--NHNMQG-YSYVLKMNKFGDLTLEEF 145
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
+ R+L + K L + + D V+ I P DWR+ G + V++
Sbjct: 146 RQRYLGY--KKPDLRTPPREVDTTLESVEDNDI----------PTHVDWRQRGCVTSVKD 193
Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWMD 214
Q CG+CWAFS E ++ K G L LS Q+++DC+ GN GC GG +++
Sbjct: 194 QGDCGSCWAFSATGAMEGVYCAKTGKLVNLSQQQLVDCSRFLGNQGCDGGRMEEAFEYVV 253
Query: 215 VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP--SESSILTDIATHGPV 272
N + E+ YP + KD CK S + + T +P SE S+ T +A PV
Sbjct: 254 ENGGICSGEN-YPYMRKDGVCK----SSQCTSVATITGYRSVPRRSEKSMKTALALRSPV 308
Query: 273 IAAV--NALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
A+ N +Q+Y G+ C N++H V +VGY
Sbjct: 309 SVAIQANQAAFQFYYDGIFDAPCG---TNLDHGVLLVGY 344
>gi|261289779|ref|XP_002611751.1| hypothetical protein BRAFLDRAFT_284345 [Branchiostoma floridae]
gi|229297123|gb|EEN67761.1| hypothetical protein BRAFLDRAFT_284345 [Branchiostoma floridae]
Length = 330
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 89/309 (28%), Positives = 149/309 (48%), Gaps = 30/309 (9%)
Query: 10 IVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEE 69
++ L+A +A +S E + E +F+ ++ K YS+ E R F+ +L IE
Sbjct: 3 LLVLLACVAMATAASLS---FESQWE---AFKIKHDKVYSEKEEYARRLIFQDNLKTIES 56
Query: 70 LNKNRQSPESARY-GITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSI 128
N+ + + + + G+ +F+D++ E+ L + ++ S N K S
Sbjct: 57 HNQEADTGKHSYWLGVNQFADMTHAEY----LNQVIGGCLITS---------NLTKTGSR 103
Query: 129 TTGITIPT-GIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSV 187
T +P + DWR+ G++ +++Q CG+CWAFST + E HA GTL LS
Sbjct: 104 ATYRYMPNMQVNDTVDWRDKGLVTDIKDQGQCGSCWAFSTTGSLEGQHAKATGTLVSLSE 163
Query: 188 QEVIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVK 246
Q ++DC+ GN GC GGD ++ NK + + E YP K+ CK S G
Sbjct: 164 QNLVDCSRQEGNKGCEGGDMDQGFQYIIQNKGI-DTEQCYPYKAKNHRCKFD-NSCIGAT 221
Query: 247 IKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVI-QYNCDGSLANINHA 303
+ S+T D E ++ A GP+ ++A ++Q+Y GV ++ C S ++H
Sbjct: 222 MSSFT-DVTSGDEDALKQACANIGPISVGIDASHQSFQFYSSGVYNEFEC--SSTKLDHG 278
Query: 304 VQIVGYDNY 312
V +VGY Y
Sbjct: 279 VLVVGYGTY 287
>gi|71666430|ref|XP_820174.1| cysteine peptidase [Trypanosoma cruzi strain CL Brener]
gi|70885508|gb|EAN98323.1| cysteine peptidase, putative [Trypanosoma cruzi]
Length = 467
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 84/314 (26%), Positives = 140/314 (44%), Gaps = 24/314 (7%)
Query: 1 MFDVKNVLFIVALIALCFLAIPVKVSKPNLEQKL-ELFSSFQQRYKKSY-SKSEHDIRFK 58
M L + A++ + +P + + E+ L F+ F+Q++ + Y S +E R
Sbjct: 1 MSGWARALSLAAVLVVMACLVPAATASLHAEETLASQFAEFKQKHGRVYESAAEEAFRLS 60
Query: 59 NFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDH 118
F ++L + L+ + A +G+T FSDL+ EEF++R+ H H
Sbjct: 61 VFRENL-FLARLHA--AANPHATFGVTPFSDLTREEFRSRY-------------HNGAAH 104
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
++ + + + G P DWR G + V++Q CG+CWAFS + E L
Sbjct: 105 FAAAQERARVPVNVEV-VGAPAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNVECQWFLA 163
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACKR 237
L+ LS Q ++ C + GC GG +W+ N + E YP +
Sbjct: 164 GHPLTNLSEQMLVSC-DKTDSGCGGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPP 222
Query: 238 KATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL 297
TS + V L E+ I +A +GPV AV+A +W Y GGV+ +C
Sbjct: 223 CTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGVMT-SCVSE- 280
Query: 298 ANINHAVQIVGYDN 311
++H V +VGY++
Sbjct: 281 -QLDHGVLLVGYND 293
>gi|66475996|ref|XP_627814.1| cryptopain - cysteine proteinase secreted, possible transmembrane
domain near N-terminus [Cryptosporidium parvum Iowa II]
gi|32399065|emb|CAD98305.1| cryptopain precursor [Cryptosporidium parvum]
gi|46229218|gb|EAK90067.1| cryptopain - cysteine proteinase secreted, possible transmembrane
domain near N-terminus [Cryptosporidium parvum Iowa II]
gi|76160841|gb|ABA40395.1| cryptopain-1 [Cryptosporidium parvum]
Length = 401
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 86/296 (29%), Positives = 131/296 (44%), Gaps = 23/296 (7%)
Query: 21 IPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPES 79
+P P + + F F+++Y K YS E + RF+ ++++++ I+ N S
Sbjct: 70 VPGDYVDPATREYRKSFEEFKKKYHKVYSSMEEENQRFEIYKQNMNFIKTTNSQGFS--- 126
Query: 80 ARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIP 139
+ EF DLS+EEF R ++ S + V P I
Sbjct: 127 YVLEMNEFGDLSKEEFMAR-----FTGYIKDSKDDERVFKSSRVSASESEEEFVPPNSI- 180
Query: 140 VKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMH-ALKNGTLSLLSVQEVIDCAG-NG 197
+W EAG + +RNQ+ CG+CWAFS V E A N L LS Q+ +DC+ NG
Sbjct: 181 ---NWVEAGCVNPIRNQKNCGSCWAFSAVAALEGATCAQTNRGLPSLSEQQFVDCSKQNG 237
Query: 198 NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP 257
N GC GG + NK L +YP ++ C + N ++I + P
Sbjct: 238 NFGCDGGTMGLAFQYAIKNK-YLCTNDDYPYFAEEKTC-MDSFCENYIEIPVKAYKYVFP 295
Query: 258 SESSIL-TDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
+ L T +A +GP+ A+ A +Q+Y GV C +NH V +VGYD
Sbjct: 296 RNINALKTALAKYGPISVAIQADQTPFQFYKSGVFDAPCG---TKVNHGVVLVGYD 348
>gi|116781957|gb|ABK22314.1| unknown [Picea sitchensis]
Length = 369
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 78/273 (28%), Positives = 127/273 (46%), Gaps = 23/273 (8%)
Query: 39 SFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTR 98
+ Q R +S EH RF+ F++++ I+ +NK + SP + G+ +F+DLS EEFK
Sbjct: 50 ALQHRSSRSLDSEEHAERFEIFKENVKYIDSVNK-KDSP--YKLGLNKFADLSNEEFKAI 106
Query: 99 HLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQT 158
++ K V+ S + P +P DWR+ G + V+NQ
Sbjct: 107 YM-----------GTKMDLRGDREVQSGSFMYQNSEP--LPASIDWRQKGAVAAVKNQGH 153
Query: 159 CGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKV 218
CG+CWAFSTV + E ++ + G L LS Q+++DC+ N GC+GG ++ +N
Sbjct: 154 CGSCWAFSTVASVEGINYITTGNLVSLSEQQLVDCSTE-NSGCNGGLMDTAFQYI-INNG 211
Query: 219 VLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA 278
+ E YP + C + ++ + + + L + H PV A+ A
Sbjct: 212 GIVTEDNYPYTAEATECSSTKINSQTTRVVIDGFEDVPANNEQALKEAVAHQPVSVAIEA 271
Query: 279 --LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+Q+Y GV C +L +H V VGY
Sbjct: 272 SGQDFQFYSTGVFTGKCGTAL---DHGVVAVGY 301
>gi|121531600|gb|ABM55485.1| digestive cysteine protease intestain [Leptinotarsa decemlineata]
Length = 326
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 90/311 (28%), Positives = 146/311 (46%), Gaps = 38/311 (12%)
Query: 10 IVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIE 68
++A+ A +A+ ++ + + +F+Q + K+Y E RF F+++L I+
Sbjct: 3 LLAIFATVLIAVTASTNE-------DQWIAFKQTHGKTYKNLLEEKTRFGIFQRNLIKIK 55
Query: 69 ELN-KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRS 127
E N + + E+ G+T F+DL+ EEFK +L K+ K R
Sbjct: 56 EHNARYDKGEETYLLGVTRFADLTHEEFK----------DILKGQIKN--------KPRL 97
Query: 128 ITTGITIPTG--IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLL 185
T P +P DW E G + +V++Q CG+CWAFS E +A+ N L
Sbjct: 98 NATPTVFPEDLEVPDSIDWTEKGAVLEVKDQNPCGSCWAFSATGALEGQNAILNNVKISL 157
Query: 186 SVQEVIDC-AGNGNMGC-SGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPN 243
S Q+++DC A GN C GGD A ++ V ++ E YP + K C+ A S
Sbjct: 158 SEQQLLDCSAAYGNGNCKEGGDMSAAFEY--VRDYGIQSEKSYPYIRKQTECQYDA-SKT 214
Query: 244 GVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHA 303
+KIK Y + SE + + GP+ A+N+ Q Y G+I + G +++H
Sbjct: 215 ILKIKGYK--NVTTSEEGLRKAVGAIGPISIAMNSDPLQLYYSGII--SGKGCSHDLDHG 270
Query: 304 VQIVGYDNYSR 314
V +VGY S+
Sbjct: 271 VLVVGYGKASQ 281
>gi|71482942|gb|AAZ32410.1| cysteine proteinase aleuran type [Nicotiana benthamiana]
Length = 360
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 83/277 (29%), Positives = 132/277 (47%), Gaps = 25/277 (9%)
Query: 36 LFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
LF+ F RY K Y E +I+ + FE LD ++ + + + S + G+ EF+D++ +EF
Sbjct: 60 LFARFAHRYGKRYETVE-EIK-QRFEVFLDNLKMIRSHNKKGLSYKLGVNEFTDITWDEF 117
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
+ L + N ++K ++ +P KDWREAGI+ V+N
Sbjct: 118 RRDRLGAAQNCSATT---------KGNLKLTNVV--------LPETKDWREAGIVSPVKN 160
Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWMD 214
Q CG+CW FST E+ + G LS Q+++DCAG N GC+GG +++
Sbjct: 161 QGKCGSCWTFSTTGALEAAYGQAFGKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIK 220
Query: 215 VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIA 274
N L+ E YP K+ CK + + GVK+ + + + +E + +A PV
Sbjct: 221 SNG-GLDTEEAYPYTGKNGLCKFSSENV-GVKVID-SVNITLGAEDELKYAVALVRPVSI 277
Query: 275 AVNALTW--QYYLGGVIQYNCDGSLANINHAVQIVGY 309
A + QY G C + ++NHAV VGY
Sbjct: 278 AFEVIKGFKQYKSGVYTSTECGNTPMDVNHAVLAVGY 314
>gi|441593109|ref|XP_003260582.2| PREDICTED: cathepsin L2 isoform 1 [Nomascus leucogenys]
Length = 334
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 88/319 (27%), Positives = 148/319 (46%), Gaps = 39/319 (12%)
Query: 11 VALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEE 69
+ L A C L I V P +Q L+ + ++ +++ Y +E R +EK++ +IE
Sbjct: 5 LVLAAFC-LGIASAV--PKFDQNLDTKWYQWKATHRRLYGANEEGWRRAVWEKNMKMIEL 61
Query: 70 LNKN-RQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSI 128
N Q + F D++ EEF+ +M ++ V + +
Sbjct: 62 HNGEYSQGKHGFTMAMNAFGDMTNEEFRQ-----------MMGCFRNQKFRKGKVFREPL 110
Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
+ +P + DWR+ G + V+NQ+ CG+CWAFS E K G L LS Q
Sbjct: 111 F--LDLPKSV----DWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQ 164
Query: 189 EVIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI 247
++DC+ GN GC+GG ++ N L+ E YP + D CK + + +
Sbjct: 165 NLVDCSRPQGNQGCNGGFMGKAFQYVKENG-GLDSEESYPYVAMDEICKYRPEN----SV 219
Query: 248 KSYTCDTLIP--SESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHA 303
+ T T++P E +++ +AT GP+ A++A ++Q+Y G I + D S N++H
Sbjct: 220 ANDTGFTVVPPGKEKALMKAVATVGPISVAMDAGHSSFQFYNQG-IYFEPDCSSENLDHG 278
Query: 304 VQIVGY------DNYSRTW 316
V +VGY N S+ W
Sbjct: 279 VLVVGYGFEGANSNNSKYW 297
>gi|62320725|dbj|BAD95392.1| cysteine proteinase RD21A [Arabidopsis thaliana]
Length = 433
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 78/284 (27%), Positives = 141/284 (49%), Gaps = 34/284 (11%)
Query: 34 LELFSSFQQRYKKSYSKS---EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDL 90
+ ++ ++ ++ K+ S++ E D RF+ F+ +L ++E N+ S R G+T F+DL
Sbjct: 47 MSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEKNLS---YRLGLTRFADL 103
Query: 91 SEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGII 150
+ +E+++++L + K ++ S+ + +P DWR+ G +
Sbjct: 104 TNDEYRSKYLGAKMEK--------------KGERRTSLRYEARVGDELPESIDWRKKGAV 149
Query: 151 GKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALL 210
+V++Q CG+CWAFST+ E ++ + G L LS QE++DC + N GC+GG L+
Sbjct: 150 AEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGG----LM 205
Query: 211 DW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIA 267
D+ + ++ + +YP D C + + V I SY D SE S+ +A
Sbjct: 206 DYAFEFIIKNGGIDTDKDYPYKGVDGTCDQIRKNAKVVTIDSYE-DVPTYSEESLKKAVA 264
Query: 268 THGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
H P+ A+ A +Q Y G+ +C ++H V VGY
Sbjct: 265 -HQPISIAIEAGGRAFQLYDSGIFDGSCG---TQLDHGVVAVGY 304
>gi|74142447|dbj|BAE31977.1| unnamed protein product [Mus musculus]
Length = 334
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 85/291 (29%), Positives = 137/291 (47%), Gaps = 29/291 (9%)
Query: 25 VSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESA-RY 82
++ P +Q + ++ +++ Y +E + R +EK++ +I+ N + +
Sbjct: 16 LATPKFDQTFSAEWHQWKSTHRRLYGTNEEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSM 75
Query: 83 GITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKK 142
+ F D++ EEF R VN + H H K R + + IP
Sbjct: 76 EMNAFGDMTNEEF-----RQVVNGY----------RHQKHKKGRLFQEPLMLK--IPKSV 118
Query: 143 DWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGC 201
DWRE G + V+N+ CG+CWAFS E LK G L LS Q ++DC+ GN GC
Sbjct: 119 DWREKGCVTPVKNKGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGC 178
Query: 202 SGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP-SES 260
+GG ++ N L+ E YP KD +CK +A + + T IP E
Sbjct: 179 NGGLMDFAFQYIKENG-GLDSEESYPYEAKDGSCKYRAE----FAVANDTGFVDIPQQEK 233
Query: 261 SILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+++ +AT GP+ A++A + Q+Y G I Y + S N++H V +VGY
Sbjct: 234 ALMKAVATVGPISVAMDASHPSLQFYSSG-IYYEPNCSSKNLDHGVLLVGY 283
>gi|356515036|ref|XP_003526207.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 336
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 82/285 (28%), Positives = 128/285 (44%), Gaps = 31/285 (10%)
Query: 35 ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEE 93
E + Y K Y +E D RF+ F+ +++ IE N + P + G+ +DL+ E
Sbjct: 36 ERHEQWMTEYGKVYKDAAEKDKRFQIFKDNVEFIESFNADGNKP--YKLGVNHLADLTVE 93
Query: 94 EFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKV 153
EFK + H+ K ++T IP DWR G + +
Sbjct: 94 EFKASR----------NGFKRPHEFSTTTFKYENVTA-------IPAAIDWRTKGAVTPI 136
Query: 154 RNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDW 212
++Q CG+CWAFST+ E +H + G L LS QE++DC G + GC GG ++
Sbjct: 137 KDQGQCGSCWAFSTIAATEGIHQITTGKLVSLSEQELVDCDTKGVDQGCEGGYMEDGFEF 196
Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV 272
+ N + E+ YP D C KATSP +IK Y + + P+ + L + PV
Sbjct: 197 IIKNGGITS-ETNYPYKAVDGKC-NKATSPV-AQIKGY--EKVPPNSETALQKAVANQPV 251
Query: 273 IAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYSRT 315
+++A + +Y G+ C L +H V VGY + T
Sbjct: 252 SVSIDADGAGFMFYSSGIYNGECGTEL---DHGVTAVGYGTANGT 293
>gi|154336052|ref|XP_001564262.1| cysteine peptidase A (CPA) [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134061296|emb|CAM38321.1| cysteine peptidase A (CPA) [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 479
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 90/311 (28%), Positives = 146/311 (46%), Gaps = 28/311 (9%)
Query: 7 VLFIVALIALCFLAIPVKVSKPNLEQKLEL--FSSFQQRYKKSYSKSE-HDIRFKNFEKS 63
+ L ALC+ + + + ++ ++ F F++++ KS+ + RF F+++
Sbjct: 10 AMVATVLFALCYCSTVIARTLHGIDDEVASAHFMHFKKQHGKSFGEEAVEGHRFNAFKEN 69
Query: 64 LDIIEELNKNRQSPESARYGIT-EFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNH 122
+ LN Q+P A Y ++ +F+ L+ +EF ++L L +H K H +
Sbjct: 70 MQTAVYLNA--QNPH-AHYDVSGKFAALTPQEFAKQYLNPDYYTRQLKAH-KERAHVYEG 125
Query: 123 VKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTL 182
V+ G DWRE G + +V++Q CG+CWAFS + E AL TL
Sbjct: 126 VR------------GGLSAVDWREKGAVTEVKDQGLCGSCWAFSAIGNIEGQWALSGNTL 173
Query: 183 SLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVN-KVVLEPESEYPLLLKDAACKR-KAT 240
LS Q ++ C +MGC+GG W+ N + E YP D + +T
Sbjct: 174 VSLSEQMLVSC-DTVDMGCNGGLMDQAWAWIIKNHSGAVYTEVSYPYTSGDGSTASCLST 232
Query: 241 SPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANI 300
G +I +L E +I + +GP+ AV+A TWQ Y GGV+ NC N+
Sbjct: 233 GKVGARISGQV--SLPQDEDAIEAWLEKNGPISIAVDATTWQLYFGGVVS-NCFAY--NL 287
Query: 301 NHAVQIVGYDN 311
NH V +VGY+N
Sbjct: 288 NHGVLLVGYNN 298
>gi|375073978|gb|AFA34856.1| cathepsin L-like protein [Trypanosoma cruzi]
Length = 467
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 84/314 (26%), Positives = 140/314 (44%), Gaps = 24/314 (7%)
Query: 1 MFDVKNVLFIVALIALCFLAIPVKVSKPNLEQKL-ELFSSFQQRYKKSY-SKSEHDIRFK 58
M L + A++ + +P + + E+ L F+ F+Q++ + Y S +E R
Sbjct: 1 MSGWARALSLAAVLVVMACLVPAATASLHAEETLASQFAEFKQKHGRVYESAAEEAFRLS 60
Query: 59 NFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDH 118
F ++L + L+ + A +G+T FSDL+ EEF++R+ H H
Sbjct: 61 VFRENL-FLARLHA--AANPHATFGVTPFSDLTREEFRSRY-------------HNGAAH 104
Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
++ + + + G P DWR G + V++Q CG+CWAFS + E L
Sbjct: 105 FAAAQERARVPVNVEV-VGAPAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNVECQWFLA 163
Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACKR 237
L+ LS Q ++ C + GCSGG +W+ N + E YP +
Sbjct: 164 GHPLTNLSEQMLVSC-DKTDSGCSGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPP 222
Query: 238 KATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL 297
TS + V L E+ I +A +GPV V+A +W Y GGV+ +C
Sbjct: 223 CTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVGVDASSWMTYTGGVMT-SCVSE- 280
Query: 298 ANINHAVQIVGYDN 311
++H V +VGY++
Sbjct: 281 -QLDHGVLLVGYND 293
>gi|326492229|dbj|BAK01898.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 365
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 87/322 (27%), Positives = 148/322 (45%), Gaps = 30/322 (9%)
Query: 7 VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQ---RYKKSYSKSEHDIRFKNFEKS 63
++ + +A+C LA K E + ++ F+ + K Y E IRF N++ +
Sbjct: 6 LITLTVFLAICSLAASSNTFKNPQEDVVLFYNIFKDWITQSNKQYGIEEMAIRFFNWKNN 65
Query: 64 LDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHV 123
D ++E N Q+ + R + +++D++ EEF H+ +N +L + K+
Sbjct: 66 FDFVQE--HNAQAGLTFRLEMNDYADMTAEEFSALHM--GLNTELLAASKKNKTAPAAAK 121
Query: 124 KKRSITTGI------------TIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETA 171
K + T TG+P D R+ G + V+NQ TCG C+AF+
Sbjct: 122 KANNTTNSTNATNAGFNKNSSAADTGLPKSVDCRKTGAVSGVKNQGTCGGCYAFAAAGAL 181
Query: 172 ESMHALKNGTLSLLSVQEVIDCAG-NGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLL 230
E ++A+KN L+ +SVQ++IDC+G GN GC GG + + V E ES Y
Sbjct: 182 EGLYAIKNKKLTDISVQQMIDCSGFFGNKGCDGGLMTTTFGFTQMFGV--EAESTYGYAA 239
Query: 231 KDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGV 288
C++ + + + ++ + + +++ L PV + A L Q + GV
Sbjct: 240 ALGECRQ---NTDNIVFRNSGYEEVPQNDTLALKKAVARQPVSVGIEASSLAVQLFKSGV 296
Query: 289 IQYNCDGSLANINHAVQIVGYD 310
+ C +L NHAV IVGYD
Sbjct: 297 LTGGCGTAL---NHAVLIVGYD 315
>gi|115743|sp|P07154.2|CATL1_RAT RecName: Full=Cathepsin L1; AltName: Full=Cyclic protein 2;
Short=CP-2; AltName: Full=Major excreted protein;
Short=MEP; Contains: RecName: Full=Procathepsin L;
Contains: RecName: Full=Cathepsin L1 heavy chain;
Contains: RecName: Full=Cathepsin L1 light chain; Flags:
Precursor
gi|38648869|gb|AAH63175.1| Cathepsin L1 [Rattus norvegicus]
gi|149029152|gb|EDL84437.1| cathepsin L, isoform CRA_a [Rattus norvegicus]
gi|386267881|dbj|BAM14518.1| cathepsin L [Rattus norvegicus]
Length = 334
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 87/294 (29%), Positives = 139/294 (47%), Gaps = 35/294 (11%)
Query: 25 VSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYG 83
++ P +Q + ++ +++ Y +E + R +EK++ +I+ N + ++G
Sbjct: 16 LATPKFDQTFNAQWHQWKSTHRRLYGTNEEEWRRAVWEKNMRMIQLHNGEY---SNGKHG 72
Query: 84 IT----EFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIP 139
T F D++ EEF R VN + H H K R + + IP
Sbjct: 73 FTMEMNAFGDMTNEEF-----RQIVNGY----------RHQKHKKGRLFQEPLMLQ--IP 115
Query: 140 VKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGN 198
DWRE G + V+NQ CG+CWAFS E LK G L LS Q ++DC+ GN
Sbjct: 116 KTVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHDQGN 175
Query: 199 MGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP- 257
GC+GG ++ N L+ E YP KD +CK +A + + T IP
Sbjct: 176 QGCNGGLMDFAFQYIKENG-GLDSEESYPYEAKDGSCKYRAE----YAVANDTGFVDIPQ 230
Query: 258 SESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
E +++ +AT GP+ A++A + Q+Y G I Y + S +++H V +VGY
Sbjct: 231 QEKALMKAVATVGPISVAMDASHPSLQFYSSG-IYYEPNCSSKDLDHGVLVVGY 283
>gi|50355611|dbj|BAD29954.1| cysteine protease [Daucus carota]
Length = 474
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 84/285 (29%), Positives = 133/285 (46%), Gaps = 27/285 (9%)
Query: 31 EQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSD 89
+Q L L+ S+ ++ K+Y+ E + RF F+ ++ ++ N R +S + G+ +F+D
Sbjct: 54 DQLLSLYESWLVKHHKNYNALGEKETRFGIFKDNVGFVDRHNSMRN--QSYKLGLNKFAD 111
Query: 90 LSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGI 149
L+ +E+++ +L + K N RS +P DWR+ G
Sbjct: 112 LTNDEYRSLYLSGKMMKR----------ERKNEDGFRSDRFVFEDGDHLPESVDWRDRGA 161
Query: 150 IGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCAL 209
+ V++Q CG+CWAFSTV E ++ + G L LS QE++DC N GC+GG L
Sbjct: 162 VAPVKDQGQCGSCWAFSTVGAVEGINKIVTGELISLSEQELVDCDNGYNQGCNGG----L 217
Query: 210 LDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
+D+ V ++ E +YP D C + + V I Y D E S+ +
Sbjct: 218 MDYAFEFIVKNGGIDTEDDYPYKGVDGLCDQNRKNAKVVTINGYE-DVPHNDEKSLKKAV 276
Query: 267 ATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
A H PV A+ A +Q Y GV C L +H V VGY
Sbjct: 277 A-HQPVSVAIEAGGRAFQLYESGVFTGQCGTEL---DHGVVAVGY 317
>gi|38345906|emb|CAE04498.2| OSJNBb0059K02.8 [Oryza sativa Japonica Group]
Length = 458
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 80/283 (28%), Positives = 132/283 (46%), Gaps = 24/283 (8%)
Query: 31 EQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSP-ESARYGITEFS 88
E+ L++ ++ + KSY+ E + R+ F +L I+E N + S R G+ F+
Sbjct: 34 EEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRLGLNRFA 93
Query: 89 DLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAG 148
DL+ EE++ +L ++ V R + +P DWR G
Sbjct: 94 DLTNEEYRDTYL-----------GLRNKPRRERKVSDRYLAAD---NEALPESVDWRTKG 139
Query: 149 IIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCA 208
+ ++++Q+ G+CWAFS + E ++ + G L LS QE++DC + N GC+GG
Sbjct: 140 AVAEIKDQEVAGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDY 199
Query: 209 LLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIAT 268
D++ +N ++ E +YP KD C + V I SY D SE+S+ +A
Sbjct: 200 AFDFI-INNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYE-DVTPNSETSLQKAVAN 257
Query: 269 HGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
PV A+ A +Q Y G+ C +L +H V VGY
Sbjct: 258 Q-PVSVAIEAGGRAFQLYSSGIFTGKCGTAL---DHGVAAVGY 296
>gi|330796919|ref|XP_003286511.1| hypothetical protein DICPUDRAFT_77394 [Dictyostelium purpureum]
gi|325083492|gb|EGC36943.1| hypothetical protein DICPUDRAFT_77394 [Dictyostelium purpureum]
Length = 325
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 97/313 (30%), Positives = 151/313 (48%), Gaps = 38/313 (12%)
Query: 9 FIVALIALCFLAIPVKVSKPNLEQKLE-LFSSFQQRYKKSYSKSEHDIRFKNFEKSLDII 67
+ + I L FL V +K E + F S+ K Y + R++ F+ ++D I
Sbjct: 4 YFIGFILLIFLNQNVFCNKLFTEIIYQNKFISWANENNKFYETLDFKKRYEIFKYNMDFI 63
Query: 68 EELNK-NRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKR 126
NK N Q+ G+ +++DLS EE+K+ L ++ N+++
Sbjct: 64 YSWNKGNSQTI----LGLNKYADLSNEEYKSLFLGSNI-------------KTQNYIRIN 106
Query: 127 SITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLS 186
S I P DWR G + V+NQ C + +AFS + + ES + ++NG L LS
Sbjct: 107 SSRYDI------PTTFDWRLKGAVTPVKNQGFCNSGYAFSAIGSLESSNKIENGQLIRLS 160
Query: 187 VQEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEP-ESEYPLLLKDAACKRKATSPNG 244
Q +IDC+G+ GN GC GG +++ ++ P ES YP + + C+ K G
Sbjct: 161 EQNLIDCSGSEGNRGCDGGTVVNSFNYLFKHQNGKIPKESSYPYEAQKSKCRFKDQFI-G 219
Query: 245 VKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANI-- 300
+ ++ LI ES+I +AT GPV A++A + +Q Y GGV D +NI
Sbjct: 220 ATLNNFA--NLISDESTIQNAVATKGPVSVAIDASSIFFQLYFGGVYD---DLFCSNIYT 274
Query: 301 NHAVQIVGY-DNY 312
NH V IVGY +NY
Sbjct: 275 NHFVLIVGYTENY 287
>gi|313235882|emb|CBY11269.1| unnamed protein product [Oikopleura dioica]
Length = 371
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 88/279 (31%), Positives = 140/279 (50%), Gaps = 25/279 (8%)
Query: 37 FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFK 96
F +F + K YS+ E RF+ F ++L I+ N Q SA+YG+TEF+DLS+ EF+
Sbjct: 50 FENFLLEHPKMYSEQESHSRFQTFWENLKRIKFHNHIEQG--SAKYGVTEFADLSDFEFR 107
Query: 97 TRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQ 156
+L + + + + K ++ K R+ + + + DW E G + +V+NQ
Sbjct: 108 RHYL--GLKPELKIPNRKKYER-----KSRNSSKKLKFAKTVDETFDWVEKGAVTEVKNQ 160
Query: 157 QTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLD--WMD 214
CG+CWAFST E G L LS QE++DC + GC+GG L+D + +
Sbjct: 161 GMCGSCWAFSTTGNIEGAWFKATGDLVSLSEQELVDCD-QKDSGCNGG----LMDQAFEE 215
Query: 215 VNKV-VLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
V ++ LE E +YP C + S + V+I + + E I + HGP+
Sbjct: 216 VIRIGGLETEQQYPYDGVQETCNFEK-SLSKVQIDDFM--DIGEDEEEIAEALEEHGPLS 272
Query: 274 AAVNALTWQYYLGGV---IQYNCDGSLANINHAVQIVGY 309
A+NA Q+Y GG+ + + C S ++H V +VGY
Sbjct: 273 IAINAFGMQFYRGGISHPLSFLC--SQDGLDHGVLMVGY 309
>gi|339246873|ref|XP_003375070.1| viral cathepsin [Trichinella spiralis]
gi|316971622|gb|EFV55373.1| viral cathepsin [Trichinella spiralis]
Length = 496
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 89/280 (31%), Positives = 132/280 (47%), Gaps = 34/280 (12%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F F + +KK Y S+ E R+ F+ ++ +E L KN Q +A YG+T F+DL+ EEF
Sbjct: 196 FKEFLKTFKKWYLSEKELLKRYDIFKVNMKTVEMLQKNEQG--TAVYGVTFFADLTPEEF 253
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
+ +L + L +K SI G I + DWRE + +V+N
Sbjct: 254 RKFYLSPQWKRDQLPQ------------RKASIPKG-----KIEDRWDWREHNAVTEVKN 296
Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDV 215
Q CG+CWAF+T+ E + A+K G L LS QE++DC + GCSGG + +
Sbjct: 297 QGMCGSCWAFATIANVEGVWAVKKGELVSLSEQELVDC-DTLDQGCSGGYPSNAYKEI-I 354
Query: 216 NKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCD--TLIPSESSILTDIATHGPVI 273
L E+ Y C+ K + K Y D +L E+ I I +GPV
Sbjct: 355 RLGGLTTETNYSYDGNQGTCRFKTQNA-----KVYINDSVSLPEDETEIAAYIRENGPVA 409
Query: 274 AAVNALTWQYYLGGVI---QYNCDGSLANINHAVQIVGYD 310
+NA +Y G+ ++ C S ++H V IVGYD
Sbjct: 410 VGINAFAMMFYRHGIAHPWRFLC--SPDALDHGVAIVGYD 447
>gi|168057475|ref|XP_001780740.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162667829|gb|EDQ54449.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 463
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 86/283 (30%), Positives = 136/283 (48%), Gaps = 37/283 (13%)
Query: 34 LELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSE 92
L++F + + + + Y S SE RF+ F+++ I NK ++S G+ +FSDL+
Sbjct: 46 LDVFHQWLETHSRVYRSLSEKHHRFQIFKENFLYIHAHNKQQKS---YWLGLNKFSDLTH 102
Query: 93 EEFKTRHL-RHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIG 151
+EF+ ++L VN+ ++ + D K DWR G +
Sbjct: 103 QEFRAQYLGTKPVNRQRKEANFMYED------------------VEAEPKVDWRLKGAVT 144
Query: 152 KVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLD 211
V++Q CG+CWAFS V + E ++A+K G L LS QE++DC N GC+GG L+D
Sbjct: 145 DVKDQGACGSCWAFSAVGSVEGVNAIKTGELVSLSEQELVDCDRKQNQGCNGG----LMD 200
Query: 212 W---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIAT 268
+ + ++ E +YP +D C + V I Y D SES+++ + T
Sbjct: 201 YAFEFIIKNGGIDTEKDYPYKARDGRCDEGRRNSKVVVIDDYQ-DVPTQSESALMKAL-T 258
Query: 269 HGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
PV A+ A +Q+Y GGV C L +H V VGY
Sbjct: 259 KNPVSVAIEAGGRDFQHYQGGVFTGPCGSEL---DHGVLAVGY 298
>gi|125533982|gb|EAY80530.1| hypothetical protein OsI_35710 [Oryza sativa Indica Group]
Length = 378
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 85/303 (28%), Positives = 130/303 (42%), Gaps = 31/303 (10%)
Query: 21 IPVKVSKPNLEQKLE-LFSSFQQRYKKSYSKSEHDI---------RFKNFEKSLDIIEEL 70
IP S + E+ L L+ ++ RY S + + RF F ++ I E
Sbjct: 25 IPFTESDLSSEESLRALYERWRSRYTVSRPAASGGVGNDDGEARRRFNVFVENARYIHEA 84
Query: 71 NKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDH--HHNHVKKRSI 128
N+ P R + +F+D++ +EF+ + S +HH + S
Sbjct: 85 NRRGGRP--FRLALNKFADMTTDEFR---------RTYAGSRARHHRSLRGGRGGEGGSF 133
Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
G +P DWRE G + +++Q CG+CWAFS V E ++ +K G L LS Q
Sbjct: 134 RYGGDDEDNLPPAVDWRERGAVTGIKDQGQCGSCWAFSAVAAVEGVNKIKTGRLVTLSEQ 193
Query: 189 EVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIK 248
E++DC N GC GG ++ N + ES YP + C + S + V I
Sbjct: 194 ELVDCDTGDNQGCDGGLMDYAFQFIKRNGGIT-TESNYPYRAEQGRCNKAKASSHDVTID 252
Query: 249 SYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQI 306
Y D ES++ +A PV AV A +Q+Y GV C +++H V
Sbjct: 253 GYE-DVPANDESALQKAVANQ-PVAVAVEASGQDFQFYSEGVFTGECG---TDLDHGVAA 307
Query: 307 VGY 309
VGY
Sbjct: 308 VGY 310
>gi|18408828|ref|NP_566920.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|12324451|gb|AAG52191.1|AC012329_18 putative cysteine proteinase; 15366-14136 [Arabidopsis thaliana]
gi|6723404|emb|CAB66413.1| cysteine protease-like protein [Arabidopsis thaliana]
gi|332645009|gb|AEE78530.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 341
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 90/307 (29%), Positives = 136/307 (44%), Gaps = 30/307 (9%)
Query: 14 IALCFLAIPVKVSKPNLEQKLELFSS--------FQQRYKKSYSK-SEHDIRFKNFEKSL 64
I LAI + + + LF + + R+ + YS SE RF+ F +L
Sbjct: 4 IVFFLLAILLSSRTSGVTSRGGLFEASAVEKHEQWMSRFNRVYSDDSEKTSRFEIFTNNL 63
Query: 65 DIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVK 124
+E +N N + ++ + EFSDL++EEFK R+ V + M+ D H V
Sbjct: 64 KFVESINMN--TNKTYTLDVNEFSDLTDEEFKARYTGLVVPEG--MTRISTTDSHET-VS 118
Query: 125 KRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL 184
R G T + DW + G + V++QQ CG CWAFS V E M + NG L
Sbjct: 119 FRYENVGETGES-----MDWIQEGAVTSVKHQQQCGCCWAFSAVAAVEGMTKIANGELVS 173
Query: 185 LSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNG 244
LS Q+++DC+ N GC GG D++ N+ + E YP C+ +
Sbjct: 174 LSEQQLLDCSTENN-GCGGGIMWKAFDYIKENQGIT-TEDNYPYQGAQQTCESNHLAA-- 229
Query: 245 VKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQY--YLGGVIQYNCDGSLANINH 302
I Y +T+ ++ L + PV A+ +++ Y GG+ C L H
Sbjct: 230 ATISGY--ETVPQNDEEALLKAVSQQPVSVAIEGSGYEFIHYSGGIFNGECGTQLT---H 284
Query: 303 AVQIVGY 309
AV IVGY
Sbjct: 285 AVTIVGY 291
>gi|14517542|gb|AAK62661.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
gi|19548039|gb|AAL87383.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
Length = 462
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 78/284 (27%), Positives = 141/284 (49%), Gaps = 34/284 (11%)
Query: 34 LELFSSFQQRYKKSYSKS---EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDL 90
+ ++ ++ ++ K+ S++ E D RF+ F+ +L ++E N+ S R G+T F+DL
Sbjct: 47 MSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEKNLS---YRLGLTRFADL 103
Query: 91 SEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGII 150
+ +E+++++L + K ++ S+ + +P DWR+ G +
Sbjct: 104 TNDEYRSKYLGAKMEKK--------------GERRTSLRYEARVGDELPESIDWRKKGAV 149
Query: 151 GKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALL 210
+V++Q CG+CWAFST+ E ++ + G L LS QE++DC + N GC+GG L+
Sbjct: 150 AEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGG----LM 205
Query: 211 DW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIA 267
D+ + ++ + +YP D C + + V I SY D SE S+ +A
Sbjct: 206 DYAFEFIIKNGGIDTDKDYPYKGVDGTCDQIRKNAKVVTIDSYE-DVPTYSEESLKKAVA 264
Query: 268 THGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
H P+ A+ A +Q Y G+ +C ++H V VGY
Sbjct: 265 -HQPISIAIEAGGRAFQLYDSGIFDGSCG---TQLDHGVVAVGY 304
>gi|18401614|ref|NP_564497.1| cysteine proteinase RD21a [Arabidopsis thaliana]
gi|1172873|sp|P43297.1|RD21A_ARATH RecName: Full=Cysteine proteinase RD21a; Short=RD21; Flags:
Precursor
gi|12321010|gb|AAG50628.1|AC083835_13 cysteine protease, putative [Arabidopsis thaliana]
gi|435619|dbj|BAA02374.1| thiol protease [Arabidopsis thaliana]
gi|18175926|gb|AAL59952.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
gi|22136972|gb|AAM91715.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
gi|332194014|gb|AEE32135.1| cysteine proteinase RD21a [Arabidopsis thaliana]
Length = 462
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 79/284 (27%), Positives = 141/284 (49%), Gaps = 34/284 (11%)
Query: 34 LELFSSFQQRYKKSYSKS---EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDL 90
+ ++ ++ ++ K+ S++ E D RF+ F+ +L ++E N+ S R G+T F+DL
Sbjct: 47 MSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEKNLS---YRLGLTRFADL 103
Query: 91 SEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGII 150
+ +E+++++L + K ++ S+ + +P DWR+ G +
Sbjct: 104 TNDEYRSKYLGAKMEKK--------------GERRTSLRYEARVGDELPESIDWRKKGAV 149
Query: 151 GKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALL 210
+V++Q CG+CWAFST+ E ++ + G L LS QE++DC + N GC+GG L+
Sbjct: 150 AEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGG----LM 205
Query: 211 DW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIA 267
D+ + ++ + +YP D C + + V I SY D SE S+ +A
Sbjct: 206 DYAFEFIIKNGGIDTDKDYPYKGVDGTCDQIRKNAKVVTIDSYE-DVPTYSEESLKKAVA 264
Query: 268 THGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
H P+ A+ A +Q Y G+ +C L +H V VGY
Sbjct: 265 -HQPISIAIEAGGRAFQLYDSGIFDGSCGTQL---DHGVVAVGY 304
>gi|440797325|gb|ELR18416.1| cathepsin Llike cysteine protease [Acanthamoeba castellanii str.
Neff]
Length = 345
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 86/291 (29%), Positives = 134/291 (46%), Gaps = 36/291 (12%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
+ SF+ +Y KSY + E R F +++ I + + I EF+DL+ +EF
Sbjct: 27 WESFKAKYGKSYPTPHEEAHRRAVFHRNVAFIAAHHDPLYT-----VAINEFADLTFDEF 81
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
TR + S H R +P + DWRE G++ +V+N
Sbjct: 82 STRKMGLLPPPLPSSSSSSPGAAHLLEAATR-----------LPTQVDWREKGVVTRVKN 130
Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGCSGGDFCALLDWMD 214
Q CG+CWAFS E AL+ G L LS + +IDC+ G+MGC GG ++
Sbjct: 131 QLDCGSCWAFSAAGAIEGQQALRTGRLVDLSEENLIDCSWAQGDMGCGGGLPSQAFQYVI 190
Query: 215 VNKVVLEPESEYPL---LLKDAACKR-------KATSPNGVKIKSYTCDTLIP--SESSI 262
NK + + E+ YPL + D ++ G + SYT +P SE+++
Sbjct: 191 DNKGI-DTEARYPLASVWISDCTAPELCPCTYNRSAGAVGAVVASYTS---LPAGSEAAL 246
Query: 263 LTDIATHGPVIAAVNA-LTWQYYLGGVI-QYNCDGSLANINHAVQIVGYDN 311
+AT GP+ ++A Q+Y GGV +C + ++NHAV VGY +
Sbjct: 247 AHALATVGPISVCIDAEQGLQFYSGGVFSSRSCGSARTDLNHAVLAVGYGS 297
>gi|357139514|ref|XP_003571326.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP2-like
[Brachypodium distachyon]
Length = 363
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 87/305 (28%), Positives = 131/305 (42%), Gaps = 39/305 (12%)
Query: 25 VSKPNLEQKLEL---FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNR------ 74
KP + EL +S +Q +Y K Y S E + RF F + + I + +
Sbjct: 28 AGKPAADDDSELRQRWSKWQAKYSKRYPSHEEQEKRFGVFRDNSNSIGAFSAPQTTTSAV 87
Query: 75 -------QSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRS 127
Q+ + R G+ F DL E + + VL + HH+
Sbjct: 88 VGSFGAPQTVTTVRVGMNRFGDLQPREVLDQFTGFNNTAAVLKTPPPTRLPHHSRK---- 143
Query: 128 ITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSV 187
P DWR +G + V+ Q +C +CWAF+ V E M+ ++ GTL LS
Sbjct: 144 -----------PCCVDWRSSGAVTGVKFQGSCQSCWAFAAVAAIEGMNKIRTGTLVSLSE 192
Query: 188 QEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACK-RKATSPNGVK 246
Q+++DC NG+ GC+GG LD + + E Y + CK K +G
Sbjct: 193 QQLVDCD-NGSSGCAGGRTDTALDLVARRGGITSGE-RYAYGGFNGRCKVDKLLFDHGAA 250
Query: 247 IKSYTCDTLIPSESSILTDIATHGPVIAAVNALTW--QYYLGGVIQYNCDGSLANINHAV 304
+ + + P++ L PV A V+A TW Q+Y GG+ + C G A +NHAV
Sbjct: 251 VGGF--KAVPPNDEHQLAMAVARQPVTAYVDASTWEFQFYSGGIFRGPCSGDPARVNHAV 308
Query: 305 QIVGY 309
IVGY
Sbjct: 309 TIVGY 313
>gi|343470212|emb|CCD17026.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 445
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 86/318 (27%), Positives = 153/318 (48%), Gaps = 32/318 (10%)
Query: 4 VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSY-SKSEHDIRFKNFE 61
+ + F V L+A+ +PV + + EQ L+ F++F+Q+Y +SY +E RF+ F+
Sbjct: 7 TRTLRFSVGLLAVAACFVPVALGVLHAEQSLQQQFAAFKQKYSRSYKDATEEAFRFRVFK 66
Query: 62 KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
+++ E + + A +G+T FSD+S EEF+ ++H +++
Sbjct: 67 QNM---ERAKEEAAANPYATFGVTRFSDMSPEEFRA-------------TYHNGAEYYAA 110
Query: 122 HVKK-RSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
+K+ R + + + TG P DWR+ G + V++Q C + WAF+ + E +
Sbjct: 111 ALKRPRKV---VNVSTGKAPEAVDWRKKGAVTPVKDQGKCDSSWAFTVIGNIEGQWKIAG 167
Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACK-- 236
L+ LS Q ++ C N ++GC G W+ N + E YP
Sbjct: 168 HELTSLSEQMLVSCDTN-DLGCRAGFMDTAFKWIVSSNNGNVFTEQSYPYASGGGNVPTC 226
Query: 237 RKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGS 296
K+ G I + ++ +E++I +A GPV AV+A ++Q Y GGV+ +C
Sbjct: 227 NKSGKVVGANIDDHV--HILDNENAIAEWLAKKGPVAIAVDATSFQSYTGGVLT-SCISK 283
Query: 297 LANINHAVQIVGYDNYSR 314
+N A +VGYD+ S+
Sbjct: 284 --EVNSAALLVGYDDTSK 299
>gi|356554921|ref|XP_003545789.1| PREDICTED: LOW QUALITY PROTEIN: thiol protease SEN102-like [Glycine
max]
Length = 439
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 91/322 (28%), Positives = 141/322 (43%), Gaps = 45/322 (13%)
Query: 10 IVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIE 68
+ L+ FLA V E + R+ K Y E + RF+ F ++++ +E
Sbjct: 108 LAMLLCTAFLAFQVTCCTLQDASMYERHEQWMTRHGKVYKDPREREKRFRIFNENVNYVE 167
Query: 69 ELNKNRQSPESARYGITEFSDLSEEEF---KTRHLRHSVNKHVLMSHHKHHDHHHNHVKK 125
N P + GI +F DL+ +EF + R H + + + K+ +
Sbjct: 168 AFNNAANKP--YKLGINQFXDLTNQEFIAPRNRFKGHMCSSIIRTTTFKYEN-------- 217
Query: 126 RSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLL 185
+TT +P DWR+ G + V++Q CG CWAFS V E +HAL G L L
Sbjct: 218 --VTT-------VPSTVDWRQNGAVTPVKDQGQCGCCWAFSAVAATEGIHALSGGKLISL 268
Query: 186 SVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVV-----LEPESEYPLLLKDAACKRKA 239
S QE++DC G + GC GG L+D D K + L E+ YP D C
Sbjct: 269 SEQELVDCDTKGVDQGCEGG----LMD--DAYKFIIQNHGLNTEANYPYKGVDGKCNANE 322
Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALT--WQYYLGGVIQYNCDGSL 297
+ + I Y D +E ++ +A PV A++A + +Q+Y G +C L
Sbjct: 323 AANHAATITGYE-DVPANNEKALQKAVANQ-PVSVAIDASSSDFQFYKSGAFTGSCGTEL 380
Query: 298 ANINHAVQIVGY---DNYSRTW 316
+H V VGY D+ ++ W
Sbjct: 381 ---DHGVTAVGYGVSDHGTKYW 399
>gi|401758208|gb|AFQ01139.1| cathepsin F-like protease, partial [Chilo suppressalis]
Length = 537
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 90/290 (31%), Positives = 138/290 (47%), Gaps = 41/290 (14%)
Query: 32 QKLELFSSFQQRYKKSYSKS--EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSD 89
Q +LF +F YK Y E RF+ F++++ I ELN + + + Y +T F+D
Sbjct: 226 QAEQLFFNFITTYKPEYINDHVEMTKRFEIFKENVKKIHELNTHERG--TGVYAVTRFTD 283
Query: 90 LSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT--GIPVKKDWREA 147
L+ EEFK+++L + N N + R IP +P DWR
Sbjct: 284 LTYEEFKSKYLGLNPNLK-----------KPNQIPMRQAE----IPKVHQLPASFDWRPL 328
Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFC 207
G + +V++Q CG+CWAFS E LK G L LS QE++DC + GC GG
Sbjct: 329 GAVTEVKDQGACGSCWAFSVTGNIEGQWKLKTGKLLSLSEQELVDCDKMDD-GCDGG--- 384
Query: 208 ALLDWMD-VNKVV-----LEPESEYPLLLKDAACK-RKATSPNGVKIKSYTCDTLIPSES 260
+MD + + LE E EYP +D C K+ S K++ + +E+
Sbjct: 385 ----YMDNAYRAIEQLGGLETEEEYPYEAEDDKCSFNKSLS----KVQISGAVNISSNET 436
Query: 261 SILTDIATHGPVIAAVNALTWQYYLGGVIQ-YNCDGSLANINHAVQIVGY 309
++ + +GP+ +NA Q+Y+GGV + + NI+H V IVGY
Sbjct: 437 NMAKWLVHNGPISIGINANAMQFYVGGVSHPWKALCNPKNIDHGVLIVGY 486
>gi|300175245|emb|CBK20556.2| unnamed protein product [Blastocystis hominis]
Length = 325
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 89/310 (28%), Positives = 151/310 (48%), Gaps = 41/310 (13%)
Query: 7 VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLD 65
+LF + I+LC +K L +L+ F++F++++ K+Y + E R F +L
Sbjct: 2 ILFALIFISLC-------TAKDTLSVELQ-FAAFEKKFGKTYVGEEERRFRMSVFSNNLK 53
Query: 66 IIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKK 125
I++ N ++QS S GIT F DLS +EF+ R ++ + K + +
Sbjct: 54 IVDYYN-SKQS--SFVLGITPFIDLSNDEFRERFASNT-------AFEKKAKSVESSSSQ 103
Query: 126 RSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLL 185
++ ++P I DWR + V++Q+ CGACWAF+ V + E ++A K G +
Sbjct: 104 QTSQDYSSLPRSI----DWRAKNTVSSVKDQKNCGACWAFAAVASIEGVYAQKTGKILDF 159
Query: 186 SVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK--ATSPN 243
S Q+++DC ++GCSGG +++ N + L ES+YP +CK+ TS
Sbjct: 160 SPQQLVDC-DYSSLGCSGGLMTYAYEYVMNNGISL--ESDYPYKASQGSCKKVDFVTSIM 216
Query: 244 GVKIKSYTCDTLIPSESSI-LTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANI 300
G +P S+ L T PV A+ A + +Q Y G++ G+ +
Sbjct: 217 GYY--------EVPVGSTYELLKATTKNPVSVAIGADSIFFQLYTSGILAEELCGT--TL 266
Query: 301 NHAVQIVGYD 310
NH V +VGY+
Sbjct: 267 NHGVLLVGYE 276
>gi|2463588|dbj|BAA22546.1| FB1035 precursor [Ananas comosus]
Length = 324
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 83/285 (29%), Positives = 137/285 (48%), Gaps = 28/285 (9%)
Query: 27 KPNLEQKLELFSSFQQRYKKSYSKSEHDIR-FKNFEKSLDIIEELNKNRQSPESARYGIT 85
+PN + ++ F + Y + Y ++ +R F+ F+ ++ IE N ++ S GI
Sbjct: 1 EPN-DPMMKRFEEWMAEYGRIYKDNDEKMRRFQIFKNNVKHIETFNS--RNGNSYTLGIN 57
Query: 86 EFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWR 145
+F+D+++ EF ++ S+ ++ D + I + +P DWR
Sbjct: 58 QFTDMTKSEFVAQYTGVSLPLNIEREPVVSFDD-------------VNI-SAVPQSIDWR 103
Query: 146 EAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGD 205
+ G + +V+NQ CG+CWAF+ + T E ++ +K G L LS QEV+DCA + GC GG
Sbjct: 104 DYGAVNEVKNQNPCGSCWAFAAIATVEGIYKIKTGYLVSLSEQEVLDCA--VSYGCKGGW 161
Query: 206 FCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
D++ N V E YP C + PN I Y+ E S++
Sbjct: 162 VNKAYDFIISNNGVTT-EENYPYQAYQGTCNANSF-PNSAYITGYSY-VRRNDERSMMYA 218
Query: 266 IATHGPVIAAVNAL-TWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
++ P+ A ++A +QYY GGV C SL NHA+ I+GY
Sbjct: 219 VSNQ-PIAALIDASENFQYYNGGVFSGPCGTSL---NHAITIIGY 259
>gi|113603|sp|P05167.1|ALEU_HORVU RecName: Full=Thiol protease aleurain; Flags: Precursor
gi|19021|emb|CAA28804.1| aleurain [Hordeum vulgare]
Length = 362
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 84/277 (30%), Positives = 128/277 (46%), Gaps = 26/277 (9%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F+ F RY KSY S +E RF+ F +SL EE+ + R GI FSD+S EEF
Sbjct: 61 FARFAVRYGKSYESAAEVRRRFRIFSESL---EEVRSTNRKGLPYRLGINRFSDMSWEEF 117
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
+ L + ++ NH+ + + +P KDWRE GI+ V+N
Sbjct: 118 QATRLGAAQTCSATLAG--------NHLMRDA--------AALPETKDWREDGIVSPVKN 161
Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWMD 214
Q CG+CW FST E+ + G LS Q+++DCAG N GC+GG +++
Sbjct: 162 QAHCGSCWTFSTTGALEAAYTQATGKNISLSEQQLVDCAGGFNNFGCNGGLPSQAFEYIK 221
Query: 215 VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIA 274
N + + E YP + C KA + V++ + + + +E + + PV
Sbjct: 222 YNGGI-DTEESYPYKGVNGVCHYKAENA-AVQVLD-SVNITLNAEDELKNAVGLVRPVSV 278
Query: 275 AVNALTW--QYYLGGVIQYNCDGSLANINHAVQIVGY 309
A + QY G +C + ++NHAV VGY
Sbjct: 279 AFQVIDGFRQYKSGVYTSDHCGTTPDDVNHAVLAVGY 315
>gi|449516391|ref|XP_004165230.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
Length = 387
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 86/286 (30%), Positives = 139/286 (48%), Gaps = 38/286 (13%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
FS F++R+ KSY ++ EHD RFK F+ ++ E +++ SA +G+T+FSDL+ EF
Sbjct: 59 FSLFKRRFGKSYATEEEHDRRFKIFKANMRRAE---RHQSFDPSAIHGVTQFSDLTPFEF 115
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGIT--IPT-GIPVKKDWREAGIIGK 152
+ L + H L + + T +PT +P+ DWR+ G + +
Sbjct: 116 RKAFL--GLRGHRL---------------RLPVDTNAAPILPTENLPIDFDWRQHGGVTR 158
Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGG 204
V+NQ +CG+CW+FST E + L G L LS Q+++DC + GC+GG
Sbjct: 159 VKNQGSCGSCWSFSTTGALEGANFLATGELVSLSEQQLVDCDHECDPEEEDACDSGCNGG 218
Query: 205 DFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILT 264
+ ++ + L E +YP D S I +++ I E I
Sbjct: 219 LMNSAFEYT-LKAGGLMKEQDYPYAGIDRNTCNFDKSKIAASIANFSVVNSI-DEDQIAA 276
Query: 265 DIATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
++ +GP+ A+NA+ Q Y+GGV + C L +H V +VGY
Sbjct: 277 NLVKNGPLAIAINAVFMQTYIGGVSCPFICSKRL---DHGVLLVGY 319
>gi|1222695|gb|AAA92019.1| CP4 [Dictyostelium discoideum]
Length = 442
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 89/306 (29%), Positives = 151/306 (49%), Gaps = 34/306 (11%)
Query: 13 LIALCFLAIPVKVSKPNLE--QKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEEL 70
L LC L + +K Q F+++ Q ++++YS E + R++ F+ ++D + +
Sbjct: 4 LSFLCLLLVSYASAKQQFSELQYRNAFTNWMQAHQRTYSSEEFNARYQIFKSNMDYVHQW 63
Query: 71 NKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITT 130
N + E+ G+ F+D++ +E++T +L + L+ + +K T
Sbjct: 64 NS--KGGETV-LGLNVFADITNQEYRTTYLGTPFDGSALIGTEE---------EKIFSTP 111
Query: 131 GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGT---LSLLSV 187
T+ DWR G + ++NQ CG CW+FST + E H + +GT L LS
Sbjct: 112 APTV--------DWRAQGAVTPIKNQGQCGGCWSFSTTGSTEGAHFIASGTKKDLVSLSE 163
Query: 188 QEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDA-ACKRKATSPNGV 245
Q +IDC+ + GN GC GG +++ +N ++ ES YP +D CK K TS G
Sbjct: 164 QNLIDCSKSYGNNGCEGGLMTLGFEYI-INNKGIDTESSYPYTAEDGKECKFK-TSNIGA 221
Query: 246 KIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSLANINHA 303
+I SY + SE+S L + + PV A++A ++Q Y G I Y + ++H
Sbjct: 222 QIVSYQ-NVTSGSEAS-LQSASNNAPVSVAIDASNESFQLYESG-IYYEPACTPTQLDHG 278
Query: 304 VQIVGY 309
V +VGY
Sbjct: 279 VLVVGY 284
>gi|344271616|ref|XP_003407633.1| PREDICTED: cathepsin L1-like [Loxodonta africana]
Length = 334
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 89/304 (29%), Positives = 141/304 (46%), Gaps = 33/304 (10%)
Query: 13 LIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELN 71
L ALC + V + P L+Q L++ ++ ++ YKK Y+ +E D R +EK++ +IE N
Sbjct: 7 LAALC---LGVASAAPKLDQSLDVQWNQWRSTYKKPYAVNEEDWRRAVWEKNVKMIERHN 63
Query: 72 KN-RQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITT 130
+ Q + F D++ EEF+ +M+ ++ H + +
Sbjct: 64 QEYSQGKHGFTMAMNAFGDMTNEEFRQ-----------VMNGFQNQKHKKGKLFYEPVFG 112
Query: 131 GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEV 190
I P DW + G + V+NQ CG+CWAFS E K G L LS Q +
Sbjct: 113 HI------PTSVDWTQKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNL 166
Query: 191 IDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDA-ACKRKATSPNGVKIK 248
+DC+ GN GC+GG ++ N L+ E YP L D C K
Sbjct: 167 VDCSRREGNEGCNGGLMDNAFQYVQDNG-GLDSEESYPYLATDTHTCNYKPE----CSAA 221
Query: 249 SYTCDTLIPS-ESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQ 305
+ T IP E +++ +AT GP+ A++A ++Q+Y G I Y S +++H V
Sbjct: 222 NDTGFVDIPQREKALMKAVATVGPISVAIDAGHESFQFYKSG-IYYEPGCSSKDLDHGVL 280
Query: 306 IVGY 309
+VGY
Sbjct: 281 LVGY 284
>gi|445927|prf||1910332A Cys endopeptidase
Length = 362
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 76/281 (27%), Positives = 134/281 (47%), Gaps = 21/281 (7%)
Query: 31 EQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDL 90
E +L+ ++ + S S E RF F+ ++ + NK + + + +F+D+
Sbjct: 34 ESLWDLYERWRSHHTVSRSLGEKHKRFNVFKANVMHVHNTNK---MDKPYKLKLNKFADM 90
Query: 91 SEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGII 150
+ EF++ + VN H + +H + K S+ P DWR+ G +
Sbjct: 91 TNHEFRSTYAGSKVNHHKMFRGSQHGSGTFMYEKVGSV----------PASVDWRKKGAV 140
Query: 151 GKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALL 210
V++Q CG+CWAFST+ E ++ +K L LS QE++DC N GC+GG +
Sbjct: 141 TDVKDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDKEENQGCNGGLMESAF 200
Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
+++ K + ES YP ++ C + V I + + + E+++L +A
Sbjct: 201 EFIK-QKGGITTESNYPYKAQEGTCDESKVNDLAVSIDGHE-NVPVNDENALLKAVANQ- 257
Query: 271 PVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
PV A++A +Q+Y GV +C+ ++NH V IVGY
Sbjct: 258 PVSVAIDAGGSDFQFYSEGVFTGDCN---TDLNHGVAIVGY 295
>gi|9631045|ref|NP_047715.1| cathepsin-like proteinase [Lymantria dispar MNPV]
gi|13124028|sp|Q9YMP9.1|CATV_NPVLD RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|3822313|gb|AAC70264.1| cathepsin-like proteinase [Lymantria dispar MNPV]
Length = 356
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 87/284 (30%), Positives = 137/284 (48%), Gaps = 25/284 (8%)
Query: 29 NLEQKLELFSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
NL++ + F SF + Y K+Y+ E + R+ F+ +L I N N +A Y I +F
Sbjct: 48 NLQRAPDYFESFVENYNKNYTSDWEKNKRYSIFKDNLHEINAKNGNATDGPTATYKINKF 107
Query: 88 SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREA 147
SDLS+ E + S+ + V N K + P P+ DWRE
Sbjct: 108 SDLSKSELIAKFTGLSIPERV-----------SNFCKTIILNQP---PDKGPLHFDWREQ 153
Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDF- 206
+ ++NQ CGACWAF+T+ + ES A+++ L LS Q++IDC + +MGC+GG
Sbjct: 154 NKVTSIKNQGACGACWAFATLASVESQFAMRHNRLIDLSEQQLIDC-DSVDMGCNGGLLH 212
Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
A + M + V + E +YP + ++ C P V + C + L D+
Sbjct: 213 TAFEEIMRMGGV--QTELDYPFVGRNRRCGLDRHRPYVVSLVG--CYRYVMVNEEKLKDL 268
Query: 267 ATH-GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
GP+ A++A Y GVI +C+ + +NHAV +VGY
Sbjct: 269 LRAVGPIPMAIDAADIVNYYRGVIS-SCENN--GLNHAVLLVGY 309
>gi|356542631|ref|XP_003539770.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 84/275 (30%), Positives = 135/275 (49%), Gaps = 36/275 (13%)
Query: 52 EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNK---HV 108
E ++R+K F++++ IE N +S + G+ +F+DL+EEEFK ++NK ++
Sbjct: 55 EKELRYKIFQQNVKGIEGFN--NAGNKSHKLGVNQFADLTEEEFK------AINKLKGYM 106
Query: 109 LMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQ-TCGACWAFST 167
+ + HV K +P DWR+ G + +++Q CG+CWAF+
Sbjct: 107 WSKISRTSTFKYEHVTK------------VPATLDWRQKGAVTPIKSQGLKCGSCWAFAA 154
Query: 168 VETAESMHALKNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEY 226
V E + L G L LS QE+IDC NG N GC G ++ NK L E+ Y
Sbjct: 155 VAATEGITKLTTGELISLSEQELIDCDTNGDNGGCKWGIIQEAFKFIVQNK-GLATEASY 213
Query: 227 PLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYY 284
P D C K S + IK Y D +E+++L +A PV V++ +++Y
Sbjct: 214 PYQAVDGTCNAKVESKHVASIKGYE-DVPANNETALLNAVANQ-PVSVLVDSSDYDFRFY 271
Query: 285 LGGVIQYNCDGSLANINHAVQIVGY---DNYSRTW 316
GV+ +C + +HAV +VGY D+ ++ W
Sbjct: 272 SSGVLSGSCGTTF---DHAVTVVGYGVSDDGTKYW 303
>gi|151547430|gb|ABS12459.1| cysteine protease Cp [Citrus sinensis]
Length = 361
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 86/278 (30%), Positives = 133/278 (47%), Gaps = 29/278 (10%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F+ F +RY K Y S E +RF F K+LD+I N S R G+ +F+D S EEF
Sbjct: 62 FARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLS---YRLGLNKFADWSWEEF 118
Query: 96 KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
+ L + N +HK +T + +P KDWRE+GI+ V+
Sbjct: 119 QRHRLGAAQNCSATTKGNHK-------------LTADV-----LPETKDWRESGIVSPVK 160
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGCSGGDFCALLDWM 213
+Q CG+CW FST + E+ + G LS Q+++DCA N GC+GG +++
Sbjct: 161 DQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYI 220
Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
N L+ E YP KD CK + + GV++ + + + +E + + PV
Sbjct: 221 KYNG-GLDTEEAYPYTGKDGVCKFSSENV-GVQVLD-SVNITLGAEDELQHAVGLVRPVS 277
Query: 274 AAVNAL-TWQYYLGGVIQYN-CDGSLANINHAVQIVGY 309
A + +++Y GV C + ++NHAV VGY
Sbjct: 278 VAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGY 315
>gi|47230018|emb|CAG10432.1| unnamed protein product [Tetraodon nigroviridis]
Length = 294
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 79/248 (31%), Positives = 123/248 (49%), Gaps = 22/248 (8%)
Query: 75 QSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITI 134
Q +S R G+T+F+D+ EE+K L+S + + +K S +
Sbjct: 26 QGIKSYRLGMTQFADMDNEEYKR-----------LISLGCLGAFNASAPRKGSAFFRLAE 74
Query: 135 PTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA 194
T +P DWR+ G + V++Q+ CG+CWAFS + E + K G L LS Q+++DC+
Sbjct: 75 GTPLPTTVDWRDKGYVTGVKDQKQCGSCWAFSATGSLEGQNYRKTGKLVSLSEQQLVDCS 134
Query: 195 GN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCD 253
G+ GNMGC GG + ++ N + + E YP +D C+ K + G K Y D
Sbjct: 135 GDYGNMGCGGGLMDSAFKYIQENGGI-DTEESYPYEAEDGKCRFKPQNI-GAKCTGYV-D 191
Query: 254 TLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVI-QYNCDGSLANINHAVQIVGY- 309
E ++ +AT GPV A++A ++Q Y GV + C S +++H V VGY
Sbjct: 192 VTAGDEDALKEAVATIGPVSVAIDASHSSFQLYESGVYDELEC--SSEDLDHGVLAVGYG 249
Query: 310 -DNYSRTW 316
DN W
Sbjct: 250 TDNGQDYW 257
>gi|118365710|ref|XP_001016075.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89297842|gb|EAR95830.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 335
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 93/312 (29%), Positives = 154/312 (49%), Gaps = 30/312 (9%)
Query: 7 VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKN--FEKSL 64
+L I+ L+ LC LA + V +KL ++ + ++++ Y +EH+ F+ F ++L
Sbjct: 6 LLSIIMLMPLC-LAQDISV------EKLLAYNKWSSQHQRVY-LNEHEKLFRQMVFFENL 57
Query: 65 DIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHL-RHSVNKHVLMSHHKHHDHHHNHV 123
++E N N + S G+ FSD++++EF + L + + H + S + H+ ++
Sbjct: 58 QKVKEHNSNPNNTYSI--GLNLFSDMTKQEFAEKILMKQDLVDHYMKSISQKETHNDVNI 115
Query: 124 KKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLS 183
+ + + +T+ T I DWR G + V+ Q CG+CW+F+ ES + ++N L
Sbjct: 116 ETQLNSKNLTLATSI----DWRTQGAVTSVKYQGNCGSCWSFAGAALMESFNFIQNKVLV 171
Query: 184 LLSVQEVIDC--AGNG--NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
S Q+++DC + NG + GC+GG LD+ +KV + YP + C
Sbjct: 172 DFSEQQLVDCVISANGYQSEGCNGGFSFETLDY--ASKVGITTLDNYPYVEVQKKCNMTG 229
Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYN-CDGSLA 298
T+ NG K K + +PS S+ L PV VNA W Y G+ YN D S
Sbjct: 230 TN-NGFKPKQW---IQVPSTSNDLKHALNFSPVSVYVNAYNWVSYQSGI--YNGSDQSNI 283
Query: 299 NINHAVQIVGYD 310
NH V VGYD
Sbjct: 284 VFNHEVLAVGYD 295
>gi|334332714|ref|XP_001367224.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
Length = 335
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 88/310 (28%), Positives = 154/310 (49%), Gaps = 34/310 (10%)
Query: 9 FIVALIALCFLAIPVKVSKPNLEQKLE-LFSSFQQRYKKSYSKSEHDIRFKNFEKSLDII 67
F + L +LC + + + P ++ L+ + ++ ++ KSY+ +E R +EK+L +I
Sbjct: 3 FYLCLASLC---LGLAAAIPPFDRALDSQWHQWKAQHGKSYAANEDSWRRATWEKNLKMI 59
Query: 68 EELNKNRQSPE-SARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKR 126
E N+ + + S + + +F D+S EEFK +M+ +K N +KR
Sbjct: 60 ERHNQEYSAGKHSFQLRMNKFGDMSTEEFKQ-----------VMNGYKS-----NGSQKR 103
Query: 127 SITTGI--TIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL 184
+ + ++ +P DWRE G + V+ Q+ C +CWAFS E K G L
Sbjct: 104 TKGSLYRESLLAQLPESVDWREKGYVTPVKEQRGCYSCWAFSAAGAIEGQWFRKTGKLVS 163
Query: 185 LSVQEVIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPN 243
LSVQ ++DC+ GN GC GG ++ N + + E YP + +D CK + +
Sbjct: 164 LSVQNLVDCSIPEGNNGCDGGLMGNAFQYVQDNGGI-DTEECYPYVAQDNECKYQPEC-S 221
Query: 244 GVKIKSYTCDTLIPS--ESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLAN 299
G + + IPS E +++ +A GP+ A++A ++++Y GV Y+ S +
Sbjct: 222 GANVTGF---VKIPSTDERALMKAVANVGPISVAIDAGNPSFKFYQSGVY-YDPQCSSSQ 277
Query: 300 INHAVQIVGY 309
+NH V +VGY
Sbjct: 278 LNHGVLVVGY 287
>gi|405958752|gb|EKC24846.1| Cathepsin L1 [Crassostrea gigas]
Length = 290
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 86/261 (32%), Positives = 130/261 (49%), Gaps = 27/261 (10%)
Query: 55 IRFKNFEKSLDIIEELNKNRQ-SPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHH 113
IR +E +LD I + N Q S G+ EF+DLS EEF H+
Sbjct: 4 IRRGIWEANLDYINQHNDEFQRGAHSYTLGLNEFADLSHEEFL----------HLYGGGI 53
Query: 114 KHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAES 173
+ D V T + +G+P++ DWR+ G +G + NQ CG+CWAF+ E
Sbjct: 54 RPRDS----VSSDPDTDIVVDTSGLPLEVDWRKEGWVGPIGNQFACGSCWAFTATGALEG 109
Query: 174 MHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLK 231
K G L +LSVQ+++DC+ GN GC GG A ++ DV + E + YP
Sbjct: 110 QVRNKTGKLIVLSVQQMMDCSEKWGNHGCEGGLMDAAFKYIHDVGGI--ESNASYPYKPA 167
Query: 232 DAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVI 289
+ CK ++ K+K Y L SE S++ +AT GP+ AA++A ++Q Y GV
Sbjct: 168 EEKCKFNKSAVV-AKVKGYK--DLPKSEESLMVAVATVGPISAALDASHSSFQLYKSGVY 224
Query: 290 -QYNCDGSLANINHAVQIVGY 309
+ NC S ++H++ +VGY
Sbjct: 225 DEPNC--SSGQVDHSLVVVGY 243
>gi|374414520|pdb|3QJ3|A Chain A, Structure Of Digestive Procathepsin L2 Proteinase From
Tenebrio Molitor Larval Midgut
gi|374414521|pdb|3QJ3|B Chain B, Structure Of Digestive Procathepsin L2 Proteinase From
Tenebrio Molitor Larval Midgut
Length = 331
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 92/284 (32%), Positives = 132/284 (46%), Gaps = 23/284 (8%)
Query: 35 ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELN-KNRQSPESARYGITEFSDLSE 92
E + +F+ Y +SY + E R + F+K L+ EE N K RQ S G+ F+D++
Sbjct: 20 EKWENFKTTYARSYVNAKEETFRKQIFQKKLETFEEHNEKYRQGLVSYTLGVNLFTDMTP 79
Query: 93 EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVK-KRSITTGITIPTGIPVKKDWREAGIIG 151
EE K H L+ D H N + K G+ P DWR+ G++
Sbjct: 80 EEMKAY-------THGLI---MPADLHKNGIPIKTREDLGLNASVRYPASFDWRDQGMVS 129
Query: 152 KVRNQQTCGACWAFSTVETAESMHALKNGTL--SLLSVQEVIDCAGNGNMGCSGGDFCAL 209
V+NQ +CG+ WAFS+ ES + NG S +S Q+++DC N +GCSGG
Sbjct: 130 PVKNQGSCGSSWAFSSTGAIESQMKIANGAGYDSSVSEQQLVDCVPNA-LGCSGGWMNDA 188
Query: 210 LDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTD-IAT 268
++ N + + E YP + D C PN V + L + ++L D +AT
Sbjct: 189 FTYVAQNGGI-DSEGAYPYEMADGNCHY---DPNQVAARLSGYVYLSGPDENMLADMVAT 244
Query: 269 HGPVIAAVNA-LTWQYYLGGVIQYNCDGSLANINHAVQIVGYDN 311
GPV A +A + Y GGV YN HAV IVGY N
Sbjct: 245 KGPVAVAFDADDPFGSYSGGVY-YNPTCETNKFTHAVLIVGYGN 287
>gi|19698255|dbj|BAB86770.1| cathepsin L-like [Engraulis japonicus]
Length = 324
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 80/280 (28%), Positives = 132/280 (47%), Gaps = 22/280 (7%)
Query: 37 FSSFQQRYKKSYSKSEHDIRFKN-FEKSLDIIEELNK-NRQSPESARYGITEFSDLSEEE 94
F+ ++ ++ KSY E + K + + I+ N+ Q S R G+ +FSD+ EE
Sbjct: 22 FNEWKAKFGKSYPSLEKEAHRKGLWLANHQKIQAHNQLADQGVHSYRQGLNQFSDMDHEE 81
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F+ + VL ++ R++ G+ DWR +G + ++
Sbjct: 82 FR---------QTVLTKMDPPKNNRGASEPFRALNVGLAASV------DWRTSGCVSPIK 126
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWM 213
NQ CG+CW+FS ES L+ G L LS Q+++DC+G+ GN GC+GG ++
Sbjct: 127 NQGQCGSCWSFSATGALESQTCLRRGYLPSLSEQQLVDCSGSYGNYGCNGGWPDQAFQYI 186
Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
N + + ES YP + C ++ + Y T + SES++ +A GP+
Sbjct: 187 QANGGI-DSESYYPYQARVGTCHYN-SAYSAATCSGYQDVTPVGSESALQYYVANVGPLS 244
Query: 274 AAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYS 313
A++A WQ Y GV +N +HAV +VGY Y+
Sbjct: 245 IAIDASGWQSYQSGV--FNDPSCSQTADHAVLLVGYGTYN 282
>gi|8347420|dbj|BAA96501.1| cysteine protease [Nicotiana tabacum]
Length = 360
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 86/277 (31%), Positives = 130/277 (46%), Gaps = 27/277 (9%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F+ F RY K Y S E RF+ F +L +I NK S + G+ EF+DL+ +EF
Sbjct: 61 FARFAHRYGKRYESVEEIKQRFEVFLDNLKMIRSHNKKGLS---YKLGVNEFTDLTWDEF 117
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
+ L + N ++K ++ +P KDWREAGI+ V+N
Sbjct: 118 RRDRLGAAQNCSATT---------KGNLKVTNVV--------LPETKDWREAGIVSPVKN 160
Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWMD 214
Q CG+CW FST E+ ++ G LS Q+++DCAG N GC+GG +++
Sbjct: 161 QGKCGSCWTFSTTGALEAAYSQAFGKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIK 220
Query: 215 VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIA 274
N L+ E YP K+ CK + + GVK+ + + + +E + +A PV
Sbjct: 221 SNG-GLDTEEAYPYTGKNGLCKFSSENV-GVKVID-SVNITLGAEDELKYAVALVRPVSI 277
Query: 275 AVNALTW--QYYLGGVIQYNCDGSLANINHAVQIVGY 309
A + QY G C + ++NHAV VGY
Sbjct: 278 AFEVIKGFKQYKSGVYTSTECGNTPMDVNHAVLAVGY 314
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.133 0.406
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,913,491,364
Number of Sequences: 23463169
Number of extensions: 196878336
Number of successful extensions: 791882
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 4656
Number of HSP's successfully gapped in prelim test: 1818
Number of HSP's that attempted gapping in prelim test: 777070
Number of HSP's gapped (non-prelim): 8164
length of query: 317
length of database: 8,064,228,071
effective HSP length: 142
effective length of query: 175
effective length of database: 9,027,425,369
effective search space: 1579799439575
effective search space used: 1579799439575
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 76 (33.9 bits)