BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 017318
(373 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255543801|ref|XP_002512963.1| cysteine protease, putative [Ricinus communis]
gi|223547974|gb|EEF49466.1| cysteine protease, putative [Ricinus communis]
Length = 373
Score = 624 bits (1608), Expect = e-176, Method: Compositional matrix adjust.
Identities = 308/366 (84%), Positives = 335/366 (91%), Gaps = 9/366 (2%)
Query: 8 LFLVSLVVF-SAVSSGTLIDD-VDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKK 65
F++S ++F SAV++ TL D D LIRQVTDG DE S N +LLGAEHHFSLFKK
Sbjct: 9 FFVISSILFVSAVTAETLTTDGEDPLIRQVTDGQDE-----SSANPNLLGAEHHFSLFKK 63
Query: 66 KFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRR 125
KF K YASQEEHD+RF IFK+NLRRA RHQKLDP+ATHG+TQFSDLT +EFRR +LGLRR
Sbjct: 64 KFKKTYASQEEHDYRFKIFKSNLRRAERHQKLDPTATHGVTQFSDLTHSEFRRQFLGLRR 123
Query: 126 KLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLAT 185
LRLPKDA++AP+LPTNDLPADFDWREKGAV VK+QGSCGSCWSFSTTGALEGAN+LAT
Sbjct: 124 -LRLPKDANEAPMLPTNDLPADFDWREKGAVTAVKNQGSCGSCWSFSTTGALEGANYLAT 182
Query: 186 GKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTD 245
GKLVSLSEQQLVDCDHECDP E G+CDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTD
Sbjct: 183 GKLVSLSEQQLVDCDHECDPAEEGACDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTD 242
Query: 246 RGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYI 305
RG AC+FDK+KIAA VANFSVVSLDEDQIAANLVKNGPLAVAINAV+MQTYIGGVSCPYI
Sbjct: 243 RG-ACQFDKTKIAAKVANFSVVSLDEDQIAANLVKNGPLAVAINAVFMQTYIGGVSCPYI 301
Query: 306 CSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSM 365
CS+RLDHGVLLVGYGSAGYAPIR+KEKPYWIIKNSWGE+WGE+GYYKICRGRN+CGVDSM
Sbjct: 302 CSKRLDHGVLLVGYGSAGYAPIRMKEKPYWIIKNSWGENWGESGYYKICRGRNICGVDSM 361
Query: 366 VSTVAA 371
VSTVAA
Sbjct: 362 VSTVAA 367
>gi|118481169|gb|ABK92536.1| unknown [Populus trichocarpa]
Length = 368
Score = 624 bits (1608), Expect = e-176, Method: Compositional matrix adjust.
Identities = 297/354 (83%), Positives = 324/354 (91%), Gaps = 9/354 (2%)
Query: 18 AVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEH 77
A+S+ T D D LIRQV +G DE S++N L +HHFSLFK+KF K+Y SQEEH
Sbjct: 18 AISAETFNGD-DSLIRQVVEGQDE------SSSNLLTAEQHHFSLFKRKFKKSYLSQEEH 70
Query: 78 DHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAP 137
D+RF++FK+NLRRAARHQKLDP+A+HG+TQFSDLT AEFR+ LGLR KLRLPKDA+ AP
Sbjct: 71 DYRFSVFKSNLRRAARHQKLDPTASHGVTQFSDLTSAEFRKQVLGLR-KLRLPKDANTAP 129
Query: 138 ILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLV 197
ILPTNDLP DFDWREKGAVGPVK+QGSCGSCWSFSTTGALEGA+FLATG+LVSLSEQQLV
Sbjct: 130 ILPTNDLPEDFDWREKGAVGPVKNQGSCGSCWSFSTTGALEGAHFLATGELVSLSEQQLV 189
Query: 198 DCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKI 257
DCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTG DRG ACKFDK+K+
Sbjct: 190 DCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGMDRG-ACKFDKNKV 248
Query: 258 AASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLV 317
AA VANFSVVSLDEDQIAANLVKNGPLAVAINAV+MQTYIGGVSCPYICSRRLDHGVLLV
Sbjct: 249 AAGVANFSVVSLDEDQIAANLVKNGPLAVAINAVFMQTYIGGVSCPYICSRRLDHGVLLV 308
Query: 318 GYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAA 371
GYGSA YAP+R+KEKPYWIIKNSWGESWGENG+YKICRGRN+CGVDSMVSTVAA
Sbjct: 309 GYGSAAYAPVRMKEKPYWIIKNSWGESWGENGFYKICRGRNICGVDSMVSTVAA 362
>gi|224105327|ref|XP_002313770.1| predicted protein [Populus trichocarpa]
gi|222850178|gb|EEE87725.1| predicted protein [Populus trichocarpa]
Length = 368
Score = 622 bits (1603), Expect = e-175, Method: Compositional matrix adjust.
Identities = 296/354 (83%), Positives = 323/354 (91%), Gaps = 9/354 (2%)
Query: 18 AVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEH 77
A+S+ T D D LIRQV +G DE S++N L +HHFSLFK+KF K+Y SQEEH
Sbjct: 18 AISAETFNGD-DSLIRQVVEGQDE------SSSNLLTAEQHHFSLFKRKFKKSYLSQEEH 70
Query: 78 DHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAP 137
D+RF++FK+NLRRAARHQKLDP+A+HG+TQFSDLT AEFR+ LGLR KLRLPKDA+ AP
Sbjct: 71 DYRFSVFKSNLRRAARHQKLDPTASHGVTQFSDLTSAEFRKQVLGLR-KLRLPKDANTAP 129
Query: 138 ILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLV 197
ILPTNDLP DFDWREKGAVGPVK+QGSCGSCWSFSTTGALEGA+FLATG+LVSLSEQQLV
Sbjct: 130 ILPTNDLPEDFDWREKGAVGPVKNQGSCGSCWSFSTTGALEGAHFLATGELVSLSEQQLV 189
Query: 198 DCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKI 257
DCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTG DRG ACKFDK+K+
Sbjct: 190 DCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGMDRG-ACKFDKNKV 248
Query: 258 AASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLV 317
AA VANFS VSLDEDQIAANLVKNGPLAVAINAV+MQTYIGGVSCPYICSRRLDHGVLLV
Sbjct: 249 AAGVANFSAVSLDEDQIAANLVKNGPLAVAINAVFMQTYIGGVSCPYICSRRLDHGVLLV 308
Query: 318 GYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAA 371
GYGSA YAP+R+KEKPYWIIKNSWGESWGENG+YKICRGRN+CGVDSMVSTVAA
Sbjct: 309 GYGSAAYAPVRMKEKPYWIIKNSWGESWGENGFYKICRGRNICGVDSMVSTVAA 362
>gi|60396844|gb|AAX19661.1| cysteine proteinase [Populus tomentosa]
Length = 374
Score = 620 bits (1599), Expect = e-175, Method: Compositional matrix adjust.
Identities = 296/354 (83%), Positives = 322/354 (90%), Gaps = 9/354 (2%)
Query: 18 AVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEH 77
A+S+ T D D LIRQV +G DE S+ N L +HH SLFK+KF K+Y SQEEH
Sbjct: 24 AISAETFNGD-DSLIRQVVEGQDE------SSPNLLTAEQHHLSLFKRKFKKSYLSQEEH 76
Query: 78 DHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAP 137
D+RF++FK+NLRRAARHQKLDP+A+HG+TQFSDLT AEFR+ LGLR KLRLPKDA++AP
Sbjct: 77 DYRFSVFKSNLRRAARHQKLDPTASHGVTQFSDLTSAEFRKQVLGLR-KLRLPKDANKAP 135
Query: 138 ILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLV 197
ILPTNDLP DFDWREKGAVGPVK+QGSCGSCWSFSTTGALEGA+FLATG+LVSLSEQQLV
Sbjct: 136 ILPTNDLPEDFDWREKGAVGPVKNQGSCGSCWSFSTTGALEGAHFLATGELVSLSEQQLV 195
Query: 198 DCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKI 257
DCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTG DRG ACKFDK K+
Sbjct: 196 DCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGMDRG-ACKFDKDKV 254
Query: 258 AASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLV 317
AA VANFSVVSLDEDQIAANLVKNGPLAVA NAV+MQTYIGGVSCPYICSRRLDHGVLLV
Sbjct: 255 AAGVANFSVVSLDEDQIAANLVKNGPLAVATNAVFMQTYIGGVSCPYICSRRLDHGVLLV 314
Query: 318 GYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAA 371
GYGSAGYAP+R+KEKPYWIIKNSWGESWGENG+YKICRGRN+CGVDSMVSTVAA
Sbjct: 315 GYGSAGYAPVRMKEKPYWIIKNSWGESWGENGFYKICRGRNICGVDSMVSTVAA 368
>gi|224077886|ref|XP_002305451.1| predicted protein [Populus trichocarpa]
gi|222848415|gb|EEE85962.1| predicted protein [Populus trichocarpa]
Length = 368
Score = 616 bits (1588), Expect = e-174, Method: Compositional matrix adjust.
Identities = 294/356 (82%), Positives = 324/356 (91%), Gaps = 9/356 (2%)
Query: 16 FSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQE 75
SAV + TL D D LIR+V DG D S++N L +HHFSLFK KF K+Y SQE
Sbjct: 16 ISAVHAETLNGD-DPLIREVVDGQD------ASSSNLLSAEQHHFSLFKSKFKKSYGSQE 68
Query: 76 EHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQ 135
EHD+RF++FKANLRRAARHQ+LDP+A+HG+TQFSDLTPAEFR+ LGLRR LRLPKDA++
Sbjct: 69 EHDYRFSVFKANLRRAARHQELDPTASHGVTQFSDLTPAEFRKQVLGLRR-LRLPKDANE 127
Query: 136 APILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQ 195
APILPT+DLP DFDWR+KGAVGP+K+QGSCGSCWSFS TGALEGA+FLATG+LVSLSEQQ
Sbjct: 128 APILPTSDLPEDFDWRDKGAVGPIKNQGSCGSCWSFSATGALEGAHFLATGELVSLSEQQ 187
Query: 196 LVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKS 255
LVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDR ACKFDK+
Sbjct: 188 LVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDR-DACKFDKN 246
Query: 256 KIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVL 315
K+AA VANFSVVSLDEDQIAANLVKNGPLAVAINAV+MQTYIGGVSCPYICSRRLDHGVL
Sbjct: 247 KVAARVANFSVVSLDEDQIAANLVKNGPLAVAINAVFMQTYIGGVSCPYICSRRLDHGVL 306
Query: 316 LVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAA 371
LVGYGSAGY+P+R+KEKP+WIIKNSWGE WGENG+YKICRGRNVCGVDSMVSTVAA
Sbjct: 307 LVGYGSAGYSPVRMKEKPFWIIKNSWGEKWGENGFYKICRGRNVCGVDSMVSTVAA 362
>gi|57282617|emb|CAE54306.1| putative papain-like cysteine proteinase [Gossypium hirsutum]
Length = 373
Score = 614 bits (1583), Expect = e-173, Method: Compositional matrix adjust.
Identities = 292/364 (80%), Positives = 320/364 (87%), Gaps = 11/364 (3%)
Query: 8 LFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKF 67
F+V ++ S+ +VD LI QVTDG HE LL AEHH+SLFKK+F
Sbjct: 15 FFIVGVICTETFSAEGF--EVDPLIEQVTDG-------HEGAEPQLLTAEHHYSLFKKRF 65
Query: 68 NKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKL 127
K+Y SQ+EHD+RF IF+ NLRRAARHQ LDPSATHG+TQFSDLTP EFR+ YLGLRR L
Sbjct: 66 KKSYGSQKEHDYRFKIFQVNLRRAARHQNLDPSATHGVTQFSDLTPGEFRKAYLGLRR-L 124
Query: 128 RLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGK 187
RLPKDA +APILPT++LP DFDWREKGAV PVK+QGSCGSCWSFSTTGALEGANFLATGK
Sbjct: 125 RLPKDATEAPILPTDNLPQDFDWREKGAVTPVKNQGSCGSCWSFSTTGALEGANFLATGK 184
Query: 188 LVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRG 247
LVSLSEQQLVDCDHECDPEE GSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRG
Sbjct: 185 LVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRG 244
Query: 248 HACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICS 307
CKFD +K+AA VANFSVVSLDEDQIAANL KNGPLAVAINAV+MQTYIGGVSCPYICS
Sbjct: 245 -TCKFDNTKVAAKVANFSVVSLDEDQIAANLFKNGPLAVAINAVFMQTYIGGVSCPYICS 303
Query: 308 RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVS 367
+RLDHGVLLVGYGSAGYAP+R+K+KPYWIIKNSWGE+WGENG+Y+ICRGRN+CGVDSMVS
Sbjct: 304 KRLDHGVLLVGYGSAGYAPVRMKDKPYWIIKNSWGENWGENGFYRICRGRNICGVDSMVS 363
Query: 368 TVAA 371
TVAA
Sbjct: 364 TVAA 367
>gi|7381219|gb|AAF61440.1|AF138264_1 papain-like cysteine proteinase isoform I [Ipomoea batatas]
Length = 368
Score = 604 bits (1557), Expect = e-170, Method: Compositional matrix adjust.
Identities = 287/372 (77%), Positives = 320/372 (86%), Gaps = 12/372 (3%)
Query: 1 MGSKTVVLFLVSLVVFSAVSSGTLIDDVDQ-LIRQVTDGGDEILSHHESTNNDLLGAEHH 59
M + +LFL +L+ +++ DD D LIRQV GD DLL A+HH
Sbjct: 1 MAFRFSLLFLCTLLATTSLVFAAEDDDGDDVLIRQVVGDGD----------GDLLNADHH 50
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F++FK++F KAYAS EEHD+R ++FKAN+RRA RHQ+LDP+A HG+TQFSDLTP EFRR
Sbjct: 51 FTVFKRRFGKAYASDEEHDYRLSVFKANMRRAKRHQELDPAAVHGVTQFSDLTPTEFRRK 110
Query: 120 YLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEG 179
+LGL R+L+ P DA APILPT++LP+DFDWR+ GAV PVK+QG+CGSCWSFSTTGALEG
Sbjct: 111 FLGLNRRLKFPADAKTAPILPTDELPSDFDWRDHGAVTPVKNQGTCGSCWSFSTTGALEG 170
Query: 180 ANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDY 239
ANFLATGKLVSLSEQQLVDCDHECDPEE GSCDSGCNGGLMNSAFEYTLKAGGLMREEDY
Sbjct: 171 ANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKAGGLMREEDY 230
Query: 240 PYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGG 299
PYTG D C+FDK+KIAA VANFSVVSLDEDQIAANLVKNGPLAVAINAV+MQTYIGG
Sbjct: 231 PYTGNDL-QVCRFDKTKIAAKVANFSVVSLDEDQIAANLVKNGPLAVAINAVFMQTYIGG 289
Query: 300 VSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNV 359
VSCPYICS+RLDHGVLLVGYGSAGYAPIR+KEKPYWIIKNSWGESWGENGYYKICRGRNV
Sbjct: 290 VSCPYICSKRLDHGVLLVGYGSAGYAPIRMKEKPYWIIKNSWGESWGENGYYKICRGRNV 349
Query: 360 CGVDSMVSTVAA 371
CGVDSMVSTVAA
Sbjct: 350 CGVDSMVSTVAA 361
>gi|7211741|gb|AAF40414.1|AF216783_1 papain-like cysteine proteinase isoform I [Ipomoea batatas]
Length = 368
Score = 603 bits (1556), Expect = e-170, Method: Compositional matrix adjust.
Identities = 287/372 (77%), Positives = 320/372 (86%), Gaps = 12/372 (3%)
Query: 1 MGSKTVVLFLVSLVVFSAVSSGTLIDDVDQ-LIRQVTDGGDEILSHHESTNNDLLGAEHH 59
M + +LFL +L+ +++ DD D LIRQV GD DLL A+HH
Sbjct: 1 MAFRFSLLFLCTLLATTSLVFAAEDDDGDDILIRQVVGDGD----------GDLLNADHH 50
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F++FK++F KAYAS EEHD+R ++FKAN+RRA RHQ+LDP+A HG+TQFSDLTP EFRR
Sbjct: 51 FTVFKRRFGKAYASDEEHDYRLSVFKANMRRAKRHQELDPAAVHGVTQFSDLTPTEFRRK 110
Query: 120 YLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEG 179
+LGL R+L+ P DA APILPT++LP+DFDWR+ GAV PVK+QG+CGSCWSFSTTGALEG
Sbjct: 111 FLGLNRRLKFPADAKTAPILPTDELPSDFDWRDHGAVTPVKNQGTCGSCWSFSTTGALEG 170
Query: 180 ANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDY 239
ANFLATGKLVSLSEQQLVDCDHECDPEE GSCDSGCNGGLMNSAFEYTLKAGGLMREEDY
Sbjct: 171 ANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKAGGLMREEDY 230
Query: 240 PYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGG 299
PYTG D C+FDK+KIAA VANFSVVSLDEDQIAANLVKNGPLAVAINAV+MQTYIGG
Sbjct: 231 PYTGNDL-QVCRFDKTKIAAKVANFSVVSLDEDQIAANLVKNGPLAVAINAVFMQTYIGG 289
Query: 300 VSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNV 359
VSCPYICS+RLDHGVLLVGYGSAGYAPIR+KEKPYWIIKNSWGESWGENGYYKICRGRNV
Sbjct: 290 VSCPYICSKRLDHGVLLVGYGSAGYAPIRMKEKPYWIIKNSWGESWGENGYYKICRGRNV 349
Query: 360 CGVDSMVSTVAA 371
CGVDSMVSTVAA
Sbjct: 350 CGVDSMVSTVAA 361
>gi|449464688|ref|XP_004150061.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
gi|449519862|ref|XP_004166953.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
Length = 377
Score = 602 bits (1552), Expect = e-169, Method: Compositional matrix adjust.
Identities = 291/370 (78%), Positives = 327/370 (88%), Gaps = 10/370 (2%)
Query: 7 VLFLVSLVVFSAVSSGTLI--DDVDQLIRQVTDGGDEILSHHESTNND--LLGAEHHFSL 62
++ ++SL+ SA+ S + D D +IRQV D G +E +N D LLGA+HHFS+
Sbjct: 7 LIVVLSLLAASAIGSEVISGESDGDFIIRQVVDDG----GVNEGSNGDDLLLGADHHFSV 62
Query: 63 FKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLG 122
FK+KF K+YAS+EEHDHRF +FKANL+RA RHQ LDPSATHG+TQFSDLTP+EFRR++LG
Sbjct: 63 FKQKFGKSYASKEEHDHRFRVFKANLKRAQRHQALDPSATHGVTQFSDLTPSEFRRSFLG 122
Query: 123 LR-RKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGAN 181
LR R+L LP DA++APILPT+ LP DFDWR+KGAV VK+QGSCGSCWSFS TGALEGAN
Sbjct: 123 LRSRRLGLPADANKAPILPTDGLPTDFDWRDKGAVSEVKNQGSCGSCWSFSATGALEGAN 182
Query: 182 FLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPY 241
FLATGKLVSLSEQQLVDCDHECDPEE GSCDSGCNGGLMNSAFEYTLK+GGLM+E+DYPY
Sbjct: 183 FLATGKLVSLSEQQLVDCDHECDPEEKGSCDSGCNGGLMNSAFEYTLKSGGLMKEQDYPY 242
Query: 242 TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVS 301
TGTDRG CKFDKSKIAASVANFSVVSLDE+QIAANLVKNGPLAVAINAV+MQTYI GVS
Sbjct: 243 TGTDRG-TCKFDKSKIAASVANFSVVSLDEEQIAANLVKNGPLAVAINAVFMQTYIKGVS 301
Query: 302 CPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCG 361
CPYICS+ LDHGVLLVGYGS GYAPIRLK+KPYWIIKNSWG +WGENGYYKICRGRN+CG
Sbjct: 302 CPYICSKHLDHGVLLVGYGSDGYAPIRLKDKPYWIIKNSWGANWGENGYYKICRGRNICG 361
Query: 362 VDSMVSTVAA 371
VDSMVSTVAA
Sbjct: 362 VDSMVSTVAA 371
>gi|7211745|gb|AAF40416.1|AF216785_1 papain-like cysteine proteinase isoform III [Ipomoea batatas]
gi|7381223|gb|AAF61442.1|AF138266_1 papain-like cysteine proteinase isoform III [Ipomoea batatas]
Length = 366
Score = 599 bits (1545), Expect = e-169, Method: Compositional matrix adjust.
Identities = 281/346 (81%), Positives = 308/346 (89%), Gaps = 11/346 (3%)
Query: 26 DDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFK 85
D D LIRQV GD DLL A+HHF++FK++F KAYAS EEHD+R ++FK
Sbjct: 25 DGDDILIRQVVGDGD----------GDLLNADHHFAVFKRRFGKAYASDEEHDYRLSVFK 74
Query: 86 ANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLP 145
AN+RRA RHQ+LDP+A HG+TQFSDLTP EFRR +LGL R+L+ P DA APILPT++LP
Sbjct: 75 ANMRRAKRHQQLDPAAVHGVTQFSDLTPTEFRRKFLGLNRRLKFPADAKTAPILPTDELP 134
Query: 146 ADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDP 205
+DFDWR++GAV PVK+QG+CGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDP
Sbjct: 135 SDFDWRDRGAVTPVKNQGTCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDP 194
Query: 206 EEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFS 265
EE GSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTG D C+FDK+KIAA VANFS
Sbjct: 195 EEAGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGNDL-QVCRFDKTKIAAKVANFS 253
Query: 266 VVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYA 325
VVSLDEDQIAANLVKNGPLAVAINAV+MQTYIGGVSCPYICS+RLDHGVLLVGYGSAGYA
Sbjct: 254 VVSLDEDQIAANLVKNGPLAVAINAVFMQTYIGGVSCPYICSKRLDHGVLLVGYGSAGYA 313
Query: 326 PIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAA 371
PIR+KEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAA
Sbjct: 314 PIRMKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAA 359
>gi|7211743|gb|AAF40415.1|AF216784_1 papain-like cysteine proteinase isoform II [Ipomoea batatas]
Length = 368
Score = 598 bits (1541), Expect = e-168, Method: Compositional matrix adjust.
Identities = 284/372 (76%), Positives = 318/372 (85%), Gaps = 12/372 (3%)
Query: 1 MGSKTVVLFLVSLVVFSAVSSGTLIDDVDQ-LIRQVTDGGDEILSHHESTNNDLLGAEHH 59
M + +LFL +L+ +++ DD D LIRQV GD DLL A+HH
Sbjct: 1 MAFRFSLLFLCTLLATTSLVFAAEDDDGDDILIRQVVGDGD----------GDLLNADHH 50
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F++FK++F KAYAS EEHD+R ++FKAN+RRA RHQ+LDP+A HG+TQFSD TP EFRR
Sbjct: 51 FTVFKRRFGKAYASDEEHDYRLSVFKANMRRAKRHQELDPAAVHGVTQFSDSTPTEFRRK 110
Query: 120 YLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEG 179
+LGL R+L+ P DA APILPT++LP+DFDWR++GAV PVK+QG+CG CWSFSTTGALEG
Sbjct: 111 FLGLNRRLKFPADAKTAPILPTDELPSDFDWRDRGAVTPVKNQGTCGLCWSFSTTGALEG 170
Query: 180 ANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDY 239
ANFLATGKLVSLSEQQLVDCDHECDPEE GSCD GCNGGLMNSAFEYTLKAGGLMREEDY
Sbjct: 171 ANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDFGCNGGLMNSAFEYTLKAGGLMREEDY 230
Query: 240 PYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGG 299
PYTG D C+FDK+KIAA VANFSVVSLDEDQIAANLVKNGPLAVAINAV+MQTYIGG
Sbjct: 231 PYTGNDL-QVCRFDKTKIAAKVANFSVVSLDEDQIAANLVKNGPLAVAINAVFMQTYIGG 289
Query: 300 VSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNV 359
VSCPYICS+RLDHGVLLVGYGSAGYAPIR+KEKPYWIIKNSWGESWGENGYYKICRGRNV
Sbjct: 290 VSCPYICSKRLDHGVLLVGYGSAGYAPIRMKEKPYWIIKNSWGESWGENGYYKICRGRNV 349
Query: 360 CGVDSMVSTVAA 371
CGVDSMVSTVAA
Sbjct: 350 CGVDSMVSTVAA 361
>gi|7381221|gb|AAF61441.1|AF138265_1 papain-like cysteine proteinase isoform II [Ipomoea batatas]
Length = 366
Score = 597 bits (1538), Expect = e-168, Method: Compositional matrix adjust.
Identities = 284/372 (76%), Positives = 318/372 (85%), Gaps = 14/372 (3%)
Query: 1 MGSKTVVLFLVSLVVFSAVSSGTLIDDVDQLIRQVT-DGGDEILSHHESTNNDLLGAEHH 59
M + +LFL +L+ + + D D LIRQV DGGD LL A+HH
Sbjct: 1 MAFRFSLLFLCTLLATTYLVFAAEDDGDDILIRQVVGDGGD------------LLNADHH 48
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F++FK++F K YAS EEHD+R ++FKAN+RRA +HQ+LDP+A HG+TQFSDLTP EFRR
Sbjct: 49 FTVFKRRFGKVYASDEEHDYRLSVFKANMRRAKQHQELDPAAVHGVTQFSDLTPTEFRRK 108
Query: 120 YLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEG 179
+LGL R+L+ P DA APILPT++LP+DFDWR+ GAV PVK+QG+CGSCWSFSTTGALEG
Sbjct: 109 FLGLNRRLKFPADAKTAPILPTDELPSDFDWRDHGAVTPVKNQGTCGSCWSFSTTGALEG 168
Query: 180 ANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDY 239
ANFLATGKLVSLSEQQLVDCDHECDPEE GSCDSGCNGGLMNSAFEYTLKAGGLMREEDY
Sbjct: 169 ANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKAGGLMREEDY 228
Query: 240 PYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGG 299
PYTG D C+FDK+KIAA VANFSVVSLDEDQIAANLVKNGPLAVAINAV++QTYIGG
Sbjct: 229 PYTGNDL-QVCRFDKTKIAAKVANFSVVSLDEDQIAANLVKNGPLAVAINAVFVQTYIGG 287
Query: 300 VSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNV 359
VSCPYICS+RLDHGVLLVGYGSAGYAPIR+KEKPYWIIKNSWGESWGENGYYKICRGRNV
Sbjct: 288 VSCPYICSKRLDHGVLLVGYGSAGYAPIRMKEKPYWIIKNSWGESWGENGYYKICRGRNV 347
Query: 360 CGVDSMVSTVAA 371
CGVDSMVSTVAA
Sbjct: 348 CGVDSMVSTVAA 359
>gi|146215998|gb|ABQ10201.1| cysteine protease Cp3 [Actinidia deliciosa]
Length = 365
Score = 596 bits (1537), Expect = e-168, Method: Compositional matrix adjust.
Identities = 282/354 (79%), Positives = 314/354 (88%), Gaps = 14/354 (3%)
Query: 18 AVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEH 77
A +SG D D +I+Q+ DG + L A+HHF LFK++F K+YA+QE+H
Sbjct: 20 AAASGKSSDGEDLVIQQIVDG------------DHPLSADHHFRLFKRRFGKSYATQEDH 67
Query: 78 DHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAP 137
D+RF++FK NLRRA HQ+LDPSA HG+TQFSDLTPAEFRR +LGL+R LR P DA++AP
Sbjct: 68 DYRFSVFKTNLRRARHHQRLDPSAVHGVTQFSDLTPAEFRRNHLGLKR-LRFPADANKAP 126
Query: 138 ILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLV 197
ILPT DLPADFDWR+ GAV VK+QGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLV
Sbjct: 127 ILPTEDLPADFDWRDHGAVASVKNQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLV 186
Query: 198 DCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKI 257
DCDHECDPEEPGSCDSGCNGGLMNSA EYTLKAGGLMREEDYPY+GTDRG CKFD++KI
Sbjct: 187 DCDHECDPEEPGSCDSGCNGGLMNSALEYTLKAGGLMREEDYPYSGTDRG-TCKFDETKI 245
Query: 258 AASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLV 317
AASVANFSVVSLDE+QIAANLVKNGPLAVAINAV+MQTY+GGVSCPYICS+RLDHGVLLV
Sbjct: 246 AASVANFSVVSLDENQIAANLVKNGPLAVAINAVFMQTYVGGVSCPYICSKRLDHGVLLV 305
Query: 318 GYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAA 371
GYGSAGYAPIR+KEKPYWIIKNSWGESWGENG+YKIC+GRNVCGVDSMVSTVAA
Sbjct: 306 GYGSAGYAPIRMKEKPYWIIKNSWGESWGENGFYKICQGRNVCGVDSMVSTVAA 359
>gi|225427714|ref|XP_002264345.1| PREDICTED: cysteine proteinase RD19a [Vitis vinifera]
Length = 377
Score = 594 bits (1532), Expect = e-167, Method: Compositional matrix adjust.
Identities = 285/355 (80%), Positives = 316/355 (89%), Gaps = 6/355 (1%)
Query: 17 SAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEE 76
S + SG DD+ +IRQV ++ E N L HHFS+FK++F K+YASQEE
Sbjct: 23 SELHSGGSDDDI--IIRQVVPELGDVEGSEEE--NLLTADHHHFSIFKRRFGKSYASQEE 78
Query: 77 HDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQA 136
HD+RF +FKANLRRA RHQ+LDPSATHG+TQFSDLTPAEFR TYLGLR L+LP DA +A
Sbjct: 79 HDYRFKVFKANLRRARRHQQLDPSATHGVTQFSDLTPAEFRGTYLGLR-PLKLPHDAQKA 137
Query: 137 PILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQL 196
PILPTNDLP DFDWR+ GAV VK+QGSCGSCWSFSTTGALEGANFLATG LVSLSEQQL
Sbjct: 138 PILPTNDLPEDFDWRDHGAVTAVKNQGSCGSCWSFSTTGALEGANFLATGNLVSLSEQQL 197
Query: 197 VDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSK 256
V+CDHECDPEE GSCDSGCNGGLMN+AFEYTLKAGGLM+EEDYPYTGTDRG +CKFDK+K
Sbjct: 198 VECDHECDPEEMGSCDSGCNGGLMNTAFEYTLKAGGLMKEEDYPYTGTDRG-SCKFDKTK 256
Query: 257 IAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLL 316
IAASV+NFSV+SLDEDQIAANLVKNGPLAVAINAV+MQTY+GGVSCPYICS+RLDHGVLL
Sbjct: 257 IAASVSNFSVISLDEDQIAANLVKNGPLAVAINAVFMQTYVGGVSCPYICSKRLDHGVLL 316
Query: 317 VGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAA 371
VGYGSAGYAPIR+K+KPYWIIKNSWGE+WGENG+YKICRGRNVCGVDSMVSTVAA
Sbjct: 317 VGYGSAGYAPIRMKDKPYWIIKNSWGENWGENGFYKICRGRNVCGVDSMVSTVAA 371
>gi|161778780|gb|ABX79341.1| cysteine protease [Vitis vinifera]
Length = 377
Score = 591 bits (1524), Expect = e-166, Method: Compositional matrix adjust.
Identities = 286/361 (79%), Positives = 317/361 (87%), Gaps = 18/361 (4%)
Query: 17 SAVSSGTLIDDVDQLIRQVT------DGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKA 70
S + SG DD+ +IRQV +GG+E N L HHFS+FK++F K+
Sbjct: 23 SELHSGGSDDDI--IIRQVVPELGDVEGGEE--------ENLLTADHHHFSIFKRRFGKS 72
Query: 71 YASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLP 130
YASQEEHD+RF +FKANLRRA RHQ+LDPSATHG+TQFSDLTPAEFR TYLGLR L+LP
Sbjct: 73 YASQEEHDYRFKVFKANLRRARRHQQLDPSATHGVTQFSDLTPAEFRGTYLGLR-PLKLP 131
Query: 131 KDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVS 190
DA +APILPTNDLP DFDWR+ GAV VK+QGSCGSCWSFSTTGALEGANFLATG LVS
Sbjct: 132 HDAQKAPILPTNDLPEDFDWRDHGAVTAVKNQGSCGSCWSFSTTGALEGANFLATGNLVS 191
Query: 191 LSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHAC 250
LSEQQLV+CDHECDPEE GSCDSGCNGGLMN+AFEYTLKAGGLM+EEDYPYTGTDRG +C
Sbjct: 192 LSEQQLVECDHECDPEEMGSCDSGCNGGLMNTAFEYTLKAGGLMKEEDYPYTGTDRG-SC 250
Query: 251 KFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRL 310
KFDK+KIAASV+NFSV+SLDEDQIAANLVK GPLAVAINAV+MQTY+GGVSCPYICS+RL
Sbjct: 251 KFDKTKIAASVSNFSVISLDEDQIAANLVKIGPLAVAINAVFMQTYVGGVSCPYICSKRL 310
Query: 311 DHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVA 370
DHGVLLVGYGSAGYAPIR+K+KPYWIIKNSWGE+WGENG+YKICRGRNVCGVDSMVSTVA
Sbjct: 311 DHGVLLVGYGSAGYAPIRMKDKPYWIIKNSWGENWGENGFYKICRGRNVCGVDSMVSTVA 370
Query: 371 A 371
A
Sbjct: 371 A 371
>gi|13491752|gb|AAK27969.1|AF242373_1 cysteine protease [Ipomoea batatas]
Length = 366
Score = 589 bits (1519), Expect = e-166, Method: Compositional matrix adjust.
Identities = 283/372 (76%), Positives = 316/372 (84%), Gaps = 14/372 (3%)
Query: 1 MGSKTVVLFLVSLVVFSAVSSGTLIDDVDQLIRQVT-DGGDEILSHHESTNNDLLGAEHH 59
M + +LFL +L+ + + D D LIRQV DGGD LL A+HH
Sbjct: 1 MAFRFSLLFLCTLLATTYLVFAAEDDGDDILIRQVVGDGGD------------LLNADHH 48
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F++FK++F K YAS EEHD+R + FKAN+RRA +HQ+LDP+A HG+TQFSDLTP EFRR
Sbjct: 49 FTVFKRRFGKVYASDEEHDYRLSEFKANMRRAKQHQELDPAAVHGVTQFSDLTPTEFRRK 108
Query: 120 YLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEG 179
+LGL R+L+ P DA APILPT++LP+DFDWR+ GAV PVK+QG+CGSC SFSTTGALEG
Sbjct: 109 FLGLNRRLKFPADAKTAPILPTDELPSDFDWRDHGAVTPVKNQGTCGSCCSFSTTGALEG 168
Query: 180 ANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDY 239
ANFLATGKLVSLSEQQLVDCDHECDPEE GSCDSGCNGGLMNSAFEYTLKAGGLMREED+
Sbjct: 169 ANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKAGGLMREEDH 228
Query: 240 PYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGG 299
PYTG D C+FDK+KIAA VANFSVVSLDEDQIAANLVKNGPLAVAINAV+MQTYIGG
Sbjct: 229 PYTGNDL-QVCRFDKTKIAAKVANFSVVSLDEDQIAANLVKNGPLAVAINAVFMQTYIGG 287
Query: 300 VSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNV 359
VSCPYICS+RLDHGVLLVGYGSAGYAPIR+KEKPYWIIKNSWGESWGENGYYKICRGRNV
Sbjct: 288 VSCPYICSKRLDHGVLLVGYGSAGYAPIRMKEKPYWIIKNSWGESWGENGYYKICRGRNV 347
Query: 360 CGVDSMVSTVAA 371
CGVDSMVSTVAA
Sbjct: 348 CGVDSMVSTVAA 359
>gi|312281839|dbj|BAJ33785.1| unnamed protein product [Thellungiella halophila]
Length = 373
Score = 587 bits (1514), Expect = e-165, Method: Compositional matrix adjust.
Identities = 279/369 (75%), Positives = 313/369 (84%), Gaps = 16/369 (4%)
Query: 10 LVSLVVFSAVSSGTLI-----DDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFK 64
V L++F +VSSG + D D +IRQV DG + +L +E HFSLFK
Sbjct: 11 FVLLILFVSVSSGIVAETSSSDGDDLVIRQVVDGAEP----------KVLSSEDHFSLFK 60
Query: 65 KKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLR 124
+KF K YAS EEHD+R ++FKANLRRA RHQKLDPSA HG+TQFSDLT +EFR+ +LG+R
Sbjct: 61 RKFGKVYASSEEHDYRLSVFKANLRRARRHQKLDPSARHGVTQFSDLTRSEFRKKHLGVR 120
Query: 125 RKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLA 184
+LPKDA++APILPT +LP DFDWR++GAV PVK+QGSCGSCWSFS TGALEGANFLA
Sbjct: 121 GGFKLPKDANKAPILPTENLPEDFDWRDRGAVTPVKNQGSCGSCWSFSATGALEGANFLA 180
Query: 185 TGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGT 244
TGKLVSLSEQQLVDCDHECDPEE GSCDSGCNGGLMNSAFEYTLK GGLMREEDYPYTG
Sbjct: 181 TGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKTGGLMREEDYPYTGK 240
Query: 245 DRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPY 304
D G CK DKSKI ASV+NFSV+S+DEDQIAANLVKNGPLAVAINA YMQTYIGGVSCPY
Sbjct: 241 D-GPTCKLDKSKIVASVSNFSVISIDEDQIAANLVKNGPLAVAINAAYMQTYIGGVSCPY 299
Query: 305 ICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDS 364
IC+RRL+HGVLLVGYGSAGYAP R KEKPYWIIKNSWGESWGENG+YKIC+GRN+CGVDS
Sbjct: 300 ICARRLNHGVLLVGYGSAGYAPARFKEKPYWIIKNSWGESWGENGFYKICKGRNICGVDS 359
Query: 365 MVSTVAAAV 373
+VSTV+A V
Sbjct: 360 LVSTVSATV 368
>gi|18141287|gb|AAL60581.1|AF454959_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 368
Score = 586 bits (1510), Expect = e-165, Method: Compositional matrix adjust.
Identities = 278/369 (75%), Positives = 316/369 (85%), Gaps = 12/369 (3%)
Query: 5 TVVLFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFK 64
++ +F + +V SA S G DD+ +I+QV DGG E ++L +E HFSLFK
Sbjct: 7 SLSVFALLFIVVSASSDGNEGDDL--VIKQVVDGGAE---------PNVLSSEDHFSLFK 55
Query: 65 KKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLR 124
KKF K YAS+EEHD+RF++FK+NLRRA RHQKLDPSA HG+TQFSDLT +EF+R +LG++
Sbjct: 56 KKFGKVYASREEHDYRFSVFKSNLRRARRHQKLDPSARHGVTQFSDLTRSEFKRKHLGVK 115
Query: 125 RKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLA 184
+LPKDA++APILPT +LP +FDWRE+GAV PVK+QGSCGSCWSFS TGALEGANFLA
Sbjct: 116 GGFKLPKDANKAPILPTENLPEEFDWRERGAVTPVKNQGSCGSCWSFSATGALEGANFLA 175
Query: 185 TGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGT 244
TGKLVSLSEQQLVDCDHECDPEE GSCDSGCNGGLMNSAFEYTLK GGLMREEDYPYTG
Sbjct: 176 TGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKTGGLMREEDYPYTGK 235
Query: 245 DRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPY 304
D G CK DKSKI ASV+NFSV+S+DE+QIAANLVKNGPLAVAINA YMQTYIGGVSCPY
Sbjct: 236 D-GATCKLDKSKIVASVSNFSVISIDEEQIAANLVKNGPLAVAINAAYMQTYIGGVSCPY 294
Query: 305 ICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDS 364
IC RRL+HGVLLVGYGSAGYAP R KEKPYWIIKNSWGE+WGE+G+YKICRGRNVCGVDS
Sbjct: 295 ICMRRLNHGVLLVGYGSAGYAPARFKEKPYWIIKNSWGETWGEDGFYKICRGRNVCGVDS 354
Query: 365 MVSTVAAAV 373
+VSTV A V
Sbjct: 355 LVSTVTATV 363
>gi|357473427|ref|XP_003606998.1| Cysteine proteinase [Medicago truncatula]
gi|355508053|gb|AES89195.1| Cysteine proteinase [Medicago truncatula]
Length = 362
Score = 585 bits (1508), Expect = e-164, Method: Compositional matrix adjust.
Identities = 280/372 (75%), Positives = 319/372 (85%), Gaps = 14/372 (3%)
Query: 1 MGSKTVVLFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHF 60
M +T++LF+V L +FS + T + D +IRQV D LGAEHHF
Sbjct: 1 MDHRTLLLFVV-LFIFSVSAFSTPDEGEDPIIRQVVD-----------EEGVRLGAEHHF 48
Query: 61 SLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTY 120
+LFK KF K Y+S++EHD+RF IFK+NL RA RHQ +DPSA HG+T+FSDLTP EFR++
Sbjct: 49 NLFKHKFGKVYSSKDEHDYRFKIFKSNLNRAKRHQLMDPSAVHGVTRFSDLTPREFRKSV 108
Query: 121 LGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGA 180
LGLR + LPKDA+ APILPT++LP DFDWREKGAV VK+QGSCGSCWSFSTTGALEGA
Sbjct: 109 LGLR-GVGLPKDANAAPILPTDNLPKDFDWREKGAVTAVKNQGSCGSCWSFSTTGALEGA 167
Query: 181 NFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYP 240
+FL+TGKLVSLSEQQLVDCDHECDPE+PGSCD+GCNGGLMNSAFEY LK+GG+MREEDYP
Sbjct: 168 HFLSTGKLVSLSEQQLVDCDHECDPEQPGSCDAGCNGGLMNSAFEYILKSGGVMREEDYP 227
Query: 241 YTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGV 300
Y+GTDRG +CKFDK KIAASVANFSVVSLDEDQIAANLVKNGPLA+A+NAVYMQTY+GGV
Sbjct: 228 YSGTDRG-SCKFDKKKIAASVANFSVVSLDEDQIAANLVKNGPLAIALNAVYMQTYVGGV 286
Query: 301 SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVC 360
SCPYICS+RLDHGVLLVGYGS Y+PIRLKEKPYWIIKNSWGE+WGENGYYKICRGRN+C
Sbjct: 287 SCPYICSKRLDHGVLLVGYGSGAYSPIRLKEKPYWIIKNSWGETWGENGYYKICRGRNIC 346
Query: 361 GVDSMVSTVAAA 372
GVDSMVSTVAA
Sbjct: 347 GVDSMVSTVAAV 358
>gi|356545108|ref|XP_003540987.1| PREDICTED: cysteine proteinase RD19a-like [Glycine max]
Length = 365
Score = 584 bits (1506), Expect = e-164, Method: Compositional matrix adjust.
Identities = 279/372 (75%), Positives = 321/372 (86%), Gaps = 14/372 (3%)
Query: 1 MGSKTVVLFLVSL-VVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHH 59
M + T+ L LV+ +VF+AVS+ + + + LI QV DGGD LGAEHH
Sbjct: 1 MNNPTLFLLLVAFSLVFAAVSASSDGGNEEPLIMQVVDGGDV-----------RLGAEHH 49
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK++F KAY S++EHD+R+ +FKAN+RRA RHQ LDPSA HG+T+FSDLTP+EFR
Sbjct: 50 FLEFKRRFGKAYDSEDEHDYRYKVFKANMRRARRHQSLDPSAAHGVTRFSDLTPSEFRNK 109
Query: 120 YLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEG 179
LGLR +RLP DA++APILPT++LP+DFDWR+ GAV PVK+QGSCGSCWSFSTTGALEG
Sbjct: 110 VLGLR-GVRLPLDANKAPILPTDNLPSDFDWRDHGAVTPVKNQGSCGSCWSFSTTGALEG 168
Query: 180 ANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDY 239
A+FL+TG+LVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEY LK+GG+MREEDY
Sbjct: 169 AHFLSTGELVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYILKSGGVMREEDY 228
Query: 240 PYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGG 299
PY+G D G CKFDK+KIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA YMQTYIGG
Sbjct: 229 PYSGADSG-TCKFDKTKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAAYMQTYIGG 287
Query: 300 VSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNV 359
VSCPY+CSRRL+HGVLLVGYGS YAPIR+KEKP+WIIKNSWGE+WGENGYYKICRGRN+
Sbjct: 288 VSCPYVCSRRLNHGVLLVGYGSGAYAPIRMKEKPFWIIKNSWGENWGENGYYKICRGRNI 347
Query: 360 CGVDSMVSTVAA 371
CGVDSMVSTVA+
Sbjct: 348 CGVDSMVSTVAS 359
>gi|297801998|ref|XP_002868883.1| hypothetical protein ARALYDRAFT_490677 [Arabidopsis lyrata subsp.
lyrata]
gi|297314719|gb|EFH45142.1| hypothetical protein ARALYDRAFT_490677 [Arabidopsis lyrata subsp.
lyrata]
Length = 368
Score = 580 bits (1496), Expect = e-163, Method: Compositional matrix adjust.
Identities = 277/367 (75%), Positives = 311/367 (84%), Gaps = 13/367 (3%)
Query: 7 VLFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKK 66
V L L+V +VSS + D D +IRQV G + +L +E HFSLFK K
Sbjct: 10 VFVLFFLIV--SVSSSDVNDGDDLVIRQVVGGAEP----------QVLTSEDHFSLFKSK 57
Query: 67 FNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRK 126
F K YAS EEHD+RF++FKANLRRA RHQKLDPSA HG+TQFSDLT +EFR+ +LG+R
Sbjct: 58 FGKVYASNEEHDYRFSVFKANLRRARRHQKLDPSARHGVTQFSDLTRSEFRKKHLGVRAG 117
Query: 127 LRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATG 186
+LPKDA++APILPT +LP DFDWR++GAV PVK+QGSCGSCWSFS TGALEGANFLATG
Sbjct: 118 FKLPKDANKAPILPTENLPEDFDWRDRGAVTPVKNQGSCGSCWSFSATGALEGANFLATG 177
Query: 187 KLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDR 246
KLVSLSEQQLVDCDHECDPEE GSCDSGCNGGLMNSAFEYTLK GGLM+EEDYPYTG D
Sbjct: 178 KLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKTGGLMKEEDYPYTGKD- 236
Query: 247 GHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYIC 306
G CK DKSKI ASV+NFSV+S+DE+QIAANLVKNGPLAVAINA YMQTYIGGVSCPYIC
Sbjct: 237 GKTCKLDKSKIVASVSNFSVISIDEEQIAANLVKNGPLAVAINAGYMQTYIGGVSCPYIC 296
Query: 307 SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMV 366
+RRL+HGVLLVGYGSAGYAP R KEKPYWIIKNSWGE+WGENG+YKIC+GRN+CGVDS+V
Sbjct: 297 TRRLNHGVLLVGYGSAGYAPARFKEKPYWIIKNSWGETWGENGFYKICKGRNICGVDSLV 356
Query: 367 STVAAAV 373
STV AAV
Sbjct: 357 STVTAAV 363
>gi|223049408|gb|ACM80348.1| cysteine proteinase [Solanum lycopersicum]
Length = 368
Score = 580 bits (1495), Expect = e-163, Method: Compositional matrix adjust.
Identities = 280/375 (74%), Positives = 320/375 (85%), Gaps = 16/375 (4%)
Query: 1 MGSKTVVLFLVSLVVFS----AVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGA 56
M + ++F++S+++ + AV+ D D LIRQV GDE HH +L A
Sbjct: 1 MAHRFSLVFVLSILLTTSFLLAVNGEIKGGDDDILIRQVV--GDE--DHH------MLNA 50
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
EHHF+LFKK+F K YAS EEH +RF++FKANLRRA RHQKLDPSA HG+TQFSD+TP EF
Sbjct: 51 EHHFTLFKKRFGKTYASDEEHHYRFSVFKANLRRAMRHQKLDPSAVHGVTQFSDMTPDEF 110
Query: 117 RRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGA 176
+ +LG+ R+LR P DA++APILPT DLP+DFDWRE GAV PVK+QGSCGSCWSFSTTGA
Sbjct: 111 SQKFLGVNRRLRFPSDANKAPILPTEDLPSDFDWREHGAVTPVKNQGSCGSCWSFSTTGA 170
Query: 177 LEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMRE 236
LEGANFLATGKLVSLSEQQLVDCDHECDPEE SCDSGC+GGLMNSAFEYTLKAGGLMRE
Sbjct: 171 LEGANFLATGKLVSLSEQQLVDCDHECDPEEKDSCDSGCSGGLMNSAFEYTLKAGGLMRE 230
Query: 237 EDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTY 296
EDYPYTGTD+ CKFD +K+AA VANFSVVSLDE+QIAANLVKNGPLAVAINAV+MQTY
Sbjct: 231 EDYPYTGTDKA-TCKFDNTKVAAKVANFSVVSLDEEQIAANLVKNGPLAVAINAVFMQTY 289
Query: 297 IGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRG 356
+GGVSCPYICS++LDHGVLLVGYG+ G++PIR+KEKPYWIIKNSWGE WGE+GYYKI RG
Sbjct: 290 VGGVSCPYICSKQLDHGVLLVGYGT-GFSPIRMKEKPYWIIKNSWGEKWGESGYYKIRRG 348
Query: 357 RNVCGVDSMVSTVAA 371
RNVCGVDSMVSTVAA
Sbjct: 349 RNVCGVDSMVSTVAA 363
>gi|356541074|ref|XP_003539008.1| PREDICTED: cysteine proteinase RD19a-like [Glycine max]
Length = 363
Score = 579 bits (1492), Expect = e-163, Method: Compositional matrix adjust.
Identities = 278/366 (75%), Positives = 315/366 (86%), Gaps = 15/366 (4%)
Query: 6 VVLFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKK 65
++ FLV VF A S+ D + LI QV +G + LGAEHHF FK+
Sbjct: 7 IIFFLVIFSVFFAASADG--GDDEPLIMQVVEG-----------SGVRLGAEHHFLDFKR 53
Query: 66 KFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRR 125
+F KAYASQEEH++RF +FKAN+RRA RHQ LDPSA HG+T+FSDLT +EFR LGLR
Sbjct: 54 RFGKAYASQEEHNYRFEVFKANMRRARRHQSLDPSAAHGVTRFSDLTASEFRNKVLGLR- 112
Query: 126 KLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLAT 185
+RLP +A++APILPT++LP+DFDWR+ GAV PVK+QGSCGSCWSFSTTGALEGA+FL+T
Sbjct: 113 GVRLPSNANKAPILPTDNLPSDFDWRDHGAVTPVKNQGSCGSCWSFSTTGALEGAHFLST 172
Query: 186 GKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTD 245
G+LVSLSEQQLVDCDHECDPEE GSCDSGCNGGLMNSAFEY LK+GG+MREEDYPY+GTD
Sbjct: 173 GELVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYILKSGGVMREEDYPYSGTD 232
Query: 246 RGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYI 305
RG+ CKFDK+KIAASVANFSV+SLDEDQIAANLVKNGPLAVAINA YMQTYIGGVSCPYI
Sbjct: 233 RGN-CKFDKAKIAASVANFSVISLDEDQIAANLVKNGPLAVAINAAYMQTYIGGVSCPYI 291
Query: 306 CSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSM 365
CSRRLDHGVLLVGYGS YAPIR+KEKP+WIIKNSWGE+WGENGYYKICRGRN+CGVDSM
Sbjct: 292 CSRRLDHGVLLVGYGSGAYAPIRMKEKPFWIIKNSWGENWGENGYYKICRGRNICGVDSM 351
Query: 366 VSTVAA 371
VSTVAA
Sbjct: 352 VSTVAA 357
>gi|297824991|ref|XP_002880378.1| hypothetical protein ARALYDRAFT_481008 [Arabidopsis lyrata subsp.
lyrata]
gi|297326217|gb|EFH56637.1| hypothetical protein ARALYDRAFT_481008 [Arabidopsis lyrata subsp.
lyrata]
Length = 360
Score = 576 bits (1485), Expect = e-162, Method: Compositional matrix adjust.
Identities = 279/368 (75%), Positives = 316/368 (85%), Gaps = 15/368 (4%)
Query: 7 VLFLVSLV-VFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKK 65
VLF VSL+ VF +VS + D D LIRQV D + +L +E HF+LFKK
Sbjct: 6 VLFSVSLLFVFVSVS---ICGDEDLLIRQVVDEAEP----------KVLSSEDHFTLFKK 52
Query: 66 KFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRR 125
KF K Y S EEH +RF++FKANLRRA RHQK+DPSA HG+TQFSDLT +EFRR +LG+
Sbjct: 53 KFGKDYGSIEEHYYRFSVFKANLRRAMRHQKMDPSARHGVTQFSDLTGSEFRRKHLGVTG 112
Query: 126 KLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLAT 185
+LPKDA+QAPILPT++LP +FDWR++GAV PVK+QGSCGSCWSFSTTGALEGA+FLAT
Sbjct: 113 GFKLPKDANQAPILPTHNLPEEFDWRDRGAVTPVKNQGSCGSCWSFSTTGALEGAHFLAT 172
Query: 186 GKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTD 245
GKLVSLSEQQLVDCDHECDPEE GSCDSGCNGGLMNSAFEYTLK GGLMREEDYPYTGTD
Sbjct: 173 GKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKTGGLMREEDYPYTGTD 232
Query: 246 RGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYI 305
G +CK D+SKI ASV+NFSVVS++EDQIAANLVKNGPLAVAINA YMQTYIGGVSCPYI
Sbjct: 233 -GGSCKLDRSKIVASVSNFSVVSINEDQIAANLVKNGPLAVAINAAYMQTYIGGVSCPYI 291
Query: 306 CSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSM 365
CSRRL+HGVLL+GYGS+GY+ RLKEKPYWIIKNSWGESWGENG+YKIC+GRN+CGVDS+
Sbjct: 292 CSRRLNHGVLLMGYGSSGYSQARLKEKPYWIIKNSWGESWGENGFYKICKGRNICGVDSL 351
Query: 366 VSTVAAAV 373
VSTVAAA
Sbjct: 352 VSTVAAAT 359
>gi|317106675|dbj|BAJ53178.1| JHL18I08.12 [Jatropha curcas]
Length = 368
Score = 575 bits (1482), Expect = e-161, Method: Compositional matrix adjust.
Identities = 275/372 (73%), Positives = 315/372 (84%), Gaps = 12/372 (3%)
Query: 1 MGSKTVVLFLVSLVVFSAVSSGTLIDDVDQ-LIRQVTDGGDEILSHHESTNNDLLGAEHH 59
M + ++ FLV ++ ++S T D++D LIRQV GD+ + LL AEHH
Sbjct: 1 MERRCLISFLVYALLSFTIASTTSPDELDDPLIRQVVPDGDQ---------DHLLNAEHH 51
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F+ FK KF K YA+QEEHD+RF +FKANLRRA +HQ +DP+A HG+T FSDLTP EFRR
Sbjct: 52 FTTFKAKFGKTYATQEEHDYRFKLFKANLRRARKHQMMDPTAVHGVTMFSDLTPREFRRQ 111
Query: 120 YLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEG 179
YLGLRR LRLP DA +APILPTNDLP DFDWR+ GAV VK+QGSCGSCWSFS GALEG
Sbjct: 112 YLGLRR-LRLPADAHEAPILPTNDLPTDFDWRDHGAVTNVKNQGSCGSCWSFSAAGALEG 170
Query: 180 ANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDY 239
A+FLATG+LVSLSEQQLVDCDHECDPEE G+CDSGCNGGLM +AFEYTLKAGGL REEDY
Sbjct: 171 AHFLATGELVSLSEQQLVDCDHECDPEEYGACDSGCNGGLMTTAFEYTLKAGGLEREEDY 230
Query: 240 PYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGG 299
PYTG DRG CKFD++KI ASV+NFSVVS+DEDQIAANLVK+GPLAV INAV+MQTY+GG
Sbjct: 231 PYTGNDRG-PCKFDRNKIVASVSNFSVVSIDEDQIAANLVKHGPLAVGINAVFMQTYMGG 289
Query: 300 VSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNV 359
VSCPYICS+R DHGVLLVGYGSAGYAPIRLK+KP+WIIKNSWGESWGENGYY+ICRGRN+
Sbjct: 290 VSCPYICSKRQDHGVLLVGYGSAGYAPIRLKDKPFWIIKNSWGESWGENGYYRICRGRNI 349
Query: 360 CGVDSMVSTVAA 371
CGVD+MVS+VAA
Sbjct: 350 CGVDAMVSSVAA 361
>gi|18399697|ref|NP_565512.1| putative cysteine proteinase A494 [Arabidopsis thaliana]
gi|12643282|sp|P43295.2|A494_ARATH RecName: Full=Probable cysteine proteinase A494; Flags: Precursor
gi|4567274|gb|AAD23687.1| cysteine proteinase [Arabidopsis thaliana]
gi|116325924|gb|ABJ98563.1| At2g21430 [Arabidopsis thaliana]
gi|330252083|gb|AEC07177.1| putative cysteine proteinase A494 [Arabidopsis thaliana]
Length = 361
Score = 575 bits (1481), Expect = e-161, Method: Compositional matrix adjust.
Identities = 277/367 (75%), Positives = 314/367 (85%), Gaps = 15/367 (4%)
Query: 7 VLFLVSLV-VFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKK 65
VLF VSL+ VF +VS + D D LIRQV D T +L +E HF+LFKK
Sbjct: 7 VLFSVSLIFVFVSVS---VCGDEDVLIRQVVD----------ETEPKVLSSEDHFTLFKK 53
Query: 66 KFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRR 125
KF K Y S EEH +RF++FKANL RA RHQK+DPSA HG+TQFSDLT +EFRR +LG++
Sbjct: 54 KFGKVYGSIEEHYYRFSVFKANLLRAMRHQKMDPSARHGVTQFSDLTRSEFRRKHLGVKG 113
Query: 126 KLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLAT 185
+LPKDA+QAPILPT +LP +FDWR++GAV PVK+QGSCGSCWSFSTTGALEGA+FLAT
Sbjct: 114 GFKLPKDANQAPILPTQNLPEEFDWRDRGAVTPVKNQGSCGSCWSFSTTGALEGAHFLAT 173
Query: 186 GKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTD 245
GKLVSLSEQQLVDCDHECDPEE GSCDSGCNGGLMNSAFEYTLK GGLMRE+DYPYTGTD
Sbjct: 174 GKLVSLSEQQLVDCDHECDPEEEGSCDSGCNGGLMNSAFEYTLKTGGLMREKDYPYTGTD 233
Query: 246 RGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYI 305
G +CK D+SKI ASV+NFSVVS++EDQIAANL+KNGPLAVAINA YMQTYIGGVSCPYI
Sbjct: 234 -GGSCKLDRSKIVASVSNFSVVSINEDQIAANLIKNGPLAVAINAAYMQTYIGGVSCPYI 292
Query: 306 CSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSM 365
CSRRL+HGVLLVGYGSAG++ RLKEKPYWIIKNSWGESWGENG+YKIC+GRN+CGVDS+
Sbjct: 293 CSRRLNHGVLLVGYGSAGFSQARLKEKPYWIIKNSWGESWGENGFYKICKGRNICGVDSL 352
Query: 366 VSTVAAA 372
VSTVAA
Sbjct: 353 VSTVAAT 359
>gi|18420375|ref|NP_568052.1| cysteine proteinase RD19a [Arabidopsis thaliana]
gi|1172872|sp|P43296.1|RD19A_ARATH RecName: Full=Cysteine proteinase RD19a; Short=RD19; Flags:
Precursor
gi|435618|dbj|BAA02373.1| thiol protease [Arabidopsis thaliana]
gi|4539328|emb|CAB38829.1| drought-inducible cysteine proteinase RD19A precursor [Arabidopsis
thaliana]
gi|7270892|emb|CAB80572.1| drought-inducible cysteine proteinase RD19A precursor [Arabidopsis
thaliana]
gi|19310552|gb|AAL85009.1| putative cysteine proteinase RD19A [Arabidopsis thaliana]
gi|22136868|gb|AAM91778.1| putative cysteine proteinase RD19A [Arabidopsis thaliana]
gi|110740898|dbj|BAE98545.1| drought-inducible cysteine proteinase RD19A precursor [Arabidopsis
thaliana]
gi|332661616|gb|AEE87016.1| cysteine proteinase RD19a [Arabidopsis thaliana]
Length = 368
Score = 574 bits (1480), Expect = e-161, Method: Compositional matrix adjust.
Identities = 270/348 (77%), Positives = 301/348 (86%), Gaps = 11/348 (3%)
Query: 26 DDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFK 85
D D +IRQV G + +L +E HFSLFK+KF K YAS EEHD+RF++FK
Sbjct: 27 DGDDLVIRQVVGGAEP----------QVLTSEDHFSLFKRKFGKVYASNEEHDYRFSVFK 76
Query: 86 ANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLP 145
ANLRRA RHQKLDPSATHG+TQFSDLT +EFR+ +LG+R +LPKDA++APILPT +LP
Sbjct: 77 ANLRRARRHQKLDPSATHGVTQFSDLTRSEFRKKHLGVRSGFKLPKDANKAPILPTENLP 136
Query: 146 ADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDP 205
DFDWR+ GAV PVK+QGSCGSCWSFS TGALEGANFLATGKLVSLSEQQLVDCDHECDP
Sbjct: 137 EDFDWRDHGAVTPVKNQGSCGSCWSFSATGALEGANFLATGKLVSLSEQQLVDCDHECDP 196
Query: 206 EEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFS 265
EE SCDSGCNGGLMNSAFEYTLK GGLM+EEDYPYTG D G CK DKSKI ASV+NFS
Sbjct: 197 EEADSCDSGCNGGLMNSAFEYTLKTGGLMKEEDYPYTGKD-GKTCKLDKSKIVASVSNFS 255
Query: 266 VVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYA 325
V+S+DE+QIAANLVKNGPLAVAINA YMQTYIGGVSCPYIC+RRL+HGVLLVGYG+AGYA
Sbjct: 256 VISIDEEQIAANLVKNGPLAVAINAGYMQTYIGGVSCPYICTRRLNHGVLLVGYGAAGYA 315
Query: 326 PIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAAAV 373
P R KEKPYWIIKNSWGE+WGENG+YKIC+GRN+CGVDSMVSTVAA V
Sbjct: 316 PARFKEKPYWIIKNSWGETWGENGFYKICKGRNICGVDSMVSTVAATV 363
>gi|356509908|ref|XP_003523684.1| PREDICTED: cysteine proteinase RD19a-like [Glycine max]
Length = 366
Score = 573 bits (1476), Expect = e-161, Method: Compositional matrix adjust.
Identities = 282/366 (77%), Positives = 306/366 (83%), Gaps = 12/366 (3%)
Query: 7 VLFLVSLVVFSAVSSGTLIDDVDQL-IRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKK 65
+LF L+ +AV++ IDD D L IRQV ++ HH LL AEHHFS FK
Sbjct: 6 ILFFGLLLFSAAVATVERIDDEDNLLIRQVVPDAED---HH------LLNAEHHFSAFKT 56
Query: 66 KFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRR 125
KF K YA+QEEHDHRF IFK NL RA HQKLDPSA HG+T+FSDLTP+EFR +LGL+
Sbjct: 57 KFAKTYATQEEHDHRFRIFKNNLLRAKSHQKLDPSAVHGVTRFSDLTPSEFRGQFLGLK- 115
Query: 126 KLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLAT 185
LRLP DA +APILPT+DLP DFDWR+ GAV VK+QGSCGSCWSFS GALEGA+FL+T
Sbjct: 116 PLRLPSDAQKAPILPTSDLPTDFDWRDHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLST 175
Query: 186 GKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTD 245
G LVSLSEQQLVDCDHECDPEE G+CDSGCNGGLM +AFEYTLKAGGLMREEDYPYTG D
Sbjct: 176 GGLVSLSEQQLVDCDHECDPEERGACDSGCNGGLMTTAFEYTLKAGGLMREEDYPYTGRD 235
Query: 246 RGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYI 305
RG CKFDKSKIAASVANFSVVSLDE+QIAANLVKNGPLAV INAV+MQTYIGGVSCPYI
Sbjct: 236 RG-PCKFDKSKIAASVANFSVVSLDEEQIAANLVKNGPLAVGINAVFMQTYIGGVSCPYI 294
Query: 306 CSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSM 365
C + LDHGVLLVGYGS YAPIR KEKPYWIIKNSWGESWGE GYYKICRGRNVCGVDSM
Sbjct: 295 CGKHLDHGVLLVGYGSGAYAPIRFKEKPYWIIKNSWGESWGEEGYYKICRGRNVCGVDSM 354
Query: 366 VSTVAA 371
VSTVAA
Sbjct: 355 VSTVAA 360
>gi|94556727|gb|ABF46642.1| papain-like cysteine proteinase [Pachysandra terminalis]
Length = 374
Score = 572 bits (1474), Expect = e-160, Method: Compositional matrix adjust.
Identities = 283/373 (75%), Positives = 311/373 (83%), Gaps = 9/373 (2%)
Query: 3 SKTVVLFLVSLVVFSAVSSGTLIDDVDQL-IRQVTDGGDEILSHHESTNNDLLGAEHHFS 61
S+ V+L S +VF+A +S D+ D L IRQV G D+ + N AEHHFS
Sbjct: 5 SRFVLLLFSSSLVFAATASTVSSDESDDLLIRQVVAGADDHDNDDLLLN-----AEHHFS 59
Query: 62 LFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYL 121
FKK+F KAY S +EHD RF +FKANLRRA R+Q LDPSA HG+TQF DLTPAEFRRTYL
Sbjct: 60 SFKKRFGKAYTSCDEHDRRFGVFKANLRRAKRNQILDPSAVHGVTQFFDLTPAEFRRTYL 119
Query: 122 GLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGAN 181
GL+R LRLP D +APILPTNDLPADFDWR+ GAV PVK+QGSCGSCWSFS TGALEGAN
Sbjct: 120 GLKR-LRLPADTHEAPILPTNDLPADFDWRDHGAVTPVKNQGSCGSCWSFSATGALEGAN 178
Query: 182 FLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPY 241
FLATGKLVSLSEQQLVDCDH CD E+P SCDSGCNGGLM SAFEYTLKAGGL REEDYPY
Sbjct: 179 FLATGKLVSLSEQQLVDCDHVCDSEDPSSCDSGCNGGLMTSAFEYTLKAGGLEREEDYPY 238
Query: 242 TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVS 301
TGTD CKFDK+KIA S +NFSVVSLDE+QIAANLV NGPLA+ INA++MQTYIGGVS
Sbjct: 239 TGTDHSK-CKFDKTKIAVSASNFSVVSLDENQIAANLVTNGPLAIGINAMFMQTYIGGVS 297
Query: 302 CPYICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVC 360
CPYICS+R LDHGVLLVGYGSAG+APIR KEKPYWIIKNSWGESWGE GYYKICRGRN+C
Sbjct: 298 CPYICSKRLLDHGVLLVGYGSAGFAPIRFKEKPYWIIKNSWGESWGEKGYYKICRGRNIC 357
Query: 361 GVDSMVSTVAAAV 373
G+DSMVS VAAAV
Sbjct: 358 GMDSMVSAVAAAV 370
>gi|21593213|gb|AAM65162.1| cysteine proteinase RD19A [Arabidopsis thaliana]
Length = 368
Score = 572 bits (1474), Expect = e-160, Method: Compositional matrix adjust.
Identities = 269/348 (77%), Positives = 301/348 (86%), Gaps = 11/348 (3%)
Query: 26 DDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFK 85
D D +IRQV G + +L +E HFSLFK+KF K YAS EEHD+RF++FK
Sbjct: 27 DGDDLVIRQVVGGAEP----------QVLTSEDHFSLFKRKFGKVYASNEEHDYRFSVFK 76
Query: 86 ANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLP 145
ANLRRA RHQKLDPSATHG+TQFSDLT +EFR+ +LG+R +LPKDA++APILPT +LP
Sbjct: 77 ANLRRARRHQKLDPSATHGVTQFSDLTRSEFRKKHLGVRSGFKLPKDANKAPILPTENLP 136
Query: 146 ADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDP 205
DFDWR+ GAV PVK+QGSCGSCWSFS TGALEGANFLATGKLVSLSEQQLVDCDHECDP
Sbjct: 137 EDFDWRDHGAVTPVKNQGSCGSCWSFSATGALEGANFLATGKLVSLSEQQLVDCDHECDP 196
Query: 206 EEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFS 265
EE SCDSGCNGGLMNSAFE+TLK GGLM+EEDYPYTG D G CK DKSKI ASV+NFS
Sbjct: 197 EEADSCDSGCNGGLMNSAFEHTLKTGGLMKEEDYPYTGKD-GKTCKLDKSKIVASVSNFS 255
Query: 266 VVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYA 325
V+S+DE+QIAANLVKNGPLAVAINA YMQTYIGGVSCPYIC+RRL+HGVLLVGYG+AGYA
Sbjct: 256 VISIDEEQIAANLVKNGPLAVAINAGYMQTYIGGVSCPYICTRRLNHGVLLVGYGAAGYA 315
Query: 326 PIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAAAV 373
P R KEKPYWIIKNSWGE+WGENG+YKIC+GRN+CGVDSMVSTVAA V
Sbjct: 316 PARFKEKPYWIIKNSWGETWGENGFYKICKGRNICGVDSMVSTVAATV 363
>gi|51969854|dbj|BAD43619.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 361
Score = 572 bits (1474), Expect = e-160, Method: Compositional matrix adjust.
Identities = 276/367 (75%), Positives = 313/367 (85%), Gaps = 15/367 (4%)
Query: 7 VLFLVSLV-VFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKK 65
VLF VSL+ VF +VS + D D LIRQV D T +L +E HF+LFKK
Sbjct: 7 VLFSVSLIFVFVSVS---VCGDEDVLIRQVVD----------ETEPKVLSSEDHFTLFKK 53
Query: 66 KFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRR 125
KF K Y S EEH +RF++FKANL RA RHQK+DPSA HG+TQFSDLT +EFRR +LG++
Sbjct: 54 KFGKVYGSIEEHYYRFSVFKANLLRAMRHQKMDPSARHGVTQFSDLTRSEFRRKHLGVKG 113
Query: 126 KLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLAT 185
+LPKDA+QAPILPT +LP +FDWR++GAV PVK+QGSCGSCWSFSTTGALEGA+FLAT
Sbjct: 114 GFKLPKDANQAPILPTQNLPEEFDWRDRGAVTPVKNQGSCGSCWSFSTTGALEGAHFLAT 173
Query: 186 GKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTD 245
GKLVSLSEQQLVDCDHECDPEE GSCDSGCNG LMNSAFEYTLK GGLMRE+DYPYTGTD
Sbjct: 174 GKLVSLSEQQLVDCDHECDPEEEGSCDSGCNGRLMNSAFEYTLKTGGLMREKDYPYTGTD 233
Query: 246 RGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYI 305
G +CK D+SKI ASV+NFSVVS++EDQIAANL+KNGPLAVAINA YMQTYIGGVSCPYI
Sbjct: 234 -GGSCKLDRSKIVASVSNFSVVSINEDQIAANLIKNGPLAVAINAAYMQTYIGGVSCPYI 292
Query: 306 CSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSM 365
CSRRL+HGVLLVGYGSAG++ RLKEKPYWIIKNSWGESWGENG+YKIC+GRN+CGVDS+
Sbjct: 293 CSRRLNHGVLLVGYGSAGFSQARLKEKPYWIIKNSWGESWGENGFYKICKGRNICGVDSL 352
Query: 366 VSTVAAA 372
VSTVAA
Sbjct: 353 VSTVAAT 359
>gi|449516391|ref|XP_004165230.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
Length = 387
Score = 572 bits (1473), Expect = e-160, Method: Compositional matrix adjust.
Identities = 273/355 (76%), Positives = 307/355 (86%), Gaps = 10/355 (2%)
Query: 19 VSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHD 78
VS ++ D D LIRQV + D +HH LGAEHHFSLFK++F K+YA++EEHD
Sbjct: 25 VSQHSVEHDGDPLIRQVVEN-DGDFNHHA------LGAEHHFSLFKRRFGKSYATEEEHD 77
Query: 79 HRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLR-RKLRLPKDADQAP 137
RF IFKAN+RRA RHQ DPSA HG+TQFSDLTP EFR+ +LGLR +LRLP D + AP
Sbjct: 78 RRFKIFKANMRRAERHQSFDPSAIHGVTQFSDLTPFEFRKAFLGLRGHRLRLPVDTNAAP 137
Query: 138 ILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLV 197
ILPT +LP DFDWR+ G V VK+QGSCGSCWSFSTTGALEGANFLATG+LVSLSEQQLV
Sbjct: 138 ILPTENLPIDFDWRQHGGVTRVKNQGSCGSCWSFSTTGALEGANFLATGELVSLSEQQLV 197
Query: 198 DCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKI 257
DCDHECDPEE +CDSGCNGGLMNSAFEYTLKAGGLM+E+DYPY G DR + C FDKSKI
Sbjct: 198 DCDHECDPEEEDACDSGCNGGLMNSAFEYTLKAGGLMKEQDYPYAGIDR-NTCNFDKSKI 256
Query: 258 AASVANFSVV-SLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLL 316
AAS+ANFSVV S+DEDQIAANLVKNGPLA+AINAV+MQTYIGGVSCP+ICS+RLDHGVLL
Sbjct: 257 AASIANFSVVNSIDEDQIAANLVKNGPLAIAINAVFMQTYIGGVSCPFICSKRLDHGVLL 316
Query: 317 VGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAA 371
VGYGSAGYAPIR+++K YWIIKNSWGESWGENGYYKICRGRN+CGVDS+VSTVAA
Sbjct: 317 VGYGSAGYAPIRMRDKDYWIIKNSWGESWGENGYYKICRGRNICGVDSLVSTVAA 371
>gi|224066056|ref|XP_002302004.1| predicted protein [Populus trichocarpa]
gi|222843730|gb|EEE81277.1| predicted protein [Populus trichocarpa]
Length = 367
Score = 571 bits (1471), Expect = e-160, Method: Compositional matrix adjust.
Identities = 270/356 (75%), Positives = 307/356 (86%), Gaps = 12/356 (3%)
Query: 17 SAVSSGTLIDDVDQ-LIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQE 75
SAV+S +D+D LIRQV G++ DLL AEHHF+ FK KF K YA+QE
Sbjct: 17 SAVASTVSSNDLDDPLIRQVVSDGED----------DLLNAEHHFTSFKSKFGKTYATQE 66
Query: 76 EHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQ 135
EHD+RF +FKANLRRA +HQ +DP+A HGIT+FSDLTP EFRR +LGL+R LRLP DA++
Sbjct: 67 EHDYRFGVFKANLRRAKKHQMIDPTAAHGITKFSDLTPKEFRRQFLGLKRWLRLPTDANK 126
Query: 136 APILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQ 195
APILPT DLP D+DWR+ GAV VKDQGSCGSCWSFS TGALEGA++LATG+L SLSEQQ
Sbjct: 127 APILPTTDLPTDYDWRDHGAVTEVKDQGSCGSCWSFSATGALEGAHYLATGELASLSEQQ 186
Query: 196 LVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKS 255
LVDCDHECDPEE G+CDSGC+GGLMN+AFEY LKAGGL REEDYPYTGTD G CKFDKS
Sbjct: 187 LVDCDHECDPEEYGACDSGCDGGLMNNAFEYALKAGGLEREEDYPYTGTD-GGTCKFDKS 245
Query: 256 KIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVL 315
K+ ASV+NFSVVS+DEDQIAANLVK+GPL+VAINA +MQTY+GGVSCPYICS+R DHGVL
Sbjct: 246 KVVASVSNFSVVSIDEDQIAANLVKHGPLSVAINAAFMQTYVGGVSCPYICSKRQDHGVL 305
Query: 316 LVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAA 371
LVGYGSAGYAPIR KEKP+WIIKNSWG++WGENGYYKICRGRN+CGVDSMVSTVAA
Sbjct: 306 LVGYGSAGYAPIRFKEKPFWIIKNSWGQNWGENGYYKICRGRNICGVDSMVSTVAA 361
>gi|359806140|ref|NP_001241450.1| uncharacterized protein LOC100778716 precursor [Glycine max]
gi|255639509|gb|ACU20049.1| unknown [Glycine max]
Length = 366
Score = 571 bits (1471), Expect = e-160, Method: Compositional matrix adjust.
Identities = 278/356 (78%), Positives = 302/356 (84%), Gaps = 12/356 (3%)
Query: 17 SAVSSGTLIDDVDQL-IRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQE 75
+ V++ IDD D L IRQV ++ HH LL AEHHFS FK KF K YA+QE
Sbjct: 16 ATVAAAERIDDEDDLLIRQVVPDAED---HH------LLNAEHHFSAFKTKFGKTYATQE 66
Query: 76 EHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQ 135
EHDHRF IFK NL RA HQKLDPSA HG+T+FSDLTPAEFRR +LGL+ LRLP DA +
Sbjct: 67 EHDHRFRIFKNNLLRAKSHQKLDPSAVHGVTRFSDLTPAEFRRQFLGLK-PLRLPSDAQK 125
Query: 136 APILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQ 195
APILPTNDLP DFDWRE GAV VK+QGSCGSCWSFS GALEGA+FL+TG+LVSLSEQQ
Sbjct: 126 APILPTNDLPTDFDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLSTGELVSLSEQQ 185
Query: 196 LVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKS 255
LVDCDHECDPEE G+CDSGCNGGLM +AFEYTL+AGGLMRE+DYPYTG DRG CKFDKS
Sbjct: 186 LVDCDHECDPEERGACDSGCNGGLMTTAFEYTLQAGGLMREKDYPYTGRDRG-PCKFDKS 244
Query: 256 KIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVL 315
K+AASVANFSVVSLDE+QIAANLV+NGPLAV INAV+MQTYIGGVSCPYIC + LDHGVL
Sbjct: 245 KVAASVANFSVVSLDEEQIAANLVQNGPLAVGINAVFMQTYIGGVSCPYICGKHLDHGVL 304
Query: 316 LVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAA 371
LVGYGS YAPIR KEKPYWIIKNSWGESWGE GYYKICRGRNVCGVDSMVSTVAA
Sbjct: 305 LVGYGSGAYAPIRFKEKPYWIIKNSWGESWGEEGYYKICRGRNVCGVDSMVSTVAA 360
>gi|118489556|gb|ABK96580.1| unknown [Populus trichocarpa x Populus deltoides]
Length = 367
Score = 570 bits (1468), Expect = e-160, Method: Compositional matrix adjust.
Identities = 269/357 (75%), Positives = 307/357 (85%), Gaps = 14/357 (3%)
Query: 15 VFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQ 74
V S VSS L D + +I+ V+DG D DLL AEHHF+ FK KF K YA+Q
Sbjct: 19 VASTVSSTDLDDPL--IIQVVSDGED-----------DLLNAEHHFTSFKSKFGKTYATQ 65
Query: 75 EEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDAD 134
EEHD+RF +FKANLRRA +HQ +DP+A HG+T+FSDLTP EFRR +LGL+R+LRLP DA+
Sbjct: 66 EEHDYRFGVFKANLRRAKKHQMIDPTAAHGVTKFSDLTPKEFRRQFLGLKRRLRLPTDAN 125
Query: 135 QAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQ 194
+APILPT DLP D+DWR+ GAV VKDQGSCGSCWSFS TGALEGA++LATG+L SLSEQ
Sbjct: 126 KAPILPTTDLPTDYDWRDHGAVTEVKDQGSCGSCWSFSATGALEGAHYLATGELASLSEQ 185
Query: 195 QLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDK 254
QLVDCDHECDPEE G+CDSGC+GGLMN+AFEY LKAGGL REEDYPYTGTD G CKFDK
Sbjct: 186 QLVDCDHECDPEEYGACDSGCDGGLMNNAFEYALKAGGLEREEDYPYTGTD-GGTCKFDK 244
Query: 255 SKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGV 314
SK+ ASV+NFSVVS+DEDQIAANLVK+GPL+VAINA +MQTY+GGVSCPYICS+R DHGV
Sbjct: 245 SKVVASVSNFSVVSIDEDQIAANLVKHGPLSVAINAAFMQTYVGGVSCPYICSKRQDHGV 304
Query: 315 LLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAA 371
LLVGYGSAGYAPIR KEKP+WIIKNSWG++WGENGYYKICRGRN+CGVDSMVSTVAA
Sbjct: 305 LLVGYGSAGYAPIRFKEKPFWIIKNSWGQNWGENGYYKICRGRNICGVDSMVSTVAA 361
>gi|118485796|gb|ABK94746.1| unknown [Populus trichocarpa]
Length = 367
Score = 568 bits (1465), Expect = e-159, Method: Compositional matrix adjust.
Identities = 269/356 (75%), Positives = 306/356 (85%), Gaps = 12/356 (3%)
Query: 17 SAVSSGTLIDDVDQ-LIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQE 75
SAV+S +D+D LIRQV G++ DLL AEHHF+ FK KF K YA+QE
Sbjct: 17 SAVASTVSSNDLDDPLIRQVVSDGED----------DLLNAEHHFTSFKSKFGKTYATQE 66
Query: 76 EHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQ 135
EHD+RF +FKANLRRA +HQ +DP+A HGIT+FSDLTP EFRR +LGL+R LRLP DA++
Sbjct: 67 EHDYRFGVFKANLRRAKKHQMIDPTAAHGITKFSDLTPKEFRRQFLGLKRWLRLPTDANK 126
Query: 136 APILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQ 195
APILPT DLP D+DWR+ GAV VKDQGSCGSCWSFS TGALEGA++LATG+L SLSEQQ
Sbjct: 127 APILPTTDLPTDYDWRDHGAVTEVKDQGSCGSCWSFSATGALEGAHYLATGELASLSEQQ 186
Query: 196 LVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKS 255
LVDCDHECDPEE G+CDSGC+GGLMN+AFEY LKAGGL RE DYPYTGTD G CKFDKS
Sbjct: 187 LVDCDHECDPEEYGACDSGCDGGLMNNAFEYALKAGGLEREADYPYTGTD-GGTCKFDKS 245
Query: 256 KIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVL 315
K+ ASV+NFSVVS+DEDQIAANLVK+GPL+VAINA +MQTY+GGVSCPYICS+R DHGVL
Sbjct: 246 KVVASVSNFSVVSIDEDQIAANLVKHGPLSVAINAAFMQTYVGGVSCPYICSKRQDHGVL 305
Query: 316 LVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAA 371
LVGYGSAGYAPIR KEKP+WIIKNSWG++WGENGYYKICRGRN+CGVDSMVSTVAA
Sbjct: 306 LVGYGSAGYAPIRFKEKPFWIIKNSWGQNWGENGYYKICRGRNICGVDSMVSTVAA 361
>gi|118485910|gb|ABK94801.1| unknown [Populus trichocarpa]
Length = 367
Score = 565 bits (1457), Expect = e-159, Method: Compositional matrix adjust.
Identities = 263/343 (76%), Positives = 299/343 (87%), Gaps = 11/343 (3%)
Query: 29 DQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANL 88
D LI QV G++ DLL AEHHF+ FK KF K YA+QEEHD+RF +FKANL
Sbjct: 30 DPLIIQVVSDGED----------DLLNAEHHFTSFKSKFGKTYATQEEHDYRFGVFKANL 79
Query: 89 RRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADF 148
RRA +HQ +DP+A HG+T+FSDLTP EFRR +LGL+R+LRLP DA++APILPT DLP D+
Sbjct: 80 RRAKKHQMIDPTAAHGVTKFSDLTPKEFRRQFLGLKRRLRLPTDANKAPILPTTDLPTDY 139
Query: 149 DWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEP 208
DWR+ GAV VKDQGSCGSCWSFS TGALEGA++LATG+L SLSEQQLVDCDHECDPEE
Sbjct: 140 DWRDHGAVTEVKDQGSCGSCWSFSATGALEGAHYLATGELASLSEQQLVDCDHECDPEEY 199
Query: 209 GSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVS 268
G+CDSGC+GGLMN+AFEY LKAGGL REEDYPYTGTD G CKFDKSK+ ASV+NFSVVS
Sbjct: 200 GACDSGCDGGLMNNAFEYALKAGGLEREEDYPYTGTDGG-TCKFDKSKVVASVSNFSVVS 258
Query: 269 LDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIR 328
+DEDQIAANLVK+GPL+VAINA +MQTY+GGVSCPYICS+R DHGVLLVGYGSAGYAPIR
Sbjct: 259 IDEDQIAANLVKHGPLSVAINAAFMQTYVGGVSCPYICSKRQDHGVLLVGYGSAGYAPIR 318
Query: 329 LKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAA 371
KEKP+WIIKNSWG++WGENGYYKICRGRN+CGVDSMVSTVAA
Sbjct: 319 FKEKPFWIIKNSWGQNWGENGYYKICRGRNICGVDSMVSTVAA 361
>gi|225431287|ref|XP_002275759.1| PREDICTED: cysteine proteinase RD19a isoform 1 [Vitis vinifera]
gi|297735094|emb|CBI17456.3| unnamed protein product [Vitis vinifera]
Length = 367
Score = 563 bits (1450), Expect = e-158, Method: Compositional matrix adjust.
Identities = 273/370 (73%), Positives = 307/370 (82%), Gaps = 14/370 (3%)
Query: 3 SKTVVLFLVSLVVFSA-VSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFS 61
S++ +LFL+ ++FSA VS + + D LIRQV GD DLL AEH F
Sbjct: 5 SRSALLFLIPTLLFSAAVSDISSDESDDLLIRQVVPEGD-----------DLLSAEHQFG 53
Query: 62 LFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYL 121
LFK KF K Y++ EEHD+RF++F+ANLRRA RHQ LDPSA HG+T+FSDLTP EFRR YL
Sbjct: 54 LFKAKFGKTYSTVEEHDYRFSVFEANLRRARRHQLLDPSAVHGVTRFSDLTPDEFRRDYL 113
Query: 122 GLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGAN 181
GL+ LRLP DA +APILPTNDLP DFDWR+ GAV PVKDQGSCGSCWSFS GALEGA+
Sbjct: 114 GLK-PLRLPADAQKAPILPTNDLPTDFDWRDHGAVTPVKDQGSCGSCWSFSAIGALEGAH 172
Query: 182 FLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPY 241
FL TG L+S+SEQQLVDCDHECDPEE G+CD GCNGGLM SAFEY LKAGG+ REE YPY
Sbjct: 173 FLTTGNLISMSEQQLVDCDHECDPEEYGACDQGCNGGLMTSAFEYILKAGGVEREETYPY 232
Query: 242 TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVS 301
G+DRG +CKF+KS+I ASV+NFSVVSLDEDQIAAN+VKNGPLAV INAV+MQTY+ GVS
Sbjct: 233 IGSDRG-SCKFNKSQIVASVSNFSVVSLDEDQIAANMVKNGPLAVGINAVFMQTYMKGVS 291
Query: 302 CPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCG 361
CPYICSR LDHGV+LVGYGSAGYAPIR KEKPYWIIKNSWGESWGE+GYYKICRG N CG
Sbjct: 292 CPYICSRNLDHGVVLVGYGSAGYAPIRFKEKPYWIIKNSWGESWGEDGYYKICRGHNACG 351
Query: 362 VDSMVSTVAA 371
VDSMVSTVAA
Sbjct: 352 VDSMVSTVAA 361
>gi|255538808|ref|XP_002510469.1| cysteine protease, putative [Ricinus communis]
gi|223551170|gb|EEF52656.1| cysteine protease, putative [Ricinus communis]
Length = 366
Score = 558 bits (1438), Expect = e-156, Method: Compositional matrix adjust.
Identities = 269/367 (73%), Positives = 309/367 (84%), Gaps = 17/367 (4%)
Query: 10 LVSLVVFSAVSSGTLI----DDVDQ-LIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFK 64
+SL+VF+ +SS L D++D LIRQV + LL A+HHF+ FK
Sbjct: 6 FLSLIVFAFLSSSILFTATSDELDDPLIRQVV----------PDVEDYLLSAQHHFTAFK 55
Query: 65 KKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLR 124
KF K YA+QEEHD+RF +FKANLRRA +HQ +DPSA HG+T+FSDLTP EFRR YLGL+
Sbjct: 56 AKFGKNYATQEEHDYRFKVFKANLRRAQKHQLMDPSAVHGVTKFSDLTPREFRRQYLGLK 115
Query: 125 RKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLA 184
KLRLP DA +APILPT+ +P DFDWR+ GAV VK+QGSCGSCWSFS GALEGA+FLA
Sbjct: 116 -KLRLPADAHEAPILPTDGIPEDFDWRDHGAVTNVKNQGSCGSCWSFSAAGALEGAHFLA 174
Query: 185 TGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGT 244
TG+LVSLSEQQLVDCDHECDP E G+CDSGCNGGLM +AFEY LKAGGL REEDYPYTG+
Sbjct: 175 TGELVSLSEQQLVDCDHECDPTEYGACDSGCNGGLMTNAFEYILKAGGLEREEDYPYTGS 234
Query: 245 DRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPY 304
DRG CKF+++KIAASV NFSVVS+DEDQIAANLV+NGPLAV INAV+MQTYIGGVSCPY
Sbjct: 235 DRG-PCKFERAKIAASVNNFSVVSVDEDQIAANLVQNGPLAVGINAVFMQTYIGGVSCPY 293
Query: 305 ICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDS 364
ICS+R DHGV+LVGYGSAGYAP+RLK+KP+WIIKNSWGE+WGENGYYKICRGRNVCGVD+
Sbjct: 294 ICSKRQDHGVVLVGYGSAGYAPVRLKDKPFWIIKNSWGENWGENGYYKICRGRNVCGVDA 353
Query: 365 MVSTVAA 371
MVSTVAA
Sbjct: 354 MVSTVAA 360
>gi|124484383|dbj|BAF46302.1| cysteine proteinase precursor [Ipomoea nil]
Length = 369
Score = 552 bits (1422), Expect = e-154, Method: Compositional matrix adjust.
Identities = 263/353 (74%), Positives = 301/353 (85%), Gaps = 18/353 (5%)
Query: 27 DVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKA 86
D D LIRQV G++ + LL A+HHF+LFK K+ K+YA+QEEHD+R ++FKA
Sbjct: 23 DEDPLIRQVVSDGED---------DALLNADHHFTLFKSKYGKSYATQEEHDYRLSVFKA 73
Query: 87 NLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLR------RKLRLPKDADQAPILP 140
NLRRA RHQ LDPSA HG+T+FSDLTP EFRRT+LG+R RKL+LP DA A ILP
Sbjct: 74 NLRRAKRHQLLDPSAVHGVTKFSDLTPKEFRRTFLGIRKSSSGKRKLKLPADAHAAEILP 133
Query: 141 TNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCD 200
T+DLP+DFDWR+ GAV VKDQGSCGSCWSFSTTGALEGANFLATG+LVSLSEQQLVDCD
Sbjct: 134 TSDLPSDFDWRDYGAVTGVKDQGSCGSCWSFSTTGALEGANFLATGELVSLSEQQLVDCD 193
Query: 201 HECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAAS 260
H CDPEE G+CDSGCNGGLM +A+EY L++GGL +E+DYPYTG D CKFDKSKIAA+
Sbjct: 194 HLCDPEEAGACDSGCNGGLMTTAYEYVLQSGGLEKEKDYPYTGKD--GTCKFDKSKIAAA 251
Query: 261 VANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRR-LDHGVLLVGY 319
VANFSVVSLDEDQIAANLVK+GPL+V INAV+MQTYIGGVSCPYICS+R LDHGVLLVGY
Sbjct: 252 VANFSVVSLDEDQIAANLVKHGPLSVGINAVFMQTYIGGVSCPYICSKRNLDHGVLLVGY 311
Query: 320 GSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAAA 372
G+AGYAPIR K+KPYWI+KNSWGE+WGE GYYKICRG N+CG+DSMVSTV AA
Sbjct: 312 GAAGYAPIRFKDKPYWIVKNSWGENWGEEGYYKICRGNNICGIDSMVSTVTAA 364
>gi|357438145|ref|XP_003589348.1| Cysteine proteinase [Medicago truncatula]
gi|355478396|gb|AES59599.1| Cysteine proteinase [Medicago truncatula]
Length = 364
Score = 552 bits (1422), Expect = e-154, Method: Compositional matrix adjust.
Identities = 267/370 (72%), Positives = 310/370 (83%), Gaps = 18/370 (4%)
Query: 9 FLVSLVVFSAVSSGT--LIDDV---DQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLF 63
FL++L +F+ V++ L DD D LIRQV D + + +L AEHHF+ F
Sbjct: 5 FLIALFLFATVATAATTLSDDTNSDDLLIRQVVD----------TAEDHILNAEHHFTSF 54
Query: 64 KKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGL 123
K KF+K YA++EEHD+RF +FK+NL +A HQKLDPSA HGIT+FSDLT +EFRR +LGL
Sbjct: 55 KSKFSKNYATKEEHDYRFGVFKSNLIKAKLHQKLDPSAQHGITKFSDLTASEFRRQFLGL 114
Query: 124 RRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFL 183
++LRLP A +APILPTN+LP DFDWREKGAV PVKDQGSCGSCW+FSTTGALEGAN+L
Sbjct: 115 NKRLRLPAHAQKAPILPTNNLPEDFDWREKGAVTPVKDQGSCGSCWAFSTTGALEGANYL 174
Query: 184 ATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTG 243
ATGKL SLSEQQLVDCDH CDPEE GSCDSGCNGGLMN+AFEY L++GG++ E+DY YTG
Sbjct: 175 ATGKLTSLSEQQLVDCDHVCDPEERGSCDSGCNGGLMNNAFEYILQSGGVVSEKDYAYTG 234
Query: 244 TDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCP 303
D +CKFDKSK+ ASV+NFSVVSLDEDQIAANLVKNGPLAVAINA +MQTY+ GVSCP
Sbjct: 235 RD--GSCKFDKSKVVASVSNFSVVSLDEDQIAANLVKNGPLAVAINAAWMQTYMSGVSCP 292
Query: 304 YICSR-RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGV 362
YIC++ RLDHGVLL+G+G GYAPIRLKEKPYWIIKNSWG++WGE GYYKICRGRNVCGV
Sbjct: 293 YICAKARLDHGVLLLGFGQGGYAPIRLKEKPYWIIKNSWGQNWGEEGYYKICRGRNVCGV 352
Query: 363 DSMVSTVAAA 372
DSMVSTVAAA
Sbjct: 353 DSMVSTVAAA 362
>gi|164605518|dbj|BAF98584.1| CM0216.500.nc [Lotus japonicus]
Length = 360
Score = 551 bits (1421), Expect = e-154, Method: Compositional matrix adjust.
Identities = 261/346 (75%), Positives = 296/346 (85%), Gaps = 15/346 (4%)
Query: 26 DDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFK 85
D VD +I QV D ++ LGAEHHF FK++F K YA++EEH +RF +FK
Sbjct: 24 DGVDPMICQVVD-------------DEGLGAEHHFLEFKRRFGKVYATEEEHGYRFNVFK 70
Query: 86 ANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLP 145
+N+ RA RHQ LDPSA HG+T+FSDLTP EFR + LGLR + LP DAD APILPT++LP
Sbjct: 71 SNMHRARRHQLLDPSAVHGVTRFSDLTPMEFRHSVLGLR-GVGLPSDADSAPILPTDNLP 129
Query: 146 ADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDP 205
DFDWRE GAV PVK+QGSCGSCWSFS TGALEGA+FL+TG+LVSLSEQQLVDCDH+CDP
Sbjct: 130 KDFDWREHGAVTPVKNQGSCGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHQCDP 189
Query: 206 EEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFS 265
EE GSCDSGCNGGLMNSAFEY L GG+MREEDYPY+GT+ G CKFDK+KIAASVANFS
Sbjct: 190 EEAGSCDSGCNGGLMNSAFEYILNNGGVMREEDYPYSGTNGG-TCKFDKAKIAASVANFS 248
Query: 266 VVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYA 325
VVS DEDQIAANLVKNGPLAVAINAVYMQTY+GGVSCPY+CS++L+HGVLLVGYGS YA
Sbjct: 249 VVSRDEDQIAANLVKNGPLAVAINAVYMQTYVGGVSCPYVCSKKLNHGVLLVGYGSESYA 308
Query: 326 PIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAA 371
PIR+K+KPYWIIKNSWGE+WGENGYYKICRGRN+CGVDSMVSTVAA
Sbjct: 309 PIRMKQKPYWIIKNSWGENWGENGYYKICRGRNICGVDSMVSTVAA 354
>gi|449461649|ref|XP_004148554.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD19a-like
[Cucumis sativus]
Length = 381
Score = 551 bits (1420), Expect = e-154, Method: Compositional matrix adjust.
Identities = 266/355 (74%), Positives = 300/355 (84%), Gaps = 16/355 (4%)
Query: 19 VSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHD 78
VS ++ D D LIRQV + D +HH LGAEHHFSLFK++F K+YA++EEHD
Sbjct: 25 VSQHSVEHDGDPLIRQVVEN-DGDFNHHA------LGAEHHFSLFKRRFGKSYATEEEHD 77
Query: 79 HRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLR-RKLRLPKDADQAP 137
RF IFKAN+RRA RHQ DPSA HG+TQFSDLTP EFR+ +LGLR +LRLP D + AP
Sbjct: 78 RRFKIFKANMRRAERHQSFDPSAIHGVTQFSDLTPFEFRKAFLGLRGHRLRLPVDTNAAP 137
Query: 138 ILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLV 197
ILPT +LP DFDWR+ G V VK+QGSCGSCWSFSTTGALEGANFL LSEQQLV
Sbjct: 138 ILPTENLPIDFDWRQHGGVTRVKNQGSCGSCWSFSTTGALEGANFL------XLSEQQLV 191
Query: 198 DCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKI 257
DCDHECDPEE +CDSGCNGGLMNSAFEYTLKAGGLM+E+DYPY G DR + C FDKSKI
Sbjct: 192 DCDHECDPEEEDACDSGCNGGLMNSAFEYTLKAGGLMKEQDYPYAGIDR-NTCNFDKSKI 250
Query: 258 AASVANFSVV-SLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLL 316
AAS+A+FSVV S+DEDQIAANLVKNGPLA+AINAV+MQTYIGGVSCP+ICS+RLDHGVLL
Sbjct: 251 AASIASFSVVNSIDEDQIAANLVKNGPLAIAINAVFMQTYIGGVSCPFICSKRLDHGVLL 310
Query: 317 VGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAA 371
VGYGSAGYAPIR+++K YWIIKNSWGESWGENGYYKICRGRN+CGVDS+VSTVAA
Sbjct: 311 VGYGSAGYAPIRMRDKDYWIIKNSWGESWGENGYYKICRGRNICGVDSLVSTVAA 365
>gi|359492179|ref|XP_002280808.2| PREDICTED: cysteine proteinase RD19a-like [Vitis vinifera]
gi|302142580|emb|CBI19783.3| unnamed protein product [Vitis vinifera]
Length = 365
Score = 550 bits (1418), Expect = e-154, Method: Compositional matrix adjust.
Identities = 267/356 (75%), Positives = 301/356 (84%), Gaps = 17/356 (4%)
Query: 16 FSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQE 75
S VSS L DD+ LIRQV S ++DLL AEHHF+ FK +F K YA+ E
Sbjct: 22 MSDVSSNEL-DDL--LIRQVV-----------SNSDDLLSAEHHFAAFKARFRKTYATAE 67
Query: 76 EHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQ 135
EHD+RF+IFKANLRRA R+Q LDPSA HG+T+FSDLTPAEFR+ YLGL+ LR P D Q
Sbjct: 68 EHDYRFSIFKANLRRAKRNQLLDPSAVHGVTRFSDLTPAEFRQNYLGLK-PLRFPIDTQQ 126
Query: 136 APILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQ 195
APILPTNDLP DFDWR+ GAV VKDQG CGSCWSFSTTGALEGA+FLATG LVSLSEQQ
Sbjct: 127 APILPTNDLPTDFDWRDHGAVTAVKDQGECGSCWSFSTTGALEGAHFLATGNLVSLSEQQ 186
Query: 196 LVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKS 255
LVDCDHECDPEE G+CD GCNGGLMN+AFEY LKAGG++R EDYPYTGTD GH CKFDK+
Sbjct: 187 LVDCDHECDPEEYGACDRGCNGGLMNTAFEYILKAGGVVRGEDYPYTGTD-GH-CKFDKT 244
Query: 256 KIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVL 315
KIAASV+NFS VS+DEDQIAANLVKNGPLAV INA++MQ+Y GGVSCP+ICS L+HGVL
Sbjct: 245 KIAASVSNFSTVSIDEDQIAANLVKNGPLAVGINAIFMQSYAGGVSCPFICSTSLNHGVL 304
Query: 316 LVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAA 371
LVGYGSAGY+PIR KEKPYW++KNSWG++WGE+GYYKICRG N+CGVDSMVSTVAA
Sbjct: 305 LVGYGSAGYSPIRFKEKPYWLLKNSWGQNWGEHGYYKICRGHNICGVDSMVSTVAA 360
>gi|516865|emb|CAA52403.1| putative thiol protease [Arabidopsis thaliana]
Length = 313
Score = 550 bits (1417), Expect = e-154, Method: Compositional matrix adjust.
Identities = 254/313 (81%), Positives = 286/313 (91%), Gaps = 1/313 (0%)
Query: 61 SLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTY 120
+LFKKKF K Y S EEH +RF++FKANL RA RHQK+DPSA HG+TQFSDLT +EFRR +
Sbjct: 1 ALFKKKFGKVYGSIEEHYYRFSVFKANLLRAMRHQKMDPSARHGVTQFSDLTRSEFRRKH 60
Query: 121 LGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGA 180
LG++ +LPKDA+QAPILPT +LP +FDWR++GAV PVK+QGSCGSCWSFSTTGALEGA
Sbjct: 61 LGVKGGFKLPKDANQAPILPTQNLPEEFDWRDRGAVTPVKNQGSCGSCWSFSTTGALEGA 120
Query: 181 NFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYP 240
+FLATGKLVSLSEQQLVDCDHECDPEE GSCDSGCNGGLMNSAFEYTLK GGLMRE+DYP
Sbjct: 121 HFLATGKLVSLSEQQLVDCDHECDPEEEGSCDSGCNGGLMNSAFEYTLKTGGLMREKDYP 180
Query: 241 YTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGV 300
YTGTD G +CK D+SKI ASV+NFSVVS++EDQIAANL+KNGPLAVAINA YMQTYIGGV
Sbjct: 181 YTGTD-GGSCKLDRSKIVASVSNFSVVSINEDQIAANLIKNGPLAVAINAAYMQTYIGGV 239
Query: 301 SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVC 360
SCPYICSRRL+HGVLLVGYGSAG++ RLKEKPYWIIKNSWGESWGENG+YKIC+GRN+C
Sbjct: 240 SCPYICSRRLNHGVLLVGYGSAGFSQARLKEKPYWIIKNSWGESWGENGFYKICKGRNIC 299
Query: 361 GVDSMVSTVAAAV 373
GVDS+VSTVAA
Sbjct: 300 GVDSLVSTVAATT 312
>gi|5051468|emb|CAB44983.1| putative preprocysteine proteinase [Nicotiana tabacum]
Length = 363
Score = 549 bits (1415), Expect = e-154, Method: Compositional matrix adjust.
Identities = 267/366 (72%), Positives = 302/366 (82%), Gaps = 14/366 (3%)
Query: 8 LFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKF 67
LFL+SL+ F SS D D LIRQV E+ ++ LL AEHHFSLFK KF
Sbjct: 4 LFLLSLLAFVLFSSAIAFSDEDPLIRQVVS---------ETDDSHLLNAEHHFSLFKSKF 54
Query: 68 NKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKL 127
K YAS+EEHDHRF +FKANLRRA RHQ LDPSA HGIT+FSDLTP+EFRRTYLGL +
Sbjct: 55 GKIYASEEEHDHRFKVFKANLRRARRHQLLDPSAEHGITKFSDLTPSEFRRTYLGLHKPK 114
Query: 128 RLPK-DADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATG 186
PK +A++APILPT+DLPADFDWR+ GAV VK+QGSCGSCWSFSTTGA+EGA+FLATG
Sbjct: 115 --PKLNAEKAPILPTSDLPADFDWRDHGAVTGVKNQGSCGSCWSFSTTGAVEGAHFLATG 172
Query: 187 KLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDR 246
+LVSLSEQQLVDCDHECDPE+ +CD+GC GGLM +AFEYTLKAGGL E+DYPYTG D
Sbjct: 173 ELVSLSEQQLVDCDHECDPEQQDACDAGCGGGLMTTAFEYTLKAGGLQLEKDYPYTGKDG 232
Query: 247 GHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYIC 306
C FDKSKIAA+V NFSV+ LDEDQIAANLVK+GPLAV INA +MQTY+GGVSCP IC
Sbjct: 233 --KCHFDKSKIAAAVTNFSVIGLDEDQIAANLVKHGPLAVGINAAWMQTYVGGVSCPLIC 290
Query: 307 SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMV 366
+R DHGVLLVGYGS G+APIRLKEK YWIIKNSWGE+WGE+GYYKICRG N+CGVD+MV
Sbjct: 291 FKRQDHGVLLVGYGSHGFAPIRLKEKAYWIIKNSWGENWGEHGYYKICRGHNICGVDAMV 350
Query: 367 STVAAA 372
STV AA
Sbjct: 351 STVTAA 356
>gi|164605519|dbj|BAF98585.1| CM0216.510.nc [Lotus japonicus]
Length = 360
Score = 549 bits (1414), Expect = e-154, Method: Compositional matrix adjust.
Identities = 262/344 (76%), Positives = 292/344 (84%), Gaps = 15/344 (4%)
Query: 28 VDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKAN 87
VD LIRQV DG + LGAEHHF FK++F K Y S+EEH +RF +FK+N
Sbjct: 26 VDPLIRQVVDG-------------EGLGAEHHFLEFKRRFGKVYVSEEEHGYRFNVFKSN 72
Query: 88 LRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPAD 147
+ RA RHQ LDPSA HG+T+FSDLTP EFR + LGLR + LP DAD APIL T++LP D
Sbjct: 73 MHRARRHQLLDPSAVHGVTRFSDLTPMEFRHSVLGLR-GVGLPSDADSAPILRTDNLPKD 131
Query: 148 FDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEE 207
FDWRE GAV PVK+QGSCG+CWSFS TGALEGA+FL+TGKLVSLSEQQLVDCDHECDPEE
Sbjct: 132 FDWREHGAVTPVKNQGSCGACWSFSATGALEGAHFLSTGKLVSLSEQQLVDCDHECDPEE 191
Query: 208 PGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVV 267
GSCDSGC GGLMNSAFEY L GG+MREEDYPY+GT G CKFD++KIAASVANFSVV
Sbjct: 192 AGSCDSGCKGGLMNSAFEYILNNGGVMREEDYPYSGT-AGGTCKFDQTKIAASVANFSVV 250
Query: 268 SLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPI 327
S DEDQIAANLVKNGPLAVAINAVYMQTY+GGVSCPY+CS++L+HGVLLVGYGS YAPI
Sbjct: 251 SRDEDQIAANLVKNGPLAVAINAVYMQTYVGGVSCPYVCSKKLNHGVLLVGYGSESYAPI 310
Query: 328 RLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAA 371
R+K+KPYWIIKNSWGE+WGENGYYKICRGRNVCGVDSMVSTVAA
Sbjct: 311 RMKQKPYWIIKNSWGENWGENGYYKICRGRNVCGVDSMVSTVAA 354
>gi|27413319|gb|AAO11786.1| pre-pro cysteine proteinase [Vicia faba]
Length = 363
Score = 549 bits (1414), Expect = e-153, Method: Compositional matrix adjust.
Identities = 258/368 (70%), Positives = 312/368 (84%), Gaps = 17/368 (4%)
Query: 9 FLVSLVVFSAVSSGTLIDDV---DQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKK 65
F+ ++V+F+AV++ + DD D +IRQV D ++ LL AEHHF+ FK
Sbjct: 5 FIFAIVLFAAVATSS-TDDTNTDDFIIRQVVDNEED----------HLLNAEHHFTSFKS 53
Query: 66 KFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRR 125
KF+K+Y+++EEHD+RF +FK+NL +A HQKLDP+A HGIT+FSDLT +EFRR +LGL++
Sbjct: 54 KFSKSYSTKEEHDYRFGVFKSNLIKAKLHQKLDPTAEHGITKFSDLTASEFRRQFLGLKK 113
Query: 126 KLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLAT 185
+LRLP A +APILPT +LP DFDWREKGAV PVKDQGSCGSCW+FSTTGALEGA++LAT
Sbjct: 114 RLRLPAHAQKAPILPTTNLPEDFDWREKGAVTPVKDQGSCGSCWAFSTTGALEGAHYLAT 173
Query: 186 GKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTD 245
GKLVSLSEQQLVDCDH CDPE+ GSCDSGCNGGLMN+AFEY L++GG+++E+DY YTG D
Sbjct: 174 GKLVSLSEQQLVDCDHVCDPEQAGSCDSGCNGGLMNNAFEYLLQSGGVVQEKDYAYTGRD 233
Query: 246 RGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYI 305
+CKFDKSK+ ASV+NFSVVSLDE+QIAANLVKNGPLAV INA +MQTY+ GVSCPY+
Sbjct: 234 --GSCKFDKSKVVASVSNFSVVSLDEEQIAANLVKNGPLAVGINAAWMQTYMSGVSCPYV 291
Query: 306 CSR-RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDS 364
C++ RLDHGVLLVG+G YAPIRLKEKPYWI+KNSWG++WGE GYYKICRGRNVCGVDS
Sbjct: 292 CAKSRLDHGVLLVGFGKGAYAPIRLKEKPYWIVKNSWGQNWGEQGYYKICRGRNVCGVDS 351
Query: 365 MVSTVAAA 372
MVSTVAAA
Sbjct: 352 MVSTVAAA 359
>gi|113208365|dbj|BAF03553.1| cysteine proteinase CP2 [Phaseolus vulgaris]
Length = 365
Score = 548 bits (1413), Expect = e-153, Method: Compositional matrix adjust.
Identities = 261/341 (76%), Positives = 290/341 (85%), Gaps = 11/341 (3%)
Query: 31 LIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRR 90
LIRQV G+ + LL AEHHFS FK KF K YA++EEHDHRF +FK+N+RR
Sbjct: 30 LIRQVVPEGE--------VEDHLLNAEHHFSTFKAKFGKTYATKEEHDHRFGVFKSNMRR 81
Query: 91 AARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDW 150
A H +LDPSA HG+T+FSDLTPAEF R +LGL+ LRLP A +APILPTN+LP DFDW
Sbjct: 82 ARLHAQLDPSAVHGVTKFSDLTPAEFHRKFLGLK-PLRLPAHAQKAPILPTNNLPKDFDW 140
Query: 151 REKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGS 210
R+KGAV VKDQGSCGSCWSFSTTGALEGA+FLATG+LVSLSEQQLVDCDH CDPEE GS
Sbjct: 141 RDKGAVTNVKDQGSCGSCWSFSTTGALEGAHFLATGELVSLSEQQLVDCDHVCDPEEYGS 200
Query: 211 CDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLD 270
CDSGCNGGLMN+AFEY + +GG+ RE+DYPYTG D CKFDKSKIAASV+N+SV+SLD
Sbjct: 201 CDSGCNGGLMNNAFEYLIGSGGVQREKDYPYTGRDG--TCKFDKSKIAASVSNYSVISLD 258
Query: 271 EDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLK 330
E+QIAANLVKNGPLAVAINAVYMQTY+GGVSCPYIC + LDHGVLLVGYG YAPIR K
Sbjct: 259 EEQIAANLVKNGPLAVAINAVYMQTYVGGVSCPYICGKHLDHGVLLVGYGEGAYAPIRFK 318
Query: 331 EKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAA 371
EKPYWIIKNSWGE+WGENGYYKICRGRNVCGVDSMVSTV A
Sbjct: 319 EKPYWIIKNSWGENWGENGYYKICRGRNVCGVDSMVSTVGA 359
>gi|33945877|emb|CAE45588.1| papain-like cysteine proteinase-like protein 1 [Lotus japonicus]
Length = 359
Score = 548 bits (1411), Expect = e-153, Method: Compositional matrix adjust.
Identities = 261/347 (75%), Positives = 296/347 (85%), Gaps = 16/347 (4%)
Query: 26 DDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFK 85
D VD +I QV D ++ LGAEHHF FK++F K YA++EEH +RF +FK
Sbjct: 24 DGVDPMICQVVD-------------DEGLGAEHHFLEFKRRFGKVYATEEEHGYRFNVFK 70
Query: 86 ANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLP 145
+N+ RA RHQ LDPSA HG+TQFSDLTP EF+ + LGLR + LP DAD APILPT++LP
Sbjct: 71 SNMHRARRHQLLDPSAVHGVTQFSDLTPMEFQHSVLGLR-GVGLPSDADSAPILPTDNLP 129
Query: 146 ADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHE-CD 204
DFDWRE GAV PVK+QGSCGSCWSFS TGALEGA+FL+TG+LVSLSEQQLVDCDH+ CD
Sbjct: 130 KDFDWREHGAVTPVKNQGSCGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHQQCD 189
Query: 205 PEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANF 264
PEE GSCDSGCNGGLMNSAFEY L GG+MREEDYPY+GT+ G CKFDK+KIAASVANF
Sbjct: 190 PEEAGSCDSGCNGGLMNSAFEYILNNGGVMREEDYPYSGTNGG-TCKFDKAKIAASVANF 248
Query: 265 SVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGY 324
SVVS DEDQIAANLVKNGPLAVAINAVYMQTY+GGVSCPY+CS++L+HGVLLVGYGS Y
Sbjct: 249 SVVSRDEDQIAANLVKNGPLAVAINAVYMQTYVGGVSCPYVCSKKLNHGVLLVGYGSESY 308
Query: 325 APIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAA 371
APIR+K+KPYWIIKNSWGE+WGENGYYKICRGRN+CGVDSMVSTVAA
Sbjct: 309 APIRMKQKPYWIIKNSWGENWGENGYYKICRGRNICGVDSMVSTVAA 355
>gi|312985015|gb|ACX54787.2| cysteine protease [Arachis diogoi]
Length = 360
Score = 548 bits (1411), Expect = e-153, Method: Compositional matrix adjust.
Identities = 257/348 (73%), Positives = 299/348 (85%), Gaps = 14/348 (4%)
Query: 26 DDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFK 85
D D LIRQVTDG HH +L AEHHF+ FK KF K+YA+QEEHD+RF +F+
Sbjct: 21 DQADPLIRQVTDG-----DHH------MLNAEHHFTTFKTKFGKSYATQEEHDYRFGVFR 69
Query: 86 ANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLP 145
ANLRRA H KLDPSA HG+T+FSDLTP EF+R YLGL+ LRLP A++APILPT+DLP
Sbjct: 70 ANLRRAKLHAKLDPSAEHGVTKFSDLTPEEFKRQYLGLK-PLRLPSTANKAPILPTSDLP 128
Query: 146 ADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDP 205
+FDWR+KGAV PVK+QGSCGSCW+FSTTGALEGA++L+TG+LVSLSEQQLVDCDH CDP
Sbjct: 129 ENFDWRDKGAVTPVKNQGSCGSCWAFSTTGALEGAHYLSTGELVSLSEQQLVDCDHVCDP 188
Query: 206 EEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFS 265
EE G+CD+GCNGGLMN+AF+Y L+AGG+ E+DYPY+G D CKFDKSK+AA+VANFS
Sbjct: 189 EEYGACDAGCNGGLMNNAFDYILQAGGVQTEKDYPYSGRDE--TCKFDKSKVAATVANFS 246
Query: 266 VVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYA 325
VVSLDEDQIAANLVK+GPLAV INA++MQTYIGGVSCPYIC + LDHGVLLVGYG+AGYA
Sbjct: 247 VVSLDEDQIAANLVKHGPLAVGINAIFMQTYIGGVSCPYICGKNLDHGVLLVGYGAAGYA 306
Query: 326 PIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAAAV 373
PIR K+KP+WIIKNSWGESWGE+GYYKICRG+NVCGVDSMVS+V A
Sbjct: 307 PIRFKDKPFWIIKNSWGESWGEDGYYKICRGKNVCGVDSMVSSVVATT 354
>gi|1401242|gb|AAB67878.1| pre-pro-cysteine proteinase [Vicia faba]
Length = 363
Score = 548 bits (1411), Expect = e-153, Method: Compositional matrix adjust.
Identities = 256/367 (69%), Positives = 311/367 (84%), Gaps = 15/367 (4%)
Query: 9 FLVSLVVFSAVSSGTL--IDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKK 66
F+ ++V+F+AV++ + + D +IRQV D ++ LL AEHHF+ FK K
Sbjct: 5 FIFAIVLFAAVATSSTDNTNTDDFIIRQVVDNEED----------HLLNAEHHFTSFKSK 54
Query: 67 FNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRK 126
F+K+Y+++EEHD+RF +FK+NL +A HQKLDP+A HGIT+FSDLT +EFRR +LGL+++
Sbjct: 55 FSKSYSTKEEHDYRFGVFKSNLIKAKLHQKLDPTAEHGITKFSDLTASEFRRQFLGLKKR 114
Query: 127 LRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATG 186
LRLP A +APILPT +LP DFDWREKGAV PVKDQGSCGSCW+FSTTGALEGA++LATG
Sbjct: 115 LRLPAHAQKAPILPTTNLPEDFDWREKGAVTPVKDQGSCGSCWAFSTTGALEGAHYLATG 174
Query: 187 KLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDR 246
KLVSLSEQQLVDCDH CDPE+ GSCDSGCNGGLMN+AFEY L++GG+++E+DY YTG D
Sbjct: 175 KLVSLSEQQLVDCDHVCDPEQAGSCDSGCNGGLMNNAFEYLLQSGGVVQEKDYAYTGRD- 233
Query: 247 GHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYIC 306
+CKFDKSK+ ASV+NFSVVSLDE+QIAANLVKNGPLAV INA +MQTY+ GVSCPY+C
Sbjct: 234 -GSCKFDKSKVVASVSNFSVVSLDEEQIAANLVKNGPLAVGINAAWMQTYMSGVSCPYVC 292
Query: 307 SR-RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSM 365
++ RLDHGVLLVG+G YAPIRLKEKPYWI+KNSWG++WGE GYYKICRGRNVCGVDSM
Sbjct: 293 AKSRLDHGVLLVGFGKGAYAPIRLKEKPYWIVKNSWGQNWGEQGYYKICRGRNVCGVDSM 352
Query: 366 VSTVAAA 372
VSTVAAA
Sbjct: 353 VSTVAAA 359
>gi|2511691|emb|CAB17075.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 365
Score = 547 bits (1410), Expect = e-153, Method: Compositional matrix adjust.
Identities = 264/346 (76%), Positives = 292/346 (84%), Gaps = 11/346 (3%)
Query: 26 DDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFK 85
D D LIRQV G E+ H LL AEHHFS FK KF K YA++EEHDHRF +FK
Sbjct: 25 DADDILIRQVVPEG-EVEDH-------LLNAEHHFSTFKSKFGKTYATKEEHDHRFGVFK 76
Query: 86 ANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLP 145
+N+RRA H +LDPSA HG+T+FSDLTPAEF R +LGL+ LRLP A +APILPTN+LP
Sbjct: 77 SNMRRARLHAQLDPSAVHGVTKFSDLTPAEFHRKFLGLK-PLRLPAHAQKAPILPTNNLP 135
Query: 146 ADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDP 205
DFDWR+KGAV VKDQGSCGSCWSFSTTGALEGA+FLATG+LVSLSEQQLVDCDH CDP
Sbjct: 136 KDFDWRDKGAVTNVKDQGSCGSCWSFSTTGALEGAHFLATGELVSLSEQQLVDCDHVCDP 195
Query: 206 EEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFS 265
EE GSCDSGCNGGLMN+AFEY + +GG+ RE+DYPYTG D CKFDKSKIAASV+N+S
Sbjct: 196 EEYGSCDSGCNGGLMNNAFEYLIGSGGVQREKDYPYTGRDG--TCKFDKSKIAASVSNYS 253
Query: 266 VVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYA 325
V+SLDE+QIAANLVKNGPLAVAINAVYMQTY+GGVSCPYIC + LDHGVLLVGYG YA
Sbjct: 254 VISLDEEQIAANLVKNGPLAVAINAVYMQTYVGGVSCPYICGKHLDHGVLLVGYGEGAYA 313
Query: 326 PIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAA 371
PIR KEKPYWIIKNSWGE+WG NGYYKICRGRNVCGVDSMVSTV A
Sbjct: 314 PIRFKEKPYWIIKNSWGENWGGNGYYKICRGRNVCGVDSMVSTVGA 359
>gi|19195|emb|CAA78403.1| pre-pro-cysteine proteinase [Solanum lycopersicum]
Length = 361
Score = 547 bits (1410), Expect = e-153, Method: Compositional matrix adjust.
Identities = 265/364 (72%), Positives = 297/364 (81%), Gaps = 12/364 (3%)
Query: 8 LFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKF 67
LFL+S + F+ SS D D LIRQV G D+ N +L AEHHFSLFK KF
Sbjct: 2 LFLLSFLAFALFSSAIAFSDDDPLIRQVVSGNDD---------NHMLNAEHHFSLFKAKF 52
Query: 68 NKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKL 127
K YASQEEHDHR +FKANL RA RHQ LDPSA HGITQFSDLTP+EFRRTYLGL K
Sbjct: 53 GKIYASQEEHDHRLKVFKANLHRAKRHQLLDPSAEHGITQFSDLTPSEFRRTYLGLN-KP 111
Query: 128 RLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGK 187
R +A++APILPT DLP+DFDWREKGAV VK+QGSCGSCWSFSTTGA+EGA+FLATG+
Sbjct: 112 RPNLNAEKAPILPTKDLPSDFDWREKGAVTDVKNQGSCGSCWSFSTTGAVEGAHFLATGE 171
Query: 188 LVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRG 247
LVSLSEQQLVDCDHECDP E CD+GCNGGLM +AFEYTLKAGGL E+DYPYTG R
Sbjct: 172 LVSLSEQQLVDCDHECDPVEKNDCDAGCNGGLMTTAFEYTLKAGGLQLEKDYPYTG--RN 229
Query: 248 HACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICS 307
C FDKS+IAASV+NFSVV LDEDQIAANL+K+GPLAV INA +MQTY+ GVSCP IC
Sbjct: 230 GKCHFDKSRIAASVSNFSVVGLDEDQIAANLLKHGPLAVGINAAWMQTYVRGVSCPLICF 289
Query: 308 RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVS 367
+R DHGVLLVGYGS G+APIRLK KPYWIIKNSWG++WGE+GYYKICRG ++CGVD+MVS
Sbjct: 290 KRQDHGVLLVGYGSEGFAPIRLKNKPYWIIKNSWGKTWGEHGYYKICRGHHICGVDAMVS 349
Query: 368 TVAA 371
TV A
Sbjct: 350 TVTA 353
>gi|359492709|ref|XP_002280798.2| PREDICTED: cysteine proteinase RD19a-like [Vitis vinifera]
gi|147841854|emb|CAN73591.1| hypothetical protein VITISV_022889 [Vitis vinifera]
gi|302142582|emb|CBI19785.3| unnamed protein product [Vitis vinifera]
Length = 371
Score = 547 bits (1409), Expect = e-153, Method: Compositional matrix adjust.
Identities = 264/343 (76%), Positives = 286/343 (83%), Gaps = 13/343 (3%)
Query: 29 DQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANL 88
D LI QV GD DLL AE+ F+ FK KF K YA+ EEHDHRF +FKANL
Sbjct: 36 DLLIHQVVSDGD-----------DLLNAEYQFAEFKTKFGKTYATAEEHDHRFNVFKANL 84
Query: 89 RRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADF 148
RRA RHQ LDPSA HG+TQFSDLTP EFR+ YLGL+R L+LP DA +APILPT DLP DF
Sbjct: 85 RRAKRHQLLDPSAEHGVTQFSDLTPREFRQNYLGLKR-LQLPADAQKAPILPTKDLPTDF 143
Query: 149 DWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEP 208
DWR+ GAV VKDQG CGSCWSFST GALEGA+FLATG LVSLS QQL+DCD ECDPEE
Sbjct: 144 DWRDHGAVTAVKDQGYCGSCWSFSTIGALEGAHFLATGNLVSLSTQQLLDCDTECDPEEY 203
Query: 209 GSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVS 268
+CD GCNGGLMN+AFEY LKAGG+ +EEDYPYTGTDRG C+F+K+KIAASVANFSVVS
Sbjct: 204 DACDDGCNGGLMNNAFEYILKAGGVAQEEDYPYTGTDRG-LCRFNKTKIAASVANFSVVS 262
Query: 269 LDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIR 328
LDEDQIAANLVKNGPLAV INAV+MQTY GVSCPYICS LDHGVLLVGYGSAGY+PIR
Sbjct: 263 LDEDQIAANLVKNGPLAVGINAVFMQTYKSGVSCPYICSSTLDHGVLLVGYGSAGYSPIR 322
Query: 329 LKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAA 371
KEKPYWIIKNSWGESWGE GYYKICRG N+CGVDSMVSTVAA
Sbjct: 323 FKEKPYWIIKNSWGESWGEQGYYKICRGHNICGVDSMVSTVAA 365
>gi|7242888|dbj|BAA92495.1| cysteine protease [Vigna mungo]
Length = 364
Score = 547 bits (1409), Expect = e-153, Method: Compositional matrix adjust.
Identities = 263/341 (77%), Positives = 292/341 (85%), Gaps = 11/341 (3%)
Query: 31 LIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRR 90
LIRQV G E+ H LL AEHHFS FK KF K YA++EEHDHRF +FK+NLRR
Sbjct: 29 LIRQVVPEG-EVEDH-------LLNAEHHFSNFKAKFGKTYATKEEHDHRFGVFKSNLRR 80
Query: 91 AARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDW 150
A H +LDPSA HG+T+FSDLT AEF+R +LGL+ L LP +A +APILPTN+LP DFDW
Sbjct: 81 ARLHAQLDPSAVHGVTKFSDLTAAEFQRQFLGLK-PLGLPANAQKAPILPTNNLPKDFDW 139
Query: 151 REKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGS 210
R+KGAV VKDQG+CGSCWSFSTTGALEGA+FLATG+LVSLSEQQLVDCDH CDPEE G+
Sbjct: 140 RDKGAVTNVKDQGACGSCWSFSTTGALEGAHFLATGELVSLSEQQLVDCDHVCDPEEYGA 199
Query: 211 CDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLD 270
CDSGCNGGLMN+AFEY L AGG+ REEDYPY G D +CKFDKSKIAASVAN+SV+SLD
Sbjct: 200 CDSGCNGGLMNNAFEYILGAGGVQREEDYPYAGRDS--SCKFDKSKIAASVANYSVISLD 257
Query: 271 EDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLK 330
EDQIAANLVKNGPLAV INAVYMQTYIGGVSCPYIC++RLDHGV +VGYG +GYAPIR K
Sbjct: 258 EDQIAANLVKNGPLAVGINAVYMQTYIGGVSCPYICAKRLDHGVQIVGYGESGYAPIRFK 317
Query: 331 EKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAA 371
EKPYWIIKNSWGESWGENGYYKICRG+N CGVDSMVSTV A
Sbjct: 318 EKPYWIIKNSWGESWGENGYYKICRGQNACGVDSMVSTVGA 358
>gi|28192375|gb|AAK07731.1| CPR2-like cysteine proteinase [Nicotiana tabacum]
Length = 363
Score = 546 bits (1408), Expect = e-153, Method: Compositional matrix adjust.
Identities = 266/366 (72%), Positives = 301/366 (82%), Gaps = 14/366 (3%)
Query: 8 LFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKF 67
LFL+SL+ F SS D D LIRQV E+ ++ LL AEHHFSLFK KF
Sbjct: 4 LFLLSLLAFVLFSSAIAFSDEDPLIRQVVS---------ETDDSHLLNAEHHFSLFKSKF 54
Query: 68 NKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKL 127
K YAS+EEHDHRF +FKAN RRA RHQ LDPSA HGIT+FSDLTP+EFRRTYLGL +
Sbjct: 55 GKIYASEEEHDHRFKVFKANRRRARRHQLLDPSAEHGITKFSDLTPSEFRRTYLGLHKPK 114
Query: 128 RLPK-DADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATG 186
PK +A++APILPT+DLPADFDWR+ GAV VK+QGSCGSCWSFSTTGA+EGA+FLATG
Sbjct: 115 --PKLNAEKAPILPTSDLPADFDWRDHGAVTGVKNQGSCGSCWSFSTTGAVEGAHFLATG 172
Query: 187 KLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDR 246
+LVSLSEQQLVDCDHECDPE+ +CD+GC GGLM +AFEYTLKAGGL E+DYPYTG D
Sbjct: 173 ELVSLSEQQLVDCDHECDPEQQDACDAGCGGGLMTTAFEYTLKAGGLQLEKDYPYTGKDG 232
Query: 247 GHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYIC 306
C FDKSKIAA+V NFSV+ LDEDQIAANLVK+GPLAV INA +MQTY+GGVSCP IC
Sbjct: 233 --KCHFDKSKIAAAVTNFSVIGLDEDQIAANLVKHGPLAVGINAAWMQTYVGGVSCPLIC 290
Query: 307 SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMV 366
+R DHGVLLVGYGS G+APIRLKEK YWIIKNSWGE+WGE+GYYKICRG N+CGVD+MV
Sbjct: 291 FKRQDHGVLLVGYGSHGFAPIRLKEKAYWIIKNSWGENWGEHGYYKICRGHNICGVDAMV 350
Query: 367 STVAAA 372
STV AA
Sbjct: 351 STVTAA 356
>gi|33945878|emb|CAE45589.1| papain-like cysteine proteinase-like protein 2 [Lotus japonicus]
Length = 361
Score = 546 bits (1407), Expect = e-153, Method: Compositional matrix adjust.
Identities = 262/347 (75%), Positives = 294/347 (84%), Gaps = 16/347 (4%)
Query: 26 DDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFK 85
D VD +I QV D ++ LGAEHHF FK++F K YA++EEH +RF +FK
Sbjct: 24 DGVDPMICQVVD-------------DEGLGAEHHFLEFKRRFGKVYATEEEHGYRFNVFK 70
Query: 86 ANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLP 145
+N+ RA RHQ LDPSA HG+T+FSDLTP EFR + LGLR + LP DAD APILPT++LP
Sbjct: 71 SNMHRARRHQLLDPSAVHGVTRFSDLTPMEFRHSVLGLR-GVGLPSDADSAPILPTDNLP 129
Query: 146 ADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHE-CD 204
DFDWRE GAV PVK+QGSCGSCWSFS TGALEGA+FL+TGKLVSLSEQQLVDCDHE CD
Sbjct: 130 KDFDWREHGAVTPVKNQGSCGSCWSFSATGALEGAHFLSTGKLVSLSEQQLVDCDHEQCD 189
Query: 205 PEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANF 264
PEE GSCDSGC GGLMNSAFEY L GG+MREEDYPY+GT G CKFD++KIAASVANF
Sbjct: 190 PEEAGSCDSGCKGGLMNSAFEYILNNGGVMREEDYPYSGT-AGGTCKFDQTKIAASVANF 248
Query: 265 SVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGY 324
SVVS DEDQIAANLVKNGPLAVAINAVYMQTY+GGVSCPY+CS++L+HGVLLVGYGS Y
Sbjct: 249 SVVSRDEDQIAANLVKNGPLAVAINAVYMQTYVGGVSCPYVCSKKLNHGVLLVGYGSESY 308
Query: 325 APIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAA 371
APIR+K+KPYWIIKNSWGE+WGENGYYKICRGRNVCGVDSMVSTVAA
Sbjct: 309 APIRMKQKPYWIIKNSWGENWGENGYYKICRGRNVCGVDSMVSTVAA 355
>gi|171854651|dbj|BAG16515.1| putative cysteine proteinase [Capsicum chinense]
Length = 367
Score = 546 bits (1406), Expect = e-153, Method: Compositional matrix adjust.
Identities = 269/368 (73%), Positives = 300/368 (81%), Gaps = 14/368 (3%)
Query: 8 LFLVSLVVFSAVSSGTL-IDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKK 66
LFL+SL+VF+ SS D D LIRQVT D+ NN LL AEHHFSLFK K
Sbjct: 4 LFLLSLLVFTIFSSSAFAFSDEDPLIRQVTSESDD-------NNNHLLNAEHHFSLFKSK 56
Query: 67 FNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRK 126
F K YA+QEEHDHR +FKANLRRA RHQ LDP+A HGIT+FSDLTP+EFRRTYLGL +
Sbjct: 57 FGKIYATQEEHDHRLKVFKANLRRARRHQLLDPTAEHGITKFSDLTPSEFRRTYLGLHKP 116
Query: 127 LRLPK-DADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLAT 185
PK +APILPT+DLP DFDWREKGAV VK+QGSCGSCWSFSTTGA+EGA+FLAT
Sbjct: 117 K--PKLSTTKAPILPTSDLPEDFDWREKGAVTGVKNQGSCGSCWSFSTTGAVEGAHFLAT 174
Query: 186 GKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTD 245
G+LVSLSEQQLVDCDHECD E+ CD+GC GGLM +AFEYTLKAGGL RE+DYPYTG
Sbjct: 175 GELVSLSEQQLVDCDHECDAEQKSECDAGCGGGLMTTAFEYTLKAGGLQREKDYPYTG-- 232
Query: 246 RGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYI 305
R C FDKSKIAASV N+SVV LDEDQIAANLVK+GPLAV IN+ +MQTYIGGVSCP +
Sbjct: 233 RNGQCHFDKSKIAASVTNYSVVGLDEDQIAANLVKHGPLAVGINSAWMQTYIGGVSCPLV 292
Query: 306 CSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGR-NVCGVDS 364
C + DHGVLLVGYGSAG+APIRLK KPYWIIKNSWGE WGE+GYYKICRG+ N+CGVD+
Sbjct: 293 CFKHQDHGVLLVGYGSAGFAPIRLKAKPYWIIKNSWGEHWGEHGYYKICRGQHNICGVDA 352
Query: 365 MVSTVAAA 372
MVSTV AA
Sbjct: 353 MVSTVTAA 360
>gi|356553413|ref|XP_003545051.1| PREDICTED: cysteine proteinase 15A-like [Glycine max]
Length = 367
Score = 545 bits (1404), Expect = e-152, Method: Compositional matrix adjust.
Identities = 258/343 (75%), Positives = 291/343 (84%), Gaps = 5/343 (1%)
Query: 29 DQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANL 88
D LIRQV D + E + LL AEHHF+ FK KF K YA++EEHD RF +FK+NL
Sbjct: 24 DILIRQVVP--DAVGEAAEKEEDHLLNAEHHFASFKAKFGKKYATKEEHDRRFGVFKSNL 81
Query: 89 RRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADF 148
RRA H KLDPSA HG+T+FSDLTPAEFRR +LG + LRLP +A +APILPT DLP DF
Sbjct: 82 RRARLHAKLDPSAVHGVTKFSDLTPAEFRRQFLGFK-PLRLPANAQKAPILPTKDLPKDF 140
Query: 149 DWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEP 208
DWR+KGAV VKDQG+CGSCWSFSTTGALEGA++LATG+LVSLSEQQLVDCDH CDPEE
Sbjct: 141 DWRDKGAVTNVKDQGACGSCWSFSTTGALEGAHYLATGELVSLSEQQLVDCDHVCDPEEY 200
Query: 209 GSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVS 268
G+CDSGCNGGLMN+AFEY L++GG+ +E+DYPYTG D CKFDK+K+AA+V+N+SVVS
Sbjct: 201 GACDSGCNGGLMNNAFEYILQSGGVQKEKDYPYTGRDG--TCKFDKTKVAATVSNYSVVS 258
Query: 269 LDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIR 328
LDEDQIAANLVKNGPLAV INAV+MQTYIGGVSCPYIC + LDHGVL+VGYG YAPIR
Sbjct: 259 LDEDQIAANLVKNGPLAVGINAVFMQTYIGGVSCPYICGKHLDHGVLIVGYGEGAYAPIR 318
Query: 329 LKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAA 371
K KPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAA
Sbjct: 319 FKNKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAA 361
>gi|4757570|gb|AAD29084.1|AF082181_1 cysteine proteinase precursor [Solanum melongena]
Length = 363
Score = 543 bits (1399), Expect = e-152, Method: Compositional matrix adjust.
Identities = 263/345 (76%), Positives = 290/345 (84%), Gaps = 12/345 (3%)
Query: 27 DVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKA 86
D D LIRQV E+ +N +L AEHHFSLFK K+ K YASQEEHDHR +FKA
Sbjct: 23 DDDPLIRQVVS---------ETDDNHMLNAEHHFSLFKSKYGKIYASQEEHDHRLKVFKA 73
Query: 87 NLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPA 146
NLRRA RHQ LDP+A HGITQFSDLTP+EFRRTYLGL K R +A +APILPT+DLP
Sbjct: 74 NLRRARRHQLLDPTAEHGITQFSDLTPSEFRRTYLGLH-KPRPKLNAQKAPILPTSDLPE 132
Query: 147 DFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPE 206
DFDWREKGAV VK+QGSCGSCWSFSTTGA+EGA+FLATG+LVSLSEQQLVDCDHECD E
Sbjct: 133 DFDWREKGAVTGVKNQGSCGSCWSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDAE 192
Query: 207 EPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSV 266
E CD+GCNGGLM +AFEYTLKAGGL RE+DYPYTG D C FDKSKIAASVANFSV
Sbjct: 193 EKSECDAGCNGGLMTTAFEYTLKAGGLQREKDYPYTGRDG--KCHFDKSKIAASVANFSV 250
Query: 267 VSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAP 326
+ LDEDQIAANLVK+GPLAV INA +MQTY+ GVSCP IC +R DHGVLLVGYGSAG+AP
Sbjct: 251 IGLDEDQIAANLVKHGPLAVGINAAWMQTYMRGVSCPLICFKRQDHGVLLVGYGSAGFAP 310
Query: 327 IRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAA 371
IRLKEKPYWIIKNSWGE+WGE+GYYKICRG N+CGVD+MVSTV A
Sbjct: 311 IRLKEKPYWIIKNSWGENWGEHGYYKICRGHNICGVDAMVSTVTA 355
>gi|224082940|ref|XP_002306900.1| predicted protein [Populus trichocarpa]
gi|118481986|gb|ABK92924.1| unknown [Populus trichocarpa]
gi|222856349|gb|EEE93896.1| predicted protein [Populus trichocarpa]
Length = 367
Score = 543 bits (1398), Expect = e-152, Method: Compositional matrix adjust.
Identities = 265/343 (77%), Positives = 300/343 (87%), Gaps = 11/343 (3%)
Query: 29 DQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANL 88
D LIRQV G++ LL AEHHF+ FK KF K YA+QEEHD+RF++FKANL
Sbjct: 30 DPLIRQVVSEGED----------HLLNAEHHFTTFKSKFGKNYATQEEHDYRFSVFKANL 79
Query: 89 RRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADF 148
RA +HQ +DP+A HG+T+FSDLTP EFRR LGL+R+LRLP DA++APILPT DLP DF
Sbjct: 80 LRAKKHQIMDPTAAHGVTKFSDLTPKEFRRQLLGLKRRLRLPTDANKAPILPTGDLPTDF 139
Query: 149 DWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEP 208
DWR+ GAV VKDQGSCGSCWSFS TGALEGA++LATG+LVSLSEQQLVDCDHECDPEE
Sbjct: 140 DWRDHGAVTSVKDQGSCGSCWSFSATGALEGAHYLATGELVSLSEQQLVDCDHECDPEEY 199
Query: 209 GSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVS 268
G+CDSGC+GGLMN+AFEY LKAGGL RE+DYPYTG DRG ACKF+KSK+AASV+NFSVVS
Sbjct: 200 GACDSGCSGGLMNNAFEYALKAGGLEREKDYPYTGNDRG-ACKFEKSKVAASVSNFSVVS 258
Query: 269 LDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIR 328
LDEDQIAANLVK+GPL+VAINAV+MQTYIGGVSCPYICS+ DHGVLLVGYG+AGYAPIR
Sbjct: 259 LDEDQIAANLVKHGPLSVAINAVFMQTYIGGVSCPYICSKHQDHGVLLVGYGAAGYAPIR 318
Query: 329 LKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAA 371
KEKP+WIIKNSWGE+WGENGYYKICR RN+CGVDSMVSTVAA
Sbjct: 319 FKEKPFWIIKNSWGENWGENGYYKICRARNICGVDSMVSTVAA 361
>gi|457756|emb|CAA82995.1| cysteine proteinase [Vicia sativa]
Length = 358
Score = 543 bits (1398), Expect = e-152, Method: Compositional matrix adjust.
Identities = 256/345 (74%), Positives = 296/345 (85%), Gaps = 13/345 (3%)
Query: 29 DQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANL 88
D LIRQV D + + LL AEHHF+ FK KF+K+YA++EEHD+RF +FKANL
Sbjct: 22 DFLIRQVVD----------NEEDHLLNAEHHFTSFKSKFSKSYATKEEHDYRFGVFKANL 71
Query: 89 RRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADF 148
+A HQKLDP+A HGIT+FSDLT +EFRR +LGL ++LRLP A +APILPT +LP DF
Sbjct: 72 IKAKLHQKLDPTAEHGITKFSDLTASEFRRQFLGLNKRLRLPAHAQKAPILPTTNLPEDF 131
Query: 149 DWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEP 208
DWREKGAV PVKDQGSCGSCW+FSTTGALEGA++LATGKLVSLSEQQLVDCDH CDPEE
Sbjct: 132 DWREKGAVTPVKDQGSCGSCWAFSTTGALEGAHYLATGKLVSLSEQQLVDCDHVCDPEEA 191
Query: 209 GSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVS 268
GSCDSGCNGGLMN+AFEY L++GG+++E+DY YTG D +CKFDKSK+ ASV+NFSVVS
Sbjct: 192 GSCDSGCNGGLMNNAFEYLLQSGGVVQEKDYAYTGRD--GSCKFDKSKVVASVSNFSVVS 249
Query: 269 LDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSR-RLDHGVLLVGYGSAGYAPI 327
LDE+QIAANLVKNGPLAVAINA +MQ Y+ GVSCPY+C++ RLDHGVLLVG+G YAPI
Sbjct: 250 LDEEQIAANLVKNGPLAVAINAAWMQAYMSGVSCPYVCAKARLDHGVLLVGFGKGAYAPI 309
Query: 328 RLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAAA 372
RLKEKPYWIIKNSWG++WGE GYYKICRGRNVCGVDSMVSTVAAA
Sbjct: 310 RLKEKPYWIIKNSWGQNWGEQGYYKICRGRNVCGVDSMVSTVAAA 354
>gi|356564325|ref|XP_003550405.1| PREDICTED: cysteine proteinase 15A [Glycine max]
Length = 370
Score = 542 bits (1397), Expect = e-152, Method: Compositional matrix adjust.
Identities = 257/341 (75%), Positives = 290/341 (85%), Gaps = 8/341 (2%)
Query: 31 LIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRR 90
LIRQV E ++LL AEHHF+ FK KF K YA++EEHDHRF +FK+NLRR
Sbjct: 32 LIRQVVPDVGEA-----EEEDNLLNAEHHFASFKAKFAKTYATKEEHDHRFGVFKSNLRR 86
Query: 91 AARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDW 150
A H KLDPSA HG+T+FSDLTPAEFRR +LGL+ LR P A +APILPT DLP DFDW
Sbjct: 87 ARLHAKLDPSAVHGVTKFSDLTPAEFRRQFLGLK-PLRFPAHAQKAPILPTKDLPKDFDW 145
Query: 151 REKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGS 210
R+KGAV VKDQG+CGSCWSFSTTGALEGA++LATG+LVSLSEQQLVDCDH CDPEE G+
Sbjct: 146 RDKGAVTNVKDQGACGSCWSFSTTGALEGAHYLATGELVSLSEQQLVDCDHVCDPEEYGA 205
Query: 211 CDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLD 270
CDSGCNGGLMN+AFEY L++GG+ +E+DYPYTG D CKFDK+K+AA+V+N+SVVSLD
Sbjct: 206 CDSGCNGGLMNNAFEYILQSGGVQKEKDYPYTGRDG--TCKFDKTKVAATVSNYSVVSLD 263
Query: 271 EDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLK 330
E+QIAANLVKNGPLAVAINAV+MQTY+GGVSCPYIC + LDHGVLLVGYG YAPIR K
Sbjct: 264 EEQIAANLVKNGPLAVAINAVFMQTYVGGVSCPYICGKHLDHGVLLVGYGEGAYAPIRFK 323
Query: 331 EKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAA 371
KPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAA
Sbjct: 324 NKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAA 364
>gi|118150|sp|P25804.1|CYSP_PEA RecName: Full=Cysteine proteinase 15A; AltName:
Full=Turgor-responsive protein 15A; Flags: Precursor
gi|20679|emb|CAA38242.1| unnamed protein product [Pisum sativum]
Length = 363
Score = 541 bits (1394), Expect = e-151, Method: Compositional matrix adjust.
Identities = 257/359 (71%), Positives = 303/359 (84%), Gaps = 15/359 (4%)
Query: 15 VFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQ 74
V +AV+ T DD +IRQV D ++ LL AEHHF+ FK KF+K+YA++
Sbjct: 15 VATAVTDDTNNDDF--IIRQVVDNEED----------HLLNAEHHFTSFKSKFSKSYATK 62
Query: 75 EEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDAD 134
EEHD+RF +FK+NL +A HQ DP+A HGIT+FSDLT +EFRR +LGL+++LRLP A
Sbjct: 63 EEHDYRFGVFKSNLIKAKLHQNRDPTAEHGITKFSDLTASEFRRQFLGLKKRLRLPAHAQ 122
Query: 135 QAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQ 194
+APILPT +LP DFDWREKGAV PVKDQGSCGSCW+FSTTGALEGA++LATGKLVSLSEQ
Sbjct: 123 KAPILPTTNLPEDFDWREKGAVTPVKDQGSCGSCWAFSTTGALEGAHYLATGKLVSLSEQ 182
Query: 195 QLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDK 254
QLVDCDH CDPE+ GSCDSGCNGGLMN+AFEY L++GG+++E+DY YTG D +CKFDK
Sbjct: 183 QLVDCDHVCDPEQAGSCDSGCNGGLMNNAFEYLLESGGVVQEKDYAYTGRD--GSCKFDK 240
Query: 255 SKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSR-RLDHG 313
SK+ ASV+NFSVV+LDEDQIAANLVKNGPLAVAINA +MQTY+ GVSCPY+C++ RLDHG
Sbjct: 241 SKVVASVSNFSVVTLDEDQIAANLVKNGPLAVAINAAWMQTYMSGVSCPYVCAKSRLDHG 300
Query: 314 VLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAAA 372
VLLVG+G YAPIRLKEKPYWIIKNSWG++WGE GYYKICRGRNVCGVDSMVSTVAAA
Sbjct: 301 VLLVGFGKGAYAPIRLKEKPYWIIKNSWGQNWGEQGYYKICRGRNVCGVDSMVSTVAAA 359
>gi|205364757|gb|ACI04578.1| cysteine protease-like protein [Robinia pseudoacacia]
Length = 335
Score = 539 bits (1389), Expect = e-151, Method: Compositional matrix adjust.
Identities = 258/343 (75%), Positives = 296/343 (86%), Gaps = 14/343 (4%)
Query: 29 DQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANL 88
D LIRQV D ++ +L AEHHFS FK KF+K YA++EEHD+RF +FK+N+
Sbjct: 1 DLLIRQVVDDNED----------HVLNAEHHFSTFKSKFSKTYATKEEHDYRFGVFKSNV 50
Query: 89 RRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADF 148
RRA H KLDPSA HG+T+FSDLTP+EFRR +LGL+ LRLP+ A +APILPT+DLP DF
Sbjct: 51 RRAKLHAKLDPSAVHGVTKFSDLTPSEFRRQFLGLK-PLRLPEHAQKAPILPTHDLPEDF 109
Query: 149 DWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEP 208
DWR+KGAV VK+QGSCGSCW+FSTTGALEG++FLATG+LVSLS+QQLVDCDH CDPE+
Sbjct: 110 DWRDKGAVTHVKNQGSCGSCWAFSTTGALEGSHFLATGELVSLSDQQLVDCDHVCDPEQY 169
Query: 209 GSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVS 268
G+CDSGCNGGLMN+AFEY L++GG+ REEDYPYTG DRG A D++ AASV+NFSVVS
Sbjct: 170 GACDSGCNGGLMNNAFEYILESGGVQREEDYPYTGRDRGPA--IDEAN-AASVSNFSVVS 226
Query: 269 LDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIR 328
LDEDQI+ANLVKNGPLA+ INAV+MQTYIGGVSCPYIC + LDHGVLLVGYG AGYAPIR
Sbjct: 227 LDEDQISANLVKNGPLAIGINAVFMQTYIGGVSCPYICGKNLDHGVLLVGYGKAGYAPIR 286
Query: 329 LKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAA 371
LKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAA
Sbjct: 287 LKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAA 329
>gi|449469923|ref|XP_004152668.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
gi|449520697|ref|XP_004167370.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
Length = 371
Score = 539 bits (1389), Expect = e-151, Method: Compositional matrix adjust.
Identities = 262/370 (70%), Positives = 300/370 (81%), Gaps = 18/370 (4%)
Query: 3 SKTVVLFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSL 62
S TV + S + SAVS D+ D LIRQV G D+ L AE HF
Sbjct: 16 SATVAYGVSSDQINSAVS-----DEEDILIRQVVSGADD----------RPLTAEQHFQD 60
Query: 63 FKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLG 122
FK KF K Y + EEHD+RF +FKANLR+A RHQKLDP A HG+T+FSDLT +EFR ++G
Sbjct: 61 FKLKFGKTYTTDEEHDYRFRVFKANLRKAKRHQKLDPDAVHGVTRFSDLTESEFRENFVG 120
Query: 123 LRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANF 182
L R LRLP DA QAPILPT++L +DFDWR++GAV PVKDQGSCGSCWSFS GALEGANF
Sbjct: 121 LNR-LRLPADAHQAPILPTDNLASDFDWRDQGAVTPVKDQGSCGSCWSFSAVGALEGANF 179
Query: 183 LATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYT 242
L+TGKL+SLSEQQLVDCDHECDPEE G+CD+GCNGGLM SAFEY +KAGGL REEDYPYT
Sbjct: 180 LSTGKLISLSEQQLVDCDHECDPEEAGACDAGCNGGLMTSAFEYIVKAGGLEREEDYPYT 239
Query: 243 GTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSC 302
GTDRG +CKF KIAAS ANFSV+S D DQIAANLVKNGPLA+ INAV+MQTY+ G+SC
Sbjct: 240 GTDRG-SCKFQNGKIAASAANFSVISNDADQIAANLVKNGPLAIGINAVFMQTYMKGISC 298
Query: 303 PYICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCG 361
PYICS+R LDHGVLLVGYG+AG+APIRLKEKPYWIIKNSWGE+WGENGYY IC+G+N+CG
Sbjct: 299 PYICSKRNLDHGVLLVGYGAAGFAPIRLKEKPYWIIKNSWGENWGENGYYFICKGKNICG 358
Query: 362 VDSMVSTVAA 371
+SMVS+VAA
Sbjct: 359 SESMVSSVAA 368
>gi|19851|emb|CAA78365.1| tobacco pre-pro-cysteine proteinase [Nicotiana tabacum]
Length = 365
Score = 539 bits (1388), Expect = e-151, Method: Compositional matrix adjust.
Identities = 265/366 (72%), Positives = 302/366 (82%), Gaps = 12/366 (3%)
Query: 8 LFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKF 67
LFL+SL F+ SS D D LIRQV +S E+ ++ LL AEHHFSLFK KF
Sbjct: 4 LFLLSLPRFALFSSAIAFPDEDPLIRQV-------VSETETDDSHLLNAEHHFSLFKSKF 56
Query: 68 NKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKL 127
K YAS+EEHDHRF +FKANLRRA +Q LDPSA HGIT+FSDLTP+EFRRTYLGL +
Sbjct: 57 GKIYASEEEHDHRFKVFKANLRRARLNQLLDPSAEHGITKFSDLTPSEFRRTYLGLHKPK 116
Query: 128 RLPK-DADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATG 186
PK +A++APILPT+DLPAD+DWR+ GAV VK+QGSCGSCWSFSTTGA+EGA+FLATG
Sbjct: 117 --PKVNAEKAPILPTSDLPADYDWRDHGAVTGVKNQGSCGSCWSFSTTGAVEGAHFLATG 174
Query: 187 KLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDR 246
+LVSLSEQQLVDCDHECD E+ SCD+GC GGLM +AFEYTLKAGGL E+DYPYTG D
Sbjct: 175 ELVSLSEQQLVDCDHECDSEQQDSCDAGCGGGLMTTAFEYTLKAGGLQLEKDYPYTGKDG 234
Query: 247 GHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYIC 306
C FDKSKIAA+V NFSV+ LDEDQIAANLVK+GPLAV INA +MQTY+GGVSCP IC
Sbjct: 235 --KCHFDKSKIAAAVTNFSVIGLDEDQIAANLVKHGPLAVGINAAWMQTYVGGVSCPLIC 292
Query: 307 SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMV 366
+R DHGVLLVGYGS G+APIRLKEK YWIIKNSWGE+WGE+GYYKICRG N+CGVD+MV
Sbjct: 293 FKRQDHGVLLVGYGSHGFAPIRLKEKAYWIIKNSWGENWGEHGYYKICRGHNICGVDAMV 352
Query: 367 STVAAA 372
STV AA
Sbjct: 353 STVTAA 358
>gi|218137972|gb|ACK57563.1| cysteine protease-like protein [Arachis hypogaea]
Length = 364
Score = 537 bits (1384), Expect = e-150, Method: Compositional matrix adjust.
Identities = 260/346 (75%), Positives = 292/346 (84%), Gaps = 12/346 (3%)
Query: 26 DDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFK 85
DD + LIRQV + GDE LL AEHHFS FK KF+K YA++EEHD+RF +FK
Sbjct: 25 DDDNILIRQVVEDGDE----------HLLNAEHHFSAFKTKFSKTYATKEEHDYRFGVFK 74
Query: 86 ANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLP 145
+NL RA HQ+LDPSA HG+T+FSDLTP+EFR +LGL+ L LP DA APILPT++LP
Sbjct: 75 SNLLRAKSHQELDPSAIHGVTKFSDLTPSEFRSQFLGLK-PLSLPSDAHNAPILPTDNLP 133
Query: 146 ADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDP 205
DFDWR+ GAV VK+QG+ GSCWSFSTTGALEGA+FLATG+LVSLSEQQLVDCDHECDP
Sbjct: 134 KDFDWRDHGAVTNVKNQGTGGSCWSFSTTGALEGAHFLATGELVSLSEQQLVDCDHECDP 193
Query: 206 EEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFS 265
+ +CDSGCNGGLM +AF YT KAGGL+REEDY YTG DRG CKFDKSKIAASV+NFS
Sbjct: 194 DLNDACDSGCNGGLMTTAFGYTKKAGGLVREEDYLYTGRDRG-PCKFDKSKIAASVSNFS 252
Query: 266 VVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYA 325
VVSLDEDQIAANLVKNGPL+V INAVYMQTYIGGVSCP+IC + LDHGVLLVGYG+ GYA
Sbjct: 253 VVSLDEDQIAANLVKNGPLSVGINAVYMQTYIGGVSCPFICGKHLDHGVLLVGYGAGGYA 312
Query: 326 PIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAA 371
PIR KEKPYWIIKNSWGE+WGENGYYKICRG N+CGVDSMVSTV A
Sbjct: 313 PIRFKEKPYWIIKNSWGENWGENGYYKICRGPNMCGVDSMVSTVIA 358
>gi|42407296|dbj|BAD10859.1| cysteine protease [Aster tripolium]
Length = 363
Score = 537 bits (1383), Expect = e-150, Method: Compositional matrix adjust.
Identities = 253/347 (72%), Positives = 291/347 (83%), Gaps = 8/347 (2%)
Query: 26 DDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFK 85
D D LIRQV DE E ++ LL EHHF LFK KF + Y ++EEH++R T+FK
Sbjct: 21 DSSDPLIRQVVQN-DET----EIESDPLLDPEHHFKLFKNKFGRTYDTEEEHEYRLTVFK 75
Query: 86 ANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLP 145
+NLRRA RHQ LDP+A HG+T+FSDLTP+EFR+ YLGL+ KL+LP DA++APILPT++LP
Sbjct: 76 SNLRRAKRHQVLDPTAKHGVTKFSDLTPSEFRKKYLGLKSKLKLPADANKAPILPTSNLP 135
Query: 146 ADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDP 205
DFDWR+KGAV PVK+QGSCGSCWSFSTTGALEG++FL TG+LVSLSEQQLVDCDHECDP
Sbjct: 136 QDFDWRDKGAVTPVKNQGSCGSCWSFSTTGALEGSHFLQTGELVSLSEQQLVDCDHECDP 195
Query: 206 EEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFS 265
E SCDSGCNGGLMN+AFEY LKAGGL +E DYPYTG R CKFDKSKIAASVANFS
Sbjct: 196 AEYNSCDSGCNGGLMNNAFEYILKAGGLQKEADYPYTG--RDGTCKFDKSKIAASVANFS 253
Query: 266 VVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSR-RLDHGVLLVGYGSAGY 324
VVS DEDQIAANLV NGPLA+ INA +MQTYIG VSCPYICS+ ++DHGVLLVGYGSAGY
Sbjct: 254 VVSTDEDQIAANLVTNGPLAIGINAAWMQTYIGQVSCPYICSKTKMDHGVLLVGYGSAGY 313
Query: 325 APIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAA 371
AP+R KEKPYWIIKNSWGE WGE+GYYK+C G N CG+D+MVS V +
Sbjct: 314 APLRFKEKPYWIIKNSWGEDWGEDGYYKLCSGYNACGMDTMVSAVVS 360
>gi|3377952|emb|CAA08906.1| cysteine proteinase [Cicer arietinum]
Length = 362
Score = 536 bits (1382), Expect = e-150, Method: Compositional matrix adjust.
Identities = 269/365 (73%), Positives = 319/365 (87%), Gaps = 15/365 (4%)
Query: 9 FLVSLVVFSAVSSGTLIDDVDQ-LIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKF 67
FL++L++FS V++ T D+ D LIRQVTD HE ++ LL AEHHF+ FK KF
Sbjct: 5 FLLALLLFSVVATATKDDNNDDFLIRQVTD--------HE--DDQLLNAEHHFTTFKSKF 54
Query: 68 NKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKL 127
+K+YA++EEHD+RF +FK+NL++A HQKLDPSA HG+T+FSDLT +EFRR +LGL+++L
Sbjct: 55 SKSYATKEEHDYRFGVFKSNLKKAKLHQKLDPSAEHGVTKFSDLTASEFRRQFLGLKKRL 114
Query: 128 RLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGK 187
RLP A +APILPTN+LP DFDWREKGAV PVKDQGSCGSCW+FSTTGALEGAN+LATGK
Sbjct: 115 RLPAHAQKAPILPTNNLPEDFDWREKGAVTPVKDQGSCGSCWAFSTTGALEGANYLATGK 174
Query: 188 LVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRG 247
LVSLSEQQLVDCDH CDP+E SCDSGCNGGLMN+AFEY L++GG++RE+DY YTG D
Sbjct: 175 LVSLSEQQLVDCDHVCDPDEYNSCDSGCNGGLMNNAFEYLLQSGGVVREQDYSYTGRD-- 232
Query: 248 HACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICS 307
+CKFDKSKIAASV+NFSVVS+DEDQIAANLVKNGPLAVAINA +MQTY+ GVSCPYIC+
Sbjct: 233 GSCKFDKSKIAASVSNFSVVSVDEDQIAANLVKNGPLAVAINAAWMQTYMSGVSCPYICA 292
Query: 308 R-RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMV 366
+ RLDHGVLLVG+G+ G+APIRLKEKPYWIIKNSWG++WGE GYYKICRGRN+CGVDSMV
Sbjct: 293 KSRLDHGVLLVGFGN-GFAPIRLKEKPYWIIKNSWGQNWGEEGYYKICRGRNICGVDSMV 351
Query: 367 STVAA 371
STVAA
Sbjct: 352 STVAA 356
>gi|15705865|gb|AAL05851.1|AF411121_1 cysteine proteinase precursor [Sandersonia aurantiaca]
Length = 360
Score = 535 bits (1379), Expect = e-149, Method: Compositional matrix adjust.
Identities = 249/343 (72%), Positives = 286/343 (83%), Gaps = 11/343 (3%)
Query: 29 DQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANL 88
D +IRQV + LL AE HFS F ++ K+YA + EH +RF++FK+NL
Sbjct: 24 DPVIRQVVSDDQQ----------QLLSAEAHFSSFLSRYGKSYADEAEHAYRFSVFKSNL 73
Query: 89 RRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADF 148
RRA RHQ+LDP+A HG+T+F+DLTP+EFRRTYLGLRR+ R APILPTN+LPADF
Sbjct: 74 RRARRHQRLDPTAVHGVTRFADLTPSEFRRTYLGLRRRPRTAGSTHDAPILPTNELPADF 133
Query: 149 DWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEP 208
DWR+ GAV PVK+QGSCGSCWSFS GALEGAN+L+TG LVSLSEQQLVDCDHECD EP
Sbjct: 134 DWRDHGAVTPVKNQGSCGSCWSFSAAGALEGANYLSTGNLVSLSEQQLVDCDHECDSSEP 193
Query: 209 GSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVS 268
SCD GCNGGLM +AFEY LK+GGL RE DYPYTGTDRG CKF+K+KI+A +NFSVVS
Sbjct: 194 DSCDQGCNGGLMTTAFEYILKSGGLEREADYPYTGTDRG-TCKFNKAKISAVASNFSVVS 252
Query: 269 LDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIR 328
+DEDQIAANLVK+GPLAV INAV+MQTY+GGVSCPYIC + LDHGVLLVGYGSAG+APIR
Sbjct: 253 IDEDQIAANLVKHGPLAVGINAVFMQTYVGGVSCPYICGKHLDHGVLLVGYGSAGFAPIR 312
Query: 329 LKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAA 371
KEKPYWIIKNSWGE+WGENGYYKICRGRNVCGVDSMVS+V+A
Sbjct: 313 FKEKPYWIIKNSWGENWGENGYYKICRGRNVCGVDSMVSSVSA 355
>gi|19849|emb|CAA78361.1| tobacco pre-pro-cysteine proteinase [Nicotiana tabacum]
Length = 363
Score = 535 bits (1378), Expect = e-149, Method: Compositional matrix adjust.
Identities = 262/366 (71%), Positives = 298/366 (81%), Gaps = 14/366 (3%)
Query: 8 LFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKF 67
LFL+SL+ F SS D D LIRQV E+ ++ LL AEHHFSLFK KF
Sbjct: 4 LFLLSLLAFVLFSSAIAFSDEDPLIRQVVS---------ETDDSHLLNAEHHFSLFKSKF 54
Query: 68 NKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKL 127
K YAS+EEHDHRF +FKANLRRA +Q LDPSA HGIT+FSDLTP+EFRRTYLGL +
Sbjct: 55 GKIYASEEEHDHRFKVFKANLRRARLNQLLDPSAEHGITKFSDLTPSEFRRTYLGLHKPK 114
Query: 128 RLPK-DADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATG 186
PK +A++APILPT+DLPADFDWR+ GAV VK+QGSCGSCWSFSTTGA+EGA+FLATG
Sbjct: 115 --PKLNAEKAPILPTSDLPADFDWRDHGAVTGVKNQGSCGSCWSFSTTGAVEGAHFLATG 172
Query: 187 KLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDR 246
+LVSLSEQQLVDCDHECDPE+ +CD+GC GG +AFEYTLKAGGL E+DYPYTG D
Sbjct: 173 ELVSLSEQQLVDCDHECDPEQQDACDAGCGGGHYATAFEYTLKAGGLQLEKDYPYTGKDG 232
Query: 247 GHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYIC 306
C FDKSKI A+V NFSV+ LDEDQIAANLVK+GPLAV INA +MQTY+GGVSCP IC
Sbjct: 233 --KCHFDKSKICAAVTNFSVIGLDEDQIAANLVKHGPLAVGINAAWMQTYVGGVSCPLIC 290
Query: 307 SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMV 366
+R DHGVLLVGYGS G+APIRLKEK YWIIKNSWGE+WGE+GYYKICRG N+CGVD+MV
Sbjct: 291 FKRQDHGVLLVGYGSHGFAPIRLKEKAYWIIKNSWGENWGEHGYYKICRGHNICGVDAMV 350
Query: 367 STVAAA 372
STV AA
Sbjct: 351 STVTAA 356
>gi|71482944|gb|AAZ32411.1| cysteine proteinase glycinain type [Nicotiana benthamiana]
Length = 355
Score = 535 bits (1377), Expect = e-149, Method: Compositional matrix adjust.
Identities = 258/344 (75%), Positives = 294/344 (85%), Gaps = 12/344 (3%)
Query: 27 DVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKA 86
D D LIRQV +S E+ ++ LL AEHHFSLFK KF K YAS+EEHDHRF +FKA
Sbjct: 23 DEDPLIRQV-------VSETETDDSHLLNAEHHFSLFKSKFGKIYASEEEHDHRFKVFKA 75
Query: 87 NLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPK-DADQAPILPTNDLP 145
NLRRA RHQ LDPSA HGIT+FSDLTP+EFRRTYLGL + PK +A++APILPT+DLP
Sbjct: 76 NLRRARRHQLLDPSAEHGITKFSDLTPSEFRRTYLGLHKPK--PKLNAEKAPILPTSDLP 133
Query: 146 ADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDP 205
AD+DWR+ GAV VK+QGSCGSCWSFSTTGA+EGA+FLATG+LVSLSEQQLVDCDHECDP
Sbjct: 134 ADYDWRDHGAVTGVKNQGSCGSCWSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDP 193
Query: 206 EEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFS 265
E+ SCD+GC+GGLM +AFEYTLKAGGL RE+DYPYTG + C FDKSKIAA+V NFS
Sbjct: 194 EQQDSCDAGCSGGLMTTAFEYTLKAGGLQREKDYPYTG--KXGKCHFDKSKIAAAVTNFS 251
Query: 266 VVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYA 325
V+ LDEDQIAANLVK+GPLAV INA +MQTY+GGVSCP IC +R DHGVLLVGYGS G+A
Sbjct: 252 VIGLDEDQIAANLVKHGPLAVGINAAWMQTYVGGVSCPLICFKRQDHGVLLVGYGSHGFA 311
Query: 326 PIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTV 369
PIRLKEK YWIIKNSWGE+WGE+GYYKICRG N+CGVD+MVSTV
Sbjct: 312 PIRLKEKAYWIIKNSWGENWGEHGYYKICRGHNICGVDAMVSTV 355
>gi|225458119|ref|XP_002279862.1| PREDICTED: cysteine proteinase RD19a [Vitis vinifera]
gi|302142581|emb|CBI19784.3| unnamed protein product [Vitis vinifera]
Length = 368
Score = 534 bits (1376), Expect = e-149, Method: Compositional matrix adjust.
Identities = 254/356 (71%), Positives = 289/356 (81%), Gaps = 13/356 (3%)
Query: 18 AVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEH 77
AVS + + + +IRQV ES +D L AE HF FK +F K YA+ EEH
Sbjct: 21 AVSEASFDESDNLMIRQV-----------ESHVDDFLNAERHFEKFKARFQKTYATPEEH 69
Query: 78 DHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAP 137
D+RF +FKANLRRA RHQ LDPSA HG+TQFSDLTPAEFRR YLGL LR P DA QAP
Sbjct: 70 DYRFNVFKANLRRAKRHQLLDPSAVHGVTQFSDLTPAEFRRDYLGLN-PLRFPADAQQAP 128
Query: 138 ILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLV 197
ILPT++LP DFDWRE GAV PVK+QG+CGSCWSFST GALEGA+FLATG L SLSEQQLV
Sbjct: 129 ILPTDNLPTDFDWRENGAVTPVKNQGNCGSCWSFSTIGALEGAHFLATGNLESLSEQQLV 188
Query: 198 DCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKI 257
DCD ECDPEE +CD GCNGGLMN+AFEY LK GG+ RE+DYPYTG DR CKF++SKI
Sbjct: 189 DCDRECDPEEYDACDDGCNGGLMNNAFEYILKTGGVEREKDYPYTGRDRS-PCKFNESKI 247
Query: 258 AASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLV 317
ASV+NFSVVS+DEDQIAANLVKNGPLAV INAV+MQTY GVSCP++CS LDHGVLLV
Sbjct: 248 VASVSNFSVVSIDEDQIAANLVKNGPLAVGINAVFMQTYTAGVSCPFLCSGELDHGVLLV 307
Query: 318 GYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAAAV 373
GYGSAGY+PIR KEKPYWI+KNSW + WGE+GYY+ICRG+N+CGVDSMVS+V AA+
Sbjct: 308 GYGSAGYSPIRFKEKPYWILKNSWSKYWGEHGYYRICRGQNMCGVDSMVSSVVAAI 363
>gi|297804580|ref|XP_002870174.1| hypothetical protein ARALYDRAFT_915142 [Arabidopsis lyrata subsp.
lyrata]
gi|297316010|gb|EFH46433.1| hypothetical protein ARALYDRAFT_915142 [Arabidopsis lyrata subsp.
lyrata]
Length = 373
Score = 531 bits (1367), Expect = e-148, Method: Compositional matrix adjust.
Identities = 266/375 (70%), Positives = 303/375 (80%), Gaps = 17/375 (4%)
Query: 4 KTVVLFLVSLVVF-----SAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEH 58
+ V FL++ + SAV SG + IRQV E + LL AEH
Sbjct: 3 RVVFFFLIAATLLAVSLGSAVISGEVNYGFVNPIRQVVP---------EENDEHLLNAEH 53
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRR 118
HFSLFK K+ K YA+QEEHDHRF +FKANLRRA R+Q LDPSA HG+TQFSDLTP EFRR
Sbjct: 54 HFSLFKSKYEKTYATQEEHDHRFRVFKANLRRARRNQLLDPSAVHGVTQFSDLTPKEFRR 113
Query: 119 TYLGLRRK-LRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
+LGL+R+ RLP D APILPT+DLP +FDWRE+GAV PVK+QG CGSCWSFS GAL
Sbjct: 114 KFLGLKRRGFRLPTDTQTAPILPTSDLPTEFDWREQGAVTPVKNQGMCGSCWSFSAIGAL 173
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREE 237
EGA+FLAT +LVSLSEQQLVDCDHECDP + SCDSGC+GGLMN+AFEY LKAGGLM+EE
Sbjct: 174 EGAHFLATKELVSLSEQQLVDCDHECDPAQANSCDSGCSGGLMNNAFEYALKAGGLMKEE 233
Query: 238 DYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYI 297
DYPYTG D ACKFDKSKIAASV+NFSVVS DEDQIAANLVK+GPLA+AINA++MQTYI
Sbjct: 234 DYPYTGRDNT-ACKFDKSKIAASVSNFSVVSSDEDQIAANLVKHGPLAIAINAMWMQTYI 292
Query: 298 GGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRG- 356
GGVSCPY+CS+ DHGVLLVG+GS+GYAPIRLKEKPYWIIKNSWG WGE+GYYKICRG
Sbjct: 293 GGVSCPYVCSKSQDHGVLLVGFGSSGYAPIRLKEKPYWIIKNSWGAMWGEHGYYKICRGP 352
Query: 357 RNVCGVDSMVSTVAA 371
N+CG+D+MVSTVAA
Sbjct: 353 HNMCGMDTMVSTVAA 367
>gi|34761156|gb|AAQ81938.1| cysteine proteinase precursor [Ipomoea batatas]
Length = 371
Score = 525 bits (1353), Expect = e-146, Method: Compositional matrix adjust.
Identities = 250/372 (67%), Positives = 302/372 (81%), Gaps = 20/372 (5%)
Query: 10 LVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNK 69
L SL++ + ++ + D D LIRQV G++ + LL A+HHF+LFK K+ K
Sbjct: 6 LPSLLIHALTAACVVRADEDPLIRQVVSDGED---------DALLNADHHFTLFKSKYGK 56
Query: 70 AYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRL 129
+YA+QEEHD+R ++FKANLRRA RHQ LDPSA HG+T+FSDLTP EFRRTYLG+R+
Sbjct: 57 SYATQEEHDYRLSVFKANLRRAKRHQMLDPSAVHGVTKFSDLTPKEFRRTYLGIRKSSSS 116
Query: 130 --------PKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGAN 181
P DA A ILPT+DLP DF+WR+ GAV VKDQG CGSCWSFSTTG LEG N
Sbjct: 117 KQKLKLKLPADAHAAEILPTSDLPFDFEWRDYGAVTGVKDQGLCGSCWSFSTTGTLEGTN 176
Query: 182 FLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPY 241
FLATG+L+SL+EQ+LVDCDH CDP++ G+CD+GCNGGLM +A+EY L++GGL +E+DYPY
Sbjct: 177 FLATGELLSLNEQELVDCDHLCDPKKAGACDAGCNGGLMTTAYEYVLQSGGLEKEKDYPY 236
Query: 242 TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVS 301
TG D CKFDKSKIAA+VANFSVVSLDEDQIAANLVK+GPL+V IN+++MQTYIGGVS
Sbjct: 237 TGRD--GTCKFDKSKIAAAVANFSVVSLDEDQIAANLVKHGPLSVGINSIFMQTYIGGVS 294
Query: 302 CPYICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVC 360
CPYICS++ LDHGVL+VGYG+AGYAPIR K+KPYWIIKNSWGE+WGE GYYKICRG N+C
Sbjct: 295 CPYICSKKNLDHGVLIVGYGAAGYAPIRFKDKPYWIIKNSWGENWGEEGYYKICRGNNIC 354
Query: 361 GVDSMVSTVAAA 372
GVDSMVS+V AA
Sbjct: 355 GVDSMVSSVTAA 366
>gi|18414611|ref|NP_567489.1| Papain family cysteine protease [Arabidopsis thaliana]
gi|2244977|emb|CAB10398.1| cysteine proteinase like protein [Arabidopsis thaliana]
gi|7268368|emb|CAB78661.1| cysteine proteinase like protein [Arabidopsis thaliana]
gi|14517442|gb|AAK62611.1| AT4g16190/dl4135w [Arabidopsis thaliana]
gi|22136546|gb|AAM91059.1| AT4g16190/dl4135w [Arabidopsis thaliana]
gi|22530956|gb|AAM96982.1| cysteine proteinase [Arabidopsis thaliana]
gi|23397184|gb|AAN31875.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|110740834|dbj|BAE98514.1| cysteine proteinase like protein [Arabidopsis thaliana]
gi|332658313|gb|AEE83713.1| Papain family cysteine protease [Arabidopsis thaliana]
Length = 373
Score = 525 bits (1352), Expect = e-146, Method: Compositional matrix adjust.
Identities = 262/375 (69%), Positives = 301/375 (80%), Gaps = 17/375 (4%)
Query: 4 KTVVLFLVSLVVF-----SAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEH 58
+ V FL++ + S V SG + D IRQV E + LL AEH
Sbjct: 3 RVVFFFLIAATLLAGSLGSTVISGEVTDGFVNPIRQVVP---------EENDEQLLNAEH 53
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRR 118
HF+LFK K+ K YA+Q EHDHRF +FKANLRRA R+Q LDPSA HG+TQFSDLTP EFRR
Sbjct: 54 HFTLFKSKYEKTYATQVEHDHRFRVFKANLRRARRNQLLDPSAVHGVTQFSDLTPKEFRR 113
Query: 119 TYLGLRRK-LRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
+LGL+R+ RLP D APILPT+DLP +FDWRE+GAV PVK+QG CGSCWSFS GAL
Sbjct: 114 KFLGLKRRGFRLPTDTQTAPILPTSDLPTEFDWREQGAVTPVKNQGMCGSCWSFSAIGAL 173
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREE 237
EGA+FLAT +LVSLSEQQLVDCDHECDP + SCDSGC+GGLMN+AFEY LKAGGLM+EE
Sbjct: 174 EGAHFLATKELVSLSEQQLVDCDHECDPAQANSCDSGCSGGLMNNAFEYALKAGGLMKEE 233
Query: 238 DYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYI 297
DYPYTG D ACKFDKSKI ASV+NFSVVS DEDQIAANLV++GPLA+AINA++MQTYI
Sbjct: 234 DYPYTGRDHT-ACKFDKSKIVASVSNFSVVSSDEDQIAANLVQHGPLAIAINAMWMQTYI 292
Query: 298 GGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRG- 356
GGVSCPY+CS+ DHGVLLVG+GS+GYAPIRLKEKPYWIIKNSWG WGE+GYYKICRG
Sbjct: 293 GGVSCPYVCSKSQDHGVLLVGFGSSGYAPIRLKEKPYWIIKNSWGAMWGEHGYYKICRGP 352
Query: 357 RNVCGVDSMVSTVAA 371
N+CG+D+MVSTVAA
Sbjct: 353 HNMCGMDTMVSTVAA 367
>gi|25956267|dbj|BAC41322.1| hypothetical protein [Lotus japonicus]
Length = 358
Score = 523 bits (1346), Expect = e-146, Method: Compositional matrix adjust.
Identities = 259/346 (74%), Positives = 294/346 (84%), Gaps = 15/346 (4%)
Query: 26 DDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFK 85
D VD +I QV D ++ LGAEHHF FK++F K YA++EEH +RF +FK
Sbjct: 24 DGVDPMICQVVD-------------DEGLGAEHHFLEFKRRFGKVYATEEEHGYRFNVFK 70
Query: 86 ANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLP 145
+N+ RA RHQ LDPSA HG+TQFSDLTP EF+ + LGLR + LP DAD APILPT++LP
Sbjct: 71 SNMHRARRHQLLDPSAVHGVTQFSDLTPMEFQHSVLGLR-GVGLPSDADSAPILPTDNLP 129
Query: 146 ADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDP 205
DFDWR GAV PVK+QGSCGSCWSFS TGALEGA+FL+TG+LVSLSEQQLVDCDH+CDP
Sbjct: 130 KDFDWRGHGAVTPVKNQGSCGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHQCDP 189
Query: 206 EEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFS 265
EE GSC SGCNGGLMNSAFEY L GG+MREEDYPY+GT+ G CKFDK+KIAASVANFS
Sbjct: 190 EEAGSCGSGCNGGLMNSAFEYILNNGGVMREEDYPYSGTNGG-TCKFDKAKIAASVANFS 248
Query: 266 VVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYA 325
VVS DEDQIAANLVKNGPLAVAINAVYMQTY+GGVSCPY+CS++L+HGVLLVGYGS YA
Sbjct: 249 VVSRDEDQIAANLVKNGPLAVAINAVYMQTYVGGVSCPYVCSKKLNHGVLLVGYGSESYA 308
Query: 326 PIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAA 371
PIR+K+KPYWIIKNSWGE+WGENGYYKICRGRN+CGVDSMVSTVAA
Sbjct: 309 PIRMKQKPYWIIKNSWGENWGENGYYKICRGRNICGVDSMVSTVAA 354
>gi|242061538|ref|XP_002452058.1| hypothetical protein SORBIDRAFT_04g017830 [Sorghum bicolor]
gi|241931889|gb|EES05034.1| hypothetical protein SORBIDRAFT_04g017830 [Sorghum bicolor]
Length = 371
Score = 516 bits (1328), Expect = e-143, Method: Compositional matrix adjust.
Identities = 253/354 (71%), Positives = 284/354 (80%), Gaps = 18/354 (5%)
Query: 26 DDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFK 85
D D LIRQV GGD+ N L AE HF F ++F K+Y EEH +R +IFK
Sbjct: 22 DTEDPLIRQVVPGGDD--------NELELNAESHFLSFVQRFGKSYKDAEEHAYRLSIFK 73
Query: 86 ANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLR-----LPKDADQAPILP 140
ANLRRA RHQ LDPSA HG+T+FSDLTPAEFRRTYLGLR+ R L K A++AP+LP
Sbjct: 74 ANLRRARRHQLLDPSAEHGVTKFSDLTPAEFRRTYLGLRKSRRALLRELGKSANEAPVLP 133
Query: 141 TNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCD 200
T+ LP DFDWR+ GAV PVK+QGSCGSCWSFST+GALEGA++LATGKL LSEQQ+VDCD
Sbjct: 134 TDGLPDDFDWRDHGAVTPVKNQGSCGSCWSFSTSGALEGAHYLATGKLEVLSEQQMVDCD 193
Query: 201 HECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAAS 260
H CD EP SCDSGCNGGLM +AF Y KAGGL E+DYPYTG+D CKFDKSKI AS
Sbjct: 194 HVCDTSEPDSCDSGCNGGLMTNAFSYLQKAGGLESEKDYPYTGSD--DKCKFDKSKIVAS 251
Query: 261 VANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYG 320
V NFSVVS+DE QIAANL+K+GPLA+ INA YMQTYIGGVSCPYIC R LDHGVLLVGYG
Sbjct: 252 VQNFSVVSVDEGQIAANLIKHGPLAIGINAAYMQTYIGGVSCPYICGRTLDHGVLLVGYG 311
Query: 321 SAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRG---RNVCGVDSMVSTVAA 371
+AG+APIRLK+KPYWIIKNSWGE+WGENGYYKICRG RN CGVDSMVSTV+A
Sbjct: 312 AAGFAPIRLKDKPYWIIKNSWGENWGENGYYKICRGSNVRNKCGVDSMVSTVSA 365
>gi|194705198|gb|ACF86683.1| unknown [Zea mays]
gi|413936851|gb|AFW71402.1| cysteine protease1 [Zea mays]
Length = 371
Score = 513 bits (1320), Expect = e-143, Method: Compositional matrix adjust.
Identities = 251/355 (70%), Positives = 285/355 (80%), Gaps = 20/355 (5%)
Query: 26 DDVDQLIRQVTDGGDEILSHHESTNNDL-LGAEHHFSLFKKKFNKAYASQEEHDHRFTIF 84
D D LIRQV GGD+ NDL L AE HF F ++F K+Y +EH +R ++F
Sbjct: 22 DAEDPLIRQVVPGGDD---------NDLELNAESHFLSFVQRFGKSYKDADEHAYRLSVF 72
Query: 85 KANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLR-----LPKDADQAPIL 139
KANLRRA RHQ LDPSA HG+T+FSDLTPAEFRRTYLGLR+ R L + A +AP+L
Sbjct: 73 KANLRRARRHQLLDPSAEHGVTKFSDLTPAEFRRTYLGLRKSRRALLRELGESAHEAPVL 132
Query: 140 PTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDC 199
PT+ LP DFDWR+ GAVGPVK+QGSCGSCWSFS +GALEGA++LATGKL LSEQQ VDC
Sbjct: 133 PTDGLPDDFDWRDHGAVGPVKNQGSCGSCWSFSASGALEGAHYLATGKLEVLSEQQFVDC 192
Query: 200 DHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAA 259
DHECD EP SCDSGCNGGLM +AF Y KAGGL E+DYPYTG+D CKFDKSKI A
Sbjct: 193 DHECDSSEPDSCDSGCNGGLMTTAFSYLQKAGGLESEKDYPYTGSDG--KCKFDKSKIVA 250
Query: 260 SVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGY 319
SV NFSVVS+DE QI+ANL+K+GPLA+ INA YMQTYIGGVSCPYIC R LDHGVLLVGY
Sbjct: 251 SVQNFSVVSVDEAQISANLIKHGPLAIGINAAYMQTYIGGVSCPYICGRHLDHGVLLVGY 310
Query: 320 GSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRG---RNVCGVDSMVSTVAA 371
G++G+APIRLK+KPYWIIKNSWGE+WGENGYYKICRG RN CGVDSMVSTV+A
Sbjct: 311 GASGFAPIRLKDKPYWIIKNSWGENWGENGYYKICRGSNVRNKCGVDSMVSTVSA 365
>gi|357148994|ref|XP_003574963.1| PREDICTED: cysteine proteinase 1-like [Brachypodium distachyon]
Length = 377
Score = 513 bits (1320), Expect = e-143, Method: Compositional matrix adjust.
Identities = 251/354 (70%), Positives = 280/354 (79%), Gaps = 18/354 (5%)
Query: 27 DVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKA 86
D D LIRQV G D +NDL HF+ F ++F K Y EEH HR ++FKA
Sbjct: 28 DEDPLIRQVVGGAD-------GDDNDLE-LSSHFTSFVQRFGKTYKDAEEHAHRLSVFKA 79
Query: 87 NLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLR-----LPKDADQAPILPT 141
NLRRA RHQ LDPSA HGIT+FSDLTPAEFRRT+LGL+ R + A AP+LPT
Sbjct: 80 NLRRARRHQLLDPSAEHGITKFSDLTPAEFRRTFLGLKTSRRSFLREIGGSAHDAPVLPT 139
Query: 142 NDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDH 201
+ LP DFDWR+ GAVGPVK+QGSCGSCWSFS +GALEGAN+LATGK+ LSEQQ VDCDH
Sbjct: 140 DGLPDDFDWRDHGAVGPVKNQGSCGSCWSFSASGALEGANYLATGKMEVLSEQQFVDCDH 199
Query: 202 ECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASV 261
ECDPEEP SCD+GCNGGLM SAF Y LK+GGL RE+DYPYTG D CKFDKSKI ASV
Sbjct: 200 ECDPEEPDSCDAGCNGGLMTSAFSYLLKSGGLEREKDYPYTGRD--GTCKFDKSKIVASV 257
Query: 262 ANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGS 321
NFSVVS+DE+QIAANLVK+GPLA+ INA YMQTYIGGVSCPYIC R LDHGVLLVGYG+
Sbjct: 258 QNFSVVSVDEEQIAANLVKHGPLAIGINAAYMQTYIGGVSCPYICGRSLDHGVLLVGYGA 317
Query: 322 AGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRG---RNVCGVDSMVSTVAAA 372
+G+AP RLK KPYW+IKNSWGE+WGE GYYKICRG RN CGVDSMVSTVAAA
Sbjct: 318 SGFAPSRLKNKPYWVIKNSWGENWGEKGYYKICRGSNVRNKCGVDSMVSTVAAA 371
>gi|162459555|ref|NP_001105685.1| cysteine proteinase 1 precursor [Zea mays]
gi|1706260|sp|Q10716.1|CYSP1_MAIZE RecName: Full=Cysteine proteinase 1; Flags: Precursor
gi|643597|dbj|BAA08244.1| cysteine proteinase [Zea mays]
Length = 371
Score = 510 bits (1314), Expect = e-142, Method: Compositional matrix adjust.
Identities = 250/355 (70%), Positives = 284/355 (80%), Gaps = 20/355 (5%)
Query: 26 DDVDQLIRQVTDGGDEILSHHESTNNDL-LGAEHHFSLFKKKFNKAYASQEEHDHRFTIF 84
D D LIRQV GGD+ NDL L AE HF F ++F K+Y +EH +R ++F
Sbjct: 22 DAEDPLIRQVVPGGDD---------NDLELNAESHFLSFVQRFGKSYKDADEHAYRLSVF 72
Query: 85 KANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLR-----LPKDADQAPIL 139
K NLRRA RHQ LDPSA HG+T+FSDLTPAEFRRTYLGLR+ R L + A +AP+L
Sbjct: 73 KDNLRRARRHQLLDPSAEHGVTKFSDLTPAEFRRTYLGLRKSRRALLRELGESAHEAPVL 132
Query: 140 PTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDC 199
PT+ LP DFDWR+ GAVGPVK+QGSCGSCWSFS +GALEGA++LATGKL LSEQQ VDC
Sbjct: 133 PTDGLPDDFDWRDHGAVGPVKNQGSCGSCWSFSASGALEGAHYLATGKLEVLSEQQFVDC 192
Query: 200 DHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAA 259
DHECD EP SCDSGCNGGLM +AF Y KAGGL E+DYPYTG+D CKFDKSKI A
Sbjct: 193 DHECDSSEPDSCDSGCNGGLMTTAFSYLQKAGGLESEKDYPYTGSDG--KCKFDKSKIVA 250
Query: 260 SVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGY 319
SV NFSVVS+DE QI+ANL+K+GPLA+ INA YMQTYIGGVSCPYIC R LDHGVLLVGY
Sbjct: 251 SVQNFSVVSVDEAQISANLIKHGPLAIGINAAYMQTYIGGVSCPYICGRHLDHGVLLVGY 310
Query: 320 GSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRG---RNVCGVDSMVSTVAA 371
G++G+APIRLK+KPYWIIKNSWGE+WGENGYYKICRG RN CGVDSMVSTV+A
Sbjct: 311 GASGFAPIRLKDKPYWIIKNSWGENWGENGYYKICRGSNVRNKCGVDSMVSTVSA 365
>gi|357162946|ref|XP_003579573.1| PREDICTED: cysteine proteinase 1-like [Brachypodium distachyon]
Length = 376
Score = 509 bits (1312), Expect = e-142, Method: Compositional matrix adjust.
Identities = 248/351 (70%), Positives = 280/351 (79%), Gaps = 18/351 (5%)
Query: 29 DQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANL 88
D LI QV GGDE N L AE HF+ F ++FNK+Y +EH HR ++F ANL
Sbjct: 29 DPLIEQVV-GGDE-------KNELELNAEAHFASFVQRFNKSYRDADEHAHRLSVFTANL 80
Query: 89 RRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLR-----LPKDADQAPILPTND 143
RRA RHQ+LDPSA HG+T+FSDLTP EFR +LGLR+ R L A AP LPT+
Sbjct: 81 RRARRHQRLDPSAVHGVTKFSDLTPDEFRDRFLGLRKYRRSFLKGLSGSAHDAPALPTDG 140
Query: 144 LPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHEC 203
LP +FDWRE GAVGPVKDQGSCGSCWSFST+GALEGA++LATGKL LSEQQ+VDCDHEC
Sbjct: 141 LPTEFDWREHGAVGPVKDQGSCGSCWSFSTSGALEGAHYLATGKLEVLSEQQMVDCDHEC 200
Query: 204 DPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVAN 263
DP EP +CD+GCNGGLM +AF Y KAGGL E+DYPYTG RG ACKFDKSKIAA V N
Sbjct: 201 DPSEPRACDAGCNGGLMTTAFSYLAKAGGLETEKDYPYTG--RGGACKFDKSKIAAQVKN 258
Query: 264 FSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAG 323
FS V++DEDQIAANLVK+GPLA+ INAV+MQTYIGGVSCP+IC R LDHGVLLVGYGSAG
Sbjct: 259 FSTVAVDEDQIAANLVKHGPLAIGINAVFMQTYIGGVSCPFICGRHLDHGVLLVGYGSAG 318
Query: 324 YAPIRLKEKPYWIIKNSWGESWGENGYYKICRG---RNVCGVDSMVSTVAA 371
YAP+R KEKPYWIIKNSWGE+WGE+GYYKICRG +N CGVDSMVSTV A
Sbjct: 319 YAPLRFKEKPYWIIKNSWGENWGESGYYKICRGAHVKNKCGVDSMVSTVTA 369
>gi|115446097|ref|NP_001046828.1| Os02g0469600 [Oryza sativa Japonica Group]
gi|47497527|dbj|BAD19579.1| putative cysteine proteinase 1 precursor [Oryza sativa Japonica
Group]
gi|113536359|dbj|BAF08742.1| Os02g0469600 [Oryza sativa Japonica Group]
gi|215701326|dbj|BAG92750.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215704370|dbj|BAG93804.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215708762|dbj|BAG94031.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218200777|gb|EEC83204.1| hypothetical protein OsI_28465 [Oryza sativa Indica Group]
gi|222622835|gb|EEE56967.1| hypothetical protein OsJ_06681 [Oryza sativa Japonica Group]
Length = 373
Score = 508 bits (1308), Expect = e-141, Method: Compositional matrix adjust.
Identities = 247/362 (68%), Positives = 287/362 (79%), Gaps = 18/362 (4%)
Query: 18 AVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEH 77
AV++ ++ + + LIRQV GGD+ N L AE HF+ F ++F K+Y +EH
Sbjct: 16 AVAAASVPGEEEPLIRQVVGGGDD--------NELELNAERHFASFVQRFGKSYRDADEH 67
Query: 78 DHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLR-----LPKD 132
+R ++FKANLRRA RHQ LDPSA HG+T+FSDLTPAEFRR YLGLR R L
Sbjct: 68 AYRLSVFKANLRRARRHQLLDPSAEHGVTKFSDLTPAEFRRAYLGLRTSRRAFLRGLGGS 127
Query: 133 ADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLS 192
A +AP+LPT+ LP DFDWR+ GAVGPVK+QGSCGSCWSFS +GALEGAN+LATGK+ LS
Sbjct: 128 AHEAPVLPTDGLPDDFDWRDHGAVGPVKNQGSCGSCWSFSASGALEGANYLATGKMDVLS 187
Query: 193 EQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKF 252
EQQ+VDCDHECD EP SCD+GCNGGLM +AF Y LK+GGL E+DYPYTG D CKF
Sbjct: 188 EQQMVDCDHECDSSEPDSCDAGCNGGLMTNAFSYLLKSGGLESEKDYPYTGRD--GTCKF 245
Query: 253 DKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDH 312
DKSKI SV NFSVVS+DEDQIAANLVK+GPLA+ INA YMQTYIGGVSCPYIC R LDH
Sbjct: 246 DKSKIVTSVQNFSVVSVDEDQIAANLVKHGPLAIGINAAYMQTYIGGVSCPYICGRHLDH 305
Query: 313 GVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRG---RNVCGVDSMVSTV 369
GVLLVGYG++G+APIRLK+K YWIIKNSWGE+WGE+GYYKICRG RN CGVDSMVSTV
Sbjct: 306 GVLLVGYGASGFAPIRLKDKAYWIIKNSWGENWGEHGYYKICRGSNVRNKCGVDSMVSTV 365
Query: 370 AA 371
+A
Sbjct: 366 SA 367
>gi|116787909|gb|ABK24688.1| unknown [Picea sitchensis]
gi|224284108|gb|ACN39791.1| unknown [Picea sitchensis]
gi|224285024|gb|ACN40241.1| unknown [Picea sitchensis]
Length = 366
Score = 506 bits (1303), Expect = e-141, Method: Compositional matrix adjust.
Identities = 243/368 (66%), Positives = 289/368 (78%), Gaps = 10/368 (2%)
Query: 7 VLFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHE--STNNDLLGAEHHFSLFK 64
L + +FS + + D LIRQVTD E++S + + L AE HF F
Sbjct: 5 TLLFSAFCIFSVIFLSSATKPDDDLIRQVTD---EVVSDPQILDARSALFNAEVHFRHFI 61
Query: 65 KKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLR 124
+++ K Y+ EEH+HRF +FK+NL RA HQKLDP A+HG+T+FSDLT EFR YLGLR
Sbjct: 62 RRYGKKYSGPEEHEHRFGVFKSNLLRALEHQKLDPRASHGVTKFSDLTQEEFRHQYLGLR 121
Query: 125 RKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLA 184
+DA APILPTNDLP DFDWREKGAV VK+QGSCGSCW+FSTTGALEGANFL
Sbjct: 122 APPL--RDAHDAPILPTNDLPEDFDWREKGAVTEVKNQGSCGSCWAFSTTGALEGANFLK 179
Query: 185 TGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGT 244
TG+LVSLSEQQLVDCDHECDP + SCDSGCNGGLM SA++Y LK+GGL +EEDYPYTG
Sbjct: 180 TGELVSLSEQQLVDCDHECDPSDARSCDSGCNGGLMTSAYQYALKSGGLEKEEDYPYTGK 239
Query: 245 DRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPY 304
D C F+K+KI A V+NFSVVS+DE QIAANLVKNGPL+V INA +MQTY+GGVSCPY
Sbjct: 240 DG--TCSFNKNKIVAHVSNFSVVSIDEGQIAANLVKNGPLSVGINAAFMQTYVGGVSCPY 297
Query: 305 ICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVD 363
+CS+R LDHGVLLVGYG+A +APIR+K+KPYW+IKNSWG +WGENGYYK+CRG NVCG++
Sbjct: 298 VCSKRNLDHGVLLVGYGAAAFAPIRMKDKPYWVIKNSWGPNWGENGYYKLCRGHNVCGIN 357
Query: 364 SMVSTVAA 371
+MVSTVAA
Sbjct: 358 NMVSTVAA 365
>gi|116779325|gb|ABK21238.1| unknown [Picea sitchensis]
gi|148905850|gb|ABR16087.1| unknown [Picea sitchensis]
gi|148908434|gb|ABR17330.1| unknown [Picea sitchensis]
gi|148908881|gb|ABR17545.1| unknown [Picea sitchensis]
gi|224286109|gb|ACN40765.1| unknown [Picea sitchensis]
Length = 366
Score = 506 bits (1302), Expect = e-141, Method: Compositional matrix adjust.
Identities = 243/368 (66%), Positives = 289/368 (78%), Gaps = 10/368 (2%)
Query: 7 VLFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHE--STNNDLLGAEHHFSLFK 64
L + +FS + + D LIRQVTD E++S + + L AE HF F
Sbjct: 5 TLLFSAFCIFSVIFLSSATRPDDDLIRQVTD---EVVSDPQILDARSALFNAEVHFRHFI 61
Query: 65 KKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLR 124
+++ K Y+ EEH+HRF +FK+NL RA HQKLDP A+HG+T+FSDLT EFR YLGLR
Sbjct: 62 RRYGKKYSGPEEHEHRFGVFKSNLLRALEHQKLDPRASHGVTKFSDLTQEEFRHQYLGLR 121
Query: 125 RKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLA 184
+DA APILPTNDLP DFDWREKGAV VK+QGSCGSCW+FSTTGALEGANFL
Sbjct: 122 APPL--RDAHDAPILPTNDLPEDFDWREKGAVTEVKNQGSCGSCWAFSTTGALEGANFLK 179
Query: 185 TGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGT 244
TG+LVSLSEQQLVDCDHECDP + SCDSGCNGGLM SA++Y LK+GGL +EEDYPYTG
Sbjct: 180 TGELVSLSEQQLVDCDHECDPSDARSCDSGCNGGLMTSAYQYALKSGGLEKEEDYPYTGK 239
Query: 245 DRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPY 304
D C F+K+KI A V+NFSVVS+DE QIAANLVKNGPL+V INA +MQTY+GGVSCPY
Sbjct: 240 DG--TCSFNKNKIVAHVSNFSVVSIDEGQIAANLVKNGPLSVGINAAFMQTYVGGVSCPY 297
Query: 305 ICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVD 363
+CS+R LDHGVLLVGYG+A +APIR+K+KPYW+IKNSWG +WGENGYYK+CRG NVCG++
Sbjct: 298 VCSKRNLDHGVLLVGYGAAAFAPIRMKDKPYWVIKNSWGPNWGENGYYKLCRGHNVCGIN 357
Query: 364 SMVSTVAA 371
+MVSTVAA
Sbjct: 358 NMVSTVAA 365
>gi|41019551|tpe|CAD66657.1| TPA: putative cysteine proteinase precursor [Hordeum vulgare subsp.
vulgare]
gi|326489967|dbj|BAJ94057.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326525847|dbj|BAJ93100.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 377
Score = 503 bits (1296), Expect = e-140, Method: Compositional matrix adjust.
Identities = 245/353 (69%), Positives = 279/353 (79%), Gaps = 18/353 (5%)
Query: 27 DVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKA 86
D + LIRQV G D + +NDL + F F ++F K Y EEH HR ++FKA
Sbjct: 28 DEEPLIRQVVGGADPL-------DNDLE-LDSQFVGFVQRFGKTYRDAEEHAHRLSVFKA 79
Query: 87 NLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLR-----LPKDADQAPILPT 141
NLRRA RHQ LDPSA HG+T+FSDLTPAEFRRTYLGL+ R + A AP+LPT
Sbjct: 80 NLRRARRHQLLDPSAEHGVTKFSDLTPAEFRRTYLGLKTTRRSFLREMAGSAHDAPVLPT 139
Query: 142 NDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDH 201
+ LP DFDWR+ GAVGPVK+QGSCGSCWSFS +GALEGAN+LA+GK+ LSEQQLVDCDH
Sbjct: 140 DGLPEDFDWRDHGAVGPVKNQGSCGSCWSFSASGALEGANYLASGKMEVLSEQQLVDCDH 199
Query: 202 ECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASV 261
ECDP EP SCD+GCNGGLM SAF Y LK+GGL RE+DYPYTG D CKFDKSKIAASV
Sbjct: 200 ECDPSEPDSCDAGCNGGLMTSAFSYLLKSGGLEREKDYPYTGKD--GTCKFDKSKIAASV 257
Query: 262 ANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGS 321
N+SVV++DE+QIAANLVK GPLA+ INA YMQTYIGGVSCPYIC R LDHGVLLVGYG+
Sbjct: 258 QNYSVVAVDEEQIAANLVKYGPLAIGINAAYMQTYIGGVSCPYICGRHLDHGVLLVGYGA 317
Query: 322 AGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRG---RNVCGVDSMVSTVAA 371
+G+AP R KEKPYWIIKNSWGE+WG+ GYYKICRG RN CGVDSMVSTV+A
Sbjct: 318 SGFAPSRFKEKPYWIIKNSWGENWGDKGYYKICRGSNVRNKCGVDSMVSTVSA 370
>gi|224285931|gb|ACN40679.1| unknown [Picea sitchensis]
Length = 366
Score = 503 bits (1294), Expect = e-140, Method: Compositional matrix adjust.
Identities = 242/368 (65%), Positives = 288/368 (78%), Gaps = 10/368 (2%)
Query: 7 VLFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHE--STNNDLLGAEHHFSLFK 64
L + +FS + + D LIRQVTD E++S + + L AE HF F
Sbjct: 5 TLLFSAFCIFSVIFLSSATRPDDDLIRQVTD---EVVSDPQILDARSALFNAEVHFRHFI 61
Query: 65 KKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLR 124
+++ K Y+ EEH+HRF +FK+NL RA HQKLDP A+HG+T+FSDLT FR YLGLR
Sbjct: 62 RRYGKKYSGPEEHEHRFGVFKSNLLRALEHQKLDPRASHGVTKFSDLTQEGFRHQYLGLR 121
Query: 125 RKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLA 184
+DA APILPTNDLP DFDWREKGAV VK+QGSCGSCW+FSTTGALEGANFL
Sbjct: 122 APPL--RDAHDAPILPTNDLPEDFDWREKGAVTEVKNQGSCGSCWAFSTTGALEGANFLK 179
Query: 185 TGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGT 244
TG+LVSLSEQQLVDCDHECDP + SCDSGCNGGLM SA++Y LK+GGL +EEDYPYTG
Sbjct: 180 TGELVSLSEQQLVDCDHECDPSDARSCDSGCNGGLMTSAYQYALKSGGLEKEEDYPYTGK 239
Query: 245 DRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPY 304
D C F+K+KI A V+NFSVVS+DE QIAANLVKNGPL+V INA +MQTY+GGVSCPY
Sbjct: 240 DG--TCSFNKNKIVAHVSNFSVVSIDEGQIAANLVKNGPLSVGINAAFMQTYVGGVSCPY 297
Query: 305 ICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVD 363
+CS+R LDHGVLLVGYG+A +APIR+K+KPYW+IKNSWG +WGENGYYK+CRG NVCG++
Sbjct: 298 VCSKRNLDHGVLLVGYGAAAFAPIRMKDKPYWVIKNSWGPNWGENGYYKLCRGHNVCGIN 357
Query: 364 SMVSTVAA 371
+MVSTVAA
Sbjct: 358 NMVSTVAA 365
>gi|194352746|emb|CAQ00101.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 381
Score = 503 bits (1294), Expect = e-140, Method: Compositional matrix adjust.
Identities = 246/351 (70%), Positives = 278/351 (79%), Gaps = 19/351 (5%)
Query: 29 DQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANL 88
D LI QV GGD + N L AE HF+ F ++F K+Y +EH+HR ++F+ANL
Sbjct: 35 DPLIEQVV-GGD-------AENELELNAEAHFASFVRRFGKSYRDADEHEHRLSVFRANL 86
Query: 89 RRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLR-----LPKDADQAPILPTND 143
RRA RHQ+LDPSA HGIT+FSDLTP EFR +LGLR+ R + A AP LPT+
Sbjct: 87 RRARRHQRLDPSAVHGITKFSDLTPDEFRERFLGLRKSRRSFLKGISGSAHDAPALPTDG 146
Query: 144 LPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHEC 203
LP +FDWRE GAVGPVKDQGSCGSCWSFST+GALEGAN+LATGKL LSEQQLVDCDHEC
Sbjct: 147 LPTEFDWREHGAVGPVKDQGSCGSCWSFSTSGALEGANYLATGKLEVLSEQQLVDCDHEC 206
Query: 204 DPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVAN 263
DP EP +CD+GCNGGLM +AF Y KAGGL E+DYPYTG R ACKFDKSKIAA V N
Sbjct: 207 DPSEPRACDAGCNGGLMTTAFSYLAKAGGLETEKDYPYTG--RNSACKFDKSKIAAQVKN 264
Query: 264 FSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAG 323
FS V++DEDQIAANLVK+GPLA+ INAV+MQTYIGGVSCPYIC R LDH V LVGYGSAG
Sbjct: 265 FSTVAIDEDQIAANLVKHGPLAIGINAVFMQTYIGGVSCPYICGRHLDH-VFLVGYGSAG 323
Query: 324 YAPIRLKEKPYWIIKNSWGESWGENGYYKICRG---RNVCGVDSMVSTVAA 371
YAP+R KEKPYWIIKNSWGE+WGE+GYYKICRG +N CGVDSMVSTV A
Sbjct: 324 YAPLRFKEKPYWIIKNSWGENWGESGYYKICRGPHVKNKCGVDSMVSTVTA 374
>gi|116786550|gb|ABK24153.1| unknown [Picea sitchensis]
Length = 394
Score = 499 bits (1284), Expect = e-138, Method: Compositional matrix adjust.
Identities = 243/370 (65%), Positives = 287/370 (77%), Gaps = 9/370 (2%)
Query: 7 VLFLVSLVVFSAVSSGTLIDDV-DQLIRQVTD-GGDEILSHHESTNNDLLGAEHHFSLFK 64
+LFLV + + + ++ V IR+VTD G+ ++ + LL AE HF+ F
Sbjct: 23 LLFLVPTITAHVHEASSDLNAVLPNPIREVTDMDGEGVI---DDLRRGLLNAEAHFAHFV 79
Query: 65 KKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLR 124
KKFNK Y+ EEH RF+IFK NL +A RHQKLD A HGI +FSDLT EF YLGL
Sbjct: 80 KKFNKEYSGAEEHARRFSIFKKNLHKALRHQKLDRDAIHGINKFSDLTEEEFHEQYLGLT 139
Query: 125 RKLR-LPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFL 183
R L + APILPT+DLP DFDWRE GAV PVK+QG+CGSCW+FSTTGA+EGANF+
Sbjct: 140 TPPRSLSQRTQPAPILPTDDLPPDFDWRELGAVTPVKNQGACGSCWTFSTTGAMEGANFM 199
Query: 184 ATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTG 243
TGKL+SLSEQQLVDCDHECD EP CDSGCNGGLM +A++Y LKAGGL REEDYPYTG
Sbjct: 200 KTGKLISLSEQQLVDCDHECDSSEPDVCDSGCNGGLMTTAYQYALKAGGLQREEDYPYTG 259
Query: 244 TDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCP 303
D +CKFD +K+AA VANFS VS+DEDQIAANLVKNGPLAV INA +MQTY+GGVSCP
Sbjct: 260 IDG--SCKFDNTKVAAMVANFSTVSIDEDQIAANLVKNGPLAVGINAAFMQTYVGGVSCP 317
Query: 304 YICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGV 362
Y+C+++ LDHGVLLVGYG+AGYAP RLK KP+WIIKNSWG WGE+GYYK+CRG NVCG+
Sbjct: 318 YVCNKQNLDHGVLLVGYGAAGYAPGRLKNKPFWIIKNSWGPDWGEDGYYKLCRGHNVCGI 377
Query: 363 DSMVSTVAAA 372
++MVSTVAAA
Sbjct: 378 NTMVSTVAAA 387
>gi|56682917|gb|AAW21813.1| cysteine protease [Triticum aestivum]
Length = 377
Score = 497 bits (1279), Expect = e-138, Method: Compositional matrix adjust.
Identities = 243/353 (68%), Positives = 279/353 (79%), Gaps = 18/353 (5%)
Query: 27 DVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKA 86
D + LIRQV G D + + E ++ LLG F ++F K Y EEH HR ++FKA
Sbjct: 28 DEEPLIRQVVGGADPLDNDLE-LDSQLLG-------FVQRFGKTYRDAEEHAHRLSVFKA 79
Query: 87 NLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLR-----LPKDADQAPILPT 141
NLRRA RHQ LDPSA HG+T+FSDLTPAEFRRT+LGL+ R + A AP+LPT
Sbjct: 80 NLRRARRHQMLDPSAEHGVTKFSDLTPAEFRRTFLGLKTTRRSFLREMAGSAHDAPVLPT 139
Query: 142 NDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDH 201
+ LP DFDWR+ GAVGPVK+QGSC SCWSFS +GALEGAN+LATGK+ LSEQQLVDCDH
Sbjct: 140 DGLPEDFDWRDHGAVGPVKNQGSCWSCWSFSASGALEGANYLATGKMEVLSEQQLVDCDH 199
Query: 202 ECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASV 261
ECDP EP SCD+GCNGGLM SAF Y LK+GGL RE+DYPYTG D CKF+KSKIAASV
Sbjct: 200 ECDPAEPDSCDAGCNGGLMTSAFSYLLKSGGLEREKDYPYTGKD--GTCKFEKSKIAASV 257
Query: 262 ANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGS 321
NFSVV++DE+QIAANLV+ GPLA+ INA YMQTYIGGVSCPYIC R LDHGVLLVGYG+
Sbjct: 258 QNFSVVAVDEEQIAANLVEYGPLAIGINAAYMQTYIGGVSCPYICGRHLDHGVLLVGYGA 317
Query: 322 AGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRG---RNVCGVDSMVSTVAA 371
+G+AP R KEKPYWIIKNSWGE+WG+ GYYKICRG RN CGVDSMVSTV+A
Sbjct: 318 SGFAPSRFKEKPYWIIKNSWGENWGDKGYYKICRGSNVRNKCGVDSMVSTVSA 370
>gi|40806502|gb|AAR92156.1| putative cysteine protease 3 [Iris x hollandica]
Length = 292
Score = 493 bits (1268), Expect = e-137, Method: Compositional matrix adjust.
Identities = 231/285 (81%), Positives = 255/285 (89%), Gaps = 2/285 (0%)
Query: 88 LRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRR-KLRLPKDADQAPILPTNDLPA 146
+RRA RHQ+LDP+A HG+TQFSDLTP EF+RTYLGLR+ K L A +AP+LPTNDLP
Sbjct: 1 MRRARRHQQLDPTAVHGVTQFSDLTPGEFKRTYLGLRKGKKHLVGSAHEAPLLPTNDLPE 60
Query: 147 DFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPE 206
DFDWR+KGAV VK+QGSCGSCWSFST+GALEGANFLATGKL +LSEQQ+VDCDHECD E
Sbjct: 61 DFDWRDKGAVTGVKNQGSCGSCWSFSTSGALEGANFLATGKLETLSEQQMVDCDHECDAE 120
Query: 207 EPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSV 266
EP CD GCNGGLMN+AF+Y K GGL E+DYPYTGTDRG CKFD+SKI ASV NFSV
Sbjct: 121 EPDDCDQGCNGGLMNTAFQYLQKVGGLESEKDYPYTGTDRG-TCKFDESKIKASVHNFSV 179
Query: 267 VSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAP 326
VS+DE+QIAANLVK+GPLA+AINAV+MQTYIGGVSCPYIC + LDHGVLLVGYGSAGYAP
Sbjct: 180 VSIDEEQIAANLVKHGPLAIAINAVFMQTYIGGVSCPYICGKHLDHGVLLVGYGSAGYAP 239
Query: 327 IRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAA 371
IRLKEKPYWIIKNSWGE+WGENGYYKICRGRNVCGVDSMVSTV A
Sbjct: 240 IRLKEKPYWIIKNSWGETWGENGYYKICRGRNVCGVDSMVSTVTA 284
>gi|38344381|emb|CAD40319.2| OSJNBb0054B09.3 [Oryza sativa Japonica Group]
gi|116309071|emb|CAH66180.1| OSIGBa0130O15.4 [Oryza sativa Indica Group]
gi|116309098|emb|CAH66205.1| OSIGBa0148D14.11 [Oryza sativa Indica Group]
Length = 381
Score = 493 bits (1268), Expect = e-137, Method: Compositional matrix adjust.
Identities = 238/349 (68%), Positives = 276/349 (79%), Gaps = 16/349 (4%)
Query: 29 DQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANL 88
D LI QV GG+E + L AE HF+ F+++F + Y E +R ++F ANL
Sbjct: 35 DPLIEQVVGGGEE--------EDAQLDAEAHFASFERRFGRTYRDAGERAYRMSVFAANL 86
Query: 89 RRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRR---KLRLPKDADQAPILPTNDLP 145
RRA RHQ+LDP+ATHG+T+FSDLTP EFR +LGLRR + + + +APILPT+ LP
Sbjct: 87 RRARRHQRLDPTATHGVTKFSDLTPGEFRDRFLGLRRPSLEGLVGGEPHEAPILPTDGLP 146
Query: 146 ADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDP 205
DFDWRE GAVGPVKDQGSCGSCWSFST+GALEGA+FLATGKL LSEQQ+VDCDHECD
Sbjct: 147 DDFDWREHGAVGPVKDQGSCGSCWSFSTSGALEGAHFLATGKLEVLSEQQMVDCDHECDA 206
Query: 206 EEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFS 265
E +CDSGCNGGLM +AF Y +K+GGL E+DYPY G R + CKFDKSKI A V NFS
Sbjct: 207 SESRACDSGCNGGLMTTAFSYLMKSGGLQSEKDYPYAG--RENTCKFDKSKIVAQVKNFS 264
Query: 266 VVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYA 325
V+S++EDQIAANLVK+GPLA+AINA YMQTYIGGVSCP+IC R LDHGVLLVGYGSAGYA
Sbjct: 265 VISVNEDQIAANLVKHGPLAIAINAAYMQTYIGGVSCPFICGRHLDHGVLLVGYGSAGYA 324
Query: 326 PIRLKEKPYWIIKNSWGESWGENGYYKICRG---RNVCGVDSMVSTVAA 371
PIR KEKPYWIIKNSWGE+WGE GYYKICRG +N CGVDSMVS+V A
Sbjct: 325 PIRFKEKPYWIIKNSWGENWGEKGYYKICRGPHDKNKCGVDSMVSSVTA 373
>gi|1619903|gb|AAB16996.1| thiol protease isoform B, partial [Glycine max]
Length = 319
Score = 487 bits (1253), Expect = e-135, Method: Compositional matrix adjust.
Identities = 234/314 (74%), Positives = 268/314 (85%), Gaps = 11/314 (3%)
Query: 62 LFKKKFN-KAYASQEEHDHRFTIFKANLRRAARHQKLDPSAT---HGITQFSDLTPAEFR 117
L + KF + YA++EEHDHRF +FK+NLRRA+ PS+T HG+T+FSDLTPAEFR
Sbjct: 7 LSRPKFRPRPYATKEEHDHRFGVFKSNLRRAS----CTPSSTPRVHGVTKFSDLTPAEFR 62
Query: 118 RTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
R +LGL+ +R P A +APILPT DLP DFDWR+KGAV VKDQG CGSCWSFSTTGAL
Sbjct: 63 RQFLGLK-AVRFPAHAQKAPILPTKDLPKDFDWRDKGAVTNVKDQGGCGSCWSFSTTGAL 121
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREE 237
EGA +LATG+LVSLSEQQLVDCDH CDPEE G+CDSGCNGGLMN+AFEY L++GG+ +E+
Sbjct: 122 EGAYYLATGELVSLSEQQLVDCDHVCDPEEYGACDSGCNGGLMNNAFEYILQSGGVQKEK 181
Query: 238 DYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYI 297
DYPYTG D CKFDK+K+AA+V+N+SVV LDE+QIAANLVKNGPLAVAINAV+MQTY+
Sbjct: 182 DYPYTGRD--GTCKFDKTKVAATVSNYSVVCLDEEQIAANLVKNGPLAVAINAVFMQTYV 239
Query: 298 GGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGR 357
GGVSCPYIC + LDHGVLLVGYG YAPIR K KPYWIIKNSWGESWGENGY +ICRGR
Sbjct: 240 GGVSCPYICGKHLDHGVLLVGYGEGAYAPIRFKNKPYWIIKNSWGESWGENGYDEICRGR 299
Query: 358 NVCGVDSMVSTVAA 371
NVCGVDSMVSTVAA
Sbjct: 300 NVCGVDSMVSTVAA 313
>gi|302771610|ref|XP_002969223.1| hypothetical protein SELMODRAFT_91274 [Selaginella moellendorffii]
gi|300162699|gb|EFJ29311.1| hypothetical protein SELMODRAFT_91274 [Selaginella moellendorffii]
Length = 367
Score = 480 bits (1236), Expect = e-133, Method: Compositional matrix adjust.
Identities = 227/343 (66%), Positives = 270/343 (78%), Gaps = 3/343 (0%)
Query: 29 DQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANL 88
D IR+VTD + + LL E HF F +F KAYA+ E + HR +F+ANL
Sbjct: 27 DSGIREVTDTARDESNGRLDAAKALLDVETHFKSFIARFGKAYATAEAYAHRLKVFEANL 86
Query: 89 RRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADF 148
RA HQ LDPSA HGITQFSDLT EF++ +LGLR RL ++A++AP+LPTNDLP DF
Sbjct: 87 VRAVSHQALDPSAVHGITQFSDLTEEEFKQQFLGLRVPSRL-REANKAPVLPTNDLPEDF 145
Query: 149 DWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEP 208
DWRE GAV VK+QG+CGSCW+FSTTGA+EGA+FL TGKL+SLSEQQLVDCDH CDP +
Sbjct: 146 DWREHGAVTEVKNQGACGSCWAFSTTGAIEGAHFLETGKLISLSEQQLVDCDHSCDPTDK 205
Query: 209 GSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVS 268
SCD+GCNGGLM +A++Y +K+GGL E DYPYTG G C+F+ +KI ASVANFS VS
Sbjct: 206 VSCDAGCNGGLMTNAYDYVMKSGGLETETDYPYTGNSNG-KCQFNANKIVASVANFSTVS 264
Query: 269 LDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSR-RLDHGVLLVGYGSAGYAPI 327
LDEDQIAANLVK+GPLA+ INAV+MQTYIGGVSCP ICS+ +DHGVLLVGYG+ GYAPI
Sbjct: 265 LDEDQIAANLVKHGPLAIGINAVFMQTYIGGVSCPIICSKHHIDHGVLLVGYGAKGYAPI 324
Query: 328 RLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVA 370
R EKPYWIIKNSWG +WGE GYYKICRG +CG+++MVSTVA
Sbjct: 325 RFTEKPYWIIKNSWGATWGEQGYYKICRGHGMCGMNTMVSTVA 367
>gi|302754322|ref|XP_002960585.1| hypothetical protein SELMODRAFT_266583 [Selaginella moellendorffii]
gi|300171524|gb|EFJ38124.1| hypothetical protein SELMODRAFT_266583 [Selaginella moellendorffii]
Length = 330
Score = 476 bits (1224), Expect = e-132, Method: Compositional matrix adjust.
Identities = 221/319 (69%), Positives = 261/319 (81%), Gaps = 3/319 (0%)
Query: 53 LLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLT 112
LL E HF F +F KAYA+ E + HR +F+ANL RA HQ LDPSA HGITQFSDLT
Sbjct: 14 LLDVETHFKSFIARFGKAYATAEAYAHRLKVFEANLVRAVSHQALDPSAVHGITQFSDLT 73
Query: 113 PAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFS 172
EF++ +LGLR RL ++A++AP+LPTNDLP DFDWRE GAV VK+QG+CGSCW+FS
Sbjct: 74 EEEFKQQFLGLRVPSRL-REANKAPVLPTNDLPEDFDWREHGAVTEVKNQGACGSCWAFS 132
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
TTGA+EGA+FL TGKL+SLSEQQLVDCDH CDP + SCD+GCNGGLM +A++Y +K+GG
Sbjct: 133 TTGAIEGAHFLETGKLISLSEQQLVDCDHSCDPTDKVSCDAGCNGGLMTNAYDYVMKSGG 192
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY 292
L E DYPYTG G C+F+ +KI ASVANFS VSLDEDQIAANLVK+GPLA+ INAV+
Sbjct: 193 LETETDYPYTGNSNG-KCQFNANKIVASVANFSTVSLDEDQIAANLVKHGPLAIGINAVF 251
Query: 293 MQTYIGGVSCPYICSR-RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYY 351
MQTYIGGVSCP ICS+ +DHGVLLVGYG+ GYAPIR EKPYWIIKNSWG +WGE GYY
Sbjct: 252 MQTYIGGVSCPIICSKHHIDHGVLLVGYGAKGYAPIRFTEKPYWIIKNSWGATWGEQGYY 311
Query: 352 KICRGRNVCGVDSMVSTVA 370
KICRG +CG+++MVSTVA
Sbjct: 312 KICRGHGMCGMNTMVSTVA 330
>gi|222628593|gb|EEE60725.1| hypothetical protein OsJ_14236 [Oryza sativa Japonica Group]
Length = 364
Score = 458 bits (1178), Expect = e-126, Method: Compositional matrix adjust.
Identities = 230/349 (65%), Positives = 264/349 (75%), Gaps = 33/349 (9%)
Query: 29 DQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANL 88
D LI QV GG+E + L AE HF+ F+++F + Y
Sbjct: 35 DPLIDQVVGGGEE--------EDAQLDAEAHFASFERRFGRTYPGP-------------- 72
Query: 89 RRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRR---KLRLPKDADQAPILPTNDLP 145
RRA R LDP+ATHG+T+FSDLTP EFR +LGLRR + + + +APILPT+ LP
Sbjct: 73 RRARR---LDPTATHGVTKFSDLTPGEFRDRFLGLRRPSLEGLVGGEPHEAPILPTDGLP 129
Query: 146 ADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDP 205
DFDWRE GAVGPVKDQGSCGSCWSFST+GALEGA+FLATGKL LSEQQ+VDCDHECD
Sbjct: 130 DDFDWREHGAVGPVKDQGSCGSCWSFSTSGALEGAHFLATGKLEVLSEQQMVDCDHECDA 189
Query: 206 EEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFS 265
E +CDSGCNGGLM +AF Y +K+GGL E+DYPY G R + CKFDKSKI A V NFS
Sbjct: 190 SESRACDSGCNGGLMTTAFSYLMKSGGLQSEKDYPYAG--RENTCKFDKSKIVAQVKNFS 247
Query: 266 VVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYA 325
V+S++EDQIAANLVK+GPLA+AINA YMQTYIGGVSCP+IC R LDHGVLLVGYGSAGYA
Sbjct: 248 VISVNEDQIAANLVKHGPLAIAINAAYMQTYIGGVSCPFICGRHLDHGVLLVGYGSAGYA 307
Query: 326 PIRLKEKPYWIIKNSWGESWGENGYYKICRG---RNVCGVDSMVSTVAA 371
PIR KEKPYWIIKNSWGE+WGE GYYKICRG +N CGVDSMVS+V A
Sbjct: 308 PIRFKEKPYWIIKNSWGENWGEKGYYKICRGPHDKNKCGVDSMVSSVTA 356
>gi|168018894|ref|XP_001761980.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162686697|gb|EDQ73084.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 369
Score = 448 bits (1153), Expect = e-123, Method: Compositional matrix adjust.
Identities = 220/376 (58%), Positives = 276/376 (73%), Gaps = 12/376 (3%)
Query: 1 MGSKTVVLFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHH--ESTNNDLLGAEH 58
M S+ ++L + ++ F+ ++ D IR+VTD + LS+ E + L+GAE
Sbjct: 1 MESRGLLLVGIVVLGFAGFAASLPTGDT---IREVTD---DALSNGSVEQFAHALIGAEK 54
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRR 118
F F K F K Y S EE++HRF +FK+NL +A +HQ LDP+A+HG+T FSDLT EF
Sbjct: 55 RFESFMKDFGKVYHSVEEYEHRFGVFKSNLLKALKHQALDPTASHGVTMFSDLTEEEFTS 114
Query: 119 TYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALE 178
YLGL+R L A QAP LPT DLP +FDWREKGAVGPVKDQG CGSCW+FSTTGA+E
Sbjct: 115 KYLGLKRPSVL-SSAPQAPPLPTEDLPPNFDWREKGAVGPVKDQGGCGSCWAFSTTGAVE 173
Query: 179 GANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREED 238
GA+FL +GKLVSLSEQQLVDCDH+CD EE +CD+GCNGG M +A++Y AGGL E D
Sbjct: 174 GAHFLNSGKLVSLSEQQLVDCDHQCDREEADACDAGCNGGFMTNAYQYVEAAGGLELESD 233
Query: 239 YPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIG 298
YPY G D CKFD +K+A V+NF+ + +DEDQ+AA L+K+GPLA+ INA +MQTYI
Sbjct: 234 YPYEGRD--GKCKFDSNKVAVKVSNFTNIPVDEDQVAAYLIKSGPLAIGINAEFMQTYIA 291
Query: 299 GVSCPYICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGR 357
GVSCP C++R LDHGVLLVGY G+AP RL KPYWIIKNSWG +WG+NGYYKICRG
Sbjct: 292 GVSCPIFCNKRNLDHGVLLVGYAERGFAPARLAYKPYWIIKNSWGPNWGDNGYYKICRGH 351
Query: 358 NVCGVDSMVSTVAAAV 373
CG+++MVS V+A+V
Sbjct: 352 GECGLNTMVSAVSASV 367
>gi|168059933|ref|XP_001781954.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666600|gb|EDQ53250.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 369
Score = 444 bits (1142), Expect = e-122, Method: Compositional matrix adjust.
Identities = 213/344 (61%), Positives = 261/344 (75%), Gaps = 5/344 (1%)
Query: 31 LIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRR 90
+I+QVTDG + E + LLGAE F F K+F K Y + EE++HRF +FK+NL R
Sbjct: 28 VIQQVTDG-VRVDGSVEQFAHALLGAEKQFESFIKEFGKVYHTVEEYEHRFKVFKSNLLR 86
Query: 91 AARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDW 150
A +HQ LDP+A+HG+T FSDLT EF YLGL+R L A A LPT DLP FDW
Sbjct: 87 ALKHQALDPTASHGVTMFSDLTEEEFATQYLGLKRPSAL-STAPTAEPLPTGDLPPSFDW 145
Query: 151 REKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGS 210
REKGAVGPVK+QGSCGSCW+FSTTGA+EGA+FLATGKL+SLSEQQLVDCDH+CDPEE +
Sbjct: 146 REKGAVGPVKNQGSCGSCWAFSTTGAVEGAHFLATGKLLSLSEQQLVDCDHQCDPEEAQA 205
Query: 211 CDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLD 270
CD+GC GGLM +A++Y +AGGL E DYPY G D C+F+ +K+AA V+NF+ + +D
Sbjct: 206 CDAGCGGGLMTNAYKYVEEAGGLELESDYPYKGRDG--KCQFNPNKVAAKVSNFTNIPID 263
Query: 271 EDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRR-LDHGVLLVGYGSAGYAPIRL 329
EDQ+AA L+K+GPLA+ INA +MQTY+ GVSCP C++R LDHGVLLVGY G+AP RL
Sbjct: 264 EDQVAAYLIKSGPLAIGINAEFMQTYVAGVSCPIFCNKRNLDHGVLLVGYAEHGFAPARL 323
Query: 330 KEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAAAV 373
KPYWIIKNSWG WG+ GYYKICRG CG+++MVS VAA V
Sbjct: 324 AYKPYWIIKNSWGPMWGDKGYYKICRGHGECGLNTMVSAVAANV 367
>gi|125547724|gb|EAY93546.1| hypothetical protein OsI_15336 [Oryza sativa Indica Group]
Length = 348
Score = 442 bits (1138), Expect = e-122, Method: Compositional matrix adjust.
Identities = 212/288 (73%), Positives = 239/288 (82%), Gaps = 8/288 (2%)
Query: 90 RAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRR---KLRLPKDADQAPILPTNDLPA 146
R R +LDP+ATHG+T+FSDLTP EFR LGLRR + + + +APILPT+ LP
Sbjct: 55 RELRAARLDPTATHGVTKFSDLTPGEFRDRLLGLRRPSLEGLVGGEPHEAPILPTDGLPD 114
Query: 147 DFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPE 206
DFDWRE GAVGPVKDQGSCGSCWSFST+GALEGA+FLATGKL LSEQQ+VDCDHECD
Sbjct: 115 DFDWREHGAVGPVKDQGSCGSCWSFSTSGALEGAHFLATGKLEVLSEQQMVDCDHECDAS 174
Query: 207 EPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSV 266
E +CDSGCNGGLM +AF Y +K+GGL E+DYPY G R + CKFDKSKI A V NFSV
Sbjct: 175 ESRACDSGCNGGLMTTAFSYLMKSGGLQSEKDYPYAG--RENTCKFDKSKIVAQVKNFSV 232
Query: 267 VSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAP 326
+S++EDQIAANLVK+GPLA+AINA YMQTYIGGVSCP+IC R LDHGVLLVGYGSAGYAP
Sbjct: 233 ISVNEDQIAANLVKHGPLAIAINAAYMQTYIGGVSCPFICGRHLDHGVLLVGYGSAGYAP 292
Query: 327 IRLKEKPYWIIKNSWGESWGENGYYKICRG---RNVCGVDSMVSTVAA 371
IR KEKPYWIIKNSWGE+WGE GYYKICRG +N CGVDSMVS+V A
Sbjct: 293 IRFKEKPYWIIKNSWGENWGEKGYYKICRGPHDKNKCGVDSMVSSVTA 340
>gi|240255643|ref|NP_567010.5| Papain family cysteine protease [Arabidopsis thaliana]
gi|17979125|gb|AAL49820.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332645795|gb|AEE79316.1| Papain family cysteine protease [Arabidopsis thaliana]
Length = 367
Score = 428 bits (1101), Expect = e-117, Method: Compositional matrix adjust.
Identities = 209/378 (55%), Positives = 269/378 (71%), Gaps = 18/378 (4%)
Query: 1 MGSKTVVLFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGA--EH 58
M +K + + +++F V + D IRQVT + + +LLG E
Sbjct: 1 MVAKALAQLITCIILFCHVVASV----EDLTIRQVT-------ADNRRIRPNLLGTHTES 49
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRR 118
F LF + K Y+++EE+ HR IF N+ +AA HQ +DPSA HG+TQFSDLT EF+R
Sbjct: 50 KFRLFMSDYGKNYSTREEYIHRLGIFAKNVLKAAEHQMMDPSAVHGVTQFSDLTEEEFKR 109
Query: 119 TYLGLRR--KLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGA 176
Y G+ R +AP++ + LP DFDWREKG V VK+QG+CGSCW+FSTTGA
Sbjct: 110 MYTGVADVGGSRGGTVGAEAPMVEVDGLPEDFDWREKGGVTEVKNQGACGSCWAFSTTGA 169
Query: 177 LEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMRE 236
EGA+F++TGKL+SLSEQQLVDCD CDP++ +CD+GC GGLM +A+EY ++AGGL E
Sbjct: 170 AEGAHFVSTGKLLSLSEQQLVDCDQACDPKDKKACDNGCGGGLMTNAYEYLMEAGGLEEE 229
Query: 237 EDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTY 296
YPYTG RGH CKFD K+A V NF+ + LDE+QIAANLV++GPLAV +NAV+MQTY
Sbjct: 230 RSYPYTG-KRGH-CKFDPEKVAVRVLNFTTIPLDENQIAANLVRHGPLAVGLNAVFMQTY 287
Query: 297 IGGVSCPYICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
IGGVSCP ICS+R ++HGVLLVGYGS G++ +RL KPYWIIKNSWG+ WGENGYYK+CR
Sbjct: 288 IGGVSCPLICSKRNVNHGVLLVGYGSKGFSILRLSNKPYWIIKNSWGKKWGENGYYKLCR 347
Query: 356 GRNVCGVDSMVSTVAAAV 373
G ++CG++SMVS VA V
Sbjct: 348 GHDICGINSMVSAVATQV 365
>gi|357473731|ref|XP_003607150.1| Cysteine proteinase [Medicago truncatula]
gi|355508205|gb|AES89347.1| Cysteine proteinase [Medicago truncatula]
Length = 326
Score = 423 bits (1087), Expect = e-116, Method: Compositional matrix adjust.
Identities = 220/372 (59%), Positives = 262/372 (70%), Gaps = 56/372 (15%)
Query: 3 SKTVVLFLVSLVVFSA-VSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFS 61
+KT++LF V + FS ++ T D D +I+QV D G GAEH F+
Sbjct: 4 NKTLMLFSVLFLFFSVDLAFSTPNDREDPIIQQVVDKG---------------GAEHQFN 48
Query: 62 LFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYL 121
FK++F K Y+S++EHD+RF +FK+NL RA RH +DPSATHG+T+FSDLTP EFR + L
Sbjct: 49 EFKQRFGKVYSSKDEHDYRFNVFKSNLHRAKRHVIMDPSATHGVTRFSDLTPREFRNSIL 108
Query: 122 GLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGAN 181
GL+ + LP+ A APIL + +LP DFDWREKGAV PV++QG CGS WSFST GALEGAN
Sbjct: 109 GLK-GVGLPRHAKAAPILSSENLPRDFDWREKGAVTPVRNQGFCGSSWSFSTIGALEGAN 167
Query: 182 FLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPY 241
FL+TG+LVSLS+QQ VDCDH EY K+GGLMR EDY Y
Sbjct: 168 FLSTGELVSLSDQQHVDCDH-----------------------EYIKKSGGLMRVEDYTY 204
Query: 242 TGTDRGHACKFDKSKIAASV-ANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGV 300
K+ IA SV ANFS V +D+DQIAANL+K GPLAVAINA YMQTY+GGV
Sbjct: 205 Y-----------KTNIARSVAANFSSVLVDDDQIAANLLKYGPLAVAINAAYMQTYVGGV 253
Query: 301 SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVC 360
SCPY C+RRLDHGVLLVGYGS Y KEKPYWI+K+SWGE+WGENGYYKICRGRN+C
Sbjct: 254 SCPYTCTRRLDHGVLLVGYGSGAYT----KEKPYWIVKSSWGETWGENGYYKICRGRNIC 309
Query: 361 GVDSMVSTVAAA 372
GVDSMVSTVAAA
Sbjct: 310 GVDSMVSTVAAA 321
>gi|297816790|ref|XP_002876278.1| hypothetical protein ARALYDRAFT_485911 [Arabidopsis lyrata subsp.
lyrata]
gi|297322116|gb|EFH52537.1| hypothetical protein ARALYDRAFT_485911 [Arabidopsis lyrata subsp.
lyrata]
Length = 368
Score = 423 bits (1087), Expect = e-116, Method: Compositional matrix adjust.
Identities = 210/379 (55%), Positives = 267/379 (70%), Gaps = 19/379 (5%)
Query: 1 MGSKTVVLFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGA--EH 58
M +K + + ++ F V + D IRQVT DE +LLG E
Sbjct: 1 MVAKALAQLITCIIFFCHVVASV----EDLTIRQVT--ADE-----RRVRPNLLGTHTES 49
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRR 118
F +F + K Y+++EE+ HR IF N+ +AA HQ +DP+A HG+TQFSDLT EF+R
Sbjct: 50 KFRVFMSDYGKNYSTREEYIHRLGIFAKNVLKAAEHQMMDPTAVHGVTQFSDLTEEEFKR 109
Query: 119 TYLGLRR--KLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGA 176
Y G+ R +AP++ + LP DFDWREKG V VK+QG+CGSCW+FSTTGA
Sbjct: 110 MYTGVADVGGSRGHAVGAEAPMVEVDGLPEDFDWREKGGVTEVKNQGACGSCWAFSTTGA 169
Query: 177 LEGANFLATGKLVSLSEQQLVDCDHE-CDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMR 235
EGA+F++TGKL+SLSEQQLVDCD CDP++ +CD+GC GGLM +A+EY ++AGGL
Sbjct: 170 AEGAHFVSTGKLLSLSEQQLVDCDQAVCDPKDKKACDNGCGGGLMTNAYEYLMEAGGLEE 229
Query: 236 EEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQT 295
E YPYTG RGH CKFD K+A V NF+ + LDEDQIAANLV+ GPLAV +NAV+MQT
Sbjct: 230 ERSYPYTG-KRGH-CKFDPEKVAVRVVNFTTIPLDEDQIAANLVRQGPLAVGLNAVFMQT 287
Query: 296 YIGGVSCPYICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
YIGGVSCP ICS+R ++HGVLLVGYGS G++ +RL KPYWIIKNSWG+ WGENGYYK+C
Sbjct: 288 YIGGVSCPLICSKRKVNHGVLLVGYGSKGFSILRLSNKPYWIIKNSWGKKWGENGYYKLC 347
Query: 355 RGRNVCGVDSMVSTVAAAV 373
RG ++CG++SMVS VA V
Sbjct: 348 RGHDICGINSMVSAVATQV 366
>gi|52546912|gb|AAU81589.1| cysteine proteinase [Petunia x hybrida]
Length = 257
Score = 418 bits (1075), Expect = e-114, Method: Compositional matrix adjust.
Identities = 198/239 (82%), Positives = 215/239 (89%), Gaps = 2/239 (0%)
Query: 134 DQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSE 193
D+APILPT+DLP DFDWREKGAV VK+QGSCGSCWSFSTTGA+EGA+FLATG+LVSLSE
Sbjct: 14 DKAPILPTSDLPDDFDWREKGAVTGVKNQGSCGSCWSFSTTGAVEGAHFLATGELVSLSE 73
Query: 194 QQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFD 253
QQLVDCDHECD E+ CD+GC GGLM +AFEYTLKAGGL RE+DYPYTG D C FD
Sbjct: 74 QQLVDCDHECDAEQQNECDAGCGGGLMTTAFEYTLKAGGLQREKDYPYTGRDG--KCHFD 131
Query: 254 KSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHG 313
KSKIAASVANFSVV LDEDQIAANLVK+GPLAV INA +MQTY+GGVSCP IC +R DHG
Sbjct: 132 KSKIAASVANFSVVGLDEDQIAANLVKHGPLAVGINAAWMQTYVGGVSCPLICFKRQDHG 191
Query: 314 VLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAAA 372
VLLVGYGSAG+APIRLKEKPYWIIKNSWGESWGE GYYKICRGRN+CGVD+MVSTV AA
Sbjct: 192 VLLVGYGSAGFAPIRLKEKPYWIIKNSWGESWGEQGYYKICRGRNICGVDAMVSTVTAA 250
>gi|357473651|ref|XP_003607110.1| Cysteine proteinase [Medicago truncatula]
gi|355508165|gb|AES89307.1| Cysteine proteinase [Medicago truncatula]
Length = 331
Score = 416 bits (1069), Expect = e-113, Method: Compositional matrix adjust.
Identities = 211/371 (56%), Positives = 259/371 (69%), Gaps = 50/371 (13%)
Query: 3 SKTVVLFLVSLVVFSAVSSGTLIDDV-DQLIRQVTDGGDEILSHHESTNNDLLGAEHHFS 61
++T +LF V + FS + ++ D D +I+QV D G GAE+ F+
Sbjct: 5 NQTFMLFSVLFLFFSVDLAFSMPKDREDPIIQQVVDKG---------------GAEYQFN 49
Query: 62 LFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYL 121
FK++F K Y+S++EHD+RF +FK+NL RA RH +DPSATHG+T+FSDLTP EFR + L
Sbjct: 50 EFKQRFGKVYSSKDEHDYRFNVFKSNLHRAKRHGIMDPSATHGVTRFSDLTPREFRNSIL 109
Query: 122 GLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGAN 181
GL+ + LP+ A APIL T +LP DFDWREKGAV PV++QG CGS WSFST GALEGA+
Sbjct: 110 GLK-GVGLPRHAKAAPILSTENLPRDFDWREKGAVTPVRNQGFCGSSWSFSTIGALEGAH 168
Query: 182 FLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPY 241
FL++G+LVSLSEQ VDCDHE Y K GGLMR EDY Y
Sbjct: 169 FLSSGELVSLSEQHHVDCDHE-----------------------YIQKYGGLMRVEDYTY 205
Query: 242 TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVS 301
T+ + ANFS +S+D++QI ANLVK+GPLA AINAVYMQTY+GG+S
Sbjct: 206 YKTNTARSV----------AANFSSISVDDNQITANLVKHGPLAAAINAVYMQTYVGGIS 255
Query: 302 CPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCG 361
CPYIC+RRLD GVLLVGYGS A ++ KEKPYWI+KNSWGE+WGENGYYKICRGRN+CG
Sbjct: 256 CPYICTRRLDLGVLLVGYGSGAGADMKEKEKPYWIVKNSWGETWGENGYYKICRGRNICG 315
Query: 362 VDSMVSTVAAA 372
VDSMVSTVAAA
Sbjct: 316 VDSMVSTVAAA 326
>gi|351726954|ref|NP_001236888.1| cysteine proteinase precursor [Glycine max]
gi|479060|emb|CAA83673.1| cysteine proteinase [Glycine max]
gi|300507422|gb|ADK24076.1| cysteine proteinase [Glycine max]
gi|300507425|gb|ADK24077.1| cysteine proteinase [Glycine max]
gi|1096153|prf||2111244A Cys protease
Length = 380
Score = 416 bits (1069), Expect = e-113, Method: Compositional matrix adjust.
Identities = 204/365 (55%), Positives = 262/365 (71%), Gaps = 20/365 (5%)
Query: 6 VVLFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKK 65
V LFL +L + +A S T+ D + R++ G +N+LL E F +F +
Sbjct: 15 VSLFLCALTLSAAHGSTTVQD----IARKLKLG-----------DNELLRTEKKFKVFME 59
Query: 66 KFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRR 125
+ ++Y+++EE+ R IF N+ RAA HQ LDP+A HG+TQFSDLT EF + Y G+
Sbjct: 60 NYGRSYSTEEEYLRRLGIFAQNMVRAAEHQALDPTAVHGVTQFSDLTEDEFEKLYTGVNG 119
Query: 126 KLRLPKDADQ--APILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFL 183
+A AP L + LP +FDWREKGAV VK QG CGSCW+FSTTG++EGANFL
Sbjct: 120 GFPSSNNAAGGIAPPLEVDGLPENFDWREKGAVTEVKLQGRCGSCWAFSTTGSIEGANFL 179
Query: 184 ATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTG 243
ATGKLVSLSEQQL+DCD++CD E SCD+GCNGGLM +A+ Y L++GGL E YPYTG
Sbjct: 180 ATGKLVSLSEQQLLDCDNKCDITEKTSCDNGCNGGLMTNAYNYLLESGGLEEESSYPYTG 239
Query: 244 TDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCP 303
+RG CKFD KIA + NF+ + DE+QIAA LVKNGPLA+ +NA++MQTYIGGVSCP
Sbjct: 240 -ERGE-CKFDPEKIAVKITNFTNIPADENQIAAYLVKNGPLAMGVNAIFMQTYIGGVSCP 297
Query: 304 YICS-RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGV 362
ICS +RL+HGVLLVGYG+ G++ +RL KPYWIIKNSWGE WGE+GYYK+CRG +CG+
Sbjct: 298 LICSKKRLNHGVLLVGYGAKGFSILRLGNKPYWIIKNSWGEKWGEDGYYKLCRGHGMCGI 357
Query: 363 DSMVS 367
++MVS
Sbjct: 358 NTMVS 362
>gi|4678299|emb|CAB41090.1| cysteine proteinase precursor-like protein [Arabidopsis thaliana]
Length = 363
Score = 414 bits (1064), Expect = e-113, Method: Compositional matrix adjust.
Identities = 206/378 (54%), Positives = 265/378 (70%), Gaps = 22/378 (5%)
Query: 1 MGSKTVVLFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGA--EH 58
M +K + + +++F V + D IRQVT + + +LLG E
Sbjct: 1 MVAKALAQLITCIILFCHVVASV----EDLTIRQVT-------ADNRRIRPNLLGTHTES 49
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRR 118
F LF + K Y+++EE+ HR IF N+ +AA HQ +DPSA HG+TQFSDLT EF+R
Sbjct: 50 KFRLFMSDYGKNYSTREEYIHRLGIFAKNVLKAAEHQMMDPSAVHGVTQFSDLTEEEFKR 109
Query: 119 TYLGLRR--KLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGA 176
Y G+ R +AP++ + LP DFDWREKG V VK+QG+CGSCW+FSTTGA
Sbjct: 110 MYTGVADVGGSRGGTVGAEAPMVEVDGLPEDFDWREKGGVTEVKNQGACGSCWAFSTTGA 169
Query: 177 LEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMRE 236
EGA+F++TGKL+SLSEQQLVDCD + +CD+GC GGLM +A+EY ++AGGL E
Sbjct: 170 AEGAHFVSTGKLLSLSEQQLVDCDQA----DKKACDNGCGGGLMTNAYEYLMEAGGLEEE 225
Query: 237 EDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTY 296
YPYTG RGH CKFD K+A V NF+ + LDE+QIAANLV++GPLAV +NAV+MQTY
Sbjct: 226 RSYPYTG-KRGH-CKFDPEKVAVRVLNFTTIPLDENQIAANLVRHGPLAVGLNAVFMQTY 283
Query: 297 IGGVSCPYICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
IGGVSCP ICS+R ++HGVLLVGYGS G++ +RL KPYWIIKNSWG+ WGENGYYK+CR
Sbjct: 284 IGGVSCPLICSKRNVNHGVLLVGYGSKGFSILRLSNKPYWIIKNSWGKKWGENGYYKLCR 343
Query: 356 GRNVCGVDSMVSTVAAAV 373
G ++CG++SMVS VA V
Sbjct: 344 GHDICGINSMVSAVATQV 361
>gi|356576257|ref|XP_003556249.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase 15A-like
[Glycine max]
Length = 374
Score = 412 bits (1060), Expect = e-112, Method: Compositional matrix adjust.
Identities = 202/368 (54%), Positives = 259/368 (70%), Gaps = 21/368 (5%)
Query: 6 VVLFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKK 65
V LFL +L + SA S T+ D + R++ G +N+LL E F +F +
Sbjct: 15 VSLFLFALTLSSAHESTTVHD----IARKLKVG-----------DNELLRTEKKFKVFME 59
Query: 66 KFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRR 125
+ ++Y+++EE+ R IF N+ RAA HQ LDP+A HG+TQFSDLT EF + Y G
Sbjct: 60 NYGRSYSTREEYLRRLGIFSQNMLRAAEHQALDPTAVHGVTQFSDLTEVEFEKLYTGXPS 119
Query: 126 KLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLAT 185
AP L LP +FDWREKGAV VK QG CGSCW+FSTTG++EGANFLAT
Sbjct: 120 T---NTAGGVAPPLEVEGLPENFDWREKGAVTEVKIQGRCGSCWAFSTTGSIEGANFLAT 176
Query: 186 GKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTD 245
GKLVSLSEQQL+DCD++C+ E SCD+GCNGGLM +A+ Y L++GGL E YPYTG +
Sbjct: 177 GKLVSLSEQQLLDCDNKCEITEKTSCDNGCNGGLMTNAYNYLLESGGLEEESSYPYTG-E 235
Query: 246 RGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYI 305
RG CKFD KI + NF+ + +DE+QIAA LVKNGPLA+ +NA++MQTYIGGVSCP I
Sbjct: 236 RGE-CKFDPEKITVRITNFTNIPVDENQIAAYLVKNGPLAMGVNAIFMQTYIGGVSCPLI 294
Query: 306 CS-RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDS 364
CS +RL+HGVLLVGYG+ G++ +RL KPYWIIKNSWG+ WGE+GYYK+CRG +CG+++
Sbjct: 295 CSKKRLNHGVLLVGYGAKGFSILRLGNKPYWIIKNSWGKKWGEDGYYKLCRGHGMCGINT 354
Query: 365 MVSTVAAA 372
MVS A
Sbjct: 355 MVSAAMVA 362
>gi|351629613|gb|AEQ54770.1| cysteine proteinase CP1 [Coffea canephora]
Length = 397
Score = 411 bits (1057), Expect = e-112, Method: Compositional matrix adjust.
Identities = 210/383 (54%), Positives = 267/383 (69%), Gaps = 26/383 (6%)
Query: 11 VSLVVFSAVSSGTLIDDVDQ------LIRQVTDGGDEILSHH---ESTNNDLLGA--EHH 59
++L+ + +SS T ++ +IRQVTD HH S N+ LLG E H
Sbjct: 16 ITLLSCALISSTTFQHEIQYRVQDPLMIRQVTDNHHH--RHHPGRSSANHRLLGTTTEVH 73
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F F +++ K Y++ EE+ HR IF NL +AA HQ +DPSA HG+TQFSDLT EF T
Sbjct: 74 FKSFVEEYEKTYSTHEEYVHRLGIFAKNLIKAAEHQAMDPSAIHGVTQFSDLTEEEFEAT 133
Query: 120 YLGLRR------KLRLPKD-ADQAP---ILPTNDLPADFDWREKGAVGPVKDQGSCGSCW 169
Y+GL+ +L KD D++ ++ +DLP FDWREKGAV VK QG CGSCW
Sbjct: 134 YMGLKGGAGVGGTTQLGKDDGDESAAEVMMDVSDLPESFDWREKGAVTEVKTQGRCGSCW 193
Query: 170 SFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK 229
+FSTTGA+EGANF+ATGKL+SLSEQQLVDCDH CD +E CD GC+GGLM +AF Y ++
Sbjct: 194 AFSTTGAIEGANFIATGKLLSLSEQQLVDCDHMCDLKEKDDCDDGCSGGLMTTAFNYLIE 253
Query: 230 AGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAIN 289
AGG+ E YPYTG RG CKF+ K+A V NF+ + DE QIAAN+V NGPLA+ +N
Sbjct: 254 AGGIEEEVTYPYTG-KRGE-CKFNPEKVAVKVRNFAKIPEDESQIAANVVHNGPLAIGLN 311
Query: 290 AVYMQTYIGGVSCPYIC-SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
AV+MQTYIGGVSCP IC +R++HGVLLVGYGS G++ +RL KPYWIIKNSWG+ WGE+
Sbjct: 312 AVFMQTYIGGVSCPLICDKKRINHGVLLVGYGSRGFSILRLGYKPYWIIKNSWGKRWGEH 371
Query: 349 GYYKICRGRNVCGVDSMVSTVAA 371
GYY++CRG N+CG+ +MVS V
Sbjct: 372 GYYRLCRGHNMCGMSTMVSAVVT 394
>gi|2414683|emb|CAB16316.1| cysteine proteinase precursor [Vicia sativa]
Length = 379
Score = 411 bits (1056), Expect = e-112, Method: Compositional matrix adjust.
Identities = 200/364 (54%), Positives = 257/364 (70%), Gaps = 19/364 (5%)
Query: 6 VVLFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKK 65
V +FL +L + S++ TLI DV + + E +NDLL E F LF K
Sbjct: 15 VAIFLCALTLSSSLHHETLIQDVARKL--------------ELKDNDLLTTEKKFKLFMK 60
Query: 66 KFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRR 125
++K Y++ EE+ R IF N+ +AA HQ LDP+A HG+TQFSDL+ EF R Y G +
Sbjct: 61 DYSKKYSTTEEYLLRLGIFAKNMVKAAEHQALDPTAIHGVTQFSDLSEEEFERFYTGFKG 120
Query: 126 KLRLPKDADQ-APILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLA 184
A AP L P +FDWREKGAV +K QG CGSCW+F+TTG++EGANFLA
Sbjct: 121 GFPSSNAAGGVAPPLDVKGFPENFDWREKGAVTGIKTQGKCGSCWAFTTTGSIEGANFLA 180
Query: 185 TGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGT 244
TGKLVSLSEQQLVDCD++CD + SCD+GCNGGLM +A++Y ++AGGL E YPYTG
Sbjct: 181 TGKLVSLSEQQLVDCDNKCDITKT-SCDNGCNGGLMTTAYDYLMEAGGLEEETSYPYTGA 239
Query: 245 DRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPY 304
CKFD +K+A V+NF+ + DE+QIAA LV +GPLA+A+NAV+MQTY+GGVSCP
Sbjct: 240 Q--GECKFDPNKVAVRVSNFTNIPADENQIAAYLVNHGPLAIAVNAVFMQTYVGGVSCPL 297
Query: 305 ICS-RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVD 363
ICS RRL+HGVLLVGY + G++ +RL++KPYW IKNSWGE WGE GYYK+CRG +CG++
Sbjct: 298 ICSKRRLNHGVLLVGYNAEGFSILRLRKKPYWTIKNSWGEQWGEKGYYKLCRGHGMCGMN 357
Query: 364 SMVS 367
+MVS
Sbjct: 358 TMVS 361
>gi|53748485|emb|CAH59428.1| cysteine protease 2 [Plantago major]
Length = 245
Score = 411 bits (1056), Expect = e-112, Method: Compositional matrix adjust.
Identities = 195/242 (80%), Positives = 218/242 (90%), Gaps = 4/242 (1%)
Query: 132 DADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSL 191
D ++AP LPT++LP +FDWREKGAV VK+QGSCGSCWSFSTTGALEGAN+LATG+L+SL
Sbjct: 2 DENKAPKLPTSNLPEEFDWREKGAVTAVKNQGSCGSCWSFSTTGALEGANYLATGELISL 61
Query: 192 SEQQLVDCDHECDPEEPG-SCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHAC 250
SEQQLVDCDHECDPEE SCD+GCNGGLMN+AFEY LKAGGL +E+DYPYTG D C
Sbjct: 62 SEQQLVDCDHECDPEEGADSCDAGCNGGLMNNAFEYALKAGGLQKEKDYPYTGKD--GTC 119
Query: 251 KFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRL 310
KFDK+KIAASV NFSVVS+DEDQIAANLVK GPLAV INA +MQTYIGGVSCPYIC + L
Sbjct: 120 KFDKTKIAASVHNFSVVSIDEDQIAANLVKYGPLAVGINAAWMQTYIGGVSCPYICGKSL 179
Query: 311 DHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVA 370
DHGVL+VGYG+ GYAP+RLK KPYWIIKNSWGESWGE+GYYKICRGRNVCGV+SMVS+V
Sbjct: 180 DHGVLIVGYGT-GYAPVRLKNKPYWIIKNSWGESWGESGYYKICRGRNVCGVESMVSSVT 238
Query: 371 AA 372
AA
Sbjct: 239 AA 240
>gi|224113123|ref|XP_002316398.1| predicted protein [Populus trichocarpa]
gi|222865438|gb|EEF02569.1| predicted protein [Populus trichocarpa]
Length = 327
Score = 408 bits (1049), Expect = e-111, Method: Compositional matrix adjust.
Identities = 189/323 (58%), Positives = 235/323 (72%), Gaps = 3/323 (0%)
Query: 52 DLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDL 111
+LLG E F +F K+ NK YA++EE+ HRF IF NL RA HQ LDP+A HG+T F DL
Sbjct: 6 NLLGTEEKFKMFIKEHNKEYATREEYVHRFGIFGKNLIRAVEHQALDPTAIHGVTPFMDL 65
Query: 112 TPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSF 171
T EF R Y G+ +P + + + LP FDWREKGAV VK QGSCGSCW+F
Sbjct: 66 TEEEFERMYAGVLGGGTVPVEKGSVSFMDASGLPDSFDWREKGAVTDVKIQGSCGSCWAF 125
Query: 172 STTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAG 231
STTG++EGANF+ATGKL++LSEQQLVDCD CD + SCD GC GGLM +A+ Y ++AG
Sbjct: 126 STTGSVEGANFIATGKLLNLSEQQLVDCDRVCDKTDKASCDDGCGGGLMTNAYRYLIEAG 185
Query: 232 GLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV 291
GL E YPYTG + CKFD KIA VANF+ +++DE+QIAANLV +GPLA+ +NA+
Sbjct: 186 GLQEESSYPYTG--KSGECKFDPEKIAVKVANFTSIAVDENQIAANLVHHGPLAIGLNAI 243
Query: 292 YMQTYIGGVSCPYICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
+MQTYIGGVSCP IC ++ L+HGVLLVGYG+ GY+ +R KPYWIIKNSWG WGE GY
Sbjct: 244 FMQTYIGGVSCPLICGKKWLNHGVLLVGYGARGYSILRFGYKPYWIIKNSWGNHWGEKGY 303
Query: 351 YKICRGRNVCGVDSMVSTVAAAV 373
Y++CRG +CG++ MVS V V
Sbjct: 304 YRLCRGHGMCGMNKMVSAVVTKV 326
>gi|449449489|ref|XP_004142497.1| PREDICTED: cysteine proteinase 15A-like [Cucumis sativus]
Length = 406
Score = 407 bits (1047), Expect = e-111, Method: Compositional matrix adjust.
Identities = 193/363 (53%), Positives = 260/363 (71%), Gaps = 16/363 (4%)
Query: 14 VVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYAS 73
++ SA+ S T + + +RQVTDG E NN G+E F +F +K+ K+Y +
Sbjct: 51 LLISAIPSATALRRDPEFLRQVTDG--------EIFNNLPAGSERKFVMFMEKYGKSYPT 102
Query: 74 QEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLR---LP 130
++E+ HRF IF NL RAA HQ LDP+A HG+TQFSDL+ EF R ++G+R LP
Sbjct: 103 RKEYLHRFGIFVKNLIRAAEHQALDPTAVHGVTQFSDLSEEEFERMFMGVRGGAGGEGLP 162
Query: 131 K--DADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKL 188
+ A + LP FDWR+KGAV VK QG+CGSCW+FST GA+EGANF+ATG L
Sbjct: 163 EMNQAVEVTAEEVKGLPERFDWRDKGAVTEVKMQGTCGSCWAFSTCGAVEGANFIATGNL 222
Query: 189 VSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGH 248
++LSEQQLVDCDH CDP + +C++GCNGGLM +A++Y +++GGL E YPYTG R
Sbjct: 223 LNLSEQQLVDCDHTCDPTDKTACNNGCNGGLMTNAYKYLIQSGGLEEESSYPYTG--RSG 280
Query: 249 ACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSR 308
C F KIA V+NF+ + +DE+QIAA+LV++GPLAV +NAV+MQTYIGGVSCP IC +
Sbjct: 281 QCNFQSDKIAVKVSNFTTIPIDENQIAAHLVRSGPLAVGLNAVFMQTYIGGVSCPLICGK 340
Query: 309 R-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVS 367
R ++HGVL+VGYG G++ +R ++ PYW+IKNSWGE WGE+GYY++CRG +CG+++MVS
Sbjct: 341 RFVNHGVLMVGYGDEGFSILRFRKLPYWVIKNSWGERWGEHGYYRLCRGHGMCGINTMVS 400
Query: 368 TVA 370
V
Sbjct: 401 AVV 403
>gi|449487301|ref|XP_004157559.1| PREDICTED: cysteine proteinase 15A-like [Cucumis sativus]
Length = 406
Score = 407 bits (1046), Expect = e-111, Method: Compositional matrix adjust.
Identities = 193/363 (53%), Positives = 260/363 (71%), Gaps = 16/363 (4%)
Query: 14 VVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYAS 73
++ SA+ S T + + +RQVTDG E NN G+E F +F +K+ K+Y +
Sbjct: 51 LLISAIPSATALRRDPEFLRQVTDG--------EIFNNLPAGSERKFVMFMEKYGKSYPT 102
Query: 74 QEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLR---LP 130
++E+ HRF IF NL RAA HQ LDP+A HG+TQFSDL+ EF R ++G+R LP
Sbjct: 103 RKEYLHRFGIFVKNLIRAAEHQALDPTAVHGVTQFSDLSEEEFERMFMGVRGGAGGEGLP 162
Query: 131 K--DADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKL 188
+ A + LP FDWR+KGAV VK QG+CGSCW+FST GA+EGANF+ATG L
Sbjct: 163 EMNQAVEVTAEEVKGLPERFDWRDKGAVTEVKMQGTCGSCWAFSTCGAVEGANFIATGNL 222
Query: 189 VSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGH 248
++LSEQQLVDCDH CDP + +C++GCNGGLM +A++Y +++GGL E YPYTG R
Sbjct: 223 LNLSEQQLVDCDHTCDPTDKTACNNGCNGGLMTNAYKYLIQSGGLEEESSYPYTG--RSG 280
Query: 249 ACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSR 308
C F KIA V+NF+ + +DE+QIAA+LV++GPLAV +NAV+MQTYIGGVSCP IC +
Sbjct: 281 QCNFQSDKIAVKVSNFTTIPIDENQIAAHLVRSGPLAVGLNAVFMQTYIGGVSCPLICGK 340
Query: 309 R-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVS 367
R ++HGVL+VGYG G++ +R ++ PYW+IKNSWGE WGE+GYY++CRG +CG+++MVS
Sbjct: 341 RFVNHGVLMVGYGDEGFSILRFRKLPYWVIKNSWGERWGEHGYYRLCRGHGMCGINTMVS 400
Query: 368 TVA 370
V
Sbjct: 401 AVV 403
>gi|225448924|ref|XP_002266821.1| PREDICTED: cysteine proteinase 15A-like [Vitis vinifera]
Length = 375
Score = 398 bits (1022), Expect = e-108, Method: Compositional matrix adjust.
Identities = 190/344 (55%), Positives = 249/344 (72%), Gaps = 8/344 (2%)
Query: 29 DQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANL 88
D I QVTDG SH + + +LG E F +F +K+ K Y+S+EE+ HR IF N+
Sbjct: 34 DPNIVQVTDG----HSHRKFGVDGVLGTEKEFRMFMEKYGKEYSSREEYVHRLGIFAKNM 89
Query: 89 RRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKD-ADQAPILPTNDLPAD 147
RAA HQ LDP+A HG+T FSDL+ EF R + G+ + + A+ A L + LP
Sbjct: 90 VRAAEHQALDPTALHGVTPFSDLSEEEFERMFTGVVGRPHMKGGVAETAAALEVDGLPES 149
Query: 148 FDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEE 207
FDWREKGAV VK QG+CGSCW+FSTTGA+EGA+F++T KL++LSEQQLVDCDH CD +
Sbjct: 150 FDWREKGAVTEVKMQGTCGSCWAFSTTGAVEGAHFISTKKLLTLSEQQLVDCDHMCDIRD 209
Query: 208 PGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVV 267
+CDSGC GGLM +A++Y ++AGGL E YPYTG + CKF ++A V NF+ V
Sbjct: 210 KTACDSGCEGGLMTNAYKYLIEAGGLEEESSYPYTG--KHGECKFKPDRVAVRVVNFTEV 267
Query: 268 SLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRR-LDHGVLLVGYGSAGYAP 326
++E+QIAANLV +GPLAV +NA++MQTYIGGVSCP IC +R ++HGVLLVGYG+ GY+
Sbjct: 268 PINENQIAANLVCHGPLAVGLNAIFMQTYIGGVSCPLICPKRWINHGVLLVGYGAKGYSI 327
Query: 327 IRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVA 370
+R KPYWIIKNSWG+ WGE+GYY++CRG +CG+++MVS V
Sbjct: 328 LRFGYKPYWIIKNSWGKRWGEHGYYRLCRGHGMCGMNTMVSAVV 371
>gi|2511695|emb|CAB17077.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 377
Score = 397 bits (1019), Expect = e-108, Method: Compositional matrix adjust.
Identities = 194/360 (53%), Positives = 254/360 (70%), Gaps = 15/360 (4%)
Query: 11 VSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKA 70
+SLV+F+ S RQ T +I + +N LL E F++F + + K
Sbjct: 15 ISLVLFALTLSSA---------RQTTV--HDIAKKLKLQDNQLLRTEKKFNVFMENYGKK 63
Query: 71 YASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLP 130
Y+++EE+ R IF N+ RA +Q LDP+A HG+TQFSDLT EF+R Y G+
Sbjct: 64 YSTREEYLQRLEIFAGNMLRAPENQALDPTAIHGVTQFSDLTEDEFQRHYTGVNGGFPWN 123
Query: 131 KDA-DQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLV 189
D AP L + LP DFDWREKGAV VK QG CGSCW+FSTTG++EGANF+ATGKL+
Sbjct: 124 NGVRDVAPPLKVDGLPEDFDWREKGAVTEVKMQGKCGSCWAFSTTGSIEGANFIATGKLL 183
Query: 190 SLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHA 249
+LSEQQLVDCD +CD E +CD+GC GGLM +A++Y L++GGL E YPYTG +G
Sbjct: 184 NLSEQQLVDCDSQCDITESTTCDNGCMGGLMTNAYKYLLQSGGLEEESSYPYTGA-KGE- 241
Query: 250 CKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRR 309
CKFD K+A + NF+ + +DE+QIAA LVK+GPLAV +NA++MQTYIGGVSCP ICS++
Sbjct: 242 CKFDPGKVAVRITNFTNIPVDENQIAAYLVKHGPLAVGLNAIFMQTYIGGVSCPLICSKK 301
Query: 310 -LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVST 368
L+HGVLLVGY + G++ +RL KPYWIIKNSWG+ WG +GYYK+CRG +CG+++MVST
Sbjct: 302 WLNHGVLLVGYRAKGFSILRLGNKPYWIIKNSWGKRWGVDGYYKLCRGHGMCGMNTMVST 361
>gi|5679322|gb|AAD46920.1|AF167986_1 putative cysteine proteinase GmPM33 [Glycine max]
Length = 363
Score = 394 bits (1012), Expect = e-107, Method: Compositional matrix adjust.
Identities = 197/367 (53%), Positives = 254/367 (69%), Gaps = 41/367 (11%)
Query: 6 VVLFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKK 65
V LFL +L + +A S T+ D + R++ G +N+LL E F +F +
Sbjct: 15 VSLFLCALTLSAAHGSTTVQD----IARKLKLG-----------DNELLRTEKKFKVFME 59
Query: 66 KFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRR 125
+ ++Y+++EE+ R IF N+ RAA HQ LDP+A HG+TQFS
Sbjct: 60 NYGRSYSTEEEYLRRLGIFAQNMVRAAEHQALDPTAVHGVTQFS---------------- 103
Query: 126 KLRLPKDADQA----PILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGAN 181
LP + A P L + LP +FDWREKGAV VK QG CGSCW+FSTTG++EGAN
Sbjct: 104 ---LPVSNNAAGGIAPPLEVDGLPENFDWREKGAVTEVKLQGRCGSCWAFSTTGSIEGAN 160
Query: 182 FLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPY 241
FLATGKLVSLS+QQL+DCD++CD E SCD+GCNGGLM +A+ Y L++GGL E YPY
Sbjct: 161 FLATGKLVSLSDQQLLDCDNKCDITEKTSCDNGCNGGLMTNAYNYLLESGGLEEESSYPY 220
Query: 242 TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVS 301
TG +RG CKFD KIA + NF+ + DE+QIAA LVKNGPLA+ +NA++MQTYIGGVS
Sbjct: 221 TG-ERGE-CKFDPEKIAVKITNFTNIPADENQIAAYLVKNGPLAMGVNAIFMQTYIGGVS 278
Query: 302 CPYICS-RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVC 360
CP ICS +RL+HGVLLVGYG+ G++ +RL KPYWIIKNSWGE WGE+GYYK+CRG +C
Sbjct: 279 CPLICSKKRLNHGVLLVGYGAKGFSILRLGNKPYWIIKNSWGEKWGEDGYYKLCRGHGMC 338
Query: 361 GVDSMVS 367
G+++MVS
Sbjct: 339 GINTMVS 345
>gi|294462776|gb|ADE76932.1| unknown [Picea sitchensis]
Length = 403
Score = 393 bits (1009), Expect = e-107, Method: Compositional matrix adjust.
Identities = 186/315 (59%), Positives = 232/315 (73%), Gaps = 4/315 (1%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F F + K Y++ EE+ R IF+ NL +AA +Q LDP+A HGIT FSDLT EF
Sbjct: 90 FDKFIVEHGKVYSTIEEYVRRLRIFEKNLLKAAENQALDPTAVHGITPFSDLTEYEFESR 149
Query: 120 YLGLR-RKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALE 178
Y GL + L + A ILP +DLPA+FDWREKGAV VK QG+CGSCW+FSTTG +E
Sbjct: 150 YTGLLGVRQGLVNEKQTAEILPVDDLPANFDWREKGAVTEVKTQGNCGSCWAFSTTGVVE 209
Query: 179 GANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREED 238
GANFLATGKL++LSEQQL+DCDH+CDP +CD+GC+GGLM +A+ Y ++AGG+ ++
Sbjct: 210 GANFLATGKLLNLSEQQLIDCDHKCDPLNTKACDNGCHGGLMTNAYNYLMEAGGIEEAKN 269
Query: 239 YPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIG 298
YPYTG CKF+ A NF+ V+LDE QIAANLVK+GPLAV +NA +MQTYIG
Sbjct: 270 YPYTGVQGD--CKFNPDLAAVKAINFTTVNLDEKQIAANLVKHGPLAVGLNAAFMQTYIG 327
Query: 299 GVSCPYICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGR 357
GVSCP ICS+R ++HGVLLVGYG G+A +RL +PYWIIKNSWG+ WGE+GYYK+CRG
Sbjct: 328 GVSCPLICSKRFINHGVLLVGYGHKGFALLRLGYRPYWIIKNSWGKRWGEHGYYKLCRGH 387
Query: 358 NVCGVDSMVSTVAAA 372
CG++ MVS V A
Sbjct: 388 GECGMNKMVSAVIPA 402
>gi|255585361|ref|XP_002533377.1| cysteine protease, putative [Ricinus communis]
gi|223526784|gb|EEF29008.1| cysteine protease, putative [Ricinus communis]
Length = 381
Score = 392 bits (1008), Expect = e-106, Method: Compositional matrix adjust.
Identities = 195/349 (55%), Positives = 245/349 (70%), Gaps = 11/349 (3%)
Query: 29 DQLIRQVTDGGDEILSHHESTNNDLLGA--EHHFSLFKKKFNKAYASQEEHDHRFTIFKA 86
D I QVTD LS N LG E +F +F K++K Y ++EE+ HR +F
Sbjct: 39 DPTILQVTDDPSVTLS-----NRKFLGTNTEENFKMFMIKYDKEYDTREEYMHRLGVFAK 93
Query: 87 NLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAP-ILPTNDLP 145
NL RAA HQ LDP+A HGIT F DLT EF R Y G+ + + A L T LP
Sbjct: 94 NLIRAAEHQVLDPTAVHGITPFMDLTEEEFERMYTGVVGGGAVGAEGVTATSFLETAGLP 153
Query: 146 ADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDP 205
+ FDWR+KGAV VK QG+CGSCW+FSTTGA+EGANF+ATGKL++LSEQQLVDCD CD
Sbjct: 154 SSFDWRKKGAVTDVKMQGACGSCWAFSTTGAIEGANFIATGKLLNLSEQQLVDCDRVCDI 213
Query: 206 EEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFS 265
+E +CD GC GGLM +A+ Y ++AGGL E YPYTG + CKFD+ KIA V NF+
Sbjct: 214 KEKTACDDGCGGGLMTNAYRYLIEAGGLEDEISYPYTG--KPGKCKFDEKKIAVRVVNFT 271
Query: 266 VVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRR-LDHGVLLVGYGSAGY 324
+ +DE+QIAA+LV +GPLA+ +NAV+MQTYIGGVSCP IC ++ ++HGVLLVGYG+ G+
Sbjct: 272 SIPIDENQIAAHLVHHGPLAIGLNAVFMQTYIGGVSCPLICGKKWINHGVLLVGYGAKGF 331
Query: 325 APIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAAAV 373
+ +RL KPYWIIKNSWG+ WGE GYY+IC+G +CG+D MVS V V
Sbjct: 332 SILRLGYKPYWIIKNSWGKRWGEEGYYRICKGYGMCGMDRMVSAVVTQV 380
>gi|115457680|ref|NP_001052440.1| Os04g0311400 [Oryza sativa Japonica Group]
gi|113564011|dbj|BAF14354.1| Os04g0311400, partial [Oryza sativa Japonica Group]
Length = 384
Score = 389 bits (999), Expect = e-105, Method: Compositional matrix adjust.
Identities = 183/234 (78%), Positives = 202/234 (86%), Gaps = 5/234 (2%)
Query: 141 TNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCD 200
T+ LP DFDWRE GAVGPVKDQGSCGSCWSFST+GALEGA+FLATGKL LSEQQ+VDCD
Sbjct: 145 TDGLPDDFDWREHGAVGPVKDQGSCGSCWSFSTSGALEGAHFLATGKLEVLSEQQMVDCD 204
Query: 201 HECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAAS 260
HECD E +CDSGCNGGLM +AF Y +K+GGL E+DYPY G R + CKFDKSKI A
Sbjct: 205 HECDASESRACDSGCNGGLMTTAFSYLMKSGGLQSEKDYPYAG--RENTCKFDKSKIVAQ 262
Query: 261 VANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYG 320
V NFSV+S++EDQIAANLVK+GPLA+AINA YMQTYIGGVSCP+IC R LDHGVLLVGYG
Sbjct: 263 VKNFSVISVNEDQIAANLVKHGPLAIAINAAYMQTYIGGVSCPFICGRHLDHGVLLVGYG 322
Query: 321 SAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRG---RNVCGVDSMVSTVAA 371
SAGYAPIR KEKPYWIIKNSWGE+WGE GYYKICRG +N CGVDSMVS+V A
Sbjct: 323 SAGYAPIRFKEKPYWIIKNSWGENWGEKGYYKICRGPHDKNKCGVDSMVSSVTA 376
>gi|147809367|emb|CAN64491.1| hypothetical protein VITISV_015725 [Vitis vinifera]
Length = 321
Score = 389 bits (999), Expect = e-105, Method: Compositional matrix adjust.
Identities = 180/318 (56%), Positives = 234/318 (73%), Gaps = 4/318 (1%)
Query: 54 LGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTP 113
+G E F +F +K+ K Y+S+EE+ HR IF N+ RAA HQ LDP A HG+T FSDL+
Sbjct: 1 MGGEKEFRMFMEKYGKEYSSREEYVHRLGIFAKNMVRAAEHQALDPXALHGVTPFSDLSE 60
Query: 114 AEFRRTYLGLRRKLRLPKD-ADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFS 172
EF R + G+ + + A+ A L + LP FDWREKGAV VK QG+CGSCW+FS
Sbjct: 61 EEFERMFTGVVGRPHMKGGVAETAAALEVDGLPESFDWREKGAVTEVKMQGTCGSCWAFS 120
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
TTGA+EGA+F++T KL++LSEQQLVDCDH CD + +CDSGC GGLM +A++Y ++AGG
Sbjct: 121 TTGAVEGAHFISTKKLLTLSEQQLVDCDHMCDIRDKXACDSGCEGGLMTNAYKYLIEAGG 180
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY 292
L E YPYTG + CKF ++A V NF+ V +BE+QIAANLV +GPLAV +NA +
Sbjct: 181 LEEESSYPYTG--KHGECKFKPDRVAVRVVNFTEVPIBENQIAANLVCHGPLAVGLNAXF 238
Query: 293 MQTYIGGVSCPYICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYY 351
MQTYIGGVSCP IC +R ++HGVLLVGYG+ GY+ +R KPYWIIKNSWG WGE+GYY
Sbjct: 239 MQTYIGGVSCPLICPKRWINHGVLLVGYGAKGYSILRFGYKPYWIIKNSWGXRWGEHGYY 298
Query: 352 KICRGRNVCGVDSMVSTV 369
++CRG +CG+++MVS V
Sbjct: 299 RLCRGHGMCGMNTMVSAV 316
>gi|1619905|gb|AAB16997.1| thiol protease isoform A, partial [Glycine max]
Length = 318
Score = 382 bits (981), Expect = e-103, Method: Compositional matrix adjust.
Identities = 186/246 (75%), Positives = 208/246 (84%), Gaps = 5/246 (2%)
Query: 127 LRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATG 186
+R P A +APILPT DLP DFDWR+KGAV VKD G CGSCWSFSTTGALE + +LATG
Sbjct: 71 VRFPAHAQKAPILPTKDLPKDFDWRDKGAVTNVKDLGGCGSCWSFSTTGALEVSFYLATG 130
Query: 187 KLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDR 246
+LVSLSEQQLVDCDH CDPEE G+CDSGCNGGLMN+AFE L++GG+ +E+D PYTG D
Sbjct: 131 ELVSLSEQQLVDCDHVCDPEEYGACDSGCNGGLMNNAFE-ILQSGGVQKEKDIPYTGRD- 188
Query: 247 GHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYIC 306
CKFDK+K+AA+ VSLDE+QIAANLVKNGPLAVAINAV+MQTY+GGVSCPYIC
Sbjct: 189 -GTCKFDKTKVAATDL-IKRVSLDEEQIAANLVKNGPLAVAINAVFMQTYVGGVSCPYIC 246
Query: 307 SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN-GYYKICRGRNVCGVDSM 365
+ LDHGVLLVGYG YAPIR K KPYWIIKNSWGESWGEN GY +ICRGRNVCGVD+M
Sbjct: 247 GKHLDHGVLLVGYGEGRYAPIRFKNKPYWIIKNSWGESWGENDGYDEICRGRNVCGVDAM 306
Query: 366 VSTVAA 371
VSTVAA
Sbjct: 307 VSTVAA 312
>gi|194352748|emb|CAQ00102.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 368
Score = 379 bits (973), Expect = e-102, Method: Compositional matrix adjust.
Identities = 197/351 (56%), Positives = 239/351 (68%), Gaps = 16/351 (4%)
Query: 29 DQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANL 88
+ +IRQVTD G H + + LL E F+ F ++ K Y+ EE+ R +F AN+
Sbjct: 26 EDVIRQVTDSG------HGAGHPGLL-PEAQFAAFVRRHGKEYSGPEEYARRLRVFAANV 78
Query: 89 RRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGL------RRKLRLPKDADQAPILPTN 142
RAA HQ LDP A HG+T FSDLT EF GL R R A A
Sbjct: 79 ARAAAHQALDPGARHGVTPFSDLTREEFEARLTGLVGAGDVLRSARRMPAAAPATEEEVA 138
Query: 143 DLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHE 202
LPA FDWR+KGAV VK QG CGSCW+FSTTGA+EGANF+ATGKL+ LSEQQLVDCDH
Sbjct: 139 ALPASFDWRDKGAVTDVKMQGVCGSCWAFSTTGAVEGANFVATGKLLDLSEQQLVDCDHT 198
Query: 203 CDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVA 262
CD C+SGC+GGLM +A+ Y + +GGLM + YPYTG C+FD+ K+A VA
Sbjct: 199 CDAVAKTECNSGCSGGLMTNAYRYLMSSGGLMEQAAYPYTGAQ--GPCRFDRGKVAVRVA 256
Query: 263 NFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRL-DHGVLLVGYGS 321
NF+ V LDEDQ+ A LV+ GPLAV +NA +MQTY+GGVSCP IC R + +HGVLLVGYG+
Sbjct: 257 NFTAVPLDEDQMRAALVRGGPLAVGLNAAFMQTYVGGVSCPLICPRAMVNHGVLLVGYGA 316
Query: 322 AGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAAA 372
G++ +RL +PYW+IKNSWG WGE GYYK+CRGRNVCGVDSMVS VA A
Sbjct: 317 RGFSALRLGYRPYWLIKNSWGAQWGEGGYYKLCRGRNVCGVDSMVSAVAVA 367
>gi|357116897|ref|XP_003560213.1| PREDICTED: probable cysteine proteinase A494-like [Brachypodium
distachyon]
Length = 373
Score = 377 bits (969), Expect = e-102, Method: Compositional matrix adjust.
Identities = 199/354 (56%), Positives = 245/354 (69%), Gaps = 15/354 (4%)
Query: 29 DQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYAS-QEEHDHRFTIFKAN 87
D +IRQVTD G S L E F+ F ++ K Y+ EE+ R +F AN
Sbjct: 26 DDVIRQVTDNGAPAARRPPSPG---LLPEAKFAAFVRRHGKEYSGGAEEYARRLRVFAAN 82
Query: 88 LRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRK---LRLPKDADQAPILPTNDL 144
L RAA HQ LDP A HG+T FSDLTP EF+ GL+++ +P A +A L
Sbjct: 83 LARAAAHQALDPGARHGVTPFSDLTPEEFQARLTGLQQQGTNNNMPAAA-RATAEELATL 141
Query: 145 PADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECD 204
PA FDWR KGAV VK QG CGSCW+FSTTGA+EGA+F+ATGKL++LSEQQLVDCDH CD
Sbjct: 142 PASFDWRAKGAVTEVKMQGMCGSCWAFSTTGAVEGAHFVATGKLLNLSEQQLVDCDHTCD 201
Query: 205 PEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANF 264
CDSGC+GGLM +A+ Y ++AGGLM + YPYTG C+FD +K+A V +F
Sbjct: 202 AVAKNECDSGCSGGLMTNAYTYLIRAGGLMEQAAYPYTGAQ--GTCRFDANKVAVRVTSF 259
Query: 265 SVVSL-DEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRL-DHGVLLVGYGSA 322
+ V DEDQI A+LV+ GPLAV +NA +MQTY+GGVSCP +C R+L +HGVLLVGYG+
Sbjct: 260 TAVPPDDEDQIRASLVRAGPLAVGLNAAFMQTYLGGVSCPLLCPRKLINHGVLLVGYGAR 319
Query: 323 GYAPIRLKEKPYWIIKNSWGESWGENGYYKICRG---RNVCGVDSMVSTVAAAV 373
G AP+RL +PYWIIKNSWG+ WGE GYY++CRG RNVCGVDSMVS VA A+
Sbjct: 320 GLAPLRLGYRPYWIIKNSWGKEWGEGGYYRLCRGARNRNVCGVDSMVSAVAVAL 373
>gi|242045644|ref|XP_002460693.1| hypothetical protein SORBIDRAFT_02g033270 [Sorghum bicolor]
gi|241924070|gb|EER97214.1| hypothetical protein SORBIDRAFT_02g033270 [Sorghum bicolor]
Length = 373
Score = 371 bits (953), Expect = e-100, Method: Compositional matrix adjust.
Identities = 196/361 (54%), Positives = 238/361 (65%), Gaps = 29/361 (8%)
Query: 29 DQLIRQVTDGGDEILSHHESTNNDLLG--AEHHFSLFKKKFNKAYASQEEHDHRFTIFKA 86
D IRQVTDG LG E F+ F ++ + Y+ EE+ R +F A
Sbjct: 22 DGFIRQVTDG------RRSRAGAGALGLLPEAQFAAFVRRHGRRYSGPEEYARRLRVFAA 75
Query: 87 NLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLR-------RKLRLPKDADQAPIL 139
NL RAA HQ LDP+A HG+T FSDLT EF G+R ++L + AP
Sbjct: 76 NLARAAAHQALDPTARHGVTPFSDLTREEFEARLTGVRAGAGGDVQRLVM----SGAPAA 131
Query: 140 P------TNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSE 193
P + LPA FDWR+KGAV VK QG+CGSCW+FSTTGA+EGANFLATGKL+ LSE
Sbjct: 132 PPASQEEVSRLPASFDWRDKGAVTGVKMQGACGSCWAFSTTGAVEGANFLATGKLLELSE 191
Query: 194 QQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFD 253
QQLVDCDH C C++GC GGLM +A+ Y +K+GGLM + YPYTG C+FD
Sbjct: 192 QQLVDCDHTCSAVAQNECNNGCAGGLMTNAYAYLMKSGGLMEQRAYPYTGAP--GPCRFD 249
Query: 254 KSKIAASVANFSVVSL-DEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRR-LD 311
+K A VANF+ V DE QI A LV+ GPLAV +NA +MQTY+GGVSCP +C R ++
Sbjct: 250 PAKAAVRVANFTAVPAGDEAQIRAALVRRGPLAVGLNAAFMQTYVGGVSCPLLCPRAWVN 309
Query: 312 HGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAA 371
HGVLLVGYG+ G+A +RL +PYWIIKNSWGE WGE GYY++CRG NVCGVDSMVS VA
Sbjct: 310 HGVLLVGYGARGFAALRLGYRPYWIIKNSWGERWGEQGYYRLCRGSNVCGVDSMVSAVAV 369
Query: 372 A 372
A
Sbjct: 370 A 370
>gi|414590229|tpg|DAA40800.1| TPA: putative cysteine protease family protein [Zea mays]
Length = 381
Score = 370 bits (950), Expect = e-100, Method: Compositional matrix adjust.
Identities = 191/352 (54%), Positives = 235/352 (66%), Gaps = 16/352 (4%)
Query: 29 DQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANL 88
D+ IRQVT G LL E F+ F ++ + Y+ +E+ R +F ANL
Sbjct: 35 DKFIRQVTTQGT-----RAGAGPGLL-PEAQFAAFVRRHGRRYSGPKEYARRLRVFAANL 88
Query: 89 RRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTND----- 143
RAA HQ LDP+A HG+T FSDLT EF GLR + + P P
Sbjct: 89 ARAAAHQALDPTARHGVTPFSDLTREEFEARLTGLRAGGDVQRLMSGVPAAPPASKEEVA 148
Query: 144 -LPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHE 202
LPA FDWR+KGAV VK QG+CGSCW+FSTTGA+EGANFLATG+LV LSEQQLVDCDH
Sbjct: 149 RLPASFDWRDKGAVTGVKTQGACGSCWAFSTTGAVEGANFLATGELVDLSEQQLVDCDHT 208
Query: 203 CDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVA 262
C C++GC GGLM +A+ Y +++GGLM + YPYTG C+FD +++A VA
Sbjct: 209 CSAVAQNECNNGCAGGLMTNAYSYLMESGGLMEQSAYPYTGA--AGPCRFDPTQVAVRVA 266
Query: 263 NFSVVSL-DEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRR-LDHGVLLVGYG 320
NF+ V DE QI A LV+ GPLAV +NA +MQTY+GGVSCP IC R ++HGVLLVGYG
Sbjct: 267 NFTAVPAGDEAQIRAALVRRGPLAVGLNAAFMQTYVGGVSCPLICPRAWVNHGVLLVGYG 326
Query: 321 SAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAAA 372
+ G+A +RL +PYWIIKNSWG+ WGE GYY++CRG NVCGVDSMVS VA A
Sbjct: 327 ARGFAALRLGYRPYWIIKNSWGKQWGEQGYYRLCRGSNVCGVDSMVSAVAVA 378
>gi|115472081|ref|NP_001059639.1| Os07g0480900 [Oryza sativa Japonica Group]
gi|27261016|dbj|BAC45132.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113611175|dbj|BAF21553.1| Os07g0480900 [Oryza sativa Japonica Group]
gi|215693312|dbj|BAG88694.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 376
Score = 370 bits (949), Expect = e-100, Method: Compositional matrix adjust.
Identities = 193/356 (54%), Positives = 240/356 (67%), Gaps = 27/356 (7%)
Query: 32 IRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRA 91
IRQVTDGG L E F+ F ++ + Y+ EE+ R +F ANL RA
Sbjct: 29 IRQVTDGGYWPPG---------LLPEAQFAAFVRRHGREYSGPEEYARRLRVFAANLARA 79
Query: 92 ARHQKLDPSATHGITQFSDLTPAEFRRTYLGLR-------RKLRLPKDADQAPILPTNDL 144
A HQ LDP+A HG+T FSDLT EF GL R+ +P A A + L
Sbjct: 80 AAHQALDPTARHGVTPFSDLTREEFEARLTGLAADVGDDVRRRPMPSAA-PATEEEVSGL 138
Query: 145 PADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECD 204
PA FDWR++GAV VK QG+CGSCW+FSTTGA+EGANFLATG L+ LSEQQLVDCDH CD
Sbjct: 139 PASFDWRDRGAVTDVKMQGACGSCWAFSTTGAVEGANFLATGNLLDLSEQQLVDCDHTCD 198
Query: 205 PEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANF 264
E+ CDSGC GGLM +A+ Y + +GGLM + YPYTG C+FD +++A VANF
Sbjct: 199 AEKKTECDSGCGGGLMTNAYAYLMSSGGLMEQSAYPYTGAQ--GTCRFDANRVAVRVANF 256
Query: 265 SVVSLD-------EDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRR-LDHGVLL 316
+VV+ + Q+ A LV++GPLAV +NA YMQTY+GGVSCP +C R ++HGVLL
Sbjct: 257 TVVAPPGGNDGDGDAQMRAALVRHGPLAVGLNAAYMQTYVGGVSCPLVCPRAWVNHGVLL 316
Query: 317 VGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAAA 372
VGYG G+A +RL +PYWIIKNSWG++WGE GYY++CRGRNVCGVD+MVS VA A
Sbjct: 317 VGYGERGFAALRLGHRPYWIIKNSWGKAWGEQGYYRLCRGRNVCGVDTMVSAVAVA 372
>gi|218199600|gb|EEC82027.1| hypothetical protein OsI_25996 [Oryza sativa Indica Group]
Length = 709
Score = 366 bits (939), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 192/354 (54%), Positives = 240/354 (67%), Gaps = 31/354 (8%)
Query: 32 IRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRA 91
IRQVTDGG L E F+ F ++ + Y+ EE+ R +F ANL RA
Sbjct: 29 IRQVTDGGYWPPG---------LLPEAQFAAFVRRHGREYSGPEEYARRLRVFAANLARA 79
Query: 92 ARHQKLDPSATHGITQFSDLTPAEFRRTYLGL----------RRKLRLPKDADQAPILPT 141
A HQ LDP+A HG+T FSDLT EF GL RR+L +P A A
Sbjct: 80 AAHQALDPTARHGVTPFSDLTREEFEARLTGLATDVGDDDVRRRRLPMPSAAP-ATEEEV 138
Query: 142 NDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDH 201
+ LP+ FDWR++GAV VK QG+CGSCW+FSTTGA+EGANFLATG L+ LSEQQLVDCDH
Sbjct: 139 SGLPSSFDWRDRGAVTGVKMQGACGSCWAFSTTGAVEGANFLATGNLLDLSEQQLVDCDH 198
Query: 202 ECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASV 261
CD E+ CDSGC GGLM +A+ Y + +GGLM + YPYTG AC+FD +++A V
Sbjct: 199 TCDAEKKTECDSGCGGGLMTNAYAYLMSSGGLMEQSAYPYTGAQ--GACRFDANRVAVRV 256
Query: 262 ANFSVVSL-------DED-QIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRR-LDH 312
ANF+VV+ D D Q+ A LV++GPLAV +NA YMQTY+GGVSCP +C R ++H
Sbjct: 257 ANFTVVAPAAGPGGNDGDAQMRAALVRHGPLAVGLNAAYMQTYVGGVSCPLVCPRAWVNH 316
Query: 313 GVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMV 366
GVLLVGYG G+A +RL +PYWIIKNSWG++WGE GYY++CRGRNVCGVD+M+
Sbjct: 317 GVLLVGYGERGFAALRLGHRPYWIIKNSWGKAWGEQGYYRLCRGRNVCGVDTML 370
>gi|5777611|emb|CAB53397.1| cysteine protease [Medicago sativa]
Length = 209
Score = 363 bits (933), Expect = 5e-98, Method: Compositional matrix adjust.
Identities = 174/209 (83%), Positives = 192/209 (91%), Gaps = 3/209 (1%)
Query: 165 CGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAF 224
CGS W+FSTTGALEGAN+LATGKLVSLSEQQLVDCDH CDPEE SCDSGCNGGLMN+AF
Sbjct: 1 CGSGWAFSTTGALEGANYLATGKLVSLSEQQLVDCDHVCDPEERNSCDSGCNGGLMNNAF 60
Query: 225 EYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPL 284
EY L++GG++ E+DY YTG D +CKFDKSKI ASV+NFSVVSLDEDQIAANLVKNGPL
Sbjct: 61 EYILQSGGVVSEKDYAYTGRD--GSCKFDKSKIVASVSNFSVVSLDEDQIAANLVKNGPL 118
Query: 285 AVAINAVYMQTYIGGVSCPYICSR-RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGE 343
AVAINA +MQTY+ GVSCP+IC++ RLDHGVLLVG+GS GYAPIRLKEKPYWIIKNSWG+
Sbjct: 119 AVAINAAWMQTYMSGVSCPHICAKARLDHGVLLVGFGSGGYAPIRLKEKPYWIIKNSWGQ 178
Query: 344 SWGENGYYKICRGRNVCGVDSMVSTVAAA 372
+WGE GYYKICRGRNVCGVDSMVSTVAAA
Sbjct: 179 NWGEEGYYKICRGRNVCGVDSMVSTVAAA 207
>gi|52546916|gb|AAU81591.1| cysteine proteinase, partial [Petunia x hybrida]
Length = 190
Score = 348 bits (894), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 165/185 (89%), Positives = 175/185 (94%), Gaps = 2/185 (1%)
Query: 187 KLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDR 246
+LVSLSEQQLVDCDHECDPEE SCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDR
Sbjct: 3 ELVSLSEQQLVDCDHECDPEEKDSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDR 62
Query: 247 GHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYIC 306
CKFD +K+AA VANFSVVSLDE+QIAANLVKNGPLAVAINAV+MQTY+GGVSCPYIC
Sbjct: 63 AK-CKFDNTKVAAKVANFSVVSLDEEQIAANLVKNGPLAVAINAVFMQTYVGGVSCPYIC 121
Query: 307 SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMV 366
S+R DHGVLLVGYGS G+APIR+KEKPYWIIKNSWGE WGE+GYYKICRGRNVCGVDSMV
Sbjct: 122 SKRQDHGVLLVGYGS-GFAPIRMKEKPYWIIKNSWGEKWGESGYYKICRGRNVCGVDSMV 180
Query: 367 STVAA 371
STVAA
Sbjct: 181 STVAA 185
>gi|308808478|ref|XP_003081549.1| Cysteine proteinase Cathepsin F (ISS) [Ostreococcus tauri]
gi|116060014|emb|CAL56073.1| Cysteine proteinase Cathepsin F (ISS), partial [Ostreococcus tauri]
Length = 293
Score = 342 bits (878), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 169/292 (57%), Positives = 216/292 (73%), Gaps = 9/292 (3%)
Query: 88 LRRAARHQKLD-PSATHGITQFSDLTPAEFRRTYLG---LRRKLRLPKDADQAPI--LPT 141
L RAA Q D SA HG+T+FSDLTP EF YLG L + R A I LPT
Sbjct: 3 LIRAATQQANDRGSAKHGVTRFSDLTPEEFAERYLGHVKLSSEHREKVRARGGVIEDLPT 62
Query: 142 NDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDH 201
LPA+FDWR KGAV VKDQG CGSCW+FSTTGA+EGA+F++TGKLV LSEQQL+DCD
Sbjct: 63 KHLPAEFDWRFKGAVSRVKDQGQCGSCWTFSTTGAIEGAHFISTGKLVELSEQQLLDCDV 122
Query: 202 ECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASV 261
CDP+ P +CDSGCNGGL ++A EY ++ GG+ E+ YPY G ++G CK D+ + A++
Sbjct: 123 GCDPDVPNACDSGCNGGLPSNAMEYIVEHGGIDTEKSYPYVG-EKGE-CKADEGTLGATL 180
Query: 262 ANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYIC-SRRLDHGVLLVGYG 320
NFS VS DE Q+AA LVK+GPL++ INA +MQTYIGGV+CP++C S LDHGVL+VGYG
Sbjct: 181 KNFSYVSSDEKQMAAALVKHGPLSIGINAAWMQTYIGGVACPWLCDSEALDHGVLIVGYG 240
Query: 321 SAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAAA 372
S+G+AP+R +++PYWI+KNSW +WGE GYY+IC+ + CG+++MV A
Sbjct: 241 SSGFAPVRWQQEPYWIVKNSWSPAWGEGGYYRICKDKGSCGINNMVVAAHGA 292
>gi|302774134|ref|XP_002970484.1| hypothetical protein SELMODRAFT_93661 [Selaginella moellendorffii]
gi|300162000|gb|EFJ28614.1| hypothetical protein SELMODRAFT_93661 [Selaginella moellendorffii]
Length = 343
Score = 339 bits (870), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 181/374 (48%), Positives = 239/374 (63%), Gaps = 37/374 (9%)
Query: 3 SKTVVLFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSL 62
+K + + LV L++ S + D+ + IRQVTD N ++ E HF
Sbjct: 2 AKALAIILVGLLILVVCCSSSNRLDIGK-IRQVTD------------NLEVKDVEGHFKH 48
Query: 63 FKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLG 122
F +KF K Y + EE+ HR +F+ANL +K DP+A HGIT F+DLTP E R +LG
Sbjct: 49 FMQKFGKVYGTTEEYVHRLKVFQANLAHVMSLKKQDPTAIHGITSFADLTPEELSR-FLG 107
Query: 123 LRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANF 182
R+ + +QAP+LPT++LP FDWRE GAV PVK QG CGSCW+FSTTG +EGANF
Sbjct: 108 FRKAYS-NRVVNQAPLLPTDNLPEAFDWREHGAVTPVKFQGRCGSCWTFSTTGVVEGANF 166
Query: 183 LATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYT 242
L TGKL+SLSE+QL+DCD++ D+GC GG M SA+EY +KA GL EEDYPY
Sbjct: 167 LKTGKLISLSEEQLIDCDYK---------DNGCEGGDMLSAYEY-VKARGLEAEEDYPYE 216
Query: 243 GTDRGHA-----CKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYI 297
H C++ SK+ A++AN+S VS DEDQIAANLVKNGPL++A+ + TY
Sbjct: 217 ELGYRHKPVRGPCRYQPSKVVATIANYSRVSEDEDQIAANLVKNGPLSIALRGNVLFTYE 276
Query: 298 GGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGR 357
GGV+CP IC ++HGVLLVGYG +R YW KN+W + +GENGY+++CRG
Sbjct: 277 GGVACPRICPGEINHGVLLVGYGVEN--GLR-----YWTFKNTWTDEFGENGYFRLCRGV 329
Query: 358 NVCGVDSMVSTVAA 371
VC ++S V TV+
Sbjct: 330 GVCDMNSEVGTVST 343
>gi|302793594|ref|XP_002978562.1| hypothetical protein SELMODRAFT_109056 [Selaginella moellendorffii]
gi|300153911|gb|EFJ20548.1| hypothetical protein SELMODRAFT_109056 [Selaginella moellendorffii]
Length = 343
Score = 337 bits (863), Expect = 7e-90, Method: Compositional matrix adjust.
Identities = 181/374 (48%), Positives = 238/374 (63%), Gaps = 37/374 (9%)
Query: 3 SKTVVLFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSL 62
+K + + LV L++ S + D+ + IRQVTD N ++ E HF
Sbjct: 2 AKALAIILVGLLILVICCSSSNRLDIGK-IRQVTD------------NLEVDDVEGHFKH 48
Query: 63 FKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLG 122
F +KF K Y + EE+ HR +F+ANL +K DP+A HGIT F+DLTP E R +LG
Sbjct: 49 FMQKFGKVYGTTEEYVHRLKVFQANLVHVMSLKKQDPTAIHGITSFADLTPEELSR-FLG 107
Query: 123 LRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANF 182
R+ + +QAP+LPT++LP FDWRE GAV PVK QG CGSCW+FSTTG +EGANF
Sbjct: 108 FRKAYS-NRVVNQAPLLPTDNLPEAFDWREHGAVTPVKFQGRCGSCWTFSTTGVVEGANF 166
Query: 183 LATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYT 242
L TGKL+SLSE+QL+DCD++ D+GC GG M SA+EY +KA GL +EDYPY
Sbjct: 167 LKTGKLISLSEEQLIDCDYK---------DNGCEGGDMLSAYEY-VKARGLEADEDYPYE 216
Query: 243 GTDRGHA-----CKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYI 297
H C++ SK+ A++AN+S VS DEDQIAANLVKNGPL++A+ + TY
Sbjct: 217 ELGYRHKPVRGPCRYQPSKVVATIANYSRVSEDEDQIAANLVKNGPLSIALRGNVLFTYE 276
Query: 298 GGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGR 357
GGV+CP IC ++HGVLLVGYG +R YW KNSW + +GENGY+++CRG
Sbjct: 277 GGVACPRICPGEINHGVLLVGYGVEN--GLR-----YWTFKNSWTDEFGENGYFRLCRGV 329
Query: 358 NVCGVDSMVSTVAA 371
VC + S V TV+
Sbjct: 330 GVCDMTSEVGTVST 343
>gi|412992445|emb|CCO18425.1| unknown [Bathycoccus prasinos]
Length = 500
Score = 329 bits (844), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 158/328 (48%), Positives = 219/328 (66%), Gaps = 29/328 (8%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP----SATHGITQFSDLTPA 114
F KK++ + ++EE++ R IF+ N +RA + D SA HG+T+F DL+
Sbjct: 176 QFPEKKKEYERK--TEEEYEKRMEIFQENWKRAIEREIDDRKGGGSAKHGVTKFFDLSEE 233
Query: 115 EFRRTYLGL---------------RRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPV 159
EFR YLGL + ++ P + D LP +DWR +GAV PV
Sbjct: 234 EFREQYLGLLSTSTSSSASKDAFRKHQMEAPSEED------LEKLPQYYDWRARGAVTPV 287
Query: 160 KDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGL 219
KDQG CGSCW+FSTTGA+EGANF+ TGKLVSLSEQQL+DCD C P+ P +CDSGCNGGL
Sbjct: 288 KDQGQCGSCWTFSTTGAIEGANFIKTGKLVSLSEQQLLDCDVGCAPDIPNACDSGCNGGL 347
Query: 220 MNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLV 279
++A EY ++ GGL E+ YPY + C+ + K+ A+++N++ V +E +A LV
Sbjct: 348 PSNAMEYIVEHGGLDTEKSYPYKAY-KEDTCRAKEGKLGATISNYTFVGKNETHMAHALV 406
Query: 280 KNGPLAVAINAVYMQTYIGGVSCPYICSR-RLDHGVLLVGYGSAGYAPIRLKEKPYWIIK 338
K GPL++ INA +MQ+Y+GGV+CP++C++ LDHGVL+VGYG G+AP RL ++PYW+IK
Sbjct: 407 KYGPLSIGINAAWMQSYVGGVACPWLCNKDALDHGVLIVGYGEEGFAPARLHKEPYWVIK 466
Query: 339 NSWGESWGENGYYKICRGRNVCGVDSMV 366
NSWG WGE GYY+IC+ + CGV++MV
Sbjct: 467 NSWGMGWGEEGYYRICKDKGNCGVNNMV 494
>gi|145351119|ref|XP_001419933.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144580166|gb|ABO98226.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 272
Score = 320 bits (820), Expect = 8e-85, Method: Compositional matrix adjust.
Identities = 152/266 (57%), Positives = 196/266 (73%), Gaps = 9/266 (3%)
Query: 108 FSDLTPAEFRRTYLG------LRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKD 161
FSDLT EF YLG R+ R + + LP LP +FDWR KGAV VKD
Sbjct: 2 FSDLTAEEFAARYLGHVRLSSEEREKRKARGGETLETLPVEHLPEEFDWRFKGAVTRVKD 61
Query: 162 QGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMN 221
QG CGSCW+FSTTGA+EGA+F++TGKLV LSEQQLVDCD CDP+ P +CDSGCNGGL +
Sbjct: 62 QGQCGSCWTFSTTGAIEGAHFISTGKLVELSEQQLVDCDVGCDPDVPNACDSGCNGGLPS 121
Query: 222 SAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKN 281
+A EY ++ GG+ E+ YPY G ++G CK K K+ A++ NFS VS DE Q+AA LVK
Sbjct: 122 NAMEYIVEHGGIDTEKSYPYVG-EKGE-CKAKKGKLGATLKNFSFVSDDEKQMAAALVKY 179
Query: 282 GPLAVAINAVYMQTYIGGVSCPYIC-SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNS 340
GPL++ INA +MQ+YIGGV+CP++C + LDHGVL+VGYGS+G+AP+R +PYWI+KNS
Sbjct: 180 GPLSIGINAAWMQSYIGGVACPWLCDAESLDHGVLIVGYGSSGFAPVRWAPEPYWIVKNS 239
Query: 341 WGESWGENGYYKICRGRNVCGVDSMV 366
W +WGE GYY+IC+ + CG+++MV
Sbjct: 240 WSPAWGEGGYYRICKDKGSCGINNMV 265
>gi|296085959|emb|CBI31400.3| unnamed protein product [Vitis vinifera]
Length = 257
Score = 317 bits (811), Expect = 8e-84, Method: Compositional matrix adjust.
Identities = 145/238 (60%), Positives = 187/238 (78%), Gaps = 3/238 (1%)
Query: 133 ADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLS 192
A+ A L + LP FDWREKGAV VK QG+CGSCW+FSTTGA+EGA+F++T KL++LS
Sbjct: 6 AETAAALEVDGLPESFDWREKGAVTEVKMQGTCGSCWAFSTTGAVEGAHFISTKKLLTLS 65
Query: 193 EQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKF 252
EQQLVDCDH CD + +CDSGC GGLM +A++Y ++AGGL E YPYTG + CKF
Sbjct: 66 EQQLVDCDHMCDIRDKTACDSGCEGGLMTNAYKYLIEAGGLEEESSYPYTG--KHGECKF 123
Query: 253 DKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRR-LD 311
++A V NF+ V ++E+QIAANLV +GPLAV +NA++MQTYIGGVSCP IC +R ++
Sbjct: 124 KPDRVAVRVVNFTEVPINENQIAANLVCHGPLAVGLNAIFMQTYIGGVSCPLICPKRWIN 183
Query: 312 HGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTV 369
HGVLLVGYG+ GY+ +R KPYWIIKNSWG+ WGE+GYY++CRG +CG+++MVS V
Sbjct: 184 HGVLLVGYGAKGYSILRFGYKPYWIIKNSWGKRWGEHGYYRLCRGHGMCGMNTMVSAV 241
>gi|255088003|ref|XP_002505924.1| cysteine endopeptidase [Micromonas sp. RCC299]
gi|226521195|gb|ACO67182.1| cysteine endopeptidase [Micromonas sp. RCC299]
Length = 291
Score = 315 bits (806), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 158/284 (55%), Positives = 199/284 (70%), Gaps = 10/284 (3%)
Query: 91 AARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRK----LRLPKDADQAPILPTNDLPA 146
A R + SA HG+TQFSDLTP EF T+LG + + P P +DLP
Sbjct: 5 AERQAQDRGSAVHGVTQFSDLTPTEFASTFLGTKLANEDVAAIRSGMTTLPDYPAHDLPL 64
Query: 147 DFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPE 206
+FDWRE+GAV PVK+QG+CGSCW+FS TGA+EGANFL TG+LVSLSEQQLVDCDH CDP
Sbjct: 65 EFDWRERGAVTPVKNQGACGSCWTFSATGAVEGANFLKTGELVSLSEQQLVDCDHTCDPS 124
Query: 207 EPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSV 266
P +CD GCNGGL +A Y K GL E +YPY G D G AASV++F++
Sbjct: 125 APRNCDYGCNGGLPLNAMRYVQKH-GLDTESNYPYKGVD-GKCASARHGPAAASVSSFNL 182
Query: 267 VSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRR-LDHGVLLVGYGSAGYA 325
VS +E QIAA L+K+GPL++ I+A +MQTY+GGV+CP+IC++ LDHGVL+VGYG G A
Sbjct: 183 VSTNETQIAAALLKHGPLSIGIDAAWMQTYVGGVACPWICNKAGLDHGVLIVGYGVNGTA 242
Query: 326 PIRL--KEKPYWIIKNSWGESWG-ENGYYKICRGRNVCGVDSMV 366
P R + + YWI+KNSWG +WG E GYY IC+ R CG+++MV
Sbjct: 243 PARPWHRRQDYWIVKNSWGPNWGVEGGYYHICKDRAACGLNTMV 286
>gi|303275866|ref|XP_003057227.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226461579|gb|EEH58872.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 329
Score = 313 bits (802), Expect = 8e-83, Method: Compositional matrix adjust.
Identities = 165/326 (50%), Positives = 209/326 (64%), Gaps = 22/326 (6%)
Query: 57 EHHFSLFKKKFNKAYASQ-EEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAE 115
E F F + K YAS +E+ R IF N+ RA D A +G T F+DLT E
Sbjct: 5 ERDFDAFVLEHGKTYASDAKEYAKRLEIFAENMARAKEMSARD-GAEYGATPFADLTEDE 63
Query: 116 FRRTYLGLRRKLRLPKDADQA------------PILPTNDLPADFDWREKGAVGPVKDQG 163
F + L +R P DA + P LPT ++P +FDWR GAV PVK+QG
Sbjct: 64 FASSLL-----MREPIDAARVERLKRHESSRVLPHLPTENIPLNFDWRALGAVTPVKNQG 118
Query: 164 SCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSA 223
CGSCWSFS TGA+EGA+F+ +G LVSLSEQQLVDCDH CDP+ +CDSGC+GGL +A
Sbjct: 119 MCGSCWSFSATGAVEGAHFVKSGALVSLSEQQLVDCDHTCDPDSGTACDSGCDGGLPANA 178
Query: 224 FEYTLKAGGLMREEDYPYTGTDRGHACKF-DKSKIAASVANFSVVSLDEDQIAANLVKNG 282
Y +K GGL E YPY G CK + AA++ N+S VS DE QIAA LVK+G
Sbjct: 179 MAYVVKRGGLDAEAAYPYLGARGDGRCKSKEDGPPAATITNYSFVSADESQIAAALVKHG 238
Query: 283 PLAVAINAVYMQTYIGGVSCPYICSR-RLDHGVLLVGYGSAGYAPIR-LKEKPYWIIKNS 340
PL+V I+A +MQ Y GV+CP+ C + RLDHGVL+VG+G+ G AP R + +P+W+IKNS
Sbjct: 239 PLSVGIDARWMQLYRRGVACPWACDKTRLDHGVLIVGFGAEGRAPARGFRREPFWLIKNS 298
Query: 341 WGESWGENGYYKICRGRNVCGVDSMV 366
WG WGE GYYKIC+ + CGV++MV
Sbjct: 299 WGARWGEEGYYKICKDKGSCGVNTMV 324
>gi|388519111|gb|AFK47617.1| unknown [Medicago truncatula]
Length = 241
Score = 312 bits (800), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 153/229 (66%), Positives = 182/229 (79%), Gaps = 15/229 (6%)
Query: 9 FLVSLVVFSAVSSGT--LIDDV---DQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLF 63
FL++L +F+ V++ L DD D LIRQV D + + +L AEHHF+ F
Sbjct: 5 FLIALFLFATVATAATTLSDDTNSDDLLIRQVVD----------TAEDHILNAEHHFTSF 54
Query: 64 KKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGL 123
K KF+K YA++EEHD+RF +FK+NL +A HQKLDPSA HGIT+FSDLT +EFRR +LGL
Sbjct: 55 KSKFSKNYATKEEHDYRFGVFKSNLIKAKLHQKLDPSAQHGITKFSDLTASEFRRQFLGL 114
Query: 124 RRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFL 183
++LRLP A +APILPTN+LP DFDWREKGAV PVKDQGSCGSCW+FSTTGALEGAN+L
Sbjct: 115 NKRLRLPAHAQKAPILPTNNLPEDFDWREKGAVTPVKDQGSCGSCWAFSTTGALEGANYL 174
Query: 184 ATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
ATGKL SLSEQQLVDCDH CDPEE GSCDSGCNGGLMN+AFEY L++GG
Sbjct: 175 ATGKLTSLSEQQLVDCDHVCDPEERGSCDSGCNGGLMNNAFEYILQSGG 223
>gi|290997496|ref|XP_002681317.1| cysteine protease [Naegleria gruberi]
gi|284094941|gb|EFC48573.1| cysteine protease [Naegleria gruberi]
Length = 350
Score = 307 bits (786), Expect = 6e-81, Method: Compositional matrix adjust.
Identities = 154/319 (48%), Positives = 215/319 (67%), Gaps = 17/319 (5%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F F KK K Y + E+H R+ IFK+N+ +A + + T G+++F DLTP EF+R
Sbjct: 36 FVKFSKKHAKLYGA-EDHGKRYQIFKSNVEKARYYNHVGKRETFGVSKFMDLTPEEFKRM 94
Query: 120 YL-------GLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFS 172
+L R+ L PK+A D P +DWR+KGAV PVK+QG+CGSCW+FS
Sbjct: 95 FLMKTYTPEEARKILAAPKEA-VVTAQQVKDTPTSWDWRQKGAVTPVKNQGACGSCWTFS 153
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEE-PGSCDSGCNGGLMNSAFEYTLKAG 231
TTG +EG + + TGKLVSLSEQQLVDCDH C + +CD+GCNGGLM SAF+Y +K G
Sbjct: 154 TTGNVEGIHQIKTGKLVSLSEQQLVDCDHNCVTYQGQQACDAGCNGGLMWSAFQYVIKTG 213
Query: 232 GLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV 291
GL+ E+ YPY G D C+F+KS +A ++ +++ + DE ++AA L NGP+++AINA
Sbjct: 214 GLVTEDSYPYEGVD--DTCRFNKSNVAVTINSWTSIPSDEGKMAAWLAANGPISIAINAE 271
Query: 292 YMQTYIGGVSCPYICS-RRLDHGVLLVGYGSAGYAPIRLKEKP-YWIIKNSWGESWGENG 349
++QTY G+S P+ C+ + LDHGVL+VG+G+ L EK YWIIKNSWG WGE+G
Sbjct: 272 WLQTYTSGISNPWFCNPQDLDHGVLIVGFGTGSN---WLGEKEDYWIIKNSWGADWGESG 328
Query: 350 YYKICRGRNVCGVDSMVST 368
Y++I RG+ CG++S+ S+
Sbjct: 329 YFRIVRGKGKCGLNSVPSS 347
>gi|353441136|gb|AEQ94152.1| drought-inducible cysteine proteinase [Elaeis guineensis]
Length = 252
Score = 303 bits (775), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 160/242 (66%), Positives = 184/242 (76%), Gaps = 12/242 (4%)
Query: 17 SAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEE 76
S SS + D LI QV DE + L AE HFS F ++F K+YA ++E
Sbjct: 20 SVASSWPSYAEDDPLIVQVVPESDE--------DELRLNAEAHFSSFLRRFGKSYADEKE 71
Query: 77 HDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPK---DA 133
H +RF++FKANLRRA RHQK+DP+A HGIT+FSDLTPAEFRRTYLGLR RL + +
Sbjct: 72 HAYRFSVFKANLRRARRHQKMDPTAVHGITKFSDLTPAEFRRTYLGLRGGRRLRRALASS 131
Query: 134 DQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSE 193
+APILPTN+LP DFDWR+ GAV VKDQGSCGSCWSFS +GALEGANFLATG+L SLSE
Sbjct: 132 HEAPILPTNNLPTDFDWRDHGAVTGVKDQGSCGSCWSFSASGALEGANFLATGQLESLSE 191
Query: 194 QQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFD 253
QQLVDCDHECD EP SCDSGCNGGLM +AFEY LK+GGL E+DYPYTGTDRG CKFD
Sbjct: 192 QQLVDCDHECDSSEPDSCDSGCNGGLMTTAFEYLLKSGGLELEKDYPYTGTDRGR-CKFD 250
Query: 254 KS 255
+S
Sbjct: 251 ES 252
>gi|2253415|gb|AAB62937.1| stress-induced cysteine proteinase [Lavatera thuringiaca]
Length = 175
Score = 302 bits (774), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 138/170 (81%), Positives = 158/170 (92%), Gaps = 1/170 (0%)
Query: 202 ECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASV 261
ECDP++ G+C++GC+GGLM SAFEYTLKAGGL REE+YPYTG DRG CKFDK+KIAASV
Sbjct: 1 ECDPQQYGACNAGCSGGLMTSAFEYTLKAGGLEREEEYPYTGIDRG-GCKFDKTKIAASV 59
Query: 262 ANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGS 321
+NFSV+S+DEDQIAAN+VK+GPLAV INA +MQTYIGGVSCPYIC R LDHGVLLVGYG+
Sbjct: 60 SNFSVISVDEDQIAANMVKHGPLAVGINAAFMQTYIGGVSCPYICFRSLDHGVLLVGYGA 119
Query: 322 AGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAA 371
AGYAP+R KEKP+WIIKNSWG +WGE+GYYKICRGRNVCGVDSMVS+VAA
Sbjct: 120 AGYAPVRFKEKPFWIIKNSWGANWGEDGYYKICRGRNVCGVDSMVSSVAA 169
>gi|118488886|gb|ABK96252.1| unknown [Populus trichocarpa x Populus deltoides]
Length = 156
Score = 301 bits (770), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 141/152 (92%), Positives = 149/152 (98%), Gaps = 1/152 (0%)
Query: 220 MNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLV 279
MNSAFEYTLKAGGLMREEDYPYTGTDRG ACKFDK+K+AA VANFSVVSLDEDQIAANLV
Sbjct: 1 MNSAFEYTLKAGGLMREEDYPYTGTDRG-ACKFDKNKVAARVANFSVVSLDEDQIAANLV 59
Query: 280 KNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKN 339
KNGPLAVAINAV+MQTYIGGVSCPYICSRRLDHGVLLVGYGSAGY+P+R+KEKP+WIIKN
Sbjct: 60 KNGPLAVAINAVFMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYSPVRMKEKPFWIIKN 119
Query: 340 SWGESWGENGYYKICRGRNVCGVDSMVSTVAA 371
SWGE WGENG+YKICRGRNVCGVDSMVSTVAA
Sbjct: 120 SWGEKWGENGFYKICRGRNVCGVDSMVSTVAA 151
>gi|1353726|gb|AAB01769.1| cysteine proteinase homolog, partial [Naegleria fowleri]
Length = 347
Score = 300 bits (769), Expect = 6e-79, Method: Compositional matrix adjust.
Identities = 154/320 (48%), Positives = 218/320 (68%), Gaps = 19/320 (5%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F F +K+ K Y + EEH++R+ IFKAN+ ++ + + GIT+FSDLTP EF+R
Sbjct: 33 FIKFSRKYAKVYGT-EEHNNRYQIFKANVEKSRYYNHVGKRENFGITKFSDLTPEEFKRM 91
Query: 120 YLGLRRKLRLPKDADQAPILPTNDL---------PADFDWREKGAVGPVKDQGSCGSCWS 170
+L K P++A + P + + P FDWR+ GAV VK+QG+CGSCW+
Sbjct: 92 FL---MKTYTPEEAKKILAAPQHAVLSEKEVQTAPTSFDWRQHGAVTRVKNQGACGSCWT 148
Query: 171 FSTTGALEGANFLATGKLVSLSEQQLVDCDHEC-DPEEPGSCDSGCNGGLMNSAFEYTLK 229
FSTTG +EG + GKLVSLSEQQLVDCDH C + +CDSGCNGGLM SAF+Y +K
Sbjct: 149 FSTTGNVEGQWAIKKGKLVSLSEQQLVDCDHNCVTYQNQQACDSGCNGGLMWSAFQYVIK 208
Query: 230 AGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAIN 289
GGL E+ YPY G D C+F+KS +AA++++++ +S DE+Q+AA L NGP+++AIN
Sbjct: 209 NGGLDTEDSYPYEGVD--DTCRFNKSNVAATISSWTSISSDENQMAAWLAANGPISIAIN 266
Query: 290 AVYMQTYIGGVSCPYICS-RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
A ++Q Y G+S P+ C+ + LDHGVL+VGYG G + + E+ YWI+KNSWG WGE+
Sbjct: 267 AEWLQYYTSGISDPWFCNPQDLDHGVLIVGYG-VGKSWLG-SEENYWIVKNSWGSDWGED 324
Query: 349 GYYKICRGRNVCGVDSMVST 368
GY++I RG+ CG++S+ S+
Sbjct: 325 GYFRIIRGKGKCGLNSVPSS 344
>gi|290980288|ref|XP_002672864.1| predicted protein [Naegleria gruberi]
gi|284086444|gb|EFC40120.1| predicted protein [Naegleria gruberi]
Length = 356
Score = 294 bits (753), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 151/327 (46%), Positives = 207/327 (63%), Gaps = 22/327 (6%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
+ F+ F++K K Y +++ D R+ IFK N+ RA L G+T+FSDLTP EF
Sbjct: 34 QQLFTQFRRKHVKLYGTKQVQDRRYQIFKQNVERARFENYLTERDNMGVTRFSDLTPDEF 93
Query: 117 RRTYLGLRRKLRLPKDAD-------QAP------ILPTNDLPADFDWREKGAVGPVKDQG 163
+ +L K PK A Q P + +D P +FDWRE AV PVKDQG
Sbjct: 94 KSMFL---MKSYTPKQARELLSGMRQYPANAKLTMKQVSDAPKEFDWREHNAVTPVKDQG 150
Query: 164 SCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEE-PGSCDSGCNGGLMNS 222
+CGSCW+FSTTG +EG TGKL+SLSEQQLVDCDH C E +C++GCNGGLM S
Sbjct: 151 NCGSCWTFSTTGNVEGMYAAKTGKLISLSEQQLVDCDHNCVVWEGEKTCNAGCNGGLMWS 210
Query: 223 AFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNG 282
+FE+ +K GGL+ EE YPY D + C+F+ S ++N++ VS +ED++AA L NG
Sbjct: 211 SFEHIIKTGGLVTEESYPYEAVD--NRCRFNVSNAVVKISNWTFVSSNEDEMAAWLANNG 268
Query: 283 PLAVAINAVYMQTYIGGVSCPYICS-RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSW 341
P+A+AINA Y+Q Y G+ P C L+HGVL+VGYG A ++++ YWI+KNSW
Sbjct: 269 PIAIAINADYLQYYRKGILNPSRCDPEELNHGVLIVGYGEEKAANGKVEK--YWIVKNSW 326
Query: 342 GESWGENGYYKICRGRNVCGVDSMVST 368
SWGE GY ++ RG+ VCG++++ S+
Sbjct: 327 SASWGEKGYVRVLRGKGVCGLNAVPSS 353
>gi|156546466|ref|XP_001607324.1| PREDICTED: hypothetical protein LOC100123649 [Nasonia vitripennis]
Length = 1036
Score = 290 bits (741), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 152/324 (46%), Positives = 202/324 (62%), Gaps = 17/324 (5%)
Query: 54 LGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLT 112
L E F F K+ K Y ++EE + RF IFK NL Q+ + + +G+TQF+DLT
Sbjct: 725 LKEEILFHEFMGKYKKMYHNKEEKEMRFQIFKDNLNLIEELQRNEMGTGRYGVTQFTDLT 784
Query: 113 PAEFRRTYLGLRRKLRLPKDADQA-PILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSF 171
AEF+ +LGL+ L+ D +P +LP+D+DWR V PVKDQGSCGSCW+F
Sbjct: 785 KAEFKARHLGLKPTLKSENDIPMPMATIPDIELPSDYDWRHHNVVTPVKDQGSCGSCWAF 844
Query: 172 STTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAG 231
S TG +EG + G+L+SLSEQ+LVDCD DSGCNGGL ++A+ + G
Sbjct: 845 SVTGNIEGQYAIKHGELLSLSEQELVDCD---------KLDSGCNGGLPDTAYRAIEELG 895
Query: 232 GLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV 291
GL E DYPY D C F+K+K+ ++ + ++ +E Q+A LVKNGP+++ INA
Sbjct: 896 GLELESDYPYDAED--EKCHFNKNKVKVNIVSGLNITSNETQMAQWLVKNGPMSIGINAN 953
Query: 292 YMQTYIGGVSCP--YICS-RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
MQ Y+GGVS P ++CS LDHGVL+VGYG Y PI K PYWIIKNSWG WGE
Sbjct: 954 AMQFYMGGVSHPFKFLCSPDSLDHGVLIVGYGVKFY-PIFKKTMPYWIIKNSWGPRWGEQ 1012
Query: 349 GYYKICRGRNVCGVDSMVSTVAAA 372
GYY++ RG CGV+ MV++ A
Sbjct: 1013 GYYRVYRGDGTCGVNKMVTSAVVA 1036
>gi|144228217|gb|ABO93617.1| papain-like cysteine proteinase [Vitis vinifera]
Length = 161
Score = 288 bits (738), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 135/162 (83%), Positives = 147/162 (90%), Gaps = 1/162 (0%)
Query: 195 QLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDK 254
QLVDCDHECDPEE G+CD GCNGGLM SAFEY LKAGG+ REE YPY G+DRG +CKF+K
Sbjct: 1 QLVDCDHECDPEEYGACDQGCNGGLMTSAFEYILKAGGVEREETYPYIGSDRG-SCKFNK 59
Query: 255 SKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGV 314
S+I ASV+NFSVVSLDEDQIAAN+VKNGPLAV INAV+MQTY+ GVSCPYICSR LDHGV
Sbjct: 60 SQIVASVSNFSVVSLDEDQIAANMVKNGPLAVGINAVFMQTYMKGVSCPYICSRNLDHGV 119
Query: 315 LLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRG 356
+LVGYGSAGYAPIR KEKPYWIIKNSWGESWGE+GY K CRG
Sbjct: 120 VLVGYGSAGYAPIRFKEKPYWIIKNSWGESWGEDGYDKNCRG 161
>gi|383863617|ref|XP_003707276.1| PREDICTED: uncharacterized protein LOC100880620 [Megachile
rotundata]
Length = 884
Score = 288 bits (736), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 153/333 (45%), Positives = 203/333 (60%), Gaps = 22/333 (6%)
Query: 41 EILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP- 99
++L E ++LL F F K +NK Y S +E R+ +F+ NL+ + +K +
Sbjct: 565 KMLKMAEDYKDELL-----FEDFVKTYNKTYLSAKEKADRYKVFRKNLKMIEKLRKFEQG 619
Query: 100 SATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDAD-QAPILPTNDLPADFDWREKGAVGP 158
+A +G+T F+DLTP EF+ YLGL+ L D Q ++P DLP FDWRE AV P
Sbjct: 620 TAVYGVTMFADLTPEEFKTKYLGLKTNLNQENDIPLQEAVIPDIDLPPKFDWREYNAVTP 679
Query: 159 VKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGG 218
VKDQG CGSCW+FS G +EG + KL+SLSEQ+LVDCD+ D GC GG
Sbjct: 680 VKDQGQCGSCWAFSAIGNIEGQYAIKHKKLLSLSEQELVDCDN---------LDDGCGGG 730
Query: 219 LMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANL 278
M +A++ K GGL E DYPY R C F K+K VA+ ++ DE ++A L
Sbjct: 731 YMINAYKTVEKLGGLELETDYPYDA--RNEKCHFLKNKAKVQVASALNITNDEKKMAQWL 788
Query: 279 VKNGPLAVAINAVYMQTYIGGVSCP--YICS-RRLDHGVLLVGYGSAGYAPIRLKEKPYW 335
VKNGP++V INA MQ Y GGVS P ++C LDHGVL+VGY ++ Y P+ K+ PYW
Sbjct: 789 VKNGPISVGINANAMQFYFGGVSHPFKFLCDPANLDHGVLIVGYATSTY-PLFKKKLPYW 847
Query: 336 IIKNSWGESWGENGYYKICRGRNVCGVDSMVST 368
IIKNSWG WGE GYY++ RG CGV++M S+
Sbjct: 848 IIKNSWGPKWGEQGYYRVYRGDGTCGVNAMASS 880
>gi|350421176|ref|XP_003492760.1| PREDICTED: hypothetical protein LOC100745708 [Bombus impatiens]
Length = 884
Score = 287 bits (735), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 153/324 (47%), Positives = 199/324 (61%), Gaps = 23/324 (7%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAE 115
E F F KKF K Y S +E RF IFK NL+ Q + +A +G+T F+DLTP E
Sbjct: 576 ETLFEAFIKKFGKTYNSADEKLDRFKIFKQNLKIIEELQTFERGTAEYGVTMFADLTPKE 635
Query: 116 FRRTYLGLRRKLRLPKDADQAPI----LPTNDLPADFDWREKGAVGPVKDQGSCGSCWSF 171
F+ YLGLR +L K ++ P+ +P LP FDWR+ V PVKDQG CGSCW+F
Sbjct: 636 FKARYLGLRPEL---KHENEIPLPEAEIPDVSLPLKFDWRDHSVVTPVKDQGQCGSCWAF 692
Query: 172 STTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAG 231
S TG +EG + +L+SLSEQ+LVDCD S D GCNGG M +A++ + G
Sbjct: 693 SVTGNVEGQYAIKHNQLLSLSEQELVDCD---------SLDEGCNGGDMENAYKAIERLG 743
Query: 232 GLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV 291
GL E DYPY D C F ++K V + ++ DE ++A LVKNGP++V INA
Sbjct: 744 GLELESDYPYDAKD--EKCHFLQNKAKVQVVSAVNITSDEKRMAQWLVKNGPISVGINAN 801
Query: 292 YMQTYIGGVSCP--YICS-RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
MQ Y GGVS P ++C+ + LDHGVL+VGYG + Y P+ KE PYWIIKNSWG WGE
Sbjct: 802 AMQFYFGGVSHPLNFLCNPKNLDHGVLIVGYGISKY-PLFHKELPYWIIKNSWGPRWGER 860
Query: 349 GYYKICRGRNVCGVDSMVSTVAAA 372
GYY++ RG CGV++M ++ A
Sbjct: 861 GYYRVYRGDGTCGVNTMATSAVVA 884
>gi|66803148|ref|XP_635417.1| cysteine proteinase 1 [Dictyostelium discoideum AX4]
gi|166201987|sp|P04988.2|CYSP1_DICDI RecName: Full=Cysteine proteinase 1; Flags: Precursor
gi|60463731|gb|EAL61909.1| cysteine proteinase 1 [Dictyostelium discoideum AX4]
Length = 343
Score = 283 bits (724), Expect = 9e-74, Method: Compositional matrix adjust.
Identities = 152/325 (46%), Positives = 198/325 (60%), Gaps = 17/325 (5%)
Query: 54 LGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRA------ARHQKLDPSATHGITQ 107
L + F F+ KFNK Y S EE+ RF IFK+NL + A + K D G+ +
Sbjct: 23 LEEQSQFLEFQDKFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKAD--TKFGVNK 79
Query: 108 FSDLTPAEFRRTYLGLRRKL---RLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGS 164
F+DL+ EF+ YL + + LP AD N +P FDWR +GAV PVK+QG
Sbjct: 80 FADLSSDEFKNYYLNNKEAIFTDDLPV-ADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQ 138
Query: 165 CGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHEC-DPEEPGSCDSGCNGGLMNSA 223
CGSCWSFSTTG +EG +F++ KLVSLSEQ LVDCDHEC + E +CD GCNGGL +A
Sbjct: 139 CGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEQACDEGCNGGLQPNA 198
Query: 224 FEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGP 283
+ Y +K GG+ E YPYT + G C F+ + I A ++NF+++ +E +A +V GP
Sbjct: 199 YNYIIKNGGIQTESSYPYTA-ETGTQCNFNSANIGAKISNFTMIPKNETVMAGYIVSTGP 257
Query: 284 LAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGE 343
LA+A +AV Q YIGGV LDHG+L+VGY + I K PYWI+KNSWG
Sbjct: 258 LAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKN--TIFRKNMPYWIVKNSWGA 315
Query: 344 SWGENGYYKICRGRNVCGVDSMVST 368
WGE GY + RG+N CGV + VST
Sbjct: 316 DWGEQGYIYLRRGKNTCGVSNFVST 340
>gi|1617037|emb|CAA26255.1| cysteine proteinase I precursor [Dictyostelium discoideum]
Length = 343
Score = 282 bits (721), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 151/322 (46%), Positives = 197/322 (61%), Gaps = 17/322 (5%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRA------ARHQKLDPSATHGITQFSD 110
+ F F+ KFNK Y S EE+ RF IFK+NL + A + K D G+ +F+D
Sbjct: 26 QSQFLEFQDKFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKAD--TKFGVNKFAD 82
Query: 111 LTPAEFRRTYLGLRRKL---RLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGS 167
L+ EF+ YL + + LP AD N +P FDWR +GAV PVK+QG CGS
Sbjct: 83 LSSDEFKNYYLNNKEAIFTDDLPV-ADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGS 141
Query: 168 CWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHEC-DPEEPGSCDSGCNGGLMNSAFEY 226
CWSFSTTG +EG +F++ KLVSLSEQ LVDCDHEC + E +CD GCNGGL +A+ Y
Sbjct: 142 CWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNY 201
Query: 227 TLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAV 286
+K GG+ E YPYT + G C F+ + I A ++NF+++ +E +A +V GPLA+
Sbjct: 202 IIKNGGIQTESSYPYTA-ETGTQCNFNSANIGAKISNFTMIPKNETVMAGYIVSTGPLAI 260
Query: 287 AINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWG 346
A +AV Q YIGGV LDHG+L+VGY + I K PYWI+KNSWG WG
Sbjct: 261 AADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKN--TIFRKNMPYWIVKNSWGADWG 318
Query: 347 ENGYYKICRGRNVCGVDSMVST 368
E GY + RG+N CGV + VST
Sbjct: 319 EQGYIYLRRGKNTCGVSNFVST 340
>gi|118483347|gb|ABK93575.1| unknown [Populus trichocarpa]
Length = 157
Score = 281 bits (718), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 131/152 (86%), Positives = 144/152 (94%), Gaps = 1/152 (0%)
Query: 220 MNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLV 279
MN+AFEY LKAGGL RE+DYPYTG DRG ACKF+KSK+AASV+NFSVVSLDEDQIAANLV
Sbjct: 1 MNNAFEYALKAGGLEREKDYPYTGNDRG-ACKFEKSKVAASVSNFSVVSLDEDQIAANLV 59
Query: 280 KNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKN 339
K+GPL+VAINAV+MQTYIGGVSCPYICS+ DHGVLLVGYG+AGYAPIR KEKP+WIIKN
Sbjct: 60 KHGPLSVAINAVFMQTYIGGVSCPYICSKHQDHGVLLVGYGAAGYAPIRFKEKPFWIIKN 119
Query: 340 SWGESWGENGYYKICRGRNVCGVDSMVSTVAA 371
SWGE+WGENGYYKICR RN+CGVDSMVSTVAA
Sbjct: 120 SWGENWGENGYYKICRARNICGVDSMVSTVAA 151
>gi|323713078|gb|ADY04293.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713086|gb|ADY04297.1| cysteine protease [Clarkia xantiana var. xantiana]
Length = 144
Score = 280 bits (715), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 128/145 (88%), Positives = 142/145 (97%), Gaps = 1/145 (0%)
Query: 217 GGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAA 276
GGLMNSAFEYTLKAGGLM+EEDYPYTGTD+G +CKF+KSKIAASVANFSVVSLDEDQIAA
Sbjct: 1 GGLMNSAFEYTLKAGGLMKEEDYPYTGTDKG-SCKFEKSKIAASVANFSVVSLDEDQIAA 59
Query: 277 NLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWI 336
NLVKNGPLA+AINAV+MQTY+GGVSCPYICS+RLDHGVLLVGYGS+GY+P+RLKEKPYWI
Sbjct: 60 NLVKNGPLAIAINAVFMQTYMGGVSCPYICSKRLDHGVLLVGYGSSGYSPVRLKEKPYWI 119
Query: 337 IKNSWGESWGENGYYKICRGRNVCG 361
IKNSWG+ WGE G+YKICRGRN+CG
Sbjct: 120 IKNSWGDKWGEEGFYKICRGRNICG 144
>gi|281209544|gb|EFA83712.1| cysteine proteinase 1 [Polysphondylium pallidum PN500]
Length = 465
Score = 279 bits (714), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 156/313 (49%), Positives = 192/313 (61%), Gaps = 20/313 (6%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLR------RAARHQKLDPSATHGITQFSD 110
E F F+ K+NK Y S E + RF FK+NL+ R A +K S G+ +F+D
Sbjct: 25 ETQFRQFQIKYNKQYTSSE-YAERFATFKSNLKVIDEKNRDAASRK--SSVRFGVNEFAD 81
Query: 111 LTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWS 170
L+ +EFR TYL + +R P +A A LP DLP FDWR KGAV VK+QG CGSCWS
Sbjct: 82 LSQSEFRATYLNSVQAVRDP-NAAVAADLPVEDLPTAFDWRTKGAVTGVKNQGQCGSCWS 140
Query: 171 FSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGS--CDSGCNGGLMNSAFEYTL 228
FSTTG +EG FLA L LSEQ LVDCDHEC E G CD GCNGGL +A+ Y +
Sbjct: 141 FSTTGNVEGQWFLAGNTLTGLSEQNLVDCDHEC-MEYLGDNVCDQGCNGGLQPNAYTYII 199
Query: 229 KAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAI 288
K GG+ E YPY G D C F + I A ++N++ VS +E Q+AA LV NGPLA+A
Sbjct: 200 KNGGIDTEASYPYQGVD--GTCSFKAANIGAKISNWTYVSSNETQMAAYLVANGPLAIAA 257
Query: 289 NAVYMQTYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGE 347
+AV Q Y+GGV P C LDHG+L+VGY + I K+K YWI+KNSWG +WGE
Sbjct: 258 DAVEWQFYLGGVFDVP--CGNTLDHGILIVGYSAEN--TIFHKDKAYWIVKNSWGATWGE 313
Query: 348 NGYYKICRGRNVC 360
GY I RG C
Sbjct: 314 QGYIYISRGNGEC 326
>gi|323713016|gb|ADY04262.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713018|gb|ADY04263.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713020|gb|ADY04264.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713022|gb|ADY04265.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713024|gb|ADY04266.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713026|gb|ADY04267.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713030|gb|ADY04269.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713032|gb|ADY04270.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713034|gb|ADY04271.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713036|gb|ADY04272.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713038|gb|ADY04273.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713040|gb|ADY04274.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713042|gb|ADY04275.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713044|gb|ADY04276.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713046|gb|ADY04277.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713048|gb|ADY04278.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713050|gb|ADY04279.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713052|gb|ADY04280.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713054|gb|ADY04281.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713056|gb|ADY04282.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713058|gb|ADY04283.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713060|gb|ADY04284.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713062|gb|ADY04285.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713064|gb|ADY04286.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713066|gb|ADY04287.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713068|gb|ADY04288.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713070|gb|ADY04289.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713072|gb|ADY04290.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713074|gb|ADY04291.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713076|gb|ADY04292.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713080|gb|ADY04294.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713084|gb|ADY04296.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713088|gb|ADY04298.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713090|gb|ADY04299.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713092|gb|ADY04300.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713094|gb|ADY04301.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713096|gb|ADY04302.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713098|gb|ADY04303.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713100|gb|ADY04304.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713102|gb|ADY04305.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713104|gb|ADY04306.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713106|gb|ADY04307.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713108|gb|ADY04308.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713110|gb|ADY04309.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713112|gb|ADY04310.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713114|gb|ADY04311.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713116|gb|ADY04312.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713118|gb|ADY04313.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713120|gb|ADY04314.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713122|gb|ADY04315.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713124|gb|ADY04316.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713126|gb|ADY04317.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713128|gb|ADY04318.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713130|gb|ADY04319.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713132|gb|ADY04320.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713134|gb|ADY04321.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713136|gb|ADY04322.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713138|gb|ADY04323.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713140|gb|ADY04324.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713142|gb|ADY04325.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713144|gb|ADY04326.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713146|gb|ADY04327.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713148|gb|ADY04328.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713150|gb|ADY04329.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713152|gb|ADY04330.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713154|gb|ADY04331.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713156|gb|ADY04332.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713158|gb|ADY04333.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713160|gb|ADY04334.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713162|gb|ADY04335.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713166|gb|ADY04337.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713168|gb|ADY04338.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713170|gb|ADY04339.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713172|gb|ADY04340.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713174|gb|ADY04341.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713180|gb|ADY04344.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713182|gb|ADY04345.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713184|gb|ADY04346.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713186|gb|ADY04347.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713188|gb|ADY04348.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713190|gb|ADY04349.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713192|gb|ADY04350.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713194|gb|ADY04351.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713196|gb|ADY04352.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713198|gb|ADY04353.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713200|gb|ADY04354.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713202|gb|ADY04355.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713204|gb|ADY04356.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713206|gb|ADY04357.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713212|gb|ADY04360.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713216|gb|ADY04362.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713218|gb|ADY04363.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713220|gb|ADY04364.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713222|gb|ADY04365.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713224|gb|ADY04366.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713226|gb|ADY04367.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713230|gb|ADY04369.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713232|gb|ADY04370.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713234|gb|ADY04371.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713236|gb|ADY04372.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713238|gb|ADY04373.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713240|gb|ADY04374.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713246|gb|ADY04377.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713248|gb|ADY04378.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713250|gb|ADY04379.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713252|gb|ADY04380.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713254|gb|ADY04381.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713256|gb|ADY04382.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713258|gb|ADY04383.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713260|gb|ADY04384.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713262|gb|ADY04385.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713264|gb|ADY04386.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713266|gb|ADY04387.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713268|gb|ADY04388.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713270|gb|ADY04389.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713274|gb|ADY04391.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713276|gb|ADY04392.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713278|gb|ADY04393.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713280|gb|ADY04394.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713282|gb|ADY04395.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713284|gb|ADY04396.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713286|gb|ADY04397.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713288|gb|ADY04398.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713290|gb|ADY04399.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713292|gb|ADY04400.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713294|gb|ADY04401.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713296|gb|ADY04402.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713298|gb|ADY04403.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713300|gb|ADY04404.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713302|gb|ADY04405.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713304|gb|ADY04406.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713306|gb|ADY04407.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713308|gb|ADY04408.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713310|gb|ADY04409.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713312|gb|ADY04410.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713314|gb|ADY04411.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713316|gb|ADY04412.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713318|gb|ADY04413.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713322|gb|ADY04415.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713324|gb|ADY04416.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713326|gb|ADY04417.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713328|gb|ADY04418.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713330|gb|ADY04419.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713332|gb|ADY04420.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713334|gb|ADY04421.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713336|gb|ADY04422.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713338|gb|ADY04423.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713340|gb|ADY04424.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713342|gb|ADY04425.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713344|gb|ADY04426.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713346|gb|ADY04427.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713348|gb|ADY04428.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713350|gb|ADY04429.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713352|gb|ADY04430.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713354|gb|ADY04431.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713356|gb|ADY04432.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713358|gb|ADY04433.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713360|gb|ADY04434.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713362|gb|ADY04435.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713364|gb|ADY04436.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713366|gb|ADY04437.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713368|gb|ADY04438.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713370|gb|ADY04439.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713372|gb|ADY04440.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713374|gb|ADY04441.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713376|gb|ADY04442.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713378|gb|ADY04443.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713380|gb|ADY04444.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713382|gb|ADY04445.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713384|gb|ADY04446.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713386|gb|ADY04447.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713388|gb|ADY04448.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713390|gb|ADY04449.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713392|gb|ADY04450.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713394|gb|ADY04451.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713396|gb|ADY04452.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713398|gb|ADY04453.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713400|gb|ADY04454.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713402|gb|ADY04455.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713404|gb|ADY04456.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713408|gb|ADY04458.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713410|gb|ADY04459.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713412|gb|ADY04460.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713414|gb|ADY04461.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713416|gb|ADY04462.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713418|gb|ADY04463.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713420|gb|ADY04464.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713422|gb|ADY04465.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713424|gb|ADY04466.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713426|gb|ADY04467.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713428|gb|ADY04468.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713430|gb|ADY04469.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713432|gb|ADY04470.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713434|gb|ADY04471.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713436|gb|ADY04472.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713438|gb|ADY04473.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713440|gb|ADY04474.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713442|gb|ADY04475.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713444|gb|ADY04476.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713448|gb|ADY04478.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713454|gb|ADY04481.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713458|gb|ADY04483.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713460|gb|ADY04484.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713462|gb|ADY04485.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713464|gb|ADY04486.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713466|gb|ADY04487.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713468|gb|ADY04488.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713470|gb|ADY04489.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713474|gb|ADY04491.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713478|gb|ADY04493.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713494|gb|ADY04501.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713496|gb|ADY04502.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713498|gb|ADY04503.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713500|gb|ADY04504.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713502|gb|ADY04505.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713504|gb|ADY04506.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713506|gb|ADY04507.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713508|gb|ADY04508.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713510|gb|ADY04509.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713512|gb|ADY04510.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713514|gb|ADY04511.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713516|gb|ADY04512.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713518|gb|ADY04513.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713520|gb|ADY04514.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713522|gb|ADY04515.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713524|gb|ADY04516.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713526|gb|ADY04517.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713528|gb|ADY04518.1| cysteine protease [Clarkia xantiana var. xantiana]
Length = 144
Score = 279 bits (713), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 127/145 (87%), Positives = 142/145 (97%), Gaps = 1/145 (0%)
Query: 217 GGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAA 276
GGLMNSAFEYTLKAGGLM+EEDYPYTGTD+G +CKF+KSKIAASVANFSVVSLDEDQIAA
Sbjct: 1 GGLMNSAFEYTLKAGGLMKEEDYPYTGTDKG-SCKFEKSKIAASVANFSVVSLDEDQIAA 59
Query: 277 NLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWI 336
NLVKNGPLA+AINAV+MQTY+GGVSCPYICS+RLDHGVLLVGYGS+GY+P+R+KEKPYWI
Sbjct: 60 NLVKNGPLAIAINAVFMQTYMGGVSCPYICSKRLDHGVLLVGYGSSGYSPVRMKEKPYWI 119
Query: 337 IKNSWGESWGENGYYKICRGRNVCG 361
IKNSWG+ WGE G+YKICRGRN+CG
Sbjct: 120 IKNSWGDKWGEEGFYKICRGRNICG 144
>gi|323713210|gb|ADY04359.1| cysteine protease [Clarkia xantiana var. xantiana]
Length = 144
Score = 278 bits (712), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 127/145 (87%), Positives = 142/145 (97%), Gaps = 1/145 (0%)
Query: 217 GGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAA 276
GGLMNSAFEYTLKAGGLM+EEDYPYTGTD+G +CKF+KSKIAASVANFSVVSLDEDQIAA
Sbjct: 1 GGLMNSAFEYTLKAGGLMKEEDYPYTGTDKG-SCKFEKSKIAASVANFSVVSLDEDQIAA 59
Query: 277 NLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWI 336
NLVKNGPLA+AINAV+MQTY+GGVSCPYICS+RLDHGVLLVGYGS+GY+P+R+KEKPYWI
Sbjct: 60 NLVKNGPLAIAINAVFMQTYMGGVSCPYICSKRLDHGVLLVGYGSSGYSPVRMKEKPYWI 119
Query: 337 IKNSWGESWGENGYYKICRGRNVCG 361
IKNSWG+ WGE G+YKICRGRN+CG
Sbjct: 120 IKNSWGDRWGEEGFYKICRGRNICG 144
>gi|323713228|gb|ADY04368.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713242|gb|ADY04375.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713244|gb|ADY04376.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713272|gb|ADY04390.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713446|gb|ADY04477.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713450|gb|ADY04479.1| cysteine protease [Clarkia xantiana var. xantiana]
Length = 144
Score = 278 bits (712), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 127/145 (87%), Positives = 141/145 (97%), Gaps = 1/145 (0%)
Query: 217 GGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAA 276
GGLMNSAFEYTLKAGGLM+EEDYPYTGTD+G +CKF+KSKIAASVANFSVVSLDEDQIAA
Sbjct: 1 GGLMNSAFEYTLKAGGLMKEEDYPYTGTDKG-SCKFEKSKIAASVANFSVVSLDEDQIAA 59
Query: 277 NLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWI 336
NLVKNGPLA+AINAV+MQTY+GGVSCPYICS+RLDHGVLLVGYGS+GY+P+R+KEKPYWI
Sbjct: 60 NLVKNGPLAIAINAVFMQTYMGGVSCPYICSKRLDHGVLLVGYGSSGYSPVRMKEKPYWI 119
Query: 337 IKNSWGESWGENGYYKICRGRNVCG 361
IKNSWG WGE G+YKICRGRN+CG
Sbjct: 120 IKNSWGNKWGEEGFYKICRGRNICG 144
>gi|427777627|gb|JAA54265.1| Putative cathepsin f-like cysteine protease [Rhipicephalus
pulchellus]
Length = 475
Score = 278 bits (711), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 154/313 (49%), Positives = 204/313 (65%), Gaps = 20/313 (6%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH-GITQFSDLTPAEFRR 118
FS+F + +NK Y +EEH+ RF IFK NL+R A +L+ H G+T+FSDL+P+EF R
Sbjct: 166 FSVFARTYNKTYKDKEEHEARFMIFKNNLKRIALFNRLEEGTAHYGLTEFSDLSPSEFER 225
Query: 119 TYLGLRRKLRLPKDADQAPIL--PTND-LPADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YLGL++ L K A+ PI P N+ LP FDWR KGAV VK+QG CGSCW+FS TG
Sbjct: 226 HYLGLKKDLAEHK-AEVKPIKVGPVNEPLPDLFDWRTKGAVTEVKNQGMCGSCWAFSVTG 284
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMR 235
+EG FL+ KL+SLSEQ+LVDCDH D GC GG M A + ++ GGL
Sbjct: 285 NVEGQWFLSRSKLLSLSEQELVDCDH---------GDHGCKGGYMGQAMKAVIEMGGLET 335
Query: 236 EEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQT 295
E +YPY G D C+F+K++ A V +F + +E ++A L+K+GP+++ INA MQ
Sbjct: 336 ESEYPYKGVD--GTCEFNKTESKARVQSFVGLPQNETELAYWLMKHGPVSIGINANAMQF 393
Query: 296 YIGGVSCP--YICS-RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYK 352
Y GG+S P ++CS LDHGVLLVG+G + R K PYWI+KNSWG+ WGE GYY+
Sbjct: 394 YFGGISHPWKFLCSPTDLDHGVLLVGFGVDKRS-FRRKPVPYWIVKNSWGKYWGEKGYYR 452
Query: 353 ICRGRNVCGVDSM 365
+ RG CGV+ M
Sbjct: 453 VYRGDGTCGVNQM 465
>gi|323713456|gb|ADY04482.1| cysteine protease [Clarkia xantiana var. xantiana]
Length = 144
Score = 278 bits (711), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 127/145 (87%), Positives = 142/145 (97%), Gaps = 1/145 (0%)
Query: 217 GGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAA 276
GGLMNSAFEYTLKAGGLM+EEDYPYTGTD+G +CKF+KSKIAASVANFSVVSLDEDQIAA
Sbjct: 1 GGLMNSAFEYTLKAGGLMKEEDYPYTGTDKG-SCKFEKSKIAASVANFSVVSLDEDQIAA 59
Query: 277 NLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWI 336
NLVKNGPLA+AINAV+MQTY+GGVSCPYICS+RLDHGVLLVGYGS+GY+P+R+KEKPYWI
Sbjct: 60 NLVKNGPLAIAINAVFMQTYMGGVSCPYICSKRLDHGVLLVGYGSSGYSPVRVKEKPYWI 119
Query: 337 IKNSWGESWGENGYYKICRGRNVCG 361
IKNSWG+ WGE G+YKICRGRN+CG
Sbjct: 120 IKNSWGDKWGEEGFYKICRGRNICG 144
>gi|313235882|emb|CBY11269.1| unnamed protein product [Oikopleura dioica]
Length = 371
Score = 278 bits (711), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 157/326 (48%), Positives = 206/326 (63%), Gaps = 25/326 (7%)
Query: 56 AEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPA 114
A F F + K Y+ QE H RF F NL+R H ++ SA +G+T+F+DL+
Sbjct: 46 ARKQFENFLLEHPKMYSEQESHS-RFQTFWENLKRIKFHNHIEQGSAKYGVTEFADLSDF 104
Query: 115 EFRRTYLGLRRKLRLP-------KDADQAPILP-TNDLPADFDWREKGAVGPVKDQGSCG 166
EFRR YLGL+ +L++P K + + L + FDW EKGAV VK+QG CG
Sbjct: 105 EFRRHYLGLKPELKIPNRKKYERKSRNSSKKLKFAKTVDETFDWVEKGAVTEVKNQGMCG 164
Query: 167 SCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEY 226
SCW+FSTTG +EGA F ATG LVSLSEQ+LVDCD + DSGCNGGLM+ AFE
Sbjct: 165 SCWAFSTTGNIEGAWFKATGDLVSLSEQELVDCDQK---------DSGCNGGLMDQAFEE 215
Query: 227 TLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAV 286
++ GGL E+ YPY G C F+KS + +F + DE++IA L ++GPL++
Sbjct: 216 VIRIGGLETEQQYPYDGVQE--TCNFEKSLSKVQIDDFMDIGEDEEEIAEALEEHGPLSI 273
Query: 287 AINAVYMQTYIGGVSCP--YICSRR-LDHGVLLVGYGSAGYAPIRLKE-KPYWIIKNSWG 342
AINA MQ Y GG+S P ++CS+ LDHGVL+VGYG + R + +PYW IKNSWG
Sbjct: 274 AINAFGMQFYRGGISHPLSFLCSQDGLDHGVLMVGYGVEHHTTWRHRHPRPYWKIKNSWG 333
Query: 343 ESWGENGYYKICRGRNVCGVDSMVST 368
WGE+GYY++ RG+ VCGV+ MVST
Sbjct: 334 PRWGEDGYYRVARGKGVCGVNKMVST 359
>gi|323713208|gb|ADY04358.1| cysteine protease [Clarkia xantiana var. xantiana]
Length = 144
Score = 278 bits (710), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 126/145 (86%), Positives = 142/145 (97%), Gaps = 1/145 (0%)
Query: 217 GGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAA 276
GGLMNSAFEYTLKAGGLM+EEDYPYTGTD+G +CKF+KSKIAASVANFSVVSLDEDQIAA
Sbjct: 1 GGLMNSAFEYTLKAGGLMKEEDYPYTGTDKG-SCKFEKSKIAASVANFSVVSLDEDQIAA 59
Query: 277 NLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWI 336
NLVKNGPLA+AINAV+MQTY+GGVSCPYICS+RLDHGVLLVGYG++GY+P+R+KEKPYWI
Sbjct: 60 NLVKNGPLAIAINAVFMQTYMGGVSCPYICSKRLDHGVLLVGYGTSGYSPVRMKEKPYWI 119
Query: 337 IKNSWGESWGENGYYKICRGRNVCG 361
IKNSWG+ WGE G+YKICRGRN+CG
Sbjct: 120 IKNSWGDKWGEEGFYKICRGRNICG 144
>gi|323713452|gb|ADY04480.1| cysteine protease [Clarkia xantiana var. xantiana]
Length = 144
Score = 278 bits (710), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 126/145 (86%), Positives = 142/145 (97%), Gaps = 1/145 (0%)
Query: 217 GGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAA 276
GGLMNSAFEYTLKAGGLM+EEDYPYTGTD+G +CKF+KSKIAASVANFSVVSLDEDQIAA
Sbjct: 1 GGLMNSAFEYTLKAGGLMKEEDYPYTGTDKG-SCKFEKSKIAASVANFSVVSLDEDQIAA 59
Query: 277 NLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWI 336
NLVKNGPLA+AINAV+MQTY+GGVSCPYICS+RLDHGVLLVGYGS+GY+P+++KEKPYWI
Sbjct: 60 NLVKNGPLAIAINAVFMQTYMGGVSCPYICSKRLDHGVLLVGYGSSGYSPVKMKEKPYWI 119
Query: 337 IKNSWGESWGENGYYKICRGRNVCG 361
IKNSWG+ WGE G+YKICRGRN+CG
Sbjct: 120 IKNSWGDKWGEEGFYKICRGRNICG 144
>gi|323713164|gb|ADY04336.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713178|gb|ADY04343.1| cysteine protease [Clarkia xantiana var. xantiana]
Length = 144
Score = 277 bits (709), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 126/145 (86%), Positives = 141/145 (97%), Gaps = 1/145 (0%)
Query: 217 GGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAA 276
GGLMNSAFEYTLK GGLM+EEDYPYTGTD+G +CKF+KSKIAASVANFSVVSLDEDQIAA
Sbjct: 1 GGLMNSAFEYTLKTGGLMKEEDYPYTGTDKG-SCKFEKSKIAASVANFSVVSLDEDQIAA 59
Query: 277 NLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWI 336
NLVKNGPLA+AINAV+MQTY+GGVSCPYICS+RLDHGVLLVGYGS+GY+P+R+KEKPYWI
Sbjct: 60 NLVKNGPLAIAINAVFMQTYMGGVSCPYICSKRLDHGVLLVGYGSSGYSPVRMKEKPYWI 119
Query: 337 IKNSWGESWGENGYYKICRGRNVCG 361
IKNSWG+ WGE G+YKICRGRN+CG
Sbjct: 120 IKNSWGDKWGEEGFYKICRGRNICG 144
>gi|323713406|gb|ADY04457.1| cysteine protease [Clarkia xantiana var. xantiana]
Length = 144
Score = 277 bits (709), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 126/145 (86%), Positives = 141/145 (97%), Gaps = 1/145 (0%)
Query: 217 GGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAA 276
GGLMNSAFEYTLKAGGLM+EEDYPYTGTD+G +CKF+KSKI ASVANFSVVSLDEDQIAA
Sbjct: 1 GGLMNSAFEYTLKAGGLMKEEDYPYTGTDKG-SCKFEKSKIVASVANFSVVSLDEDQIAA 59
Query: 277 NLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWI 336
NLVKNGPLA+AINAV+MQTY+GGVSCPYICS+RLDHGVLLVGYGS+GY+P+R+KEKPYWI
Sbjct: 60 NLVKNGPLAIAINAVFMQTYMGGVSCPYICSKRLDHGVLLVGYGSSGYSPVRMKEKPYWI 119
Query: 337 IKNSWGESWGENGYYKICRGRNVCG 361
IKNSWG+ WGE G+YKICRGRN+CG
Sbjct: 120 IKNSWGDKWGEEGFYKICRGRNICG 144
>gi|323713214|gb|ADY04361.1| cysteine protease [Clarkia xantiana var. xantiana]
Length = 144
Score = 277 bits (708), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 126/145 (86%), Positives = 141/145 (97%), Gaps = 1/145 (0%)
Query: 217 GGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAA 276
GGLMNSAFEYTLKAG LM+EEDYPYTGTD+G +CKF+KSKIAASVANFSVVSLDEDQIAA
Sbjct: 1 GGLMNSAFEYTLKAGALMKEEDYPYTGTDKG-SCKFEKSKIAASVANFSVVSLDEDQIAA 59
Query: 277 NLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWI 336
NLVKNGPLA+AINAV+MQTY+GGVSCPYICS+RLDHGVLLVGYGS+GY+P+R+KEKPYWI
Sbjct: 60 NLVKNGPLAIAINAVFMQTYMGGVSCPYICSKRLDHGVLLVGYGSSGYSPVRMKEKPYWI 119
Query: 337 IKNSWGESWGENGYYKICRGRNVCG 361
IKNSWG+ WGE G+YKICRGRN+CG
Sbjct: 120 IKNSWGDKWGEEGFYKICRGRNICG 144
>gi|323713028|gb|ADY04268.1| cysteine protease [Clarkia xantiana var. xantiana]
Length = 144
Score = 277 bits (708), Expect = 7e-72, Method: Compositional matrix adjust.
Identities = 126/145 (86%), Positives = 142/145 (97%), Gaps = 1/145 (0%)
Query: 217 GGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAA 276
GGLMNSAFEYTLKAGGLM+EEDYPYTGTD+G +CKF+KSKIAASVANFSVVSLDEDQIAA
Sbjct: 1 GGLMNSAFEYTLKAGGLMKEEDYPYTGTDKG-SCKFEKSKIAASVANFSVVSLDEDQIAA 59
Query: 277 NLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWI 336
NLVKNGPLA+AINAV+MQTY+GGVSCPYICS+RLDHGVLLVGYGS+GY+P+R+KEKP+WI
Sbjct: 60 NLVKNGPLAIAINAVFMQTYMGGVSCPYICSKRLDHGVLLVGYGSSGYSPVRMKEKPHWI 119
Query: 337 IKNSWGESWGENGYYKICRGRNVCG 361
IKNSWG+ WGE G+YKICRGRN+CG
Sbjct: 120 IKNSWGDKWGEEGFYKICRGRNICG 144
>gi|323713082|gb|ADY04295.1| cysteine protease [Clarkia xantiana var. xantiana]
Length = 144
Score = 276 bits (707), Expect = 8e-72, Method: Compositional matrix adjust.
Identities = 126/145 (86%), Positives = 141/145 (97%), Gaps = 1/145 (0%)
Query: 217 GGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAA 276
GGLMNSAFEYTLKAGGLM+EEDYPYTGTD+G +CKF+KSKIAASVANFSVVSLDEDQIAA
Sbjct: 1 GGLMNSAFEYTLKAGGLMKEEDYPYTGTDKG-SCKFEKSKIAASVANFSVVSLDEDQIAA 59
Query: 277 NLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWI 336
NLVKNGPLA+AINAV+MQTY+GGVSCPYICS+RLDHGVLLVGYGS+GY+P+ +KEKPYWI
Sbjct: 60 NLVKNGPLAIAINAVFMQTYMGGVSCPYICSKRLDHGVLLVGYGSSGYSPVSMKEKPYWI 119
Query: 337 IKNSWGESWGENGYYKICRGRNVCG 361
IKNSWG+ WGE G+YKICRGRN+CG
Sbjct: 120 IKNSWGDKWGEEGFYKICRGRNICG 144
>gi|307200028|gb|EFN80374.1| Putative cysteine proteinase CG12163 [Harpegnathos saltator]
Length = 1032
Score = 276 bits (706), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 146/326 (44%), Positives = 201/326 (61%), Gaps = 21/326 (6%)
Query: 54 LGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANL--RRAARHQKLDPSATHGITQFSDL 111
+ +E F F +N+ YA++EE + R +IF+ NL R R + + +G+ QF+D+
Sbjct: 721 MRSERLFENFVNTYNRTYATEEERNLRLSIFRENLGIIRLLRKNEQG-TGQYGVNQFADV 779
Query: 112 TPAEFRRTYLGLRRKLRLPKDAD--QAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCW 169
+ EF YLGLR LR + QA I P +LP FDWR+KGAV PVK+QG CGSCW
Sbjct: 780 STEEFHAFYLGLRPDLRTENNIPLRQAEI-PDIELPNSFDWRQKGAVTPVKNQGMCGSCW 838
Query: 170 SFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK 229
+FS TG +EG + KL+SLSEQ+LVDCD D GCNGGL ++A+ K
Sbjct: 839 AFSVTGNVEGQYAIKHNKLLSLSEQELVDCD---------DLDEGCNGGLPDNAYRAIEK 889
Query: 230 AGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAIN 289
GGL E DYPY + C F K+ V + ++ +E QIA LV NGP+++ IN
Sbjct: 890 LGGLELESDYPYEAEN--ERCHFKKNMAKVQVGSAVNITSNETQIAQWLVANGPISIGIN 947
Query: 290 AVYMQTYIGGVSCP--YICS-RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWG 346
A MQ Y+GGVS P ++C+ + LDHGVL+VGYG++ Y P+ K+ PYWI+KNSWG+ WG
Sbjct: 948 ANAMQFYMGGVSHPFKFLCNPKNLDHGVLIVGYGTSNY-PLFHKKLPYWIVKNSWGDRWG 1006
Query: 347 ENGYYKICRGRNVCGVDSMVSTVAAA 372
E GYY++ RG CG+++M S+
Sbjct: 1007 EQGYYRVYRGDGTCGLNTMASSAVVV 1032
>gi|323713176|gb|ADY04342.1| cysteine protease [Clarkia xantiana var. xantiana]
Length = 144
Score = 276 bits (706), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 125/145 (86%), Positives = 141/145 (97%), Gaps = 1/145 (0%)
Query: 217 GGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAA 276
GGLMNSAFEYTLK GGLM+EEDYPYTGTD+G +CKF+KSKIAA+VANFSVVSLDEDQIAA
Sbjct: 1 GGLMNSAFEYTLKTGGLMKEEDYPYTGTDKG-SCKFEKSKIAAAVANFSVVSLDEDQIAA 59
Query: 277 NLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWI 336
NLVKNGPLA+AINAV+MQTY+GGVSCPYICS+RLDHGVLLVGYGS+GY+P+R+KEKPYWI
Sbjct: 60 NLVKNGPLAIAINAVFMQTYMGGVSCPYICSKRLDHGVLLVGYGSSGYSPVRMKEKPYWI 119
Query: 337 IKNSWGESWGENGYYKICRGRNVCG 361
IKNSWG+ WGE G+YKICRGRN+CG
Sbjct: 120 IKNSWGDKWGEEGFYKICRGRNICG 144
>gi|289740839|gb|ADD19167.1| cysteine proteinase cathepsin F [Glossina morsitans morsitans]
Length = 471
Score = 276 bits (706), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 152/334 (45%), Positives = 206/334 (61%), Gaps = 21/334 (6%)
Query: 41 EILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP- 99
E +H +S +++L EH F+ F+ KF + Y + E RF IFK NL+ + +
Sbjct: 147 EKKTHKKSNHHNLNKVEHLFAKFQIKFKRNYHTTMEKQMRFRIFKQNLQLIEELNRNEQG 206
Query: 100 SATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPI--LPTNDLPADFDWREKGAVG 157
SA +GIT+F+D+T E+++ GL + R P+ A P +P DLP +FDWREKGA+
Sbjct: 207 SAKYGITEFADMTSPEYKQR-TGLWQ--RDPQKAASNPKAEIPNIDLPKEFDWREKGAIS 263
Query: 158 PVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNG 217
VK+QG+CGSCW+FS TG +EG + + TG L SEQ+L+DCD + DS CNG
Sbjct: 264 AVKNQGNCGSCWAFSVTGNIEGLHAVRTGVLEQYSEQELLDCD---------TSDSACNG 314
Query: 218 GLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAAN 277
GL ++A+E K GGL E DYPY R C F+ +KI V + +E IA
Sbjct: 315 GLPDNAYEAIEKIGGLELESDYPYHA--RKDQCHFNSTKIHVKVKGHVDLPKNETAIAQW 372
Query: 278 LVKNGPLAVAINAVYMQTYIGGVSCP--YICSRR-LDHGVLLVGYGSAGYAPIRLKEKPY 334
L+ NGP+++ INA MQ Y GGVS P +CSR+ LDHGVL+VGYG + Y P+ K PY
Sbjct: 373 LIANGPISIGINANAMQFYRGGVSHPPHILCSRKNLDHGVLIVGYGVSDY-PMFKKTLPY 431
Query: 335 WIIKNSWGESWGENGYYKICRGRNVCGVDSMVST 368
WI+KNSWG+ WGE GYY++ RG N CGV M S+
Sbjct: 432 WIVKNSWGKKWGEQGYYRVYRGDNTCGVSEMSSS 465
>gi|332026794|gb|EGI66903.1| Putative cysteine proteinase [Acromyrmex echinatior]
Length = 774
Score = 276 bits (705), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 143/327 (43%), Positives = 204/327 (62%), Gaps = 24/327 (7%)
Query: 54 LGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLT 112
+ AE F+ F +N+ Y+S E + RF IF+ NL ++ + + +G+ F+D++
Sbjct: 464 MKAERLFNNFMTTYNRTYSSLE-RNLRFKIFRENLNFIEELRETEQGTGIYGVNMFADMS 522
Query: 113 PAEFRRTYLGLRRKLRLPKDADQAPI----LPTNDLPADFDWREKGAVGPVKDQGSCGSC 168
EFR YLGLR L + ++ P+ +P DLP+ FDWR+KG V PVK+QG CGSC
Sbjct: 523 QKEFRTRYLGLRPDL---QSENEIPLPKAEIPDIDLPSSFDWRQKGVVTPVKNQGQCGSC 579
Query: 169 WSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTL 228
W+FS TG +EG + G+L+SLSEQ+LVDCDH D GCNGGL ++A+
Sbjct: 580 WAFSVTGNVEGQYAIKHGQLLSLSEQELVDCDH---------LDEGCNGGLPDNAYRAIE 630
Query: 229 KAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAI 288
+ GGL E DYPY + C F ++ + +A+ ++ +E QIA LV+NGP+A+ I
Sbjct: 631 QLGGLELESDYPYEAEN--EKCHFKQNLVKVELASAVNITSNETQIAQWLVQNGPIAIGI 688
Query: 289 NAVYMQTYIGGVSCPY--ICS-RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESW 345
NA MQ Y+GGVS P +C+ L+HGVL+VGYG++ Y P+ K PYWIIKNSWG+SW
Sbjct: 689 NANAMQFYMGGVSHPLKILCNPNNLNHGVLIVGYGTSRY-PLFHKNLPYWIIKNSWGKSW 747
Query: 346 GENGYYKICRGRNVCGVDSMVSTVAAA 372
GE GYY++ RG CG+++M S+
Sbjct: 748 GEQGYYRVYRGDGTCGLNTMASSAVVV 774
>gi|323713320|gb|ADY04414.1| cysteine protease [Clarkia xantiana var. xantiana]
Length = 144
Score = 276 bits (705), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 126/145 (86%), Positives = 141/145 (97%), Gaps = 1/145 (0%)
Query: 217 GGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAA 276
GGLMNSAFEYTLKAGGLM+EEDYPYTGTD+G +CKF+KSKIAASVANFSVVS DEDQIAA
Sbjct: 1 GGLMNSAFEYTLKAGGLMKEEDYPYTGTDKG-SCKFEKSKIAASVANFSVVSHDEDQIAA 59
Query: 277 NLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWI 336
NLVKNGPLA+AINAV+MQTY+GGVSCPYICS+RLDHGVLLVGYGS+GY+P+R+KEKPYWI
Sbjct: 60 NLVKNGPLAIAINAVFMQTYMGGVSCPYICSKRLDHGVLLVGYGSSGYSPVRMKEKPYWI 119
Query: 337 IKNSWGESWGENGYYKICRGRNVCG 361
IKNSWG+ WGE G+YKICRGRN+CG
Sbjct: 120 IKNSWGDKWGEEGFYKICRGRNICG 144
>gi|405977658|gb|EKC42097.1| Cathepsin F [Crassostrea gigas]
Length = 715
Score = 275 bits (704), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 144/316 (45%), Positives = 197/316 (62%), Gaps = 23/316 (7%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRR 118
F F+ F + Y S++E RF IF N+R+A + Q ++ +A +G+T+F+D++ +EF++
Sbjct: 418 FQQFQAAFKRLYMSKQEEKTRFKIFCENMRKAKKLQDVEKGTAVYGVTKFADMSESEFKQ 477
Query: 119 TYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALE 178
Y+G K +A I N LP FDWRE GAV VK+QGSCGSCW+FSTTG +E
Sbjct: 478 -YVGKVWDQNANKGMKKAKIPEMNSLPNSFDWREHGAVTEVKNQGSCGSCWAFSTTGNIE 536
Query: 179 GANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREED 238
G ++ KLVSLSEQ+LVDCD D GCNGGL + A++ ++ GGL E D
Sbjct: 537 GQWAISKKKLVSLSEQELVDCD---------KVDEGCNGGLPSQAYKEIIRLGGLETETD 587
Query: 239 YPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIG 298
Y Y G + C DKSKI + +S +E ++AA LVKNGP+++ INA MQ Y+G
Sbjct: 588 YKYRGHNE--KCSMDKSKIRVKINGSVSISSNETEMAAWLVKNGPISIGINAFAMQFYMG 645
Query: 299 GVSCPY--ICS-RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
G+S P+ C+ + LDHGVL+VGYG G KPYWIIKNSWG WGE GYY + R
Sbjct: 646 GISHPWKIFCNPKELDHGVLIVGYGVKG-------SKPYWIIKNSWGPDWGEKGYYLVYR 698
Query: 356 GRNVCGVDSMVSTVAA 371
G VCG+++M ++
Sbjct: 699 GAGVCGLNTMCTSAVV 714
>gi|307175778|gb|EFN65613.1| Putative cysteine proteinase CG12163 [Camponotus floridanus]
Length = 887
Score = 275 bits (702), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 140/322 (43%), Positives = 197/322 (61%), Gaps = 17/322 (5%)
Query: 56 AEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH-GITQFSDLTPA 114
+E F+ F +N+ Y++ EE + R IF+ NL +K + H + F+D++P
Sbjct: 578 SEQLFNNFVVTYNRTYSTPEERNLRLRIFRENLGIIQLLRKTERGTAHYDVNMFADMSPE 637
Query: 115 EFRRTYLGLRRKLRLPKDAD-QAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFST 173
EFR YLGLR LR D + +P +LP FDWREK V PVKDQG CGSCW+FS
Sbjct: 638 EFRSRYLGLRPDLRSENDIPLREAEIPDVELPPKFDWREKSVVTPVKDQGMCGSCWAFSV 697
Query: 174 TGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGL 233
TG +EG + G+L+SLSEQ+LVDCD D GCNGGL ++A+ K GGL
Sbjct: 698 TGNIEGQYAIKHGRLLSLSEQELVDCD---------DLDEGCNGGLPDNAYRAIEKLGGL 748
Query: 234 MREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYM 293
E DYPY + C F K+ +A+ ++ +E Q+A LV+NGP+++ INA M
Sbjct: 749 ELESDYPYEAEN--EKCHFKKNLAKVQLASAVNITSNETQMAQWLVQNGPISIGINANAM 806
Query: 294 QTYIGGVSCP--YICS-RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
Q Y+GGVS P ++C+ + LDHGVL+VGYG++ Y P+ K+ PYW IKNSWG+ WGE GY
Sbjct: 807 QFYVGGVSHPFKFLCNPKNLDHGVLIVGYGTSDY-PLFHKKLPYWTIKNSWGKRWGEQGY 865
Query: 351 YKICRGRNVCGVDSMVSTVAAA 372
Y++ RG CG++++ ++
Sbjct: 866 YRVYRGDGTCGLNTLATSAVVV 887
>gi|302794759|ref|XP_002979143.1| hypothetical protein SELMODRAFT_110288 [Selaginella moellendorffii]
gi|300152911|gb|EFJ19551.1| hypothetical protein SELMODRAFT_110288 [Selaginella moellendorffii]
Length = 227
Score = 275 bits (702), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 137/244 (56%), Positives = 178/244 (72%), Gaps = 25/244 (10%)
Query: 136 APILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQ 195
AP+LPT++LP FDWRE GA+ PVK+QGSCGSCW+FS+TGA+EGA+FL + +L+SL E+Q
Sbjct: 1 APLLPTDNLPKSFDWREHGAMTPVKNQGSCGSCWTFSSTGAVEGAHFLKSRELISLREEQ 60
Query: 196 LVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRG------HA 249
LVDCD D GC GG M +A+EY +KA GL EEDYPY + H
Sbjct: 61 LVDCDR---------MDGGCKGGDMLNAYEY-IKAKGLEAEEDYPYQEENYKEYMFPHHR 110
Query: 250 CKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYIC--S 307
C F SK+AA++AN+S VS DEDQIAANLVKNGPL++A+NA Y+ Y+GGV+CP IC
Sbjct: 111 CHFRPSKVAATIANYSTVSEDEDQIAANLVKNGPLSIALNANYIMDYMGGVACPRICPGG 170
Query: 308 RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVS 367
++H VLLVGYG G +KPYWI+KNSW E++GE+GY+++CRG VCG+++ VS
Sbjct: 171 DNMNHAVLLVGYGMDG-------DKPYWILKNSWSENYGEDGYFRLCRGFGVCGMNTRVS 223
Query: 368 TVAA 371
TV+A
Sbjct: 224 TVSA 227
>gi|244790097|ref|NP_001156454.1| cathepsin F isoform 2 precursor [Acyrthosiphon pisum]
Length = 586
Score = 274 bits (700), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 148/330 (44%), Positives = 192/330 (58%), Gaps = 14/330 (4%)
Query: 43 LSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SA 101
L+ ++ +D L + F F NK Y S EE RF IF AN+++ Q + SA
Sbjct: 263 LTTKKNNIDDRLQLKTDFENFIMTHNKIYTSLEEKSRRFRIFAANMKKVKLLQNHEQGSA 322
Query: 102 THGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKD 161
+G TQF+DLT EF++ YLGL + K A I + +P +FDWR V PVK+
Sbjct: 323 IYGATQFADLTKNEFKKKYLGLDSSMTSKKTLPMAVIPQSASIPNEFDWRNHNVVTPVKN 382
Query: 162 QGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMN 221
QG+CGSCW+FS +EG L + +L+SLSEQ+L+DCD+ D+GC GGLM
Sbjct: 383 QGACGSCWAFSAIANIEGQYALKSKELLSLSEQELIDCDN---------LDNGCGGGLMT 433
Query: 222 SAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKN 281
AFE GGL E DYPY G C+ KS + S++ VS DE+ IA LVK+
Sbjct: 434 QAFEAVENLGGLETESDYPYEGHADRKGCQLKKSDVKVSISKAVNVSTDEEDIAKFLVKH 493
Query: 282 GPLAVAINAVYMQTYIGGVSCPY--ICS-RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIK 338
GPL+V +NA MQ Y+GGVS P +CS + LDHGV +VGYG K PYW+IK
Sbjct: 494 GPLSVGVNANAMQFYMGGVSHPIHALCSPKSLDHGVAIVGYG-VHRTKYTHKNLPYWLIK 552
Query: 339 NSWGESWGENGYYKICRGRNVCGVDSMVST 368
NSWG WGE GYY + RG CGV+ MVS+
Sbjct: 553 NSWGPGWGEKGYYLLYRGDGSCGVNQMVSS 582
>gi|260234113|dbj|BAI44279.1| cysteine proteinase inhibitor precursor [Manduca sexta]
gi|261336196|dbj|BAH59606.2| cysteine proteinase inhibitor precursor [Manduca sexta]
Length = 2676
Score = 273 bits (699), Expect = 7e-71, Method: Compositional matrix adjust.
Identities = 147/335 (43%), Positives = 205/335 (61%), Gaps = 25/335 (7%)
Query: 45 HHESTNNDL---LGAEHHFSLFKKKFNKAYAS-QEEHDHRFTIFKANLRRAAR---HQKL 97
+HE+ ++ L AEH F F + Y + + RF IFK N+R+ H++
Sbjct: 2353 YHEAATAEVYHHLQAEHLFYEFLSTYKPEYIDDRHQMRQRFEIFKENVRKMHELNTHER- 2411
Query: 98 DPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDAD-QAPILPTNDLPADFDWREKGAV 156
+AT+G+T+F+DLT EF ++G++ LR P + ++P P FDWR+ GAV
Sbjct: 2412 -GTATYGVTRFADLTYEEFSTKHMGMKASLRDPNQVQFRKAVIPNVTAPDSFDWRDHGAV 2470
Query: 157 GPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCN 216
VKDQGSCGSCW+FS TG +EG + TG LVSLSEQ+LVDCD D GCN
Sbjct: 2471 TGVKDQGSCGSCWAFSVTGNIEGQWKMKTGDLVSLSEQELVDCD---------KLDQGCN 2521
Query: 217 GGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAA 276
GGL ++A+ + GGL E+DYPY G+D C F+K+ ++ ++ +E +A
Sbjct: 2522 GGLPDNAYRAIEQLGGLESEDDYPYEGSD--DKCSFNKTLARVQISGAVNITSNETDMAK 2579
Query: 277 NLVKNGPLAVAINAVYMQTYIGGVSCPY--ICS-RRLDHGVLLVGYGSAGYAPIRLKEKP 333
LVK+GP+++ INA MQ Y+GG+S P+ +C+ LDHGVL+VGYG+ Y P+ K P
Sbjct: 2580 WLVKHGPISIGINANAMQFYMGGISHPWRMLCNPSNLDHGVLIVGYGAKDY-PLFHKHLP 2638
Query: 334 YWIIKNSWGESWGENGYYKICRGRNVCGVDSMVST 368
YWIIKNSWG SWGE GYY++ RG CGV+ M S+
Sbjct: 2639 YWIIKNSWGTSWGEQGYYRVYRGDGTCGVNQMASS 2673
>gi|83944664|gb|ABC48936.1| cathepsin F like protease [Glossina morsitans morsitans]
Length = 471
Score = 273 bits (698), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 151/334 (45%), Positives = 205/334 (61%), Gaps = 21/334 (6%)
Query: 41 EILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP- 99
E +H +S +++L EH F+ F+ KF + Y + E RF IFK NL+ + +
Sbjct: 147 EKKTHKKSNHHNLNKVEHLFAKFQIKFKRNYHTTMEKQMRFRIFKQNLQLIEELNRNEQG 206
Query: 100 SATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPI--LPTNDLPADFDWREKGAVG 157
SA +GIT+F+D+T E+++ GL + R P+ A P +P DLP +FDWREKGA+
Sbjct: 207 SAKYGITEFADMTSPEYKQR-TGLWQ--RDPQKAASNPKAEIPNIDLPKEFDWREKGAIS 263
Query: 158 PVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNG 217
VK+QG+CGSCW+FS TG +EG + + TG L SEQ+L+DCD + DS CNG
Sbjct: 264 AVKNQGNCGSCWAFSVTGNIEGLHAVRTGVLEQYSEQELLDCD---------TSDSACNG 314
Query: 218 GLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAAN 277
GL ++A+E K GGL E DYPY R C F+ +KI V + +E IA
Sbjct: 315 GLPDNAYEAIEKIGGLELESDYPYHA--RKDQCHFNSTKIHVKVKGHVDLPKNETAIAQW 372
Query: 278 LVKNGPLAVAINAVYMQTYIGGVSCP--YICSRR-LDHGVLLVGYGSAGYAPIRLKEKPY 334
L+ NGP+++ INA MQ Y GGVS P +CSR+ LDHGVL+VGY + Y P+ K PY
Sbjct: 373 LIANGPISIGINANAMQFYRGGVSHPPHILCSRKNLDHGVLIVGYRVSDY-PMFKKTLPY 431
Query: 335 WIIKNSWGESWGENGYYKICRGRNVCGVDSMVST 368
WI+KNSWG+ WGE GYY++ RG N CGV M S+
Sbjct: 432 WIVKNSWGKKWGEQGYYRVYRGDNTCGVSEMSSS 465
>gi|330792958|ref|XP_003284553.1| hypothetical protein DICPUDRAFT_96752 [Dictyostelium purpureum]
gi|325085467|gb|EGC38873.1| hypothetical protein DICPUDRAFT_96752 [Dictyostelium purpureum]
Length = 346
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 151/324 (46%), Positives = 200/324 (61%), Gaps = 22/324 (6%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAAR---HQKLDPSATH-GITQFSDLT 112
+ F F++K+NK Y+S E+ +F FKANL A+ KL S T G+ +F+DL+
Sbjct: 26 QTQFVAFQQKYNKVYSS-NEYSAKFETFKANLGVIAQLNQKAKLHKSDTKFGVNEFADLS 84
Query: 113 PAEFRRTYLGL---RRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCG 166
AEFR+ YL + LP AP+L L P FDWR KGAV VK+QG CG
Sbjct: 85 AAEFRKYYLNAQVAKPDASLP----MAPLLTEEVLETIPTAFDWRTKGAVTGVKNQGQCG 140
Query: 167 SCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHEC-DPEEPGSCDSGCNGGLMNSAFE 225
SCWSFSTTG +EG +LA LV LSEQ LVDCDH+C + + SCD+GC+GGL +A+
Sbjct: 141 SCWSFSTTGNIEGQWYLAGNTLVGLSEQNLVDCDHQCMEYDGQKSCDAGCDGGLQPNAYR 200
Query: 226 YTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLA 285
Y ++ GGL E YPY G +CKF +AA ++NF+++ +E Q+A L +GPLA
Sbjct: 201 YVIENGGLDSENSYPYLAV-TGDSCKFKSGNVAAKISNFTMIPQNETQMAGYLATHGPLA 259
Query: 286 VAINAVYMQTYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGES 344
+A +A Q YIGGV P C + LDHG+L+VG+ + I KPYWI+KNSWG S
Sbjct: 260 IAADAAEWQFYIGGVFDLP--CGQSLDHGILIVGFSAE--KNIFGHLKPYWIVKNSWGAS 315
Query: 345 WGENGYYKICRGRNVCGVDSMVST 368
WGE GY + +G+N+CGV VST
Sbjct: 316 WGEQGYLYLGKGKNLCGVSDFVST 339
>gi|313220237|emb|CBY31096.1| unnamed protein product [Oikopleura dioica]
Length = 371
Score = 272 bits (696), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 158/337 (46%), Positives = 203/337 (60%), Gaps = 47/337 (13%)
Query: 56 AEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPA 114
A F F + K Y+ QE H RF F NL+R H ++ SA +G+T+F+DL+
Sbjct: 46 ARKQFENFLLEHPKMYSEQESHS-RFQTFWENLKRIKFHNHIEQGSAKYGVTEFTDLSDF 104
Query: 115 EFRRTYLGLR-------------------RKLRLPKDADQAPILPTNDLPADFDWREKGA 155
EFRR YLGL+ +KL+ K AD+ FDW EKGA
Sbjct: 105 EFRRHYLGLKPELKNLNRKKYERKSRNSSKKLKFAKTADET-----------FDWVEKGA 153
Query: 156 VGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGC 215
V VK+QG CGSCW+FSTTG +EGA F ATG L+SLSEQ+LVDCD + DSGC
Sbjct: 154 VTEVKNQGMCGSCWAFSTTGNIEGAWFKATGDLISLSEQELVDCDQK---------DSGC 204
Query: 216 NGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIA 275
NGGLM+ AFE ++ GGL E+ YPY G C F+KS + +F + DE++IA
Sbjct: 205 NGGLMDQAFEEVIRIGGLETEQQYPYDGVQE--TCNFEKSLSKVQIDDFMDIGEDEEEIA 262
Query: 276 ANLVKNGPLAVAINAVYMQTYIGGVSCP--YICSRR-LDHGVLLVGYGSAGYAPIRLKE- 331
L ++GPL++AINA MQ Y GGVS P ++CS LDHGVL+VGYG + R +
Sbjct: 263 EALEEHGPLSIAINAFGMQFYRGGVSHPLSFLCSPDGLDHGVLMVGYGVEHHTTWRHRHP 322
Query: 332 KPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVST 368
+PYW IKNSWG WGE+GYY++ RG+ VCGV+ MVST
Sbjct: 323 RPYWKIKNSWGPRWGEDGYYRVARGKGVCGVNKMVST 359
>gi|244790093|ref|NP_001156453.1| cathepsin F isoform 1 precursor [Acyrthosiphon pisum]
Length = 586
Score = 272 bits (696), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 147/330 (44%), Positives = 192/330 (58%), Gaps = 14/330 (4%)
Query: 43 LSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SA 101
L+ ++ +D L + F F NK Y S EE RF IF AN+++ Q + SA
Sbjct: 263 LTTKKNNIDDRLQLKTDFENFIMTHNKIYTSLEEKSRRFRIFAANMKKVKLLQNHEQGSA 322
Query: 102 THGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKD 161
+G TQF+DLT EF++ YLGL + K A I + +P +FDWR V PVK+
Sbjct: 323 IYGATQFADLTKNEFKKKYLGLDSSMTSKKTLPMAVIPQSASIPNEFDWRNHNVVTPVKN 382
Query: 162 QGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMN 221
QG+CGSCW+FS +EG L + +L+SLSEQ+L+DCD+ D+GC GGLM
Sbjct: 383 QGACGSCWAFSAIANIEGQYALKSKELLSLSEQELIDCDN---------LDNGCGGGLMT 433
Query: 222 SAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKN 281
AFE GGL E DYPY G C+ KS + S++ VS DE+ IA LVK+
Sbjct: 434 QAFEAVENLGGLETESDYPYEGHADRKGCQLKKSDVKVSISKAVNVSTDEEDIAKFLVKH 493
Query: 282 GPLAVAINAVYMQTYIGGVSCPY--ICS-RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIK 338
GPL+V +NA MQ Y+GGVS P +CS + LDHGV +VGYG Y P P+W IK
Sbjct: 494 GPLSVGVNANAMQFYMGGVSHPIHALCSPKSLDHGVAIVGYGVHKY-PYLNATLPFWTIK 552
Query: 339 NSWGESWGENGYYKICRGRNVCGVDSMVST 368
NSWG+ WG GYY + RG CGV+ MVS+
Sbjct: 553 NSWGDKWGMQGYYLLYRGDGSCGVNQMVSS 582
>gi|380025691|ref|XP_003696602.1| PREDICTED: putative cysteine proteinase CG12163-like [Apis florea]
Length = 881
Score = 271 bits (693), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 152/321 (47%), Positives = 199/321 (61%), Gaps = 17/321 (5%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAE 115
E F F KFNK ++S E +RF IFK NL+ Q + +A +G+T F+DLTP E
Sbjct: 573 ETLFEDFIIKFNKTFSSTNEKQNRFQIFKQNLKIIKELQTFEQGTAEYGVTMFADLTPKE 632
Query: 116 FRRTYLGLRRKLRLPKDADQAPILPTND-LPADFDWREKGAVGPVKDQGSCGSCWSFSTT 174
F+ YLG R +L+ + A I ++ LP FDWR+ AV PVKDQG CGSCW+FS T
Sbjct: 633 FKTRYLGFRPELKQENEIPLAKIEVSDIFLPPKFDWRDYNAVTPVKDQGLCGSCWAFSVT 692
Query: 175 GALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLM 234
G +EG + KL+SLSEQ+L+DCD + D GCNGG M +A++ K GGL
Sbjct: 693 GNVEGQYAIKYKKLLSLSEQELLDCD---------TLDEGCNGGYMENAYKAIEKLGGLE 743
Query: 235 REEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQ 294
E DYPY G R C F K V ++ +E ++A L+KNGP+++ INA MQ
Sbjct: 744 LESDYPYDG--RNEKCHFFKKNAKVQVVGAVNITSNETKMAQWLIKNGPISIGINANAMQ 801
Query: 295 TYIGGVSCP--YICS-RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYY 351
YIGGVS P ++C+ + LDHGVL+VGYG + Y P+ KE PYWIIKNSWG WGENGYY
Sbjct: 802 FYIGGVSHPFHFLCNPKDLDHGVLIVGYGISKY-PLFHKELPYWIIKNSWGSRWGENGYY 860
Query: 352 KICRGRNVCGVDSMVSTVAAA 372
++ RG CGV++M S+ A
Sbjct: 861 RVYRGDGTCGVNAMASSAIVA 881
>gi|223648298|gb|ACN10907.1| Cathepsin F precursor [Salmo salar]
Length = 474
Score = 270 bits (690), Expect = 9e-70, Method: Compositional matrix adjust.
Identities = 146/321 (45%), Positives = 194/321 (60%), Gaps = 25/321 (7%)
Query: 52 DLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSD 110
+LLG F F ++N+ Y+SQEE D R +F NL+ A + Q LD +A +G+T+FSD
Sbjct: 171 ELLG---QFKEFMVRYNRTYSSQEEADRRLRVFHENLKTAEKLQSLDQGTAEYGVTKFSD 227
Query: 111 LTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWS 170
LT EFR YL + + + + +P P +DWRE GAV PVK+QG CGSCW+
Sbjct: 228 LTEEEFRTLYLNPLLSQQNLQQSMKPAAMPRGPAPPSWDWREHGAVSPVKNQGMCGSCWA 287
Query: 171 FSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA 230
FS TG +EG F TGKLVSLSEQ+LVDCD + D C GGL ++A+E K
Sbjct: 288 FSVTGNIEGQWFAKTGKLVSLSEQELVDCD---------TVDQACGGGLPSNAYEAIEKL 338
Query: 231 GGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA 290
GGL E DY YTG + +C F K+ A + + +S DE++IAA L +NGP++VA+NA
Sbjct: 339 GGLETETDYSYTG--KKQSCDFTTDKVIAYINSSVELSTDENEIAAWLAENGPVSVALNA 396
Query: 291 VYMQTYIGGVSCP---YICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGE 347
MQ Y GVS P + +DH VLLVGYG + KP+W IKNSWGE +GE
Sbjct: 397 FAMQFYRKGVSHPLKIFCNPWMIDHAVLLVGYGER-------QGKPFWAIKNSWGEDYGE 449
Query: 348 NGYYKICRGRNVCGVDSMVST 368
GYY + RG +CG++ M S+
Sbjct: 450 QGYYYLYRGSRLCGINKMCSS 470
>gi|222637029|gb|EEE67161.1| hypothetical protein OsJ_24244 [Oryza sativa Japonica Group]
Length = 309
Score = 270 bits (689), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 150/292 (51%), Positives = 186/292 (63%), Gaps = 26/292 (8%)
Query: 32 IRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRA 91
IRQVTDGG L E F+ F ++ + Y+ EE+ R +F ANL RA
Sbjct: 29 IRQVTDGGYWPPG---------LLPEAQFAAFVRRHGREYSGPEEYARRLRVFAANLARA 79
Query: 92 ARHQKLDPSATHGITQFSDLTPAEFRRTYLGLR-------RKLRLPKDADQAPILPTNDL 144
A HQ LDP+A HG+T FSDLT EF GL R+ +P A A + L
Sbjct: 80 AAHQALDPTARHGVTPFSDLTREEFEARLTGLAADVGDDVRRRPMPSAA-PATEEEVSGL 138
Query: 145 PADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECD 204
PA FDWR++GAV VK QG+CGSCW+FSTTGA+EGANFLATG L+ LSEQQLVDCDH CD
Sbjct: 139 PASFDWRDRGAVTDVKMQGACGSCWAFSTTGAVEGANFLATGNLLDLSEQQLVDCDHTCD 198
Query: 205 PEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANF 264
E+ CDSGC GGLM +A+ Y + +GGLM + YPYTG C+FD +++A VANF
Sbjct: 199 AEKKTECDSGCGGGLMTNAYAYLMSSGGLMEQSAYPYTGAQ--GTCRFDANRVAVRVANF 256
Query: 265 SVVSL----DED---QIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRR 309
+VV+ D D Q+ A LV++GPLAV +NA YMQTY+GGVSCP +C R
Sbjct: 257 TVVAPPGGNDGDGDAQMRAALVRHGPLAVGLNAAYMQTYVGGVSCPLVCRAR 308
>gi|213513816|ref|NP_001133678.1| Cathepsin F precursor [Salmo salar]
gi|209154908|gb|ACI33686.1| Cathepsin F precursor [Salmo salar]
Length = 475
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 148/322 (45%), Positives = 198/322 (61%), Gaps = 27/322 (8%)
Query: 52 DLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSD 110
+LLG F F ++N+ Y+SQE+ D R IF NL+ A + Q LD +A +G+T+FSD
Sbjct: 172 ELLG---QFKEFMVRYNRTYSSQEDTDRRLRIFHENLKTAEKLQSLDLGTAEYGVTKFSD 228
Query: 111 LTPAEFRRTYLG-LRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCW 169
LT EFR YL L + +L + A +P P +DWRE GAV PVK+QG CGSCW
Sbjct: 229 LTEEEFRTLYLNPLLSQQKLQRSMKPAA-MPHGPAPPSWDWREHGAVSPVKNQGMCGSCW 287
Query: 170 SFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK 229
+FS TG +EG F+ TGKLVSLSEQ+LVDCD + D C GGL ++A+E K
Sbjct: 288 AFSVTGNIEGQWFVKTGKLVSLSEQELVDCD---------TADQACGGGLPSNAYEAIEK 338
Query: 230 AGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAIN 289
GG+ E DY YTG + +C F K+ A + + +S DE++IAA L +NGP++VA+N
Sbjct: 339 LGGVETETDYSYTG--KKQSCDFTTDKVTAYINSSVELSKDENEIAAWLAENGPVSVALN 396
Query: 290 AVYMQTYIGGVSCP---YICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWG 346
A MQ Y GVS P + +DH VLLVGYG + KP+W IKNSWGE +G
Sbjct: 397 AFAMQFYRKGVSHPLKIFCNPWMIDHAVLLVGYGER-------QGKPFWAIKNSWGEDYG 449
Query: 347 ENGYYKICRGRNVCGVDSMVST 368
E GYY + RG +CG+++M S+
Sbjct: 450 EQGYYYLYRGSRLCGINTMCSS 471
>gi|67773378|gb|AAY81946.1| cysteine protease 8 [Paragonimus westermani]
Length = 325
Score = 269 bits (687), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 143/317 (45%), Positives = 197/317 (62%), Gaps = 27/317 (8%)
Query: 56 AEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPA 114
A + FK+ + KAYA++++ RF IFK NL RA ++Q + +A +G+TQFSDLTP
Sbjct: 28 ARELYEQFKRDYGKAYANEDDQ-KRFAIFKDNLVRAQQYQMQEQGTAKYGVTQFSDLTPE 86
Query: 115 EFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTT 174
EF YLGLR + + D+ + PA DWREKGAVGP+++QGSCGSCW+FS
Sbjct: 87 EFEAKYLGLR----IDEQVDRVQLNDLQTAPASVDWREKGAVGPIENQGSCGSCWAFSVV 142
Query: 175 GALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLM 234
G +EG FL TG LVSLS+QQLVDCD + D+GC GG ++ + GGL
Sbjct: 143 GNIEGQWFLKTGYLVSLSKQQLVDCD---------TVDNGCYGGYPPYTYKEIKRMGGLE 193
Query: 235 REEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQ 294
+ DYPYTG GH C+ D+SK+ A + + V+ DE++ AA L ++GP++ +NA Y+Q
Sbjct: 194 LQSDYPYTGW--GHGCRLDRSKLFAKIDDSIVLEADEEKQAAWLAEHGPMSTCLNAKYLQ 251
Query: 295 TYIGGVSCP--YICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYY 351
Y G+ P +CS L+H VL VGY + PYWIIKNSWG SWGE+GY+
Sbjct: 252 FYQSGILHPSKAMCSPEGLNHAVLTVGYDTK-------HGIPYWIIKNSWGTSWGEDGYF 304
Query: 352 KICRGRNVCGVDSMVST 368
+I RG CG+D + ++
Sbjct: 305 RIYRGDGTCGIDRLTTS 321
>gi|156389068|ref|XP_001634814.1| predicted protein [Nematostella vectensis]
gi|156221901|gb|EDO42751.1| predicted protein [Nematostella vectensis]
Length = 276
Score = 268 bits (684), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 142/294 (48%), Positives = 192/294 (65%), Gaps = 28/294 (9%)
Query: 81 FTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRRTYL--GLRRKLRLPKDADQAP 137
IF++N+R+AA+ QK+D +A +G T FSDL+ EFR+ + G + L KDA+
Sbjct: 1 MKIFESNMRKAAKMQKMDSGTAQYGPTIFSDLSEEEFRKQKMMPGWGKPLYEMKDAE--- 57
Query: 138 ILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLV 197
+P D+P DWR+KG V PVK+QGSCGSCW+FSTTG +EG + TGKLVSLSEQ+LV
Sbjct: 58 -IPLGDIPESVDWRDKGVVTPVKNQGSCGSCWAFSTTGNIEGQYAIKTGKLVSLSEQELV 116
Query: 198 DCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKI 257
DCD + D GC GGL ++A++ K GGL E DYPY G D CKF+K+++
Sbjct: 117 DCD---------TIDKGCEGGLPSNAYKQIEKLGGLESESDYPYKGAD--SKCKFNKAEV 165
Query: 258 AASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPY--ICS-RRLDHGV 314
++ + V+S DE +IAA L KNGP+++ INA MQ Y+GG++ P+ C+ L+HGV
Sbjct: 166 KVTINSSVVISKDEKEIAAWLAKNGPISIGINANAMQFYMGGIAHPWKIFCNPSSLNHGV 225
Query: 315 LLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVST 368
L+VGYG PYWIIKNSWG SWGE GYY I RG CG+++M ++
Sbjct: 226 LIVGYGVK-------NGTPYWIIKNSWGPSWGEKGYYLIYRGGGCCGLNTMCTS 272
>gi|328788558|ref|XP_392381.3| PREDICTED: putative cysteine proteinase CG12163-like [Apis
mellifera]
Length = 881
Score = 268 bits (684), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 150/321 (46%), Positives = 198/321 (61%), Gaps = 17/321 (5%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAE 115
E F F KFNK ++S E +RF IFK NL+ Q + +A +G+T F+DLTP E
Sbjct: 573 EMLFEDFIIKFNKTFSSTNEKQNRFQIFKQNLKIINELQTFEQGTAEYGVTMFADLTPKE 632
Query: 116 FRRTYLGLRRKLRLPKDADQAPILPTND-LPADFDWREKGAVGPVKDQGSCGSCWSFSTT 174
F+ YLG R +L+ + A I ++ LP FDWR+ V PVKDQG CGSCW+FS T
Sbjct: 633 FKTRYLGFRPELKQENEIPLAKIEVSDIFLPLKFDWRDYNVVTPVKDQGLCGSCWAFSVT 692
Query: 175 GALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLM 234
G +EG + KL+SLSEQ+L+DCD + D GCNGG M +A++ K GGL
Sbjct: 693 GNVEGQYAIKYKKLLSLSEQELLDCD---------TLDEGCNGGYMENAYKAIEKLGGLE 743
Query: 235 REEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQ 294
E DYPY G R C F K V ++ +E ++A L+KNGP+++ INA MQ
Sbjct: 744 LESDYPYDG--RNEKCHFFKKNAKVQVVGAVNITSNETKMAQWLIKNGPISIGINANAMQ 801
Query: 295 TYIGGVSCP--YICS-RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYY 351
YIGGVS P ++C+ + LDHGVL+VGYG + Y P+ K+ PYWIIKNSWG WGENGYY
Sbjct: 802 FYIGGVSHPFHFLCNPKDLDHGVLIVGYGISKY-PLFHKKLPYWIIKNSWGSRWGENGYY 860
Query: 352 KICRGRNVCGVDSMVSTVAAA 372
++ RG CGV++M S+ A
Sbjct: 861 RVYRGDGTCGVNAMASSAIVA 881
>gi|324522685|gb|ADY48108.1| Cathepsin L, partial [Ascaris suum]
Length = 308
Score = 267 bits (682), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 146/315 (46%), Positives = 195/315 (61%), Gaps = 29/315 (9%)
Query: 63 FKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRRTYL 121
F ++N+ Y++++E RF I+K NLR A Q + +A +G TQFSDLT AEFR+ L
Sbjct: 10 FIGRYNRTYSNKKEMLKRFRIYKRNLRAAKIWQANEQGTAIYGETQFSDLTQAEFRKIML 69
Query: 122 GLRRKLRLPKDADQAPI-----LPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGA 176
K PK ++ + ND+P FDWREK AV VK+QGSCGSCW+FS TG
Sbjct: 70 PY--KWETPKVPNKMANFKEFGIAQNDIPESFDWREKNAVTEVKNQGSCGSCWAFSVTGN 127
Query: 177 LEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMRE 236
+EGA + T KLVSLSEQ+LVDCD D GCNGGL ++A+ ++ GGL E
Sbjct: 128 IEGAWAIKTSKLVSLSEQELVDCD---------IIDQGCNGGLPSNAYREIIRMGGLEAE 178
Query: 237 EDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTY 296
DYPY G RG C K IA + + + DE+++AA LV GP+++ +NA +Q Y
Sbjct: 179 SDYPYDG--RGEKCHLMKKDIAVYINDSLQLPHDEEKMAAWLVAKGPISIGLNANPLQFY 236
Query: 297 IGGVSCPY--ICS-RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKI 353
G++ P+ CS + LDHGVL+VGYGS +KPYWIIKNSWG WGE GY+++
Sbjct: 237 RHGIAHPWRVFCSPKHLDHGVLIVGYGSE-------TDKPYWIIKNSWGTKWGEEGYFRL 289
Query: 354 CRGRNVCGVDSMVST 368
RG+NVCG+ M +T
Sbjct: 290 FRGKNVCGIQEMATT 304
>gi|323713472|gb|ADY04490.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713476|gb|ADY04492.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713480|gb|ADY04494.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713482|gb|ADY04495.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713484|gb|ADY04496.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713486|gb|ADY04497.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713488|gb|ADY04498.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713490|gb|ADY04499.1| cysteine protease [Clarkia xantiana var. xantiana]
gi|323713492|gb|ADY04500.1| cysteine protease [Clarkia xantiana var. xantiana]
Length = 138
Score = 266 bits (681), Expect = 9e-69, Method: Compositional matrix adjust.
Identities = 122/139 (87%), Positives = 136/139 (97%), Gaps = 1/139 (0%)
Query: 217 GGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAA 276
GGLMNSAFEYTLKAGGLM+EEDYPYTGTD+G +CKF+KSKIAASVANFSVVSLDEDQIAA
Sbjct: 1 GGLMNSAFEYTLKAGGLMKEEDYPYTGTDKG-SCKFEKSKIAASVANFSVVSLDEDQIAA 59
Query: 277 NLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWI 336
NLVKNGPLA+AINAV+MQTY+GGVSCPYICS+RLDHGVLLVGYGS+GY+P+R+KEKPYWI
Sbjct: 60 NLVKNGPLAIAINAVFMQTYMGGVSCPYICSKRLDHGVLLVGYGSSGYSPVRMKEKPYWI 119
Query: 337 IKNSWGESWGENGYYKICR 355
IKNSWG+ WGE G+YKICR
Sbjct: 120 IKNSWGDKWGEEGFYKICR 138
>gi|66803062|ref|XP_635374.1| cysteine protease [Dictyostelium discoideum AX4]
gi|60463697|gb|EAL61879.1| cysteine protease [Dictyostelium discoideum AX4]
Length = 352
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 147/332 (44%), Positives = 200/332 (60%), Gaps = 30/332 (9%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQK----LDPSATHGITQFSDLT 112
E F F+ K+NK Y S EE+ +F FK+NL K + G+ +F+DL+
Sbjct: 24 ESQFIAFQNKYNKIY-SAEEYLVKFETFKSNLLNIDALNKQATTIGSDTKFGVNKFADLS 82
Query: 113 PAEFRRTYLGLRRKLRLPKDADQAPILP--TNDL----PADFDWREKGA---------VG 157
EF++ YL ++ RL D P+LP ++D+ PA FDWR G V
Sbjct: 83 KEEFKKYYLS-SKEARL---TDDLPMLPNLSDDIISATPAAFDWRNTGGSTKFPQGTPVT 138
Query: 158 PVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHEC-DPEEPGSCDSGCN 216
VK+QG CGSCWSFSTTG +EG ++L+TG LV LSEQ LVDCDH C E C++GC+
Sbjct: 139 AVKNQGQCGSCWSFSTTGNVEGQHYLSTGTLVGLSEQNLVDCDHTCMTYENENVCNAGCD 198
Query: 217 GGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAA 276
GGL +A+ Y +K GG+ E YPYT D CKF+ +++ A +++F++V +E QIA+
Sbjct: 199 GGLQPNAYNYIIKNGGIQTEATYPYTAVD--GECKFNSAQVGAKISSFTMVPQNETQIAS 256
Query: 277 NLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWI 336
L NGPLA+A +A Q Y+GGV + C + LDHG+L+VGYG+ I K PYWI
Sbjct: 257 YLFNNGPLAIAADAEEWQFYMGGV-FDFPCGQTLDHGILIVGYGAQD--TIVGKNTPYWI 313
Query: 337 IKNSWGESWGENGYYKICRGRNVCGVDSMVST 368
IKNSWG WGE GY K+ R + CGV + VS+
Sbjct: 314 IKNSWGADWGEAGYLKVERNTDKCGVANFVSS 345
>gi|290984408|ref|XP_002674919.1| predicted protein [Naegleria gruberi]
gi|284088512|gb|EFC42175.1| predicted protein [Naegleria gruberi]
Length = 353
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 137/339 (40%), Positives = 208/339 (61%), Gaps = 20/339 (5%)
Query: 42 ILSHHESTNNDL--LGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP 99
IL+ + T L HF F +KF + Y EE+++R +F+ N+ + R +
Sbjct: 17 ILAFDQETYQPLSETAVRDHFLDFTRKFQRFYKGPEEYEYRLKVFRENIETSRRMNIREG 76
Query: 100 SATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDL---------PADFDW 150
+ +GIT+FSDLT EFR+ YL ++ PK+ + + +N + P +DW
Sbjct: 77 NNNYGITKFSDLTSDEFRKFYLMEKKT---PKEIQKMMRMDSNKMVSNSYAKPAPDHYDW 133
Query: 151 REKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDP-EEPG 209
R GA+ VKDQG CGSCW+FS G++EG+ + +LVS SEQQLVDCD+ C E
Sbjct: 134 RNHGAITGVKDQGQCGSCWAFSAIGSIEGSYAIKHKQLVSFSEQQLVDCDNNCVTFENQQ 193
Query: 210 SCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSL 269
SCD GCNGGL SA++Y +KAGG++ E+DYPY + C+ + A ++N++++S
Sbjct: 194 SCDDGCNGGLQWSAYQYLMKAGGVVTEKDYPYYA--ERYKCEVKPANFVAKLSNWTMLST 251
Query: 270 DEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICS-RRLDHGVLLVGYGSAGYAPIR 328
+E ++A L +NGP+AVA+NA ++Q Y G++ P C +LDHGVL+VGYG +
Sbjct: 252 NETEMANWLAENGPIAVALNADFLQNYNNGIADPAWCDPTQLDHGVLIVGYGLETF--WF 309
Query: 329 LKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVS 367
K +PYWI+KNSWG +GE+GY++I +G CG++++ S
Sbjct: 310 GKPQPYWIVKNSWGYDFGEDGYFRIVKGVGRCGINTVPS 348
>gi|427778331|gb|JAA54617.1| Putative cysteine proteinase cathepsin f [Rhipicephalus pulchellus]
Length = 361
Score = 266 bits (679), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 154/331 (46%), Positives = 204/331 (61%), Gaps = 38/331 (11%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH-GITQFSDLTPAEFRR 118
FS+F + +NK Y +EEH+ RF IFK NL+R A +L+ H G+T+FSDL+P+EF R
Sbjct: 34 FSVFARTYNKTYKDKEEHEARFMIFKNNLKRIALFNRLEEGTAHYGLTEFSDLSPSEFER 93
Query: 119 TYLGLRRKLRLPKDADQAPIL--PTND-LPADFDWREKGAVGPVKDQGSCGSCWS----- 170
YLGL++ L K A+ PI P N+ LP FDWR KGAV VK+QG CGSCW+
Sbjct: 94 HYLGLKKDLAEHK-AEVKPIKVGPVNEPLPDLFDWRTKGAVTEVKNQGMCGSCWAFSXXT 152
Query: 171 -------------FSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNG 217
FS TG +EG FL+ KL+SLSEQ+LVDCDH D GC G
Sbjct: 153 EVKNQGMCGSCWAFSVTGNVEGQWFLSRSKLLSLSEQELVDCDH---------GDHGCKG 203
Query: 218 GLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAAN 277
G M A + ++ GGL E +YPY G D C+F+K++ A V +F + +E ++A
Sbjct: 204 GYMGQAMKAVIEMGGLETESEYPYKGVD--GTCEFNKTESKARVQSFVGLPQNETELAYW 261
Query: 278 LVKNGPLAVAINAVYMQTYIGGVSCP--YICS-RRLDHGVLLVGYGSAGYAPIRLKEKPY 334
L+K+GP+++ INA MQ Y GG+S P ++CS LDHGVLLVG+G + R K PY
Sbjct: 262 LMKHGPVSIGINANAMQFYFGGISHPWKFLCSPTDLDHGVLLVGFGVDKRS-FRRKPVPY 320
Query: 335 WIIKNSWGESWGENGYYKICRGRNVCGVDSM 365
WI+KNSWG+ WGE GYY++ RG CGV+ M
Sbjct: 321 WIVKNSWGKYWGEKGYYRVYRGDGTCGVNQM 351
>gi|224555777|gb|ACN56478.1| cathepsin F [Paralichthys olivaceus]
Length = 475
Score = 266 bits (679), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 152/328 (46%), Positives = 194/328 (59%), Gaps = 39/328 (11%)
Query: 52 DLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSD 110
+LLG F F K+NK Y+SQ+E D R +IF NL+ A + Q LD SA +G+T+FSD
Sbjct: 172 ELLG---QFKEFMVKYNKVYSSQDEADRRLSIFHENLKTAEKLQSLDQGSAEYGVTKFSD 228
Query: 111 LTPAEFRRTYLG-------LRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQG 163
LT EFR TYL L R ++ P + P PA +DWR+ GAV VK+QG
Sbjct: 229 LTEEEFRSTYLNPLLSQWTLHRPMK-PASPAKGPA------PASWDWRDHGAVSSVKNQG 281
Query: 164 SCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSA 223
CGSCW+FS TG +EG FL G LVSLSEQ+LVDCD D CNGGL ++A
Sbjct: 282 MCGSCWAFSVTGNIEGQWFLKNGTLVSLSEQELVDCD---------GLDQACNGGLPSNA 332
Query: 224 FEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGP 283
+E K GGL E DY Y G + +C F K+AA + + +S DE +IAA L +NGP
Sbjct: 333 YEAIEKLGGLETETDYSYIG--KKQSCDFATKKVAAYINSSVELSKDEKEIAAWLAENGP 390
Query: 284 LAVAINAVYMQTYIGGVSCP---YICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNS 340
++VA+NA MQ Y GVS P + +DH VL+VGYG K P+W IKNS
Sbjct: 391 VSVALNAFAMQFYRKGVSHPLKIFCNPWMIDHAVLMVGYGER-------KGIPFWAIKNS 443
Query: 341 WGESWGENGYYKICRGRNVCGVDSMVST 368
WGE +GE GYY + RG N CG++ M S+
Sbjct: 444 WGEDYGEQGYYNLYRGSNACGINKMCSS 471
>gi|186688051|gb|ACC86111.1| cathepsin F [Paralichthys olivaceus]
Length = 475
Score = 265 bits (676), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 152/328 (46%), Positives = 194/328 (59%), Gaps = 39/328 (11%)
Query: 52 DLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSD 110
+LLG F F K+NK Y+SQ+E D R +IF NL+ A + Q LD SA +G+T+FSD
Sbjct: 172 ELLG---QFKEFMVKYNKVYSSQDEADRRLSIFHENLKTAEKLQSLDQGSAEYGVTKFSD 228
Query: 111 LTPAEFRRTYLG-------LRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQG 163
LT EFR TYL L R ++ P + P PA +DWR+ GAV VK+QG
Sbjct: 229 LTEEEFRSTYLNPLLSQWTLHRPMK-PASPAKGPA------PASWDWRDHGAVSSVKNQG 281
Query: 164 SCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSA 223
CGSCW+FS TG +EG FL G LVSLSEQ+LVDCD D CNGGL ++A
Sbjct: 282 MCGSCWAFSVTGNIEGQWFLKNGTLVSLSEQELVDCD---------GLDQACNGGLPSNA 332
Query: 224 FEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGP 283
+E K GGL E DY Y G + +C F K+AA + + +S DE +IAA L +NGP
Sbjct: 333 YEAIEKLGGLETETDYSYIG--KKQSCDFATKKVAAYINSSVELSKDEKEIAAWLAENGP 390
Query: 284 LAVAINAVYMQTYIGGVSCP---YICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNS 340
++VA+NA MQ Y GVS P + +DH VL+VGYG K P+W IKNS
Sbjct: 391 VSVALNAFAMQFYRKGVSHPLKIFCNPWMIDHAVLMVGYGER-------KGIPFWAIKNS 443
Query: 341 WGESWGENGYYKICRGRNVCGVDSMVST 368
WGE +GE GYY + RG N CG++ M S+
Sbjct: 444 WGEDYGEQGYYYLHRGSNACGINKMCSS 471
>gi|328866896|gb|EGG15279.1| cysteine protease [Dictyostelium fasciculatum]
Length = 347
Score = 265 bits (676), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 148/323 (45%), Positives = 188/323 (58%), Gaps = 20/323 (6%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRA----ARHQKLDPSATHGITQFSDLT 112
E F F+ K+NK Y S E +F FK NL R A G+ +F+DL+
Sbjct: 24 EIQFRDFQVKYNKVYGSHE-FSQKFVTFKDNLNRIDTLNANAAASGSDTKFGVNEFADLS 82
Query: 113 PAEFRRTYLGLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCW 169
EFR+ Y+ +P DA A L P+ FDWR KGAV PVK+QG CGSCW
Sbjct: 83 VQEFRKFYMN-AVPASVPSDAQVAGDYSDETLASIPSSFDWRTKGAVTPVKNQGQCGSCW 141
Query: 170 SFSTTGALEGANFLATGKLVSLSEQQLVDCDHEC---DPEEPGSCDSGCNGGLMNSAFEY 226
SFSTTG +EG FLA L LSEQ LVDCDH C D ++ SCD GCNGGL +AF+Y
Sbjct: 142 SFSTTGNVEGQWFLAGNTLTGLSEQNLVDCDHHCMTYDGQQ--SCDDGCNGGLQPNAFQY 199
Query: 227 TLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAV 286
+ GG+ E YPY + C+F S I A ++N+ ++S +E QIAA L NGP+++
Sbjct: 200 IIGNGGIDTETSYPYLAVAQ-DKCQFKASNIGAKISNWQMLSTNETQIAAYLALNGPVSI 258
Query: 287 AINAVYMQTYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESW 345
A +A Q YIGGV P C + LDHG+L+VGY + I KPYW +KNSWG SW
Sbjct: 259 AADAAEWQFYIGGVFDLP--CGKALDHGILIVGYDTE--TNIFGHAKPYWWVKNSWGASW 314
Query: 346 GENGYYKICRGRNVCGVDSMVST 368
GE GY K+ RG CG+++ VST
Sbjct: 315 GEQGYLKVLRGAGECGLNTFVST 337
>gi|371781479|emb|CCA95098.1| putative responsive to dehydration 19, partial [Liriodendron
tulipifera]
Length = 150
Score = 265 bits (676), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 125/151 (82%), Positives = 138/151 (91%), Gaps = 2/151 (1%)
Query: 207 EPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSV 266
+P SCD+GCNGGLM SAF+YTLK+GGL +EEDYPYTG D G CKF+KSKIAAS N++V
Sbjct: 1 DPSSCDAGCNGGLMTSAFKYTLKSGGLEKEEDYPYTGKD-GATCKFEKSKIAASALNYTV 59
Query: 267 VSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRR-LDHGVLLVGYGSAGYA 325
VS+DEDQIAANLVK GPLAV INAV+MQTYIGGVSCPYICS+R LDHGVLLVGYG+AGYA
Sbjct: 60 VSIDEDQIAANLVKFGPLAVGINAVFMQTYIGGVSCPYICSKRLLDHGVLLVGYGAAGYA 119
Query: 326 PIRLKEKPYWIIKNSWGESWGENGYYKICRG 356
PIR K+KPYWIIKNSWGESWGENGYYKICRG
Sbjct: 120 PIRFKDKPYWIIKNSWGESWGENGYYKICRG 150
>gi|403183546|gb|EJY58173.1| AAEL017153-PA [Aedes aegypti]
Length = 1165
Score = 264 bits (675), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 143/328 (43%), Positives = 194/328 (59%), Gaps = 27/328 (8%)
Query: 56 AEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPA 114
A H F FK K ++ Y S EH+ RF IFK NL + + K + +A +GIT F+D+T A
Sbjct: 854 ARHLFEKFKLKHSREYQSTLEHEMRFRIFKNNLFKIEQLNKYEQGTAKYGITHFADMTSA 913
Query: 115 EFRRTYLGLRRKLRLPKDAD-------QAPILPTNDLPADFDWREKGAVGPVKDQGSCGS 167
E+R+ R L +P+D D +A I +LP FDWRE GAV PVK+QG+CGS
Sbjct: 914 EYRQ-----RTGLVIPRDEDRNHVGNPKAEIDENMELPESFDWRELGAVSPVKNQGNCGS 968
Query: 168 CWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYT 227
CW+FS G +EG + + T L SEQ+L+DCD + DS C GG M+ A++
Sbjct: 969 CWAFSVVGNIEGLHQIKTKVLEEYSEQELLDCD---------AVDSACQGGYMDDAYKAI 1019
Query: 228 LKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVA 287
K GGL E +YPY + C F+ +++ V + +E +A LV NGP+++
Sbjct: 1020 EKIGGLELESEYPYLA-KKQKTCHFNSTEVHVRVKGAVDLPKNETAMAQYLVANGPISIG 1078
Query: 288 INAVYMQTYIGGVSCPY--ICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGES 344
+NA MQ Y GG+S P+ +CS++ LDHGVL+VGYG Y P+ K PYWI+KNSWG
Sbjct: 1079 LNANAMQFYRGGISHPWKPLCSKKNLDHGVLIVGYGVKEY-PMFNKTMPYWIVKNSWGPK 1137
Query: 345 WGENGYYKICRGRNVCGVDSMVSTVAAA 372
WGE GYY+I RG N CGV M S+ A
Sbjct: 1138 WGEQGYYRIFRGDNTCGVSEMASSAVLA 1165
>gi|242014216|ref|XP_002427787.1| Cathepsin F precursor, putative [Pediculus humanus corporis]
gi|212512256|gb|EEB15049.1| Cathepsin F precursor, putative [Pediculus humanus corporis]
Length = 434
Score = 264 bits (675), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 139/315 (44%), Positives = 192/315 (60%), Gaps = 20/315 (6%)
Query: 58 HHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEF 116
F F KFNK Y S+EE RF IF+AN+++ K + +A +GIT+FSDL+ EF
Sbjct: 132 QSFKDFVLKFNKVYFSKEEFKKRFRIFRANMKKINFLNKAEKGTAQYGITEFSDLSVTEF 191
Query: 117 RRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGA 176
+ YLGL++K P+ +P LP +FDWR AV PVK+QGSCGSCW+FS TG
Sbjct: 192 K-NYLGLKKK---PESKLPTAEIPDVKLPDNFDWRHYNAVTPVKNQGSCGSCWAFSVTGN 247
Query: 177 LEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMRE 236
+EG + +L+SLSEQ+L+DCD D+GCNGG M +E +K GGL E
Sbjct: 248 IEGLWAIKKHELLSLSEQELIDCD---------KIDNGCNGGYMPETYEAIMKLGGLETE 298
Query: 237 EDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTY 296
DYPY + C +K++I + ++ E IA L KNGP++ +NA MQ Y
Sbjct: 299 TDYPYEAEN--EKCNLNKTEIKVKINGAVNLTKSELDIAKWLYKNGPVSAGLNANAMQFY 356
Query: 297 IGGVSCP--YICS-RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKI 353
+GG+S P +C+ DHG+L+VGYG + ++ + PYWIIKNSWG+ WGE GYY++
Sbjct: 357 LGGISHPPKILCNPEEQDHGILIVGYGIHKSSILK-RTIPYWIIKNSWGKHWGEKGYYRL 415
Query: 354 CRGRNVCGVDSMVST 368
RG VCG++ MVS+
Sbjct: 416 YRGSGVCGINQMVSS 430
>gi|170032975|ref|XP_001844355.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167873312|gb|EDS36695.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 1454
Score = 264 bits (675), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 145/337 (43%), Positives = 198/337 (58%), Gaps = 30/337 (8%)
Query: 44 SHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SAT 102
+HH S + D + H F FK + N+ Y S EH+ RF IFK NL + + K + +A
Sbjct: 1132 AHHYSKSED--HSRHLFDKFKTRHNRTYQSSLEHEMRFRIFKNNLFKIEQLNKYEQGTAK 1189
Query: 103 HGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQ--------APILPTNDLPADFDWREKG 154
+GIT F+D+T AE+R R L +P++ D+ A I +LP FDWRE G
Sbjct: 1190 YGITHFADMTSAEYR-----ARTGLVVPREGDEVNHIRNPMAEIDEHMELPDAFDWRELG 1244
Query: 155 AVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSG 214
AV VK+QG+CGSCW+FS G +EG + + T KL SEQ+L+DCD + DS
Sbjct: 1245 AVSEVKNQGNCGSCWAFSVVGNIEGLHQVKTKKLEEYSEQELLDCD---------TVDSA 1295
Query: 215 CNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQI 274
CNGG M+ A++ K GGL E +YPY + C F+K+ V + +E I
Sbjct: 1296 CNGGFMDDAYKAIEKIGGLELESEYPYLAK-KQKTCHFNKTMAHVRVKGAVDLPKNETAI 1354
Query: 275 AANLVKNGPLAVAINAVYMQTYIGGVSCPY--ICSRR-LDHGVLLVGYGSAGYAPIRLKE 331
A LV NGP+++ +NA MQ Y GG+S P+ +CS++ LDHGVL+VGYG Y P+ K
Sbjct: 1355 AQFLVANGPVSIGLNANAMQFYRGGISHPWKPLCSKKNLDHGVLIVGYGVKEY-PMFNKT 1413
Query: 332 KPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVST 368
PYWI+KNSWG WGE GYY++ RG N CGV M ++
Sbjct: 1414 LPYWIVKNSWGPKWGEQGYYRVFRGDNTCGVSEMATS 1450
>gi|401758208|gb|AFQ01139.1| cathepsin F-like protease, partial [Chilo suppressalis]
Length = 537
Score = 263 bits (672), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 143/325 (44%), Positives = 194/325 (59%), Gaps = 23/325 (7%)
Query: 56 AEHHFSLFKKKFNKAYASQE-EHDHRFTIFKANLRRAAR---HQKLDPSATHGITQFSDL 111
AE F F + Y + E RF IFK N+++ H++ + + +T+F+DL
Sbjct: 227 AEQLFFNFITTYKPEYINDHVEMTKRFEIFKENVKKIHELNTHER--GTGVYAVTRFTDL 284
Query: 112 TPAEFRRTYLGLRRKLRLPKD--ADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCW 169
T EF+ YLGL L+ P QA I + LPA FDWR GAV VKDQG+CGSCW
Sbjct: 285 TYEEFKSKYLGLNPNLKKPNQIPMRQAEIPKVHQLPASFDWRPLGAVTEVKDQGACGSCW 344
Query: 170 SFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK 229
+FS TG +EG L TGKL+SLSEQ+LVDCD D GC+GG M++A+ +
Sbjct: 345 AFSVTGNIEGQWKLKTGKLLSLSEQELVDCD---------KMDDGCDGGYMDNAYRAIEQ 395
Query: 230 AGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAIN 289
GGL EE+YPY D C F+KS ++ +S +E +A LV NGP+++ IN
Sbjct: 396 LGGLETEEEYPYEAED--DKCSFNKSLSKVQISGAVNISSNETNMAKWLVHNGPISIGIN 453
Query: 290 AVYMQTYIGGVSCPY--ICS-RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWG 346
A MQ Y+GGVS P+ +C+ + +DHGVL+VGYG Y P+ K+ PYW++KNSWG WG
Sbjct: 454 ANAMQFYVGGVSHPWKALCNPKNIDHGVLIVGYGIKEY-PLFNKQLPYWVVKNSWGPGWG 512
Query: 347 ENGYYKICRGRNVCGVDSMVSTVAA 371
E GYY++ RG CGV++M S+
Sbjct: 513 EQGYYRVFRGDGTCGVNTMASSAVV 537
>gi|390994427|gb|AFM37363.1| cathepsin F1 [Dictyocaulus viviparus]
Length = 459
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 149/333 (44%), Positives = 193/333 (57%), Gaps = 44/333 (13%)
Query: 56 AEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQ-KLDPSATHGITQFSDLTPA 114
A + F F + K Y S+ + RF +FK NL+ Q K + +A +GITQFSDLTP
Sbjct: 153 AWNQFVDFMGRHEKVYNSKHDTLKRFRVFKRNLKAIRSWQEKEEGTAVYGITQFSDLTPE 212
Query: 115 EFRRTYLGLRRKLRLPKDADQAPILPT-------------NDLPADFDWREKGAVGPVKD 161
EF++ YL P D+ PI+P LP FDWR+ GAV VK+
Sbjct: 213 EFKKIYL--------PYIWDE-PIVPNRMVDLTAEGVHLNETLPESFDWRDHGAVTDVKN 263
Query: 162 QGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMN 221
QG CGSCW+FSTTG +EG FLA KLVSLSEQ+LVDCD D GC GGL +
Sbjct: 264 QGFCGSCWAFSTTGNIEGQWFLAKKKLVSLSEQELVDCD---------KVDDGCEGGLPS 314
Query: 222 SAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKN 281
A++ ++ GGL E YPY G RG C ++++ A + + + DE+ + A LVK
Sbjct: 315 QAYKEIMRMGGLETESAYPYDG--RGEECHINRTEFAVYINDSVELPHDEESMKAWLVKK 372
Query: 282 GPLAVAINAVYMQTYIGGVSCP--YICS-RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIK 338
GP+++ INA +Q Y G+S P + C L+HGVLLVGYGS K KPYWIIK
Sbjct: 373 GPISIGINANPLQFYRHGISHPWKFFCEPYMLNHGVLLVGYGSE-------KNKPYWIIK 425
Query: 339 NSWGESWGENGYYKICRGRNVCGVDSMVSTVAA 371
NSWG WGENGYY++ RG+NVCGV M ++
Sbjct: 426 NSWGPKWGENGYYRLYRGKNVCGVHEMPTSAVV 458
>gi|24417396|gb|AAN60308.1| unknown [Arabidopsis thaliana]
Length = 193
Score = 262 bits (670), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 128/195 (65%), Positives = 153/195 (78%), Gaps = 11/195 (5%)
Query: 8 LFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKF 67
+F++S + VSS + D D +IRQV G + +L +E HFSLFK+KF
Sbjct: 10 VFVLSFFIV-LVSSSDVNDGDDLVIRQVVGGAEP----------QVLTSEDHFSLFKRKF 58
Query: 68 NKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKL 127
K YAS EEHD+RF++FKANLRRA RHQKLDPSATHG+TQFSDLT +EFR+ +LG+R
Sbjct: 59 GKVYASNEEHDYRFSVFKANLRRARRHQKLDPSATHGVTQFSDLTRSEFRKKHLGVRSGF 118
Query: 128 RLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGK 187
+LPKDA++APILPT +LP DFDWR+ GAV PVK+QGSCGSCWSFS TGALEGANFLATGK
Sbjct: 119 KLPKDANKAPILPTENLPEDFDWRDHGAVTPVKNQGSCGSCWSFSATGALEGANFLATGK 178
Query: 188 LVSLSEQQLVDCDHE 202
LVSLSEQQLVDCDH+
Sbjct: 179 LVSLSEQQLVDCDHQ 193
>gi|45822207|emb|CAE47500.1| cathepsin L-like proteinase [Diabrotica virgifera virgifera]
Length = 326
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 148/319 (46%), Positives = 192/319 (60%), Gaps = 28/319 (8%)
Query: 54 LGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQ-KLDPSATH---GITQFS 109
L + + FK + NK+Y + E RFTIF+ +LR+ H K D + G+T+F+
Sbjct: 17 LSDKEEWVQFKVRNNKSYRNYIEEQKRFTIFQGSLRKIENHNDKYDHGLSTFKLGVTKFA 76
Query: 110 DLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCW 169
DLT EF LG+ R + + + P DLP+ FDWREKGAV VKDQGSCGSCW
Sbjct: 77 DLTEKEFSDM-LGISRSTKSSRPRVIHSLTPVKDLPSKFDWREKGAVTEVKDQGSCGSCW 135
Query: 170 SFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK 229
SFSTTG +EGA FL TGKLVSLSEQ LVDC E C GC+GG M+ A EY
Sbjct: 136 SFSTTGTVEGAYFLKTGKLVSLSEQNLVDCAKE-------DC-YGCSGGYMDKALEYIET 187
Query: 230 AGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSL-DEDQIAANLVKNGPLAVAI 288
AGG+M E DYPY G D C+FD SK+AA ++NF+ + DED + ++ GP++VAI
Sbjct: 188 AGGIMSENDYPYEGID--DKCRFDSSKVAAKISNFTYIKKNDEDDLKNAVIAKGPISVAI 245
Query: 289 NAVY-MQTYIGGV---SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGES 344
+A + Q Y G+ S Y L+HGVL+VGYG+ KE+ YWI+KNSWG
Sbjct: 246 DASFNFQLYDSGILDDSSCYSDFNSLNHGVLVVGYGTE-------KEQDYWIVKNSWGAD 298
Query: 345 WGENGYYKICRGR-NVCGV 362
WG +GY + R + N CG+
Sbjct: 299 WGMDGYIWMSRNKNNQCGI 317
>gi|6649575|gb|AAF21461.1|U69120_1 cysteine proteinase PWCP1 [Paragonimus westermani]
Length = 427
Score = 261 bits (667), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 142/332 (42%), Positives = 202/332 (60%), Gaps = 30/332 (9%)
Query: 49 TNNDLLG------AEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SA 101
+N +LLG F F++KF K+Y+S + R+ +FK NL + Q+L+ +A
Sbjct: 110 SNIELLGFRLPQNTSRLFEEFQRKFRKSYSS--DTAKRYALFKYNLLKMQLIQRLEKGTA 167
Query: 102 THGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPT--NDLPADFDWREKGAVGPV 159
+GIT+FSDL+ EFR + ++R+ + A I PT LP FDWR GAV V
Sbjct: 168 NYGITKFSDLSAEEFRHSLANMKRRKSKGSQMETA-IFPTTIQSLPPSFDWRANGAVTEV 226
Query: 160 KDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGL 219
KDQG CGSCW+F+TTG +EG F T KL+SLSEQQL+DCD + D CNGGL
Sbjct: 227 KDQGMCGSCWAFATTGNIEGQWFRKTNKLISLSEQQLLDCDTK---------DEACNGGL 277
Query: 220 MNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLV 279
A++ +K GGLM E+DYPY + +C + I+A + + + DE ++AA LV
Sbjct: 278 PEWAYDEIVKMGGLMSEKDYPYEAM-KEQSCHLRRPNISAYINGSATLPSDEAKLAAWLV 336
Query: 280 KNGPLAVAINAVYMQTYIGGVSCP--YICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWI 336
+NGP++V +NA ++Q Y+GG+S P +CS LDH VLLVGYG + + +PYWI
Sbjct: 337 QNGPISVGVNANFLQFYLGGISHPPHMLCSEAGLDHAVLLVGYGVSTFL-----RRPYWI 391
Query: 337 IKNSWGESWGENGYYKICRGRNVCGVDSMVST 368
+KNSWG WGE GY+++ RG CG+++ +T
Sbjct: 392 VKNSWGGGWGEKGYFRMYRGDGTCGINADPTT 423
>gi|195453400|ref|XP_002073772.1| GK14287 [Drosophila willistoni]
gi|194169857|gb|EDW84758.1| GK14287 [Drosophila willistoni]
Length = 610
Score = 259 bits (663), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 142/335 (42%), Positives = 194/335 (57%), Gaps = 19/335 (5%)
Query: 45 HHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATH 103
H + ++ L EH F F+ KF + Y + E R IF+ NLR + + SA +
Sbjct: 288 HKKHNHHSLDKVEHLFHKFQIKFERRYVNSVERQMRLRIFRQNLRIIEQLNANEMGSAKY 347
Query: 104 GITQFSDLTPAEFRRTYLGLRRKLRLPKDADQA--PILPTNDLPADFDWREKGAVGPVKD 161
GIT+F+D+T E++ +R P +A P P +LP +FDWR+KGAV VK+
Sbjct: 348 GITEFADMTSTEYKERTGLWQRTEGQPTGGQKAVVPSYPGGELPKEFDWRQKGAVSSVKN 407
Query: 162 QGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMN 221
QGSCGSCW+FST G +EG N + TG+L SEQ+L+DCD + DS CNGGL +
Sbjct: 408 QGSCGSCWAFSTIGNIEGLNAVKTGQLKEFSEQELLDCD---------TKDSACNGGLPD 458
Query: 222 SAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVK 280
+A++ + GGL E +YPY R C F+K+ V F + +E + L+
Sbjct: 459 NAYKAIQEIGGLEYESEYPYKA--RKEQCHFNKTLAHVQVTGFVDLPKNNETAMQEWLIA 516
Query: 281 NGPLAVAINAVYMQTYIGGVSCPY--ICSR-RLDHGVLLVGYGSAGYAPIRLKEKPYWII 337
NGP+++ INA MQ Y GGVS P+ +C + LDHGVL+VGYG + Y P K PYWI+
Sbjct: 517 NGPISIGINANAMQFYRGGVSHPWKILCEKSNLDHGVLIVGYGVSDY-PNFHKTLPYWIV 575
Query: 338 KNSWGESWGENGYYKICRGRNVCGVDSMVSTVAAA 372
KNSWG WGE GYY++ RG N CGV M S+ A
Sbjct: 576 KNSWGPRWGEQGYYRVYRGDNTCGVSEMASSAILA 610
>gi|161408101|dbj|BAF94154.1| cathepsin F-like cysteine protease [Plautia stali]
Length = 803
Score = 258 bits (660), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 134/304 (44%), Positives = 187/304 (61%), Gaps = 16/304 (5%)
Query: 69 KAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRRTYLGLRRKL 127
++Y + EE RF IF+AN+++A QK + +A +G+T FSD++ EF++ YLGL+++
Sbjct: 509 RSYKTTEELKKRFRIFRANMKKADYLQKTEQGTAKYGVTIFSDISSKEFKKHYLGLKKRT 568
Query: 128 RLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGK 187
K + +P LP ++DWR AV PVK+QG CGSCW+FS TG +EG + TG
Sbjct: 569 PDIKFKQEMAQIPNITLPEEYDWRNYNAVTPVKNQGMCGSCWAFSVTGNIEGQYAIKTGN 628
Query: 188 LVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRG 247
LVSLSEQ+LVDCD D GC GGL +A+ + GGL E DYPY+G R
Sbjct: 629 LVSLSEQELVDCD---------KYDDGCEGGLFETAYHAIEELGGLELESDYPYSG--RD 677
Query: 248 HACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCP--YI 305
+ C F+ S++ S+ + +S DE +A LV NGP+++ INA MQ Y+GGVS P ++
Sbjct: 678 NTCHFNSSEVRVSITSSVNISNDETDMAKWLVANGPISIGINANAMQFYLGGVSHPLKFL 737
Query: 306 CS-RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDS 364
C + LDHGVL+VGYG + + PYW+IKNSW WG GYY + RG CGV+
Sbjct: 738 CDPKTLDHGVLIVGYG-IHRTWLLHRHLPYWLIKNSWSSYWGAKGYYMLYRGDGSCGVNQ 796
Query: 365 MVST 368
S+
Sbjct: 797 WPSS 800
>gi|348528696|ref|XP_003451852.1| PREDICTED: cathepsin F-like [Oreochromis niloticus]
Length = 475
Score = 258 bits (660), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 149/325 (45%), Positives = 187/325 (57%), Gaps = 33/325 (10%)
Query: 52 DLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSD 110
+LLG F F K+NK Y+SQEE D R IF NL+ A + Q LD SA +G+T+FSD
Sbjct: 172 ELLG---QFKEFMTKYNKVYSSQEEVDRRLRIFHENLKTAEKLQALDQGSAEYGVTKFSD 228
Query: 111 LTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDL----PADFDWREKGAVGPVKDQGSCG 166
LT EFR TYL L + P+ P P +DWR+ GAV PVK+QG CG
Sbjct: 229 LTEEEFRSTYLNPL----LSQWTLHQPMKPATPAKGPSPDSWDWRDHGAVSPVKNQGMCG 284
Query: 167 SCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEY 226
SCW+FS G +EG FL G L+SLSEQ+LVDCD D C GGL ++A+E
Sbjct: 285 SCWAFSVIGNIEGQWFLKNGTLLSLSEQELVDCD---------GLDQACRGGLPSNAYEA 335
Query: 227 TLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAV 286
K GGL E DY YTG C F K+AA + + + DE +IAA L +NGP++V
Sbjct: 336 IEKLGGLETESDYSYTG--HKQRCDFTTGKVAAYINSSVELPKDEKEIAAWLAENGPVSV 393
Query: 287 AINAVYMQTYIGGVSCP---YICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGE 343
A+NA MQ Y G+S P + +DH VLLVGYG K P+W IKNSWGE
Sbjct: 394 ALNAFAMQFYRKGISHPLKIFCNPWMIDHAVLLVGYGER-------KGIPFWAIKNSWGE 446
Query: 344 SWGENGYYKICRGRNVCGVDSMVST 368
+GE GYY + RG N CG++ M S+
Sbjct: 447 DYGEQGYYYLYRGSNACGINKMCSS 471
>gi|16076437|emb|CAC94443.1| cysteine proteinase [Betula pendula]
Length = 133
Score = 257 bits (657), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 120/134 (89%), Positives = 130/134 (97%), Gaps = 1/134 (0%)
Query: 200 DHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAA 259
DHECDPEE GSCDSGC+GGLMNSAFEYTLKAGGLMREEDYPYTGTDR CKFDKSKIAA
Sbjct: 1 DHECDPEEQGSCDSGCSGGLMNSAFEYTLKAGGLMREEDYPYTGTDR-STCKFDKSKIAA 59
Query: 260 SVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGY 319
SV+NFSV+SLDEDQIAANLVKNGPLAVAINAV+MQT++GGVSCPYICSRRLDHGVLLVG+
Sbjct: 60 SVSNFSVISLDEDQIAANLVKNGPLAVAINAVFMQTHVGGVSCPYICSRRLDHGVLLVGF 119
Query: 320 GSAGYAPIRLKEKP 333
GSAGY+P+R+KEKP
Sbjct: 120 GSAGYSPVRMKEKP 133
>gi|195111686|ref|XP_002000409.1| GI10216 [Drosophila mojavensis]
gi|193917003|gb|EDW15870.1| GI10216 [Drosophila mojavensis]
Length = 605
Score = 257 bits (656), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 142/326 (43%), Positives = 197/326 (60%), Gaps = 26/326 (7%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP----SATHGITQFSDLT 112
+H F +F+ K+ + YA+ EH R IF+ NLR Q+L+ SA +GIT+F+D+T
Sbjct: 296 DHLFHVFQIKYKRRYANSMEHQMRLRIFRQNLRTI---QELNDNEQGSAKYGITEFADMT 352
Query: 113 PAEFRRTYLGLRRKLRLPKDADQAPILPT--NDLPADFDWREKGAVGPVKDQGSCGSCWS 170
+E+ + +R P A ++P +LP +FDWREK AV VK+QGSCGSCW+
Sbjct: 353 SSEYTQRAGLWQRSANKPTGGKPA-VVPAYKGELPKEFDWREKNAVTQVKNQGSCGSCWA 411
Query: 171 FSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA 230
FS TG +EG + TG+L SEQ+L+DCD S DS CNGGLM++A++
Sbjct: 412 FSVTGNIEGLYAIKTGELREFSEQELLDCD---------STDSACNGGLMDNAYKAIKDI 462
Query: 231 GGLMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAIN 289
GGL E +YPY + C F+K+ VA+F + +E + L+ NGP+++ +N
Sbjct: 463 GGLEYESEYPYLAKKK--QCHFNKTLSHVQVADFVDLPKGNETAMQEWLLANGPISIGLN 520
Query: 290 AVYMQTYIGGVSCPY--ICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWG 346
A MQ Y GGVS P+ +CS++ LDHGVL+VGYG + Y P K PYWI+KNSWG WG
Sbjct: 521 ANAMQFYRGGVSHPWGPLCSKKNLDHGVLIVGYGVSDY-PNFHKTLPYWIVKNSWGPRWG 579
Query: 347 ENGYYKICRGRNVCGVDSMVSTVAAA 372
E GYY+I RG N CGV M ++ A
Sbjct: 580 EQGYYRIYRGDNTCGVSEMATSAVLA 605
>gi|357473429|ref|XP_003606999.1| Cysteine proteinase [Medicago truncatula]
gi|355508054|gb|AES89196.1| Cysteine proteinase [Medicago truncatula]
Length = 210
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 129/203 (63%), Positives = 155/203 (76%), Gaps = 13/203 (6%)
Query: 1 MGSKTVVLFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHF 60
M +T++LF+V L +FS + T + D +IRQV D LGAEHHF
Sbjct: 1 MDHRTLLLFVV-LFIFSVSAFSTPDEGEDPIIRQVVD-----------EEGVRLGAEHHF 48
Query: 61 SLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTY 120
+LFK KF K Y+S++EHD+RF IFK+NL RA RHQ +DPSA HG+T+FSDLTP EFR++
Sbjct: 49 NLFKHKFGKVYSSKDEHDYRFKIFKSNLNRAKRHQLMDPSAVHGVTRFSDLTPREFRKSV 108
Query: 121 LGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGA 180
LGLR + LPKDA+ APILPT++LP DFDWREKGAV VK+QGSCGSCWSFSTTGALEGA
Sbjct: 109 LGLR-GVGLPKDANAAPILPTDNLPKDFDWREKGAVTAVKNQGSCGSCWSFSTTGALEGA 167
Query: 181 NFLATGKLVSLSEQQLVDCDHEC 203
+FL+TGKLVSLSEQQLVDCDHE
Sbjct: 168 HFLSTGKLVSLSEQQLVDCDHEV 190
>gi|291230041|ref|XP_002734978.1| PREDICTED: cysteine proteinase inhibitor-like [Saccoglossus
kowalevskii]
Length = 352
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 141/317 (44%), Positives = 193/317 (60%), Gaps = 30/317 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLD-PSATHGITQFSDLTPAEFRR 118
F F K ++K Y ++EEH R+ IF+ NL +A R Q+ + + +G+T+F DL+ EFR+
Sbjct: 54 FQDFMKTYDKKYDTEEEHQLRYQIFQDNLLKAERLQQTEQATGQYGVTKFMDLSEEEFRK 113
Query: 119 TYLGLRRKLRLP--KDADQAPILPTNDLPADFDWRE--KGAVGPVKDQGSCGSCWSFSTT 174
YL + P K A+ +P PA FDWR+ K AV VK+QG+CGSCW+FSTT
Sbjct: 114 YYLTPVWRGSDPHMKKAE----IPKGTPPAAFDWRDADKNAVTKVKNQGTCGSCWAFSTT 169
Query: 175 GALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLM 234
G +EG + G LVSLSEQ+LVDCD D GCNGGL ++A++ ++ GG+M
Sbjct: 170 GNIEGQWKIKKGTLVSLSEQELVDCD---------KLDQGCNGGLPSNAYQEIMRFGGIM 220
Query: 235 REEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQ 294
E+DYPYTG D+ CK + + + +S DE +A+ L NGP+++ INA MQ
Sbjct: 221 SEDDYPYTGRDQ--DCKLNATLNKVYINGSMNISKDEGDMASWLAANGPISIGINANAMQ 278
Query: 295 TYIGGVSCPY--ICS-RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYY 351
Y GGVS P+ C+ LDHGVL+VGYG+ PYWIIKNSWG SWG GYY
Sbjct: 279 FYFGGVSHPWKIFCNPENLDHGVLIVGYGTK-------DGTPYWIIKNSWGRSWGVEGYY 331
Query: 352 KICRGRNVCGVDSMVST 368
+ RG VCG++ M ++
Sbjct: 332 LVYRGGGVCGLNEMCTS 348
>gi|270011071|gb|EFA07519.1| cystatin [Tribolium castaneum]
Length = 1761
Score = 255 bits (651), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 146/335 (43%), Positives = 200/335 (59%), Gaps = 29/335 (8%)
Query: 51 NDLLGA---EHHFSLFKK--KFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHG 104
++LLG E+H SLF K ++E+ +RF +F NL + + +AT+G
Sbjct: 1443 DNLLGCDDREYHLSLFTDFLKKYNKKYHKKEYKYRFNVFVQNLMQIRVLNTFEQGTATYG 1502
Query: 105 ITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPI----LPTNDLPADFDWREKGAVGPVK 160
IT+F+D+T EF R+ LGLR LR + ++ P +P +LP +FDWR+K V VK
Sbjct: 1503 ITRFADMTQKEFSRS-LGLRTDLR---NENETPFAQAKIPNIELPKEFDWRKKNVVTEVK 1558
Query: 161 DQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLM 220
+Q CGSCW+FS TG +EG L GKL+ SEQ+LVDCD + D GCNGGLM
Sbjct: 1559 NQEQCGSCWAFSVTGNVEGQYALRHGKLLEFSEQELVDCDTD---------DQGCNGGLM 1609
Query: 221 NSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVK 280
++A+ K GGL E+DYPY D C F+++ V +S +E +A LV
Sbjct: 1610 DTAYRSIEKIGGLETEQDYPYDAED--EKCHFNRTLARVQVTGALNISHNETDMAKWLVA 1667
Query: 281 NGPLAVAINAVYMQTYIGGVSCP--YICS-RRLDHGVLLVGYGSAGYAPIRLKEKPYWII 337
NGP+++AINA MQ Y+GGVS P ++CS + LDHGVL+VGYG Y P+ K PYWI+
Sbjct: 1668 NGPISIAINANAMQFYMGGVSHPFKFLCSPKNLDHGVLIVGYGVHNY-PLFKKSLPYWIV 1726
Query: 338 KNSWGESWGENGYYKICRGRNVCGVDSMVSTVAAA 372
KNSWG WGE GYY++ RG CG++ S+ A
Sbjct: 1727 KNSWGTGWGEQGYYRVYRGDGTCGLNQTPSSAIVA 1761
>gi|189239337|ref|XP_973607.2| PREDICTED: similar to cathepsin F-like cysteine protease [Tribolium
castaneum]
Length = 1726
Score = 255 bits (651), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 146/335 (43%), Positives = 200/335 (59%), Gaps = 29/335 (8%)
Query: 51 NDLLGA---EHHFSLFKK--KFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHG 104
++LLG E+H SLF K ++E+ +RF +F NL + + +AT+G
Sbjct: 1408 DNLLGCDDREYHLSLFTDFLKKYNKKYHKKEYKYRFNVFVQNLMQIRVLNTFEQGTATYG 1467
Query: 105 ITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPI----LPTNDLPADFDWREKGAVGPVK 160
IT+F+D+T EF R+ LGLR LR + ++ P +P +LP +FDWR+K V VK
Sbjct: 1468 ITRFADMTQKEFSRS-LGLRTDLR---NENETPFAQAKIPNIELPKEFDWRKKNVVTEVK 1523
Query: 161 DQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLM 220
+Q CGSCW+FS TG +EG L GKL+ SEQ+LVDCD + D GCNGGLM
Sbjct: 1524 NQEQCGSCWAFSVTGNVEGQYALRHGKLLEFSEQELVDCDTD---------DQGCNGGLM 1574
Query: 221 NSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVK 280
++A+ K GGL E+DYPY D C F+++ V +S +E +A LV
Sbjct: 1575 DTAYRSIEKIGGLETEQDYPYDAED--EKCHFNRTLARVQVTGALNISHNETDMAKWLVA 1632
Query: 281 NGPLAVAINAVYMQTYIGGVSCP--YICS-RRLDHGVLLVGYGSAGYAPIRLKEKPYWII 337
NGP+++AINA MQ Y+GGVS P ++CS + LDHGVL+VGYG Y P+ K PYWI+
Sbjct: 1633 NGPISIAINANAMQFYMGGVSHPFKFLCSPKNLDHGVLIVGYGVHNY-PLFKKSLPYWIV 1691
Query: 338 KNSWGESWGENGYYKICRGRNVCGVDSMVSTVAAA 372
KNSWG WGE GYY++ RG CG++ S+ A
Sbjct: 1692 KNSWGTGWGEQGYYRVYRGDGTCGLNQTPSSAIVA 1726
>gi|195054270|ref|XP_001994049.1| GH22731 [Drosophila grimshawi]
gi|193895919|gb|EDV94785.1| GH22731 [Drosophila grimshawi]
Length = 617
Score = 255 bits (651), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 139/339 (41%), Positives = 199/339 (58%), Gaps = 20/339 (5%)
Query: 41 EILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP- 99
E +H + ++ L EH F F+ K+ + YA+ EH R IF+ NLR +
Sbjct: 292 EKKTHKKRNHHTLNKIEHLFHKFQLKYKRQYANTAEHQMRLRIFRQNLRTIEELNANERG 351
Query: 100 SATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILP--TNDLPADFDWREKGAVG 157
SA +GITQF+D+T E++ + GL ++ A ++P ++P +FDWR+K AV
Sbjct: 352 SAKYGITQFADMTSTEYK-LHAGLWQRSEDKPTGGAAAVVPPYAGEMPKEFDWRQKKAVT 410
Query: 158 PVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNG 217
VK+QG CGSCW+FS TG +EG + TG+L SEQ+L+DCD S DS CNG
Sbjct: 411 HVKNQGQCGSCWAFSVTGNIEGLYAIKTGELEEFSEQELLDCD---------STDSACNG 461
Query: 218 GLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAA 276
GLM++A++ GGL E +YPY + C F+++ ++ F + +E +
Sbjct: 462 GLMDNAYKAIKDIGGLEYESEYPYAA--KKMQCHFNRTMSHVQLSGFVDLPKGNETAMQE 519
Query: 277 NLVKNGPLAVAINAVYMQTYIGGVSCPY--ICSRR-LDHGVLLVGYGSAGYAPIRLKEKP 333
L+ NGP+++ +NA MQ Y GGVS P+ +CS++ LDHGVL+VGYG + Y P K P
Sbjct: 520 WLLSNGPISIGLNANAMQFYRGGVSHPWAPLCSKKNLDHGVLIVGYGVSDY-PNFHKTLP 578
Query: 334 YWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAAA 372
YWI+KNSWG WGE GYY+I RG N CGV M ++ A
Sbjct: 579 YWIVKNSWGPRWGEQGYYRIYRGDNTCGVSEMATSAVLA 617
>gi|341878608|gb|EGT34543.1| hypothetical protein CAEBREN_26318 [Caenorhabditis brenneri]
Length = 478
Score = 254 bits (649), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 140/317 (44%), Positives = 187/317 (58%), Gaps = 26/317 (8%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRR 118
F F + K Y ++ E RF +FK N + QK + +A +G T+FSD+T EF+
Sbjct: 176 FLDFIDRHEKRYENKREVLKRFRVFKRNAKVIRELQKNEQGTAVYGFTKFSDMTTMEFKE 235
Query: 119 TYLGLRRKLRLPKDA----DQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTT 174
T L + + +P D + + DLP FDWRE GAV VK+QGSCGSCW+FSTT
Sbjct: 236 TMLPYQWEQPVPMDQANFEKEGVTISEEDLPDSFDWREHGAVTQVKNQGSCGSCWAFSTT 295
Query: 175 GALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLM 234
G +EGA FLA KLVSLSEQ+LVDCD S D GCNGGL ++A++ ++ GGL
Sbjct: 296 GNIEGAWFLAKKKLVSLSEQELVDCD---------SVDQGCNGGLPSNAYKEIIRMGGLE 346
Query: 235 REEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQ 294
E+ YPY G RG C + IA + + DE ++ LV GP+++ +NA +Q
Sbjct: 347 PEDAYPYDG--RGETCHLVRKDIAVYINGSVELPHDEVEMQKWLVTKGPISIGLNANTLQ 404
Query: 295 TYIGGVSCPY--ICSR-RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYY 351
Y GV P+ C L+HGVL+VGYG G KPYWI+KNSWG +WGE GY+
Sbjct: 405 FYRHGVVHPFKIFCEPFMLNHGVLIVGYGKDG-------RKPYWIVKNSWGPTWGEAGYF 457
Query: 352 KICRGRNVCGVDSMVST 368
K+ RG+NVCGV M ++
Sbjct: 458 KLYRGKNVCGVQEMATS 474
>gi|341878637|gb|EGT34572.1| hypothetical protein CAEBREN_13324 [Caenorhabditis brenneri]
Length = 478
Score = 254 bits (648), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 140/317 (44%), Positives = 187/317 (58%), Gaps = 26/317 (8%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRR 118
F F + K Y ++ E RF +FK N + QK + +A +G T+FSD+T EF+
Sbjct: 176 FLDFIDRHEKRYENKREVLKRFRVFKRNAKVIRELQKNEQGTAVYGFTKFSDMTTMEFKE 235
Query: 119 TYLGLRRKLRLPKDA----DQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTT 174
T L + + +P D + + DLP FDWRE GAV VK+QGSCGSCW+FSTT
Sbjct: 236 TMLPYQWEQPVPMDQANFEKEGVTISEEDLPDSFDWREHGAVTQVKNQGSCGSCWAFSTT 295
Query: 175 GALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLM 234
G +EGA FLA KLVSLSEQ+LVDCD S D GCNGGL ++A++ ++ GGL
Sbjct: 296 GNIEGAWFLAKKKLVSLSEQELVDCD---------SVDQGCNGGLPSNAYKEIIRMGGLE 346
Query: 235 REEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQ 294
E+ YPY G RG C + IA + + DE ++ LV GP+++ +NA +Q
Sbjct: 347 PEDAYPYDG--RGETCHLVRKDIAVYINGSVELPHDEVEMQKWLVTKGPISIGLNANTLQ 404
Query: 295 TYIGGVSCPY--ICSR-RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYY 351
Y GV P+ C L+HGVL+VGYG G KPYWI+KNSWG +WGE GY+
Sbjct: 405 FYRHGVVHPFKIFCEPFMLNHGVLIVGYGKDG-------RKPYWIVKNSWGPTWGEAGYF 457
Query: 352 KICRGRNVCGVDSMVST 368
K+ RG+NVCGV M ++
Sbjct: 458 KLYRGKNVCGVQEMATS 474
>gi|67773374|gb|AAY81944.1| cysteine protease 6 [Paragonimus westermani]
Length = 325
Score = 254 bits (648), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 141/320 (44%), Positives = 192/320 (60%), Gaps = 33/320 (10%)
Query: 56 AEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPA 114
A + FK+ + K+YA+ ++ + RF IFK NL RA +Q + +A +G+TQFSDLTP
Sbjct: 28 ARELYEQFKRDYGKSYAN-DDDEKRFAIFKDNLVRAQNYQLQEQGTARYGVTQFSDLTPE 86
Query: 115 EFRRTYLGLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSF 171
EF +L R DQ + NDL P DWRE GAV PV+DQGSCGSCW+F
Sbjct: 87 EFAAKFLSSRFD-------DQVERVQLNDLKAAPESVDWRELGAVAPVEDQGSCGSCWAF 139
Query: 172 STTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAG 231
S G +EG FL TG+LVSLS+QQLVDCD + DSGC+GG + + ++ G
Sbjct: 140 SVAGNVEGQWFLKTGQLVSLSKQQLVDCDVQ---------DSGCDGGYPPTTYGEIIRMG 190
Query: 232 GLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV 291
GL + DYPY G R CK D+SK+ A + + V+ +E + AA + ++GP++ INAV
Sbjct: 191 GLEAQRDYPYVG--REQPCKLDESKLLAKINSSIVLEANEKKQAAYIAEHGPMSSGINAV 248
Query: 292 YMQTYIGGVSCPYICSRR---LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+Q Y G+S P + L+HGVL VGYG+ PYWIIKNSWG WGE
Sbjct: 249 TLQFYQSGISHPSKSQCQPDWLNHGVLSVGYGTEDGV-------PYWIIKNSWGTGWGEK 301
Query: 349 GYYKICRGRNVCGVDSMVST 368
GY+++ RG CG++ +VS+
Sbjct: 302 GYFRLYRGDGTCGIEKVVSS 321
>gi|347968731|ref|XP_003436277.1| AGAP002879-PB [Anopheles gambiae str. PEST]
gi|333467869|gb|EGK96736.1| AGAP002879-PB [Anopheles gambiae str. PEST]
Length = 1834
Score = 254 bits (648), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 141/327 (43%), Positives = 191/327 (58%), Gaps = 30/327 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRR 118
F F+ + YAS EH+ RF IF+ NL + + K + +A +G+T+F+D+T AE+R
Sbjct: 1524 FDKFRHHHRRQYASSMEHEMRFNIFRNNLFKIEQLNKFERGTAKYGVTKFADMTVAEYR- 1582
Query: 119 TYLGLRRKLRLPKD----------ADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSC 168
+ GL +PK A + + DLP FDWR+ GAV VK+QGSCGSC
Sbjct: 1583 AHTGLV----VPKHDRANHVGNRVASEEDVAGVGDLPRSFDWRDHGAVTEVKNQGSCGSC 1638
Query: 169 WSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTL 228
W+FS G +EG + + T KL S SEQ+L+DCD D+GC GG M+ AF+
Sbjct: 1639 WAFSAVGNVEGLHQIKTKKLESYSEQELIDCD---------KVDNGCGGGYMDDAFKAIE 1689
Query: 229 KAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAI 288
+ GGL E DYPY + +C F++S V + +E IA L+KNGP+A+ +
Sbjct: 1690 QLGGLELENDYPYEAKAQ-KSCHFNRSLSHVQVKGAVDMPKNETYIAKYLIKNGPIAIGL 1748
Query: 289 NAVYMQTYIGGVSCPY--ICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESW 345
NA MQ Y GG+S P+ +C+ + +DHGVL+VGYG Y P+ K PYWIIKNSWG W
Sbjct: 1749 NANAMQFYRGGISHPWHPLCNHKSIDHGVLIVGYGIKEY-PMFNKTLPYWIIKNSWGPRW 1807
Query: 346 GENGYYKICRGRNVCGVDSMVSTVAAA 372
GE GYY+I RG N CGV M S+ A
Sbjct: 1808 GEQGYYRIYRGDNSCGVSEMASSAILA 1834
>gi|194898683|ref|XP_001978897.1| GG11133 [Drosophila erecta]
gi|190650600|gb|EDV47855.1| GG11133 [Drosophila erecta]
Length = 615
Score = 253 bits (647), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 139/339 (41%), Positives = 199/339 (58%), Gaps = 20/339 (5%)
Query: 41 EILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP- 99
E +H + ++ L +H F F+ +F + Y S E R IF+ NL+ +
Sbjct: 290 EKKTHKKHSHRGLDKVDHLFHKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNTNEMG 349
Query: 100 SATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPT--NDLPADFDWREKGAVG 157
SA +GIT+F+DLT +E++ GL ++ A ++P +LP +FDWR+K AV
Sbjct: 350 SAKYGITEFADLTSSEYKER-TGLWQRDEAKATGGSAAVVPAYHGELPKEFDWRQKNAVT 408
Query: 158 PVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNG 217
PVK+QGSCGSCW+FS TG +EG + TG+L SEQ+L+DCD + DS CNG
Sbjct: 409 PVKNQGSCGSCWAFSVTGNIEGLYAVKTGELKEFSEQELLDCD---------TTDSACNG 459
Query: 218 GLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAA 276
GLM++A++ GGL E +YPY + + C F+++ VA F + +E +
Sbjct: 460 GLMDNAYKAIKDIGGLEYEAEYPYKA--KKNQCHFNRTLSHVQVAGFVDLPKGNETAMQE 517
Query: 277 NLVKNGPLAVAINAVYMQTYIGGVSCPY--ICSRR-LDHGVLLVGYGSAGYAPIRLKEKP 333
L+ GP+++ INA MQ Y GGVS P+ +CS++ LDHGVL+VGYG + Y P K P
Sbjct: 518 WLLTKGPISIGINANAMQFYRGGVSHPWKALCSKKNLDHGVLVVGYGVSDY-PNFHKTLP 576
Query: 334 YWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAAA 372
YWI+KNSWG WGE GYY++ RG N CGV M ++ A
Sbjct: 577 YWIVKNSWGPRWGEQGYYRVYRGDNTCGVSEMATSAVLA 615
>gi|347968733|ref|XP_312034.5| AGAP002879-PA [Anopheles gambiae str. PEST]
gi|333467868|gb|EAA08025.5| AGAP002879-PA [Anopheles gambiae str. PEST]
Length = 1810
Score = 253 bits (647), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 141/327 (43%), Positives = 191/327 (58%), Gaps = 30/327 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRR 118
F F+ + YAS EH+ RF IF+ NL + + K + +A +G+T+F+D+T AE+R
Sbjct: 1500 FDKFRHHHRRQYASSMEHEMRFNIFRNNLFKIEQLNKFERGTAKYGVTKFADMTVAEYR- 1558
Query: 119 TYLGLRRKLRLPKD----------ADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSC 168
+ GL +PK A + + DLP FDWR+ GAV VK+QGSCGSC
Sbjct: 1559 AHTGLV----VPKHDRANHVGNRVASEEDVAGVGDLPRSFDWRDHGAVTEVKNQGSCGSC 1614
Query: 169 WSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTL 228
W+FS G +EG + + T KL S SEQ+L+DCD D+GC GG M+ AF+
Sbjct: 1615 WAFSAVGNVEGLHQIKTKKLESYSEQELIDCD---------KVDNGCGGGYMDDAFKAIE 1665
Query: 229 KAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAI 288
+ GGL E DYPY + +C F++S V + +E IA L+KNGP+A+ +
Sbjct: 1666 QLGGLELENDYPYEAKAQ-KSCHFNRSLSHVQVKGAVDMPKNETYIAKYLIKNGPIAIGL 1724
Query: 289 NAVYMQTYIGGVSCPY--ICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESW 345
NA MQ Y GG+S P+ +C+ + +DHGVL+VGYG Y P+ K PYWIIKNSWG W
Sbjct: 1725 NANAMQFYRGGISHPWHPLCNHKSIDHGVLIVGYGIKEY-PMFNKTLPYWIIKNSWGPRW 1783
Query: 346 GENGYYKICRGRNVCGVDSMVSTVAAA 372
GE GYY+I RG N CGV M S+ A
Sbjct: 1784 GEQGYYRIYRGDNSCGVSEMASSAILA 1810
>gi|118394988|ref|XP_001029851.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89284124|gb|EAR82188.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 330
Score = 253 bits (646), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 145/316 (45%), Positives = 187/316 (59%), Gaps = 28/316 (8%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F F + +NK Y+S+E ++ R +IFK NLRR K D A HGITQF+DLT EF
Sbjct: 30 FKKFTQTYNKKYSSEEHYNARLSIFKENLRRIELFNKND-EAQHGITQFADLTHEEFADM 88
Query: 120 YLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEG 179
YLG + +LR + P P DW KGAV PVK+QGSCGSCW+FSTTG++EG
Sbjct: 89 YLGYKPQLRNSQAKVSLSSTPFT-APTAIDWTTKGAVTPVKNQGSCGSCWAFSTTGSIEG 147
Query: 180 ANFLATGK-LVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREED 238
L + L S SEQQLVDCD + D GCNGGLM++AF Y L++ L E
Sbjct: 148 QYVLQLKQNLTSFSEQQLVDCDTK--------EDQGCNGGLMDNAFTY-LESAKLETESA 198
Query: 239 YPYTGTDRGHACKFDKSKIAASVANF------SVVSLDEDQIAANLVKNGPLAVAINAVY 292
YPYT D +CK+++S VA+F V+ E+ + L GPL+VAINA
Sbjct: 199 YPYTAVD--GSCKYNQSLGVVGVASFVDIEQGKTVADTENTMGVALDNIGPLSVAINANN 256
Query: 293 MQTYIGGVSCPYICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYY 351
+Q Y GG+S P IC+ L+HGVL+VG GS K +W +KNSWG SWGE GY+
Sbjct: 257 LQFYAGGISNPLICNPNGLNHGVLIVGLGSE-------NGKDFWKVKNSWGASWGEKGYF 309
Query: 352 KICRGRNVCGVDSMVS 367
+I RG+ CG++ VS
Sbjct: 310 RIVRGKGKCGINRAVS 325
>gi|347968729|ref|XP_003436276.1| AGAP002879-PC [Anopheles gambiae str. PEST]
gi|333467870|gb|EGK96737.1| AGAP002879-PC [Anopheles gambiae str. PEST]
Length = 953
Score = 253 bits (646), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 140/323 (43%), Positives = 190/323 (58%), Gaps = 22/323 (6%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRR 118
F F+ + YAS EH+ RF IF+ NL + + K + +A +G+T+F+D+T AE+R
Sbjct: 643 FDKFRHHHRRQYASSMEHEMRFNIFRNNLFKIEQLNKFERGTAKYGVTKFADMTVAEYR- 701
Query: 119 TYLGL------RRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFS 172
+ GL R + A + + DLP FDWR+ GAV VK+QGSCGSCW+FS
Sbjct: 702 AHTGLVVPKHDRANHVGNRVASEEDVAGVGDLPRSFDWRDHGAVTEVKNQGSCGSCWAFS 761
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
G +EG + + T KL S SEQ+L+DCD D+GC GG M+ AF+ + GG
Sbjct: 762 AVGNVEGLHQIKTKKLESYSEQELIDCD---------KVDNGCGGGYMDDAFKAIEQLGG 812
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY 292
L E DYPY + +C F++S V + +E IA L+KNGP+A+ +NA
Sbjct: 813 LELENDYPYEAKAQ-KSCHFNRSLSHVQVKGAVDMPKNETYIAKYLIKNGPIAIGLNANA 871
Query: 293 MQTYIGGVSCPY--ICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
MQ Y GG+S P+ +C+ + +DHGVL+VGYG Y P+ K PYWIIKNSWG WGE G
Sbjct: 872 MQFYRGGISHPWHPLCNHKSIDHGVLIVGYGIKEY-PMFNKTLPYWIIKNSWGPRWGEQG 930
Query: 350 YYKICRGRNVCGVDSMVSTVAAA 372
YY+I RG N CGV M S+ A
Sbjct: 931 YYRIYRGDNSCGVSEMASSAILA 953
>gi|195497262|ref|XP_002096026.1| GE25302 [Drosophila yakuba]
gi|194182127|gb|EDW95738.1| GE25302 [Drosophila yakuba]
Length = 615
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 138/339 (40%), Positives = 200/339 (58%), Gaps = 20/339 (5%)
Query: 41 EILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP- 99
E +H + ++ L A+H F F+ +F + Y S E R IF+ NL+ + +
Sbjct: 290 EKKTHKKHSHRALDKADHLFHKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEQLNVNEMG 349
Query: 100 SATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPT--NDLPADFDWREKGAVG 157
SA +GIT+F+D+T +E++ GL ++ ++P +LP +FDWR+K AV
Sbjct: 350 SAKYGITEFADMTSSEYKER-TGLWQRNEAKATGGSVAVVPAYHGELPKEFDWRQKNAVT 408
Query: 158 PVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNG 217
VK+QGSCGSCW+FS TG +EG + + TG L SEQ+L+DCD + DS CNG
Sbjct: 409 QVKNQGSCGSCWAFSVTGNIEGLHAVKTGDLKEFSEQELLDCD---------TTDSACNG 459
Query: 218 GLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAA 276
GLM++A++ GGL E +YPY + + C F+++ VA F + +E +
Sbjct: 460 GLMDNAYKAIKDIGGLEYEAEYPYKA--KKNQCHFNRTLSHVQVAGFVDLPKGNETAMQE 517
Query: 277 NLVKNGPLAVAINAVYMQTYIGGVSCPY--ICSRR-LDHGVLLVGYGSAGYAPIRLKEKP 333
L+ NGP+++ INA MQ Y GGVS P+ +CS++ LDHGVL+VGYG + Y P K P
Sbjct: 518 WLLTNGPISIGINANAMQFYRGGVSHPWKALCSKKNLDHGVLVVGYGVSEY-PNFHKTLP 576
Query: 334 YWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAAA 372
YWI+KNSWG WGE GYY++ RG N CGV M ++ A
Sbjct: 577 YWIVKNSWGPRWGEQGYYRVYRGDNTCGVSEMATSAVLA 615
>gi|256077197|ref|XP_002574894.1| cathepsin F (C01 family) [Schistosoma mansoni]
gi|353230780|emb|CCD77197.1| cathepsin F (C01 family) [Schistosoma mansoni]
Length = 419
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 136/313 (43%), Positives = 191/313 (61%), Gaps = 27/313 (8%)
Query: 63 FKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRRTYL 121
FK K+ K Y E+ + RF IFK+N+ +A +Q + SA +G+T +SDLT EF RT+L
Sbjct: 123 FKLKYRKQYHETED-EIRFNIFKSNILKAQLYQVFERGSAIYGVTPYSDLTTDEFARTHL 181
Query: 122 GLRRKLRLPKDADQAPI---LPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALE 178
+P P N++P +FDWREKGAV VK+QG CGSCW+FSTTG +E
Sbjct: 182 --TASWVVPSSRSNTPTSLGKEVNNIPKNFDWREKGAVTEVKNQGMCGSCWAFSTTGNVE 239
Query: 179 GANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREED 238
F TGKL+SLSEQQLVDCD D GCNGGL ++A+E +K GGLM E++
Sbjct: 240 SQWFRKTGKLLSLSEQQLVDCD---------GLDDGCNGGLPSNAYESIIKMGGLMLEDN 290
Query: 239 YPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIG 298
YPY + C +A + + ++ DE ++AA L N ++V +NA+ +Q Y
Sbjct: 291 YPYDA--KNEKCHLKTDGVAVYINSSVNLTQDETELAAWLYHNSTISVGMNALLLQFYQH 348
Query: 299 GVSCPY--ICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
G+S P+ CS+ LDH VLLVGYG + K +P+WI+KNSWG WGENGY+++ R
Sbjct: 349 GISHPWWIFCSKYLLDHAVLLVGYG------VSEKNEPFWIVKNSWGVEWGENGYFRMYR 402
Query: 356 GRNVCGVDSMVST 368
G CG++++ ++
Sbjct: 403 GDGTCGINTVATS 415
>gi|256077193|ref|XP_002574892.1| cathepsin F (C01 family) [Schistosoma mansoni]
gi|353230781|emb|CCD77198.1| cathepsin F (C01 family) [Schistosoma mansoni]
Length = 457
Score = 251 bits (642), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 136/313 (43%), Positives = 191/313 (61%), Gaps = 27/313 (8%)
Query: 63 FKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRRTYL 121
FK K+ K Y E+ + RF IFK+N+ +A +Q + SA +G+T +SDLT EF RT+L
Sbjct: 161 FKLKYRKQYHETED-EIRFNIFKSNILKAQLYQVFERGSAIYGVTPYSDLTTDEFARTHL 219
Query: 122 GLRRKLRLPKDADQAPI---LPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALE 178
+P P N++P +FDWREKGAV VK+QG CGSCW+FSTTG +E
Sbjct: 220 --TASWVVPSSRSNTPTSLGKEVNNIPKNFDWREKGAVTEVKNQGMCGSCWAFSTTGNVE 277
Query: 179 GANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREED 238
F TGKL+SLSEQQLVDCD D GCNGGL ++A+E +K GGLM E++
Sbjct: 278 SQWFRKTGKLLSLSEQQLVDCD---------GLDDGCNGGLPSNAYESIIKMGGLMLEDN 328
Query: 239 YPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIG 298
YPY + C +A + + ++ DE ++AA L N ++V +NA+ +Q Y
Sbjct: 329 YPYDA--KNEKCHLKTDGVAVYINSSVNLTQDETELAAWLYHNSTISVGMNALLLQFYQH 386
Query: 299 GVSCPY--ICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
G+S P+ CS+ LDH VLLVGYG + K +P+WI+KNSWG WGENGY+++ R
Sbjct: 387 GISHPWWIFCSKYLLDHAVLLVGYG------VSEKNEPFWIVKNSWGVEWGENGYFRMYR 440
Query: 356 GRNVCGVDSMVST 368
G CG++++ ++
Sbjct: 441 GDGTCGINTVATS 453
>gi|256077195|ref|XP_002574893.1| cathepsin F (C01 family) [Schistosoma mansoni]
gi|353230782|emb|CCD77199.1| cathepsin F (C01 family) [Schistosoma mansoni]
Length = 456
Score = 251 bits (640), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 136/313 (43%), Positives = 190/313 (60%), Gaps = 28/313 (8%)
Query: 63 FKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRRTYL 121
FK K+ K Y E + RF IFK+N+ +A +Q + SA +G+T +SDLT EF RT+L
Sbjct: 161 FKLKYRKQY--HETDEIRFNIFKSNILKAQLYQVFERGSAIYGVTPYSDLTTDEFARTHL 218
Query: 122 GLRRKLRLPKDADQAPI---LPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALE 178
+P P N++P +FDWREKGAV VK+QG CGSCW+FSTTG +E
Sbjct: 219 --TASWVVPSSRSNTPTSLGKEVNNIPKNFDWREKGAVTEVKNQGMCGSCWAFSTTGNVE 276
Query: 179 GANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREED 238
F TGKL+SLSEQQLVDCD D GCNGGL ++A+E +K GGLM E++
Sbjct: 277 SQWFRKTGKLLSLSEQQLVDCD---------GLDDGCNGGLPSNAYESIIKMGGLMLEDN 327
Query: 239 YPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIG 298
YPY + C +A + + ++ DE ++AA L N ++V +NA+ +Q Y
Sbjct: 328 YPYDA--KNEKCHLKTDGVAVYINSSVNLTQDETELAAWLYHNSTISVGMNALLLQFYQH 385
Query: 299 GVSCPY--ICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
G+S P+ CS+ LDH VLLVGYG + K +P+WI+KNSWG WGENGY+++ R
Sbjct: 386 GISHPWWIFCSKYLLDHAVLLVGYG------VSEKNEPFWIVKNSWGVEWGENGYFRMYR 439
Query: 356 GRNVCGVDSMVST 368
G CG++++ ++
Sbjct: 440 GDGTCGINTVATS 452
>gi|67773376|gb|AAY81945.1| cysteine protease 7 [Paragonimus westermani]
Length = 325
Score = 251 bits (640), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 137/317 (43%), Positives = 191/317 (60%), Gaps = 27/317 (8%)
Query: 56 AEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPA 114
A + FK+ + KAYA++++ RF IFK NL RA ++Q + +A +G+TQFSDLTP
Sbjct: 28 ARELYEQFKRDYGKAYANEDDQ-KRFAIFKDNLVRAQQYQMQEQGTAKYGVTQFSDLTPE 86
Query: 115 EFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTT 174
EF YLG R R+ D+ + PA DWR+KGAVGPV+DQGSCGSCW+FS T
Sbjct: 87 EFAAMYLGSRIDERV----DRVQLNDLQTAPASVDWRKKGAVGPVEDQGSCGSCWAFSVT 142
Query: 175 GALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLM 234
+EG FL TG+LVSLS+QQLVDCD D GC+GG ++ + GGL
Sbjct: 143 ANVEGQWFLKTGRLVSLSKQQLVDCDR---------LDHGCSGGYPPYTYKEIKRMGGLE 193
Query: 235 REEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQ 294
+ YPYT + AC+ D+SK+ A + + V+ DE++ AA L ++GP++ +NA +Q
Sbjct: 194 LQSAYPYTSWKQ--ACRIDRSKLVAKIDDSIVLETDEEKQAAWLAEHGPMSTCLNAGPLQ 251
Query: 295 TYIGGVSCP--YICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYY 351
Y G+ P +CS L+H VL VGY + PYW ++NSWG WGENGY+
Sbjct: 252 FYQSGILHPSKAMCSPEGLNHAVLTVGYDTE-------HGVPYWTVRNSWGTRWGENGYF 304
Query: 352 KICRGRNVCGVDSMVST 368
+I RG CG+D + ++
Sbjct: 305 RIYRGDGTCGIDRLTTS 321
>gi|3023456|sp|Q26534.1|CATL_SCHMA RecName: Full=Cathepsin L; AltName: Full=SMCL1; Flags: Precursor
gi|555663|gb|AAC46485.1| preprocathepsin L [Schistosoma mansoni]
gi|1094710|prf||2106314A cathepsin L
Length = 319
Score = 250 bits (638), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 136/320 (42%), Positives = 193/320 (60%), Gaps = 27/320 (8%)
Query: 56 AEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQK-LDPSATHGITQFSDLTPA 114
+ + FK K+ K Y E+ + RF IFK+N+ +A +Q + SA +G+T +SDLT
Sbjct: 16 VDEKYVQFKLKYRKQYHETED-EIRFNIFKSNILKAQLYQVFVRGSAIYGVTPYSDLTTD 74
Query: 115 EFRRTYLGLRRKLRLPKDADQAPI---LPTNDLPADFDWREKGAVGPVKDQGSCGSCWSF 171
EF RT+L +P P N++P +FDWREKGAV VK+QG CGSCW+F
Sbjct: 75 EFARTHL--TASWVVPSSRSNTPTSLGKEVNNIPKNFDWREKGAVTEVKNQGMCGSCWAF 132
Query: 172 STTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAG 231
STTG +E F TGKL+SLSEQQLVDCD D GCNGGL ++A+E +K G
Sbjct: 133 STTGNVESQWFRKTGKLLSLSEQQLVDCD---------GLDDGCNGGLPSNAYESIIKMG 183
Query: 232 GLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV 291
GLM E++YPY + C +A + + ++ DE ++AA L N ++V +NA+
Sbjct: 184 GLMLEDNYPYDA--KNEKCHLKTDGVAVYINSSVNLTQDETELAAWLYHNSTISVGMNAL 241
Query: 292 YMQTYIGGVSCPY--ICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+Q Y G+S P+ CS+ LDH VLLVGYG + K +P+WI+KNSWG WGEN
Sbjct: 242 LLQFYQHGISHPWWIFCSKYLLDHAVLLVGYG------VSEKNEPFWIVKNSWGVEWGEN 295
Query: 349 GYYKICRGRNVCGVDSMVST 368
GY+++ RG CG++++ ++
Sbjct: 296 GYFRMYRGDGSCGINTVATS 315
>gi|85068708|gb|ABC69434.1| cysteine protease [Clonorchis sinensis]
gi|85068710|gb|ABC69435.1| cysteine protease [Clonorchis sinensis]
Length = 328
Score = 250 bits (638), Expect = 9e-64, Method: Compositional matrix adjust.
Identities = 143/320 (44%), Positives = 187/320 (58%), Gaps = 30/320 (9%)
Query: 56 AEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPA 114
A + FK K+ K Y S ++ + RF IFK NL RA R Q+++ +A +G+TQFSDLT
Sbjct: 28 ARALYEEFKLKYKKTY-SNDDDELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSE 86
Query: 115 EFRRTYLGLRRKLRLPKDADQAPILPTNDLPAD---FDWREKGAVGPVKDQGSCGSCWSF 171
EF+ YL ++R + P D+ D FDWRE GAVGPV DQG CGSCW+F
Sbjct: 87 EFKTRYL----RMRFDGPIVSEDLTPEEDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAF 142
Query: 172 STTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAG 231
S G +EG F TG L++LSEQQLVDCDH D GCNGG + K G
Sbjct: 143 SVIGNVEGQWFRKTGDLLALSEQQLVDCDH---------LDKGCNGGYPPKTYGEIEKMG 193
Query: 232 GLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV 291
GL DYPYTG D C ++SK A V + +V+ L E A L + GPL+ A+NAV
Sbjct: 194 GLELASDYPYTGVD--GICYMNQSKFVAYVNDSTVLPLSEKIQAQKLKEIGPLSSALNAV 251
Query: 292 YMQTYIGGV--SCPYICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+Q Y+GG+ P++C+ L+H VL VGYG+ PYWI+KNSWG +GE
Sbjct: 252 LLQFYLGGIIFPIPFLCNPHGLNHAVLTVGYGTEFGI-------PYWIVKNSWGVGFGEK 304
Query: 349 GYYKICRGRNVCGVDSMVST 368
GY++I RG CG++ +VST
Sbjct: 305 GYFRIFRGAGTCGINLVVST 324
>gi|118429527|gb|ABK91811.1| cathepsin F precursor [Clonorchis sinensis]
Length = 326
Score = 250 bits (638), Expect = 9e-64, Method: Compositional matrix adjust.
Identities = 141/318 (44%), Positives = 183/318 (57%), Gaps = 28/318 (8%)
Query: 56 AEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPA 114
A + FK K+ K Y S ++ + RF IFK NL RA R Q+++ +A +G+TQFSDLT
Sbjct: 28 ARALYEEFKLKYKKTY-SNDDDELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSE 86
Query: 115 EFRRTYLGLRRKLRLPKDADQAPILPTNDLPAD---FDWREKGAVGPVKDQGSCGSCWSF 171
EF+ YL ++R + P D+ D FDWRE GAVGPV DQG CGSCW+F
Sbjct: 87 EFKTRYL----RMRFDGPIVSEDLTPEEDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAF 142
Query: 172 STTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAG 231
S G +EG F TG L++LSEQQLVDCD+ D GC+GG + K G
Sbjct: 143 SVIGNVEGQWFRKTGDLLALSEQQLVDCDY---------LDGGCDGGYPPQTYTAIQKMG 193
Query: 232 GLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV 291
GL DYPYTG G C DKSK A + +++ L E A L GPL+ A+NA
Sbjct: 194 GLELASDYPYTGV--GGICYMDKSKFVAYINGSTILPLSEKVQAQKLRAIGPLSSALNAD 251
Query: 292 YMQTYIGGVSCPYICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
+Q Y GG+ P +C ++H VL VGYG KPYWI+KNSWGE +GE GY
Sbjct: 252 TLQLYKGGIMRPRLCDPAGVNHAVLTVGYGVQ-------NGKPYWIVKNSWGEDFGEEGY 304
Query: 351 YKICRGRNVCGVDSMVST 368
++I RG CG++S+V+T
Sbjct: 305 FRIYRGDGTCGINSIVTT 322
>gi|432880227|ref|XP_004073613.1| PREDICTED: cathepsin F-like [Oryzias latipes]
Length = 473
Score = 250 bits (638), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 150/322 (46%), Positives = 190/322 (59%), Gaps = 27/322 (8%)
Query: 52 DLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSD 110
LLG F F K+ K Y+SQEE + R IF+ NL+ A + Q LD SA +G+T+FSD
Sbjct: 170 QLLG---QFKDFMVKYKKDYSSQEEAERRLQIFQENLKTAEKLQALDQGSAEYGVTKFSD 226
Query: 111 LTPAEFRRTYLG-LRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCW 169
LT EFR TYL L + L + AP T + +DWR+ GAV PVK+QG CGSCW
Sbjct: 227 LTEEEFRSTYLNPLLSQWTLHRGMKPAPPAKTPAPDS-WDWRDHGAVSPVKNQGMCGSCW 285
Query: 170 SFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK 229
+FS TG +EG FL G L+SLSEQ+LVDCD D C GGL ++A+E K
Sbjct: 286 AFSVTGNIEGQWFLKNGTLLSLSEQELVDCD---------GLDQACRGGLPSNAYEAIEK 336
Query: 230 AGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAIN 289
GGL E DY YTG C F K+AA + + + DE +IAA L +NGP++VA+N
Sbjct: 337 LGGLESETDYSYTG--HKQKCDFTNRKVAAYINSSVELPKDEREIAAWLAENGPISVALN 394
Query: 290 AVYMQTYIGGVSCPY--ICS-RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWG 346
A MQ Y GVS P+ C+ +DH VLLVGYG P+W IKNSWGE +G
Sbjct: 395 AFAMQFYKKGVSHPWKIFCNPWMIDHAVLLVGYGERNGI-------PFWAIKNSWGEDYG 447
Query: 347 ENGYYKICRGRNVCGVDSMVST 368
E GYY + RG N CG++ M S+
Sbjct: 448 EQGYYYLQRGSNACGINRMGSS 469
>gi|56755191|gb|AAW25775.1| SJCHGC00511 protein [Schistosoma japonicum]
Length = 454
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 133/309 (43%), Positives = 194/309 (62%), Gaps = 23/309 (7%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRR 118
++ FK + K Y + +++ RF+IFK+NL +A +Q L+ SA +G+T +SDLT EF R
Sbjct: 157 YAQFKLTYRKQY-HETDNEKRFSIFKSNLLKAQLYQVLERGSAVYGVTPYSDLTTDEFSR 215
Query: 119 TYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALE 178
T+L + ++ +P D+P +FDWREKGAV VK+QG CGSCW+FSTTG +E
Sbjct: 216 THLTAPWRASSKRNT-ISPRREVGDIPNNFDWREKGAVTEVKNQGMCGSCWAFSTTGNIE 274
Query: 179 GANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREED 238
F TGKL+SLSEQQLVDCD S D GCNGGL ++A+E ++ GGLM E++
Sbjct: 275 SQWFRKTGKLLSLSEQQLVDCD---------SLDDGCNGGLPSNAYESIIRMGGLMLEDN 325
Query: 239 YPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIG 298
YPY + C + +AA + + ++ DE ++A L + ++V +NA+ +Q Y
Sbjct: 326 YPYDA--KNEKCHLKVANVAAYINSSVNLTQDESELAIWLYHHSAISVGMNALLLQFYRH 383
Query: 299 GVSCPY--ICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
G+S P+ CS+ LDH VLLVGYG + K +P+WI+KNSWG WGE GY+++ R
Sbjct: 384 GISHPWWIFCSKYLLDHAVLLVGYG------VSEKNEPFWIVKNSWGVEWGEKGYFRMYR 437
Query: 356 GRNVCGVDS 364
G CG+++
Sbjct: 438 GDGTCGINT 446
>gi|290999038|ref|XP_002682087.1| predicted protein [Naegleria gruberi]
gi|284095713|gb|EFC49343.1| predicted protein [Naegleria gruberi]
Length = 349
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 144/346 (41%), Positives = 188/346 (54%), Gaps = 48/346 (13%)
Query: 56 AEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKL--------DPSATHGITQ 107
A ++F FKK + K YA++EEH R+ IF N+ + + P A +GITQ
Sbjct: 11 ALNYFQHFKKLYLKRYATEEEHHRRWKIFYDNINLVNQLNIMHKPNEIAGKPVAQYGITQ 70
Query: 108 FSDLTPAEFRRTYLGLRRKLRLPKDADQAPILP-----TNDLPADFDWREKGAVGPVKDQ 162
F D++P EF R L K KD + P P + LP FDWRE GAV VKDQ
Sbjct: 71 FMDMSPNEFARVKLLPPTK---QKDINHTPTAPKEKYQIDALPESFDWREHGAVTAVKDQ 127
Query: 163 GSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNS 222
SCGSCW+FST +EGA FLA L S QQLVDCD + + GC GG
Sbjct: 128 ASCGSCWAFSTVENIEGAYFLAGHNLTKFSPQQLVDCD---------NLNCGCFGGFPFI 178
Query: 223 AFEYTLKAGGLMREEDYPY-------------------TGTDRGHACKFDKSKIAASVAN 263
A +Y K GGL E YPY +G C ++ A VA
Sbjct: 179 AMQYIQKRGGLATESSYPYCIPPLGNCFPCNTNKTYCPSGEYCNRTCSVQNYQLVAKVAG 238
Query: 264 FSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAG 323
+ VS +ED IAA LVKNGPL++ +NA+++Q Y G+S P C +DH VLLVG+G+
Sbjct: 239 YENVSQNEDDIAAYLVKNGPLSICLNAMWLQFYHSGISDPMYCPPDIDHAVLLVGFGTHT 298
Query: 324 YAPIRLKEKP-YWIIKNSWGESWGENGYYKICRGRNVCGVDSMVST 368
L EK YWI+KNSWGESWGE GY+++ RG++ CG+++MV+
Sbjct: 299 N---WLGEKTNYWIVKNSWGESWGEKGYFRLIRGKDKCGINTMVAN 341
>gi|195395906|ref|XP_002056575.1| GJ11017 [Drosophila virilis]
gi|194143284|gb|EDW59687.1| GJ11017 [Drosophila virilis]
Length = 599
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 138/326 (42%), Positives = 195/326 (59%), Gaps = 26/326 (7%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP----SATHGITQFSDLT 112
+H F F+ K+ + YA+ EH R IF+ +L+ Q+L+ SA +GIT+F+D+T
Sbjct: 290 DHLFHKFQVKYKRRYANSAEHQMRLRIFRQSLKTI---QELNANEQGSAKYGITEFADMT 346
Query: 113 PAEFRRTYLGLRRKLRLPKDADQAPILPT--NDLPADFDWREKGAVGPVKDQGSCGSCWS 170
E+ + GL ++ A ++P +LP +FDWR+K AV VK+QG CGSCW+
Sbjct: 347 STEYAQR-AGLWQRSEGKPTGGAAAVVPAYAGELPKEFDWRQKNAVTHVKNQGQCGSCWA 405
Query: 171 FSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA 230
FS TG +EGA + TG L SEQ+L+DCD S DS CNGGLM++A++
Sbjct: 406 FSVTGNIEGAYAIKTGDLQEFSEQELLDCD---------SKDSACNGGLMDNAYKAIKDI 456
Query: 231 GGLMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAIN 289
GGL E +YPY G + C F+++ V+ F + +E + L+ NGP+++ IN
Sbjct: 457 GGLEYESEYPYEGKKK--QCHFNRTLSHVQVSGFVDLPKGNETAMQEWLLTNGPISIGIN 514
Query: 290 AVYMQTYIGGVSCPY--ICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWG 346
A MQ Y GGVS P+ +CS++ LDHGVL+VGYG + Y P K PYWI+KNSWG WG
Sbjct: 515 ANAMQFYRGGVSHPWSPLCSKKNLDHGVLIVGYGVSDY-PNFHKTLPYWIVKNSWGPRWG 573
Query: 347 ENGYYKICRGRNVCGVDSMVSTVAAA 372
E GYY++ RG N CGV M ++ A
Sbjct: 574 EQGYYRVYRGDNTCGVSEMATSALLA 599
>gi|71993922|ref|NP_505215.2| Protein TAG-196 [Caenorhabditis elegans]
gi|351050011|emb|CCD64084.1| Protein TAG-196 [Caenorhabditis elegans]
Length = 477
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 141/327 (43%), Positives = 188/327 (57%), Gaps = 45/327 (13%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRR 118
F F + K Y ++ E RF +FK N + QK + +A +G T+FSD+T EF+
Sbjct: 174 FLDFVDRHEKKYTNKREVLKRFRVFKKNAKVIRELQKNEQGTAVYGFTKFSDMTTMEFK- 232
Query: 119 TYLGLRRKLRLPKDADQAPILPTN--------------DLPADFDWREKGAVGPVKDQGS 164
K+ LP +Q P+ P DLP FDWREKGAV VK+QG+
Sbjct: 233 -------KIMLPYQWEQ-PVYPMEQANFEKHDVTINEEDLPESFDWREKGAVTQVKNQGN 284
Query: 165 CGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAF 224
CGSCW+FSTTG +EGA F+A KLVSLSEQ+LVDCD S D GCNGGL ++A+
Sbjct: 285 CGSCWAFSTTGNVEGAWFIAKNKLVSLSEQELVDCD---------SMDQGCNGGLPSNAY 335
Query: 225 EYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPL 284
+ ++ GGL E+ YPY G RG C + IA + + DE ++ LV GP+
Sbjct: 336 KEIIRMGGLEPEDAYPYDG--RGETCHLVRKDIAVYINGSVELPHDEVEMQKWLVTKGPI 393
Query: 285 AVAINAVYMQTYIGGVSCPY--ICSR-RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSW 341
++ +NA +Q Y GV P+ C L+HGVL+VGYG G KPYWI+KNSW
Sbjct: 394 SIGLNANTLQFYRHGVVHPFKIFCEPFMLNHGVLIVGYGKDG-------RKPYWIVKNSW 446
Query: 342 GESWGENGYYKICRGRNVCGVDSMVST 368
G +WGE GY+K+ RG+NVCGV M ++
Sbjct: 447 GPNWGEAGYFKLYRGKNVCGVQEMATS 473
>gi|195343593|ref|XP_002038380.1| GM10654 [Drosophila sechellia]
gi|194133401|gb|EDW54917.1| GM10654 [Drosophila sechellia]
Length = 615
Score = 249 bits (636), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 137/339 (40%), Positives = 198/339 (58%), Gaps = 20/339 (5%)
Query: 41 EILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP- 99
E +H + ++ +H F F+ +F + Y S E R IF+ NL+ +
Sbjct: 290 EKKTHKKHSHRAFDKVDHLFYKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMG 349
Query: 100 SATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPT--NDLPADFDWREKGAVG 157
SA +GIT+F+D+T +E++ GL ++ A ++P +LP +FDWR+K AV
Sbjct: 350 SAKYGITEFADMTSSEYKER-TGLWQRDEAKATGGSAAVVPAYHGELPKEFDWRQKDAVT 408
Query: 158 PVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNG 217
VK+QGSCGSCW+FS TG +EG + TG+L SEQ+L+DCD + DS CNG
Sbjct: 409 QVKNQGSCGSCWAFSVTGNIEGLYAVKTGELKEFSEQELLDCD---------TTDSACNG 459
Query: 218 GLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAA 276
GLM++A++ GGL E +YPY + + C F+++ VA F + +E +
Sbjct: 460 GLMDNAYKAIKDIGGLEYEAEYPYKA--KKNQCHFNRTLSHVQVAGFVDLPKGNETAMQE 517
Query: 277 NLVKNGPLAVAINAVYMQTYIGGVSCPY--ICSRR-LDHGVLLVGYGSAGYAPIRLKEKP 333
L+ NGP+++ INA MQ Y GGVS P+ +CS++ LDHGVL+VGYG + Y P K P
Sbjct: 518 WLLTNGPISIGINANAMQFYRGGVSHPWKALCSKKNLDHGVLVVGYGVSDY-PNFHKTLP 576
Query: 334 YWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAAA 372
YWI+KNSWG WGE GYY++ RG N CGV M ++ A
Sbjct: 577 YWIVKNSWGPRWGEQGYYRVYRGDNTCGVSEMATSAVLA 615
>gi|24644155|ref|NP_730901.1| CG12163, isoform A [Drosophila melanogaster]
gi|32699625|sp|Q9VN93.2|CPR1_DROME RecName: Full=Putative cysteine proteinase CG12163; Flags:
Precursor
gi|23170427|gb|AAF52055.2| CG12163, isoform A [Drosophila melanogaster]
gi|27819876|gb|AAO24986.1| LP08529p [Drosophila melanogaster]
Length = 614
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 136/334 (40%), Positives = 196/334 (58%), Gaps = 20/334 (5%)
Query: 46 HESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHG 104
H+ ++ +H F F+ +F + Y S E R IF+ NL+ + SA +G
Sbjct: 294 HKKHSHRFDKVDHLFYKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYG 353
Query: 105 ITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPT--NDLPADFDWREKGAVGPVKDQ 162
IT+F+D+T +E++ GL ++ A ++P +LP +FDWR+K AV VK+Q
Sbjct: 354 ITEFADMTSSEYKER-TGLWQRDEAKATGGSAAVVPAYHGELPKEFDWRQKDAVTQVKNQ 412
Query: 163 GSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNS 222
GSCGSCW+FS TG +EG + TG+L SEQ+L+DCD + DS CNGGLM++
Sbjct: 413 GSCGSCWAFSVTGNIEGLYAVKTGELKEFSEQELLDCD---------TTDSACNGGLMDN 463
Query: 223 AFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKN 281
A++ GGL E +YPY + + C F+++ VA F + +E + L+ N
Sbjct: 464 AYKAIKDIGGLEYEAEYPYKA--KKNQCHFNRTLSHVQVAGFVDLPKGNETAMQEWLLAN 521
Query: 282 GPLAVAINAVYMQTYIGGVSCPY--ICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIK 338
GP+++ INA MQ Y GGVS P+ +CS++ LDHGVL+VGYG + Y P K PYWI+K
Sbjct: 522 GPISIGINANAMQFYRGGVSHPWKALCSKKNLDHGVLVVGYGVSDY-PNFHKTLPYWIVK 580
Query: 339 NSWGESWGENGYYKICRGRNVCGVDSMVSTVAAA 372
NSWG WGE GYY++ RG N CGV M ++ A
Sbjct: 581 NSWGPRWGEQGYYRVYRGDNTCGVSEMATSAVLA 614
>gi|24644153|ref|NP_649521.1| CG12163, isoform B [Drosophila melanogaster]
gi|23170426|gb|AAN13266.1| CG12163, isoform B [Drosophila melanogaster]
gi|378548248|gb|AFC17498.1| FI18603p1 [Drosophila melanogaster]
Length = 475
Score = 248 bits (634), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 137/337 (40%), Positives = 199/337 (59%), Gaps = 26/337 (7%)
Query: 46 HESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP----SA 101
H+ ++ +H F F+ +F + Y S E R IF+ NL+ ++L+ SA
Sbjct: 155 HKKHSHRFDKVDHLFYKFQVRFGRRYVSTAERQMRLRIFRQNLKTI---EELNANEMGSA 211
Query: 102 THGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPT--NDLPADFDWREKGAVGPV 159
+GIT+F+D+T +E++ GL ++ A ++P +LP +FDWR+K AV V
Sbjct: 212 KYGITEFADMTSSEYKER-TGLWQRDEAKATGGSAAVVPAYHGELPKEFDWRQKDAVTQV 270
Query: 160 KDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGL 219
K+QGSCGSCW+FS TG +EG + TG+L SEQ+L+DCD + DS CNGGL
Sbjct: 271 KNQGSCGSCWAFSVTGNIEGLYAVKTGELKEFSEQELLDCD---------TTDSACNGGL 321
Query: 220 MNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANL 278
M++A++ GGL E +YPY + + C F+++ VA F + +E + L
Sbjct: 322 MDNAYKAIKDIGGLEYEAEYPYKA--KKNQCHFNRTLSHVQVAGFVDLPKGNETAMQEWL 379
Query: 279 VKNGPLAVAINAVYMQTYIGGVSCPY--ICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYW 335
+ NGP+++ INA MQ Y GGVS P+ +CS++ LDHGVL+VGYG + Y P K PYW
Sbjct: 380 LANGPISIGINANAMQFYRGGVSHPWKALCSKKNLDHGVLVVGYGVSDY-PNFHKTLPYW 438
Query: 336 IIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAAA 372
I+KNSWG WGE GYY++ RG N CGV M ++ A
Sbjct: 439 IVKNSWGPRWGEQGYYRVYRGDNTCGVSEMATSAVLA 475
>gi|410913409|ref|XP_003970181.1| PREDICTED: cathepsin F-like [Takifugu rubripes]
Length = 476
Score = 248 bits (634), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 152/328 (46%), Positives = 197/328 (60%), Gaps = 39/328 (11%)
Query: 52 DLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSD 110
+LLG F F K+NK Y+SQEE D R IFK NL+ A + Q LD SA +G+T+FSD
Sbjct: 173 ELLGL---FKEFMTKYNKVYSSQEEADRRLQIFKENLKTAEKIQSLDEGSAEYGVTKFSD 229
Query: 111 LTPAEFRRTYLG-------LRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQG 163
LT EFR TYL LRR ++ P ++P PA +DWR+ GAV PVK+QG
Sbjct: 230 LTEEEFRLTYLNPLLSQWTLRRPMK-PASPARSPA------PASWDWRDHGAVSPVKNQG 282
Query: 164 SCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSA 223
CGSCW+FS TG +EG FL GKL+SLSEQ+LVDCD D C GGL ++A
Sbjct: 283 LCGSCWAFSVTGNIEGQWFLKHGKLLSLSEQELVDCD---------GLDHACRGGLPSNA 333
Query: 224 FEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGP 283
+E GGL E DY Y+G C F K+AA + + + DE+++AA L +NGP
Sbjct: 334 YEAIEGLGGLEAENDYTYSG--HKQKCSFATEKVAAYINSSVELPSDENEMAAWLAENGP 391
Query: 284 LAVAINAVYMQTYIGGVSCPY--ICSR-RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNS 340
++VA+NA MQ Y GVS P+ +C+ +DH VLLVGYG P+W IKNS
Sbjct: 392 VSVALNAFAMQFYKKGVSHPWMILCNPWMIDHAVLLVGYGERNGI-------PFWAIKNS 444
Query: 341 WGESWGENGYYKICRGRNVCGVDSMVST 368
WGE +GE GYY + +G N CG++ M S+
Sbjct: 445 WGEDYGEEGYYYLYKGSNACGINKMGSS 472
>gi|2731635|gb|AAB93494.1| pre-procathepsin L [Paragonimus westermani]
Length = 325
Score = 248 bits (634), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 138/317 (43%), Positives = 191/317 (60%), Gaps = 27/317 (8%)
Query: 56 AEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPA 114
A + FK+ + KAYA+ E+ RF IFK NL RA ++Q + +A +G+TQFSDLT
Sbjct: 28 ARELYEQFKRDYGKAYAN-EDDQKRFAIFKDNLVRAQQYQTQEQGTAKYGVTQFSDLTNE 86
Query: 115 EFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTT 174
EF YLG R R+ D+ + PA DWREKGAVGPV+ QGSCGSCW+FS T
Sbjct: 87 EFAAMYLGSRIDERV----DRVQLNDLQTAPASVDWREKGAVGPVEHQGSCGSCWAFSVT 142
Query: 175 GALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLM 234
+EG FL TG+LVSLS+QQLVDCD D GC+GG ++ + GGL
Sbjct: 143 ANVEGQWFLKTGRLVSLSKQQLVDCDR---------LDHGCSGGYPPYTYKEIKRMGGLE 193
Query: 235 REEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQ 294
+ YPYTG ++ AC+ D+SK+ A + + V+ +E++ AA L ++GP++ +NA +Q
Sbjct: 194 LQSAYPYTGWEQ--ACRLDRSKLFAKIDDSIVLEKNEEKQAAWLAEHGPMSTCLNAGPLQ 251
Query: 295 TYIGGVSCP--YICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYY 351
Y G+ P Y CS L+H VL VGY + + PYW ++NSWG WGENGY+
Sbjct: 252 FYRYGILHPSEYACSPEGLNHAVLTVGYDTE-------RGVPYWTVRNSWGTRWGENGYF 304
Query: 352 KICRGRNVCGVDSMVST 368
+I RG CG+D + ++
Sbjct: 305 RIYRGDGTCGIDRLTTS 321
>gi|163914827|ref|NP_001106423.1| cathepsin F precursor [Xenopus (Silurana) tropicalis]
gi|157423494|gb|AAI53364.1| LOC100127591 protein [Xenopus (Silurana) tropicalis]
Length = 463
Score = 248 bits (633), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 135/322 (41%), Positives = 188/322 (58%), Gaps = 23/322 (7%)
Query: 51 NDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFS 109
+++L F F +NK Y+ QEE R IF NL++A Q++D +A +G+T++S
Sbjct: 157 DEMLKTLTLFKDFVTTYNKKYSDQEEAARRLQIFSQNLKKAQMIQEMDQGTAEYGVTKYS 216
Query: 110 DLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCW 169
DLT EFR YL + P + I+P P +DWR+ GAV VK+QG CGSCW
Sbjct: 217 DLTEDEFRSLYLNPLLSSK-PLYQMKKAIVPNMSAPDQWDWRDHGAVTEVKNQGMCGSCW 275
Query: 170 SFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK 229
+FS G +EG FL G LVSLSEQ+LVDCD D C GGL ++A+E K
Sbjct: 276 AFSVIGNIEGQWFLKKGSLVSLSEQELVDCD---------GVDHACAGGLPSNAYEAIEK 326
Query: 230 AGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAIN 289
GG+ E++Y Y G + C F SK++A + + + DE++IAA L +NGP+++A+N
Sbjct: 327 LGGIETEQEYSYEG--HKNTCSFSTSKVSAYINSSVEIPKDENEIAAWLAQNGPISIALN 384
Query: 290 AVYMQTYIGGVSCPY--ICS-RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWG 346
A MQ Y G+S P+ +C+ +DH VLLVGYG P+W IKNSWG WG
Sbjct: 385 AFAMQFYRKGISHPFRILCNPWMIDHAVLLVGYGER-------NGTPFWAIKNSWGTDWG 437
Query: 347 ENGYYKICRGRNVCGVDSMVST 368
E GYY + RG CG+++M S+
Sbjct: 438 EQGYYYLYRGTGACGMNTMCSS 459
>gi|118429515|gb|ABK91805.1| cysteine proteinase 7 precursor [Clonorchis sinensis]
Length = 326
Score = 248 bits (633), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 141/318 (44%), Positives = 182/318 (57%), Gaps = 28/318 (8%)
Query: 56 AEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPA 114
A + FK K+ K Y S ++ + RF IFK NL RA R Q+++ +A +G+TQFSDLT
Sbjct: 28 ARALYEEFKLKYKKTY-SNDDDELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSE 86
Query: 115 EFRRTYLGLRRKLRLPKDADQAPILPTNDLPAD---FDWREKGAVGPVKDQGSCGSCWSF 171
EF+ YL ++R + P D+ D FDWRE GAVGPV DQG CGSCW+F
Sbjct: 87 EFKTRYL----RMRFDGPIVSEDLTPEEDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAF 142
Query: 172 STTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAG 231
S G +EG F TG L++LSEQQLVDCD+ D GC+GG + K G
Sbjct: 143 SVIGNVEGQWFRKTGDLLALSEQQLVDCDY---------LDGGCDGGYPPQTYTAIQKMG 193
Query: 232 GLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV 291
GL DYPYTG G C DKSK A + +++ L E A L GPL+ A+NA
Sbjct: 194 GLELASDYPYTGV--GGICYMDKSKFVAYINGSTILPLSEKVQAQKLRAIGPLSSALNAD 251
Query: 292 YMQTYIGGVSCPYICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
+Q Y GG+ P C ++H VL VGYG KPYWI+KNSWGE +GE GY
Sbjct: 252 TLQLYKGGIMRPKWCDPAGVNHAVLTVGYGVQ-------NGKPYWIVKNSWGEDFGEEGY 304
Query: 351 YKICRGRNVCGVDSMVST 368
++I RG CG++S+V+T
Sbjct: 305 FRIYRGDGTCGINSIVTT 322
>gi|21218381|gb|AAM44058.1|AF510740_1 cathepsin L1 [Schistosoma japonicum]
Length = 317
Score = 248 bits (633), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 133/309 (43%), Positives = 192/309 (62%), Gaps = 23/309 (7%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRR 118
++ FK + K Y + +++ RF+IFK+NL +A +Q L+ SA +G+T +SDLT EF R
Sbjct: 20 YAQFKLTYRKQY-HETDNEKRFSIFKSNLLKAQLYQVLERGSAVYGVTPYSDLTTDEFSR 78
Query: 119 TYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALE 178
T+L + ++ P D+P +FDWREKGAV VK+QG CGSCW+FSTTG +E
Sbjct: 79 THLTAPWRASSKRNT-IPPRREVGDIPNNFDWREKGAVTEVKNQGMCGSCWAFSTTGNIE 137
Query: 179 GANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREED 238
F TGKL+SLSEQQLVDCD S D GCNGGL ++A+E ++ GGLM E++
Sbjct: 138 SQWFRKTGKLLSLSEQQLVDCD---------SLDDGCNGGLPSNAYESIIRMGGLMLEDN 188
Query: 239 YPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIG 298
YPY + C +AA + + ++ DE ++A L + ++V +NA+ +Q Y
Sbjct: 189 YPYDA--KNEKCHLKVGNVAAYINSSVNLTQDESELAIWLYHHSAISVGMNALLLQFYRH 246
Query: 299 GVSCPY--ICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
G+S P+ CS+ LDH VLLVGYG + K +P+WI+KNSWG WGE GY+++ R
Sbjct: 247 GISHPWWIFCSKYLLDHAVLLVGYG------VSEKNEPFWIVKNSWGVEWGEKGYFRMYR 300
Query: 356 GRNVCGVDS 364
G CG+++
Sbjct: 301 GDGTCGINT 309
>gi|7219908|gb|AAF40479.1| cystein protease [Clonorchis sinensis]
Length = 326
Score = 248 bits (632), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 142/318 (44%), Positives = 182/318 (57%), Gaps = 28/318 (8%)
Query: 56 AEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPA 114
A + FK K+ K Y S ++ + RF IFK NL RA R Q+++ +A +G+TQFSDLT
Sbjct: 28 ARALYEEFKLKYKKTY-SNDDDELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSE 86
Query: 115 EFRRTYLGLRRKLRLPKDADQAPILPTNDLPAD---FDWREKGAVGPVKDQGSCGSCWSF 171
EF+ YL ++R + P D+ D FDWRE GAVGPV DQG CGSCW+F
Sbjct: 87 EFKTRYL----RMRFDGPIVSEDLTPEEDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAF 142
Query: 172 STTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAG 231
S G + G F TG L++LSEQQLVDCD+ D GC+GG + K G
Sbjct: 143 SVIGNVVGQWFRKTGHLLALSEQQLVDCDY---------LDDGCDGGYPPQTYTAIQKMG 193
Query: 232 GLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV 291
GL DYPYTG G C DKSK A V +++ L E A L GPL+ A+NA
Sbjct: 194 GLELASDYPYTGV--GGICHMDKSKFVAYVNGSTILPLSEKVQAQKLRAIGPLSSALNAD 251
Query: 292 YMQTYIGGVSCPYICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
+Q Y GG+ P C ++HGVL VGYG KPYWI+KNSWGE +GE GY
Sbjct: 252 TLQLYKGGIMRPKWCDPAGVNHGVLTVGYGVQ-------NGKPYWIVKNSWGEDFGEEGY 304
Query: 351 YKICRGRNVCGVDSMVST 368
++I RG CG++S+V+T
Sbjct: 305 FRIYRGDGTCGINSIVTT 322
>gi|4760897|gb|AAD29130.1| cysteine proteinase 1 precursor [Clonorchis sinensis]
Length = 328
Score = 247 bits (631), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 145/325 (44%), Positives = 187/325 (57%), Gaps = 40/325 (12%)
Query: 56 AEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPA 114
A + FK K+ K Y S ++ + RF IFK NL RA R Q+++ +A +G+TQFSDLT
Sbjct: 28 ARALYEEFKLKYKKTY-SNDDDELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSE 86
Query: 115 EFRRTYLGLRRKLRLPKDADQAPIL-----PTNDLPAD---FDWREKGAVGPVKDQGSCG 166
EF+ YL +R PI+ P D+ D FDWRE GAVGPV DQG CG
Sbjct: 87 EFKTRYLRMRF---------DGPIVSEDPSPEEDVTMDNEKFDWREHGAVGPVLDQGKCG 137
Query: 167 SCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEY 226
SCW+FS G +EG F TG L++LSEQQLVDCDH D GCNGG +
Sbjct: 138 SCWAFSVIGNVEGQWFRKTGDLLALSEQQLVDCDH---------LDKGCNGGYPPKTYGE 188
Query: 227 TLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAV 286
K GGL DYPYTG D C ++SK A V +V+ L E A L + GPL+
Sbjct: 189 IEKMGGLELASDYPYTGVD--GICYMNQSKFVAYVNESTVLPLSEKIQAQKLKEIGPLSS 246
Query: 287 AINAVYMQTYIGGV--SCPYICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGE 343
A+NAV +Q Y+GG+ P++C+ L+H VL VGYG+ PYWI+KNSWG
Sbjct: 247 ALNAVLLQFYLGGIIFPIPFLCNPHGLNHAVLTVGYGTEFGI-------PYWIVKNSWGV 299
Query: 344 SWGENGYYKICRGRNVCGVDSMVST 368
+GE GY++I RG CG++ +VST
Sbjct: 300 GFGEKGYFRIFRGAGTCGINLVVST 324
>gi|312378084|gb|EFR24752.1| hypothetical protein AND_10451 [Anopheles darlingi]
Length = 1785
Score = 247 bits (630), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 138/323 (42%), Positives = 192/323 (59%), Gaps = 31/323 (9%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRA---ARHQKLDPSATHGITQFSDLTPAE 115
F FK + YAS EH+ R+ IF+ NL + RH++ + +G+T+F+D+T AE
Sbjct: 1477 QFEKFKLHHQRQYASSFEHEMRYNIFRNNLYKIDQLNRHER--GTGKYGVTKFADMTTAE 1534
Query: 116 FRRTYLGLRRKLRLPKDAD---QAPILPTN----DLPADFDWREKGAVGPVKDQGSCGSC 168
+R + GL +PK + PI + LP FDWR+ GAV VK+QG+CGSC
Sbjct: 1535 YR-AHTGLI----VPKQHSNHIRNPIATVSTERTSLPTSFDWRDHGAVTGVKNQGNCGSC 1589
Query: 169 WSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTL 228
W+FS G +EG + + T KL + SEQ+L+DCD + D+GCNGG M+ AF+
Sbjct: 1590 WAFSAIGNIEGLHQIKTKKLEAYSEQELIDCD---------TVDNGCNGGYMDDAFKAIE 1640
Query: 229 KAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAI 288
K GGL E++YPY + C F+K+ V + +E IA L++NGP+A+ +
Sbjct: 1641 KLGGLELEDEYPYQAKAQ-KTCHFNKTLSHVRVKGAVDMPKNETFIAQYLIENGPIAIGL 1699
Query: 289 NAVYMQTYIGGVSCPY--ICS-RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESW 345
NA MQ Y GG+S P+ +CS +++DHGVL+VGYG Y P+ K PYW IKNSWG W
Sbjct: 1700 NANAMQFYRGGISHPWHLLCSHKQIDHGVLIVGYGVKEY-PLFNKTLPYWTIKNSWGPKW 1758
Query: 346 GENGYYKICRGRNVCGVDSMVST 368
GE GYY+I RG N CGV M S+
Sbjct: 1759 GEQGYYRIYRGDNSCGVSEMASS 1781
>gi|308506829|ref|XP_003115597.1| CRE-TAG-196 protein [Caenorhabditis remanei]
gi|308256132|gb|EFP00085.1| CRE-TAG-196 protein [Caenorhabditis remanei]
Length = 475
Score = 247 bits (630), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 137/318 (43%), Positives = 189/318 (59%), Gaps = 27/318 (8%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRR 118
F F + K Y+++ E RF FK N + QK + +A +G T+FSD+T EF++
Sbjct: 172 FLDFIDRHEKRYSNKREVLKRFRTFKKNAKAIRELQKNEQGTAVYGFTKFSDMTTMEFKQ 231
Query: 119 TYLGLRRKLRL-PKDA----DQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFST 173
T L + + + P D + + DLP FDWR+KGAV VK+QG+CGSCW+FST
Sbjct: 232 TMLPYQWEQPVYPMDQADFEKEGITISEEDLPESFDWRDKGAVTQVKNQGNCGSCWAFST 291
Query: 174 TGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGL 233
TG +EGA FLA KLVSLSEQ+LVDCD D GCNGGL ++A++ ++ GGL
Sbjct: 292 TGNVEGAWFLAKNKLVSLSEQELVDCD---------GVDQGCNGGLPSNAYKEIIRMGGL 342
Query: 234 MREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYM 293
E+ YPY G +G C + IA + + DE ++ LV GP+++ +NA +
Sbjct: 343 EPEDAYPYDG--KGETCHLVRKDIAVYINGSIELPHDEVEMQKWLVTKGPISIGLNANTL 400
Query: 294 QTYIGGVSCPY--ICSR-RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
Q Y GV P+ C L+HGVL+VGYG G KPYWI+KNSWG +WGE+GY
Sbjct: 401 QFYRHGVVHPFKIFCEPFMLNHGVLIVGYGKDG-------RKPYWIVKNSWGPTWGESGY 453
Query: 351 YKICRGRNVCGVDSMVST 368
+K+ RG+NVCGV M ++
Sbjct: 454 FKLYRGKNVCGVQEMATS 471
>gi|194746631|ref|XP_001955780.1| GF16067 [Drosophila ananassae]
gi|190628817|gb|EDV44341.1| GF16067 [Drosophila ananassae]
Length = 620
Score = 247 bits (630), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 135/323 (41%), Positives = 187/323 (57%), Gaps = 20/323 (6%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAE 115
EH F F+ +F + Y S E R IF+ NL+ + SA +GIT+F+D+T E
Sbjct: 311 EHLFHKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGITEFADMTSTE 370
Query: 116 FRRTYLGLRRKLRLPKDADQAPILP--TNDLPADFDWREKGAVGPVKDQGSCGSCWSFST 173
++ GL ++ ++P + +LP +FDWR K AV VK+QG CGSCW+FS
Sbjct: 371 YKER-TGLWQRDEAKATGGSPAVVPAYSGELPKEFDWRSKNAVTGVKNQGQCGSCWAFSV 429
Query: 174 TGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGL 233
TG +EG L G+L SEQ+L+DCD + DS CNGGLM++A++ GGL
Sbjct: 430 TGNIEGLYALKYGELKEFSEQELLDCD---------TTDSACNGGLMDNAYKAIKDIGGL 480
Query: 234 MREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAINAVY 292
E +YPY + C F+K+ V +F + +E + LV NGP+++ INA
Sbjct: 481 EYEAEYPYEAKKK--QCHFNKTMSHVQVKDFVDLPKGNETAMQEWLVSNGPISIGINANA 538
Query: 293 MQTYIGGVSCPY--ICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
MQ Y GGVS P+ +CS++ LDHGVL+VGYG + Y P K PYWI+KNSWG WGE G
Sbjct: 539 MQFYRGGVSHPWKALCSKKNLDHGVLVVGYGVSDY-PNYHKTLPYWIVKNSWGPRWGEQG 597
Query: 350 YYKICRGRNVCGVDSMVSTVAAA 372
YY++ RG N CGV M ++ A
Sbjct: 598 YYRVYRGDNTCGVSEMATSAVLA 620
>gi|85068702|gb|ABC69431.1| cysteine protease [Clonorchis sinensis]
Length = 326
Score = 247 bits (630), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 140/318 (44%), Positives = 182/318 (57%), Gaps = 28/318 (8%)
Query: 56 AEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPA 114
A + FK K+ K Y S ++ + RF IFK NL RA R Q+++ +A +G+TQFSDLT
Sbjct: 28 ARALYEEFKLKYKKTY-SNDDDELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSE 86
Query: 115 EFRRTYLGLRRKLRLPKDADQAPILPTNDLPAD---FDWREKGAVGPVKDQGSCGSCWSF 171
EF+ YL ++R + P D+ D FDWRE GAVGPV DQG CGSCW+F
Sbjct: 87 EFKTRYL----RMRFDGPIVSEDLTPEEDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAF 142
Query: 172 STTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAG 231
S G + G F TG L++LSEQQLVDCD+ D GC+GG + K G
Sbjct: 143 SVIGNVVGQWFRKTGHLLALSEQQLVDCDY---------LDGGCDGGYPPQTYTAIQKMG 193
Query: 232 GLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV 291
GL DYPYTG G C DKSK A + +++ L E A L GPL+ A+NA
Sbjct: 194 GLELASDYPYTGV--GGICYMDKSKFVAYINGSTILPLSEKVQAQKLRAIGPLSSALNAD 251
Query: 292 YMQTYIGGVSCPYICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
+Q Y GG+ P +C ++H VL VGYG KPYWI+KNSWGE +GE GY
Sbjct: 252 TLQLYKGGIMRPRLCDPAGVNHAVLTVGYGVQ-------NGKPYWIVKNSWGEDFGEEGY 304
Query: 351 YKICRGRNVCGVDSMVST 368
++I RG CG++S+V+T
Sbjct: 305 FRIYRGDGTCGINSIVTT 322
>gi|340053965|emb|CCC48258.1| cysteine peptidase precursor [Trypanosoma vivax Y486]
Length = 441
Score = 246 bits (629), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 138/318 (43%), Positives = 181/318 (56%), Gaps = 25/318 (7%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
E F+ FK+K+ ++Y + E R +F+ N+RR+ + +P AT G+T FSDLTP EF
Sbjct: 31 EPLFAAFKQKYGRSYGTAAEEAFRLRVFEDNMRRSRMYAAANPHATFGVTPFSDLTPEEF 90
Query: 117 RRTYLGLRRKLRLPKDADQAPI-LPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
R Y R + + + +P PA DWR KGAV PVKDQGSCGSCWSFS G
Sbjct: 91 RTRYHNGERHFEAARGRVRTLVQVPPGKAPAAVDWRRKGAVTPVKDQGSCGSCWSFSAIG 150
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGL 233
+EG A L SLSEQ LV CD + D+GC GGLM++AFE+ +K +G +
Sbjct: 151 NIEGQWAAAGNPLTSLSEQMLVSCDTK---------DNGCGGGLMDNAFEWIVKENSGKV 201
Query: 234 MREEDYPY-TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY 292
E+ YPY +G CK K+ A++ + DED IA L NGP+AVA++A
Sbjct: 202 YTEKSYPYVSGGGEEPPCKPRGHKVGATITGHVDIPHDEDAIAKYLADNGPVAVAVDATT 261
Query: 293 MQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
+Y GGV SC S L+HGVLLVGY + + PYWIIKNSW SWGE GY
Sbjct: 262 FMSYSGGVVTSC---TSEALNHGVLLVGYNDS-------SKPPYWIIKNSWSSSWGEKGY 311
Query: 351 YKICRGRNVCGVDSMVST 368
+I +G N C V + S+
Sbjct: 312 IRIEKGTNQCLVAQLASS 329
>gi|45822209|emb|CAE47501.1| cathepsin L-like proteinase [Diabrotica virgifera virgifera]
Length = 325
Score = 246 bits (629), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 140/319 (43%), Positives = 189/319 (59%), Gaps = 29/319 (9%)
Query: 54 LGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKL----DPSATHGITQFS 109
L + + FK K NK+Y S E RF IF+ NLR+ H + + + G+T+F+
Sbjct: 17 LNDKEEWVQFKVKNNKSYKSYVEEQTRFRIFQENLRKIENHNEKYNNGESTFKFGVTKFT 76
Query: 110 DLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCW 169
DLT EF L L + R + + P DLP+ FDWR+KGAV VKDQG CGSCW
Sbjct: 77 DLTEKEFL-DLLVLSKNARPNRTHATHLLAPLRDLPSAFDWRDKGAVTEVKDQGMCGSCW 135
Query: 170 SFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK 229
+FSTTG++E A+FL TG LVSLSEQ LVDC + +C GC GG M+ A EY ++
Sbjct: 136 TFSTTGSVEAAHFLKTGNLVSLSEQNLVDCAKD-------TC-YGCGGGWMDKALEY-IE 186
Query: 230 AGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSL-DEDQIAANLVKNGPLAVAI 288
GG+M E+DYPY G D C+FD SK+AA ++NF+ + DE+ + + GP++VAI
Sbjct: 187 KGGIMSEKDYPYEGVDDN--CRFDISKVAAKISNFTYIKKNDEEDLKNAVAAKGPISVAI 244
Query: 289 NA-VYMQTYIGGVSCPYICSRRLD---HGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGES 344
+A Q Y+ G+ CS D HGVL+VGYG+ K YWIIKNSWG +
Sbjct: 245 DASATFQLYVSGILDDTECSNEFDSLNHGVLVVGYGTEN-------GKDYWIIKNSWGVN 297
Query: 345 WGENGYYKICRGR-NVCGV 362
WG +GY ++ R + N CG+
Sbjct: 298 WGMDGYIRMSRNKNNQCGI 316
>gi|226468424|emb|CAX69889.1| Temporarily Assigned Gene name [Schistosoma japonicum]
Length = 454
Score = 246 bits (629), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 131/309 (42%), Positives = 194/309 (62%), Gaps = 23/309 (7%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRR 118
++ FK + K Y + +++ RF+IFK+NL +A +Q L+ SA +G+T +SDLT EF R
Sbjct: 157 YAQFKLTYRKQY-HETDNEKRFSIFKSNLLKAQLYQVLERGSAVYGVTPYSDLTTDEFSR 215
Query: 119 TYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALE 178
T+L + ++ +P D+P +FDWR+KGAV VK+QG CGSCW+FSTTG +E
Sbjct: 216 THLTAPWRASSKRNT-ISPRREVGDIPNNFDWRKKGAVTEVKNQGMCGSCWAFSTTGNIE 274
Query: 179 GANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREED 238
F TGKL+SLSEQQLVDCD + D GCNGGL ++A+E ++ GGLM E++
Sbjct: 275 SQWFRKTGKLLSLSEQQLVDCD---------NLDDGCNGGLPSNAYESIIRMGGLMLEDN 325
Query: 239 YPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIG 298
YPY + C + +AA + + ++ DE ++A L + ++V +NA+ +Q Y
Sbjct: 326 YPYDA--KNEKCHLKVANVAAYINSSVNLTQDESELAIWLYHHSAISVGMNALLLQFYRH 383
Query: 299 GVSCPY--ICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
G+S P+ CS+ LDH VLLVGYG + K +P+WI+KNSWG WGE GY+++ R
Sbjct: 384 GISHPWWIFCSKYLLDHAVLLVGYG------VSEKNEPFWIVKNSWGVEWGEKGYFRMYR 437
Query: 356 GRNVCGVDS 364
G CG+++
Sbjct: 438 GDGTCGINT 446
>gi|67773372|gb|AAY81943.1| cysteine protease 5 [Paragonimus westermani]
Length = 325
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 139/319 (43%), Positives = 186/319 (58%), Gaps = 31/319 (9%)
Query: 56 AEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPA 114
A + FK+ + K YA+ ++ RF IFK NL RA + Q D +A +G+TQFSDLTP
Sbjct: 28 ARELYEQFKRDYGKVYANDDDQ-KRFAIFKDNLVRAQKLQLKDRGTARYGVTQFSDLTPE 86
Query: 115 EFRRTYLGLRRKLRLPKDADQAPILPT--NDLPADFDWREKGAVGPVKDQGSCGSCWSFS 172
EF YL P + + PT P DWRE GAVGPV++QGSCGSCW+FS
Sbjct: 87 EFAAKYLSR------PMNDQVERVRPTGLKAAPERMDWREWGAVGPVENQGSCGSCWAFS 140
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
G +EG FL TG+LVSLS+QQLVDCD D GC GG +A+ ++ GG
Sbjct: 141 VAGNVEGQWFLKTGQLVSLSKQQLVDCD---------VMDYGCGGGWPTNAYMEIMRMGG 191
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY 292
L + DYPY G + C +K K+ A + + V+ E++ AA L ++GPL+ A+NA Y
Sbjct: 192 LELQSDYPYVGVQQ--QCYLNKEKLLAKIDDLIVLGAYEEEHAAYLAEHGPLSSALNAGY 249
Query: 293 MQTYIGGVSCPYI--CS-RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
+Q Y G+S P CS L+H VL VGY + PYWIIKNSWG WGENG
Sbjct: 250 LQFYQSGISHPSYEECSPASLNHAVLTVGYDTE-------NGVPYWIIKNSWGTGWGENG 302
Query: 350 YYKICRGRNVCGVDSMVST 368
Y+++ RG CG++ M+++
Sbjct: 303 YFRLYRGDGTCGINRMITS 321
>gi|343412462|emb|CCD21670.1| cysteine peptidase (CP), putative [Trypanosoma vivax Y486]
Length = 367
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 137/318 (43%), Positives = 181/318 (56%), Gaps = 25/318 (7%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
E F+ FK+K+ ++Y + E R +F+ N+RR+ + +P AT G+T FSDLTP EF
Sbjct: 31 EPLFAAFKQKYGRSYGTAAEEAFRLRVFEDNMRRSRMYAAANPHATFGVTPFSDLTPEEF 90
Query: 117 RRTYLGLRRKLRLPKDADQAPI-LPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
R Y R + + + +P PA DWR KGAV PVKDQG+CGSCWSFS G
Sbjct: 91 RTRYHNGERHFEAARGRVRTLVQVPPGKAPAAVDWRRKGAVTPVKDQGTCGSCWSFSAIG 150
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGL 233
+EG A L SLSEQ LV CD + D+GC GGLM++AFE+ +K +G +
Sbjct: 151 NIEGQWAAAGNPLTSLSEQMLVSCDTK---------DNGCGGGLMDNAFEWIVKENSGKV 201
Query: 234 MREEDYPY-TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY 292
E+ YPY +G CK K+ A++ + DED IA L NGP+AVA++A
Sbjct: 202 YTEKSYPYVSGGGEEPPCKPRGHKVGATITGHVDIPHDEDAIAKYLADNGPVAVAVDATT 261
Query: 293 MQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
+Y GGV SC S L+HGVLLVGY + + PYWIIKNSW SWGE GY
Sbjct: 262 FMSYSGGVVTSC---TSEALNHGVLLVGYNDS-------SKPPYWIIKNSWSSSWGEKGY 311
Query: 351 YKICRGRNVCGVDSMVST 368
+I +G N C V + S+
Sbjct: 312 IRIEKGTNQCLVAQLASS 329
>gi|116242314|gb|ABJ89814.1| cysteine protease preprotein [Clonorchis sinensis]
Length = 326
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 141/318 (44%), Positives = 181/318 (56%), Gaps = 28/318 (8%)
Query: 56 AEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPA 114
A + FK K+ K Y S ++ + RF IFK NL RA R Q+++ +A +G+TQFSDLT
Sbjct: 28 ARALYEEFKLKYKKTY-SNDDDELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSE 86
Query: 115 EFRRTYLGLRRKLRLPKDADQAPILPTNDLPAD---FDWREKGAVGPVKDQGSCGSCWSF 171
EF+ YL ++R + P D+ D FDWRE GAVGPV DQG CGSCW+F
Sbjct: 87 EFKTRYL----RMRFDGPIVSEDLTPEEDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAF 142
Query: 172 STTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAG 231
S G + G F TG L++LSEQQLVDCD+ D GC+GG + K G
Sbjct: 143 SVIGNVVGQWFRKTGHLLALSEQQLVDCDY---------LDDGCDGGYPPQTYTAIQKMG 193
Query: 232 GLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV 291
GL DYPYTG G C DKSK A V +++ L E A L GPL+ A+NA
Sbjct: 194 GLELASDYPYTGV--GGICHMDKSKFVAYVNGSTILPLSEKVQAQKLRAIGPLSSALNAD 251
Query: 292 YMQTYIGGVSCPYICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
+Q Y GG+ P C ++H VL VGYG KPYWI+KNSWGE +GE GY
Sbjct: 252 TLQLYKGGIMRPKWCDPAGVNHAVLTVGYGVQ-------NGKPYWIVKNSWGEDFGEEGY 304
Query: 351 YKICRGRNVCGVDSMVST 368
++I RG CG++S+V+T
Sbjct: 305 FRIYRGDGTCGINSIVTT 322
>gi|72389861|ref|XP_845225.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|72389863|ref|XP_845226.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|62359933|gb|AAX80358.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|62359934|gb|AAX80359.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|70801760|gb|AAZ11666.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|70801761|gb|AAZ11667.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 449
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 141/323 (43%), Positives = 184/323 (56%), Gaps = 35/323 (10%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
E F+ FKKK+ K Y +E RF F+ N+ +A +P AT G+T FSD+T EF
Sbjct: 38 EMRFAAFKKKYGKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEF 97
Query: 117 RR------TYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWS 170
R +Y +K RL K + + T PA DWREKGAV PVKDQG CGSCW+
Sbjct: 98 RARYRNGASYFAAAQK-RLRKTVN----VTTGRAPAAVDWREKGAVTPVKDQGQCGSCWA 152
Query: 171 FSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA 230
FST G +EG +A LVSLSEQ LV CD + DSGCNGGLM++AF + + +
Sbjct: 153 FSTIGNIEGQWQVAGNPLVSLSEQMLVSCD---------TIDSGCNGGLMDNAFNWIVNS 203
Query: 231 --GGLMREEDYPY-TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVA 287
G + E YPY +G C+ + +I A++ + + DED IAA L +NGPLA+A
Sbjct: 204 NGGNVFTEASYPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIA 263
Query: 288 INAVYMQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESW 345
++A Y GG+ SC S +LDHGVLLVGY PYWIIKNSW W
Sbjct: 264 VDATSFMDYNGGILTSC---TSEQLDHGVLLVGYNDN-------SNPPYWIIKNSWSNMW 313
Query: 346 GENGYYKICRGRNVCGVDSMVST 368
GE+GY +I +G N C ++ VS+
Sbjct: 314 GEDGYIRIEKGTNQCLMNQAVSS 336
>gi|72389847|ref|XP_845218.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|72389849|ref|XP_845219.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|72389851|ref|XP_845220.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|72389857|ref|XP_845223.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|62359926|gb|AAX80351.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|62359927|gb|AAX80352.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|62359928|gb|AAX80353.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|62359931|gb|AAX80356.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|70801753|gb|AAZ11659.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|70801754|gb|AAZ11660.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|70801755|gb|AAZ11661.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|70801758|gb|AAZ11664.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 450
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 141/323 (43%), Positives = 184/323 (56%), Gaps = 35/323 (10%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
E F+ FKKK+ K Y +E RF F+ N+ +A +P AT G+T FSD+T EF
Sbjct: 38 EMRFAAFKKKYGKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEF 97
Query: 117 RR------TYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWS 170
R +Y +K RL K + + T PA DWREKGAV PVKDQG CGSCW+
Sbjct: 98 RARYRNGASYFAAAQK-RLRKTVN----VTTGRAPAAVDWREKGAVTPVKDQGQCGSCWA 152
Query: 171 FSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA 230
FST G +EG +A LVSLSEQ LV CD + DSGCNGGLM++AF + + +
Sbjct: 153 FSTIGNIEGQWQVAGNPLVSLSEQMLVSCD---------TIDSGCNGGLMDNAFNWIVNS 203
Query: 231 --GGLMREEDYPY-TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVA 287
G + E YPY +G C+ + +I A++ + + DED IAA L +NGPLA+A
Sbjct: 204 NGGNVFTEASYPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIA 263
Query: 288 INAVYMQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESW 345
++A Y GG+ SC S +LDHGVLLVGY PYWIIKNSW W
Sbjct: 264 VDATSFMDYNGGILTSC---TSEQLDHGVLLVGYNDN-------SNPPYWIIKNSWSNMW 313
Query: 346 GENGYYKICRGRNVCGVDSMVST 368
GE+GY +I +G N C ++ VS+
Sbjct: 314 GEDGYIRIEKGTNQCLMNQAVSS 336
>gi|72389855|ref|XP_845222.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|72389865|ref|XP_845227.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|72389867|ref|XP_845228.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|62359930|gb|AAX80355.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|62359935|gb|AAX80360.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|62359936|gb|AAX80361.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|70801757|gb|AAZ11663.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|70801762|gb|AAZ11668.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|70801763|gb|AAZ11669.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 449
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 141/323 (43%), Positives = 184/323 (56%), Gaps = 35/323 (10%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
E F+ FKKK+ K Y +E RF F+ N+ +A +P AT G+T FSD+T EF
Sbjct: 38 EMRFAAFKKKYGKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEF 97
Query: 117 RR------TYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWS 170
R +Y +K RL K + + T PA DWREKGAV PVKDQG CGSCW+
Sbjct: 98 RARYRNGASYFAAAQK-RLRKTVN----VTTGRAPAAVDWREKGAVTPVKDQGQCGSCWA 152
Query: 171 FSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA 230
FST G +EG +A LVSLSEQ LV CD + DSGCNGGLM++AF + + +
Sbjct: 153 FSTIGNIEGQWQVAGNPLVSLSEQMLVSCD---------TIDSGCNGGLMDNAFNWIVNS 203
Query: 231 --GGLMREEDYPY-TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVA 287
G + E YPY +G C+ + +I A++ + + DED IAA L +NGPLA+A
Sbjct: 204 NGGNVFTEASYPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIA 263
Query: 288 INAVYMQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESW 345
++A Y GG+ SC S +LDHGVLLVGY PYWIIKNSW W
Sbjct: 264 VDATSFMDYNGGILTSC---TSEQLDHGVLLVGYNDN-------SNPPYWIIKNSWSNMW 313
Query: 346 GENGYYKICRGRNVCGVDSMVST 368
GE+GY +I +G N C ++ VS+
Sbjct: 314 GEDGYIRIEKGTNQCLMNQAVSS 336
>gi|72389853|ref|XP_845221.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|62359929|gb|AAX80354.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|70801756|gb|AAZ11662.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 449
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 141/323 (43%), Positives = 184/323 (56%), Gaps = 35/323 (10%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
E F+ FKKK+ K Y +E RF F+ N+ +A +P AT G+T FSD+T EF
Sbjct: 38 EMRFAAFKKKYGKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEF 97
Query: 117 RR------TYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWS 170
R +Y +K RL K + + T PA DWREKGAV PVKDQG CGSCW+
Sbjct: 98 RARYRNGASYFAAAQK-RLRKTVN----VTTGRAPAAVDWREKGAVTPVKDQGQCGSCWA 152
Query: 171 FSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA 230
FST G +EG +A LVSLSEQ LV CD + DSGCNGGLM++AF + + +
Sbjct: 153 FSTIGNIEGQWQVAGNPLVSLSEQMLVSCD---------TIDSGCNGGLMDNAFNWIVNS 203
Query: 231 --GGLMREEDYPY-TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVA 287
G + E YPY +G C+ + +I A++ + + DED IAA L +NGPLA+A
Sbjct: 204 NGGNVFTEASYPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIA 263
Query: 288 INAVYMQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESW 345
++A Y GG+ SC S +LDHGVLLVGY PYWIIKNSW W
Sbjct: 264 VDATSFMDYNGGILTSC---TSEQLDHGVLLVGYNDN-------SNPPYWIIKNSWSNMW 313
Query: 346 GENGYYKICRGRNVCGVDSMVST 368
GE+GY +I +G N C ++ VS+
Sbjct: 314 GEDGYIRIEKGTNQCLMNQAVSS 336
>gi|268554660|ref|XP_002635317.1| C. briggsae CBR-TAG-196 protein [Caenorhabditis briggsae]
Length = 477
Score = 244 bits (624), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 136/318 (42%), Positives = 190/318 (59%), Gaps = 27/318 (8%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRR 118
F F + K Y+++ E RF FK N + QK + SA +G T+FSD+T EF++
Sbjct: 174 FLDFIDRHEKRYSNKREVLKRFRTFKKNAKVIRELQKNEQGSAVYGFTKFSDMTTMEFKQ 233
Query: 119 TYLGLRRKLRLPKDAD-----QAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFST 173
T L + + + A+ + + +DLP FDWR+ GAV VK+QG+CGSCW+FST
Sbjct: 234 TMLPYQWEQPVYPMAEADFEKEGVTISEDDLPDSFDWRDHGAVTQVKNQGNCGSCWAFST 293
Query: 174 TGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGL 233
TG +EGA +LA KLVSLSEQ+LVDCD S D GCNGGL ++A++ ++ GGL
Sbjct: 294 TGNVEGAWYLAKKKLVSLSEQELVDCD---------SVDQGCNGGLPSNAYKEIMRMGGL 344
Query: 234 MREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYM 293
E+ YPY G +G C + IA + + DE +I LV GP+++ +NA +
Sbjct: 345 EPEDAYPYDG--KGETCHIVRKDIAVYINGSVELPHDEVKIQKWLVTKGPISIGLNANTL 402
Query: 294 QTYIGGVSCPY--ICSR-RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
Q Y GV P+ C L+HGVL+VGYG G KPYWI+KNSWG +WGE+GY
Sbjct: 403 QFYRHGVVHPFKIFCEPFMLNHGVLIVGYGKDG-------RKPYWIVKNSWGPTWGESGY 455
Query: 351 YKICRGRNVCGVDSMVST 368
+++ RG+NVCGV M ++
Sbjct: 456 FRLYRGKNVCGVQEMATS 473
>gi|85068704|gb|ABC69432.1| cysteine protease [Clonorchis sinensis]
Length = 326
Score = 244 bits (623), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 141/318 (44%), Positives = 180/318 (56%), Gaps = 28/318 (8%)
Query: 56 AEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPA 114
A + FK K+ K Y S ++ + RF IFK NL RA R Q+++ +A +G+TQFSDLT
Sbjct: 28 ARALYEEFKLKYKKTY-SNDDDELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSE 86
Query: 115 EFRRTYLGLRRKLRLPKDADQAPILPTNDLPAD---FDWREKGAVGPVKDQGSCGSCWSF 171
EF YL ++R + P D+ D FDWRE GAVGPV DQG CGSCW+F
Sbjct: 87 EFETRYL----RMRFDGPIVSEDLTPEEDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAF 142
Query: 172 STTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAG 231
S G + G F TG L++LSEQQLVDCD+ D GC+GG + K G
Sbjct: 143 SVIGNVVGQWFRKTGHLLALSEQQLVDCDY---------LDDGCDGGYPPQTYTAIQKMG 193
Query: 232 GLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV 291
GL DYPYTG G C DKSK A V +++ L E A L GPL+ A+NA
Sbjct: 194 GLELASDYPYTGV--GGICHMDKSKFVAYVNGSTILPLSEKVQAQKLRAIGPLSSALNAD 251
Query: 292 YMQTYIGGVSCPYICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
+Q Y GG+ P C ++H VL VGYG KPYWI+KNSWGE +GE GY
Sbjct: 252 TLQLYKGGIMRPKWCDPAGVNHAVLTVGYGVQ-------NGKPYWIVKNSWGEDFGEEGY 304
Query: 351 YKICRGRNVCGVDSMVST 368
++I RG CG++S+V+T
Sbjct: 305 FRIYRGDGTCGINSIVTT 322
>gi|85068698|gb|ABC69429.1| cysteine protease [Clonorchis sinensis]
Length = 326
Score = 244 bits (623), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 142/323 (43%), Positives = 183/323 (56%), Gaps = 38/323 (11%)
Query: 56 AEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPA 114
A + FK K+ K Y S ++ + RF IFK NL RA R Q+++ +A +G+TQFSDLT
Sbjct: 28 ARALYEEFKLKYKKTY-SNDDDELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSE 86
Query: 115 EFRRTYLGLRRKLRLPKDADQAPIL-----PTNDLPAD---FDWREKGAVGPVKDQGSCG 166
EF+ YL +R PI+ P D+ D FDWRE GAVGPV DQG CG
Sbjct: 87 EFKTRYLRMRF---------DGPIVSEDPSPEEDVTMDNEKFDWREHGAVGPVLDQGKCG 137
Query: 167 SCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEY 226
SCW+FS G + G F TG L++LSEQQLVDCD+ D GC+GG +
Sbjct: 138 SCWAFSVIGNVVGQWFRKTGHLLALSEQQLVDCDY---------LDGGCDGGYPPQTYTA 188
Query: 227 TLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAV 286
K GGL DYPYTG G C DKSK A + +++ L E A L GPL+
Sbjct: 189 IQKMGGLELASDYPYTGV--GGICYMDKSKFVAYINGSTILPLSEKVQAQKLRAIGPLSS 246
Query: 287 AINAVYMQTYIGGVSCPYICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESW 345
A+NA +Q Y GG+ P +C ++H VL VGYG KPYWI+KNSWGE +
Sbjct: 247 ALNADTLQLYKGGIMRPRLCDPAGVNHAVLTVGYGVQ-------NGKPYWIVKNSWGEDF 299
Query: 346 GENGYYKICRGRNVCGVDSMVST 368
GE GY++I RG CG++S+V+T
Sbjct: 300 GEEGYFRIYRGDGTCGINSIVTT 322
>gi|417401303|gb|JAA47542.1| Putative cathepsin f [Desmodus rotundus]
Length = 459
Score = 244 bits (622), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 140/319 (43%), Positives = 183/319 (57%), Gaps = 35/319 (10%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRR 118
F F +N+ Y ++EE R +IF N+ RA Q LD +A +G+T+FSDLT EFR
Sbjct: 162 FKHFIATYNRTYETEEEAQWRMSIFINNMVRAQEIQALDRGTAQYGVTKFSDLTEEEFRT 221
Query: 119 TYL------GLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFS 172
YL GL +K+RL K D + P ++DWR KGAV VK+QG CGSCW+FS
Sbjct: 222 FYLNPLLKEGLGKKMRLAKPVD-------DPAPPEWDWRNKGAVTKVKNQGMCGSCWAFS 274
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
TG +EG FL G L+SLSEQ+LVDCD + D C GGL ++A+ GG
Sbjct: 275 VTGNVEGQWFLKQGDLLSLSEQELVDCD---------TLDKACMGGLPSNAYSAIKTLGG 325
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY 292
L E+DY Y G C F K+ + + +S DE ++AA L K GP+++AINA
Sbjct: 326 LETEDDYSYHG--HLQTCSFTAEKVKVYINDSVELSKDEQKLAAWLAKKGPISIAINAFG 383
Query: 293 MQTYIGGVSCP--YICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
MQ Y G+S P +CS +DH VLLVGYG+ + P+W IKNSWG WGE G
Sbjct: 384 MQFYRRGISRPLRLLCSPWFIDHAVLLVGYGNRS-------DVPFWAIKNSWGTDWGEEG 436
Query: 350 YYKICRGRNVCGVDSMVST 368
YY + RG CGV+ M S+
Sbjct: 437 YYYLHRGSRACGVNVMASS 455
>gi|340053963|emb|CCC48256.1| cysteine peptidase precursor [Trypanosoma vivax Y486]
Length = 452
Score = 244 bits (622), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 137/319 (42%), Positives = 180/319 (56%), Gaps = 25/319 (7%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
E F+ FK+K+ ++Y + E R +F+ N+RR+ + +P AT G+T FSDLTP EF
Sbjct: 31 EPLFAAFKQKYGRSYGTAAEEAFRLRVFEDNMRRSRMYAAANPHATFGVTPFSDLTPEEF 90
Query: 117 RRTYLGLRRKLRLPKDADQAPI-LPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
R Y R + + + +P PA DWR KGAV PVKDQGSCGSCWSFS G
Sbjct: 91 RTRYHNGERHFEAARGRVRTLVQVPPGKAPAAVDWRRKGAVTPVKDQGSCGSCWSFSAIG 150
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGL 233
+EG A L SLSEQ LV CD S D+GC GG M++AFE+ +K +G +
Sbjct: 151 NIEGQWAAAGNPLTSLSEQMLVSCD---------SKDNGCGGGFMDNAFEWIVKENSGKV 201
Query: 234 MREEDYPY-TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY 292
E+ YPY +G CK ++ A++ + DED IA L NGP+AVA++A
Sbjct: 202 YTEKSYPYVSGGGEEPPCKPRGHEVGATITGHVDIPHDEDAIAKYLADNGPVAVAVDATT 261
Query: 293 MQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
+Y GGV SC S L+HGVLLVGY + + PYWIIKNSW SWGE GY
Sbjct: 262 FMSYSGGVVTSC---TSEALNHGVLLVGYNDS-------SKPPYWIIKNSWSSSWGEKGY 311
Query: 351 YKICRGRNVCGVDSMVSTV 369
+I +G N C V + S+
Sbjct: 312 IRIEKGTNQCLVAQLASSA 330
>gi|30575714|gb|AAP33049.1| cysteine proteinase 1 [Clonorchis sinensis]
Length = 326
Score = 243 bits (621), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 140/318 (44%), Positives = 180/318 (56%), Gaps = 28/318 (8%)
Query: 56 AEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPA 114
A + F K+ K Y S ++ + RF IFK NL RA R Q+++ +A +G+TQFSDLT
Sbjct: 28 ARALYEEFTLKYKKTY-SNDDDELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSE 86
Query: 115 EFRRTYLGLRRKLRLPKDADQAPILPTNDLPAD---FDWREKGAVGPVKDQGSCGSCWSF 171
EF+ YL ++R + P D+ D FDWRE GAVGPV DQG CGSCW+F
Sbjct: 87 EFKTRYL----RMRFDGPIVSEDLTPEEDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAF 142
Query: 172 STTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAG 231
S G + G F TG L++LSEQQLVDCD+ D GC+GG + K G
Sbjct: 143 SVIGNVVGQWFRKTGHLLALSEQQLVDCDY---------LDDGCDGGYPPQTYTAIQKMG 193
Query: 232 GLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV 291
GL DYPYTG G C DKSK A V +++ L E A L GPL+ A+NA
Sbjct: 194 GLELASDYPYTGV--GGICHMDKSKFVAYVNGSTILPLSEKVQAQKLRAIGPLSSALNAD 251
Query: 292 YMQTYIGGVSCPYICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
+Q Y GG+ P C ++H VL VGYG KPYWI+KNSWGE +GE GY
Sbjct: 252 TLQLYKGGIMRPKWCDPAGVNHAVLTVGYGVQ-------NGKPYWIVKNSWGEDFGEKGY 304
Query: 351 YKICRGRNVCGVDSMVST 368
++I RG CG++S+V+T
Sbjct: 305 FRIYRGDGTCGINSIVTT 322
>gi|198453932|ref|XP_002137768.1| GA27408, isoform A [Drosophila pseudoobscura pseudoobscura]
gi|198132577|gb|EDY68326.1| GA27408, isoform A [Drosophila pseudoobscura pseudoobscura]
Length = 629
Score = 243 bits (621), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 132/322 (40%), Positives = 189/322 (58%), Gaps = 18/322 (5%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAE 115
+H F F+ +F + Y + E R IF+ NL+ + SA +GIT+F+D+T E
Sbjct: 320 DHLFHKFQIRFGRRYDNTAERQMRLRIFRQNLKTIEELNTNEMGSAKYGITEFADMTSTE 379
Query: 116 FR-RTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTT 174
++ RT L R + + A + P +FDWR+K AV PVK+QGSCGSCW+FS T
Sbjct: 380 YKERTGLWQRDEQKPTGGAPAVVPAYEGEFPKEFDWRQKNAVTPVKNQGSCGSCWAFSVT 439
Query: 175 GALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLM 234
G +EG + TG+L SEQ+L+DCD + DS CNGGLM++A++ GGL
Sbjct: 440 GNIEGLYAVKTGELKEFSEQELLDCD---------TTDSACNGGLMDNAYKAIKDIGGLE 490
Query: 235 REEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAINAVYM 293
E +YPY + C F+++ V+ F + +E + L+ +GP+++ +NA M
Sbjct: 491 YEAEYPYEA--KKQQCHFNRTLSHVQVSGFVDLPKGNETAMQEWLLTHGPISIGLNANAM 548
Query: 294 QTYIGGVSCPY--ICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
Q Y GGVS P+ +CS++ LDHGVL+VGYG + Y P K PYWI+KNSWG WGE GY
Sbjct: 549 QFYRGGVSHPWKALCSKKNLDHGVLIVGYGVSDY-PNFHKTLPYWIVKNSWGPRWGEQGY 607
Query: 351 YKICRGRNVCGVDSMVSTVAAA 372
Y++ RG N CGV M ++ A
Sbjct: 608 YRVYRGDNTCGVSEMATSAVLA 629
>gi|195152617|ref|XP_002017233.1| GL22196 [Drosophila persimilis]
gi|194112290|gb|EDW34333.1| GL22196 [Drosophila persimilis]
Length = 627
Score = 243 bits (621), Expect = 9e-62, Method: Compositional matrix adjust.
Identities = 132/322 (40%), Positives = 189/322 (58%), Gaps = 18/322 (5%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAE 115
+H F F+ +F + Y + E R IF+ NL+ + SA +GIT+F+D+T E
Sbjct: 318 DHLFHKFQIRFGRRYDNTAERQMRLRIFRQNLKTIEELNTNEMGSAKYGITEFADMTSTE 377
Query: 116 FR-RTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTT 174
++ RT L R + + A + P +FDWR+K AV PVK+QGSCGSCW+FS T
Sbjct: 378 YKERTGLWQRDEQKPTGGAPAVVPAYEGEFPKEFDWRQKNAVTPVKNQGSCGSCWAFSVT 437
Query: 175 GALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLM 234
G +EG + TG+L SEQ+L+DCD + DS CNGGLM++A++ GGL
Sbjct: 438 GNIEGLYAVKTGELKEFSEQELLDCD---------TTDSACNGGLMDNAYKAIKDIGGLE 488
Query: 235 REEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAINAVYM 293
E +YPY + C F+++ V+ F + +E + L+ +GP+++ +NA M
Sbjct: 489 YEAEYPYEA--KKQQCHFNRTLSHVQVSGFVDLPKGNETAMQEWLLTHGPISIGLNANAM 546
Query: 294 QTYIGGVSCPY--ICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
Q Y GGVS P+ +CS++ LDHGVL+VGYG + Y P K PYWI+KNSWG WGE GY
Sbjct: 547 QFYRGGVSHPWKALCSKKNLDHGVLIVGYGVSDY-PNFHKTLPYWIVKNSWGPRWGEQGY 605
Query: 351 YKICRGRNVCGVDSMVSTVAAA 372
Y++ RG N CGV M ++ A
Sbjct: 606 YRVYRGDNTCGVSEMATSAVLA 627
>gi|38683931|gb|AAR27011.1| cysteine protease [Periserrula leucophryna]
Length = 283
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 136/294 (46%), Positives = 178/294 (60%), Gaps = 21/294 (7%)
Query: 80 RFTIFKANLRRAARHQKLD-PSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPI 138
RF IF+ N+++ + A +G+TQFSDL EFRR YL + L D +A I
Sbjct: 2 RFKIFRENMKKINTLNDNELGDAEYGVTQFSDLAEEEFRRYYLTPKWDLSHRPDLVRAKI 61
Query: 139 LPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVD 198
P D PA FDWR+ AV PVK+QG CGSCW+FSTT +EG + KLVSLSEQ+LVD
Sbjct: 62 -PDVDPPASFDWRDHNAVTPVKNQGMCGSCWAFSTTENIEGQWAIHRNKLVSLSEQELVD 120
Query: 199 CDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIA 258
CD D GC GGL +A+E ++ GGL E+ YPY D CKF +A
Sbjct: 121 CD---------KLDDGCEGGLPVNAYEEIIRLGGLESEKKYPYDAEDE--KCKFTVGDVA 169
Query: 259 ASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCP--YICS-RRLDHGVL 315
+ + +S +E +AA L KNGP+++ INA MQ Y+GGVS P ++CS LDHGVL
Sbjct: 170 VYINSSVNISSNEADMAAWLYKNGPISIGINAFAMQFYMGGVSHPFSFLCSPDELDHGVL 229
Query: 316 LVGYGS-AGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVST 368
+VGYG+ G+ + PYWI+KNSWG SWG GYY + RG VCG++ M ++
Sbjct: 230 IVGYGTKKGW----FSDSPYWIVKNSWGASWGVQGYYLVYRGDGVCGLNKMPTS 279
>gi|390178852|ref|XP_003736743.1| GA27408, isoform B [Drosophila pseudoobscura pseudoobscura]
gi|388859612|gb|EIM52816.1| GA27408, isoform B [Drosophila pseudoobscura pseudoobscura]
Length = 477
Score = 243 bits (619), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 132/322 (40%), Positives = 189/322 (58%), Gaps = 18/322 (5%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAE 115
+H F F+ +F + Y + E R IF+ NL+ + SA +GIT+F+D+T E
Sbjct: 168 DHLFHKFQIRFGRRYDNTAERQMRLRIFRQNLKTIEELNTNEMGSAKYGITEFADMTSTE 227
Query: 116 FR-RTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTT 174
++ RT L R + + A + P +FDWR+K AV PVK+QGSCGSCW+FS T
Sbjct: 228 YKERTGLWQRDEQKPTGGAPAVVPAYEGEFPKEFDWRQKNAVTPVKNQGSCGSCWAFSVT 287
Query: 175 GALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLM 234
G +EG + TG+L SEQ+L+DCD + DS CNGGLM++A++ GGL
Sbjct: 288 GNIEGLYAVKTGELKEFSEQELLDCD---------TTDSACNGGLMDNAYKAIKDIGGLE 338
Query: 235 REEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAINAVYM 293
E +YPY + C F+++ V+ F + +E + L+ +GP+++ +NA M
Sbjct: 339 YEAEYPYEA--KKQQCHFNRTLSHVQVSGFVDLPKGNETAMQEWLLTHGPISIGLNANAM 396
Query: 294 QTYIGGVSCPY--ICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
Q Y GGVS P+ +CS++ LDHGVL+VGYG + Y P K PYWI+KNSWG WGE GY
Sbjct: 397 QFYRGGVSHPWKALCSKKNLDHGVLIVGYGVSDY-PNFHKTLPYWIVKNSWGPRWGEQGY 455
Query: 351 YKICRGRNVCGVDSMVSTVAAA 372
Y++ RG N CGV M ++ A
Sbjct: 456 YRVYRGDNTCGVSEMATSAVLA 477
>gi|85068706|gb|ABC69433.1| cysteine protease [Clonorchis sinensis]
Length = 326
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 140/318 (44%), Positives = 180/318 (56%), Gaps = 28/318 (8%)
Query: 56 AEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPA 114
A + FK K+ K Y S ++ + RF IFK NL RA R Q+++ +A +G+TQFSDLT
Sbjct: 28 ARALYEEFKLKYKKTY-SNDDDELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSE 86
Query: 115 EFRRTYLGLRRKLRLPKDADQAPILPTNDLPAD---FDWREKGAVGPVKDQGSCGSCWSF 171
EF+ YL ++R + P D+ D FDWRE GAVGPV DQG CGSCW+F
Sbjct: 87 EFKTRYL----RMRFDGPIVSEDLTPEEDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAF 142
Query: 172 STTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAG 231
S G + G F TG L++LS QQLVDCD+ D GC+GG + K G
Sbjct: 143 SVIGNVVGQWFRETGHLLALSGQQLVDCDY---------LDDGCDGGYPPQTYTAIQKMG 193
Query: 232 GLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV 291
GL DYPYTG G C DKSK A V +++ L E A L GPL+ A+NA
Sbjct: 194 GLELASDYPYTGV--GGICHMDKSKFVAYVNGSTILPLSEKVQAQKLRAIGPLSSALNAD 251
Query: 292 YMQTYIGGVSCPYICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
+Q Y GG+ P C ++H VL VGYG KPYWI+KNSWGE +GE GY
Sbjct: 252 TLQLYKGGIMRPKWCDPAGVNHAVLTVGYGVQ-------NGKPYWIVKNSWGEDFGEEGY 304
Query: 351 YKICRGRNVCGVDSMVST 368
++I RG CG++S+V+T
Sbjct: 305 FRIYRGDGTCGINSIVTT 322
>gi|16076439|emb|CAC94444.1| cysteine proteinase [Betula pendula]
Length = 133
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 113/134 (84%), Positives = 125/134 (93%), Gaps = 1/134 (0%)
Query: 200 DHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAA 259
DHECDPEE G+CDSGC+GGLM +AFEYTLKAGGL RE+DYPYTGTDRG +CKFDKSKIAA
Sbjct: 1 DHECDPEEYGACDSGCSGGLMTTAFEYTLKAGGLEREKDYPYTGTDRG-SCKFDKSKIAA 59
Query: 260 SVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGY 319
SV+NFSVVS+DEDQIAANLVKNGPLA+ INA +MQTY+ GVSCPYIC RRLDHGVLLVGY
Sbjct: 60 SVSNFSVVSIDEDQIAANLVKNGPLAIGINAAFMQTYMKGVSCPYICGRRLDHGVLLVGY 119
Query: 320 GSAGYAPIRLKEKP 333
GSAG++PIR KEKP
Sbjct: 120 GSAGFSPIRFKEKP 133
>gi|85068712|gb|ABC69436.1| cysteine protease [Clonorchis sinensis]
Length = 328
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 143/325 (44%), Positives = 187/325 (57%), Gaps = 40/325 (12%)
Query: 56 AEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPA 114
A + FK K+ K Y S ++ + RF IFK NL RA R Q+++ +A +G+TQFSDLT
Sbjct: 28 ARALYEEFKLKYKKTY-SNDDDELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSE 86
Query: 115 EFRRTYLGLRRKLRLPKDADQAPIL-----PTNDLPAD---FDWREKGAVGPVKDQGSCG 166
EF+ YL +R PI+ P D+ D FDWRE GAVGPV DQG CG
Sbjct: 87 EFKTRYLRMRF---------DGPIVSEDPSPEEDVTMDNEKFDWREHGAVGPVLDQGKCG 137
Query: 167 SCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEY 226
SCW+FS G +EG F TG L++LSEQQLVDCDH + GCNGG +
Sbjct: 138 SCWAFSVIGNVEGQWFRKTGDLLALSEQQLVDCDH---------LEKGCNGGYPPKTYGE 188
Query: 227 TLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAV 286
K GGL DYPYTG D C ++SK A V + +V+ L E A L + GPL+
Sbjct: 189 IEKMGGLELASDYPYTGVD--GICYMNQSKFVAYVNDSTVLPLSEKIQAQKLKEIGPLSS 246
Query: 287 AINAVYMQTYIGGV--SCPYICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGE 343
A+NAV +Q Y+GG+ P++C+ L+H VL VGYG+ PYWI+KNS G
Sbjct: 247 ALNAVLLQFYLGGIIFPIPFLCNPHGLNHAVLTVGYGTEFGI-------PYWIVKNSLGV 299
Query: 344 SWGENGYYKICRGRNVCGVDSMVST 368
+GE GY++I RG CG++ +VST
Sbjct: 300 GFGEKGYFRIFRGAGTCGINLVVST 324
>gi|118156|sp|P14658.1|CYSP_TRYBB RecName: Full=Cysteine proteinase; Flags: Precursor
gi|10393|emb|CAA34485.1| unnamed protein product [Trypanosoma brucei]
Length = 450
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 140/323 (43%), Positives = 184/323 (56%), Gaps = 35/323 (10%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
E F+ FKKK+ K Y +E RF F+ N+ +A +P AT G+T FSD+T EF
Sbjct: 38 EMRFAAFKKKYGKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEF 97
Query: 117 RR------TYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWS 170
R +Y +K RL K + + T PA DWREKGAV PVK QG CGSCW+
Sbjct: 98 RARYRNGASYFAAAQK-RLRKTVN----VTTGRAPAAVDWREKGAVTPVKVQGQCGSCWA 152
Query: 171 FSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA 230
FST G +EG +A LVSLSEQ LV CD + DSGCNGGLM++AF + + +
Sbjct: 153 FSTIGNIEGQWQVAGNPLVSLSEQMLVSCD---------TIDSGCNGGLMDNAFNWIVNS 203
Query: 231 --GGLMREEDYPY-TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVA 287
G + E YPY +G C+ + +I A++ + + DED IAA L +NGPLA+A
Sbjct: 204 NGGNVFTEASYPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIA 263
Query: 288 INAVYMQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESW 345
++A Y GG+ SC S++LDHGVLLVGY PYWIIKNSW W
Sbjct: 264 VDAESFMDYNGGILTSC---TSKQLDHGVLLVGYNDN-------SNPPYWIIKNSWSNMW 313
Query: 346 GENGYYKICRGRNVCGVDSMVST 368
GE+GY +I +G N C ++ VS+
Sbjct: 314 GEDGYIRIEKGTNQCLMNQAVSS 336
>gi|432091081|gb|ELK24293.1| Cathepsin F, partial [Myotis davidii]
Length = 410
Score = 241 bits (616), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 138/322 (42%), Positives = 184/322 (57%), Gaps = 35/322 (10%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRR 118
F F +N+ Y ++EE R ++F N+ RA + Q LD +A +G+T+FSDLT EFR
Sbjct: 113 FKYFITTYNRTYETEEEAQWRMSVFINNMIRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 172
Query: 119 TYLG------LRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFS 172
YL L +K+RL K + P ++DWR+KGAV VK+QG CGSCW+FS
Sbjct: 173 MYLNPLLKEELGKKMRLVK-------FVGDPAPPEWDWRKKGAVTKVKNQGMCGSCWAFS 225
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
TG +EG FL G L+SLSEQ+LVDCD D C GGL ++A+ GG
Sbjct: 226 VTGNVEGQWFLKRGDLLSLSEQELVDCD---------KVDKACMGGLPSNAYSAIKTLGG 276
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY 292
L E+DY Y+G C F K + + +S +E ++AA L KNGP+++AINA
Sbjct: 277 LETEDDYSYSG--HLQTCSFSAQKAKVYINDSVELSHNEQELAAWLAKNGPISIAINAFG 334
Query: 293 MQTYIGGVSCPY--ICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
MQ Y G+S P +CSR +DH VLLVGYG+ + P+W IKNSWG WGE G
Sbjct: 335 MQFYRHGISRPLRPLCSRWFIDHAVLLVGYGNRS-------DVPFWAIKNSWGTDWGEEG 387
Query: 350 YYKICRGRNVCGVDSMVSTVAA 371
YY + RG CGV+ M S+
Sbjct: 388 YYYLHRGSGACGVNVMASSAVV 409
>gi|85068700|gb|ABC69430.1| cysteine protease [Clonorchis sinensis]
Length = 326
Score = 241 bits (616), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 139/318 (43%), Positives = 180/318 (56%), Gaps = 28/318 (8%)
Query: 56 AEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPA 114
A + FK K+ K Y S ++ + RF IFK NL RA R Q+++ +A +G+TQFSDLT
Sbjct: 28 ARALYEEFKLKYKKTY-SNDDDELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSE 86
Query: 115 EFRRTYLGLRRKLRLPKDADQAPILPTNDLPAD---FDWREKGAVGPVKDQGSCGSCWSF 171
EF+ YL ++R + P D+ D FDWRE GAVGPV DQG CGSCW+F
Sbjct: 87 EFKTRYL----RMRFDGPIVSEDLTPEEDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAF 142
Query: 172 STTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAG 231
S G + G F TG L++LSEQ LVDCD+ D GC+GG K G
Sbjct: 143 SVIGNVVGQWFRKTGHLLALSEQPLVDCDY---------LDGGCDGGYPPQTNTAIQKMG 193
Query: 232 GLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV 291
GL DYPYTG G C DKSK A + +++ L E A L GPL+ A+NA
Sbjct: 194 GLELASDYPYTGV--GGICYMDKSKFVAYINGSTILPLSEKVQAQKLRAIGPLSSALNAD 251
Query: 292 YMQTYIGGVSCPYICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
+Q Y GG+ P +C ++H VL VGYG KPYWI+KNSWGE +GE GY
Sbjct: 252 TLQLYKGGIMRPRLCDPAGVNHAVLTVGYGVQ-------NGKPYWIVKNSWGEDFGEEGY 304
Query: 351 YKICRGRNVCGVDSMVST 368
++I RG CG++S+V+T
Sbjct: 305 FRIYRGDGTCGINSIVTT 322
>gi|15485586|emb|CAC67416.1| cysteine protease [Trypanosoma brucei rhodesiense]
Length = 450
Score = 241 bits (616), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 139/323 (43%), Positives = 183/323 (56%), Gaps = 35/323 (10%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
E F+ FKKK+ K Y +E RF F+ N+ +A +P AT G+T FSD+T EF
Sbjct: 38 EMRFAAFKKKYGKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEF 97
Query: 117 RR------TYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWS 170
R +Y +K RL K + + T PA DWREKGAV PVKDQG CGSCW+
Sbjct: 98 RARYRNGASYFAAAQK-RLRKTVN----VTTGRAPAAVDWREKGAVTPVKDQGQCGSCWA 152
Query: 171 FSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA 230
FST G +EG +A LVSLSEQ LV CD + D GC GGLM++AF + + +
Sbjct: 153 FSTIGNIEGQWQVAGNPLVSLSEQMLVSCD---------TIDFGCGGGLMDNAFNWIVNS 203
Query: 231 --GGLMREEDYPY-TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVA 287
G + E YPY +G C+ + +I A++ + + DED IAA L +NGPLA+A
Sbjct: 204 NGGNVFTEASYPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIA 263
Query: 288 INAVYMQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESW 345
++A Y GG+ SC S +LDHGVLLVGY + PYWIIKNSW W
Sbjct: 264 VDATSFMDYNGGILTSC---TSEQLDHGVLLVGYNDS-------SNPPYWIIKNSWSNMW 313
Query: 346 GENGYYKICRGRNVCGVDSMVST 368
GE+GY +I +G N C ++ VS+
Sbjct: 314 GEDGYIRIEKGTNQCLMNQAVSS 336
>gi|391336140|ref|XP_003742440.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 330
Score = 241 bits (616), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 140/322 (43%), Positives = 191/322 (59%), Gaps = 29/322 (9%)
Query: 56 AEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSA---THGITQFSDLT 112
AE H++ FK K+Y +E R IF+ NL +++ S T G+ +F+D+T
Sbjct: 24 AEPHWNAFKSTHLKSYRDGQEELIRRFIFEDNLHTIEEFNRVNASLAGFTLGVNEFADMT 83
Query: 113 PAEFRRTYLGLRRKLRLPKDA--DQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWS 170
EF LGL + ++ D+ + + + DLPA+ DW +KG V VK+QG CGSCW+
Sbjct: 84 NTEFSNMLLGLGGRNKIAGDSVFESSHV---QDLPAEVDWTQKGYVTEVKNQGQCGSCWA 140
Query: 171 FSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA 230
FSTTG+LEG F TGKLVSLSEQ LVDC + GCNGGLM+ AF Y K
Sbjct: 141 FSTTGSLEGQVFKKTGKLVSLSEQNLVDCS-------TSEGNQGCNGGLMDQAFTYIKKN 193
Query: 231 GGLMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAIN 289
GG+ E YPYTG+D C+F ++K+ A+V+ F V S DE+ + + GP++VAI+
Sbjct: 194 GGIDTEAAYPYTGSD--GTCRFLENKVGATVSGFVDVKSGDENALKEAVATVGPISVAID 251
Query: 290 A--VYMQTYIGGVSCPYICSR-RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWG 346
A ++ Q Y GGV P+ CS LDHGVL+VGYG+ G K YW++KNSWG SWG
Sbjct: 252 ASSIFFQFYRGGVYNPWFCSSTELDHGVLVVGYGTEG-------GKDYWLVKNSWGSSWG 304
Query: 347 ENGYYKICRG-RNVCGVDSMVS 367
GY K+ R +N CG+ + S
Sbjct: 305 LKGYIKMVRNKKNRCGIATQAS 326
>gi|56718881|gb|AAW28151.1| westerpain-1 [Paragonimus westermani]
Length = 322
Score = 241 bits (616), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 138/323 (42%), Positives = 183/323 (56%), Gaps = 38/323 (11%)
Query: 56 AEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPA 114
A + FK+ + K YA++++ RF IFK NL RA + Q D +A +G+TQFSDLTP
Sbjct: 23 ARELYEQFKRDYGKVYANEDDQ-KRFAIFKDNLMRAQKLQLKDQGTARYGVTQFSDLTPE 81
Query: 115 EFRRTYLGLRRKLRLPKDADQAP-ILPT--NDLPADFDWREKGAVGPVKDQGSCGSCWSF 171
EF YL P + DQ + PT P DWR KGAV V++QGSCGSCW+F
Sbjct: 82 EFAAKYLSA------PVNNDQVKRVRPTGLKAAPERIDWRAKGAVTAVENQGSCGSCWAF 135
Query: 172 STTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAG 231
ST G +EG F+ TG+LVSLS+QQLVDCD GCNGG S++ + G
Sbjct: 136 STAGNVEGQWFIKTGQLVSLSKQQLVDCDRAA---------QGCNGGWPASSYLEIMYMG 186
Query: 232 GLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV 291
GL E DYPY G + C +K K+ A + + V+ +E+ AA L ++GPL+ +NAV
Sbjct: 187 GLESESDYPYVGVE--QTCALNKEKLVAKIDDSIVLGPEEEDHAAYLAEHGPLSTLLNAV 244
Query: 292 YMQTYIGGV------SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESW 345
+Q Y GV CP L+H VL VGY G + PYWIIKNSWG W
Sbjct: 245 ALQYYQSGVLKPTFEECP---DTELNHAVLTVGYDKEG-------DMPYWIIKNSWGTDW 294
Query: 346 GENGYYKICRGRNVCGVDSMVST 368
GE GY+++ RG CG++ M ++
Sbjct: 295 GEKGYFRLFRGDCTCGINRMATS 317
>gi|67773382|gb|AAY81948.1| cysteine protease 11 [Paragonimus westermani]
Length = 322
Score = 241 bits (615), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 136/323 (42%), Positives = 186/323 (57%), Gaps = 38/323 (11%)
Query: 56 AEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPA 114
A + FK+ + K YA++++ RF IFK NL RA + Q D +A +G+TQFSDLTP
Sbjct: 23 ARELYEQFKRDYGKVYANEDDQ-KRFAIFKDNLVRAQKLQLRDQGTARYGVTQFSDLTPE 81
Query: 115 EFRRTYLGLRRKLRLPKDADQAP-ILPT--NDLPADFDWREKGAVGPVKDQGSCGSCWSF 171
EF YL P ++DQ + PT P DWR KGAV PV++QG CGSCW+F
Sbjct: 82 EFAAKYLSP------PLNSDQVERVQPTGLKAAPERMDWRAKGAVTPVENQGECGSCWAF 135
Query: 172 STTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAG 231
ST G +EG F+ TG+LVSLS+QQLVDCD + GCNGG +S++ + G
Sbjct: 136 STAGNVEGQWFIKTGQLVSLSKQQLVDCDMAAE---------GCNGGWPSSSYLEIMDMG 186
Query: 232 GLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV 291
GL E DYPY G ++ C +K K+ A + + V+ E++ L ++GPL+ +NAV
Sbjct: 187 GLESENDYPYVGVEQ--TCALNKEKLVAKIDDAVVLGASENEHVDYLAEHGPLSTLLNAV 244
Query: 292 YMQTYIGGV------SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESW 345
+Q Y G+ CP L+H VL VGY G + PYWIIKNSWG W
Sbjct: 245 ALQHYQSGILHPSHKDCP---DDDLNHAVLTVGYDREG-------DMPYWIIKNSWGTDW 294
Query: 346 GENGYYKICRGRNVCGVDSMVST 368
GE GY+++ RG VCG++ M ++
Sbjct: 295 GEKGYFRLFRGDCVCGINRMATS 317
>gi|196014793|ref|XP_002117255.1| hypothetical protein TRIADDRAFT_61245 [Trichoplax adhaerens]
gi|190580220|gb|EDV20305.1| hypothetical protein TRIADDRAFT_61245 [Trichoplax adhaerens]
Length = 353
Score = 241 bits (615), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 136/324 (41%), Positives = 189/324 (58%), Gaps = 30/324 (9%)
Query: 56 AEHHFSLFK------KKFNKAYASQEEHDHRFTIFKANLRRAARHQKLD-PSATHGITQF 108
A HH +FK K++NK+Y + +E ++R+ +F N+ RA QK D + +G T+
Sbjct: 45 ATHHDPMFKNYLQFIKEYNKSYNNIQELNYRYQVFTKNMARAMLFQKHDNATGRYGFTKL 104
Query: 109 SDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSC 168
SDLT E + Y +++ + +A I N LP FDWR KGAV VKDQ CG+C
Sbjct: 105 SDLTDQEVKSFY-AMKKWPQQLYPTKKANIPQLNSLPQSFDWRSKGAVTAVKDQKRCGAC 163
Query: 169 WSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTL 228
W+F+TTG +EG +L GKL SLSEQ+LVDCD D GC GGL +A+ +
Sbjct: 164 WAFATTGNIEGQWYLNKGKLYSLSEQELVDCD---------KIDEGCKGGLPLNAYHSIM 214
Query: 229 -KAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVA 287
+ GGL E+DYPY + CK +KS+ + + VS +E +AA LV +GP+A+
Sbjct: 215 NRLGGLETEKDYPYVA--KNGKCKLNKSEEVVYINSSVKVSTNETDLAAWLVAHGPVAIG 272
Query: 288 INAVYMQTYIGGVSCPYI--CS-RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGES 344
IN+V M Y GG++ P C+ + LDHGVL+VGYG K PYWIIKNSWG
Sbjct: 273 INSVNMLHYKGGIAHPTNKDCNPKLLDHGVLIVGYGEE-------KSTPYWIIKNSWGTD 325
Query: 345 WGENGYYKICRGRNVCGVDSMVST 368
WGE GYY++ RG CG++ ++
Sbjct: 326 WGEKGYYRVVRGIGACGLNKSATS 349
>gi|339244639|ref|XP_003378245.1| cathepsin F [Trichinella spiralis]
gi|316972864|gb|EFV56510.1| cathepsin F [Trichinella spiralis]
Length = 366
Score = 241 bits (615), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 140/354 (39%), Positives = 202/354 (57%), Gaps = 24/354 (6%)
Query: 24 LIDDVDQLIRQVTDGGDE--ILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRF 81
L +V+ L R + D+ +L N + +F F +FNK Y +++ ++
Sbjct: 28 LFTNVNHLERYMDSKFDKNLLLKLLPEMNAKEARSWENFKQFMVEFNKWYETEKLTAEKY 87
Query: 82 TIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRRTYLGLR-RKLRLPKDADQAPIL 139
IFK+N+ A R Q+ + +A +G T F+D+TP EFR+T+L ++ PK + +
Sbjct: 88 NIFKSNMVIAKRLQEEEQGTAIYGPTIFADMTPEEFRKTHLNFNPNNVKKPK---RMANI 144
Query: 140 PTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDC 199
P +++ DWR+ AV VKDQG+CGSCW+F T +EGA + T +L+SLSEQQLVDC
Sbjct: 145 PKSNISERMDWRKFNAVTSVKDQGNCGSCWAFCTVANIEGAWAVKTAQLISLSEQQLVDC 204
Query: 200 DHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAA 259
D D GC GGL +A+ ++ GGL +EEDY YT R CKF+ +K A
Sbjct: 205 DR---------LDDGCEGGLPVNAYLEIIRLGGLEKEEDYKYTA--RSGKCKFNHTKSAV 253
Query: 260 SVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCP--YICSRR-LDHGVLL 316
+ + V+ DED IA + +NGP+AV +NA M Y G++ P +CS ++HGV +
Sbjct: 254 YINDTVVLPEDEDAIARYVSENGPVAVGLNADAMMFYRSGIAHPSRLMCSPDGINHGVTI 313
Query: 317 VGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVA 370
VGY PYWIIKNSWG +WGE GYY + RG+ VCG+D M S+V
Sbjct: 314 VGYDVKESL---FWSTPYWIIKNSWGPNWGEKGYYYLYRGKGVCGIDQMASSVV 364
>gi|343417244|emb|CCD20093.1| cysteine peptidase precursor [Trypanosoma vivax Y486]
Length = 454
Score = 241 bits (614), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 136/318 (42%), Positives = 179/318 (56%), Gaps = 25/318 (7%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
E F+ FK+K+ ++Y + E R +F+ N+RR+ + +P AT G+T FSDLTP EF
Sbjct: 31 EPLFAAFKQKYGRSYGTAAEEAFRLRVFEDNMRRSRMYAAANPHATFGVTPFSDLTPEEF 90
Query: 117 RRTYLGLRRKLRLPKDADQAPI-LPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
R Y R + + + +P PA DW KGAV PVKDQG+CGSCWSFS G
Sbjct: 91 RTRYHNGERHFEAARGRVRTLVQVPPGKAPAAVDWGRKGAVTPVKDQGTCGSCWSFSAIG 150
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGL 233
+EG A L SLSEQ LV CD + D+GC GGLM++AFE+ +K +G +
Sbjct: 151 NIEGQWAAAGNPLTSLSEQMLVSCDTK---------DNGCGGGLMDNAFEWIVKENSGKV 201
Query: 234 MREEDYPY-TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY 292
E+ YPY +G CK K+ A++ + DED IA L NGP+AVA++A
Sbjct: 202 YTEKSYPYVSGGGEEPPCKPRGHKVGATITGHVDIPHDEDAIAKYLADNGPVAVAVDATT 261
Query: 293 MQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
+Y GGV SC S L+HGVLLVGY + + PYWIIKNSW SWGE GY
Sbjct: 262 FMSYSGGVVTSC---TSEALNHGVLLVGYNDS-------SKPPYWIIKNSWSSSWGEKGY 311
Query: 351 YKICRGRNVCGVDSMVST 368
+I +G N C V S+
Sbjct: 312 IRIEKGTNQCLVAQRASS 329
>gi|340053966|emb|CCC48259.1| cysteine peptidase precursor, fragment, partial [Trypanosoma vivax
Y486]
Length = 447
Score = 241 bits (614), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 135/318 (42%), Positives = 178/318 (55%), Gaps = 25/318 (7%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
E F+ FK+K+ ++Y + E R +F+ N+RR+ + +P AT G+T FSDLTP EF
Sbjct: 23 EPLFAAFKQKYGRSYGTAAEEAFRLRVFEDNMRRSRMYAAANPHATFGVTPFSDLTPEEF 82
Query: 117 RRTYLGLRRKLRLPKDADQAPI-LPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
R Y R + + + +P PA DWR KGAV PVKDQGSCGSCWSFS G
Sbjct: 83 RTRYHNGERHFEAARGRVRTLVQVPPGKAPAAVDWRRKGAVTPVKDQGSCGSCWSFSAIG 142
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGL 233
+EG A L SLSEQ LV CD + D+GC GG M++AFE+ +K +G +
Sbjct: 143 NIEGQWAAAGNPLTSLSEQMLVSCDFK---------DNGCGGGFMDNAFEWIVKENSGKV 193
Query: 234 MREEDYPYTGTDRGHA-CKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY 292
E+ YPY D C ++ A++ + DED IA L NGP+AVA++A
Sbjct: 194 YTEKSYPYVSEDGSKPFCIPYGHEVGATITGHVDIPHDEDAIAKYLADNGPVAVAVDATT 253
Query: 293 MQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
+Y GGV SC S L+HGVLLVGY + + PYWIIKNSW SWGE GY
Sbjct: 254 FMSYSGGVVTSC---TSEALNHGVLLVGYNDS-------SKPPYWIIKNSWSSSWGEKGY 303
Query: 351 YKICRGRNVCGVDSMVST 368
+I +G N C V + S+
Sbjct: 304 IRIEKGTNQCLVAQLASS 321
>gi|10391|emb|CAA38238.1| unnamed protein product [Trypanosoma brucei]
Length = 450
Score = 240 bits (613), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 137/318 (43%), Positives = 178/318 (55%), Gaps = 25/318 (7%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
E F+ FKKK+ K Y +E RF F+ N+ +A +P AT G+T FSD+T EF
Sbjct: 38 EMRFAAFKKKYGKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEF 97
Query: 117 RRTYL-GLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
R Y G K + + T PA DWREKGAV PVKDQG CGSCW+FST G
Sbjct: 98 RARYRNGASYFAAAQKRVRKTVNVTTGRAPAAVDWREKGAVTPVKDQGQCGSCWAFSTIG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA--GGL 233
+EG +A LVSLSEQ LV CD + D GC GGLM++AF + + + G +
Sbjct: 158 NIEGQWQVAGNPLVSLSEQMLVSCD---------TIDFGCGGGLMDNAFNWIVNSNGGNV 208
Query: 234 MREEDYPY-TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY 292
E YPY +G C+ + +I A++ + + DED IAA L +NGPLA+A++A
Sbjct: 209 FTEASYPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDATS 268
Query: 293 MQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
Y GG+ SC S +LDHGVLLVGY PYWIIKNSW WGE+GY
Sbjct: 269 FMDYNGGILTSC---TSEQLDHGVLLVGYNDN-------SNPPYWIIKNSWSNMWGEDGY 318
Query: 351 YKICRGRNVCGVDSMVST 368
+I +G N C ++ VS+
Sbjct: 319 IRIEKGTNQCLMNQAVSS 336
>gi|56718883|gb|AAW28152.1| westerpain-10 [Paragonimus westermani]
Length = 327
Score = 240 bits (613), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 137/323 (42%), Positives = 183/323 (56%), Gaps = 38/323 (11%)
Query: 56 AEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPA 114
A + FK+ + K YA++++ RF IFK NL RA + Q D +A +G+TQFSDLTP
Sbjct: 28 ARELYEQFKRGYGKVYANEDDQ-KRFAIFKDNLVRAQKLQLKDQGTARYGVTQFSDLTPE 86
Query: 115 EFRRTYLGLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSF 171
EF YL P + DQ + L P DWR KGAV V++QGSCGSCW+F
Sbjct: 87 EFAAKYLSA------PVNDDQVKRMRPTGLKAAPERIDWRAKGAVTAVENQGSCGSCWAF 140
Query: 172 STTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAG 231
ST G +EG F+ TG+LVSLS+QQLVDCD GCNGG S++ + G
Sbjct: 141 STAGNVEGQWFIKTGQLVSLSKQQLVDCDRAA---------QGCNGGWPASSYLEIMYMG 191
Query: 232 GLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV 291
GL E DYPY G ++ C +K K+ A + + V+ +E+ AA L ++GPL+ +NAV
Sbjct: 192 GLESESDYPYVGVEQ--TCALNKEKLVAKIDDSIVLGPEEEDHAAYLAEHGPLSTLLNAV 249
Query: 292 YMQTYIGGV------SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESW 345
+Q Y GV CP L+H VL VGY G + PYWIIKNSWG W
Sbjct: 250 ALQHYQSGVLKPTFDECP---DTELNHAVLTVGYDKEG-------DMPYWIIKNSWGTDW 299
Query: 346 GENGYYKICRGRNVCGVDSMVST 368
GE GY+++ RG CG++ M ++
Sbjct: 300 GEKGYFRLFRGDCTCGINRMATS 322
>gi|261328617|emb|CBH11595.1| cysteine peptidase precursor [Trypanosoma brucei gambiense DAL972]
gi|261328620|emb|CBH11598.1| cysteine peptidase precursor [Trypanosoma brucei gambiense DAL972]
Length = 450
Score = 240 bits (613), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 139/323 (43%), Positives = 182/323 (56%), Gaps = 35/323 (10%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
E F+ FKKK+ K Y +E RF F+ N+ +A +P AT G+T FSD+T EF
Sbjct: 38 EMRFAAFKKKYGKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEF 97
Query: 117 RR------TYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWS 170
R +Y +K RL K + + T PA DWREKGAV PVKDQG CGSCW+
Sbjct: 98 RARYRNGASYFAAAQK-RLRKTVN----VTTGRAPAAVDWREKGAVTPVKDQGQCGSCWA 152
Query: 171 FSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA 230
FST G +EG +A LVSLSEQ LV CD + D GC GGLM++AF + + +
Sbjct: 153 FSTIGNIEGQWQVAGNPLVSLSEQMLVSCD---------TIDFGCGGGLMDNAFNWIVNS 203
Query: 231 --GGLMREEDYPY-TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVA 287
G + E YPY +G C+ + +I A++ + + DED IAA L +NGPLA+A
Sbjct: 204 NGGNVFTEASYPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIA 263
Query: 288 INAVYMQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESW 345
++A Y GG+ SC S +LDHGVLLVGY PYWIIKNSW W
Sbjct: 264 VDATSFMDYNGGILTSC---TSEQLDHGVLLVGYNDN-------SNPPYWIIKNSWSNMW 313
Query: 346 GENGYYKICRGRNVCGVDSMVST 368
GE+GY +I +G N C ++ VS+
Sbjct: 314 GEDGYIRIEKGTNQCLMNQAVSS 336
>gi|261328615|emb|CBH11593.1| cysteine peptidase precursor [Trypanosoma brucei gambiense DAL972]
Length = 451
Score = 240 bits (613), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 139/323 (43%), Positives = 182/323 (56%), Gaps = 35/323 (10%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
E F+ FKKK+ K Y +E RF F+ N+ +A +P AT G+T FSD+T EF
Sbjct: 38 EMRFAAFKKKYGKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEF 97
Query: 117 RR------TYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWS 170
R +Y +K RL K + + T PA DWREKGAV PVKDQG CGSCW+
Sbjct: 98 RARYRNGASYFAAAQK-RLRKTVN----VTTGRAPAAVDWREKGAVTPVKDQGQCGSCWA 152
Query: 171 FSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA 230
FST G +EG +A LVSLSEQ LV CD + D GC GGLM++AF + + +
Sbjct: 153 FSTIGNIEGQWQVAGNPLVSLSEQMLVSCD---------TIDFGCGGGLMDNAFNWIVNS 203
Query: 231 --GGLMREEDYPY-TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVA 287
G + E YPY +G C+ + +I A++ + + DED IAA L +NGPLA+A
Sbjct: 204 NGGNVFTEASYPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIA 263
Query: 288 INAVYMQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESW 345
++A Y GG+ SC S +LDHGVLLVGY PYWIIKNSW W
Sbjct: 264 VDATSFMDYNGGILTSC---TSEQLDHGVLLVGYNDN-------SNPPYWIIKNSWSNMW 313
Query: 346 GENGYYKICRGRNVCGVDSMVST 368
GE+GY +I +G N C ++ VS+
Sbjct: 314 GEDGYIRIEKGTNQCLMNQAVSS 336
>gi|72389859|ref|XP_845224.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|62359932|gb|AAX80357.1| cysteine peptidase precursor [Trypanosoma brucei]
gi|70801759|gb|AAZ11665.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 450
Score = 240 bits (613), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 139/323 (43%), Positives = 182/323 (56%), Gaps = 35/323 (10%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
E F+ FKKK+ K Y +E RF F+ N+ +A +P AT G+T FSD+T EF
Sbjct: 38 EMRFAAFKKKYGKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEF 97
Query: 117 RR------TYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWS 170
R +Y +K RL K + + T PA DWREKGAV PVKDQG CGSCW+
Sbjct: 98 RARYRNGASYFAAAQK-RLRKTVN----VTTGRAPAAVDWREKGAVTPVKDQGQCGSCWA 152
Query: 171 FSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA 230
FST G +EG +A LVSLSEQ LV CD + D GC GGLM++AF + + +
Sbjct: 153 FSTIGNIEGQWQVAGNPLVSLSEQMLVSCD---------TIDFGCGGGLMDNAFNWIVNS 203
Query: 231 --GGLMREEDYPY-TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVA 287
G + E YPY +G C+ + +I A++ + + DED IAA L +NGPLA+A
Sbjct: 204 NGGNVFTEASYPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIA 263
Query: 288 INAVYMQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESW 345
++A Y GG+ SC S +LDHGVLLVGY PYWIIKNSW W
Sbjct: 264 VDATSFMDYNGGILTSC---TSEQLDHGVLLVGYNDN-------SNPPYWIIKNSWSNMW 313
Query: 346 GENGYYKICRGRNVCGVDSMVST 368
GE+GY +I +G N C ++ VS+
Sbjct: 314 GEDGYIRIEKGTNQCLMNQAVSS 336
>gi|74273320|gb|ABA01328.1| secreted cathepsin F [Teladorsagia circumcincta]
Length = 364
Score = 240 bits (612), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 138/346 (39%), Positives = 189/346 (54%), Gaps = 28/346 (8%)
Query: 31 LIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRR 90
LI + T G E + + GA +HF+ F ++ +K Y ++ E RF IFK NL
Sbjct: 35 LIDKKTKGSIEFARLGQHISPKDFGAWNHFTSFIERHDKVYRNESEALKRFGIFKRNLEI 94
Query: 91 AARHQKLDP-SATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPIL------PTND 143
Q+ D +A +GI QF+DL+P EF++T+L + P ++ L P
Sbjct: 95 IRSAQENDKGTAIYGINQFADLSPEEFKKTHLP--HTWKQPDHPNRIVDLAAEGVDPKEP 152
Query: 144 LPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHEC 203
LP FDWRE GAV VK +G C +CW+FS TG +EG FLA KLVSLS QQL+DCD
Sbjct: 153 LPESFDWREHGAVTKVKTEGHCAACWAFSVTGNIEGQWFLAKKKLVSLSAQQLLDCD--- 209
Query: 204 DPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVAN 263
D GCNGG A++ ++ GGL E+ YPY + C+ S IA +
Sbjct: 210 ------VVDEGCNGGFPLDAYKEIVRMGGLEPEDKYPYEA--KAEQCRLVPSDIAVYING 261
Query: 264 FSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICS-RRLDHGVLLVGYGSA 322
+ DE+++ A LVK GP+++ I +Q Y GGVS P C + HG LLVGYG
Sbjct: 262 SVELPHDEEKMRAWLVKKGPISIGITVDDIQFYKGGVSRPTTCRLSSMIHGALLVGYGVE 321
Query: 323 GYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVST 368
K PYWIIKNSWG +WGE+GYY++ RG N C ++ ++
Sbjct: 322 -------KNIPYWIIKNSWGPNWGEDGYYRMVRGENACRINRFPTS 360
>gi|67773380|gb|AAY81947.1| cysteine protease 9 [Paragonimus westermani]
Length = 322
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 138/323 (42%), Positives = 184/323 (56%), Gaps = 38/323 (11%)
Query: 56 AEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPA 114
A + FK+ + K YA++++ RF IFK NL RA + Q D +A +G+TQFSDLTP
Sbjct: 23 ARELYEQFKRDYGKVYANEDDQ-KRFAIFKDNLVRAQKLQLKDQGTARYGVTQFSDLTPE 81
Query: 115 EFRRTYLGLRRKLRLPKDADQAP-ILPT--NDLPADFDWREKGAVGPVKDQGSCGSCWSF 171
EF YL R + DQ + PT P DWREKGAV V++QGSCGSCW+F
Sbjct: 82 EFAAKYL------RAAVNNDQVERVRPTGLKAAPERMDWREKGAVTAVENQGSCGSCWAF 135
Query: 172 STTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAG 231
S G +EG F+ TG+LVSLS+QQLVDCD + GCNGG S++ G
Sbjct: 136 SAAGNVEGQWFIKTGQLVSLSKQQLVDCDRVAE---------GCNGGWPVSSYLEIKHMG 186
Query: 232 GLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV 291
GL E DYPY G ++ C +K K+ A + + V+ E++ AA L ++GPL+ +NAV
Sbjct: 187 GLESESDYPYVGAEQ--TCALNKEKLLAKIDDLIVLGAYEEEHAAYLAEHGPLSTLLNAV 244
Query: 292 YMQTYIGGV------SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESW 345
+Q Y GV CP L+H VL VGY G + PYWIIKNSWG W
Sbjct: 245 ALQHYQSGVLNPTYEECP---DTELNHAVLTVGYDKEG-------DMPYWIIKNSWGTDW 294
Query: 346 GENGYYKICRGRNVCGVDSMVST 368
GE GY+++ RG CG++ M ++
Sbjct: 295 GEKGYFRLFRGDYTCGINRMATS 317
>gi|182892046|gb|AAI65744.1| Ctsf protein [Danio rerio]
Length = 473
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 137/313 (43%), Positives = 179/313 (57%), Gaps = 22/313 (7%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRR 118
F F +N+ Y+SQEE + R IF+ N++ A Q L+ SA +GIT+FSDLT EFR
Sbjct: 175 FKNFMITYNRTYSSQEEAEKRLRIFQQNMKTAQTLQSLEQGSAEYGITKFSDLTEDEFRM 234
Query: 119 TYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALE 178
YL K + I + P +DWR+ GAV PVK+QG CGSCW+FS TG +E
Sbjct: 235 MYLNPMLSQWSLKKEMKPAIPASAPAPDTWDWRDHGAVSPVKNQGMCGSCWAFSVTGNIE 294
Query: 179 GANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREED 238
G F TG+L+SLSEQ+LVDCD D C GGL ++A+E GGL E D
Sbjct: 295 GQWFKKTGQLLSLSEQELVDCD---------KLDQACGGGLPSNAYEAIENLGGLETETD 345
Query: 239 YPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIG 298
Y YTG +C F K+AA + + + DE +IAA L +NGP++ A+NA MQ Y
Sbjct: 346 YSYTG--HKQSCDFSTGKVAAYINSSVELPKDEKEIAAFLAENGPVSAALNAFAMQFYRK 403
Query: 299 GVSCP---YICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
GVS P + +DH VLLVG+G P+W IKNSWGE +GE GYY + R
Sbjct: 404 GVSHPLKIFCNPWMIDHAVLLVGFGQRNGV-------PFWAIKNSWGEDYGEQGYYYLYR 456
Query: 356 GRNVCGVDSMVST 368
G +CG+ M S+
Sbjct: 457 GSGLCGIHKMCSS 469
>gi|117606135|ref|NP_001071036.1| cathepsin F precursor [Danio rerio]
gi|115313533|gb|AAI24244.1| Cathepsin F [Danio rerio]
Length = 473
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 137/313 (43%), Positives = 179/313 (57%), Gaps = 22/313 (7%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRR 118
F F +N+ Y+SQEE + R IF+ N++ A Q L+ SA +GIT+FSDLT EFR
Sbjct: 175 FKNFMITYNRTYSSQEEAEKRLRIFQQNMKTAQTLQSLEQGSAEYGITKFSDLTEDEFRM 234
Query: 119 TYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALE 178
YL K + I + P +DWR+ GAV PVK+QG CGSCW+FS TG +E
Sbjct: 235 MYLNPMLSQWSLKKEMKPAIPASAPAPDTWDWRDHGAVSPVKNQGMCGSCWAFSVTGNIE 294
Query: 179 GANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREED 238
G F TG+L+SLSEQ+LVDCD D C GGL ++A+E GGL E D
Sbjct: 295 GQWFKKTGQLLSLSEQELVDCD---------KLDQACGGGLPSNAYEAIENLGGLETETD 345
Query: 239 YPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIG 298
Y YTG +C F K+AA + + + DE +IAA L +NGP++ A+NA MQ Y
Sbjct: 346 YSYTG--HKQSCDFSTGKVAAYINSSVELPKDEKEIAAFLAENGPVSAALNAFAMQFYRK 403
Query: 299 GVSCP---YICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
GVS P + +DH VLLVG+G P+W IKNSWGE +GE GYY + R
Sbjct: 404 GVSHPLKIFCNPWMIDHAVLLVGFGQRNGV-------PFWAIKNSWGEDYGEQGYYYLYR 456
Query: 356 GRNVCGVDSMVST 368
G +CG+ M S+
Sbjct: 457 GSGLCGIHKMCSS 469
>gi|375073982|gb|AFA34858.1| cathepsin L-like protein [Trypanosoma dionisii]
Length = 467
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 141/323 (43%), Positives = 181/323 (56%), Gaps = 37/323 (11%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFR- 117
F+ FK+++ + Y S E R ++F+ NL A H +P AT G+T FSDLT EFR
Sbjct: 37 QFADFKQRYGRVYKSAAEEAFRLSVFRKNLLDAKLHAAANPHATFGVTPFSDLTREEFRS 96
Query: 118 RTYLGL------RRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSF 171
R + G R++ R+P D + D PA DWR++GAV PVKDQG CGSCW+F
Sbjct: 97 RHHSGAAHFAAGRKRARVPVD------VGVGDAPAAVDWRDRGAVTPVKDQGQCGSCWAF 150
Query: 172 STTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK-- 229
S G +EG FLA L SLSEQ LV CD + DSGC+GGLMNSAFE+ ++
Sbjct: 151 SAIGNVEGQWFLAGNALTSLSEQMLVSCD---------TMDSGCDGGLMNSAFEWIVEHH 201
Query: 230 AGGLMREEDYPY-TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAI 288
G + EE Y Y +G C+ + A + + DE ++A L NGPLAVA+
Sbjct: 202 NGTVYTEESYRYASGDGIAQPCRTSGRTVGAVITGHVKLPPDEAKMATWLAANGPLAVAV 261
Query: 289 NAVYMQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWG 346
+A Y GGV SC S LDHGVLLVGY + AP PYWI+KNSWG WG
Sbjct: 262 DASSWMFYTGGVLTSC---VSNELDHGVLLVGYNDSA-AP------PYWIVKNSWGTLWG 311
Query: 347 ENGYYKICRGRNVCGVDSMVSTV 369
E+GY +I +G N C V S+
Sbjct: 312 EDGYVRIAKGTNQCLVKEEASSA 334
>gi|390339264|ref|XP_791714.3| PREDICTED: putative cysteine proteinase CG12163-like
[Strongylocentrotus purpuratus]
Length = 453
Score = 239 bits (610), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 124/312 (39%), Positives = 184/312 (58%), Gaps = 31/312 (9%)
Query: 60 FSLFKKKFNKAYASQE---EHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAE 115
F F F + Y + E+++R+++F N+ + + +A +G T+F+D+T AE
Sbjct: 156 FDKFLMTFKREYRQNDGTNEYEYRYSVFVQNMLTVEMFNQFEQGTAKYGPTKFADMTEAE 215
Query: 116 FRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
FR+ G +K + K A +P +P ++DWR GAV PVK+QG CGSCW+FS G
Sbjct: 216 FRKLQSGPLKKTGIKKQA----AIPQGPVPEEYDWRTHGAVTPVKNQGMCGSCWAFSAIG 271
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMR 235
+EG + G+L+SLSEQ+LVDCD D GC GG M+ A+E +K GG M
Sbjct: 272 NMEGQWQIKKGELISLSEQELVDCD---------KVDGGCEGGEMSDAYEAIIKLGGAMS 322
Query: 236 EEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQT 295
EE YPY G + CKF+ + + + + +S +E ++A L +GP+++ INA+ MQ
Sbjct: 323 EEKYPYRGEN--EKCKFNMTDVRVKINGYVNISKNETEMAGWLAAHGPISIGINALMMQF 380
Query: 296 YIGGVSCPY--ICS-RRLDHGVLLVGYGSAGYAPIRLKE-KPYWIIKNSWGESWGENGYY 351
Y GG++ P+ CS LDHGVL+VGY +K+ +PYWI+KNSWG+ WGE GYY
Sbjct: 381 YFGGIAHPWKIFCSPDSLDHGVLIVGYS--------VKDGEPYWIVKNSWGKDWGEEGYY 432
Query: 352 KICRGRNVCGVD 363
+ RG CG++
Sbjct: 433 LVYRGDGTCGLN 444
>gi|312080834|ref|XP_003142769.1| ctsf protein [Loa loa]
Length = 437
Score = 239 bits (610), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 137/354 (38%), Positives = 197/354 (55%), Gaps = 34/354 (9%)
Query: 26 DDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFK 85
D+ D + +VTD ++ + E ++L + F F KKF + Y+S E RF +
Sbjct: 103 DESDTVNMKVTDPVIDLQNWQEGKKTEMLW--NSFLDFIKKFKREYSSVAEQLDRFKKYM 160
Query: 86 ANLRRAARHQKLDP-SATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPI----LP 140
NL + Q + +A +G+TQFSD++P EF++T L R+ + + + L
Sbjct: 161 QNLHFVEKLQHEEKGTAIYGVTQFSDMSPEEFQKTMLPSLWWDRVVSNGVEYDLKKFNLT 220
Query: 141 TNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCD 200
N+LP FDWR KG V PVK+QGSCGSCW+FS TG +EG + TGKL+SLSEQ+L+DCD
Sbjct: 221 FNNLPEQFDWRTKGVVTPVKNQGSCGSCWAFSVTGNIEGLWAIKTGKLISLSEQELIDCD 280
Query: 201 HECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAAS 260
D GCNGGL +AF + GGL E+ YPY R C +S IA +
Sbjct: 281 R---------IDKGCNGGLPINAFREIQRMGGLEPEDQYPYKA--RNGTCHLIRSAIAVT 329
Query: 261 VANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGV------SCPYICSRRLDHGV 314
+ + + +E + A +V+ GPL+V I+A + Y G+ CP +DHGV
Sbjct: 330 IDDAVEIPRNETVMKAWIVQRGPLSVGIDAKLLAYYKSGILHPSRSRCP---PSGIDHGV 386
Query: 315 LLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVST 368
L+ GYG PYW IKNSWG+ WGE+GY+++ G++VCGV +VS+
Sbjct: 387 LITGYGVENGL-------PYWTIKNSWGDQWGEDGYFRLMLGKDVCGVSDLVSS 433
>gi|393906608|gb|EFO21301.2| ctsf protein [Loa loa]
Length = 472
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 137/354 (38%), Positives = 197/354 (55%), Gaps = 34/354 (9%)
Query: 26 DDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFK 85
D+ D + +VTD ++ + E ++L + F F KKF + Y+S E RF +
Sbjct: 138 DESDTVNMKVTDPVIDLQNWQEGKKTEMLW--NSFLDFIKKFKREYSSVAEQLDRFKKYM 195
Query: 86 ANLRRAARHQKLDP-SATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPI----LP 140
NL + Q + +A +G+TQFSD++P EF++T L R+ + + + L
Sbjct: 196 QNLHFVEKLQHEEKGTAIYGVTQFSDMSPEEFQKTMLPSLWWDRVVSNGVEYDLKKFNLT 255
Query: 141 TNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCD 200
N+LP FDWR KG V PVK+QGSCGSCW+FS TG +EG + TGKL+SLSEQ+L+DCD
Sbjct: 256 FNNLPEQFDWRTKGVVTPVKNQGSCGSCWAFSVTGNIEGLWAIKTGKLISLSEQELIDCD 315
Query: 201 HECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAAS 260
D GCNGGL +AF + GGL E+ YPY R C +S IA +
Sbjct: 316 R---------IDKGCNGGLPINAFREIQRMGGLEPEDQYPYKA--RNGTCHLIRSAIAVT 364
Query: 261 VANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGV------SCPYICSRRLDHGV 314
+ + + +E + A +V+ GPL+V I+A + Y G+ CP +DHGV
Sbjct: 365 IDDAVEIPRNETVMKAWIVQRGPLSVGIDAKLLAYYKSGILHPSRSRCP---PSGIDHGV 421
Query: 315 LLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVST 368
L+ GYG PYW IKNSWG+ WGE+GY+++ G++VCGV +VS+
Sbjct: 422 LITGYGVENGL-------PYWTIKNSWGDQWGEDGYFRLMLGKDVCGVSDLVSS 468
>gi|395544492|ref|XP_003774144.1| PREDICTED: cathepsin F [Sarcophilus harrisii]
Length = 451
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 141/320 (44%), Positives = 184/320 (57%), Gaps = 35/320 (10%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRR 118
F F +NK+YA+ E R IF NL A + Q+LD SA +G+T+FSDLT EFR
Sbjct: 154 FKDFLTTYNKSYANATETQRRLGIFARNLELARKVQELDRGSAEYGVTKFSDLTEEEFRT 213
Query: 119 TYLGLR------RKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFS 172
+YL R LR P A + P PA +DWR+ GAV VK+QG+CGSCW+FS
Sbjct: 214 SYLNPLLSSLPGRALR-PGPATRGPA------PASWDWRDHGAVTGVKNQGACGSCWAFS 266
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
TG +EG FL G L++LSEQ+LVDCD + D C GGL ++A+ K GG
Sbjct: 267 VTGNVEGQWFLRRGALLALSEQELVDCD---------TLDQACGGGLPSNAYTAIEKLGG 317
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY 292
L E+DY Y G R C F K + + +S DE+++A L +NGP+++A+NA
Sbjct: 318 LETEKDYSYEG--RKERCSFSPDKARVYINSSVDLSRDEEELATWLAENGPVSIALNAFA 375
Query: 293 MQTYIGGVSCPY--ICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
MQ Y GVS P+ +CS +DH VLLVGYG P+W IKNSWG WGE G
Sbjct: 376 MQFYRRGVSHPFRPLCSPWFIDHAVLLVGYG-------HRSGIPFWAIKNSWGPDWGEEG 428
Query: 350 YYKICRGRNVCGVDSMVSTV 369
YY + RG CGV++M S+
Sbjct: 429 YYYLYRGARACGVNAMASSA 448
>gi|431910221|gb|ELK13294.1| Cathepsin F [Pteropus alecto]
Length = 458
Score = 238 bits (608), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 139/316 (43%), Positives = 184/316 (58%), Gaps = 29/316 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRR 118
F F +N+ Y ++EE R ++F N+ RA + Q LD +A +G+T+FSDLT EFR
Sbjct: 161 FKEFVITYNRTYETKEEAQWRMSVFINNMMRAQKIQALDRGTARYGVTKFSDLTEEEFRT 220
Query: 119 TYLG-LRRKLRLPKDADQAPILPTNDLPA--DFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YL L ++LR + + P+ + PA ++DWR KGAV VKDQG CGSCW+FS TG
Sbjct: 221 IYLNPLLKELR----SKRMPLAMSVSGPAPPEWDWRNKGAVTKVKDQGMCGSCWAFSVTG 276
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMR 235
+EG FL G L+SLSEQ+LVDCD D C GGL ++A+ GGL
Sbjct: 277 NVEGQWFLKRGDLLSLSEQELVDCDK---------LDKACLGGLPSNAYSAIKTLGGLET 327
Query: 236 EEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQT 295
E+DY Y G C F K + + +S +E ++AA L KNGP+++AINA MQ
Sbjct: 328 EDDYGYNG--HLQTCNFSAEKAKVYINDSVELSQNEQKLAAWLAKNGPISIAINAFGMQF 385
Query: 296 YIGGVSCPY--ICSRRL-DHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYK 352
Y G+S P +CS L DH VLLVGYG+ + P+W IKNSWG WGE GYY
Sbjct: 386 YRHGISHPLRPLCSPWLIDHAVLLVGYGNRS-------DIPFWAIKNSWGTDWGEEGYYY 438
Query: 353 ICRGRNVCGVDSMVST 368
+ RG CGV+ M S+
Sbjct: 439 LHRGSGACGVNIMASS 454
>gi|23397070|gb|AAN31820.1| putative cysteine proteinase AALP [Arabidopsis thaliana]
Length = 358
Score = 238 bits (607), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 149/377 (39%), Positives = 201/377 (53%), Gaps = 34/377 (9%)
Query: 1 MGSKTVVLFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHH- 59
M +KT++ +V +V+F+A ++ + D IR V+DG E+ E + + +LG H
Sbjct: 1 MSAKTILSSVVLVVLFAASAAANIGFDESNPIRMVSDGLREV----EESVSQILGQSRHV 56
Query: 60 --FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFR 117
F+ F ++ K Y + EE RF+IFK NL K S G+ QF+DLT EF+
Sbjct: 57 LSFARFTHRYGKKYQNVEEMKLRFSIFKENLDLIRSTNKKGLSYKLGVNQFADLTWQEFQ 116
Query: 118 RTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
RT LG + + LP DWRE G V PVKDQG CGSCW+FSTTGAL
Sbjct: 117 RTKLGAAQNCSATLKGSHK--VTEAALPETKDWREDGIVSPVKDQGGCGSCWTFSTTGAL 174
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREE 237
E A A GK +SLSEQQLVDC + + GCNGGL + AFEY GGL E+
Sbjct: 175 EAAYHQAFGKGISLSEQQLVDCAGAFN-------NYGCNGGLPSQAFEYIKSNGGLDTEK 227
Query: 238 DYPYTGTDRGHACKFDKSKIAASVANFSVVSL---DEDQIAANLVKNGPLAVAINAVY-M 293
YPYTG D CKF + V N ++L DE + A LV+ P+++A ++
Sbjct: 228 AYPYTGKDE--TCKFSAENVGVQVLNSVNITLGAEDELKHAVGLVR--PVSIAFEVIHSF 283
Query: 294 QTYIGGVSCPYICSRR---LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
+ Y GV C ++H VL VGYG PYW+IKNSWG WG+ GY
Sbjct: 284 RLYKSGVYTDSHCGSTPMDVNHAVLAVGYGVEDGV-------PYWLIKNSWGADWGDKGY 336
Query: 351 YKICRGRNVCGVDSMVS 367
+K+ G+N+CG+ + S
Sbjct: 337 FKMEMGKNMCGIATCAS 353
>gi|343477445|emb|CCD11724.1| unnamed protein product, partial [Trypanosoma congolense IL3000]
Length = 380
Score = 237 bits (605), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 131/316 (41%), Positives = 178/316 (56%), Gaps = 21/316 (6%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
+ F+ FK+K++++Y E RF +FK N+ RA +P AT G+T+FSD++P EF
Sbjct: 38 QQQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97
Query: 117 RRTYL-GLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
R TY G K + + T P DWR+KGAV PVKDQG CGSCW+FS G
Sbjct: 98 RATYHNGAEYYAAALKRPRKVVNVSTGKAPEAVDWRKKGAVTPVKDQGQCGSCWAFSAIG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA--GGL 233
+EG +A +L SLSEQ LV CD D GC GGLM+ AF++ + + G +
Sbjct: 158 NIEGQWKVAGHELTSLSEQMLVSCDTN---------DFGCEGGLMDDAFKWIVSSNKGNV 208
Query: 234 MREEDYPY-TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY 292
E+ YPY +G AC + A + + + DE+ IA L KNGP+A+A++A
Sbjct: 209 FTEQSYPYASGGGNVPACDKSGKVVGAKIRDHVDLPEDENAIAEWLAKNGPVAIAVDATS 268
Query: 293 MQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYK 352
Q+Y GGV I S LDHGVLLVGY + PYWIIKNSW + WGE GY +
Sbjct: 269 FQSYTGGVLTSCI-SEHLDHGVLLVGYDDT-------SKPPYWIIKNSWSKGWGEEGYIR 320
Query: 353 ICRGRNVCGVDSMVST 368
I +G N C + ++ S+
Sbjct: 321 IEKGTNQCLMKNLPSS 336
>gi|340053971|emb|CCC48265.1| cysteine peptidase precursor, fragment, partial [Trypanosoma vivax
Y486]
Length = 389
Score = 237 bits (604), Expect = 9e-60, Method: Compositional matrix adjust.
Identities = 133/318 (41%), Positives = 177/318 (55%), Gaps = 25/318 (7%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
E F+ FK+K+ ++Y + E R +F+ N+RR+ + +P AT G+T FSDLTP EF
Sbjct: 31 EPLFAAFKQKYGRSYGTAAEEAFRLRVFEDNMRRSRMYAAANPHATFGVTPFSDLTPEEF 90
Query: 117 RRTYLGLRRKLRLPKDADQAPI-LPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
R Y R + + + +P PA DWR KGAV PVKDQG+CGSCWSFS G
Sbjct: 91 RTRYHNGERHFEAARGRVRTLVQVPPGKAPAAVDWRRKGAVTPVKDQGTCGSCWSFSAIG 150
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGL 233
+EG A L SLSEQ LV CD + D+GC GG M++AFE+ +K +G +
Sbjct: 151 NIEGQWAAAGNPLTSLSEQMLVSCDFK---------DNGCGGGFMDNAFEWIVKENSGKV 201
Query: 234 MREEDYPYTGTDRGHA-CKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY 292
+ YPY D C ++ A++ + DED IA L NGP+AVA++A
Sbjct: 202 YTGKSYPYVSEDGSKPFCIPYGHEVGATITGHVDIPHDEDAIAKYLADNGPVAVAVDATT 261
Query: 293 MQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
+Y GGV SC S L+HGVLLVGY + + PYWIIKNSW SWGE GY
Sbjct: 262 FMSYSGGVVTSC---TSEALNHGVLLVGYNDS-------SKPPYWIIKNSWSSSWGEKGY 311
Query: 351 YKICRGRNVCGVDSMVST 368
+I +G N C V + S+
Sbjct: 312 IRIEKGTNQCLVAQLASS 329
>gi|340053968|emb|CCC48262.1| cysteine peptidase, Clan CA, family C1,Cathepsin L-like, fragment,
partial [Trypanosoma vivax Y486]
Length = 323
Score = 236 bits (603), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 133/312 (42%), Positives = 174/312 (55%), Gaps = 25/312 (8%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
E F+ FK+K+ ++Y + E R +F+ N+RR+ + +P AT G+T FSDLTP EF
Sbjct: 31 EPLFAAFKQKYGRSYGTAAEEAFRLRVFEDNMRRSRMYAAANPHATFGVTPFSDLTPEEF 90
Query: 117 RRTYLGLRRKLRLPKDADQAPI-LPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
R Y R + + + +P PA DWR KGAV PVKDQG CGSCWSFS G
Sbjct: 91 RTRYHNGERHFEAARGRVRTLVQVPPGKAPAAVDWRRKGAVTPVKDQGRCGSCWSFSAIG 150
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGL 233
+EG A L SLSEQ LV CD + D+GC GG M++AFE+ +K +G +
Sbjct: 151 NIEGQWAAAGNPLTSLSEQMLVSCDFK---------DNGCGGGFMDNAFEWIVKENSGKV 201
Query: 234 MREEDYPYTGTDRGHA-CKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY 292
E+ YPY D C ++ A++ + DED IA L NGP+AVA++A
Sbjct: 202 YTEKSYPYVSEDGSKPFCIPYGHEVGATITGHVDIPHDEDAIAKYLADNGPVAVAVDATT 261
Query: 293 MQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
+Y GGV SC S L+HGVLLVGY + + PYWIIKNSW SWGE GY
Sbjct: 262 FMSYSGGVVTSC---TSEALNHGVLLVGYNDS-------SKPPYWIIKNSWSSSWGEKGY 311
Query: 351 YKICRGRNVCGV 362
+I +G N C V
Sbjct: 312 IRIEKGTNQCLV 323
>gi|343476707|emb|CCD12272.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 447
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 129/317 (40%), Positives = 179/317 (56%), Gaps = 23/317 (7%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
+ F+ FK+K++++Y E RF +FK N+ RA +P AT G+T+FSD++P EF
Sbjct: 38 QQQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97
Query: 117 RRTYL-GLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
R TY G K + + T P DWR+KGAV PVKDQG CGSCW+FS G
Sbjct: 98 RATYHNGAEYYAAALKRPRKVVTVSTGKAPEAVDWRKKGAVTPVKDQGQCGSCWAFSAIG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAG--GL 233
+EG + L SLSEQ LV CD E D GC GGLM++AF++ + + +
Sbjct: 158 NIEGQWKVTGHNLTSLSEQMLVSCDTE---------DLGCAGGLMDNAFKWIVSSNRHNV 208
Query: 234 MREEDYPYTGTDRGHA--CKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV 291
EE YPY + G+ C+ + A + + + DE+ IA L KNGP+A+A+++
Sbjct: 209 FTEESYPY-ASKGGNVPPCRMSGKVVGAKIRDHVDLPKDENAIAEWLAKNGPVAIAVDST 267
Query: 292 YMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYY 351
Q+Y GGV I S++LDHGVLLVGY + PYWIIKNSW + WGE GY
Sbjct: 268 SFQSYTGGVLTSCI-SKQLDHGVLLVGYDDT-------SKPPYWIIKNSWSKGWGEEGYI 319
Query: 352 KICRGRNVCGVDSMVST 368
+I +G N C V + ++
Sbjct: 320 RIEKGTNQCLVKNYATS 336
>gi|339246873|ref|XP_003375070.1| viral cathepsin [Trichinella spiralis]
gi|316971622|gb|EFV55373.1| viral cathepsin [Trichinella spiralis]
Length = 496
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 142/366 (38%), Positives = 207/366 (56%), Gaps = 30/366 (8%)
Query: 15 VFSAVSSGTLI-------DDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKF 67
V + + +GTLI D+ D+ + + ++L S + + F F K F
Sbjct: 145 VLNELKNGTLITITNTSEDNFDRKL-MLAYNSVKLLKFIRSQSEEERTLWMQFKEFLKTF 203
Query: 68 NKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRRTYLGLRRK 126
K Y S++E R+ IFK N++ QK + +A +G+T F+DLTP EFR+ YL + K
Sbjct: 204 KKWYLSEKELLKRYDIFKVNMKTVEMLQKNEQGTAVYGVTFFADLTPEEFRKFYLSPQWK 263
Query: 127 L-RLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLAT 185
+LP+ + +P + +DWRE AV VK+QG CGSCW+F+T +EG +
Sbjct: 264 RDQLPQ---RKASIPKGKIEDRWDWREHNAVTEVKNQGMCGSCWAFATIANVEGVWAVKK 320
Query: 186 GKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTD 245
G+LVSLSEQ+LVDCD + D GC+GG ++A++ ++ GGL E +Y Y G +
Sbjct: 321 GELVSLSEQELVDCD---------TLDQGCSGGYPSNAYKEIIRLGGLTTETNYSYDG-N 370
Query: 246 RGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCP-- 303
+G C+F + + + DE +IAA + +NGP+AV INA M Y G++ P
Sbjct: 371 QG-TCRFKTQNAKVYINDSVSLPEDETEIAAYIRENGPVAVGINAFAMMFYRHGIAHPWR 429
Query: 304 YICS-RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGV 362
++CS LDHGV +VGY + K KPYWIIKNSWG WGE GYY + RG VCGV
Sbjct: 430 FLCSPDALDHGVAIVGYDVEKQSK---KPKPYWIIKNSWGTHWGEGGYYMLYRGAGVCGV 486
Query: 363 DSMVST 368
+ MV++
Sbjct: 487 NKMVTS 492
>gi|18141283|gb|AAL60579.1|AF454957_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 460
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 143/363 (39%), Positives = 197/363 (54%), Gaps = 38/363 (10%)
Query: 1 MGSKTVVLFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAE--H 58
MGS V + L+++++ + ++ I D+ HH + N+ AE
Sbjct: 1 MGSVKVTILLLAMMIGVSYAADMSIISYDE-------------KHHITAENERSDAEVAR 47
Query: 59 HFSLFKKKFNKAYASQ----EEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPA 114
+ + +K K S EE D RF IFK NLR H + S G+T+F+DLT
Sbjct: 48 IYEAWMEKHGKKAQSNGLVGEEKDQRFEIFKDNLRFIDEHNNKNLSYKLGLTRFADLTNE 107
Query: 115 EFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTT 174
E+R YLG + K R+ K +D+ + +P DWR++GAV VKDQGSCGSCW+FST
Sbjct: 108 EYRSIYLGAKSKKRVLKTSDRYQPRVGDAIPDSVDWRKEGAVAAVKDQGSCGSCWAFSTI 167
Query: 175 GALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLM 234
GA+EG N + TG L+SLSEQ+LVDCD S + GCNGGLM+ AFE+ +K GG+
Sbjct: 168 GAVEGINKIVTGDLISLSEQELVDCDT--------SYNQGCNGGLMDYAFEFIIKNGGID 219
Query: 235 REEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA--VY 292
EEDYPY D G + K+ ++ + V + + + N P++VAI A
Sbjct: 220 TEEDYPYKAAD-GRCDQTRKNAKVVTIDAYEDVPENNEAALKKTLANQPISVAIEAGGRA 278
Query: 293 MQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYK 352
Q Y GV IC LDHGV+ VGYG+ K YWI++NSWG SWGE+GY K
Sbjct: 279 FQLYSSGVF-DGICGTELDHGVVAVGYGTE-------NGKDYWIVRNSWGGSWGESGYIK 330
Query: 353 ICR 355
+ R
Sbjct: 331 MAR 333
>gi|343472324|emb|CCD15484.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 445
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 130/318 (40%), Positives = 178/318 (55%), Gaps = 25/318 (7%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
+ F+ FK+K++++Y E RF +FK N+ RA +P AT G+T+FSD++P EF
Sbjct: 38 QQQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97
Query: 117 RRTYL-GLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
R TY G K + + T P DWR+KGAV PVKDQG CGSCW+FS G
Sbjct: 98 RATYHNGAEYYAAALKRPRKVVTVSTGKAPEAVDWRKKGAVTPVKDQGQCGSCWAFSAIG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA--GGL 233
+EG +A +L SLSEQ LV CDP E C GG M++AF + + + G +
Sbjct: 158 NIEGQWKVAGHELTSLSEQTLVS----CDPTE-----YACEGGFMDNAFRWIISSNKGKV 208
Query: 234 MREEDYPYTGTDRG-HACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY 292
E+ YPY+ R AC + A+++++ + DE+ IA L KNGP++V ++A
Sbjct: 209 FTEQSYPYSSGGRNVPACNMSGKVVGANISDYVDLPQDENAIAEWLAKNGPVSVIVDATS 268
Query: 293 MQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
Q+Y GGV SC S+ L+H VLLVGY + PYWIIKNSW E WGE GY
Sbjct: 269 FQSYTGGVLTSC---LSKILNHAVLLVGYDDTS-------KPPYWIIKNSWSEKWGEKGY 318
Query: 351 YKICRGRNVCGVDSMVST 368
+I +G N C V S+
Sbjct: 319 IRIEKGTNQCLVQEYASS 336
>gi|440804656|gb|ELR25533.1| cysteine proteinase precursor, putative [Acanthamoeba castellanii
str. Neff]
Length = 330
Score = 235 bits (600), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 133/317 (41%), Positives = 181/317 (57%), Gaps = 19/317 (5%)
Query: 54 LGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTP 113
+ AE F F ++ K+YAS EE R IF+ NL R + A +G+ +F+DLTP
Sbjct: 26 MTAEQQFRQFAAQYGKSYAS-EEFGERLRIFRDNLDRIDALNSANTGARYGVNKFADLTP 84
Query: 114 AEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFST 173
EF+ TYL R K A A + T LP+ FDWR+KGAV P KDQG CG W+FS
Sbjct: 85 KEFKATYLKGARSAGQKKAAATAKLDMTGPLPSQFDWRDKGAVTPTKDQGQCG--WAFSV 142
Query: 174 TGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGL 233
T A+E FL+ KLVSL+ QQ+VDCD G+ D GC+GG +A+EY +KAGGL
Sbjct: 143 TEAIESQWFLSGRKLVSLAPQQIVDCDQ-------GNGDYGCDGGDPPTAYEYVIKAGGL 195
Query: 234 MREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSL--DEDQIAANLVKNGPLAVAINAV 291
EE YPYT D C F S + A ++N++ ++ +E ++ L GPL++ ++A
Sbjct: 196 DTEESYPYTAED--GQCAFKPSAVGAKISNWTYITTTKNETEMQYGLASRGPLSICVDAS 253
Query: 292 YMQTYIGGVSCPYICSRRLDHGVLLVGYG-SAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
Q YIGGV +C LDH V++ GY G+ ++ W I+NSWGE WG GY
Sbjct: 254 SWQYYIGGVITS-LCEDSLDHCVMITGYSVQEGWDFMKYD---VWNIRNSWGEDWGYGGY 309
Query: 351 YKICRGRNVCGVDSMVS 367
+ RG N+CGV V+
Sbjct: 310 LYVQRGSNLCGVGDEVT 326
>gi|55979119|gb|AAV69023.1| cysteine protease [Opisthorchis viverrini]
gi|224923980|gb|ACN68966.1| cathepsin F-like cysteine protease [Opisthorchis viverrini]
Length = 326
Score = 235 bits (600), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 135/321 (42%), Positives = 177/321 (55%), Gaps = 28/321 (8%)
Query: 56 AEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPA 114
A + FK K+ K Y S ++ + RF IFK NL RA R Q ++ +A +G+TQFSDLT
Sbjct: 28 ARALYEEFKLKYKKTY-SNDDDELRFRIFKDNLERAKRLQAMEQGTAEYGVTQFSDLTSE 86
Query: 115 EFRRTYLGLRRKLRLPKDADQAPILPTNDLPAD---FDWREKGAVGPVKDQGSCGSCWSF 171
EF+ YL ++R + P D+ D FDWR+ GAVGPV DQG CGSCW+F
Sbjct: 87 EFKTRYL----RMRFDEPIVNEDPTPQEDVTMDNSNFDWRDHGAVGPVLDQGDCGSCWAF 142
Query: 172 STTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAG 231
S G +EG F TG L+ LSEQQL+DCDH D GC+GG + + G
Sbjct: 143 SVIGNVEGQWFRKTGDLLGLSEQQLIDCDHS---------DQGCDGGYPPQTYSAIEEMG 193
Query: 232 GLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV 291
GL DYPYTG D C D+SK A V + + E A +L + GPL+ +NAV
Sbjct: 194 GLELRSDYPYTGKD--GICYMDQSKFVAYVNGSTRLPWCEKTQAKSLKEIGPLSSGLNAV 251
Query: 292 YMQTYIGGVSCPYICS-RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
+Q Y G+ P C+ L+H VL VGYG PYWI+KNSWG+ +GE GY
Sbjct: 252 LLQLYKRGIMRPRWCNPAELNHAVLTVGYGME-------HRMPYWIVKNSWGKRFGEKGY 304
Query: 351 YKICRGRNVCGVDSMVSTVAA 371
++I RG CG++ V+T
Sbjct: 305 FRIYRGDGTCGINRAVTTAVV 325
>gi|1136312|gb|AAB41118.1| cruzipain [Trypanosoma cruzi]
Length = 383
Score = 235 bits (600), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 133/316 (42%), Positives = 172/316 (54%), Gaps = 21/316 (6%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRR 118
F+ FK+K + Y S E R ++F+ANL A H +P AT G+T FSDLT EFR
Sbjct: 37 QFAEFKQKHGRVYGSAAEEAFRLSVFRANLFLARLHAAANPHATFGVTAFSDLTREEFRS 96
Query: 119 TYLGLRRKLRLPKDADQAPI-LPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
Y ++ + P+ + PA DWR +GAV VKDQG CGSCW+FS G +
Sbjct: 97 RYHNGAAHFAAAQERARVPVNVEVVGAPAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNV 156
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA--GGLMR 235
E FLA L +LSEQ LV CD DSGC GGLMN+AFE+ ++ G +
Sbjct: 157 ECQWFLAGHPLTNLSEQMLVSCDKT---------DSGCGGGLMNNAFEWIVQENNGAVYT 207
Query: 236 EEDYPY-TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQ 294
E+ YPY +G C + A++ + DE QIAA L NGP+AVA++A
Sbjct: 208 EDSYPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAVDASSWM 267
Query: 295 TYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
TY GGV + S +LDHGVLLVGY + PYW+IKNSW WGE+GY +I
Sbjct: 268 TYTGGVMTSCV-SEQLDHGVLLVGYNDSA-------AVPYWVIKNSWTTQWGEDGYIRIA 319
Query: 355 RGRNVCGVDSMVSTVA 370
+G N C V S+ A
Sbjct: 320 KGSNQCLVKEEASSAA 335
>gi|405971603|gb|EKC36430.1| Cathepsin L [Crassostrea gigas]
Length = 360
Score = 235 bits (599), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 143/322 (44%), Positives = 185/322 (57%), Gaps = 29/322 (9%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKL----DPSATHGITQFSDLT 112
E + FK +K Y + EE RF IF+ N+++ H KL S G+ QFSDL
Sbjct: 53 EQAWKEFKILHDKTYDALEEESRRFEIFRENVQKIEEHNKLYHLGKKSYYLGVNQFSDLK 112
Query: 113 PAEFRRTYLGLRRKLRLPKDADQAPILPTNDL--PADFDWREKGAVGPVKDQGSCGSCWS 170
EF + Y GL++ KD + L N+L P DWR+KG V VK+QG CGSCWS
Sbjct: 113 HEEFVK-YNGLKKTSL--KDGGCSSYLAANNLVEPDSVDWRKKGYVTDVKNQGQCGSCWS 169
Query: 171 FSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA 230
FSTTG+LEG +F +GKLVSLSE QLVDC E GCNGGLM++AF+Y
Sbjct: 170 FSTTGSLEGQHFRKSGKLVSLSESQLVDCSQSFGNE-------GCNGGLMDNAFKYIKSV 222
Query: 231 GGLMREEDYPYTGTDRGHACKFDKSKIAAS-VANFSVVSLDEDQIAANLVKNGPLAVAIN 289
GGL EEDYPY + CKFD +K+AA+ V S E + + + GP++VAI+
Sbjct: 223 GGLESEEDYPY--KPKQGTCKFDDTKVAATDTGCVDVESGSESALKKAVSEVGPVSVAID 280
Query: 290 AVY--MQTYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWG 346
A + Q+Y GGV P S +LDHGVL VGYG+ + YWI+KNSWG WG
Sbjct: 281 ASHSSFQSYAGGVYDEPECSSEQLDHGVLCVGYGTDDQG------QDYWIVKNSWGAEWG 334
Query: 347 ENGYYKICRG-RNVCGVDSMVS 367
E+GY K+ R +N CG+ + S
Sbjct: 335 EDGYVKMSRNKKNQCGIATQAS 356
>gi|126338866|ref|XP_001379280.1| PREDICTED: cathepsin F-like [Monodelphis domestica]
Length = 567
Score = 234 bits (598), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 137/314 (43%), Positives = 178/314 (56%), Gaps = 23/314 (7%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRR 118
F F +NK+YA+ E R IF NL A + Q+LD SA +G+T+FSDLT EFR
Sbjct: 270 FKDFLTTYNKSYANATETQRRLGIFARNLELAHKLQELDQGSAQYGVTKFSDLTEEEFRM 329
Query: 119 TYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALE 178
YL LP A + PA +DWR+ GA+ K+QG CGSCW+FS TG +E
Sbjct: 330 FYLNPLLS-SLPGRALRPAPRARGPAPASWDWRDHGALTAAKNQGMCGSCWAFSVTGNVE 388
Query: 179 GANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREED 238
G FL G L++LSEQ+LVDCD + D C GGL ++A+ GGL E+D
Sbjct: 389 GQWFLRRGALLTLSEQELVDCD---------TLDQACGGGLPSNAYTAIETLGGLETEKD 439
Query: 239 YPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIG 298
Y Y G R C F K A + + +S DE +IAA L +NGP+++A+NA MQ Y
Sbjct: 440 YSYEG--RKERCSFSPDKARAYINSSVDLSRDEQEIAAWLAENGPVSIALNAFAMQFYRR 497
Query: 299 GVSCPY--ICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
GVS P+ +CS +DH VLLVGYG P+W IKNSWG WGE GYY + R
Sbjct: 498 GVSHPFRPLCSPWFIDHAVLLVGYGDR-------SGIPFWAIKNSWGPDWGEEGYYYLYR 550
Query: 356 GRNVCGVDSMVSTV 369
G CG+++M S+
Sbjct: 551 GARACGMNTMASSA 564
>gi|67773370|gb|AAY81942.1| cysteine protease 3 [Paragonimus westermani]
Length = 321
Score = 234 bits (598), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 135/320 (42%), Positives = 182/320 (56%), Gaps = 32/320 (10%)
Query: 56 AEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPA 114
A + FK+ + K YA++++ RF IFK NL RA + Q D +A +G+TQFSDLTP
Sbjct: 23 ARELYEQFKRDYGKVYANEDDQ-KRFAIFKDNLMRAQKLQLKDQGTARYGVTQFSDLTPE 81
Query: 115 EFRRTYLGLRRKLRLPKDADQAP-ILPT--NDLPADFDWREKGAVGPVKDQGSCGSCWSF 171
EF YL P + DQ + PT P DWR KGAV V++QGSCGSCW+F
Sbjct: 82 EFAAKYLSA------PVNNDQVKRVRPTGLKAAPERIDWRAKGAVTAVENQGSCGSCWAF 135
Query: 172 STTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAG 231
ST G +EG F+ TG+LVSLS+QQLVDCD D GCNGG S++ + G
Sbjct: 136 STAGNVEGQWFIKTGQLVSLSKQQLVDCDRAAD---------GCNGGWPASSYLEIMHMG 186
Query: 232 GLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV 291
GL ++DYPY G C +K ++ A + + + ED AA L ++GPL+ +NA+
Sbjct: 187 GLESQDDYPYAGVK--EQCFMEKERLLAKIDDSIALGPSEDDNAAYLAEHGPLSTLLNAI 244
Query: 292 YMQTYIGGVSCPYI--CSR-RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+Q Y G+ P CS L+H VL VGY G + PYWIIKNSW WGE
Sbjct: 245 TLQYYQSGIIHPSYEECSPVDLNHAVLTVGYDKEG-------DMPYWIIKNSWNVEWGEK 297
Query: 349 GYYKICRGRNVCGVDSMVST 368
GY+++ RG CG++ M ++
Sbjct: 298 GYFRLYRGDGTCGINRMPTS 317
>gi|198427474|ref|XP_002119872.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 596
Score = 234 bits (597), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 138/338 (40%), Positives = 194/338 (57%), Gaps = 45/338 (13%)
Query: 24 LIDDVDQ----LIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQ-EEHD 78
L+++ D+ ++R TDG +IL F +F +K+ + Y+S +E++
Sbjct: 145 LLNEFDKHTTNMVRPTTDGDVKIL----------------FDMFLEKYPRTYSSSSDEYN 188
Query: 79 HRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRRTYL-GLRRKLRLPKDADQA 136
RF IFK N + +++ +A +GIT+F D++ E+ RT G R L +P +
Sbjct: 189 ERFEIFKTNYQVVQHLNEIERGTAVYGITKFMDMSEEEYHRTLAPGFTRPL-VPIQTLNS 247
Query: 137 PILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQL 196
L T ++P DWR+ GAV VK+QGSCGSCW+FSTTG +EG FL KL+SLSEQ+L
Sbjct: 248 AELDTTNIPDSMDWRKHGAVTEVKNQGSCGSCWAFSTTGNVEGQWFLKHKKLISLSEQEL 307
Query: 197 VDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSK 256
VDCD + DSGC GGL ++A++ K GGL E+DYPY G G C +S
Sbjct: 308 VDCD---------TLDSGCGGGLPSNAYKSIEKLGGLEPEKDYPYVG--EGEKCAIKQSD 356
Query: 257 IAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPY--ICS-RRLDHG 313
V N + DE ++AA L +NGP+++ INA MQ Y GG+S P+ C+ + LDHG
Sbjct: 357 FKVFVNNSVALPKDEVKLAAWLAQNGPISIGINANLMQFYWGGISHPWKIFCNPKSLDHG 416
Query: 314 VLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYY 351
VL+VGYG+ P+WIIKNSWG WGE Y
Sbjct: 417 VLIVGYGTE-------NGTPFWIIKNSWGPDWGEEEEY 447
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 55/112 (49%), Positives = 71/112 (63%), Gaps = 11/112 (9%)
Query: 115 EFRRTYL-GLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFST 173
E+ RT G R L +P + L T ++P DWR+ GAV VK+QGSCGSCW+FST
Sbjct: 446 EYHRTLAPGFTRPL-VPIQTLNSAELDTTNIPDSMDWRKHGAVTEVKNQGSCGSCWAFST 504
Query: 174 TGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFE 225
TG +EG FL KL+SLSEQ+LVDCD + DSGC GGL ++A++
Sbjct: 505 TGNVEGQWFLKHKKLISLSEQELVDCD---------TLDSGCGGGLPSNAYK 547
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 26/53 (49%), Positives = 33/53 (62%), Gaps = 2/53 (3%)
Query: 318 GYGSAGYAPIRLKEK--PYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVST 368
G S Y I E P+WIIKNSWG WGE GYY+I RG CG+++M ++
Sbjct: 540 GLPSNAYKSIEKLENGTPFWIIKNSWGPDWGEEGYYRIYRGDGSCGLNNMATS 592
>gi|118395092|ref|XP_001029901.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89284178|gb|EAR82238.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 344
Score = 234 bits (597), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 121/322 (37%), Positives = 177/322 (54%), Gaps = 31/322 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK KFNK Y ++ EH F +K + +HQ +P+A G T+FSD++P EF
Sbjct: 33 FEEFKSKFNKYYHNEHEHHSSFHNYKTSREHIVKHQMENPNAKFGHTKFSDMSPEEFENK 92
Query: 120 YL-------------GLRRKLRLPKD-ADQAPILPTNDLPADFDWREKGAVGPVKDQGSC 165
L G++ K K Q + +DLP FDWR+KG + P K Q +C
Sbjct: 93 MLNFDFSLFKKAKSQGIKLKAEPMKGYLRQGENVDNSDLPESFDWRDKGIITPAKFQNTC 152
Query: 166 GSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFE 225
GSCW+F+TTG +E L G+L+ SEQ L+DCD + + GC GGLM A++
Sbjct: 153 GSCWTFATTGVIESQYALKYGELLHFSEQMLLDCD---------NINQGCRGGLMTDAYQ 203
Query: 226 YTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLA 285
+ ++GG+ + Y ++ C FDK+K+ A V ++ + +E+ I LVKNGP+A
Sbjct: 204 FLQQSGGIQTADTYG-DYKNKKDICNFDKAKVKAKVVDWYQIPENEETIRRELVKNGPVA 262
Query: 286 VAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESW 345
V INA +Q Y GG+ P C +++H VL+VGYG + PYW+IKN WG W
Sbjct: 263 VGINARTLQFYEGGIVDPKNCDDKINHAVLIVGYGVE-------EGIPYWLIKNQWGAEW 315
Query: 346 GENGYYKICRGRNVCGVDSMVS 367
G G++K+ RG+ CG+ + S
Sbjct: 316 GIKGFFKLIRGKKQCGIHTYAS 337
>gi|46948154|gb|AAT07059.1| cathepsin F-like cysteine proteinase, partial [Brugia malayi]
Length = 461
Score = 234 bits (597), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 131/340 (38%), Positives = 187/340 (55%), Gaps = 32/340 (9%)
Query: 40 DEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP 99
D ++ E N + F F KKF + Y+S EE RF I+ N+ A + Q +
Sbjct: 139 DLAMNSQEWQNEEKKTLWSDFMTFIKKFKREYSSIEEQLDRFRIYLQNMNFAKKLQFEEK 198
Query: 100 -SATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPI----LPTNDLPADFDWREKG 154
+A +G T+FSD+T EF++ L R+ + + L +LP+ FDWR +G
Sbjct: 199 GTAIYGATKFSDMTAEEFQKIMLPSIWWDRVESNGITFNLNDFNLSIYNLPSKFDWRTEG 258
Query: 155 AVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSG 214
V PVKDQGSCGSCW+FS TG +E + TGKL+SLSEQ+L+DCD D G
Sbjct: 259 VVTPVKDQGSCGSCWAFSVTGNIESLWAIKTGKLISLSEQELIDCD---------VIDKG 309
Query: 215 CNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQI 274
CNGGL +AF + GGL E+ YPY + C +++IA S+ + + +E +
Sbjct: 310 CNGGLPINAFREIKRMGGLEPEDQYPYEA--KNGTCHLVRAQIAVSIDDAVEIPRNETVM 367
Query: 275 AANLVKNGPLAVAINAVYMQTYIGGV------SCPYICSRRLDHGVLLVGYGSAGYAPIR 328
A + + GPL+V I+A + Y G+ CP +++HGVL+ GYG
Sbjct: 368 KAWIAQRGPLSVGIDAELLSYYKSGILHPSKSRCP---PSKINHGVLITGYGIEN----- 419
Query: 329 LKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVST 368
PYW IKNSWGE WGENGY+++ RG+N+CGV +VS+
Sbjct: 420 --NLPYWTIKNSWGEQWGENGYFQLMRGKNICGVSDLVSS 457
>gi|443696723|gb|ELT97360.1| hypothetical protein CAPTEDRAFT_147978 [Capitella teleta]
Length = 274
Score = 234 bits (596), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 129/284 (45%), Positives = 174/284 (61%), Gaps = 22/284 (7%)
Query: 89 RRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADF 148
RR ++ D AT+G + F+DLT EFR+ YL + A I P P F
Sbjct: 5 RRIQEKEQGD--ATYGASPFADLTAEEFRKNYLSPVWNVTHDPFLKPASI-PIETPPDAF 61
Query: 149 DWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEP 208
DWR+ AV PVK+QGSCGSCW+FS TG +EG + KL+SLSEQ+LVDCD
Sbjct: 62 DWRDHDAVTPVKNQGSCGSCWAFSVTGNVEGQWAIQKKKLLSLSEQELVDCDK------- 114
Query: 209 GSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVS 268
D GCNGGL A++ ++ GGL E+DYPY G +G C F+K+++ ++ +S
Sbjct: 115 --VDLGCNGGLPLQAYKEIMRIGGLETEKDYPYEG--KGDKCVFEKAEVEVNITGAVNIS 170
Query: 269 LDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCP--YICS-RRLDHGVLLVGYG-SAGY 324
+ED + A L KNGP+++ +NA MQ Y+GGVS P ++CS LDHGVL+ GYG G+
Sbjct: 171 SNEDDMKAWLWKNGPISIGLNANAMQFYMGGVSHPFSFLCSPSSLDHGVLITGYGIKQGW 230
Query: 325 APIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVST 368
+ + P+W IKNSWGESWGE GYY + RG VCGV+ M ++
Sbjct: 231 ----MSDSPFWAIKNSWGESWGEKGYYLLYRGAGVCGVNQMPTS 270
>gi|209978824|ref|YP_002300567.1| cathepsin [Adoxophyes orana nucleopolyhedrovirus]
gi|192758806|gb|ACF05341.1| cathepsin [Adoxophyes orana nucleopolyhedrovirus]
Length = 337
Score = 233 bits (594), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 129/332 (38%), Positives = 184/332 (55%), Gaps = 32/332 (9%)
Query: 52 DLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDL 111
D+ A+H+F F +NK YA + ++RF IF NL KL+ SA + I +FSDL
Sbjct: 24 DIHDAQHYFETFIVNYNKQYADTKTKNYRFKIFVQNLEYINEKNKLNDSAIYNINKFSDL 83
Query: 112 TPAEFRRTYLGL--RRKLRLPKDADQ--------APILPTNDLPADFDWREKGAVGPVKD 161
+ E Y GL R+ + K AP ++LP +FDWR + VKD
Sbjct: 84 SKNELLTKYTGLTSRKPSNMVKSTSNFCNVIHLDAPPDARDELPQNFDWRVNNKMTSVKD 143
Query: 162 QGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMN 221
QG+CGSCW+ + G LE + L++LSEQQL+DCD S + C+GGLM+
Sbjct: 144 QGACGSCWAHAAVGTLETLYAIKHNYLINLSEQQLIDCD---------SANMACDGGLMH 194
Query: 222 SAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFS-VVSLDEDQIAANLVK 280
+AFE + AGGLM E DYPY GT +G CK D K A SV++ + +E+ + L+
Sbjct: 195 TAFEQLMNAGGLMEEIDYPYQGT-KG-ICKIDNKKFALSVSSCKRYIFQNEENLKKELIT 252
Query: 281 NGPLAVAINAVYMQTYIGGVSCPYICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKN 339
GP+A+AI+A + TY G+ + C L+H VLLVGYG+ G YW +KN
Sbjct: 253 TGPIAMAIDAASISTYSKGI--IHFCENLGLNHAVLLVGYGTEGGV-------SYWTLKN 303
Query: 340 SWGESWGENGYYKICRGRNVCGVDSMVSTVAA 371
SWG WGE+GY+++ R N CG+++ ++ A
Sbjct: 304 SWGSDWGEDGYFRVKRNINACGLNNQLAASAT 335
>gi|171702843|dbj|BAG16377.1| cysteine protease [Brassica rapa var. perviridis]
Length = 431
Score = 233 bits (594), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 140/353 (39%), Positives = 192/353 (54%), Gaps = 34/353 (9%)
Query: 5 TVVLFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFK 64
TV+LFL +VV SA+ + D + HH ++ + + +
Sbjct: 2 TVILFLAMIVVSSAMDMSIISYDKN---------------HHTVSSRSDVEVSRLYEEWV 46
Query: 65 KKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLR 124
K KA S E D RF IFK NLR H + S G+T+F+DLT E+R YLG R
Sbjct: 47 VKHGKAQNSLTEKDRRFEIFKDNLRFIDEHNGKNLSYRLGLTKFADLTNDEYRSMYLGSR 106
Query: 125 RKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLA 184
K + K + + + +P DWR++GAV VKDQGSCGSCW+FST GA+EG N +
Sbjct: 107 LKRKATKTSLRYEARVGDAIPESVDWRKEGAVAEVKDQGSCGSCWAFSTIGAVEGINKIV 166
Query: 185 TGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGT 244
TG L+SLSEQ+LVDCD S + GCNGGLM+ AFE+ +K GG+ EEDYPY G
Sbjct: 167 TGDLISLSEQELVDCDT--------SYNEGCNGGLMDYAFEFIIKNGGIDTEEDYPYKGV 218
Query: 245 DRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAIN--AVYMQTYIGGVSC 302
D G + K+ ++ ++ V + ++ + + P++VAI Q Y G+
Sbjct: 219 D-GRCDQTRKNAKVVTIDSYEDVPANSEESLKKALSHQPISVAIEGGGRAFQLYDSGIF- 276
Query: 303 PYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
IC LDHGV+ VGYG+ K YWI+KNSWG SWGE+GY ++ R
Sbjct: 277 DGICGTDLDHGVVAVGYGTE-------NGKDYWIVKNSWGTSWGESGYIRMER 322
>gi|91992514|gb|ABE72973.1| cathepsin L [Aedes aegypti]
Length = 265
Score = 233 bits (594), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 124/280 (44%), Positives = 168/280 (60%), Gaps = 26/280 (9%)
Query: 103 HGITQFSDLTPAEFRRTYLGLRRKLRLPKDAD-------QAPILPTNDLPADFDWREKGA 155
+GIT F+D+T AE+R+ R L +P+D D +A I +LP FDWRE GA
Sbjct: 2 YGITHFADMTSAEYRQ-----RTGLVIPRDEDRNHVGNPKAEIDENMELPESFDWRELGA 56
Query: 156 VGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGC 215
V PVK+QG+CGSCW+FS G +EG + + T L SEQ+L+DCD + DS C
Sbjct: 57 VSPVKNQGNCGSCWAFSVVGNIEGLHQIKTKVLEEYSEQELLDCD---------AVDSAC 107
Query: 216 NGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIA 275
GG M+ A++ K GGL E +YPY + C F+ +++ V + +E +A
Sbjct: 108 QGGYMDDAYKAIEKIGGLELESEYPYLAKKQ-KTCHFNSTEVHVRVKGAVDLPKNETAMA 166
Query: 276 ANLVKNGPLAVAINAVYMQTYIGGVSCPY--ICSRR-LDHGVLLVGYGSAGYAPIRLKEK 332
LV NGP+++ +NA MQ Y GG+S P+ +CS++ LDHGVL+VGYG Y P+ K
Sbjct: 167 QYLVANGPISIGLNANAMQFYRGGISHPWKPLCSKKNLDHGVLIVGYGVKEY-PMFNKTM 225
Query: 333 PYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAAA 372
PYWI+KNSWG WGE GYY+I RG N CGV M S+ A
Sbjct: 226 PYWIVKNSWGPKWGEQGYYRIFRGDNTCGVSEMASSAVLA 265
>gi|321460289|gb|EFX71333.1| hypothetical protein DAPPUDRAFT_189155 [Daphnia pulex]
Length = 266
Score = 233 bits (594), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 123/272 (45%), Positives = 163/272 (59%), Gaps = 15/272 (5%)
Query: 100 SATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPV 159
+A +G T FSD + AE++ G LR + +P DLP +FDWR V PV
Sbjct: 3 TAVYGDTPFSDWSAAEYKAHLAGFNPSLRQSNARLRQAAIPEIDLPDEFDWRNHSVVTPV 62
Query: 160 KDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGL 219
KDQGSCGSCW+FS TG +EG + G L+SLSEQ+LVDCD DSGCNGGL
Sbjct: 63 KDQGSCGSCWAFSVTGNVEGIYAVRNGDLLSLSEQELVDCD---------KLDSGCNGGL 113
Query: 220 MNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLV 279
+A++ GGL E DYPY G + + CKF+ + V +S +E ++A L+
Sbjct: 114 PENAYKAIHDIGGLETESDYPYNGHE--NKCKFNSNITRVQVTGGVEISTNETEMAQWLI 171
Query: 280 KNGPLAVAINAVYMQTYIGGVSCPYICSRR---LDHGVLLVGYGSAGYAPIRLKEKPYWI 336
+NGP+++ INA MQ Y GGVS P+ R +DHGVL+VGYG + Y P K PYWI
Sbjct: 172 QNGPISIGINANAMQYYRGGVSHPWKVLCRPGGIDHGVLIVGYGVSQY-PKFNKTLPYWI 230
Query: 337 IKNSWGESWGENGYYKICRGRNVCGVDSMVST 368
+KNSWG WGE GYY++ RG CG++ M ++
Sbjct: 231 VKNSWGTRWGEQGYYRVFRGDGTCGLNQMCTS 262
>gi|375073980|gb|AFA34857.1| cathepsin L-like protein [Trypanosoma cruzi marinkellei]
Length = 467
Score = 233 bits (593), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 136/314 (43%), Positives = 173/314 (55%), Gaps = 21/314 (6%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFR- 117
F+ FK+K + Y S E R ++F+ NL A H +P AT G+T FSDLT EFR
Sbjct: 37 QFAEFKQKHGRVYKSAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEFRS 96
Query: 118 RTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
R + G + A + +PA DWR +GAV VKDQG CGSCW+FS G +
Sbjct: 97 RYHNGAAHFAAAQERARVPVNVEVVGVPAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNV 156
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA--GGLMR 235
E FLA L +LSEQ LV CD DSGC+GGLMN AFE+ ++ G +
Sbjct: 157 ESQWFLAGHPLTNLSEQMLVSCDKT---------DSGCSGGLMNDAFEWIVQENDGAVYT 207
Query: 236 EEDYPY-TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQ 294
EE YPY +G C + A++ + DE QIAA L NGP+AVA++A
Sbjct: 208 EESYPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAANGPVAVAVDATSWM 267
Query: 295 TYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
TY GGV + S +LDHGVLLVGY + AP+ PYWIIKNSW WGE+GY +I
Sbjct: 268 TYTGGVMTSCV-SEQLDHGVLLVGYNDS--APV-----PYWIIKNSWTTLWGEDGYIRIA 319
Query: 355 RGRNVCGVDSMVST 368
+G N C V S+
Sbjct: 320 KGSNQCLVKEEASS 333
>gi|343470378|emb|CCD16903.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 445
Score = 233 bits (593), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 128/310 (41%), Positives = 174/310 (56%), Gaps = 25/310 (8%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
+ F+ FK+K++++Y E RF +FK ++ RA +P AT G+TQFSD++P EF
Sbjct: 38 QQQFAAFKQKYSRSYKDATEEAFRFRMFKQSMERAKEEAAANPYATFGVTQFSDMSPEEF 97
Query: 117 RRTYL-GLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
R TYL G + K + + T P DWR+KGAV PVKDQG CGSCW+FS G
Sbjct: 98 RATYLNGAKYYAAALKRPRKVVNVSTGKAPPAIDWRKKGAVTPVKDQGKCGSCWAFSAIG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA--GGL 233
+EG +A +L SLSEQ LV CD+ D GC GG ++ A ++ + + G +
Sbjct: 158 NIEGQWKVAGHELTSLSEQMLVSCDN---------MDYGCRGGFLDRALKWIVSSNKGNV 208
Query: 234 MREEDYPYTGTDRG-HACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY 292
EE YPY TD C + A ++ + DE+ IA L KNGP+A+A++A
Sbjct: 209 FTEESYPYDSTDGDVPPCNKSGKVVGAKISGLINLPKDENAIAEWLAKNGPIAIAVDASS 268
Query: 293 MQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
Y GGV SC S L+HGVLLVGY + + PYWIIKNSWG+ WGE GY
Sbjct: 269 FLDYTGGVLTSCS---SDALNHGVLLVGYDDS-------SKPPYWIIKNSWGKKWGEEGY 318
Query: 351 YKICRGRNVC 360
++ +G N C
Sbjct: 319 IRVEKGTNQC 328
>gi|118157|sp|P25779.1|CYSP_TRYCR RecName: Full=Cruzipain; AltName: Full=Cruzaine; AltName:
Full=Major cysteine proteinase; Flags: Precursor
gi|162048|gb|AAA30181.1| cruzain [Trypanosoma cruzi]
gi|29409382|gb|AAM33131.1| cysteine proteinase precursor [Trypanosoma cruzi]
Length = 467
Score = 233 bits (593), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 132/314 (42%), Positives = 170/314 (54%), Gaps = 21/314 (6%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRR 118
F+ FK+K + Y S E R ++F+ NL A H +P AT G+T FSDLT EFR
Sbjct: 37 QFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEFRS 96
Query: 119 TYLGLRRKLRLPKDADQAPI-LPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
Y ++ + P+ + PA DWR +GAV VKDQG CGSCW+FS G +
Sbjct: 97 RYHNGAAHFAAAQERARVPVKVEVVGAPAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNV 156
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA--GGLMR 235
E FLA L +LSEQ LV CD DSGC+GGLMN+AFE+ ++ G +
Sbjct: 157 ECQWFLAGHPLTNLSEQMLVSCDKT---------DSGCSGGLMNNAFEWIVQENNGAVYT 207
Query: 236 EEDYPY-TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQ 294
E+ YPY +G C + A++ + DE QIAA L NGP+AVA++A
Sbjct: 208 EDSYPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAVDASSWM 267
Query: 295 TYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
TY GGV + S +LDHGVLLVGY + PYWIIKNSW WGE GY +I
Sbjct: 268 TYTGGVMTSCV-SEQLDHGVLLVGYNDSAAV-------PYWIIKNSWTTQWGEEGYIRIA 319
Query: 355 RGRNVCGVDSMVST 368
+G N C V S+
Sbjct: 320 KGSNQCLVKEEASS 333
>gi|11464864|gb|AAG35357.1|AF314929_1 cruzipain [Trypanosoma cruzi]
Length = 467
Score = 232 bits (592), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 132/314 (42%), Positives = 170/314 (54%), Gaps = 21/314 (6%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRR 118
F+ FK+K + Y S E R ++F+ NL A H +P AT G+T FSDLT EFR
Sbjct: 37 QFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEFRS 96
Query: 119 TYLGLRRKLRLPKDADQAPI-LPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
Y ++ + P+ + PA DWR +GAV VKDQG CGSCW+FS G +
Sbjct: 97 RYHNGAAHFAAAQERARVPVKVEVVGAPAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNV 156
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA--GGLMR 235
E FLA L +LSEQ LV CD DSGC+GGLMN+AFE+ ++ G +
Sbjct: 157 ECQWFLAGHPLTNLSEQMLVSCDKT---------DSGCSGGLMNNAFEWIVQENNGAVYT 207
Query: 236 EEDYPY-TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQ 294
E+ YPY +G C + A++ + DE QIAA L NGP+AVA++A
Sbjct: 208 EDSYPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAVDASSWM 267
Query: 295 TYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
TY GGV + S +LDHGVLLVGY + PYWIIKNSW WGE GY +I
Sbjct: 268 TYTGGVMTSCV-SEQLDHGVLLVGYNDSAAV-------PYWIIKNSWTTQWGEEGYIRIA 319
Query: 355 RGRNVCGVDSMVST 368
+G N C V S+
Sbjct: 320 KGSNQCLVKEEASS 333
>gi|1136308|gb|AAB41119.1| cruzipain [Trypanosoma cruzi]
Length = 467
Score = 232 bits (592), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 131/314 (41%), Positives = 170/314 (54%), Gaps = 21/314 (6%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRR 118
F+ FK+K + Y S E R ++F+ NL A H +P AT G+T FSDLT EFR
Sbjct: 37 QFAEFKQKHGRVYGSAAEEAFRLSVFRENLFLARLHAAANPHATFGVTAFSDLTREEFRS 96
Query: 119 TYLGLRRKLRLPKDADQAPI-LPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
Y ++ + P+ + PA DWR +GAV VKDQG CGSCW+FS G +
Sbjct: 97 RYHNGAAHFAAAQERARVPVNVEVVGAPAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNV 156
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA--GGLMR 235
E FLA L +LSEQ LV CD DSGC GGLMN+AFE+ ++ G +
Sbjct: 157 ECQWFLAGHPLTNLSEQMLVSCDKT---------DSGCGGGLMNNAFEWIVQENNGAVYT 207
Query: 236 EEDYPY-TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQ 294
E+ YPY +G C + A++ + DE QIAA L NGP+AVA++A
Sbjct: 208 EDSYPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAVDASSWM 267
Query: 295 TYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
TY GGV + S +LDHGVLLVGY + PYW+IKNSW WGE+GY +I
Sbjct: 268 TYTGGVMTSCV-SEQLDHGVLLVGYNDSAAV-------PYWVIKNSWTTQWGEDGYIRIA 319
Query: 355 RGRNVCGVDSMVST 368
+G N C V S+
Sbjct: 320 KGSNQCLVKEEASS 333
>gi|71666430|ref|XP_820174.1| cysteine peptidase [Trypanosoma cruzi strain CL Brener]
gi|70885508|gb|EAN98323.1| cysteine peptidase, putative [Trypanosoma cruzi]
Length = 467
Score = 232 bits (592), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 132/314 (42%), Positives = 170/314 (54%), Gaps = 21/314 (6%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRR 118
F+ FK+K + Y S E R ++F+ NL A H +P AT G+T FSDLT EFR
Sbjct: 37 QFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEFRS 96
Query: 119 TYLGLRRKLRLPKDADQAPI-LPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
Y ++ + P+ + PA DWR +GAV VKDQG CGSCW+FS G +
Sbjct: 97 RYHNGAAHFAAAQERARVPVNVEVVGAPAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNV 156
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA--GGLMR 235
E FLA L +LSEQ LV CD DSGC GGLMN+AFE+ ++ G +
Sbjct: 157 ECQWFLAGHPLTNLSEQMLVSCDKT---------DSGCGGGLMNNAFEWIVQENNGAVYT 207
Query: 236 EEDYPY-TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQ 294
E+ YPY +G C + A++ + DE QIAA L NGP+AVA++A
Sbjct: 208 EDSYPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAVDASSWM 267
Query: 295 TYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
TY GGV + S +LDHGVLLVGY + PYWIIKNSW WGE+GY +I
Sbjct: 268 TYTGGVMTSCV-SEQLDHGVLLVGYNDSAAV-------PYWIIKNSWTAQWGEDGYIRIA 319
Query: 355 RGRNVCGVDSMVST 368
+G N C V S+
Sbjct: 320 KGSNQCLVKEEASS 333
>gi|71663163|ref|XP_818578.1| cysteine peptidase [Trypanosoma cruzi strain CL Brener]
gi|70883837|gb|EAN96727.1| cysteine peptidase, putative [Trypanosoma cruzi]
Length = 467
Score = 232 bits (591), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 132/314 (42%), Positives = 170/314 (54%), Gaps = 21/314 (6%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRR 118
F+ FK+K + Y S E R ++F+ NL A H +P AT G+T FSDLT EFR
Sbjct: 37 QFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEFRS 96
Query: 119 TYLGLRRKLRLPKDADQAPI-LPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
Y ++ + P+ + PA DWR +GAV VKDQG CGSCW+FS G +
Sbjct: 97 RYHNGAVHFAAAQERARVPVKVEVVGAPAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNV 156
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA--GGLMR 235
E FLA L +LSEQ LV CD DSGC+GGLMN+AFE+ ++ G +
Sbjct: 157 ECQWFLAGHPLTNLSEQMLVSCDKT---------DSGCSGGLMNNAFEWIVQENNGAVYT 207
Query: 236 EEDYPY-TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQ 294
E+ YPY +G C + A++ + DE QIAA L NGP+AVA++A
Sbjct: 208 EDSYPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAVDASSWM 267
Query: 295 TYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
TY GGV + S +LDHGVLLVGY + PYWIIKNSW WGE GY +I
Sbjct: 268 TYTGGVMTSCV-SEQLDHGVLLVGYNDSAAV-------PYWIIKNSWTTQWGEEGYIRIA 319
Query: 355 RGRNVCGVDSMVST 368
+G N C V S+
Sbjct: 320 KGSNQCLVKEEASS 333
>gi|326430490|gb|EGD76060.1| cysteine proteinase [Salpingoeca sp. ATCC 50818]
Length = 448
Score = 232 bits (591), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 142/314 (45%), Positives = 182/314 (57%), Gaps = 35/314 (11%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH----GITQFSDLTPAE 115
F FK KFNK Y S EE RF++F N+ RH H + QF+DLT E
Sbjct: 30 FDAFKTKFNKVYESAEEEARRFSVFSQNIDFINRHNAEAARGVHTHTVDVNQFADLTNEE 89
Query: 116 FRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
+R+ YL L ++ + + N DWR+KGAV P+K+QG CGSCWSFSTTG
Sbjct: 90 YRQLYLRPYPTELLGRERQEVWLDGPN--AGSVDWRQKGAVTPIKNQGQCGSCWSFSTTG 147
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMR 235
++EGA+ +ATG LVSLSEQQLVDC + GCNGGLM++AF+Y + GGL
Sbjct: 148 SVEGAHAIATGNLVSLSEQQLVDCSGSFG-------NQGCNGGLMDNAFKYIISNGGLDT 200
Query: 236 EEDYPYTGTDRGHACKFDKSKIAASVANFS-VVSLDEDQIAANLVKNGPLAVAINA--VY 292
E+DYPYT D G K +SK A S++ + V +EDQ+AA V+ GP++VAI A
Sbjct: 201 EQDYPYTARD-GVCDKSKESKHAVSISGYKDVPQNNEDQLAA-AVEKGPVSVAIEADQQS 258
Query: 293 MQTYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYY 351
Q Y GV S P C LDHGVL+VGY S YWI+KNSWG SWG+ GY
Sbjct: 259 FQMYSSGVFSGP--CGTNLDHGVLVVGYTS-----------DYWIVKNSWGASWGDQGYI 305
Query: 352 KICRGRN---VCGV 362
+ RG + +CG+
Sbjct: 306 MMKRGVSSAGICGI 319
>gi|71406896|ref|XP_805951.1| cysteine peptidase [Trypanosoma cruzi strain CL Brener]
gi|70869552|gb|EAN84100.1| cysteine peptidase, putative [Trypanosoma cruzi]
Length = 426
Score = 232 bits (591), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 133/316 (42%), Positives = 171/316 (54%), Gaps = 21/316 (6%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRR 118
F+ FK+K + Y S E R ++F+ NL A H +P AT G+T FSDLT EFR
Sbjct: 37 QFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEFRS 96
Query: 119 TYLGLRRKLRLPKDADQAPI-LPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
Y ++ + P+ + PA DWR +GAV VKDQG CGSCW+FS G +
Sbjct: 97 RYHNGAAHFAAAQERARVPVKVEVVGAPAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNV 156
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA--GGLMR 235
E FLA L +LSEQ LV CD DSGC+GGLMN+AFE+ ++ G +
Sbjct: 157 ECQWFLAGHPLTNLSEQMLVSCDKT---------DSGCSGGLMNNAFEWIVQENNGAVYT 207
Query: 236 EEDYPY-TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQ 294
E+ YPY +G C + A++ + DE QIAA L NGP+AVA++A
Sbjct: 208 EDSYPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAVDASSWM 267
Query: 295 TYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
TY GGV + S +LDHGVLLVGY + PYWIIKNSW WGE GY +I
Sbjct: 268 TYTGGVMTSCV-SEQLDHGVLLVGYNDSA-------AVPYWIIKNSWTTQWGEEGYIRIA 319
Query: 355 RGRNVCGVDSMVSTVA 370
+G N C V S+ A
Sbjct: 320 KGLNQCLVKEEASSAA 335
>gi|66814630|ref|XP_641494.1| cysteine protease [Dictyostelium discoideum AX4]
gi|118121|sp|P04989.1|CYSP2_DICDI RecName: Full=Cysteine proteinase 2; AltName: Full=Prestalk
cathepsin; Flags: Precursor
gi|167860|gb|AAA33240.1| pst-cathepsin [Dictyostelium discoideum]
gi|1834417|emb|CAA27050.1| cysteine proteinase 2 [Dictyostelium discoideum]
gi|60469522|gb|EAL67513.1| cysteine protease [Dictyostelium discoideum AX4]
gi|225484|prf||1304284A cathepsin,prestalk
Length = 376
Score = 232 bits (591), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 140/346 (40%), Positives = 187/346 (54%), Gaps = 47/346 (13%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAAR-HQKLDPSATHGITQFSDLTPAEFRR 118
F+ + KFN+ Y+S E +R++IFK+N+ + K D G+ F+D+T E+R+
Sbjct: 36 FTEWTLKFNRQYSSSE-FSNRYSIFKSNMDYVDNWNSKGDSQTVLGLNNFADITNEEYRK 94
Query: 119 TYLGLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
TYLG R D +L DL P DWR K AV P+KDQG CGSCWSFSTTG
Sbjct: 95 TYLGTRVNAHSYNGYDGREVLNVEDLQTNPKSIDWRTKNAVTPIKDQGQCGSCWSFSTTG 154
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMR 235
+ EGA+ L T KLVSLSEQ LVDC PEE + GC+GGLMN+AF+Y +K G+
Sbjct: 155 STEGAHALKTKKLVSLSEQNLVDC---SGPEE----NFGCDGGLMNNAFDYIIKNKGIDT 207
Query: 236 EEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY--M 293
E YPYT + G C F+KS I A++ + ++ + N ++GP++VAI+A +
Sbjct: 208 ESSYPYTA-ETGSTCLFNKSDIGATIKGYVNITAGSEISLENGAQHGPVSVAIDASHNSF 266
Query: 294 QTYIGGVSCPYICS-RRLDHGVLLVGYGSAG----------------------------- 323
Q Y G+ CS LDHGVL+VGYG G
Sbjct: 267 QLYTSGIYYEPKCSPTELDHGVLVVGYGVQGKDDEGPVLNRKQTIVIHKNEDNKVESSDD 326
Query: 324 -YAPIRLKEKPYWIIKNSWGESWGENGYYKICRGR-NVCGVDSMVS 367
+R K YWI+KNSWG SWG GY + + R N CG+ S+ S
Sbjct: 327 SSDSVRPKANNYWIVKNSWGTSWGIKGYILMSKDRKNNCGIASVSS 372
>gi|19747207|gb|AAL96762.1|AC104496_8 Tcc1l8.8 [Trypanosoma cruzi]
Length = 500
Score = 232 bits (591), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 132/314 (42%), Positives = 170/314 (54%), Gaps = 21/314 (6%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRR 118
F+ FK+K + Y S E R ++F+ NL A H +P AT G+T FSDLT EFR
Sbjct: 70 QFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEFRS 129
Query: 119 TYLGLRRKLRLPKDADQAPI-LPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
Y ++ + P+ + PA DWR +GAV VKDQG CGSCW+FS G +
Sbjct: 130 RYHNGAVHFAAAQERARVPVKVEVVGAPAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNV 189
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA--GGLMR 235
E FLA L +LSEQ LV CD DSGC+GGLMN+AFE+ ++ G +
Sbjct: 190 ECQWFLAGHPLTNLSEQMLVSCDKT---------DSGCSGGLMNNAFEWIVQENNGAVYT 240
Query: 236 EEDYPY-TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQ 294
E+ YPY +G C + A++ + DE QIAA L NGP+AVA++A
Sbjct: 241 EDSYPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAVDASSWM 300
Query: 295 TYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
TY GGV + S +LDHGVLLVGY + PYWIIKNSW WGE GY +I
Sbjct: 301 TYTGGVMTSCV-SEQLDHGVLLVGYNDSAAV-------PYWIIKNSWTTQWGEEGYIRIA 352
Query: 355 RGRNVCGVDSMVST 368
+G N C V S+
Sbjct: 353 KGLNQCLVKEEASS 366
>gi|297793593|ref|XP_002864681.1| hypothetical protein ARALYDRAFT_496172 [Arabidopsis lyrata subsp.
lyrata]
gi|297310516|gb|EFH40940.1| hypothetical protein ARALYDRAFT_496172 [Arabidopsis lyrata subsp.
lyrata]
Length = 361
Score = 231 bits (590), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 148/371 (39%), Positives = 196/371 (52%), Gaps = 34/371 (9%)
Query: 1 MGSKTVVLFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHH- 59
M +KTV+ +V +++ +A ++ + D IR V+DG E+ E T + +LG H
Sbjct: 1 MSAKTVLSSVVLVILIAASAAADIGFDELNPIRMVSDGLREV----EETVSQILGQSRHV 56
Query: 60 --FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFR 117
F+ F ++ K Y + EE RF+IFK NL K S G+ QF+DLT EF+
Sbjct: 57 LTFARFTHRYGKKYQNVEEMKLRFSIFKENLDLIRSTNKKGLSYKLGVNQFADLTWQEFQ 116
Query: 118 RTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
RT LG + L LP DWRE G V PVKDQG CGSCW+FSTTGAL
Sbjct: 117 RTKLGAAQNCSATLKGSHK--LTEAALPETKDWREDGIVSPVKDQGGCGSCWTFSTTGAL 174
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREE 237
E A A GK +SLSEQQLVDC + + GCNGGL + AFEY GGL EE
Sbjct: 175 EAAYHQAFGKGISLSEQQLVDCAGAYN-------NYGCNGGLPSQAFEYIKSNGGLDTEE 227
Query: 238 DYPYTGTDRGHACKFDKSKIAASV---ANFSVVSLDEDQIAANLVKNGPLAVAINAVY-M 293
YPY G D CKF + V N ++ + DE + A LV+ P+++A ++
Sbjct: 228 AYPYIGKD--GTCKFSAENVGVQVLDSVNITLGAEDELKHAVGLVR--PVSIAFEVIHSF 283
Query: 294 QTYIGGVSCPYICSRR---LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
+ Y GV C ++H VL VGYG PYW+IKNSWG WG+ GY
Sbjct: 284 RLYKSGVYTDSHCGSTPMDVNHAVLAVGYGVE-------DGVPYWLIKNSWGADWGDKGY 336
Query: 351 YKICRGRNVCG 361
+K+ G+N+CG
Sbjct: 337 FKMEMGKNMCG 347
>gi|343477619|emb|CCD11596.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 445
Score = 231 bits (590), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 128/310 (41%), Positives = 173/310 (55%), Gaps = 25/310 (8%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
+ F+ FK+K++++Y E RF +FK ++ RA +P AT G+TQFSD++P EF
Sbjct: 38 QQQFAAFKQKYSRSYKDATEEAFRFRMFKQSMERAKEEAAANPYATFGVTQFSDMSPEEF 97
Query: 117 RRTYL-GLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
R TYL G + K + + T P DWR+KGAV PVKDQ CGSCW+FS G
Sbjct: 98 RATYLNGAKYYAAALKRPRKVVTVSTGKAPPAIDWRKKGAVTPVKDQRKCGSCWAFSAIG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA--GGL 233
+EG +A +L SLSEQ LV CD+ D GC GGLM+ A ++ + + G +
Sbjct: 158 NIEGQWKVAGHELTSLSEQMLVSCDN---------MDDGCQGGLMDRALKWIVSSNKGNV 208
Query: 234 MREEDYPYTGTDRG-HACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY 292
EE YPY TD C + A ++ + DE+ IA L KNGP+A+A++A
Sbjct: 209 FTEESYPYDSTDGDVPPCNKSGKVVGAKISGLINLPKDENAIAEWLAKNGPIAIAVDASS 268
Query: 293 MQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
Y GGV SC S L+H VLLVGY + + PYWIIKNSWG+ WGE GY
Sbjct: 269 FLDYTGGVLTSCS---SDALNHDVLLVGYDDSS-------KPPYWIIKNSWGKKWGEEGY 318
Query: 351 YKICRGRNVC 360
++ +G N C
Sbjct: 319 IRVEKGTNQC 328
>gi|18141285|gb|AAL60580.1|AF454958_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 485
Score = 231 bits (589), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 139/357 (38%), Positives = 192/357 (53%), Gaps = 34/357 (9%)
Query: 1 MGSKTVVLFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHF 60
+ S TV+LFL +VV SA+ + D + HH ++ +
Sbjct: 4 LNSATVILFLTMIVVSSAMDMSIISYDKN---------------HHTVSSRSDAEVSRLY 48
Query: 61 SLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTY 120
+ K KA S E D RF IFK NLR H + S G+T+F+DLT E+R Y
Sbjct: 49 EEWLVKHGKAQNSLTEKDRRFEIFKDNLRFIDEHNGKNLSYRLGLTKFADLTNDEYRSMY 108
Query: 121 LGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGA 180
LG R K + K + + + + +P DWR++GAV VKDQGSCGSCW+FST GA+EG
Sbjct: 109 LGSRLKRKATKSSLRYEVRVGDAIPESVDWRKEGAVAEVKDQGSCGSCWAFSTIGAVEGI 168
Query: 181 NFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYP 240
N + TG L++LSEQ+LVDCD S + GCNGGLM+ AFE+ + GG+ EEDYP
Sbjct: 169 NKIVTGDLITLSEQELVDCDT--------SYNEGCNGGLMDYAFEFIINNGGIDTEEDYP 220
Query: 241 YTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAIN--AVYMQTYIG 298
Y G D G + K+ ++ + V + ++ + + P++VAI Q Y
Sbjct: 221 YKGVD-GRCDQTRKNAKVVTIDLYEDVPANSEESLKKALSHQPISVAIEGGGRAFQLYDS 279
Query: 299 GVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
G+ IC LDHGV+ VGYG+ K YWI+KNSWG SWGE+GY ++ R
Sbjct: 280 GIF-DGICGTDLDHGVVAVGYGTE-------NGKDYWIVKNSWGTSWGESGYIRMER 328
>gi|343475823|emb|CCD12886.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 361
Score = 231 bits (589), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 128/313 (40%), Positives = 173/313 (55%), Gaps = 21/313 (6%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
+ F+ FK+K++++Y E RF +FK N+ RA +P AT G+T+FSD++P EF
Sbjct: 38 QQQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97
Query: 117 RRTYL-GLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
R TY G K + + T P DWR+KGAV PVKDQG C S W+FS G
Sbjct: 98 RATYHNGAEYYAAALKRPRKVVNVSTGRPPMTVDWRKKGAVTPVKDQGKCDSSWAFSAIG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTL--KAGGL 233
+EG +A +L SLSEQ LV CD + D GC GL + AF++ L G +
Sbjct: 158 NIEGQWKIAGHELTSLSEQMLVSCD---------TNDLGCELGLKDPAFQWILWSNKGNV 208
Query: 234 MREEDYPY-TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY 292
E+ YPY +G C + A ++N + LDED IA L + GP+A+A++A
Sbjct: 209 FTEQSYPYASGGGNVPTCDMSGKVVGAKISNMRYLPLDEDTIAEWLARKGPVAIAVDATS 268
Query: 293 MQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYK 352
Q Y GGV I SRRL++G LLVGY + PYWIIKNSWG+ WGE GY +
Sbjct: 269 FQRYTGGVLTSCI-SRRLNYGALLVGYDDT-------SKPPYWIIKNSWGKGWGEEGYIR 320
Query: 353 ICRGRNVCGVDSM 365
I +G N C V ++
Sbjct: 321 IEKGTNQCLVKNL 333
>gi|302763831|ref|XP_002965337.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
gi|300167570|gb|EFJ34175.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
Length = 343
Score = 231 bits (588), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 133/322 (41%), Positives = 183/322 (56%), Gaps = 34/322 (10%)
Query: 54 LGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQ-KLDPSATHGITQFSDLT 112
L ++ F + K K+Y+S E R IF L +H + + + T G+ +FSDLT
Sbjct: 35 LEIKNMFEDWAAKHGKSYSSDLEKARRLMIFSDTLAYIEKHNAQPNTTFTLGLNKFSDLT 94
Query: 113 PAEFRRTYLGLRRKLRLPKDADQAPI----LPTNDLPADFDWREKGAVGPVKDQGSCGSC 168
AEFR ++G K + P+ D+ P + + LP DWR+KGAV P+KDQG CGSC
Sbjct: 95 NAEFRAMHVG---KFKRPRYQDRLPAEDEDVDVSSLPTSLDWRQKGAVTPIKDQGDCGSC 151
Query: 169 WSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTL 228
W+FS ++E A+FLAT +LVSLSEQQL+DCD + D+GC+GGLM +AF++ +
Sbjct: 152 WAFSAIASIESAHFLATKELVSLSEQQLMDCD---------TVDAGCDGGLMETAFKFVV 202
Query: 229 KAGGLMREEDYPYTGTDRGHACKFDKSKI---AASVANFSVVSLDEDQIAANLVKNGPLA 285
K GG+ E YPYTG+ +C +K I A + F VV+ D V P+
Sbjct: 203 KNGGVTTEASYPYTGS--VGSCNANKVAIINKVAEITGFKVVTEDSADALMKAVSKTPVT 260
Query: 286 VAI--NAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGE 343
V+I + Q Y G+ C LDHGVLL+GYG+ G PYWIIKNSWG
Sbjct: 261 VSICGSDENFQNYKSGILSGQ-CGDSLDHGVLLIGYGTEG-------GMPYWIIKNSWGT 312
Query: 344 SWGENGYYKICR--GRNVCGVD 363
SWGE+G+ KI R G +CG++
Sbjct: 313 SWGEDGFMKIERKDGDGICGMN 334
>gi|375073976|gb|AFA34855.1| cathepsin L-like protein [Trypanosoma cruzi]
Length = 467
Score = 231 bits (588), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 131/314 (41%), Positives = 169/314 (53%), Gaps = 21/314 (6%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRR 118
F+ FK+K + Y S E R ++F+ NL A H +P AT G+T FSDLT EFR
Sbjct: 37 QFAEFKQKHGRVYGSAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEFRS 96
Query: 119 TYLGLRRKLRLPKDADQAPI-LPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
Y ++ + P+ + PA DWR +GAV VKDQG CGSCW+FS G +
Sbjct: 97 RYHNGAAHFAAAQERARVPVNVEVVGAPAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNV 156
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA--GGLMR 235
E FLA L +LSEQ LV CD DSGC GGLMN+AFE+ ++ G +
Sbjct: 157 ECQWFLAGHPLTNLSEQMLVSCDKT---------DSGCGGGLMNNAFEWIVQENNGAVYT 207
Query: 236 EEDYPY-TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQ 294
E YPY +G C + A++ + DE QIAA L NGP+AVA++A
Sbjct: 208 EGSYPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAVDASSWM 267
Query: 295 TYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
TY GGV + S +LDHGVLLVGY + PYW+IKNSW WGE+GY +I
Sbjct: 268 TYTGGVMTSCV-SEQLDHGVLLVGYNDSAAV-------PYWVIKNSWTTQWGEDGYIRIA 319
Query: 355 RGRNVCGVDSMVST 368
+G N C V S+
Sbjct: 320 KGSNQCLVKEEASS 333
>gi|559532|emb|CAA57675.1| cysteine proteinase [Zea mays]
Length = 145
Score = 230 bits (587), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 109/139 (78%), Positives = 123/139 (88%), Gaps = 5/139 (3%)
Query: 236 EEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQT 295
E+DYPYTG+D CKFDKSKI ASV NFSVVS+DE QI+AN +K+GPLA+ INA YMQT
Sbjct: 3 EKDYPYTGSDG--KCKFDKSKIVASVQNFSVVSVDEAQISANRIKHGPLAIGINAAYMQT 60
Query: 296 YIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
YIGGVSCPYIC R LDHGVLLVGYG++G+AP+RLK+KPYWIIKNSWGE+WGENGYYKICR
Sbjct: 61 YIGGVSCPYICGRHLDHGVLLVGYGASGFAPMRLKDKPYWIIKNSWGENWGENGYYKICR 120
Query: 356 G---RNVCGVDSMVSTVAA 371
G RN CGVDSMVSTV+A
Sbjct: 121 GSNVRNKCGVDSMVSTVSA 139
>gi|146335582|gb|ABQ23400.1| cathepsin L isotype 3 [Trypanoplasma borreli]
Length = 442
Score = 230 bits (587), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 131/322 (40%), Positives = 184/322 (57%), Gaps = 27/322 (8%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
E F FK + YAS +E RF IF AN+++AA + +P AT G +F+D++ EF
Sbjct: 22 EVLFRDFKTTHARNYASADEERKRFEIFAANMKKAAELNRKNPMATFGPNEFADMSSEEF 81
Query: 117 RRTYLGLRR----KLRLPKDADQAPILPTND-LPADFDWREKGAVGPVKDQGSCGSCWSF 171
+ + R R PK+ N + DWR KGAV PVK+QGSCGSCWSF
Sbjct: 82 QTRHNAARHYAAVMARPPKNTKTFTEEEINAAVGQKVDWRLKGAVTPVKNQGSCGSCWSF 141
Query: 172 STTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA- 230
STTG +EG + +ATG+LVSLSEQ+LV CD + D GC+GGLM++AF + L A
Sbjct: 142 STTGNIEGQHAIATGQLVSLSEQELVSCD---------TVDDGCSGGLMDNAFGWLLSAH 192
Query: 231 -GGLMREEDYPY-TGTDRGHACKFDKSK--IAASVANFSVVSLDEDQIAANLVKNGPLAV 286
G + E YPY +G AC F+ + + A++ +F + E +AA + K GPL++
Sbjct: 193 NGQITTEASYPYVSGNGIVPACTFNSNSNPVGATITSFHDIPKTERDMAAFVFKYGPLSI 252
Query: 287 AINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWG 346
++A Q+YIGG+ + ++DHGVL+VG+ PYWIIKNSW WG
Sbjct: 253 GVDASSWQSYIGGI-LSHCSDVQIDHGVLIVGFDDTA-------STPYWIIKNSWSSMWG 304
Query: 347 ENGYYKICRGRNVCGVDSMVST 368
E GY ++ +G N CG+ S S+
Sbjct: 305 EQGYIRVAKGSNQCGLTSFPSS 326
>gi|29567137|ref|NP_818699.1| cathepsin [Adoxophyes honmai NPV]
gi|37076951|sp|Q80LP4.1|CATV_NPVAH RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|29467913|dbj|BAC67303.1| cathepsin [Adoxophyes honmai NPV]
Length = 337
Score = 230 bits (587), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 127/332 (38%), Positives = 184/332 (55%), Gaps = 32/332 (9%)
Query: 52 DLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDL 111
D+ A+H+F F +NK Y + ++RF IFK NL KL+ SA + I +FSDL
Sbjct: 24 DIHDAQHYFETFIINYNKQYPDTKTKNYRFKIFKQNLEDINEKNKLNDSAIYNINKFSDL 83
Query: 112 TPAEFRRTYLGL--RRKLRLPKDADQ--------APILPTNDLPADFDWREKGAVGPVKD 161
+ E Y GL ++ + + AP ++LP +FDWR + VKD
Sbjct: 84 SKNELLTKYTGLTSKKPSNMVRSTSNFCNVIHLDAPPDVHDELPQNFDWRVNNKMTSVKD 143
Query: 162 QGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMN 221
QG+CGSCW+ + G LE + L++LSEQQL+DCD S + C+GGLM+
Sbjct: 144 QGACGSCWAHAAVGTLETLYAIKHNYLINLSEQQLIDCD---------SANMACDGGLMH 194
Query: 222 SAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFS-VVSLDEDQIAANLVK 280
+AFE + AGGLM E DYPY GT +G CK D K A SV++ + +E+ + L+
Sbjct: 195 TAFEQLMNAGGLMEEIDYPYQGT-KG-VCKIDNKKFALSVSSCKRYIFQNEENLKKELIT 252
Query: 281 NGPLAVAINAVYMQTYIGGVSCPYICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKN 339
GP+A+AI+A + TY G+ + C L+H VLLVGYG+ G YW +KN
Sbjct: 253 MGPIAMAIDAASISTYSKGI--IHFCENLGLNHAVLLVGYGTEGGV-------SYWTLKN 303
Query: 340 SWGESWGENGYYKICRGRNVCGVDSMVSTVAA 371
SWG WGE+GY+++ R N CG+++ ++ A
Sbjct: 304 SWGSDWGEDGYFRVKRNINACGLNNQLAASAT 335
>gi|343476708|emb|CCD12273.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 363
Score = 230 bits (587), Expect = 8e-58, Method: Compositional matrix adjust.
Identities = 128/317 (40%), Positives = 180/317 (56%), Gaps = 23/317 (7%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
+ F+ FK+K++++Y E RF +FK N+ RA +P AT G+T+FSD++P EF
Sbjct: 38 QQQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97
Query: 117 RRTYL-GLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
R TY G K + + T P DWR+KGAV PV+D+ C S W+FS G
Sbjct: 98 RATYHNGAEYYAAALKRPRKVVTVSTGKAPDAVDWRKKGAVTPVRDERLCDSSWAFSAIG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA--GGL 233
+EG +A +L SLSEQ L+ CD D GC GGLM+ AF++ + + G +
Sbjct: 158 NIEGQWKVAGHELTSLSEQMLLSCDTRED---------GCGGGLMDRAFQWIVSSNKGNV 208
Query: 234 MREEDYPYTGTDRGHACKFDKSK--IAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV 291
E+ YPY TD G + +KS + A ++++ + DE+ IA L KNGP+A+A+ A
Sbjct: 209 FTEQSYPYASTD-GDVPRCNKSGKVVGAKISDYVDLPQDENAIAEWLAKNGPVAIAVEAT 267
Query: 292 YMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYY 351
+Q Y GGV I S +LDHGVLLVGY + PYWIIKNSWG+ WGE GY
Sbjct: 268 SLQRYTGGVLTSCI-SEQLDHGVLLVGYDDT-------SKPPYWIIKNSWGKGWGEEGYI 319
Query: 352 KICRGRNVCGVDSMVST 368
+I +G N C + + S+
Sbjct: 320 RIEKGTNQCLMKNYASS 336
>gi|375073978|gb|AFA34856.1| cathepsin L-like protein [Trypanosoma cruzi]
Length = 467
Score = 230 bits (586), Expect = 9e-58, Method: Compositional matrix adjust.
Identities = 130/314 (41%), Positives = 169/314 (53%), Gaps = 21/314 (6%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRR 118
F+ FK+K + Y S E R ++F+ NL A H +P AT G+T FSDLT EFR
Sbjct: 37 QFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEFRS 96
Query: 119 TYLGLRRKLRLPKDADQAPI-LPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
Y ++ + P+ + PA DWR +GAV VKDQG CGSCW+FS G +
Sbjct: 97 RYHNGAAHFAAAQERARVPVNVEVVGAPAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNV 156
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA--GGLMR 235
E FLA L +LSEQ LV CD DSGC+GGLMN+AFE+ ++ G +
Sbjct: 157 ECQWFLAGHPLTNLSEQMLVSCDKT---------DSGCSGGLMNNAFEWIVQENNGAVYT 207
Query: 236 EEDYPY-TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQ 294
E+ YPY +G C + A++ + DE QIAA L NGP+AV ++A
Sbjct: 208 EDSYPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVGVDASSWM 267
Query: 295 TYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
TY GGV + S +LDHGVLLVGY + PYWIIKNSW WGE GY ++
Sbjct: 268 TYTGGVMTSCV-SEQLDHGVLLVGYNDSAAV-------PYWIIKNSWTTQWGEGGYIRVA 319
Query: 355 RGRNVCGVDSMVST 368
+G N C V S+
Sbjct: 320 KGSNQCLVKEEASS 333
>gi|171702831|dbj|BAG16371.1| cysteine protease [Brassica oleracea var. italica]
Length = 441
Score = 230 bits (586), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 138/353 (39%), Positives = 190/353 (53%), Gaps = 34/353 (9%)
Query: 5 TVVLFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFK 64
TV+LFL +VV SA+ + D + HH ++ + +
Sbjct: 2 TVILFLTMIVVSSAMDMSIISYDKN---------------HHTVSSRSDAEVSRLYEEWL 46
Query: 65 KKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLR 124
K KA S E D RF IFK NLR H + S G+T+F+DLT E+R YLG R
Sbjct: 47 VKHGKAQNSLTEKDRRFEIFKDNLRFIDEHNGKNLSYRLGLTKFADLTNDEYRSMYLGSR 106
Query: 125 RKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLA 184
K + K + + + + +P DWR++GAV VKDQGSCGSCW+FST GA+EG N +
Sbjct: 107 LKRKATKSSLRYEVRVGDAIPESVDWRKEGAVAEVKDQGSCGSCWAFSTIGAVEGINKIV 166
Query: 185 TGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGT 244
TG L++LSEQ+LVDCD S + GCNGGLM+ AFE+ + GG+ EEDYPY G
Sbjct: 167 TGDLITLSEQELVDCDT--------SYNEGCNGGLMDYAFEFIINNGGIDTEEDYPYKGV 218
Query: 245 DRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAIN--AVYMQTYIGGVSC 302
D G + K+ ++ + V + ++ + + P++VAI Q Y G+
Sbjct: 219 D-GRCDQTRKNAKVVTIDLYEDVPANSEESLKKALSHQPISVAIEGGGRAFQLYDSGIF- 276
Query: 303 PYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
IC LDHGV+ VGYG+ K YWI+KNSWG SWGE+GY ++ R
Sbjct: 277 DGICGTDLDHGVVAVGYGTE-------NGKDYWIVKNSWGTSWGESGYIRMER 322
>gi|110349473|gb|ABG73217.1| cathepsin L 1 precursor [Diaprepes abbreviatus]
Length = 322
Score = 229 bits (585), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 134/309 (43%), Positives = 179/309 (57%), Gaps = 26/309 (8%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP----SATHGITQFSDLTPAE 115
F FK K K Y +Q E RF IFK NLR +H L S GI +F+D+T E
Sbjct: 25 FQAFKLKHGKTYKNQVEETARFNIFKDNLRAIEQHNVLYEQGLVSYKKGINRFTDMTQEE 84
Query: 116 FRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
FR +L L + P +L +P DWR KG V VKDQG+CGSCW+FS TG
Sbjct: 85 FR-AFLTLSSSKK-PHFNTTEHVLTGLAVPDSIDWRTKGQVTGVKDQGNCGSCWAFSVTG 142
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMR 235
+ E A + GKLVSLSEQQLVDC + ++GCNGG ++ F Y +K+ GL
Sbjct: 143 STEAAYYRKAGKLVSLSEQQLVDCSTD--------INAGCNGGYLDETFTY-VKSKGLEA 193
Query: 236 EEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAINAVYMQ 294
E YPY GTD +CK+ SK+ V+ S+ S DE+ + + GP++VAI+A Y+
Sbjct: 194 ESTYPYKGTD--GSCKYSASKVVTKVSGHKSLKSEDENALLDAVGNVGPVSVAIDATYLS 251
Query: 295 TYIGGVSCPYICS-RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKI 353
+Y G+ CS L+HGVL+VGYG++ K YWI+KNSWG S+GE+GY+++
Sbjct: 252 SYESGIYEDDWCSPSELNHGVLVVGYGTS-------NGKKYWIVKNSWGGSFGESGYFRL 304
Query: 354 CRGRNVCGV 362
RG+N CGV
Sbjct: 305 LRGKNECGV 313
>gi|71663165|ref|XP_818579.1| cruzipain precursor [Trypanosoma cruzi strain CL Brener]
gi|70883838|gb|EAN96728.1| cruzipain precursor, putative [Trypanosoma cruzi]
Length = 467
Score = 229 bits (585), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 131/314 (41%), Positives = 169/314 (53%), Gaps = 21/314 (6%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRR 118
F+ FK+K + Y S E R ++F+ NL A H +P AT G+T FSDLT EFR
Sbjct: 37 QFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEFRS 96
Query: 119 TYLGLRRKLRLPKDADQAPI-LPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
Y ++ + P+ + PA DWR +GAV VKDQG CGSCW+FS G +
Sbjct: 97 RYHNGAAHFAAAQERARVPVKVEVVGAPAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNV 156
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA--GGLMR 235
E FLA L +LSEQ LV CD D GC+GGLMN+AFE+ ++ G +
Sbjct: 157 ECQWFLAGHPLTNLSEQMLVSCDKT---------DFGCSGGLMNNAFEWIVQENNGAVYT 207
Query: 236 EEDYPY-TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQ 294
E+ YPY +G C + A++ + DE QIAA L NGP+AVA++A
Sbjct: 208 EDSYPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAVDASSWM 267
Query: 295 TYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
TY GGV + S +LDHGVLLVGY + PYWIIKNSW WGE GY +I
Sbjct: 268 TYTGGVMTSCV-SEQLDHGVLLVGYNDSAAV-------PYWIIKNSWTTQWGEEGYIRIA 319
Query: 355 RGRNVCGVDSMVST 368
+G N C V S+
Sbjct: 320 KGSNQCLVKEEASS 333
>gi|118429523|gb|ABK91809.1| cathepsin L-like proteinase precursor [Clonorchis sinensis]
Length = 373
Score = 229 bits (585), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 142/336 (42%), Positives = 184/336 (54%), Gaps = 38/336 (11%)
Query: 52 DLLGAEHHFSL------FKKKFNKAYASQEEHDHRFTIFKANLRRAARHQ----KLDPSA 101
D++G + +F+L F + + Y EH+ RF IF N R ++H + S
Sbjct: 52 DVIGVDWNFTLSSIWKHFMTTYKRNYIDPSEHERRFKIFANNFVRISKHNVRFIQGQVSY 111
Query: 102 THGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKD 161
T GI +FSD T E +R R L +D + I P++ DWR KGAV PVK+
Sbjct: 112 TMGINEFSDKTDEELKRLRC-FRGSLNASRDGSKY-ITIAAPPPSEIDWRNKGAVTPVKN 169
Query: 162 QGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMN 221
QG+CGSCW+FS TGA+EG NFLATG LVSLSEQQLVDC E ++ CNGGLM+
Sbjct: 170 QGNCGSCWAFSATGAIEGQNFLATGNLVSLSEQQLVDCSSEYG-------NNACNGGLMD 222
Query: 222 SAFEYTLKAGGLMREEDYPYTGTDRGHA---CKFDKSKIAASVANFSVVSLDEDQIA--- 275
+AF+Y + G+ E YPY + G A C+F+ + V + + L Q++
Sbjct: 223 NAFKYVKDSNGIDTEASYPYVSGETGDANPTCRFNLKEAVVRVTGY--IDLPRGQVSELK 280
Query: 276 ANLVKNGPLAVAINA--VYMQTYIGGVSCPYICSR-RLDHGVLLVGYGSAGYAPIRLKEK 332
+ GP++VAINA +Y GV CS LDHGVLLVGYG
Sbjct: 281 QAVGHYGPISVAINAGLPSFMSYKSGVYSDDQCSSDDLDHGVLLVGYGEE-------NGI 333
Query: 333 PYWIIKNSWGESWGENGYYKICRGR-NVCGVDSMVS 367
PYW+IKNSWG WGENGY KI R N+CGV SM S
Sbjct: 334 PYWLIKNSWGPHWGENGYVKILRDHNNLCGVASMAS 369
>gi|225438807|ref|XP_002283263.1| PREDICTED: germination-specific cysteine protease 1-like isoform 1
[Vitis vinifera]
Length = 374
Score = 229 bits (584), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 129/303 (42%), Positives = 173/303 (57%), Gaps = 24/303 (7%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
+ + K KAY E + RF IFK NL+ H + + G+ +F+DLT E+R
Sbjct: 46 YQWWMAKHGKAYNGLGEKEKRFEIFKDNLKFIDEHNAQNRTYKVGLNRFADLTNEEYRAI 105
Query: 120 YLGLRR--KLRLPKDADQAP---ILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTT 174
YLG R K R K + +P ++P LP DWRE GAV PVKDQ SCGSCW+FST
Sbjct: 106 YLGTRSDPKRRFAKLKNASPRYAVMPGEVLPESVDWRETGAVNPVKDQRSCGSCWAFSTV 165
Query: 175 GALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLM 234
A+EG N + TG+L+SLSEQ+LVDCD E D GCNGGLM+ AF++ +K GGL
Sbjct: 166 AAVEGINQIVTGELISLSEQELVDCDTE--------YDMGCNGGLMDYAFDFIIKNGGLD 217
Query: 235 REEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA--VY 292
E+DYPYTG D G KS S+ + V +++ V + P++VA+ A
Sbjct: 218 TEKDYPYTGFD-GECNLSGKSSKVVSIDGYEDVPPFDEKALQKAVAHQPVSVAVEAGGRA 276
Query: 293 MQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYK 352
+Q Y+ G+ C LDHG++ VGYG+ YWI++NSWG SWGENGY +
Sbjct: 277 LQLYVSGIFTGE-CGTALDHGIVAVGYGTE-------NGTDYWIVRNSWGSSWGENGYIR 328
Query: 353 ICR 355
+ R
Sbjct: 329 MER 331
>gi|302790828|ref|XP_002977181.1| hypothetical protein SELMODRAFT_106402 [Selaginella moellendorffii]
gi|300155157|gb|EFJ21790.1| hypothetical protein SELMODRAFT_106402 [Selaginella moellendorffii]
Length = 337
Score = 229 bits (584), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 133/320 (41%), Positives = 184/320 (57%), Gaps = 32/320 (10%)
Query: 54 LGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQ-KLDPSATHGITQFSDLT 112
L ++ F + K K+Y+S E R IF L +H + + + T G+ +FSDLT
Sbjct: 31 LEIKNMFEDWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHNAQPNTTFTLGLNKFSDLT 90
Query: 113 PAEFRRTYLGLRRKLRLPKDADQAPI----LPTNDLPADFDWREKGAVGPVKDQGSCGSC 168
AEFR ++G K + P+ D+ P + + LP DWR+KGAV P+KDQG CGSC
Sbjct: 91 NAEFRAMHVG---KFKRPRYQDRLPAEDEDVDVSSLPTSLDWRQKGAVTPIKDQGDCGSC 147
Query: 169 WSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTL 228
W+FS ++E A+FLAT +LVSLSEQQL+DCD + D+GC+GGLM +AF++ +
Sbjct: 148 WAFSAIASIESAHFLATKELVSLSEQQLMDCD---------TVDAGCDGGLMETAFKFVV 198
Query: 229 KAGGLMREEDYPYTGTDRGHACKFDKSK-IAASVANFSVVSLDEDQIAANLVKNGPLAVA 287
K GG+ E YPYTG+ +C +K+K A + F VV+ D V P+ V+
Sbjct: 199 KNGGVTTEAAYPYTGS--VGSCNANKAKNKVAEITGFKVVTEDSADALMKAVSKTPVTVS 256
Query: 288 I--NAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESW 345
I + Q Y G+ C LDHGVLL+GYG+ G PYWIIKNSWG SW
Sbjct: 257 ICGSDENFQNYKSGI-LSGKCDDSLDHGVLLIGYGTEG-------GMPYWIIKNSWGTSW 308
Query: 346 GENGYYKICR--GRNVCGVD 363
GE+G+ KI R G +CG++
Sbjct: 309 GEDGFMKIERKDGDGMCGMN 328
>gi|358339354|dbj|GAA47434.1| cathepsin F [Clonorchis sinensis]
Length = 603
Score = 229 bits (584), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 130/322 (40%), Positives = 183/322 (56%), Gaps = 30/322 (9%)
Query: 56 AEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPA 114
A + FK+K+ K Y + ++ ++RF++FK NL RA + Q ++ +A +G+TQF DLT
Sbjct: 303 ARQLYEEFKQKYKKTYVNDDD-EYRFSVFKENLLRAHQLQTMEQGTAEYGVTQFFDLTSQ 361
Query: 115 EFRRTYLGLRRKLRLPKDADQAPILPTNDLPAD---FDWREKGAVGPVKDQGSCGSCWSF 171
EF+ YLG + + D + P+ + D FDWR+ GAVGPV DQG CGSCW+F
Sbjct: 362 EFQIQYLGFKYE----DMQDTEEMSPSTRVVMDEDSFDWRDHGAVGPVLDQGKCGSCWAF 417
Query: 172 STTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAG 231
ST G +EG FL TG+L+SLSEQQL+DCD + D GCNGG + +K G
Sbjct: 418 STIGNIEGQWFLKTGELLSLSEQQLIDCD---------NVDEGCNGGYPPKTYGAVIKMG 468
Query: 232 GLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV 291
GL DYPY C D+ K+ + + V +E A L GPL+ A+NA
Sbjct: 469 GLELNSDYPYKAL--AEKCHMDRQKLKVYINDSVVFPRNEHLQAEALKLMGPLSSALNAN 526
Query: 292 YMQTYIGGVSCPYICS---RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
++ Y G+ + S R L+H VL VGYG+ PYW +KNSWG ++GE+
Sbjct: 527 PLKFYKTGIMHLPVASCFPRALNHAVLTVGYGTE-------NGLPYWTVKNSWGTAFGED 579
Query: 349 GYYKICRGRNVCGVDSMVSTVA 370
GY++I RG CG++ +VST A
Sbjct: 580 GYFRIYRGGGTCGINRLVSTAA 601
Score = 150 bits (380), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 88/206 (42%), Positives = 115/206 (55%), Gaps = 23/206 (11%)
Query: 147 DFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPE 206
+FDWR+ GAVGPV +QG CGSCW+FS G +EG FL +G+L+ LS QQ++DCDH
Sbjct: 42 NFDWRQHGAVGPVWNQGPCGSCWAFSAVGNIEGQWFLKSGELLHLSVQQVLDCDH----- 96
Query: 207 EPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSV 266
D GCNGG + + GGL + DY Y C D+SK A V N SV
Sbjct: 97 ----VDHGCNGGYPPQVYRQVNQMGGLQLDADYSYKAAV--GKCHTDRSKFRAYV-NSSV 149
Query: 267 VSLDEDQIAANLVKN-GPLAVAINAVYMQTYIGGV--SCPYICSR-RLDHGVLLVGYGSA 322
+ +Q AN +K GPLA +NA +Q Y G+ P C+ +L+H VL VGYG+
Sbjct: 150 ILSQNEQFQANKLKTIGPLASTLNARTLQFYRKGIMHPTPSACNPGQLNHAVLTVGYGTE 209
Query: 323 GYAPIRLKEKPYWIIKNSWGESWGEN 348
+ PYWI+KNSW +GE
Sbjct: 210 -------QGMPYWIVKNSWSRGFGEQ 228
>gi|291385469|ref|XP_002709277.1| PREDICTED: cathepsin F [Oryctolagus cuniculus]
Length = 460
Score = 229 bits (584), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 148/352 (42%), Positives = 196/352 (55%), Gaps = 41/352 (11%)
Query: 33 RQVTDGGDEILSHHESTNNDLLGAEHH------FSLFKKKFNKAYASQEEHDHRFTIFKA 86
R+ D + + S + N D L + F F + +N+ Y S+EE R ++F +
Sbjct: 130 RRTEDRNETLKSTLPALNRDSLPQDFSVKMASIFKKFVRTYNRTYESKEEAQWRLSVFAS 189
Query: 87 NLRRAARHQKLDP-SATHGITQFSDLTPAEFRRTYLG--LR----RKLRLPKDADQAPIL 139
N+ RA + Q LD +A +GIT+FSDLT EFR YL LR +K++L K +
Sbjct: 190 NMVRAQKIQSLDRGTAQYGITKFSDLTEEEFRTIYLNPLLRSEPGKKMQLAKPVE----- 244
Query: 140 PTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDC 199
+ P +DWR KGAV VKDQG CGSCW+FS TG +EG FL G L+SLSEQ+L+DC
Sbjct: 245 --DPAPPQWDWRSKGAVTNVKDQGMCGSCWAFSVTGNVEGQWFLKRGTLLSLSEQELLDC 302
Query: 200 DHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAA 259
D D C GGL ++A+ GGL EEDY Y G AC F K
Sbjct: 303 DK---------LDKACLGGLPSNAYSAIKNLGGLETEEDYTYQG--HMQACNFSAQKAKV 351
Query: 260 SVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPY--ICSRRL-DHGVLL 316
+ + +S +E ++AA L K GP++VAINA MQ Y G++ P +CS L DH VLL
Sbjct: 352 YINDSVELSQNEQKLAAWLAKRGPISVAINAFGMQFYRRGIAHPLRPLCSPWLIDHAVLL 411
Query: 317 VGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVST 368
VGYG+ P+W IKNSWG WGE GYY + RG VCGV++M S+
Sbjct: 412 VGYGNRS-------ATPFWAIKNSWGADWGEEGYYYLYRGSGVCGVNTMASS 456
>gi|8468605|gb|AAF75546.1| cruzipain [Trypanosoma cruzi]
Length = 467
Score = 229 bits (584), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 131/314 (41%), Positives = 168/314 (53%), Gaps = 21/314 (6%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRR 118
F+ FK+K + Y S E R ++F+ NL A H +P AT G+T FSDLT EFR
Sbjct: 37 QFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEFRS 96
Query: 119 TYLGLRRKLRLPKDADQAPI-LPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
Y ++ + P+ + PA DWR +GAV VKDQG CGSCW+FS G +
Sbjct: 97 RYHNGAAHFAAAQERARVPVNVEVVGAPAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNV 156
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA--GGLMR 235
E FLA L +LSEQ LV CD DSGC GGLMN+AF + ++ G +
Sbjct: 157 ECQWFLAGHPLTNLSEQMLVSCDKT---------DSGCGGGLMNNAFGWIVQENNGAVYT 207
Query: 236 EEDYPY-TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQ 294
E YPY +G C + A++ + DE QIAA L NGP+AVA++A
Sbjct: 208 ENSYPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAVDASSWM 267
Query: 295 TYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
TY GGV + S +LDHGVLLVGY + PYWIIKNSW WGE+GY +I
Sbjct: 268 TYTGGVMTSCV-SEQLDHGVLLVGYNDSAAV-------PYWIIKNSWTAQWGEDGYIRIA 319
Query: 355 RGRNVCGVDSMVST 368
+G N C V S+
Sbjct: 320 KGSNQCLVKEEASS 333
>gi|312282841|dbj|BAJ34286.1| unnamed protein product [Thellungiella halophila]
Length = 358
Score = 229 bits (583), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 148/377 (39%), Positives = 196/377 (51%), Gaps = 34/377 (9%)
Query: 1 MGSKTVVLFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHH- 59
M KT++ +V +++ +A ++ + D IR V+DG EI E + +LG H
Sbjct: 1 MSVKTILPSVVLVILIAASAAADIGFDESNPIRMVSDGLREI----EESVVQILGQSRHV 56
Query: 60 --FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFR 117
F+ F ++ K Y + EE RF+IFK NL K S G+ QF+DLT EF+
Sbjct: 57 LSFARFTHRYGKKYQNAEEIKLRFSIFKENLDLIRSTNKKRLSYKLGVNQFADLTWQEFQ 116
Query: 118 RTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
R LG + L LP DWRE G V PVKDQG CGSCW+FSTTGAL
Sbjct: 117 RNKLGAAQNCSATLKGSHK--LTEAALPETKDWREDGIVSPVKDQGGCGSCWTFSTTGAL 174
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREE 237
E A A GK +SLSEQQLVDC + + GCNGGL + AFEY GGL EE
Sbjct: 175 EAAYHQAFGKGISLSEQQLVDCAGAFN-------NYGCNGGLPSQAFEYIKSNGGLDTEE 227
Query: 238 DYPYTGTDRGHACKFDKSKIAASV---ANFSVVSLDEDQIAANLVKNGPLAVAINAVY-M 293
YPYTG D CK+ + V N ++ + DE + A LV+ P+++A V
Sbjct: 228 AYPYTGKD--GTCKYSAENVGVQVLDSVNITLGAEDELKHAVGLVR--PVSIAFEVVKSF 283
Query: 294 QTYIGGVSCPYICSRR---LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
+ Y GV C ++H VL VGYG PYW+IKNSWG WG+ GY
Sbjct: 284 RLYKSGVYTDSHCGNTPMDVNHAVLAVGYGIEDGV-------PYWLIKNSWGADWGDKGY 336
Query: 351 YKICRGRNVCGVDSMVS 367
+K+ G+N+CG+ + S
Sbjct: 337 FKMEMGKNMCGIATCAS 353
>gi|343471272|emb|CCD16264.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 447
Score = 228 bits (582), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 128/310 (41%), Positives = 172/310 (55%), Gaps = 25/310 (8%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
+ F+ FK+K++++Y E RF +FK ++ RA +P AT G+TQFSD++P E
Sbjct: 38 QQQFAAFKQKYSRSYKDATEEAFRFRMFKQSMERAKEEAAANPYATFGVTQFSDMSPEEL 97
Query: 117 RRTYL-GLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
R TYL G + K + + T P DWR+KGAV PVKDQ CGSCW+FS TG
Sbjct: 98 RATYLNGAKYYAAALKRPRKVVNVSTGKAPPAVDWRKKGAVTPVKDQRKCGSCWAFSATG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA--GGL 233
+EG +A +L SLSEQ LV CD+ D GC GGLM+ A ++ + + G +
Sbjct: 158 NIEGQWKVAGHELTSLSEQMLVSCDN---------MDDGCQGGLMDRALKWIVSSNKGNV 208
Query: 234 MREEDYPYTGTDRG-HACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY 292
EE YPY TD C + A ++ + DE+ IA L KNGP+A+A++A
Sbjct: 209 FTEESYPYDSTDGDVPPCNMSGKVVGAKISGHINLPKDENAIAEWLAKNGPVAIAVDASS 268
Query: 293 MQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
Y GGV SC S L+H VLLVGY + PYWIIKNSWG+ WGE GY
Sbjct: 269 FLDYKGGVLTSCS---SDALNHDVLLVGYDDTS-------KPPYWIIKNSWGKKWGEEGY 318
Query: 351 YKICRGRNVC 360
++ +G N C
Sbjct: 319 IRVEKGTNQC 328
>gi|158148921|dbj|BAF81994.1| cysteine proteinase [Platycodon grandiflorus]
Length = 359
Score = 228 bits (582), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 142/370 (38%), Positives = 195/370 (52%), Gaps = 34/370 (9%)
Query: 8 LFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHH---FSLFK 64
L+ + + S+G+ D + + + V+DG E+ E++ ++G H F+ F
Sbjct: 9 FLLILIACVAGASAGSSFADQNPIKQVVSDGLREL----EASVLQVIGQTRHSLAFARFA 64
Query: 65 KKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLR 124
++ K+Y + EE RF+IF +L+ H K S T G+ +F+DLT EFR+ LG
Sbjct: 65 HRYGKSYETAEEMKRRFSIFVDSLKMIRSHNKKGLSYTLGVNEFADLTWEEFRKHRLGAA 124
Query: 125 RKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLA 184
+ + L LP DWRE G V PVK+QG CGSCW+FSTTGALE A A
Sbjct: 125 QNCSATLKGNHK--LTNGLLPLKKDWREVGIVTPVKNQGHCGSCWTFSTTGALEAAYVQA 182
Query: 185 TGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGT 244
GK + LSEQQLVDC + + GCNGGL + AFEY GGL EE YPYTG
Sbjct: 183 FGKAIFLSEQQLVDCARAYN-------NFGCNGGLPSQAFEYIKANGGLDTEEAYPYTGV 235
Query: 245 DRGHACKFDKSKIAASV---ANFSVVSLDEDQIAANLVKNGPLAVAINAVY-MQTYIGGV 300
D CKF I V N ++ + DE + A V+ P++VA V + Y GV
Sbjct: 236 D--GVCKFSSENIGVQVLDSVNITLGAEDELKDAVAFVR--PVSVAFEVVSGFRLYKSGV 291
Query: 301 SCPYICSRR---LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGR 357
C ++H V+ VGYG + PYW+IKNSWG WG+NGY+K+ G+
Sbjct: 292 YTSDTCGNTPMDVNHAVVAVGYGVE-------NDVPYWLIKNSWGADWGDNGYFKMEMGK 344
Query: 358 NVCGVDSMVS 367
N+CGV + S
Sbjct: 345 NMCGVATCAS 354
>gi|408009|gb|AAA18215.1| cysteine protease precursor [Trypanosoma congolense]
Length = 444
Score = 228 bits (582), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 129/316 (40%), Positives = 175/316 (55%), Gaps = 22/316 (6%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
+ F+ FK+K++++Y E RF +FK N+ RA +P AT G+T+FSD++P EF
Sbjct: 38 QQQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97
Query: 117 RRTYL-GLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
R TY G K + + T P DWR+KGAV PVKDQG CGSCW+FS G
Sbjct: 98 RATYHNGAEYYAAALKRPRKVVNVSTGKAPEAVDWRKKGAVTPVKDQGQCGSCWAFSAIG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA--GGL 233
+EG +A +L SLSEQ LV CD D GC GGLM+ AF++ + + G +
Sbjct: 158 NIEGQWKVAGHELTSLSEQMLVSCDTN---------DFGCEGGLMDDAFKWIVSSNKGNV 208
Query: 234 MREEDYPY-TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY 292
E+ YPY +G C + A + + + DE+ IA L KNGP+A+A++A
Sbjct: 209 FTEQSYPYASGGGNVPTCDKSGKVVGAKIRDHVDLPEDENAIAEWLAKNGPVAIAVDATS 268
Query: 293 MQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYK 352
Q+Y GGV I S LDHGVLLVGY + PYWIIKNSW + WGE GY
Sbjct: 269 FQSYTGGVLTSCI-SEHLDHGVLLVGYDDT-------SKPPYWIIKNSWSKGWGEEGYSA 320
Query: 353 ICRGRNVCGVDSMVST 368
+ R N C + ++ S+
Sbjct: 321 L-RRHNQCLMKNLPSS 335
>gi|340381055|ref|XP_003389037.1| PREDICTED: cathepsin L1-like [Amphimedon queenslandica]
Length = 329
Score = 228 bits (582), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 132/310 (42%), Positives = 181/310 (58%), Gaps = 17/310 (5%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQ-KLDPSA-THGITQFSDLTPAEF 116
+ L+K+ K Y+S++E +R TI++AN + H D T + F+DL +EF
Sbjct: 22 EWELWKRTNGKDYSSEKEELYRQTIWEANKKIVLEHNANADKWGWTLEMNAFADLESSEF 81
Query: 117 RRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGA 176
Y G RR R +A + + N LP DWR KGAV PVK+Q CGSCW+FSTTG+
Sbjct: 82 AAMYNGYRRSAR-KSNATRYHVPTGNALPDTVDWRTKGAVTPVKNQKQCGSCWAFSTTGS 140
Query: 177 LEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMRE 236
LEG FL G L SLSEQQLVDC + + GC GGLM++AF+Y GG+ E
Sbjct: 141 LEGQTFLKKGTLPSLSEQQLVDCSDKYG-------NHGCQGGLMDNAFKYIEANGGIDSE 193
Query: 237 EDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKN-GPLAVAINAVY--M 293
YPY + C+F +S +AA+ + + D+ + V N GP++VA++A +
Sbjct: 194 ASYPYEA--KNGKCRFQQSAVAATCTGYKDIPHDDIDGLQDAVANVGPISVAMDASHSSF 251
Query: 294 QTYIGGVSCPYICSR-RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYK 352
Q Y GV P +CS RLDHGVL VGYG+ + + +EKPYW++KNSWG WG+ GY+K
Sbjct: 252 QLYAAGVYDPLLCSSTRLDHGVLAVGYGTEP-SGLFHEEKPYWLVKNSWGPDWGQQGYFK 310
Query: 353 ICRGRNVCGV 362
I R N CG+
Sbjct: 311 IVRKDNKCGI 320
>gi|345783063|ref|XP_533219.3| PREDICTED: LOW QUALITY PROTEIN: cathepsin F [Canis lupus
familiaris]
Length = 490
Score = 228 bits (582), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 144/320 (45%), Positives = 187/320 (58%), Gaps = 36/320 (11%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRR 118
F F +N+ Y ++EE + R ++F N+ RA + Q LD +A +GIT+FSDLT EFR
Sbjct: 192 FKEFVTTYNRTYETKEEAEWRMSVFSNNMVRAQKIQALDRGTAQYGITKFSDLTEEEFRT 251
Query: 119 TYLG--LR----RKLRLPKD-ADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSF 171
YL LR +K+RL K +D AP P ++DWR KGAV VKDQG CGSCW+F
Sbjct: 252 IYLNPLLRENRGKKMRLAKSISDHAP-------PPEWDWRSKGAVTKVKDQGMCGSCWAF 304
Query: 172 STTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAG 231
S TG +EG FL G L+SLSEQ+L+DCD D C GGL ++A+ + G
Sbjct: 305 SVTGNVEGQWFLKEGTLLSLSEQELLDCD---------KVDKACLGGLPSNAYSAIMTLG 355
Query: 232 GLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV 291
GL E+DY Y G AC F K + + +S +E ++AA L K GP++VAINA
Sbjct: 356 GLETEDDYSYQG--HLQACSFSAKKARVYINDSMELSQNEQKLAAWLAKKGPISVAINAF 413
Query: 292 YMQTYIGGVSCPY--ICSRRL-DHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
MQ Y G+S P +CS L DH VLLVGYG+ P+W IKNSWG WGE
Sbjct: 414 GMQFYRHGISHPLRPLCSPWLIDHAVLLVGYGNRSGI-------PFWAIKNSWGTDWGEE 466
Query: 349 GYYKICRGRNVCGVDSMVST 368
GYY + RG CGV++M S+
Sbjct: 467 GYYYLHRGSGACGVNTMASS 486
>gi|8468607|gb|AAF75547.1| cruzipain [Trypanosoma cruzi]
Length = 467
Score = 228 bits (581), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 131/314 (41%), Positives = 168/314 (53%), Gaps = 21/314 (6%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRR 118
F+ FK+K + Y S E R ++F+ NL A H +P AT G+T FSDLT EF
Sbjct: 37 QFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEFWS 96
Query: 119 TYLGLRRKLRLPKDADQAPI-LPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
Y ++ + P+ + PA DWR +GAV VKDQG CGSCW+FS G +
Sbjct: 97 RYHNGAAHFAAAQERARVPVNVEVVGAPAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNV 156
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA--GGLMR 235
E FLA L +LSEQ LV CD DSGC GGLMN+AFE+ ++ G +
Sbjct: 157 ECQWFLAGHPLTNLSEQMLVSCDKT---------DSGCGGGLMNNAFEWIVQENNGAVYT 207
Query: 236 EEDYPY-TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQ 294
E YPY +G C + A++ + DE QIAA L NGP+AVA++A
Sbjct: 208 EGSYPYASGEGISPPCTTSGHTVGATITGHVEIPQDEAQIAAWLAVNGPVAVAVDASSWM 267
Query: 295 TYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
TY GGV + S +LDHGVLLVGY + PYW+IKNSW WGE GY +I
Sbjct: 268 TYTGGVMTSCV-SEQLDHGVLLVGYNDSAAV-------PYWVIKNSWTTHWGEGGYIRIA 319
Query: 355 RGRNVCGVDSMVST 368
+G N C V VS+
Sbjct: 320 KGSNQCLVKEGVSS 333
>gi|62526575|gb|AAX84673.1| cysteine protease CP1 [Manihot esculenta]
Length = 467
Score = 228 bits (581), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 138/365 (37%), Positives = 192/365 (52%), Gaps = 40/365 (10%)
Query: 7 VLFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKK 66
VL +S + SA + D + DE+++ +E + K
Sbjct: 13 VLLFLSFTLSSASDMSIISYDQTHATKSSWRTDDEVMAIYEE--------------WLVK 58
Query: 67 FNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLR-- 124
K Y + E + RF +FK NLR H + + G+ F+DLT E+R TYLG R
Sbjct: 59 QGKVYNALGEREKRFQVFKDNLRFIDEHNSENRTYKLGLNGFADLTNEEYRSTYLGARGG 118
Query: 125 -RKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFL 183
++ RL K +D+ LP DWR++GAV VKDQGSCGSCW+FST A+EG N +
Sbjct: 119 MKRNRLRKTSDRYAPRVGESLPDSVDWRKEGAVAEVKDQGSCGSCWAFSTIAAVEGINKI 178
Query: 184 ATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTG 243
TG L+SLSEQ+LVDCD S + GCNGGLM+ AFE+ + GG+ EEDYPY
Sbjct: 179 VTGDLISLSEQELVDCDT--------SYNEGCNGGLMDYAFEFIINNGGIDTEEDYPYLA 230
Query: 244 TDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA--VYMQTYIGGVS 301
D G + K+ ++ ++ V ++ + V N P++VAI A Q Y G+
Sbjct: 231 RD-GRCDTYRKNAKVVTIDDYEDVPVNSETALQKAVANQPVSVAIEAGGRDFQFYASGIF 289
Query: 302 CPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRN--- 358
C +LDHGV VGYG+ K YWI++NSWG+SWGENGY ++ R N
Sbjct: 290 SGR-CGTQLDHGVAAVGYGTE-------NGKDYWIVRNSWGKSWGENGYLRMARSINSPT 341
Query: 359 -VCGV 362
+CG+
Sbjct: 342 GICGI 346
>gi|229893789|gb|ACQ90252.1| cathepsin L [Pinctada fucata]
Length = 362
Score = 228 bits (580), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 144/326 (44%), Positives = 180/326 (55%), Gaps = 28/326 (8%)
Query: 54 LGAEHH-FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKL----DPSATHGITQF 108
LG H + FK F K Y + EE RF IF+ L R H + S G+ QF
Sbjct: 47 LGPYHETWKEFKTLFGKVYDTVEEEIKRFDIFRDTLERIEEHNRKYHMGQKSYYMGVNQF 106
Query: 109 SDLTPAEFRRTYLGLRRKLRLPKDAD--QAPILPTNDLPADFDWREKGAVGPVKDQGSCG 166
SD++ E+ R + GLRR R + + L DWR+KG V PVK+QG CG
Sbjct: 107 SDMSHDEYLR-HNGLRRGNRKYSKGEGCDSYTKSGKQLDDKVDWRDKGYVTPVKNQGQCG 165
Query: 167 SCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEY 226
SCWSFSTTG+LEG +F TGKL+SLSEQQLVDC E GCNGGLM++AFEY
Sbjct: 166 SCWSFSTTGSLEGQHFRQTGKLISLSEQQLVDCSGTFGNE-------GCNGGLMDNAFEY 218
Query: 227 TLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFS-VVSLDEDQIAANLVKNGPLA 285
GGL E+DYPYT + C KS A+ + V S DED + L GP++
Sbjct: 219 IKSIGGLEGEDDYPYTA--KQGKCHLKKSLFKANDTGCTDVESGDEDALKDALASVGPIS 276
Query: 286 VAINAVY--MQTYIGGVSCPYIC-SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWG 342
VAI+A + Q+Y GGV C S+ LDHGVL VGYG+ YW++KNSWG
Sbjct: 277 VAIDASHASFQSYDGGVYDEEECSSQNLDHGVLTVGYGTEENGG------DYWLVKNSWG 330
Query: 343 ESWGENGYYKICRGR-NVCGVDSMVS 367
E WGE GY K+ R + N CG+ + S
Sbjct: 331 EMWGEEGYIKMSRNKDNQCGIATQAS 356
>gi|225428879|ref|XP_002285299.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
Length = 469
Score = 228 bits (580), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 133/312 (42%), Positives = 179/312 (57%), Gaps = 27/312 (8%)
Query: 66 KFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLG--- 122
K K+Y + E + RF IFK NLR H ++ + G+ +F+DLT E+R YLG
Sbjct: 60 KHGKSYNALGERERRFEIFKDNLRFIEEHNAVNRTYKVGLNRFADLTNEEYRSRYLGRRD 119
Query: 123 -LRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGAN 181
RR LR + +D+ DLP DWREKGAV PVKDQG+CGSCW+FST A+EG N
Sbjct: 120 ETRRGLRASRVSDRYSFRAGEDLPESVDWREKGAVVPVKDQGNCGSCWAFSTIAAVEGIN 179
Query: 182 FLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPY 241
+ATG L+SLSEQ+LVDCD S + GCNGGLM+ AFE+ + GG+ EEDYPY
Sbjct: 180 QIATGDLISLSEQELVDCDK--------SYNQGCNGGLMDYAFEFIINNGGIDSEEDYPY 231
Query: 242 TGTDRGHACKFD-KSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA--VYMQTYIG 298
D C + K+ S+ + V ++++ V N P++VAI A Q Y
Sbjct: 232 RAAD--TTCDPNRKNARVVSIDGYEDVPQNDERSLKKAVANQPVSVAIEAGGRAFQLYQS 289
Query: 299 GVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRN 358
GV C +LDHGV+ VGYG+ YWI++NSWG +WGE+GY K+ RN
Sbjct: 290 GVFTGQ-CGTQLDHGVVAVGYGTENSV-------DYWIVRNSWGPNWGESGYIKL--ERN 339
Query: 359 VCGVDSMVSTVA 370
+ G ++ +A
Sbjct: 340 LAGTETGKCGIA 351
>gi|168058022|ref|XP_001781010.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162667567|gb|EDQ54194.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 457
Score = 228 bits (580), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 139/344 (40%), Positives = 193/344 (56%), Gaps = 37/344 (10%)
Query: 35 VTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARH 94
V GD I + + LL + F+ + K K Y++ EE HRF ++K NL RH
Sbjct: 22 VVANGDVIRMPTDVGKDQLLAGQ--FAAWAHKHGKVYSAAEERAHRFLVWKDNLEYIQRH 79
Query: 95 QKLDPSATHGITQFSDLTPAEFRRTYLGLR----RKLRLPKDADQAPILPTNDLPADFDW 150
+ + S G+T+F+DLT EFRR Y G R R+L+ ++A + ++ P DW
Sbjct: 80 SEKNLSYWLGLTKFADLTNEEFRRQYTGTRIDRSRRLKKGRNATGSFRYANSEAPKSIDW 139
Query: 151 REKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGS 210
REKGAV VKDQGSCGSCW+FS G++EG N + TG +SLS Q+LVDCD +
Sbjct: 140 REKGAVTSVKDQGSCGSCWAFSAVGSVEGINAIRTGDAISLSVQELVDCDKK-------- 191
Query: 211 CDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVA---NFSVV 267
+ GCNGGLM+ AF++ ++ GG+ E+DYPY G D + D +K+ A V ++ V
Sbjct: 192 YNQGCNGGLMDYAFDFVIQNGGIDTEKDYPYQGYDG----RCDVNKMNARVVTIDSYEDV 247
Query: 268 SLDEDQIAANLVKNGPLAVAINA--VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYA 325
++++ V P++VAI A Q Y GGV C LDHGVL VGYGS
Sbjct: 248 PENDEEALKKAVAGQPVSVAIEAGGRDFQLYSGGVFTGR-CGTDLDHGVLAVGYGSE--- 303
Query: 326 PIRLKEKPYWIIKNSWGESWGENGYYKICR------GRNVCGVD 363
K YWI+KNSWGE WGE+GY ++ R G +CG++
Sbjct: 304 ----KGLDYWIVKNSWGEYWGESGYLRMQRNLKDDNGYGLCGIN 343
>gi|18141289|gb|AAL60582.1|AF454960_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 359
Score = 228 bits (580), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 145/377 (38%), Positives = 198/377 (52%), Gaps = 34/377 (9%)
Query: 1 MGSKTVVLFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHH- 59
M +T++ V L++ +A ++ ++ D IR V+D E+ E + +LG H
Sbjct: 2 MSVRTILPSAVLLILIAASTAESIGFDESNPIRMVSDRLREV----EESVVQILGQSRHV 57
Query: 60 --FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFR 117
F+ F ++ K Y + EE RF+IFK NL K S G+ QF+D+T EF+
Sbjct: 58 ISFARFAHRYGKRYENAEEMKLRFSIFKENLDLIRSTNKKGLSYKLGVNQFADMTWQEFQ 117
Query: 118 RTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
RT LG + L LP DWRE G V PVKDQG CGSCW+FSTTGAL
Sbjct: 118 RTKLGAAQNCSATLKGTHK--LTGEALPETKDWREDGIVSPVKDQGGCGSCWTFSTTGAL 175
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREE 237
E A A GK +SLSEQQLVDC + + GCNGGL + AFEY GGL EE
Sbjct: 176 EAAYHQAFGKGISLSEQQLVDCAGAFN-------NYGCNGGLPSQAFEYIKSNGGLDTEE 228
Query: 238 DYPYTGTDRGHACKFDKSKIAASV---ANFSVVSLDEDQIAANLVKNGPLAVAINAVY-M 293
YPYTG D CK+ + V N ++ + DE + A LV+ P+++A ++
Sbjct: 229 AYPYTGED--GTCKYSAENVGVEVLDSVNITLGAEDELKHAVGLVR--PVSIAFEVIHSF 284
Query: 294 QTYIGGVSCPYICSRR---LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
+ Y GV C + ++H VL VGYG PYW+IKNSWG WG+ GY
Sbjct: 285 RLYKSGVYSDSHCGQTPMDVNHAVLAVGYGIE-------DGVPYWLIKNSWGADWGDKGY 337
Query: 351 YKICRGRNVCGVDSMVS 367
+K+ G+N+CG+ + S
Sbjct: 338 FKMEMGKNMCGIATCAS 354
>gi|335281454|ref|XP_003122543.2| PREDICTED: cathepsin F [Sus scrofa]
gi|350579927|ref|XP_003480717.1| PREDICTED: cathepsin F-like [Sus scrofa]
Length = 490
Score = 228 bits (580), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 142/322 (44%), Positives = 183/322 (56%), Gaps = 39/322 (12%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRR 118
F F +N+ Y ++EE R ++F N+ RA + Q LD +A +G+T+FSDLT EFR
Sbjct: 193 FKEFVTTYNRTYDTKEEARWRMSVFANNMVRAQKIQALDTGTARYGVTKFSDLTEEEFRT 252
Query: 119 TYLGLR------RKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFS 172
YL RK+RL K P P ++DWR+KGAV VKDQG CGSCW+FS
Sbjct: 253 IYLNPLLQEEPGRKMRLAKSVSSLP-------PPEWDWRKKGAVTKVKDQGMCGSCWAFS 305
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
TG +EG FL G L+SLSEQ+L+DCD D GC GGL ++A+ GG
Sbjct: 306 VTGNVEGQWFLKQGTLLSLSEQELLDCDK---------VDKGCMGGLPSNAYSAIKTLGG 356
Query: 233 LMREEDYPYTGTDRGH--ACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA 290
L EEDY Y RGH C F+ K + + +S +E ++AA L + GP++VAINA
Sbjct: 357 LETEEDYSY----RGHLQTCSFNAEKAKVYINDSVELSQNEQKLAAWLAEKGPISVAINA 412
Query: 291 VYMQTYIGGVSCPY--ICSRRL-DHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGE 347
MQ Y G+S P +CS L DH VLLVGYG+ P+W IKNSWG WGE
Sbjct: 413 FGMQFYRHGISHPLRPLCSPWLIDHAVLLVGYGNRS-------ATPFWAIKNSWGTDWGE 465
Query: 348 NGYYKICRGRNVCGVDSMVSTV 369
GYY + RG CGV+ M S+
Sbjct: 466 EGYYYLYRGSGACGVNIMASSA 487
>gi|72008176|ref|XP_780713.1| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 335
Score = 227 bits (579), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 140/323 (43%), Positives = 182/323 (56%), Gaps = 28/323 (8%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQ-KLDP---SATHGITQFSDLT 112
+ ++ +K + K Y S EE R I++ NL +H K D + G+ QF+DL
Sbjct: 25 DEDWNQWKNEHGKRYLSDEEEASRKLIWEKNLDIVIKHNLKYDLGHFTYALGMNQFADLK 84
Query: 113 PAEFRRTYLGLRRKLRLPKDADQAPILPTN---DLPADFDWREKGAVGPVKDQGSCGSCW 169
EF G R K A + LP+N +LP DWR KG V PVKDQG CGSCW
Sbjct: 85 NEEFVAMMTGFRVN-GTSKAAKGSTFLPSNNIGELPKTVDWRTKGYVTPVKDQGQCGSCW 143
Query: 170 SFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK 229
+FSTTG+LEG +F ATGKLVSLSEQ LVDC + E GC+GGLM+ AF+Y +K
Sbjct: 144 AFSTTGSLEGQHFKATGKLVSLSEQNLVDCSGKEGNE-------GCDGGLMDQAFQYIIK 196
Query: 230 AGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKN-GPLAVAI 288
AGG+ EE YPY D C F K+ I A+V ++ V+ D + V + GP++VAI
Sbjct: 197 AGGIDTEESYPYKAVDG--ECHFKKANIGATVTGYTDVTSDSETALQKAVAHIGPISVAI 254
Query: 289 NAVYM--QTYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESW 345
+A +M Q Y GV + P S LDHGVL VGYG+ YWI+KNSW E+W
Sbjct: 255 DASHMSFQLYKSGVYNEPDCSSTLLDHGVLAVGYGTTSDGT------DYWIVKNSWAETW 308
Query: 346 GENGYYKICRGR-NVCGVDSMVS 367
G NGY + R + N CG+ + S
Sbjct: 309 GMNGYLWMSRNKDNQCGIATQAS 331
>gi|18407961|ref|NP_566880.1| thiol protease aleurain-like protein [Arabidopsis thaliana]
gi|73622182|sp|Q8RWQ9.1|ALEUL_ARATH RecName: Full=Thiol protease aleurain-like; Flags: Precursor
gi|20147207|gb|AAM10319.1| AT3g45310/F18N11_70 [Arabidopsis thaliana]
gi|332644500|gb|AEE78021.1| thiol protease aleurain-like protein [Arabidopsis thaliana]
Length = 358
Score = 227 bits (579), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 147/362 (40%), Positives = 191/362 (52%), Gaps = 34/362 (9%)
Query: 11 VSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHH---FSLFKKKF 67
+ L++F+A +S + D I+ V+D E+ E T +LG H FS F ++
Sbjct: 11 ILLILFAAAASKEIGFDESNPIKMVSDNLHEL----EDTVVQILGQSRHVLSFSRFTHRY 66
Query: 68 NKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKL 127
K Y S EE RF++FK NL K S + QF+DLT EF+R LG +
Sbjct: 67 GKKYQSVEEMKLRFSVFKENLDLIRSTNKKGLSYKLSLNQFADLTWQEFQRYKLGAAQNC 126
Query: 128 RLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGK 187
T +P DWRE G V PVK+QG CGSCW+FSTTGALE A A GK
Sbjct: 127 SATLKGSHKITEAT--VPDTKDWREDGIVSPVKEQGHCGSCWTFSTTGALEAAYHQAFGK 184
Query: 188 LVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRG 247
+SLSEQQLVDC + + GC+GGL + AFEY GGL EE YPYTG D G
Sbjct: 185 GISLSEQQLVDCAGTFN-------NFGCHGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGG 237
Query: 248 HACKFDKSKIAASV---ANFSVVSLDEDQIAANLVKNGPLAVAINAVY-MQTYIGGVSCP 303
CKF I V N ++ + DE + A LV+ P++VA V+ + Y GV
Sbjct: 238 --CKFSAKNIGVQVRDSVNITLGAEDELKHAVGLVR--PVSVAFEVVHEFRFYKKGVFTS 293
Query: 304 YICSRR---LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVC 360
C ++H VL VGYG + PYW+IKNSWG WG+NGY+K+ G+N+C
Sbjct: 294 NTCGNTPMDVNHAVLAVGYGVE-------DDVPYWLIKNSWGGEWGDNGYFKMEMGKNMC 346
Query: 361 GV 362
GV
Sbjct: 347 GV 348
>gi|146335580|gb|ABQ23399.1| cathepsin L isotype 2 [Trypanoplasma borreli]
Length = 443
Score = 227 bits (579), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 131/321 (40%), Positives = 186/321 (57%), Gaps = 31/321 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFR-R 118
FS FK + Y S E RF IF AN+++AA + +P AT G +F+D++ EF+ R
Sbjct: 25 FSDFKATHARNYVSPGEERKRFEIFAANMKKAAELNRKNPMATFGPNEFADMSSEEFQTR 84
Query: 119 TYLGLRRKLRLPKDADQAPILPTNDLPA----DFDWREKGAVGPVKDQGSCGSCWSFSTT 174
+ A ++ A DWR KGAV VK+QGSCGSCWSFSTT
Sbjct: 85 HNAARHYAAAKARRAKHTKSFTKEEIKAADGQKIDWRLKGAVTSVKNQGSCGSCWSFSTT 144
Query: 175 GALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTL--KAGG 232
G +EG N +ATG LVSLSEQ+LV CD + D+GCNGGLM++AF + + + G
Sbjct: 145 GNIEGQNAIATGNLVSLSEQELVSCD---------TTDNGCNGGLMDNAFGWLISTRGGQ 195
Query: 233 LMREEDYPY-TGTDRGHACKF--DKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAIN 289
+ E YPY +G AC + D + A+++NF ++ E+ +AA + GPL++ ++
Sbjct: 196 IATEASYPYVSGNGIVPACSYNLDNKPVGATISNFQDITGTEEDMAAFVFNYGPLSIGVD 255
Query: 290 AVYMQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGE 347
A Q+Y GG+ CP + ++DHGVL+VGY AP PYWIIKNSW +WGE
Sbjct: 256 ASTWQSYAGGIITYCPDV---QIDHGVLIVGYDDT--AP-----TPYWIIKNSWTANWGE 305
Query: 348 NGYYKICRGRNVCGVDSMVST 368
+GY ++ +G N+CG+ S S+
Sbjct: 306 DGYIRVAKGSNMCGLTSTPSS 326
>gi|358339045|dbj|GAA32724.2| cathepsin F, partial [Clonorchis sinensis]
Length = 271
Score = 227 bits (578), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 129/289 (44%), Positives = 168/289 (58%), Gaps = 29/289 (10%)
Query: 87 NLRRAARHQKLDP-SATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLP 145
L A R Q+++ +A +G+TQFSDLT EF+ YL ++R + P D+
Sbjct: 1 QLAAAKRLQEMEQGTAQYGVTQFSDLTSEEFKTRYL----RMRFDGPIVSEDLTPEEDVT 56
Query: 146 AD---FDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHE 202
D FDWRE GAVGPV DQG CGSCW+FS G +EG F TG L++LSEQQLVDCDH
Sbjct: 57 MDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIGNVEGQWFRKTGDLLALSEQQLVDCDH- 115
Query: 203 CDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVA 262
D GCNGG + K GGL DYPYTG D C ++SK A V
Sbjct: 116 --------LDKGCNGGYPPKTYGEIEKMGGLELASDYPYTGVD--GICYMNQSKFVAYVN 165
Query: 263 NFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGV--SCPYICSRR-LDHGVLLVGY 319
+ +V+ L E A L + GPL+ A+NAV +Q Y+GG+ P++C+ L+H VL VGY
Sbjct: 166 DSTVLPLSEKIQAQKLKEIGPLSSALNAVLLQFYLGGIIFPIPFLCNPHGLNHAVLTVGY 225
Query: 320 GSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVST 368
G+ PYWI+KNSWG +GE GY++I RG CG++ +VST
Sbjct: 226 GTE-------FGIPYWIVKNSWGVGFGEKGYFRIFRGAGTCGINLVVST 267
>gi|30141019|dbj|BAC75923.1| cysteine protease-1 [Helianthus annuus]
Length = 461
Score = 227 bits (578), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 131/296 (44%), Positives = 172/296 (58%), Gaps = 23/296 (7%)
Query: 66 KFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRR 125
K K Y + E D RF IFK NLR H D + G+ +F+DLT E+R TY G++
Sbjct: 58 KHGKTYNALGEKDRRFQIFKDNLRFIDEHNSGDHTYKLGLNKFADLTNEEYRMTYTGIKT 117
Query: 126 ---KLRLPK-DADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGAN 181
K +L K +D+ + LP DWRE+GAV VKDQGSCGSCW+FSTTG++EG N
Sbjct: 118 IDDKKKLSKMKSDRYAYRSGDSLPEYVDWREQGAVTDVKDQGSCGSCWAFSTTGSVEGVN 177
Query: 182 FLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPY 241
+ TG L+S+SEQ+LV+CD S + GCNGGLM+ AFE+ +K GG+ EEDYPY
Sbjct: 178 KIVTGDLISVSEQELVNCDT--------SYNQGCNGGLMDYAFEFIIKNGGIDTEEDYPY 229
Query: 242 TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA--VYMQTYIGG 299
TG D G K K+ ++ ++ V ++++ V N P+AVAI A Q Y G
Sbjct: 230 TGKD-GKCDKNKKNAKVVTIDSYEDVPVNDESSLKKAVSNQPVAVAIEAGGRDFQFYTSG 288
Query: 300 VSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
+ C LDHGVL GYG+ K YW++KNSWG WGE GY K+ R
Sbjct: 289 IFTG-SCGTALDHGVLAAGYGTE-------DGKDYWLVKNSWGAEWGEGGYLKMER 336
>gi|281204396|gb|EFA78592.1| cysteine proteinase 3 [Polysphondylium pallidum PN500]
Length = 330
Score = 227 bits (578), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 140/330 (42%), Positives = 188/330 (56%), Gaps = 33/330 (10%)
Query: 51 NDLLGAEHH---FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQ 107
N L +H+ F+ + + ++AY E D R+ FK NL + S G+
Sbjct: 17 NRLFSEQHYQNQFTNWMVRLDRAYDVFEFQD-RYNAFKNNLDLIHKWNSQGHSTVLGVNH 75
Query: 108 FSDLTPAEFRRTYLGLRRKL-RLPKDADQAPILPTNDL----PADFDWREKGAVGPVKDQ 162
+DL+ E+R YLG++ RLP+ QA + N + A DWR GAVG VKDQ
Sbjct: 76 LADLSNEEYRNLYLGVKVDASRLPQ---QAASIKLNKVFAPVAASLDWRSSGAVGRVKDQ 132
Query: 163 GSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNS 222
G CGSCWSFSTTG++EGAN +ATG SLSEQQL+DC + E GCNGGLM++
Sbjct: 133 GQCGSCWSFSTTGSIEGANQIATGNFASLSEQQLMDCSRDYGNE-------GCNGGLMDA 185
Query: 223 AFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKN 281
A +Y + GGL EE YPYT +D + CKF+ + I A ++++ V E +AA L K
Sbjct: 186 AMKYVIAQGGLDTEESYPYTMSDS-YTCKFNPANIGAKISSYIDVQRGSETDLAAKLNK- 243
Query: 282 GPLAVAINAVY--MQTYIGGVSCPYICSR-RLDHGVLLVGYGSAGYAPIRLKEKPYWIIK 338
GP++VAI+A + Q Y GV CS LDHGVL VGYG+ G YWI+K
Sbjct: 244 GPVSVAIDASHSSFQLYKSGVYYEPACSSYNLDHGVLAVGYGTEG-------SSNYWIVK 296
Query: 339 NSWGESWGENGYYKICRGR-NVCGVDSMVS 367
NSWG +WG +GY + + + N CG+ SM S
Sbjct: 297 NSWGPNWGLSGYIWMAKDKSNHCGISSMAS 326
>gi|146216004|gb|ABQ10204.1| cysteine protease Cp6 [Actinidia deliciosa]
Length = 461
Score = 227 bits (578), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 132/302 (43%), Positives = 171/302 (56%), Gaps = 23/302 (7%)
Query: 66 KFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLR- 124
K K+Y + E + RF IFK NLR H + G+ +F+DLT E+R YLG R
Sbjct: 52 KHGKSYNAIGEKEKRFQIFKDNLRFIDEHNAESRTYKVGLNRFADLTNDEYRSMYLGART 111
Query: 125 ---RKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGAN 181
R+L K +D+ + LP DWREKGAV VKDQGSCGSCW+FST A+EG N
Sbjct: 112 GSRRRLSTQKRSDRYVPVAGESLPDSVDWREKGAVVGVKDQGSCGSCWAFSTIAAVEGIN 171
Query: 182 FLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPY 241
+ TG L+SLSEQ+LVDCD S + GCNGGLM+ AFE+ +K GG+ EEDYPY
Sbjct: 172 QIVTGDLISLSEQELVDCDT--------SYNEGCNGGLMDYAFEFIIKNGGIDTEEDYPY 223
Query: 242 TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYM--QTYIGG 299
D G ++ K+ ++ ++ V ++ +Q V N P++VAI A M Q Y G
Sbjct: 224 NARD-GRCDQYRKNAKVVTIDDYEDVPVNNEQALQKAVANQPVSVAIEASGMAFQFYESG 282
Query: 300 VSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNV 359
V C LDHGV VGYG+ YWI+KNSWG SWGE+GY ++ R
Sbjct: 283 VFTGN-CGTALDHGVTAVGYGTE-------NSVDYWIVKNSWGSSWGESGYIRMERNTGA 334
Query: 360 CG 361
G
Sbjct: 335 TG 336
>gi|118350314|ref|XP_001008438.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89290205|gb|EAR88193.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 389
Score = 226 bits (577), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 138/355 (38%), Positives = 190/355 (53%), Gaps = 42/355 (11%)
Query: 45 HHESTNN-DLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAAR-HQKLDPSAT 102
H+ ST +L + FS FK + K Y EE RF IF+ NL + +Q + +A
Sbjct: 24 HYNSTKQLNLTQVKQLFSKFKAEHKKFYNFLEEQ-RRFEIFRQNLDIISELNQVEEGTAE 82
Query: 103 HGITQFSDLTPAEFRRTYL---GLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPV 159
+GITQFSD+T EF+ L R + I + D P +DWR+ GAV PV
Sbjct: 83 YGITQFSDMTTEEFKSQILIPSTYARNFTGSRYHGFQKI--SQDAPTSYDWRDHGAVTPV 140
Query: 160 KDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGL 219
K+QG+ G+CW+FSTTG +EG FLA LVSLSE+Q+VDCD +P G D G GG
Sbjct: 141 KNQGTVGTCWTFSTTGNIEGQWFLAGNPLVSLSEEQIVDCDGSQEPST-GHADCGVFGGW 199
Query: 220 MNSAFEYTLKAGGLMREEDYPYTGTDRG-------------------------HACKFDK 254
AF+Y + AGGL EE YPY + G + C+ +
Sbjct: 200 PYLAFDYVINAGGLPSEETYPYCVGNGGCYPCPAPGYNETLCGPAVPYCNATAYPCRQGQ 259
Query: 255 SKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSR-RLDHG 313
IAA + ++ +S DED I L + GPL+VA++A Y+Q Y G+S P CS+ L+H
Sbjct: 260 VPIAAKIEDWKALSKDEDSIKQQLFEIGPLSVALDASYLQFYKKGISAPKFCSKTTLNHA 319
Query: 314 VLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVST 368
VLL GYG +W +KNSWG WGE GY+++ RG +CG+++ V+T
Sbjct: 320 VLLTGYGIDNGV-------EFWNVKNSWGAKWGEQGYFRLKRGVGMCGINTQVAT 367
>gi|56553473|gb|AAV97878.1| recombinant cysteine protease [Cloning vector pQ-CPB]
Length = 335
Score = 226 bits (577), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 133/312 (42%), Positives = 172/312 (55%), Gaps = 31/312 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK+ + + YA+ +E R F+ NL HQ +P A GIT+F DL+ EF
Sbjct: 30 FEEFKQTYQRVYATLDEEQQRLANFQRNLELMREHQANNPHARFGITKFFDLSEEEFATR 89
Query: 120 YL-GLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YL G + K A Q DL PA DWREKGAV PVKDQG CGSCW+FS G
Sbjct: 90 YLSGATHFAKAKKFASQYYRKVGADLSTAPAAVDWREKGAVTPVKDQGMCGSCWAFSAIG 149
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTL--KAGGL 233
+E +LAT L+SLSEQ+LV CD D GCNGGLM AF++ L + G +
Sbjct: 150 NIESKWYLATHSLISLSEQELVSCD---------DVDEGCNGGLMGQAFDWLLNNRNGAV 200
Query: 234 MREEDYPYTGTDRGHACKFDKSK---IAASVANFSVVSLDEDQIAANLVKNGPLAVAINA 290
YPY + G + +S I A + + +ED +AA L NGP+A+A++A
Sbjct: 201 YTGASYPYV-SGNGSVPECSESSDLVIGAYIDGHVTIESNEDTMAAWLAANGPIAIAVDA 259
Query: 291 VYMQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+Y GGV SC ++L+HGVLLVGY G E PYW+IKNSWGE+WGE
Sbjct: 260 SAFMSYTGGVLTSCD---GKQLNHGVLLVGYNMTG-------EVPYWVIKNSWGENWGEK 309
Query: 349 GYYKICRGRNVC 360
GY ++ +G N C
Sbjct: 310 GYVRVRKGTNEC 321
>gi|300175452|emb|CBK20763.2| unnamed protein product [Blastocystis hominis]
Length = 313
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 136/321 (42%), Positives = 174/321 (54%), Gaps = 35/321 (10%)
Query: 54 LGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTP 113
L E+ F+ F+ ++ K Y + E R +F N+ A + D T G T F+D+T
Sbjct: 17 LRYENTFNSFEARYGKNYINAAERAFRQKVFAYNMEWAQKINSEDHPYTVGATPFADMTN 76
Query: 114 AEFRRTYL-GLRRKLRLPKDADQAPILPTNDLPAD-FDWREKGAVGPVKDQGSCGSCWSF 171
EF + L G K ++ K P P + A+ DWREKGAV PVK+Q SCGSCW+F
Sbjct: 77 TEFAVSKLCGCMLKPKMTK-----PATPIMEPAAEAVDWREKGAVTPVKNQASCGSCWAF 131
Query: 172 STTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAG 231
S TGA+EG NF+A G+L+SLSEQQLVDCDH+ SGC GGLM AFEY K
Sbjct: 132 SATGAMEGRNFVANGELISLSEQQLVDCDHQ---------SSGCGGGLMTYAFEYA-KKK 181
Query: 232 GLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA- 290
G+ +EEDYPY D CK DK + V + V GP++VA+ A
Sbjct: 182 GMCKEEDYPYHAVDED--CKDDKCTPVVFPKGYEEVPRFDGAALKQAVSQGPVSVAVEAD 239
Query: 291 -VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
+ Q Y GGV C L+HGVL VGYG+ YWI+KNSWGESWG+ G
Sbjct: 240 SIVFQMYTGGVIDSSACGTSLNHGVLAVGYGA-----------DYWIVKNSWGESWGDKG 288
Query: 350 YYKIC---RGRNVCGVDSMVS 367
Y KI G +CG++ M S
Sbjct: 289 YLKIKYTESGAGICGINQMNS 309
>gi|11464866|gb|AAG35358.1|AF314930_1 cruzipain [Trypanosoma cruzi]
Length = 467
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 131/314 (41%), Positives = 169/314 (53%), Gaps = 21/314 (6%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRR 118
F+ FK+K + Y S E R ++F+ NL A H +P AT G+T FSDLT EFR
Sbjct: 37 QFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEFRS 96
Query: 119 TYLGLRRKLRLPKDADQAPI-LPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
Y ++ + P+ + PA DWR +GAV VKDQG CGSCW+FS G +
Sbjct: 97 RYHNGAAHFAAAQERARVPVKVEVVGAPAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNV 156
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA--GGLMR 235
E FLA L +LSEQ LV CD DSGC+GGLMN+AFE+ ++ G +
Sbjct: 157 ECQWFLAGHPLTNLSEQMLVSCDKT---------DSGCSGGLMNNAFEWIVQENNGAVYT 207
Query: 236 EEDYPY-TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQ 294
E+ YPY +G C + A++ + DE QIAA L NGP+AVA++A
Sbjct: 208 EDSYPYASGEGISPPCTTSGHTVGATITGHVGLPQDEAQIAAWLAVNGPVAVAVDASSWM 267
Query: 295 TYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
TY GGV + S +LDHGVLLVGY + PYWIIKNS WGE GY +I
Sbjct: 268 TYTGGVMTSCV-SEQLDHGVLLVGYNDSAAV-------PYWIIKNSRTTQWGEEGYIRIA 319
Query: 355 RGRNVCGVDSMVST 368
+G N C V S+
Sbjct: 320 KGSNQCLVKEEASS 333
>gi|146335578|gb|ABQ23398.1| cathepsin L isotype 1 [Trypanoplasma borreli]
Length = 443
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 131/321 (40%), Positives = 186/321 (57%), Gaps = 31/321 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFR-R 118
FS FK + Y S E RF IF AN+++AA + +P AT G +F+D++ EF+ R
Sbjct: 25 FSDFKATHARNYVSPGEERKRFEIFAANMKKAAELNRKNPMATFGPNEFADMSSEEFQTR 84
Query: 119 TYLGLRRKLRLPKDADQAPILPTNDLPA----DFDWREKGAVGPVKDQGSCGSCWSFSTT 174
+ A ++ A DWR KGAV VK+QGSCGSCWSFSTT
Sbjct: 85 HNAARHYAAAKARRAKHTKSFTKEEIKAADGQKIDWRLKGAVTSVKNQGSCGSCWSFSTT 144
Query: 175 GALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTL--KAGG 232
G +EG N +ATG LVSLSEQ+LV CD + D+GCNGGLM++AF + + + G
Sbjct: 145 GNIEGQNAIATGNLVSLSEQELVSCD---------TTDNGCNGGLMDNAFGWLISTRGGQ 195
Query: 233 LMREEDYPY-TGTDRGHACKF--DKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAIN 289
+ E YPY +G AC + D + A+++NF ++ E+ +AA + GPL++ ++
Sbjct: 196 IATEASYPYVSGNGIVPACSYNLDNKPVGATISNFQDITGTEEDMAAFVFNYGPLSIGVD 255
Query: 290 AVYMQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGE 347
A Q+Y GG+ CP + ++DHGVL+VGY AP PYWIIKNSW +WGE
Sbjct: 256 ASTWQSYAGGIITYCPDV---QIDHGVLIVGYDDT--AP-----TPYWIIKNSWTANWGE 305
Query: 348 NGYYKICRGRNVCGVDSMVST 368
+GY ++ +G N+CG+ S S+
Sbjct: 306 DGYIRVAKGSNMCGLTSTPSS 326
>gi|168006315|ref|XP_001755855.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693174|gb|EDQ79528.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 454
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 135/327 (41%), Positives = 177/327 (54%), Gaps = 31/327 (9%)
Query: 31 LIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRR 90
L+R TD G+E L F + K K Y+S EEH HR+ ++K NL
Sbjct: 29 LLRMTTDLGNERL------------LSEQFGAWAHKHGKVYSSLEEHAHRYMVWKDNLEY 76
Query: 91 AARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDW 150
RH + + S G+T+F+D+T EFRR Y G R ++ P DW
Sbjct: 77 IQRHSEKNRSYWLGLTKFADITNDEFRRQYTGTRIDRSKRSKRKTGFRYADSEAPESVDW 136
Query: 151 REKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGS 210
R+KGAV VKDQGSCGSCW+FS G++EG N + TG+ VSLSEQ+LVDCD E
Sbjct: 137 RKKGAVTTVKDQGSCGSCWAFSAIGSVEGINAIRTGEAVSLSEQELVDCDLE-------- 188
Query: 211 CDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLD 270
+ GCNGGLM+ AF++ L+ GG+ E DYPY G D G K+ ++ + V +
Sbjct: 189 YNQGCNGGLMDYAFDFILENGGIDTENDYPYKGLD-GRCDNNKKNAHVVTIDGYEDVPEN 247
Query: 271 EDQIAANLVKNGPLAVAINA--VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIR 328
+++ V P++VAI A Q Y GGV C LDHGVL VGYGS G
Sbjct: 248 DEEALKKAVAGQPVSVAIEAGGRDFQLYSGGVFTGE-CGTDLDHGVLAVGYGSEG----- 301
Query: 329 LKEKPYWIIKNSWGESWGENGYYKICR 355
YWI+KNSWGE WGE+GY ++ R
Sbjct: 302 --SLDYWIVKNSWGEYWGESGYLRMQR 326
>gi|146215996|gb|ABQ10200.1| cysteine protease Cp2 [Actinidia deliciosa]
Length = 376
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 129/314 (41%), Positives = 181/314 (57%), Gaps = 27/314 (8%)
Query: 49 TNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQF 108
T+ +++G ++ + K KAY E + RF IFK NL+ H + S G+ +F
Sbjct: 39 TDEEVMGI---YAEWLAKHGKAYNGIGERERRFEIFKDNLKFVDEHNSENRSYKVGLNRF 95
Query: 109 SDLTPAEFRRTYLGLR----RKLRLPKDADQAPILPTND-LPADFDWREKGAVGPVKDQG 163
+DLT E+R +LG + R+ K A + + +D LP DWRE GAV P+KDQG
Sbjct: 96 ADLTNEEYRSMFLGTKTDSKRRFMKSKSASRRYAVQDSDMLPESVDWRESGAVAPIKDQG 155
Query: 164 SCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSA 223
SCGSCW+FST A+EG N +ATG+++ LSEQ+LVDCD + D+GCNGGLM+ A
Sbjct: 156 SCGSCWAFSTVAAVEGVNQIATGEMIQLSEQELVDCDR--------TYDAGCNGGLMDYA 207
Query: 224 FEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGP 283
FE+ + GG+ EEDYPY G D G K+ S+ ++ V ++ V + P
Sbjct: 208 FEFIINNGGIDTEEDYPYRGVD-GTCDPERKNTKVVSINDYEDVPPYDEMALKKAVAHQP 266
Query: 284 LAVAINAV--YMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSW 341
++VAI A Q Y+ GV C R LDHGV++VGYG+ A +WI++NSW
Sbjct: 267 VSVAIEASGRAFQLYLSGVFTGE-CGRALDHGVVVVGYGTDNGA-------DHWIVRNSW 318
Query: 342 GESWGENGYYKICR 355
G SWGENGY ++ R
Sbjct: 319 GTSWGENGYIRMER 332
>gi|154183745|gb|ABS70713.1| cathepsin L-like cysteine proteinase [Dermacentor variabilis]
Length = 333
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 139/320 (43%), Positives = 174/320 (54%), Gaps = 27/320 (8%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP----SATHGITQFSDLTPA 114
+ FK K+Y S E RF IF N ARH + S G+ QF DL P
Sbjct: 26 QWEAFKATHKKSYQSNMEELLRFKIFSENSLLVARHNEKYARGLVSYKLGMNQFGDLLPH 85
Query: 115 EFRRTYLGLRRKLRLPKDADQAPILPTN--DLPADFDWREKGAVGPVKDQGSCGSCWSFS 172
EF R + G R + + P N LP DWREKGAV PVK+QG CGSCW+FS
Sbjct: 86 EFARMFNGYRGARTAGRGSTFLPPANVNYSSLPQSMDWREKGAVTPVKNQGQCGSCWAFS 145
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
TTG+LEG +FL TG LVSLSEQ LVDC E G + GC GGLM++AF+Y GG
Sbjct: 146 TTGSLEGQHFLKTGVLVSLSEQNLVDC-----SETFG--NHGCEGGLMDNAFQYIKANGG 198
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAINAV 291
+ E+ YPY D C+F K + A+ F + ED + + GP++VAI+A
Sbjct: 199 IDTEKSYPYEAED--GECRFKKQNVGATDTGFVDIEQGSEDDLKKAVATVGPVSVAIDAS 256
Query: 292 Y--MQTYIGGVSCPYIC-SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+ Q Y GV C S +LDHGVL+VGYG K YW++KNSW ESWG+N
Sbjct: 257 HSSFQLYSEGVYDETECSSEQLDHGVLVVGYGVE-------DGKKYWLVKNSWAESWGDN 309
Query: 349 GYYKICRGR-NVCGVDSMVS 367
GY K+ R + N CG+ S S
Sbjct: 310 GYIKMSRDKDNQCGIASAAS 329
>gi|18424347|ref|NP_568921.1| thiol protease aleurain [Arabidopsis thaliana]
gi|71152227|sp|Q8H166.2|ALEU_ARATH RecName: Full=Thiol protease aleurain; Short=AtALEU; AltName:
Full=Senescence-associated gene product 2; Flags:
Precursor
gi|7230640|gb|AAF43041.1|AF233883_1 AALP protein [Arabidopsis thaliana]
gi|13430722|gb|AAK25983.1|AF360273_1 putative cysteine proteinase AALP [Arabidopsis thaliana]
gi|9757740|dbj|BAB08221.1| AALP protein [Arabidopsis thaliana]
gi|21617934|gb|AAM66984.1| cysteine proteinase AALP [Arabidopsis thaliana]
gi|23397068|gb|AAN31819.1| putative cysteine proteinase AALP [Arabidopsis thaliana]
gi|23397074|gb|AAN31822.1| putative cysteine proteinase AALP [Arabidopsis thaliana]
gi|24417304|gb|AAN60262.1| unknown [Arabidopsis thaliana]
gi|222423506|dbj|BAH19723.1| AT5G60360 [Arabidopsis thaliana]
gi|222424411|dbj|BAH20161.1| AT5G60360 [Arabidopsis thaliana]
gi|332009930|gb|AED97313.1| thiol protease aleurain [Arabidopsis thaliana]
Length = 358
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 142/351 (40%), Positives = 184/351 (52%), Gaps = 34/351 (9%)
Query: 27 DVDQLIRQVTDGGDEILSHHESTNNDLLGAEHH---FSLFKKKFNKAYASQEEHDHRFTI 83
D IR V+DG E+ E + + +LG H F+ F ++ K Y + EE RF+I
Sbjct: 27 DESNPIRMVSDGLREV----EESVSQILGQSRHVLSFARFTHRYGKKYQNVEEMKLRFSI 82
Query: 84 FKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTND 143
FK NL K S G+ QF+DLT EF+RT LG + +
Sbjct: 83 FKENLDLIRSTNKKGLSYKLGVNQFADLTWQEFQRTKLGAAQNCSATLKGSHK--VTEAA 140
Query: 144 LPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHEC 203
LP DWRE G V PVKDQG CGSCW+FSTTGALE A A GK +SLSEQQLVDC
Sbjct: 141 LPETKDWREDGIVSPVKDQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAF 200
Query: 204 DPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVAN 263
+ + GCNGGL + AFEY GGL E+ YPYTG D CKF + V N
Sbjct: 201 N-------NYGCNGGLPSQAFEYIKSNGGLDTEKAYPYTGKDE--TCKFSAENVGVQVLN 251
Query: 264 FSVVSL---DEDQIAANLVKNGPLAVAINAVY-MQTYIGGVSCPYICSRR---LDHGVLL 316
++L DE + A LV+ P+++A ++ + Y GV C ++H VL
Sbjct: 252 SVNITLGAEDELKHAVGLVR--PVSIAFEVIHSFRLYKSGVYTDSHCGSTPMDVNHAVLA 309
Query: 317 VGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVS 367
VGYG PYW+IKNSWG WG+ GY+K+ G+N+CG+ + S
Sbjct: 310 VGYGVEDGV-------PYWLIKNSWGADWGDKGYFKMEMGKNMCGIATCAS 353
>gi|326501772|dbj|BAK02675.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 333
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 132/321 (41%), Positives = 180/321 (56%), Gaps = 28/321 (8%)
Query: 58 HHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH----GITQFSDLTP 113
H+ L+K+ NK Y+ EEH R T ++ NL++ H H G+ +++D+T
Sbjct: 26 QHWKLWKEANNKRYSDAEEHVRRAT-WEGNLQKVQEHNLQADLGVHTYWLGMNKYADMTV 84
Query: 114 AEFRRTYLGLRRKLRLPKDADQAPILPTND--LPADFDWREKGAVGPVKDQGSCGSCWSF 171
EF + G +R + D+ + LP DWR+KG V VKDQG CGSCW+F
Sbjct: 85 TEFVKVMNGYNATMRGQRTQDRHTFSFNSKIALPDTVDWRDKGYVTDVKDQGQCGSCWAF 144
Query: 172 STTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAG 231
STTGALEG +F TGKLVSLSEQ LVDC + + GCNGGLM+ AFEY +
Sbjct: 145 STTGALEGQHFKQTGKLVSLSEQNLVDCSGK-------QGNMGCNGGLMDQAFEYIKENN 197
Query: 232 GLMREEDYPYTGTDRGHACKFDKSKIAASVANFS-VVSLDEDQIAANLVKNGPLAVAINA 290
G+ E+ YPY D + C+F + + A+ F+ + S DE + + GP++VAI+A
Sbjct: 198 GIDTEDSYPYEAVD--NQCRFKAANVGATDTGFTDITSKDESALQQAVATVGPISVAIDA 255
Query: 291 VY--MQTYIGGVSCPYICSR-RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGE 347
+ Q Y GV CS+ RLDHGVL VGYG+ K YW++KNSWGE WG+
Sbjct: 256 GHTSFQLYKHGVYNEPFCSQTRLDHGVLAVGYGTD-------SGKDYWLVKNSWGEGWGD 308
Query: 348 NGYYKICRG-RNVCGVDSMVS 367
GY K+ R RN CG+ + S
Sbjct: 309 KGYIKMTRNKRNQCGIATAAS 329
>gi|217323618|gb|ACK38176.1| midgut cysteine peptidase, partial [Sphenophorus levis]
Length = 324
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 127/310 (40%), Positives = 172/310 (55%), Gaps = 25/310 (8%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRA----ARHQKLDPSATHGITQFSDLTPA 114
HF FK K K Y +Q E RF IF+ NLR+ A +++ S T GI +F+D+T A
Sbjct: 25 HFQSFKLKHGKTYKNQAEETKRFAIFRENLRKIEAHNAEYKQGIHSYTQGINKFADMTRA 84
Query: 115 EFRRTYLGLRRKLRLPKDADQAPILPTN-DLPADFDWREKGAVGPVKDQGSCGSCWSFST 173
EF+ L + K + A + L +P DWR + V P+KDQ CGSCWSF+
Sbjct: 85 EFK-AMLATQVKTKPSIVATKTFQLADGVSVPESIDWRSRNVVTPIKDQAQCGSCWSFAV 143
Query: 174 TGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGL 233
G+ EGA L+TGKL SEQQLVDC + + GC+GG ++ F Y ++ GL
Sbjct: 144 VGSTEGAYALSTGKLTRFSEQQLVDCTTD--------LNYGCDGGYLDDTFPY-IQTNGL 194
Query: 234 MREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYM 293
E DYPYTG D +C +D SK+ V+++ V +E + + GP+A+AINA +
Sbjct: 195 ELESDYPYTGYD--GSCSYDSSKVVTKVSSYVSVPANEQALLEAVGTAGPVAIAINADDL 252
Query: 294 QTYIGGVSCPYICS-RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYK 352
Q Y G+ C LDHGVL VGY S YW+IKNSWG WGE+GY++
Sbjct: 253 QFYFSGIIDDKYCDPEWLDHGVLAVGYNSE-------NGLDYWLIKNSWGADWGESGYFR 305
Query: 353 ICRGRNVCGV 362
RG+N+CGV
Sbjct: 306 FLRGQNICGV 315
>gi|343471318|emb|CCD16236.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 445
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 127/311 (40%), Positives = 176/311 (56%), Gaps = 27/311 (8%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
+ F+ FK+K++++Y E RF +FK ++ RA +P AT G+TQFSD++P EF
Sbjct: 38 QQQFAAFKQKYSRSYKDATEEAFRFRMFKQSMERAKEEAAANPYATFGVTQFSDMSPEEF 97
Query: 117 RRTYL-GLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
R TYL G + + + + T P DWR+KGAV PVKDQGSCGSCW+F+ TG
Sbjct: 98 RATYLNGAKYYAAALERPRKVVNVSTGKAPPAVDWRKKGAVTPVKDQGSCGSCWAFAATG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA--GGL 233
+EG +A +L SLSEQ LV CD + + C GG + AF++ + + G +
Sbjct: 158 NIEGQWKIAGHELTSLSEQMLVSCD---------TTEDNCRGGFADRAFKWIVSSNKGNV 208
Query: 234 MREEDYPYTGTDRGHACKFDKSK--IAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV 291
EE YPY TD G+ +KS + A ++ + DE+ IA L +NGP+A+A++A
Sbjct: 209 FTEESYPYASTD-GYVPPCNKSGKVVGAKISGHINLPKDENAIAEWLARNGPVAIAVDAS 267
Query: 292 YMQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
Y GGV SC S L H VLLVGY + PYWIIKNSW + WGE G
Sbjct: 268 TFLDYKGGVLTSCS---SEGLSHDVLLVGYNDT-------SKPPYWIIKNSWDKEWGEEG 317
Query: 350 YYKICRGRNVC 360
Y +I +G N+C
Sbjct: 318 YIRIEKGTNLC 328
>gi|403293601|ref|XP_003937801.1| PREDICTED: cathepsin F [Saimiri boliviensis boliviensis]
Length = 379
Score = 226 bits (576), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 154/366 (42%), Positives = 202/366 (55%), Gaps = 38/366 (10%)
Query: 17 SAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNND-------LLGAEHHFSLFKKKFNK 69
SA + G+++ + L + D G+E S S N+ + F F +N+
Sbjct: 34 SAFTQGSVM--ISSLSQPHPDNGNETFSPVFSLLNEDPLPQDLTVKMASIFRNFVITYNR 91
Query: 70 AYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRRTYLGLRRKLR 128
Y S+EE R +IF N+ RA + Q LD +A +G+T+FSDLT EFR YL +
Sbjct: 92 TYESKEEAQWRLSIFAHNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRTIYLNPLLREE 151
Query: 129 LPKDADQAPILPTNDL-PADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGK 187
K QA + DL P ++DWR KGAV VKDQG CGSCW+FS TG +EG FL G
Sbjct: 152 PGKKMKQAKSV--GDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGT 209
Query: 188 LVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRG 247
L+SLSEQ+L+DCD D C GGL +SA+ GGL E+DY Y RG
Sbjct: 210 LLSLSEQELLDCDK---------IDKACMGGLPSSAYSAIKNLGGLETEDDYSY----RG 256
Query: 248 H--ACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPY- 304
H AC F K + + +S +E ++AA L K GP++VAINA MQ Y G+S P
Sbjct: 257 HMQACSFSPEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAINAFGMQFYRHGISRPLR 316
Query: 305 -ICSRRL-DHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGV 362
+CS L DH VLLVGYG+ + P+W IKNSWG WGE GYY + RG CGV
Sbjct: 317 PLCSPWLIDHAVLLVGYGNRS-------DIPFWAIKNSWGTDWGEKGYYYLHRGSGACGV 369
Query: 363 DSMVST 368
++M S+
Sbjct: 370 NTMASS 375
>gi|30023547|gb|AAO48766.2| cathepsin L-like cysteine proteinase [Tenebrio molitor]
Length = 337
Score = 226 bits (576), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 138/325 (42%), Positives = 182/325 (56%), Gaps = 29/325 (8%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP----SATHGITQFSDLT 112
+ + FK NK Y S+ E R IF N A+H KL S GI +++D+
Sbjct: 24 QEQWGAFKMTHNKQYQSETEERFRMKIFMENSHTVAKHNKLYAQGLVSFKLGINKYADML 83
Query: 113 PAEFRRTYLGLRRK---LRLPKDADQAPILPTND--LPADFDWREKGAVGPVKDQGSCGS 167
EF + G R LR + D LP + LP DWR+KGAV PVKDQG CGS
Sbjct: 84 HHEFVQVLNGFNRTKSGLRSGESDDSVTFLPPANVQLPGQIDWRDKGAVTPVKDQGQCGS 143
Query: 168 CWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYT 227
CWSFS TG+LEG +F +GKLVSLSEQ LVDC E+ G ++GCNGGLM++AF Y
Sbjct: 144 CWSFSATGSLEGQHFRQSGKLVSLSEQNLVDC-----SEKFG--NNGCNGGLMDNAFRYI 196
Query: 228 LKAGGLMREEDYPYTGTDRGHACKFD-KSKIAASVANFSVVSLDEDQIAANLVKNGPLAV 286
GG+ E+ YPY D C + K+K A + S +ED++ + + GP++V
Sbjct: 197 KANGGIDTEQAYPYKAEDE--KCHYKPKNKGATDRGYVDIESGNEDKLQSAVATVGPVSV 254
Query: 287 AINAVY--MQTYIGGVSCPYICS-RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGE 343
AI+A + Q Y GGV CS +LDHGVL+VGYG+ YW++KNSWG+
Sbjct: 255 AIDASHQSFQLYSGGVYYEPDCSASQLDHGVLVVGYGTEDDGT------DYWLVKNSWGK 308
Query: 344 SWGENGYYKICRGR-NVCGVDSMVS 367
SWG+ GY K+ R R N CG+ + S
Sbjct: 309 SWGDQGYIKMARNRNNNCGIATEAS 333
>gi|18419649|gb|AAL69389.1|AF462226_1 putative cysteine proteinase [Narcissus pseudonarcissus]
Length = 136
Score = 226 bits (575), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 103/125 (82%), Positives = 115/125 (92%)
Query: 247 GHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYIC 306
G CK DKSKIAASV+NFSVVS+DE+QIAANLV++GPLA+ INA +MQTYIGGVSCPYIC
Sbjct: 5 GAVCKLDKSKIAASVSNFSVVSIDEEQIAANLVQHGPLAIGINAAFMQTYIGGVSCPYIC 64
Query: 307 SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMV 366
+ LDHGVLLVGYGS+G+APIR KEKPYWIIKNSWGE+WGE GYYKIC+GRNVCGVDSMV
Sbjct: 65 GKHLDHGVLLVGYGSSGWAPIRFKEKPYWIIKNSWGENWGEKGYYKICKGRNVCGVDSMV 124
Query: 367 STVAA 371
STV A
Sbjct: 125 STVTA 129
>gi|115495381|ref|NP_001068884.1| cathepsin F precursor [Bos taurus]
gi|111304901|gb|AAI20004.1| Cathepsin F [Bos taurus]
gi|296471599|tpg|DAA13714.1| TPA: cathepsin F [Bos taurus]
Length = 460
Score = 226 bits (575), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 143/317 (45%), Positives = 181/317 (57%), Gaps = 31/317 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRR 118
F F +N+ Y SQEE R ++F N+ RA + Q LD +A +G+T+FSDLT EFR
Sbjct: 163 FKDFVTTYNRTYDSQEEASWRMSVFANNMVRAQKIQALDRGTARYGVTKFSDLTEEEFRT 222
Query: 119 TYLGLRRKLRLPKDA---DQAPILPTNDLPA-DFDWREKGAVGPVKDQGSCGSCWSFSTT 174
YL L KDA + P P D+P +DWR KGAV VKDQG CGSCW+FS T
Sbjct: 223 IYLN-----PLLKDAPGRNMRPAQPVTDVPPPQWDWRNKGAVTNVKDQGMCGSCWAFSVT 277
Query: 175 GALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLM 234
G +EG FL G L+SLSEQ+L+DCD D C GGL ++A+ GGL
Sbjct: 278 GNVEGQWFLKRGTLLSLSEQELLDCDK---------TDKACLGGLPSNAYSAIRTLGGLE 328
Query: 235 REEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQ 294
E+DY Y G R C F K + + +S +E ++AA L KNGP+++AINA MQ
Sbjct: 329 TEDDYSYRG--RLQTCSFSAEKAKVYINDSVELSKNEQKLAAWLAKNGPVSIAINAFGMQ 386
Query: 295 TYIGGVSCPY--ICSRRL-DHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYY 351
Y G+S P +CS L DH VLLVGYG+ P+W IKNSWG WGE GYY
Sbjct: 387 FYRHGISHPLRPLCSPWLIDHAVLLVGYGNRSAI-------PFWAIKNSWGTDWGEEGYY 439
Query: 352 KICRGRNVCGVDSMVST 368
+ RG CGV+ M S+
Sbjct: 440 YLHRGSGACGVNIMASS 456
>gi|296218871|ref|XP_002755611.1| PREDICTED: cathepsin F [Callithrix jacchus]
Length = 489
Score = 226 bits (575), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 147/317 (46%), Positives = 186/317 (58%), Gaps = 32/317 (10%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRR 118
F F +N+ Y S+EE R ++F N+ RA + Q LD +A +G+T+FSDLT EFR
Sbjct: 193 FRNFVITYNRTYESKEEAQWRLSVFVHNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 252
Query: 119 TYLGLRRKLRLP-KDADQAPILPTNDL-PADFDWREKGAVGPVKDQGSCGSCWSFSTTGA 176
TYL LR P K QA + DL P ++DWR KGAV VKDQG CGSCW+FS TG
Sbjct: 253 TYLN--PLLREPGKKMKQAKSV--GDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGN 308
Query: 177 LEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMRE 236
+EG FL G L+SLSEQ+L+DCD D C GGL +SA+ GGL E
Sbjct: 309 VEGQWFLNQGTLLSLSEQELLDCDK---------IDKACMGGLPSSAYSAIKNLGGLETE 359
Query: 237 EDYPYTGTDRGH--ACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQ 294
+DY Y RGH AC F K + + +S +E ++AA L K GP++VAINA MQ
Sbjct: 360 DDYSY----RGHMQACNFSPEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAINAFGMQ 415
Query: 295 TYIGGVSCPY--ICSRRL-DHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYY 351
Y G+S P +CS L DH VLLVGYG+ + P+W IKNSWG WGE GYY
Sbjct: 416 FYRHGISRPLRPLCSPWLIDHAVLLVGYGNR-------SDVPFWAIKNSWGTDWGEKGYY 468
Query: 352 KICRGRNVCGVDSMVST 368
+ RG CGV++M S+
Sbjct: 469 YLHRGSGACGVNTMASS 485
>gi|288548566|gb|ADC52431.1| cathepsin L2 cysteine protease [Pinctada fucata]
Length = 330
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 131/327 (40%), Positives = 185/327 (56%), Gaps = 34/327 (10%)
Query: 54 LGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH----GITQFS 109
+G ++ +++FKK++NK Y ++EE R ++++NL H H G+ ++
Sbjct: 21 VGLDNEWNIFKKQYNKLYQNEEEARRRL-VWESNLDFITLHNLAADRGEHTFWVGMNEYG 79
Query: 110 DLTPAEFRRTYLGLRRKLRLPKDADQAPI-LPTN---DLPADFDWREKGAVGPVKDQGSC 165
D+T EF +T G R + + AP+ +P N DLP DWR KG V P+K+QG C
Sbjct: 80 DMTNEEFTKTMNGYRMRNK----TSNAPVFMPPNNMGDLPDTVDWRPKGYVTPIKNQGQC 135
Query: 166 GSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFE 225
GSCWSFS TG+LEG F TGKLVSLSEQ LVDC + + GC GGLM+ AF
Sbjct: 136 GSCWSFSATGSLEGQTFKKTGKLVSLSEQNLVDCSKK-------QGNHGCEGGLMDDAFT 188
Query: 226 YTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPL 284
Y G+ E YPY D C+F + + A+ F + + DE+ + + GP+
Sbjct: 189 YIKANNGIDTEASYPYKARD--GKCEFKSADVGATDTGFVDIKTKDEEALKQAVATVGPI 246
Query: 285 AVAINAVYM--QTYIGGVSCPYICSR-RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSW 341
+VAI+A +M Q Y GV + CS+ +LDHGVL VGYG+ K YW++KNSW
Sbjct: 247 SVAIDASHMSFQLYRTGVYHDWFCSQTKLDHGVLAVGYGTE-------DSKDYWLVKNSW 299
Query: 342 GESWGENGYYKICRG-RNVCGVDSMVS 367
GESWG+ GY ++ R RN CG+ + S
Sbjct: 300 GESWGQKGYIQMSRNRRNNCGIATSAS 326
>gi|391340505|ref|XP_003744580.1| PREDICTED: digestive cysteine proteinase 1-like [Metaseiulus
occidentalis]
Length = 469
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 138/321 (42%), Positives = 177/321 (55%), Gaps = 42/321 (13%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSA---THGITQFSDLTPAE 115
+F FK+ F K Y +EH R IF+ NL + ++ T GITQF+D++ AE
Sbjct: 165 NFEHFKEHFGKTYEG-DEHALRQGIFQRNLAHIEKFNAEKAASRGYTLGITQFADMSTAE 223
Query: 116 FRRTYLGLR---------RKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCG 166
FR+TYLGLR RKL+ AD DLP DWR+KGAV PVKDQG CG
Sbjct: 224 FRQTYLGLRMNASTIAKLRKLQREVVADD------RDLPEAVDWRDKGAVSPVKDQGQCG 277
Query: 167 SCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEY 226
SCW+FST+GA+EG +FL G+L+SLSEQQ+VDC D GCNGG A EY
Sbjct: 278 SCWAFSTSGAIEGQHFLKNGELLSLSEQQMVDCSW---------LDFGCNGGQPMLAMEY 328
Query: 227 TLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSL-DEDQIAANLVKNGPLA 285
GGL E YPY G G +C DK AA + F + E + + K GP++
Sbjct: 329 VRFNGGLELETAYPYKGV--GGSCHSDKKSAAAKITGFWMAGFYSESALQKAVAKVGPIS 386
Query: 286 VAINAVY--MQTYIGGVSCPYICSR-RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWG 342
V ++A Q Y G+ P CS LDH VL VGYG++ + YW++KNSW
Sbjct: 387 VGMDASGEDFQHYKSGIYNPESCSSIGLDHAVLAVGYGTS-------DDGDYWLVKNSWN 439
Query: 343 ESWGENGYYKICRGR-NVCGV 362
SWGE GY+K+ R + N CG+
Sbjct: 440 TSWGEKGYFKLPRNKGNKCGI 460
>gi|300122868|emb|CBK23875.2| unnamed protein product [Blastocystis hominis]
Length = 316
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 137/317 (43%), Positives = 175/317 (55%), Gaps = 34/317 (10%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
E F F+ K+ K Y S E ++R + N+ + + S T G+T F+D+T EF
Sbjct: 24 EKLFQTFEAKYGKNYLSSE-REYRKKVLAYNMDWIEKFNSDEHSFTLGMTPFADMTNTEF 82
Query: 117 RRTYL-GLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
+ L G +K P + QA +L N DWREKGAV PVK+QGSCGSCW+FS TG
Sbjct: 83 ATSKLCGCMKK---PLNHKQARVL-NNMAVESIDWREKGAVTPVKNQGSCGSCWAFSATG 138
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMR 235
ALEG NF+ATGKLVSLSEQQLVDCD E D+GC GG M++AFEY +K GL
Sbjct: 139 ALEGGNFVATGKLVSLSEQQLVDCDTE---------DAGCGGGFMDTAFEYVMKK-GLCT 188
Query: 236 EEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA--VYM 293
EEDYPY D CK D+ S+ + V ++ + P++VAI A
Sbjct: 189 EEDYPYHAKDED--CKDDQCTSVISITGYEDVPANDGVALKQALTKAPVSVAIQADSFVF 246
Query: 294 QTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKI 353
Q Y GGV +C L+HGVL VGY K Y I+KNSWG SWG+ GY KI
Sbjct: 247 QMYTGGVLDSDMCGTSLNHGVLAVGYA-----------KEYIIVKNSWGASWGDKGYVKI 295
Query: 354 C---RGRNVCGVDSMVS 367
+G +CG++ S
Sbjct: 296 AHRDQGEGICGINMAAS 312
>gi|343473370|emb|CCD14732.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 445
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 124/308 (40%), Positives = 171/308 (55%), Gaps = 21/308 (6%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
+ F+ FK+K++++Y E RF +FK N+ RA +P AT G+T+FSD++P EF
Sbjct: 38 QQQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97
Query: 117 RRTYL-GLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
R TY G K + + T P DWR+KGAV PVKDQG+CGSCW+FS G
Sbjct: 98 RATYHNGAEYYAAALKRPRKVVNVSTGKAPEAVDWRKKGAVTPVKDQGACGSCWAFSAIG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA--GGL 233
+EG +A +L SLSEQ LV CD + D GC GGLM+ + ++ + + G +
Sbjct: 158 NIEGQWKVAGHELTSLSEQMLVSCD---------TTDYGCRGGLMDKSLQWIVSSNKGNV 208
Query: 234 MREEDYPY-TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY 292
+ YPY +G + C + A ++ + DE+ IA L KNGP+A+A++A
Sbjct: 209 FTAQSYPYASGGGKMPPCNKSGKVVGAKISGHINLPKDENAIAEWLAKNGPVAIAVDATS 268
Query: 293 MQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYK 352
Y GGV I S+ LDH VLLVGY + PYWIIKNSW + WGE GY +
Sbjct: 269 FLGYKGGVLTSCI-SKGLDHDVLLVGYNDT-------SKPPYWIIKNSWSKGWGEEGYIR 320
Query: 353 ICRGRNVC 360
I +G N C
Sbjct: 321 IEKGTNQC 328
>gi|344295816|ref|XP_003419606.1| PREDICTED: cathepsin F [Loxodonta africana]
Length = 473
Score = 225 bits (573), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 142/320 (44%), Positives = 185/320 (57%), Gaps = 35/320 (10%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRR 118
F F +N+ Y ++EE R ++F N+ RA + Q LD +A +GIT+FSDLT EFR
Sbjct: 176 FKNFVTTYNRTYETKEETKWRMSVFANNMIRAQKLQALDQGTAQYGITKFSDLTEEEFRT 235
Query: 119 TYLG--LR----RKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFS 172
YL LR +K+RL K P +P D+DWR KGAV VKDQG CGSCW+FS
Sbjct: 236 IYLNPLLREDPGQKMRLGK-------APKGPVPPDWDWRTKGAVTKVKDQGMCGSCWAFS 288
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
TG +EG FL G L+SLSEQ+L+DCD D C GG+ ++A+ GG
Sbjct: 289 VTGNVEGQWFLNRGTLLSLSEQELLDCD---------KVDKACMGGVPSNAYSAIKTLGG 339
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY 292
L EEDY Y G AC F K + + +S +E ++AA L KNGP++VAINA
Sbjct: 340 LETEEDYSYHG--HLQACSFSAEKAKVYINDSVELSQNEYKLAAWLAKNGPISVAINAFG 397
Query: 293 MQTYIGGVSCPY--ICSRRL-DHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
MQ Y G++ P +CS L DH VL+VGYG+ + P+W IKNSWG WGE G
Sbjct: 398 MQFYRHGIAHPLRPLCSPWLIDHAVLIVGYGNR-------SDVPFWAIKNSWGTDWGEEG 450
Query: 350 YYKICRGRNVCGVDSMVSTV 369
YY + RG CGV++M S+
Sbjct: 451 YYYLHRGSGACGVNTMASSA 470
>gi|343477207|emb|CCD11901.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 361
Score = 225 bits (573), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 124/320 (38%), Positives = 174/320 (54%), Gaps = 21/320 (6%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
+ F+ FK+K++++Y E RF +FK N+ RA +P AT G+T+FSD++P EF
Sbjct: 38 QQQFAAFKQKYSRSYRDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97
Query: 117 RRTYL-GLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
R TY G K + + T P DWR+KGAV PVKDQG C S W+FS G
Sbjct: 98 RATYHNGAEYYAAALKRPRKVVNVSTGRPPMTVDWRKKGAVTPVKDQGKCDSSWAFSAIG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTL--KAGGL 233
+EG +A +L SLSEQ LV CD + D GC GG + AF++ L G +
Sbjct: 158 NIEGQWKIAGHELTSLSEQMLVSCDTD---------DFGCRGGFSDPAFKWILWSNKGNV 208
Query: 234 MREEDYPY-TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY 292
E+ YPY +G CK + A ++N + DED I L + GP+A+A++A
Sbjct: 209 FTEQSYPYASGGGNVPTCKMSGKVVGAKISNRLYLPEDEDMITEWLARKGPVAIAVDATS 268
Query: 293 MQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYK 352
Q+Y GGV I S+ +++G LLVGY + PYWIIKNSW + WGE GY +
Sbjct: 269 FQSYTGGVLTSCI-SKEMNYGALLVGYDDT-------SKPPYWIIKNSWSKGWGEEGYIR 320
Query: 353 ICRGRNVCGVDSMVSTVAAA 372
I +G N C V ++ S+ +
Sbjct: 321 IEKGTNQCLVKNLPSSAVVS 340
>gi|238481789|gb|ACR43934.1| cathepsin L-like cysteine proteinase [Haliotis diversicolor
supertexta]
Length = 347
Score = 224 bits (572), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 141/316 (44%), Positives = 183/316 (57%), Gaps = 28/316 (8%)
Query: 63 FKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKL----DPSATHGITQFSDLTPAEFRR 118
FKK+ + Y EE + RF IFK NL+ H K S GI QF+D+ EFR
Sbjct: 45 FKKQHGRLYEKHEEEEERFEIFKQNLQYIEEHNKKFSLGQKSYYLGINQFADMKNEEFR- 103
Query: 119 TYLGLRRKLRLPKDADQAPILPTNDL--PADFDWREKGAVGPVKDQGSCGSCWSFSTTGA 176
Y GLRR ++ + L L P + DWR+KG V VK+QG CGSCWSFSTTG+
Sbjct: 104 MYNGLRRDYNYSREVQCSNHLTPEYLVAPDEVDWRKKGYVTAVKNQGQCGSCWSFSTTGS 163
Query: 177 LEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMRE 236
LEG +F +GKLVSLSEQQLVDC + E GCNGGLM+ AFEY + GG+ E
Sbjct: 164 LEGQHFHKSGKLVSLSEQQLVDCSGKFGNE-------GCNGGLMDQAFEYIITNGGIETE 216
Query: 237 EDYPYTGTDRGHACKFDKSKIAASVAN-FSVVSLDEDQIAANLVKNGPLAVAINAVY--M 293
E+YPY R C F KS++AA+ + V S DE + ++ + GP+++AI+A +
Sbjct: 217 EEYPYDA--RQERCHFKKSEVAATASGCVDVKSGDETDLKNSVAEVGPVSIAIDASHQSF 274
Query: 294 QTYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYK 352
Q Y GGV P S LDHGVL+VGYG+ + YW++KNSWG +WG GY K
Sbjct: 275 QLYSGGVYDEPKCSSTELDHGVLVVGYGTD-------DGQDYWLVKNSWGTTWGLEGYVK 327
Query: 353 ICRGR-NVCGVDSMVS 367
+ R + N CGV + S
Sbjct: 328 MSRNQDNQCGVATQAS 343
>gi|33112583|gb|AAP94047.1| cathepsin-L-like cysteine peptidase 03 [Tenebrio molitor]
Length = 337
Score = 224 bits (572), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 138/325 (42%), Positives = 181/325 (55%), Gaps = 29/325 (8%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP----SATHGITQFSDLT 112
+ + FK NK Y S E R IF N A+H KL S GI +++D+
Sbjct: 24 QEQWGAFKMTHNKQYQSDTEERFRMKIFMENSHTVAKHNKLYAQGLVSFKLGINKYADML 83
Query: 113 PAEFRRTYLGLRRK---LRLPKDADQAPILPTND--LPADFDWREKGAVGPVKDQGSCGS 167
EF + G R LR + D LP + LP DWR+KGAV PVKDQG CGS
Sbjct: 84 HHEFVQVLNGFNRTKSGLRSGESDDSVTFLPPANVQLPGQIDWRDKGAVTPVKDQGQCGS 143
Query: 168 CWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYT 227
CWSFS TG+LEG +F +GKLVSLSEQ LVDC E+ G ++GCNGGLM++AF Y
Sbjct: 144 CWSFSATGSLEGQHFRKSGKLVSLSEQNLVDC-----SEKFG--NNGCNGGLMDNAFRYI 196
Query: 228 LKAGGLMREEDYPYTGTDRGHACKFD-KSKIAASVANFSVVSLDEDQIAANLVKNGPLAV 286
GG+ E+ YPY D C + K+K A + S +ED++ + + GP++V
Sbjct: 197 KANGGIDTEQAYPYKAEDE--KCHYKPKNKGATDRGYVDIESGNEDKLQSAVATVGPVSV 254
Query: 287 AINAVY--MQTYIGGVSCPYICS-RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGE 343
AI+A + Q Y GGV CS +LDHGVL+VGYG+ YW++KNSWG+
Sbjct: 255 AIDASHQSFQLYSGGVYYEPDCSASQLDHGVLVVGYGTEDDGT------DYWLVKNSWGK 308
Query: 344 SWGENGYYKICRGR-NVCGVDSMVS 367
SWG+ GY K+ R R N CG+ + S
Sbjct: 309 SWGDQGYIKMARNRDNNCGIATEAS 333
>gi|391328505|ref|XP_003738729.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 323
Score = 224 bits (572), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 142/332 (42%), Positives = 187/332 (56%), Gaps = 40/332 (12%)
Query: 53 LLG-AEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSAT---HGITQF 108
LLG A ++ L+KK K+Y EEH R +K+ + A + + D T G+ +F
Sbjct: 11 LLGLASANWDLYKKVHGKSYGHDEEHFRRQLFYKSVAKINAHNLRHDLGLTTYRMGLNKF 70
Query: 109 SDLTPAEFRRTYLGL--------RRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVK 160
+D+T EFR + GL R R K+ L LP DWREKG V PVK
Sbjct: 71 TDMTSEEFR-NFKGLKFDATKTKRNGTRFQKE------LLGEALPTQVDWREKGYVTPVK 123
Query: 161 DQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLM 220
+QG CGSCW+FSTTG+LEG +F ATGKLVSLSEQ LVDC ++GCNGGLM
Sbjct: 124 NQGQCGSCWAFSTTGSLEGQHFKATGKLVSLSEQNLVDCSRV-------EGNNGCNGGLM 176
Query: 221 NSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLV 279
++ F Y + GG+ EE YPYTG D C F+++ + A V F V DE + A +
Sbjct: 177 DNGFTYIQQNGGIDTEESYPYTGKD--GDCAFNENSVGARVKGFVDVPQRDEAALQAAVA 234
Query: 280 KNGPLAVAINAV--YMQTYIGGVSCPYICS-RRLDHGVLLVGYGSAGYAPIRLKEKPYWI 336
GP++VAI+A Q Y GV CS +LDHGVL+VGYG+ YW+
Sbjct: 235 SVGPVSVAIDASNDSFQYYKEGVYDEPSCSFSQLDHGVLVVGYGTENGV-------DYWL 287
Query: 337 IKNSWGESWGENGYYKICRGR-NVCGVDSMVS 367
+KNSWG +WG++GY K+ R + N CG+ SM S
Sbjct: 288 VKNSWGPTWGQDGYIKMMRNKENQCGIASMAS 319
>gi|1163075|emb|CAA81061.1| cysteine proteinase [Trypanosoma congolense]
Length = 442
Score = 224 bits (572), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 124/308 (40%), Positives = 171/308 (55%), Gaps = 21/308 (6%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
+ F+ FK+K++++Y E RF +FK N+ RA +P AT G+T+FSD++P EF
Sbjct: 33 QQQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 92
Query: 117 RRTYL-GLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
R TY G K + + T P DWR+KGAV PVKDQG+CGSCW+FS G
Sbjct: 93 RATYHNGAEYYAAALKRPRKVVNVSTGKAPPAVDWRKKGAVTPVKDQGACGSCWAFSAIG 152
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA--GGL 233
+EG +A +L SLSEQ LV CD + D GC GGLM+ + ++ + + G +
Sbjct: 153 NIEGQWKVAGHELTSLSEQMLVSCD---------TTDYGCRGGLMDKSLQWIVSSNKGNV 203
Query: 234 MREEDYPY-TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY 292
+ YPY +G + C + A ++ + DE+ IA L KNGP+A+A++A
Sbjct: 204 FTAQSYPYASGGGKMPPCNKSGKVVGAKISGHINLPKDENAIAEWLAKNGPVAIAVDATS 263
Query: 293 MQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYK 352
Y GGV I S+ LDH VLLVGY + PYWIIKNSW + WGE GY +
Sbjct: 264 FLGYKGGVLTSCI-SKGLDHDVLLVGYDDT-------SKPPYWIIKNSWSKGWGEEGYIR 315
Query: 353 ICRGRNVC 360
I +G N C
Sbjct: 316 IEKGTNQC 323
>gi|297852302|ref|XP_002894032.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
gi|297339874|gb|EFH70291.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
Length = 455
Score = 224 bits (572), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 140/354 (39%), Positives = 194/354 (54%), Gaps = 33/354 (9%)
Query: 6 VVLFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGD-EILSHHESTNNDLLGAEHHFSLFK 64
V+LFL + V SAV + D + D E++S +E+ A++ SL +
Sbjct: 2 VILFLAMVAVASAVDMSIISYDEKHGVSTTGGRSDAEVMSIYEAWLVKHGKAQNQNSLVE 61
Query: 65 KKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLR 124
K D RF IFK NLR H K + S G+T+F+DLT E+R YLG +
Sbjct: 62 K------------DRRFEIFKDNLRFIDDHNKKNLSYRLGLTRFADLTNDEYRSKYLGAK 109
Query: 125 RKLRLPKDADQAPILPTND-LPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFL 183
+ + + Q D LP DWR+KGAV VKDQGSCGSCW+FST GA+EG N +
Sbjct: 110 MEKKGERRTSQRYEARVGDELPESIDWRKKGAVAEVKDQGSCGSCWAFSTIGAVEGINQI 169
Query: 184 ATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTG 243
TG L++LSEQ+LVDCD S + GCNGGLM+ AFE+ +K GG+ ++DYPY G
Sbjct: 170 VTGDLITLSEQELVDCDT--------SYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKG 221
Query: 244 TDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA--VYMQTYIGGVS 301
D G + K+ ++ ++ V ++ V + P++VAI A Q Y G+
Sbjct: 222 VD-GTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPVSVAIEAGGRAFQLYDSGIF 280
Query: 302 CPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
C +LDHGV+ VGYG+ K YWI++NSWG+SWGE+GY K+ R
Sbjct: 281 -DGTCGTQLDHGVVAVGYGTE-------NGKDYWIVRNSWGKSWGESGYLKMAR 326
>gi|343477225|emb|CCD11889.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 447
Score = 224 bits (572), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 124/308 (40%), Positives = 171/308 (55%), Gaps = 21/308 (6%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
+ F+ FK+K++++Y E RF +FK N+ RA +P AT G+T+FSD++P EF
Sbjct: 38 QQQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97
Query: 117 RRTYL-GLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
R TY G K + + T P DWR+KGAV PVKDQG+CGSCW+FS G
Sbjct: 98 RATYHNGAEYYAAALKRPRKVVNVSTGKAPPAVDWRKKGAVTPVKDQGACGSCWAFSAIG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA--GGL 233
+EG +A +L SLSEQ LV CD + D GC GGLM+ + ++ + + G +
Sbjct: 158 NIEGQWKVAGHELTSLSEQMLVSCD---------TTDYGCRGGLMDKSLQWIVSSNKGNV 208
Query: 234 MREEDYPY-TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY 292
+ YPY +G + C + A ++ + DE+ IA L KNGP+A+A++A
Sbjct: 209 FTAQSYPYASGGGKMPPCNKSGKVVGAKISGHINLPKDENAIAEWLAKNGPVAIAVDATS 268
Query: 293 MQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYK 352
Y GGV I S+ LDH VLLVGY + PYWIIKNSW + WGE GY +
Sbjct: 269 FLGYKGGVLTSCI-SKGLDHDVLLVGYDDT-------SKPPYWIIKNSWSKGWGEEGYIR 320
Query: 353 ICRGRNVC 360
I +G N C
Sbjct: 321 IEKGTNQC 328
>gi|33112581|gb|AAP94046.1| cathepsin-L-like cysteine peptidase 02 [Tenebrio molitor]
Length = 337
Score = 224 bits (572), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 138/325 (42%), Positives = 181/325 (55%), Gaps = 29/325 (8%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP----SATHGITQFSDLT 112
+ + FK NK Y S E R IF N A+H KL S GI +++D+
Sbjct: 24 QEQWGAFKMTHNKQYQSDTEERFRMKIFMENSHTVAKHNKLYAQGLVSFKLGINKYADML 83
Query: 113 PAEFRRTYLGLRRK---LRLPKDADQAPILPTND--LPADFDWREKGAVGPVKDQGSCGS 167
EF + G R LR + D LP + LP DWR+KGAV PVKDQG CGS
Sbjct: 84 HHEFVQVLNGFNRTKSGLRSGESDDSVTFLPPANVQLPGQIDWRDKGAVTPVKDQGQCGS 143
Query: 168 CWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYT 227
CWSFS TG+LEG +F +GKLVSLSEQ LVDC E+ G ++GCNGGLM++AF Y
Sbjct: 144 CWSFSATGSLEGQHFRKSGKLVSLSEQNLVDC-----SEKFG--NNGCNGGLMDNAFRYI 196
Query: 228 LKAGGLMREEDYPYTGTDRGHACKFD-KSKIAASVANFSVVSLDEDQIAANLVKNGPLAV 286
GG+ E+ YPY D C + K+K A + S +ED++ + + GP++V
Sbjct: 197 KANGGIDTEQAYPYKAEDE--KCHYKPKNKGATDRGYVDIESGNEDKLQSAVATVGPVSV 254
Query: 287 AINAVY--MQTYIGGVSCPYICS-RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGE 343
AI+A + Q Y GGV CS +LDHGVL+VGYG+ YW++KNSWG+
Sbjct: 255 AIDASHQSFQLYSGGVYYEPECSPSQLDHGVLVVGYGTEDDGT------DYWLVKNSWGK 308
Query: 344 SWGENGYYKICRGR-NVCGVDSMVS 367
SWG+ GY K+ R R N CG+ + S
Sbjct: 309 SWGDQGYIKMARNRDNNCGIATEAS 333
>gi|146216000|gb|ABQ10202.1| cysteine protease Cp4 [Actinidia deliciosa]
Length = 463
Score = 224 bits (572), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 130/308 (42%), Positives = 171/308 (55%), Gaps = 26/308 (8%)
Query: 69 KAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLR 128
KAY + E + RF IFK NLR H + S G+ +F+DLT E+R +LG +++
Sbjct: 56 KAYNAIGEKERRFEIFKDNLRFVDEHNAVAGSYRVGLNRFADLTNEEYRSMFLGGNMEMK 115
Query: 129 ---LPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLAT 185
+D+ + LP DWREKGAV PVKDQG CGSCW+FST A+EG N + T
Sbjct: 116 ERSASTKSDRYAFRAGDKLPGSVDWREKGAVSPVKDQGQCGSCWAFSTISAVEGINQIVT 175
Query: 186 GKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTD 245
G+L+SLSEQ+LVDCD S + GCNGGLM+ F++ + GG+ EEDYPY D
Sbjct: 176 GELISLSEQELVDCDK--------SYNMGCNGGLMDYGFQFIINNGGIDTEEDYPYRAVD 227
Query: 246 RGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA--VYMQTYIGGVSCP 303
G +F K+ S+ + V D++ V N P++VAI A Q Y GV
Sbjct: 228 -GTCDQFRKNARVVSINGYEDVPEDDENSLKKAVANQPVSVAIEAGGRAFQLYESGVFTG 286
Query: 304 YICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNV---- 359
+ C LDHGV+ VGYG+ YW ++NSWG WGENGY K+ R N
Sbjct: 287 H-CGTNLDHGVVAVGYGTENGVD-------YWTVRNSWGPKWGENGYIKLERNINATSGK 338
Query: 360 CGVDSMVS 367
CG+ SM S
Sbjct: 339 CGIASMAS 346
>gi|4581057|gb|AAD24589.1|AF139913_1 cysteine protease [Trypanosoma congolense]
Length = 440
Score = 224 bits (572), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 126/320 (39%), Positives = 174/320 (54%), Gaps = 21/320 (6%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
+ F+ FK+K++++Y E RF +FK N+ RA +P AT G+T+FSD++P EF
Sbjct: 38 QQQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97
Query: 117 RRTYL-GLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
R TY G K + + T P DWR+KGAV PVKDQG C S W+FS G
Sbjct: 98 RATYHNGAEYYAAALKRPRKVVNVSTGKAPPAIDWRKKGAVTPVKDQGQCHSSWAFSAIG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA--GGL 233
+EG +A +L SLSEQ LV CD D GC GG + AF++ + + G +
Sbjct: 158 NIEGQWKIAGHELTSLSEQMLVSCDTN---------DFGCGGGFSDPAFKWIVSSNKGNV 208
Query: 234 MREEDYPY-TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY 292
E+ YPY +G C + A + + + DE+ IA L K GP+A+A++A
Sbjct: 209 FTEQSYPYASGGGNVPTCDKSGKVVGAKIRDRVDLPRDENAIAEWLAKKGPVAIAVDATS 268
Query: 293 MQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYK 352
Q+Y GGV I S LDHGVLLVGY + PYWIIKNSWG+ WGE GY +
Sbjct: 269 FQSYTGGVLTSCI-SEHLDHGVLLVGYDDT-------SKPPYWIIKNSWGKGWGEEGYIR 320
Query: 353 ICRGRNVCGVDSMVSTVAAA 372
I +G N C + ++ S+ +
Sbjct: 321 IEKGTNQCLMKNLPSSAVVS 340
>gi|154332645|ref|XP_001562139.1| cathepsin L-like protease [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134059587|emb|CAM37169.1| cathepsin L-like protease [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 441
Score = 224 bits (572), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 133/312 (42%), Positives = 172/312 (55%), Gaps = 31/312 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK+ + + YA+ +E R F+ NL HQ +P A GIT+F DL+ EF
Sbjct: 38 FEEFKQTYQRVYATLDEEQQRLANFQRNLELMREHQANNPHARFGITKFFDLSEEEFATR 97
Query: 120 YL-GLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YL G + K A Q DL PA DWREKGAV PVKDQG CGSCW+FS G
Sbjct: 98 YLSGATHFAKAKKFASQYYRKVGADLSTAPAAVDWREKGAVTPVKDQGMCGSCWAFSAIG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTL--KAGGL 233
+E +LAT L+SLSEQ+LV CD D GCNGGLM AF++ L + G +
Sbjct: 158 NIESKWYLATHSLISLSEQELVSCD---------DVDEGCNGGLMLQAFDWLLNNRNGAV 208
Query: 234 MREEDYPYTGTDRGHACKFDKSK---IAASVANFSVVSLDEDQIAANLVKNGPLAVAINA 290
YPY + G + +S I A + + +ED +AA L NGP+A+A++A
Sbjct: 209 YTGASYPYV-SGNGSVPECSESSDLVIGAYIDGHVTIESNEDTMAAWLAANGPIAIAVDA 267
Query: 291 VYMQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+Y GGV SC ++L+HGVLLVGY G E PYW+IKNSWGE+WGE
Sbjct: 268 SAFMSYTGGVLTSCD---GKQLNHGVLLVGYNMTG-------EVPYWLIKNSWGENWGEK 317
Query: 349 GYYKICRGRNVC 360
GY ++ +G N C
Sbjct: 318 GYVRVRKGTNEC 329
>gi|254674508|dbj|BAH86062.1| cysteine protease [Haemaphysalis longicornis]
Length = 333
Score = 224 bits (572), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 138/316 (43%), Positives = 174/316 (55%), Gaps = 27/316 (8%)
Query: 63 FKKKFNKAYASQEEHDHRFTIFKANLRRAARHQ----KLDPSATHGITQFSDLTPAEFRR 118
FK + NKAY+S E RF IF N A+H K S + +F DL P EF +
Sbjct: 30 FKSQHNKAYSSHVEELLRFKIFTENTLLVAKHNAKYAKGLVSYKLAMNKFGDLLPHEFAK 89
Query: 119 TYLGLRRKLRLPKDADQAPILPTND--LPADFDWREKGAVGPVKDQGSCGSCWSFSTTGA 176
G R K + P ND LP DWR+KGAV PVK+QG CGSCW+FSTTG+
Sbjct: 90 MVNGYRGKQNKEQRPTFIPPANLNDSSLPTTVDWRKKGAVTPVKNQGQCGSCWAFSTTGS 149
Query: 177 LEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMRE 236
LEG +F TGKLVSLSEQ LVDC + + GCNGGLM++ F+Y GG+ E
Sbjct: 150 LEGQHFRKTGKLVSLSEQNLVDCSDDFG-------NQGCNGGLMDNGFQYIKANGGIDTE 202
Query: 237 EDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAINAVY--M 293
E +PYT D CKF K+ + A+ A F + ED + + GP++VAI+A +
Sbjct: 203 ESHPYTAQDGD--CKFKKADVGATDAGFVDIQQGSEDDLKKAVATVGPVSVAIDASHGSF 260
Query: 294 QTYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYK 352
Q Y GV P S +LDHGVL VGYG K YW++KNSWG WG+NGY
Sbjct: 261 QLYSQGVYDEPDCSSSQLDHGVLTVGYGVK-------NGKKYWLVKNSWGGDWGDNGYIL 313
Query: 353 ICRGR-NVCGVDSMVS 367
+ R + N CG+ S S
Sbjct: 314 MSRDKDNQCGIASSAS 329
>gi|357216861|gb|AET71138.1| cysteine peptidase isoform b [Sphenophorus levis]
Length = 324
Score = 224 bits (572), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 126/310 (40%), Positives = 174/310 (56%), Gaps = 25/310 (8%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRA----ARHQKLDPSATHGITQFSDLTPA 114
HF FK K K Y +Q E RF IF+ NLR+ A +++ S T GI +F+D+T A
Sbjct: 25 HFQSFKLKHGKTYKNQAEETKRFAIFRENLRKIEAHNAEYKQGIHSYTQGINKFADMTRA 84
Query: 115 EFRRTYLGLRRKLRLPKDADQAPILPTN-DLPADFDWREKGAVGPVKDQGSCGSCWSFST 173
EF+ L + K + A + L +P DWR + V P+KDQ CGSCW+F+
Sbjct: 85 EFKAM-LATQVKTKPSIVATKTFQLADGVSVPESIDWRSRNVVTPIKDQAQCGSCWAFAV 143
Query: 174 TGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGL 233
G+ EGA L+TGKL SEQQLVDC + + GC+GG ++ F Y ++ GL
Sbjct: 144 VGSTEGAYALSTGKLTRFSEQQLVDCTTD--------LNYGCDGGYLDDTFPY-IQTNGL 194
Query: 234 MREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYM 293
E DYPYTG D G+ C ++ SK+ V+++ V +E + + GP+A+AINA +
Sbjct: 195 ELESDYPYTGYD-GY-CSYESSKVVTKVSSYVSVPANEQALLEAVGTAGPVAIAINADDL 252
Query: 294 QTYIGGVSCPYICS-RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYK 352
Q Y G+ C LDHGVL VGY S + YW+IKNSWG WGE+GY++
Sbjct: 253 QFYFSGIIDDKYCDPEYLDHGVLAVGYDSE-------NGRDYWLIKNSWGADWGESGYFR 305
Query: 353 ICRGRNVCGV 362
RG+N+CGV
Sbjct: 306 FLRGQNICGV 315
>gi|224096714|ref|XP_002310708.1| predicted protein [Populus trichocarpa]
gi|222853611|gb|EEE91158.1| predicted protein [Populus trichocarpa]
Length = 356
Score = 224 bits (571), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 135/324 (41%), Positives = 181/324 (55%), Gaps = 26/324 (8%)
Query: 40 DEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP 99
D LSH +S+ + + +K KAY E RF IFK NLR H +
Sbjct: 8 DNHLSHDQSSWRSDDEVMSIYKWWLQKHGKAYNRLGEKAKRFEIFKNNLRFIDEHNSQNR 67
Query: 100 SATHGITQFSDLTPAEFRRTYLGLR----RKLRLPKDADQAPILPTND-LPADFDWREKG 154
+ G+T+F+DLT E+R +LG R R+L K+ + D LP DWR KG
Sbjct: 68 TYKVGLTKFADLTNQEYRAMFLGTRSDPKRRLMKSKNPSERYAYKAGDKLPESVDWRGKG 127
Query: 155 AVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSG 214
AV P+KDQGSCGSCW+FST A+EG N + TG+L+SLSEQ+LVDCD ++G
Sbjct: 128 AVNPIKDQGSCGSCWAFSTVAAVEGINQIVTGELISLSEQELVDCDR--------FYNAG 179
Query: 215 CNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKI-AASVANFSVVSLDEDQ 273
CNGGLM+ AF++ + GGL E+DYPY G D C DK K A S+ F V +++
Sbjct: 180 CNGGLMDYAFQFIINNGGLDTEKDYPYLGND--DTCDRDKMKTKAVSIDGFEDVLPFDEK 237
Query: 274 IAANLVKNGPLAVAINA--VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKE 331
V + P++VAI A + +Q Y GV C LDHGV++VGYG+ K
Sbjct: 238 ALQKAVAHQPVSVAIEASGMALQFYQSGVFTGE-CGTALDHGVVVVGYGTE-------KG 289
Query: 332 KPYWIIKNSWGESWGENGYYKICR 355
YW+++NSWG WGE+GY K+ R
Sbjct: 290 LDYWLVRNSWGTEWGEHGYIKMQR 313
>gi|37732137|gb|AAR02406.1| cysteine proteinase [Anthonomus grandis]
Length = 322
Score = 224 bits (571), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 125/307 (40%), Positives = 171/307 (55%), Gaps = 26/307 (8%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP----SATHGITQFSDLTPAE 115
F FK + K+Y +Q E RF IF+AN+ +H L S I QF+DLT E
Sbjct: 26 FETFKVENGKSYRNQVEEVQRFNIFRANVLEIEQHNALYEQGLVSYKKAINQFTDLTQEE 85
Query: 116 FRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
F+ YLGL K L L ++P DWR G V VK+QGSCGSCWSF+ TG
Sbjct: 86 FK-AYLGLHVKPVLNNTIQYE--LKGLEVPTSVDWRSAGQVTGVKNQGSCGSCWSFALTG 142
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMR 235
+ EGA + +LVSLSEQQLVDC S + GCNGG +++ F Y ++ GL
Sbjct: 143 STEGAYYRKHKQLVSLSEQQLVDCST--------SINYGCNGGFLDATFPY-IEQYGLQT 193
Query: 236 EEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQT 295
E YPYTG D +CK+D SK+ ++N+ + E ++ + GP+A+ ++A Y+ +
Sbjct: 194 ESSYPYTGVDG--SCKYDSSKVVTKISNYVSLHGSESKVLEPVGSIGPVAITMDASYLSS 251
Query: 296 YIGGVSCPYIC-SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
Y G+ C + L+H VL+VGYGS + YWI+KNSWG WGE GY+++
Sbjct: 252 YSSGIYAANKCTTTNLNHAVLVVGYGSQ-------NGQNYWIVKNSWGSGWGEQGYFRLL 304
Query: 355 RGRNVCG 361
RG N CG
Sbjct: 305 RGSNECG 311
>gi|18402225|ref|NP_566633.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
gi|11994461|dbj|BAB02463.1| cysteine proteinase [Arabidopsis thaliana]
gi|17065298|gb|AAL32803.1| cysteine proteinase [Arabidopsis thaliana]
gi|20260004|gb|AAM13349.1| cysteine proteinase [Arabidopsis thaliana]
gi|332642713|gb|AEE76234.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
Length = 452
Score = 224 bits (570), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 136/337 (40%), Positives = 190/337 (56%), Gaps = 33/337 (9%)
Query: 43 LSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSAT 102
++ E+T N+ A + + + K Y E + RF IFK NL+ H + P+ T
Sbjct: 27 VTATETTRNEA-EARRMYERWLVENRKNYNGLGEKERRFEIFKDNLKFVEEHSSI-PNRT 84
Query: 103 H--GITQFSDLTPAEFRRTYLGLR-RKLRLPKDADQAPILPTNDLPADFDWREKGAVGPV 159
+ G+T+F+DLT EFR YL + + R+P ++ + LP DWR KGAV PV
Sbjct: 85 YEVGLTRFADLTNDEFRAIYLRSKMERTRVPVKGEKYLYKVGDSLPDAIDWRAKGAVNPV 144
Query: 160 KDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGL 219
KDQGSCGSCW+FS GA+EG N + TG+L+SLSEQ+LVDCD S + GC GGL
Sbjct: 145 KDQGSCGSCWAFSAIGAVEGINQIKTGELISLSEQELVDCDT--------SYNDGCGGGL 196
Query: 220 MNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKI-AASVANFSVVSLDEDQIAANL 278
M+ AF++ ++ GG+ EEDYPY TD + C DK ++ + V ++++
Sbjct: 197 MDYAFKFIIENGGIDTEEDYPYIATDV-NVCNSDKKNTRVVTIDGYEDVPQNDEKSLKKA 255
Query: 279 VKNGPLAVAINA--VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWI 336
+ N P++VAI A Q Y GV C LDHGV+ VGYGS G + YWI
Sbjct: 256 LANQPISVAIEAGGRAFQLYTSGVFTG-TCGTSLDHGVVAVGYGSEG-------GQDYWI 307
Query: 337 IKNSWGESWGENGYYKICRGRNV------CGVDSMVS 367
++NSWG +WGE+GY+K+ RN+ CGV M S
Sbjct: 308 VRNSWGSNWGESGYFKL--ERNIKESSGKCGVAMMAS 342
>gi|118486542|gb|ABK95110.1| unknown [Populus trichocarpa]
Length = 465
Score = 224 bits (570), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 132/323 (40%), Positives = 185/323 (57%), Gaps = 26/323 (8%)
Query: 42 ILSHHES--TNNDLLGAEHHFSLFKK---KFNKAYASQEEHDHRFTIFKANLRRAARHQK 96
I+S+H++ T + + +++++ K K Y + E + RF IFK NL +H
Sbjct: 28 IISYHQTHATKSSWRTDDEVMAMYEEWLVKHGKNYNALGEKEKRFEIFKDNLMFIDQHNS 87
Query: 97 LDPSATHGITQFSDLTPAEFRRTYLGLR--RKLRLPKDADQAPILPTNDLPADFDWREKG 154
+ + T G+ +F+DLT EFR YLG R K RLPK +D+ + LP DWR++G
Sbjct: 88 ENRTYTVGLNRFADLTNEEFRSMYLGTRTGHKKRLPKTSDRYAPRVGDSLPDSVDWRKEG 147
Query: 155 AVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSG 214
AV VKDQG CGSCW+FST A+EG N + TG L++LSEQ+LVDCD S + G
Sbjct: 148 AVAEVKDQGGCGSCWAFSTIAAVEGINKIVTGDLIALSEQELVDCDT--------SYNEG 199
Query: 215 CNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQI 274
CNGGLM+ AFE+ + GG+ E+DYPY G D G + K+ S+ ++ V +++
Sbjct: 200 CNGGLMDYAFEFIINNGGIDTEDDYPYLGRD-GRCDTYRKNAKVVSIDSYEDVPENDETA 258
Query: 275 AANLVKNGPLAVAIN--AVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEK 332
V N P++VAI Q Y GV C LDHGV VGYG+ K K
Sbjct: 259 LKKAVANQPVSVAIEGGGRNFQLYNSGVFTGE-CGTSLDHGVAAVGYGTE-------KGK 310
Query: 333 PYWIIKNSWGESWGENGYYKICR 355
YWI++NSWG+SWGE+GY ++ R
Sbjct: 311 DYWIVRNSWGKSWGESGYIRMER 333
>gi|330805273|ref|XP_003290609.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
gi|325079248|gb|EGC32857.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
Length = 333
Score = 224 bits (570), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 134/314 (42%), Positives = 183/314 (58%), Gaps = 24/314 (7%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F + +K ++AY S EE R+ FK N+ + + G+T+F+DLT E+++
Sbjct: 33 FIGWMRKHDRAY-SHEEFTDRYQAFKENMDFIHKWNSQESDTVLGLTKFADLTNEEYKKH 91
Query: 120 YLGLRRKLRLPKDADQAPI-LPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALE 178
YLG++ ++ +A Q + P DWREKGAV VKDQG CGSCWSFSTTGA+E
Sbjct: 92 YLGIKVNVKKNLNAAQKGLKFFKFTGPDSIDWREKGAVSQVKDQGQCGSCWSFSTTGAVE 151
Query: 179 GANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREED 238
GA+ + +G +VSLSEQ LVDC + + GC GGLM +AFEY + GG+ E
Sbjct: 152 GAHQIKSGNMVSLSEQNLVDCSGQYG-------NQGCEGGLMVNAFEYIIDNGGIATESS 204
Query: 239 YPYTGTDRGHACKFDKSKIAASVANFSVVSL-DEDQIAANLVKNGPLAVAINAVYM--QT 295
YPYT CKF KS A++ + + +ED + A L K P++VAI+A +M Q
Sbjct: 205 YPYTAAQG--RCKFTKSMNGANIIGYKEIPQGEEDSLTAALAKQ-PVSVAIDASHMSFQL 261
Query: 296 YIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
Y GV P S LDHGVL VGYG+ L+ K Y+IIKNSWG +WG++GY +
Sbjct: 262 YSSGVYDEPACSSEALDHGVLAVGYGT-------LEGKDYYIIKNSWGPTWGQDGYIFMS 314
Query: 355 R-GRNVCGVDSMVS 367
R +N CGV +M S
Sbjct: 315 RNAQNQCGVATMAS 328
>gi|30575716|gb|AAP33050.1| cysteine proteinase 3 [Clonorchis sinensis]
gi|358339353|dbj|GAA47433.1| cathepsin F [Clonorchis sinensis]
Length = 327
Score = 224 bits (570), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 135/318 (42%), Positives = 185/318 (58%), Gaps = 26/318 (8%)
Query: 56 AEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPA 114
A + FK K+ K+Y S ++ ++RF +FK NL R + Q ++ +A +G+TQFSDLT
Sbjct: 27 ARQLYEEFKLKYKKSY-SNDDDEYRFRVFKDNLLRIKQFQNMERGTAKYGVTQFSDLTAQ 85
Query: 115 EFRRTYLGLRRKLR-LPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFST 173
EF+ YL R K +P D + P + + +FDWR GAVGPV DQG CGSCW+FS
Sbjct: 86 EFKVRYL--RSKFGGVPVDREPVPFIRMDVDDDNFDWRNHGAVGPVLDQGDCGSCWAFSA 143
Query: 174 TGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGL 233
G +EG F T L+ LSEQQL+DCD D GCNGG AF+ L GGL
Sbjct: 144 VGNIEGQWFRKTDNLLQLSEQQLLDCDE---------VDEGCNGGTPQQAFKQILGMGGL 194
Query: 234 MREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYM 293
+ DYPY G R C+ SK+ + ++ DE A L + GPL+ A+NA+++
Sbjct: 195 QLDSDYPYEG--REGQCRMVPSKVKVYINGSKILPEDEQIQAQMLKETGPLSSALNALFL 252
Query: 294 QTYIGGV--SCPYIC-SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
Q Y G+ P +C ++ L+H VL VGYG G RL PYW +KNSW +GENGY
Sbjct: 253 QFYTEGILHPLPALCDAQSLNHAVLTVGYGKEG----RL---PYWTVKNSWSTMFGENGY 305
Query: 351 YKICRGRNVCGVDSMVST 368
++I RG CG++++VST
Sbjct: 306 FRIYRGDGTCGINTLVST 323
>gi|118429521|gb|ABK91808.1| cysteine proteinase prozyme precursor [Clonorchis sinensis]
Length = 316
Score = 224 bits (570), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 135/318 (42%), Positives = 185/318 (58%), Gaps = 26/318 (8%)
Query: 56 AEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPA 114
A + FK K+ K+Y S ++ ++RF +FK NL R + Q ++ +A +G+TQFSDLT
Sbjct: 16 ARQLYEEFKLKYKKSY-SNDDDEYRFRVFKDNLLRIKQFQNMERGTAKYGVTQFSDLTAQ 74
Query: 115 EFRRTYLGLRRKLR-LPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFST 173
EF+ YL R K +P D + P + + +FDWR GAVGPV DQG CGSCW+FS
Sbjct: 75 EFKVRYL--RSKFGGVPVDREPVPFIRMDVDDDNFDWRNHGAVGPVLDQGDCGSCWAFSA 132
Query: 174 TGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGL 233
G +EG F T L+ LSEQQL+DCD D GCNGG AF+ L GGL
Sbjct: 133 VGNIEGQWFRKTDNLLQLSEQQLLDCDE---------VDEGCNGGTPQQAFKQILGMGGL 183
Query: 234 MREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYM 293
+ DYPY G R C+ SK+ + ++ DE A L + GPL+ A+NA+++
Sbjct: 184 QLDSDYPYEG--REGQCRMVPSKVKVYINGSKILPEDEQIQAQMLKETGPLSSALNALFL 241
Query: 294 QTYIGGV--SCPYIC-SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
Q Y G+ P +C ++ L+H VL VGYG G RL PYW +KNSW +GENGY
Sbjct: 242 QFYTEGILHPLPALCDAQSLNHAVLTVGYGKEG----RL---PYWTVKNSWSTMFGENGY 294
Query: 351 YKICRGRNVCGVDSMVST 368
++I RG CG++++VST
Sbjct: 295 FRIYRGDGTCGINTLVST 312
>gi|161408095|dbj|BAF94151.1| cathepsin L-like cysteine protease 1 [Plautia stali]
Length = 344
Score = 224 bits (570), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 128/324 (39%), Positives = 177/324 (54%), Gaps = 28/324 (8%)
Query: 56 AEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLR----RAARHQKLDPSATHGITQFSDL 111
A F+ FK ++ K Y S +R ++K N + R+++ + + + +D+
Sbjct: 19 AASEFTRFKSQYRKDYPSDSVERYRKKVYKQNEKFVREHNERYERGEVTYKMALNHLADM 78
Query: 112 TPAEFRRTYLGLRRKLRLPKDADQA-PILPTND--LPADFDWREKGAVGPVKDQGSCGSC 168
P EF T+LG R LR + P D + + DWR+KGA+ PVKDQG CGSC
Sbjct: 79 HPREFMATFLGFNRSLRATNKVPEGIPFRHNKDAVIQKEVDWRQKGAISPVKDQGHCGSC 138
Query: 169 WSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTL 228
W+FS+TGALE FL G+ VSLSEQ L+DC ++GC GGLM AF+Y
Sbjct: 139 WAFSSTGALEAHTFLKKGRRVSLSEQNLIDCS-------LNYGNNGCEGGLMEQAFQYVR 191
Query: 229 KAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVA 287
G+ EE YPY G D C+F K+ + A+ A F ++ S DE + + GPL++A
Sbjct: 192 DNDGIDTEEAYPYEGED--SECRFKKNNVGATDAGFVTIPSGDEQALMEAVATQGPLSIA 249
Query: 288 INAV--YMQTYIGGVSC-PYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGES 344
I+A Q Y GV P S +LDHGVLLVGYG K++ YW++KNSW E
Sbjct: 250 IDASNPSFQFYSEGVYYEPECSSAQLDHGVLLVGYGVE-------KDQKYWLVKNSWSEQ 302
Query: 345 WGENGYYKICRGR-NVCGVDSMVS 367
WGENGY K+ R + N CG+ + S
Sbjct: 303 WGENGYIKMARNKDNNCGIATQAS 326
>gi|224136808|ref|XP_002326950.1| predicted protein [Populus trichocarpa]
gi|222835265|gb|EEE73700.1| predicted protein [Populus trichocarpa]
Length = 456
Score = 223 bits (569), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 132/323 (40%), Positives = 185/323 (57%), Gaps = 26/323 (8%)
Query: 42 ILSHHES--TNNDLLGAEHHFSLFKK---KFNKAYASQEEHDHRFTIFKANLRRAARHQK 96
I+S+H++ T + + +++++ K K Y + E + RF IFK NL +H
Sbjct: 19 IISYHQTHATKSSWRTDDEVMAMYEEWLVKHGKNYNALGEKEKRFEIFKDNLMFIDQHNS 78
Query: 97 LDPSATHGITQFSDLTPAEFRRTYLGLR--RKLRLPKDADQAPILPTNDLPADFDWREKG 154
+ + T G+ +F+DLT EFR YLG R K RLPK +D+ + LP DWR++G
Sbjct: 79 ENRTYTVGLNRFADLTNEEFRSMYLGTRTGHKKRLPKTSDRYAPRVGDSLPDSVDWRKEG 138
Query: 155 AVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSG 214
AV VKDQG CGSCW+FST A+EG N + TG L++LSEQ+LVDCD S + G
Sbjct: 139 AVAEVKDQGGCGSCWAFSTIAAVEGINKIVTGDLIALSEQELVDCDT--------SYNEG 190
Query: 215 CNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQI 274
CNGGLM+ AFE+ + GG+ E+DYPY G D G + K+ S+ ++ V +++
Sbjct: 191 CNGGLMDYAFEFIINNGGIDTEDDYPYLGRD-GRCDTYRKNAKVVSIDSYEDVPENDETA 249
Query: 275 AANLVKNGPLAVAIN--AVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEK 332
V N P++VAI Q Y GV C LDHGV VGYG+ K K
Sbjct: 250 LKKAVANQPVSVAIEGGGRNFQLYNSGVFTGE-CGTSLDHGVAAVGYGTE-------KGK 301
Query: 333 PYWIIKNSWGESWGENGYYKICR 355
YWI++NSWG+SWGE+GY ++ R
Sbjct: 302 DYWIVRNSWGKSWGESGYIRMER 324
>gi|146215994|gb|ABQ10199.1| cysteine protease Cp1 [Actinidia deliciosa]
Length = 358
Score = 223 bits (569), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 144/377 (38%), Positives = 191/377 (50%), Gaps = 34/377 (9%)
Query: 1 MGSKTVVLFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHH- 59
M + L ++ + + SS + DD + + V+D L E++ +LG H
Sbjct: 1 MARTSFSLLIILIACVAGASSASTFDDENPIRTVVSDA----LREFETSILSVLGDSRHA 56
Query: 60 --FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFR 117
F+ F ++ K Y + EE RF IF NL+ H K S T G+ F+D T EFR
Sbjct: 57 LSFARFAHRYGKRYETAEETKLRFAIFSENLKLIRSHNKKGLSYTLGVNHFADWTWEEFR 116
Query: 118 RTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
R LG + + L LP DWR G V PVKDQG CGSCW+FSTTGAL
Sbjct: 117 RHRLGAAQNCSATTKGNHK--LTEEALPEMKDWRVSGIVSPVKDQGHCGSCWTFSTTGAL 174
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREE 237
E A A GK +SLSEQQLVDC + + GC+GGL + AFEY GGL EE
Sbjct: 175 EAAYKQAFGKGISLSEQQLVDCAGAFN-------NFGCSGGLPSQAFEYVKYNGGLDTEE 227
Query: 238 DYPYTGTDRGHACKFDKSKIAASV---ANFSVVSLDEDQIAANLVKNGPLAVAINAVY-M 293
YPYTG + CKF + V N ++ + DE + A V+ P++VA V
Sbjct: 228 AYPYTG--KNGECKFSSENVGVQVLDSVNITLGAEDELKHAVAFVR--PVSVAFQVVNGF 283
Query: 294 QTYIGGVSCPYICSRR---LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
+ Y GV C R ++H VL VGYG PYW+IKNSWG WG++GY
Sbjct: 284 RLYKEGVYTSDTCGRTPMDVNHAVLAVGYGVENGV-------PYWLIKNSWGADWGDSGY 336
Query: 351 YKICRGRNVCGVDSMVS 367
+K+ G+N+CGV + S
Sbjct: 337 FKMEMGKNMCGVATCAS 353
>gi|391343119|ref|XP_003745860.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 385
Score = 223 bits (569), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 135/318 (42%), Positives = 177/318 (55%), Gaps = 23/318 (7%)
Query: 58 HHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQ-KLDPSATHGITQFSDLTPAEF 116
H+ FK + NK Y S E R IF+ N + H K + G+ F DLT E+
Sbjct: 79 QHWENFKAEHNKKYESFPEELMRRLIFEENHQFIEDHNSKKEFDFYLGMNHFGDLTNKEY 138
Query: 117 RRTYLGLRRKLRLPKDADQ--APILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTT 174
R YLG RR P A + D+P DWR++G V PVK+QG CGSCW+FS
Sbjct: 139 RERYLGYRRPENTPSKASYIFSRAEKIEDVPDQIDWRDQGFVTPVKNQGQCGSCWAFSAV 198
Query: 175 GALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLM 234
G+LEG +F +TGKLVSLSEQ LVDC PE +SGCNGG M+ AFEY G+
Sbjct: 199 GSLEGQHFKSTGKLVSLSEQNLVDCS---TPE----GNSGCNGGWMDQAFEYVKDNHGID 251
Query: 235 REEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAINAVYM 293
E+ YPY GTD +C F I A++ F V DE+ + + GP++VAI+A M
Sbjct: 252 TEDSYPYVGTD--GSCHFKNKSIGATLKGFMDVKEGDEEALRQAVGVAGPVSVAIDASSM 309
Query: 294 --QTYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
Q Y GGV + P+ + LDHGVL+VGYG + + K +W++KNSWG WG GY
Sbjct: 310 LFQFYRGGVYNVPWCSTSELDHGVLVVGYGK------QFQGKDFWMVKNSWGVGWGIYGY 363
Query: 351 YKICRGR-NVCGVDSMVS 367
++ R + N CG+ S S
Sbjct: 364 IEMSRNKGNQCGIASKAS 381
>gi|154332649|ref|XP_001562141.1| cathepsin L-like protease [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134059589|emb|CAM37171.1| cathepsin L-like protease [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 441
Score = 223 bits (569), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 133/312 (42%), Positives = 172/312 (55%), Gaps = 31/312 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK+ + + YA+ +E R F+ NL HQ +P A GIT+F DL+ EF
Sbjct: 38 FEEFKQTYQRVYATLDEEQQRLANFQRNLELMREHQANNPHARFGITKFFDLSEEEFATR 97
Query: 120 YL-GLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YL G + K A Q DL PA DWREKGAV PVKDQG CGSCW+FS G
Sbjct: 98 YLSGATHFAKAKKFASQHYRKVGADLSTAPAAVDWREKGAVTPVKDQGMCGSCWAFSAIG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTL--KAGGL 233
+E +LAT L+SLSEQ+LV CD D GCNGGLM AF++ L + G +
Sbjct: 158 NIESQWYLATHSLISLSEQELVSCD---------DVDEGCNGGLMLQAFDWLLNNRNGAV 208
Query: 234 MREEDYPYTGTDRGHACKFDKSK---IAASVANFSVVSLDEDQIAANLVKNGPLAVAINA 290
YPY + G + +S I A + + +ED +AA L NGP+A+A++A
Sbjct: 209 YTGVSYPYV-SGNGSVPECSESSDLVIGAYIDGHVTIESNEDTMAAWLAANGPIAIAVDA 267
Query: 291 VYMQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+Y GGV SC ++L+HGVLLVGY G E PYW+IKNSWGE+WGE
Sbjct: 268 SAFMSYTGGVLTSCD---GKQLNHGVLLVGYNMTG-------EVPYWLIKNSWGENWGEK 317
Query: 349 GYYKICRGRNVC 360
GY ++ +G N C
Sbjct: 318 GYVRVRKGTNEC 329
>gi|116242322|gb|ABJ89818.1| cysteine proteinase 3 [Clonorchis sinensis]
Length = 327
Score = 223 bits (569), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 135/318 (42%), Positives = 185/318 (58%), Gaps = 26/318 (8%)
Query: 56 AEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPA 114
A + FK K+ K+Y S ++ ++RF +FK NL R + Q ++ +A +G+TQFSDLT
Sbjct: 27 ARQLYEEFKLKYKKSY-SNDDDEYRFRVFKDNLLRIKQFQNMERGTAKYGVTQFSDLTAQ 85
Query: 115 EFRRTYLGLRRKLR-LPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFST 173
EF+ YL R K +P D + P + + +FDWR GAVGPV DQG CGSCW+FS
Sbjct: 86 EFKVRYL--RSKFGGVPVDREPVPFIRMDVDDDNFDWRNHGAVGPVLDQGDCGSCWAFSA 143
Query: 174 TGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGL 233
G +EG F T L+ LSEQQL+DCD D GCNGG AF+ L GGL
Sbjct: 144 VGNIEGQWFRKTDNLLQLSEQQLLDCD---------GVDEGCNGGTPQQAFKQILGMGGL 194
Query: 234 MREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYM 293
+ DYPY G R C+ SK+ + ++ DE A L + GPL+ A+NA+++
Sbjct: 195 QLDSDYPYEG--REGQCRMVPSKVKVYINGSKILPEDEQIQAQMLKETGPLSSALNALFL 252
Query: 294 QTYIGGV--SCPYIC-SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
Q Y G+ P +C ++ L+H VL VGYG G RL PYW +KNSW +GENGY
Sbjct: 253 QFYTEGILHPLPALCDAQSLNHAVLTVGYGKEG----RL---PYWTVKNSWSTMFGENGY 305
Query: 351 YKICRGRNVCGVDSMVST 368
++I RG CG++++VST
Sbjct: 306 FRIYRGDGTCGINTLVST 323
>gi|148927394|gb|ABR19828.1| cysteine proteinase [Elaeis guineensis]
Length = 469
Score = 223 bits (569), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 127/318 (39%), Positives = 183/318 (57%), Gaps = 32/318 (10%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH----GITQFSDLTPAE 115
+ +K + ++Y + +E + R IF+ NLR +H + + G+T+F+DLT E
Sbjct: 47 YQAWKAQHARSYNALDEDEQRLEIFRDNLRFIDQHNAAANAGKYSFRLGLTRFADLTNEE 106
Query: 116 FRRTYLGLR-----RKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWS 170
+R TYLG+R R+ +++ ++DLP DWR+KGAV VKDQGSCGSCW+
Sbjct: 107 YRSTYLGVRTAGSRRRRNSTVGSNRYRFRSSDDLPDSIDWRDKGAVVDVKDQGSCGSCWA 166
Query: 171 FSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA 230
FST A+EG N + TG L+SLSEQ+LVDCD + GCNGGLM+ AFE+ +
Sbjct: 167 FSTIAAVEGINHIVTGDLISLSEQELVDCDT--------YYNQGCNGGLMDYAFEFIISN 218
Query: 231 GGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA 290
GG+ +EDYPYTG D G ++ K+ ++ ++ V +++++ V N P++VAI A
Sbjct: 219 GGIDTDEDYPYTGRD-GSCDQYRKNAHVVTIDSYEDVPINDEKSLQKAVANQPVSVAIEA 277
Query: 291 --VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
Q Y G+ Y C LDHGV +GYGS K YWI+KNSWG WGE+
Sbjct: 278 GGRAFQLYESGIFTGY-CGTELDHGVTAIGYGSE-------NGKYYWIVKNSWGSDWGES 329
Query: 349 GYYKICRGRNV----CGV 362
GY ++ R N CG+
Sbjct: 330 GYIRMERNINSATGKCGI 347
>gi|209731972|gb|ACI66855.1| Cathepsin H precursor [Salmo salar]
Length = 328
Score = 223 bits (569), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 136/316 (43%), Positives = 179/316 (56%), Gaps = 33/316 (10%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
E+HF L+ ++NK Y EE+ HR IF N RR H + + T G+ QFSDLT AEF
Sbjct: 25 EYHFKLWMSQYNKVY-DMEEYYHRLQIFIENKRRIDYHNEGNHKFTMGLNQFSDLTFAEF 83
Query: 117 RRTYLGLRRKLRLPKD--ADQAPILPTN-DLPADFDWREKG-AVGPVKDQGSCGSCWSFS 172
R+++L L P++ A + + +N P DWR+KG V VK+QGSCGSCW+FS
Sbjct: 84 RKSFL-----LTEPQNCSATKGSHVSSNGPYPESVDWRKKGNYVTAVKNQGSCGSCWTFS 138
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
TTG LE +ATGKL+ LSEQQLVDC + + GCNGGL + AFEY G
Sbjct: 139 TTGCLESVTAIATGKLLQLSEQQLVDCAQAFN-------NHGCNGGLPSQAFEYIKFNKG 191
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVAN-FSVVSLDEDQIAANLVKNGPLAVA--IN 289
+M E+DYPYT D CKF AA V + ++ DE + + + P+++A +
Sbjct: 192 IMTEDDYPYTAHDD--TCKFKTDLAAAFVKDVVNITKYDEMGMVDAVARFNPVSLAYEVT 249
Query: 290 AVYMQTYIGGVSCPYICSRRLD---HGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWG 346
+ +M Y GGV C D H VL VGYG K PYWI+KNSWG SWG
Sbjct: 250 SDFMH-YDGGVYTSKECHNTTDTVNHAVLAVGYGEE-------KGTPYWIVKNSWGSSWG 301
Query: 347 ENGYYKICRGRNVCGV 362
GY+ I RG+N+CG+
Sbjct: 302 MKGYFFIERGKNMCGL 317
>gi|449454309|ref|XP_004144898.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
gi|449471311|ref|XP_004153272.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 349
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 134/360 (37%), Positives = 186/360 (51%), Gaps = 23/360 (6%)
Query: 14 VVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYAS 73
+ S S TLI I T I+ + + F + K +KAY S
Sbjct: 1 MALSTFSKATLILSATLFITYATAHDFSIVGYSPEHLASMDKTIELFESWMSKHSKAYRS 60
Query: 74 QEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDA 133
EE HRF IF NL+ K S G+ +F+DL+ EF+ YLGLR + + +
Sbjct: 61 IEEKLHRFEIFLDNLKHIDETNKKVSSYWLGLNEFADLSHEEFKSKYLGLRVEFPRKRSS 120
Query: 134 DQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSE 193
DLP DWR KGAV PVK+QGSCGSCW+FST A+EG N + TG L SLSE
Sbjct: 121 RGFSYGDVEDLPESVDWRTKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSE 180
Query: 194 QQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFD 253
Q+L+DCD S ++GC GGLM+ AF+Y + GL +EEDYPY + G +
Sbjct: 181 QELIDCDR--------SFNNGCYGGLMDYAFQYIMSNSGLRKEEDYPYL-MEEGRCIREK 231
Query: 254 KSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY--MQTYIGGVSCPYICSRRLD 311
+ +++ + V +++Q + + P++VAI A Q Y GG+ C ++D
Sbjct: 232 EQFEVVTISGYEDVPANDEQSLLKALSHQPVSVAIEASSRNFQFYKGGIFTGR-CGTQMD 290
Query: 312 HGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRG----RNVCGVDSMVS 367
HGV VGYGS+ + Y I+KNSWG WGENGY ++ R +CG++ M S
Sbjct: 291 HGVTAVGYGSS-------EGTDYIIVKNSWGPKWGENGYIRMKRNTGKPEGLCGINQMAS 343
>gi|357619727|gb|EHJ72186.1| cathepsin [Danaus plexippus]
Length = 336
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 132/358 (36%), Positives = 194/358 (54%), Gaps = 41/358 (11%)
Query: 6 VVLFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKK 65
+++F++ + F+A + + DV+++ + V DE A F F +
Sbjct: 1 MIVFVLCAISFTAAAPQNDVSDVEKVRKPVFYSMDE--------------APILFENFIR 46
Query: 66 KFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRR 125
++NK Y S+E+ + RF IF NL+R +A HGI +F+DL+ EF++ Y G +
Sbjct: 47 EYNKKYDSKEKEE-RFKIFVNNLKRINDLNHKSTNAVHGINKFTDLSKEEFKKFYTGFKP 105
Query: 126 KLRLPKDADQAPILPTNDL--PADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFL 183
D + P + ++ P FDWR+KG V VK+QG+CGSCW+FST G +E N +
Sbjct: 106 DKSFLDDNIKKPSQLSFNITAPPAFDWRDKGVVTRVKNQGTCGSCWAFSTIGNVESVNAI 165
Query: 184 ATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTG 243
G LV LSEQQLVDCD S D C+ GL ++A +Y L + G + E+ YPY G
Sbjct: 166 KHGNLVELSEQQLVDCD---------SKDEACDSGLPDNAQQY-LVSHGAISEQSYPYKG 215
Query: 244 TDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGV--- 300
C +D S++ ++NF V L E Q+A L PL++ I A + TY G+
Sbjct: 216 --YAANCTYDSSQVVVRLSNFEKVVLSECQMAEKLYSTAPLSIVIAAEVLGTYTKGILVN 273
Query: 301 SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRN 358
C S+ L+H VLLVGYG+ G +WI+KNSWG +WGE GY++I RG N
Sbjct: 274 ECEQ--SQDLNHAVLLVGYGNEG-------GTNFWILKNSWGTNWGEGGYFRIKRGVN 322
>gi|79314271|ref|NP_001030812.1| thiol protease aleurain-like protein [Arabidopsis thaliana]
gi|332644501|gb|AEE78022.1| thiol protease aleurain-like protein [Arabidopsis thaliana]
Length = 357
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 145/360 (40%), Positives = 189/360 (52%), Gaps = 34/360 (9%)
Query: 11 VSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHH---FSLFKKKF 67
+ L++F+A +S + D I+ V+D E+ E T +LG H FS F ++
Sbjct: 11 ILLILFAAAASKEIGFDESNPIKMVSDNLHEL----EDTVVQILGQSRHVLSFSRFTHRY 66
Query: 68 NKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKL 127
K Y S EE RF++FK NL K S + QF+DLT EF+R LG +
Sbjct: 67 GKKYQSVEEMKLRFSVFKENLDLIRSTNKKGLSYKLSLNQFADLTWQEFQRYKLGAAQNC 126
Query: 128 RLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGK 187
T +P DWRE G V PVK+QG CGSCW+FSTTGALE A A GK
Sbjct: 127 SATLKGSHKITEAT--VPDTKDWREDGIVSPVKEQGHCGSCWTFSTTGALEAAYHQAFGK 184
Query: 188 LVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRG 247
+SLSEQQLVDC + + GC+GGL + AFEY GGL EE YPYTG D G
Sbjct: 185 GISLSEQQLVDCAGTFN-------NFGCHGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGG 237
Query: 248 HACKFDKSKIAASV---ANFSVVSLDEDQIAANLVKNGPLAVAINAVY-MQTYIGGVSCP 303
CKF I V N ++ + DE + A LV+ P++VA V+ + Y GV
Sbjct: 238 --CKFSAKNIGVQVRDSVNITLGAEDELKHAVGLVR--PVSVAFEVVHEFRFYKKGVFTS 293
Query: 304 YICSRR---LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVC 360
C ++H VL VGYG + PYW+IKNSWG WG+NGY+K+ G+N+C
Sbjct: 294 NTCGNTPMDVNHAVLAVGYGVE-------DDVPYWLIKNSWGGEWGDNGYFKMEMGKNMC 346
>gi|157093563|gb|ABV22436.1| cysteine proteinase [Oxyrrhis marina]
Length = 329
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 131/312 (41%), Positives = 174/312 (55%), Gaps = 25/312 (8%)
Query: 63 FKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLG 122
FK KF ++Y +EE R +F N++ + T G+ QF+DLT EF +TY+G
Sbjct: 22 FKAKFGESYNGEEEEAERKGVFAQNVQLINEENSKGHTYTLGVNQFADLTVEEFSKTYMG 81
Query: 123 LRRKLRLPKDADQAP--ILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGA 180
++ + DA + LP DW +GAV PVK+QG CGSCWSFSTTG+LEGA
Sbjct: 82 FKKPAQKYGDAAYLGRHVYNGEALPTSVDWSSQGAVTPVKNQGQCGSCWSFSTTGSLEGA 141
Query: 181 NFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYP 240
N ++TGKLVSLSEQQ VDC + GCNGGLM+SAF+Y +A L E+ YP
Sbjct: 142 NEISTGKLVSLSEQQFVDCAGTYG-------NQGCNGGLMDSAFKYA-EANALCTEQSYP 193
Query: 241 YTGTD---RGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA--VYMQT 295
Y GTD + +C +K SV+ + VS D +Q + V P+++AI A Q
Sbjct: 194 YKGTDGSCQASSCSTGLAK--GSVSGYKDVSSDSEQDMMSAVAQQPVSIAIEADKSVFQL 251
Query: 296 YIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
Y GGV C LDHGVL VGYG+ L YW +KNSWG +WG +GY + R
Sbjct: 252 YSGGV-LTGACGASLDHGVLAVGYGT-------LSGTDYWKVKNSWGSTWGMSGYVLLQR 303
Query: 356 GRNVCGVDSMVS 367
G+ G ++S
Sbjct: 304 GKGGSGECGLLS 315
>gi|6967097|emb|CAB72480.1| cysteine protease-like protein [Arabidopsis thaliana]
Length = 377
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 145/360 (40%), Positives = 189/360 (52%), Gaps = 34/360 (9%)
Query: 11 VSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHH---FSLFKKKF 67
+ L++F+A +S + D I+ V+D E+ E T +LG H FS F ++
Sbjct: 11 ILLILFAAAASKEIGFDESNPIKMVSDNLHEL----EDTVVQILGQSRHVLSFSRFTHRY 66
Query: 68 NKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKL 127
K Y S EE RF++FK NL K S + QF+DLT EF+R LG +
Sbjct: 67 GKKYQSVEEMKLRFSVFKENLDLIRSTNKKGLSYKLSLNQFADLTWQEFQRYKLGAAQNC 126
Query: 128 RLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGK 187
T +P DWRE G V PVK+QG CGSCW+FSTTGALE A A GK
Sbjct: 127 SATLKGSHKITEAT--VPDTKDWREDGIVSPVKEQGHCGSCWTFSTTGALEAAYHQAFGK 184
Query: 188 LVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRG 247
+SLSEQQLVDC + + GC+GGL + AFEY GGL EE YPYTG D G
Sbjct: 185 GISLSEQQLVDCAGTFN-------NFGCHGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGG 237
Query: 248 HACKFDKSKIAASV---ANFSVVSLDEDQIAANLVKNGPLAVAINAVY-MQTYIGGVSCP 303
CKF I V N ++ + DE + A LV+ P++VA V+ + Y GV
Sbjct: 238 --CKFSAKNIGVQVRDSVNITLGAEDELKHAVGLVR--PVSVAFEVVHEFRFYKKGVFTS 293
Query: 304 YICSRR---LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVC 360
C ++H VL VGYG + PYW+IKNSWG WG+NGY+K+ G+N+C
Sbjct: 294 NTCGNTPMDVNHAVLAVGYGVE-------DDVPYWLIKNSWGGEWGDNGYFKMEMGKNMC 346
>gi|145334857|ref|NP_001078774.1| thiol protease aleurain [Arabidopsis thaliana]
gi|332009932|gb|AED97315.1| thiol protease aleurain [Arabidopsis thaliana]
Length = 361
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 141/345 (40%), Positives = 181/345 (52%), Gaps = 34/345 (9%)
Query: 27 DVDQLIRQVTDGGDEILSHHESTNNDLLGAEHH---FSLFKKKFNKAYASQEEHDHRFTI 83
D IR V+DG E+ E + + +LG H F+ F ++ K Y + EE RF+I
Sbjct: 27 DESNPIRMVSDGLREV----EESVSQILGQSRHVLSFARFTHRYGKKYQNVEEMKLRFSI 82
Query: 84 FKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTND 143
FK NL K S G+ QF+DLT EF+RT LG + +
Sbjct: 83 FKENLDLIRSTNKKGLSYKLGVNQFADLTWQEFQRTKLGAAQNCSATLKGSHK--VTEAA 140
Query: 144 LPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHEC 203
LP DWRE G V PVKDQG CGSCW+FSTTGALE A A GK +SLSEQQLVDC
Sbjct: 141 LPETKDWREDGIVSPVKDQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAF 200
Query: 204 DPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVAN 263
+ + GCNGGL + AFEY GGL E+ YPYTG D CKF + V N
Sbjct: 201 N-------NYGCNGGLPSQAFEYIKSNGGLDTEKAYPYTGKDE--TCKFSAENVGVQVLN 251
Query: 264 FSVVSL---DEDQIAANLVKNGPLAVAINAVY-MQTYIGGVSCPYICSRR---LDHGVLL 316
++L DE + A LV+ P+++A ++ + Y GV C ++H VL
Sbjct: 252 SVNITLGAEDELKHAVGLVR--PVSIAFEVIHSFRLYKSGVYTDSHCGSTPMDVNHAVLA 309
Query: 317 VGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCG 361
VGYG PYW+IKNSWG WG+ GY+K+ G+N+CG
Sbjct: 310 VGYGVEDGV-------PYWLIKNSWGADWGDKGYFKMEMGKNMCG 347
>gi|301784869|ref|XP_002927853.1| PREDICTED: cathepsin F-like [Ailuropoda melanoleuca]
Length = 394
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 148/350 (42%), Positives = 196/350 (56%), Gaps = 38/350 (10%)
Query: 34 QVTDGGDEILS------HHESTNNDLLGAEHHFSLFKK---KFNKAYASQEEHDHRFTIF 84
+VTD +E LS + E D + S+FK+ +N+ Y S+EE + R ++F
Sbjct: 64 KVTDDKNETLSSVLPLLNKEPLPQDF--SVRMVSIFKEFVTTYNRTYESKEEAEWRMSVF 121
Query: 85 KANLRRAARHQKLDP-SATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTND 143
N+ RA + Q LD +A +GIT+FSDLT EFR YL + K D A + +
Sbjct: 122 SNNVMRAQKIQALDRGTAQYGITKFSDLTEEEFRTIYLNPLLRENRGKKMDLAKSI-GDS 180
Query: 144 LPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHEC 203
P ++DWR KGAV VKDQG CGSCW+FS TG +EG FL G L+SLSEQ+L+DCD
Sbjct: 181 APPEWDWRNKGAVTQVKDQGMCGSCWAFSVTGNVEGQWFLKRGALLSLSEQELLDCDK-- 238
Query: 204 DPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHA--CKFDKSKIAASV 261
D C GGL ++A+ GGL E+DY Y RGH C F K +
Sbjct: 239 -------VDKACLGGLPSNAYSAIKTLGGLETEDDYSY----RGHVQTCSFSSKKARVYI 287
Query: 262 ANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPY--ICSRRL-DHGVLLVG 318
+ +S +E ++ A L +NGP++VAINA MQ Y G+S P +CS L DH VLLVG
Sbjct: 288 NDSVELSQNEQKLVAWLAQNGPISVAINAFGMQFYRRGISHPLRPLCSPWLIDHAVLLVG 347
Query: 319 YGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVST 368
YG+ P+W IKNSWG WGE GYY + RG CGV++M S+
Sbjct: 348 YGNRSGI-------PFWAIKNSWGTDWGEEGYYYLHRGSGACGVNTMASS 390
>gi|444510192|gb|ELV09527.1| Cathepsin F [Tupaia chinensis]
Length = 597
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 148/378 (39%), Positives = 205/378 (54%), Gaps = 43/378 (11%)
Query: 9 FLVSLVVFSAVSSGTLID-DVDQLIRQVTDGGDE-------ILSHHESTNNDLLGAEHHF 60
L S V + L+ D ++ +VTD G+E +L+ + + + F
Sbjct: 241 LLCSFEVLDELGKHMLLRRDCGPVVTKVTDDGNEALNSGLPLLTKDPLSQDFSVKMASIF 300
Query: 61 SLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRRT 119
F +N+ Y ++EE R ++F +N+ RA + Q LD +A +G+T+FSDLT EFR
Sbjct: 301 KNFVTTYNRTYQTKEEAQWRLSVFASNMVRAQKIQALDHGTAQYGVTKFSDLTEEEFRTI 360
Query: 120 YLG--LR----RKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFST 173
YL LR +K+ L K + P ++DWR+ GAV VKDQG CGSCW+FS
Sbjct: 361 YLNPLLREVPGKKMHLAKSIG-------DPAPPEWDWRKNGAVTKVKDQGMCGSCWAFSV 413
Query: 174 TGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGL 233
TG +EG FL G L+SLSEQ+L+DCD D C GGL ++A+ GGL
Sbjct: 414 TGNVEGQWFLNRGTLLSLSEQELLDCD---------KMDKACMGGLPSNAYSAIKNLGGL 464
Query: 234 MREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYM 293
E+DY Y G AC F K + + +S +E ++AA L K GP++VAINA M
Sbjct: 465 ETEDDYSYQG--HMQACNFSAEKAKVYINDSVELSQNEQKLAAWLAKKGPISVAINAFGM 522
Query: 294 QTYIGGVSCPY--ICSRRL-DHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
Q Y G++ P +CS L DH VL+VGYG+ E P+W IKNSWG WGE GY
Sbjct: 523 QFYRHGIAHPLRPLCSPWLIDHAVLIVGYGNRS-------EVPFWAIKNSWGTDWGEKGY 575
Query: 351 YKICRGRNVCGVDSMVST 368
Y + RG CGV++M S+
Sbjct: 576 YYLHRGSGSCGVNTMASS 593
>gi|77379397|gb|ABA71355.1| cysteine protease [Brassica napus]
Length = 359
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 147/378 (38%), Positives = 196/378 (51%), Gaps = 35/378 (9%)
Query: 1 MGSKTVVLFLVSLVVFSAVSSGTLIDDVD-QLIRQVTDGGDEILSHHESTNNDLLGAEHH 59
M S +L V+L++ AVS+ I + IR V D E+ E + +LG H
Sbjct: 1 MMSVRTILPSVALLILIAVSTAESIGFYESNPIRMVFDRLLEV----EESVVQILGQTRH 56
Query: 60 ---FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
F+ F ++ K Y + EE RF+IFK NL K S G+ QF+D+T EF
Sbjct: 57 VLSFARFTHRYGKRYENAEEMKLRFSIFKENLDLIRSTNKKGLSYKLGVNQFTDMTWQEF 116
Query: 117 RRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGA 176
+RT LG + L LP DWRE G V PVKDQG CGSCW+FSTTGA
Sbjct: 117 QRTKLGAAQNCSATLKGTHK--LTGEALPETKDWREDGIVSPVKDQGGCGSCWTFSTTGA 174
Query: 177 LEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMRE 236
LE A A GK +SLSEQQLVDC + + GCNGGL + AFEY GGL E
Sbjct: 175 LEAAYHQAFGKGISLSEQQLVDCAGAFN-------NYGCNGGLPSQAFEYIKSNGGLDTE 227
Query: 237 EDYPYTGTDRGHACKFDKSKIAASV---ANFSVVSLDEDQIAANLVKNGPLAVAINAVY- 292
E YPYTG D CK+ + V N ++ + DE + A L++ P+++A ++
Sbjct: 228 EAYPYTGED--GTCKYSAENVGVQVLDSVNITLGAEDELKHAVGLLR--PVSIAFEVIHS 283
Query: 293 MQTYIGGVSCPYICSRR---LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
+ Y GV C + ++H VL VGYG PYW+IKNSWG WG+ G
Sbjct: 284 FRLYKSGVYSDSHCGQTPMDVNHAVLAVGYGIE-------DGVPYWLIKNSWGADWGDKG 336
Query: 350 YYKICRGRNVCGVDSMVS 367
Y+K+ G+N+CG+ + S
Sbjct: 337 YFKMEMGKNMCGIATCAS 354
>gi|320164780|gb|EFW41679.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 334
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 129/318 (40%), Positives = 175/318 (55%), Gaps = 25/318 (7%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLD-PSATHGITQFSDLTPAEFR 117
F +K+ F K+Y+ E +R +++AN H S T G+ F+DLT EF+
Sbjct: 29 EFEAWKRTFGKSYSDAVEEINRRAVWEANKMLVDAHNGAGIHSYTLGMNIFADLTHEEFK 88
Query: 118 RTYLGLRRKLRLPKDADQAPILPTND---LPADFDWREKGAVGPVKDQGSCGSCWSFSTT 174
R YLG + L P+ + +PT + LP DWR G V PVKDQG CGSCWSFSTT
Sbjct: 89 RFYLGTKVDLNRPRSNFSSTFIPTANVGALPDSVDWRTAGIVTPVKDQGQCGSCWSFSTT 148
Query: 175 GALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLM 234
G++EG + TG+LVSLSEQ LVDC + GCNGGLM+ AF+Y + G+
Sbjct: 149 GSVEGQHARKTGQLVSLSEQNLVDCSK-------AQGNQGCNGGLMDDAFQYIITNKGID 201
Query: 235 REEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKN-GPLAVAINAV-- 291
E YPYT D CKF+ + + A++++F ++ + N V GP++VAI+A
Sbjct: 202 TEASYPYTAKD--GTCKFNAANVGATLSSFQDITRGSESDLQNAVATVGPVSVAIDASKN 259
Query: 292 YMQTYIGGVSCPYICSR-RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
Q Y GV CS LDHGVL GYG++ PYW++KNSWG SWG+ GY
Sbjct: 260 SFQLYTSGVYNEKKCSSTSLDHGVLAAGYGTS-------NGTPYWLVKNSWGSSWGQAGY 312
Query: 351 YKICR-GRNVCGVDSMVS 367
+ R N CG+ + S
Sbjct: 313 IWMSRNANNQCGIATSAS 330
>gi|405971604|gb|EKC36431.1| Cathepsin L [Crassostrea gigas]
Length = 384
Score = 223 bits (567), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 138/322 (42%), Positives = 179/322 (55%), Gaps = 29/322 (9%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKL----DPSATHGITQFSDLT 112
E + FK +K+Y EE RF IF+ N+ R +H KL S G+ QF+DL
Sbjct: 76 EQAWKEFKILHDKSYEDHEEESRRFEIFRENVLRIEKHNKLFHLGKKSYYLGVNQFTDLE 135
Query: 113 PAEFRRTYLGLRRKLRLPKDADQAPILPTNDL--PADFDWREKGAVGPVKDQGSCGSCWS 170
AEF + GL K+ + + L N++ P DWR KG V VK+QG+CGSCW+
Sbjct: 136 YAEFV-NFNGL--KMTNLNNTKCSSHLSANNIVVPDSVDWRSKGYVTKVKNQGACGSCWA 192
Query: 171 FSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA 230
FS TG+LEG F GKLV LSE QLVDC E GCNGG M +AF+Y
Sbjct: 193 FSATGSLEGQYFRKNGKLVPLSESQLVDCSGSFGNE-------GCNGGFMENAFKYVKSV 245
Query: 231 GGLMREEDYPYTGTDRGHACKFDKSKIAASVAN-FSVVSLDEDQIAANLVKNGPLAVAIN 289
GG+ E DYPY R C FDK+K+ A+V+ V S E + + + GP++VAI+
Sbjct: 246 GGIESESDYPYKARQR--TCAFDKTKVIATVSGCVDVESGSESSLKEVVSEVGPVSVAID 303
Query: 290 AVY--MQTYIGGVSCPYICS-RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWG 346
A + Q Y GGV +CS RL+HGVL VGYG++ L+ K YWI+KNSWG WG
Sbjct: 304 AGHSSFQLYAGGVYDEPLCSTSRLNHGVLCVGYGTS------LQGKDYWIVKNSWGVRWG 357
Query: 347 ENGYYKICRGR-NVCGVDSMVS 367
GY K+ R + N CG+ S S
Sbjct: 358 VEGYIKMSRNKNNQCGIASEAS 379
>gi|154336052|ref|XP_001564262.1| cysteine peptidase A (CPA) [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134061296|emb|CAM38321.1| cysteine peptidase A (CPA) [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 479
Score = 223 bits (567), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 128/311 (41%), Positives = 172/311 (55%), Gaps = 24/311 (7%)
Query: 56 AEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGIT-QFSDLTPA 114
A HF FKK+ K++ + HRF FK N++ A +P A + ++ +F+ LTP
Sbjct: 38 ASAHFMHFKKQHGKSFGEEAVEGHRFNAFKENMQTAVYLNAQNPHAHYDVSGKFAALTPQ 97
Query: 115 EFRRTYLG---LRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSF 171
EF + YL R+L+ K+ L A DWREKGAV VKDQG CGSCW+F
Sbjct: 98 EFAKQYLNPDYYTRQLKAHKERAHVYEGVRGGLSA-VDWREKGAVTEVKDQGLCGSCWAF 156
Query: 172 STTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK-- 229
S G +EG L+ LVSLSEQ LV CD + D GCNGGLM+ A+ + +K
Sbjct: 157 SAIGNIEGQWALSGNTLVSLSEQMLVSCD---------TVDMGCNGGLMDQAWAWIIKNH 207
Query: 230 AGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAIN 289
+G + E YPYT D A K+ A ++ + DED I A L KNGP+++A++
Sbjct: 208 SGAVYTEVSYPYTSGDGSTASCLSTGKVGARISGQVSLPQDEDAIEAWLEKNGPISIAVD 267
Query: 290 AVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
A Q Y GGV + L+HGVLLVGY ++ PYWI+KNSWG SWGE+G
Sbjct: 268 ATTWQLYFGGVVSNCF-AYNLNHGVLLVGYNNSA-------NPPYWIVKNSWGTSWGEHG 319
Query: 350 YYKICRGRNVC 360
Y ++ +G N C
Sbjct: 320 YIRLAKGSNQC 330
>gi|328870281|gb|EGG18656.1| hypothetical protein DFA_04151 [Dictyostelium fasciculatum]
Length = 347
Score = 223 bits (567), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 128/323 (39%), Positives = 175/323 (54%), Gaps = 28/323 (8%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRR------AARHQKLDPSATHGITQFSD 110
E F F+ K+NK Y S E + FK +L+R A+ K+D G+ +F+D
Sbjct: 27 ETQFREFQLKYNKHYESHE-FAQKLATFKNSLKRIQELNDMAKRAKVDTE--FGVNKFAD 83
Query: 111 LTPAEFRRTYLGLRRKLRLPKDADQAPILP---TNDLPADFDWREKGAVGPVKDQGSCGS 167
L+ EF YL + + AP ++LP FDWR +GAV PVKDQG CGS
Sbjct: 84 LSKEEFANYYLN-KGGMESTDSETYAPDYSDKEISNLPTSFDWRTQGAVTPVKDQGQCGS 142
Query: 168 CWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYT 227
CWSFSTTG +EG FLA L LSEQ LVDC + D GCNGGLM A++Y
Sbjct: 143 CWSFSTTGNVEGQWFLAGNDLTGLSEQNLVDCSTKND---------GCNGGLMPLAYDYI 193
Query: 228 LKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVA 287
++ G+ E YPY + + C+F+ + I A + + VS +E Q+ NLV NGPL++A
Sbjct: 194 VENNGIDTEASYPYLAIQQKN-CQFNPANIGAKIDGYYNVSSNETQMQINLVNNGPLSIA 252
Query: 288 INAVYMQTYIGGVSCPY--ICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESW 345
+A Q Y G+ IC + LDHG+L+VGYG + +WIIKNSW W
Sbjct: 253 ADAAEWQYYKKGIFSGIFGICGKNLDHGILIVGYGQ---QTTEFGTELFWIIKNSWSTDW 309
Query: 346 GENGYYKICRGRNVCGVDSMVST 368
G +G+ I RG CG++ V++
Sbjct: 310 GLSGFMLIKRGTGECGINLAVTS 332
>gi|224131910|ref|XP_002328138.1| predicted protein [Populus trichocarpa]
gi|222837653|gb|EEE76018.1| predicted protein [Populus trichocarpa]
Length = 349
Score = 223 bits (567), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 129/306 (42%), Positives = 174/306 (56%), Gaps = 25/306 (8%)
Query: 69 KAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLR 128
KAY S EE HRF +FK NL+ + K S G+ +F+DL+ EF+ +LGL +
Sbjct: 56 KAYNSLEEKLHRFEVFKENLKHIDQRNKEVTSYWLGLNEFADLSHEEFKSKFLGLYPEFP 115
Query: 129 LPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKL 188
K ++ DLP DWR+KGAV PVK+QGSCGSCW+FST A+EG N + G L
Sbjct: 116 RKKSSEDFSYRDVVDLPKSIDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVAGNL 175
Query: 189 VSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGH 248
SLSEQQL+DCD S ++GCNGGLM+ AFE+ + GGL +EEDYPY + G
Sbjct: 176 TSLSEQQLIDCD--------TSFNNGCNGGLMDYAFEFIVNNGGLHKEEDYPYL-MEEGT 226
Query: 249 ACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV--YMQTYIGGV-SCPYI 305
+ + +++ + V +++Q + + PL+VAI+A Q Y GGV S P
Sbjct: 227 CDEKREEMEVVTISGYHDVPRNDEQSLLKALAHQPLSVAIDASGRDFQFYSGGVFSGP-- 284
Query: 306 CSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRG----RNVCG 361
C LDHGV VGYGS+ Y I+KNSWG WGE GY ++ R +CG
Sbjct: 285 CGTDLDHGVAAVGYGSSSGI-------DYIIVKNSWGPKWGERGYLRMKRNTGKPEGLCG 337
Query: 362 VDSMVS 367
++ M S
Sbjct: 338 INKMAS 343
>gi|356530431|ref|XP_003533785.1| PREDICTED: cysteine proteinase [Glycine max]
Length = 354
Score = 222 bits (566), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 136/315 (43%), Positives = 172/315 (54%), Gaps = 27/315 (8%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F+ F +F K+Y S+EE R+ IF NLR H K T + F+D T EF+R
Sbjct: 55 FARFVSRFGKSYQSEEEMKERYEIFSQNLRFIRSHNKKRLPYTLSVNHFADWTWEEFKRH 114
Query: 120 YLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEG 179
LG + + + L LP DWR++G V VKDQGSCGSCW+FSTTGALE
Sbjct: 115 RLGAAQNCSATLNGNHK--LTDAVLPPTKDWRKEGIVSSVKDQGSCGSCWTFSTTGALEA 172
Query: 180 ANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDY 239
A A GK +SLSEQQLVDC + + GC+GGL + AFEY GGL EE Y
Sbjct: 173 AYAQAFGKSISLSEQQLVDCAGPFN-------NFGCHGGLPSQAFEYIKYNGGLETEEAY 225
Query: 240 PYTGTDRGHACKFDKSKIAASV---ANFSVVSLDEDQIAANLVKNGPLAVAINAVY-MQT 295
PYTG D CKF +A V N ++ + DE + A V+ P++VA V
Sbjct: 226 PYTGKD--GVCKFSAENVAVQVLDSVNITLGAEDELKHAVAFVR--PVSVAFQVVNGFHF 281
Query: 296 YIGGVSCPYIC---SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYK 352
Y GV C S+ ++H VL VGYG PYW+IKNSWGESWGENGY+K
Sbjct: 282 YENGVFTSDTCGSTSQDVNHAVLAVGYGVENGV-------PYWLIKNSWGESWGENGYFK 334
Query: 353 ICRGRNVCGVDSMVS 367
+ G+N+CGV + S
Sbjct: 335 MELGKNMCGVATCAS 349
>gi|261328618|emb|CBH11596.1| cysteine peptidase precursor, (fragment) [Trypanosoma brucei
gambiense DAL972]
Length = 404
Score = 222 bits (566), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 130/313 (41%), Positives = 174/313 (55%), Gaps = 35/313 (11%)
Query: 67 FNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRR------TY 120
+ K Y +E RF F+ N+ +A +P AT G+T FSD+T EFR +Y
Sbjct: 2 YGKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEFRARYRNGASY 61
Query: 121 LGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGA 180
+K RL K + + T PA DWREKGAV P+KDQG CGSCW+F + G +EG
Sbjct: 62 FAAAQK-RLRKTVN----VTTGRAPAAVDWREKGAVTPMKDQGQCGSCWAFYSIGNIEGQ 116
Query: 181 NFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA--GGLMREED 238
+A LVSLSEQ LV CD + D GC GGLM++AF + + + G + E
Sbjct: 117 WQVAGNPLVSLSEQMLVSCD---------TIDFGCGGGLMDNAFNWIVNSNGGNVFTEAS 167
Query: 239 YPY-TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYI 297
YPY +G C+ + +I A++ + + DED IAA L +NGPLA+A++A Y
Sbjct: 168 YPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYN 227
Query: 298 GGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
GG+ SC S +LDHGVLLVGY PYWIIKNSW WGE+GY +I +
Sbjct: 228 GGILTSCT---SEQLDHGVLLVGYNDN-------SNPPYWIIKNSWSNMWGEDGYIRIEK 277
Query: 356 GRNVCGVDSMVST 368
G N C ++ VS+
Sbjct: 278 GTNQCLMNQAVSS 290
>gi|79331505|ref|NP_001032106.1| thiol protease aleurain [Arabidopsis thaliana]
gi|332009931|gb|AED97314.1| thiol protease aleurain [Arabidopsis thaliana]
Length = 357
Score = 222 bits (566), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 140/344 (40%), Positives = 180/344 (52%), Gaps = 34/344 (9%)
Query: 27 DVDQLIRQVTDGGDEILSHHESTNNDLLGAEHH---FSLFKKKFNKAYASQEEHDHRFTI 83
D IR V+DG E+ E + + +LG H F+ F ++ K Y + EE RF+I
Sbjct: 27 DESNPIRMVSDGLREV----EESVSQILGQSRHVLSFARFTHRYGKKYQNVEEMKLRFSI 82
Query: 84 FKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTND 143
FK NL K S G+ QF+DLT EF+RT LG + +
Sbjct: 83 FKENLDLIRSTNKKGLSYKLGVNQFADLTWQEFQRTKLGAAQNCSATLKGSHK--VTEAA 140
Query: 144 LPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHEC 203
LP DWRE G V PVKDQG CGSCW+FSTTGALE A A GK +SLSEQQLVDC
Sbjct: 141 LPETKDWREDGIVSPVKDQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAF 200
Query: 204 DPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVAN 263
+ + GCNGGL + AFEY GGL E+ YPYTG D CKF + V N
Sbjct: 201 N-------NYGCNGGLPSQAFEYIKSNGGLDTEKAYPYTGKDE--TCKFSAENVGVQVLN 251
Query: 264 FSVVSL---DEDQIAANLVKNGPLAVAINAVY-MQTYIGGVSCPYICSRR---LDHGVLL 316
++L DE + A LV+ P+++A ++ + Y GV C ++H VL
Sbjct: 252 SVNITLGAEDELKHAVGLVR--PVSIAFEVIHSFRLYKSGVYTDSHCGSTPMDVNHAVLA 309
Query: 317 VGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVC 360
VGYG PYW+IKNSWG WG+ GY+K+ G+N+C
Sbjct: 310 VGYGVEDGV-------PYWLIKNSWGADWGDKGYFKMEMGKNMC 346
>gi|77628008|ref|NP_001029282.1| cathepsin F precursor [Rattus norvegicus]
gi|71681040|gb|AAH99780.1| Cathepsin F [Rattus norvegicus]
gi|149062007|gb|EDM12430.1| cathepsin F, isoform CRA_a [Rattus norvegicus]
gi|159895422|gb|ABX09995.1| cathepsin F [Rattus norvegicus]
Length = 462
Score = 222 bits (566), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 142/320 (44%), Positives = 184/320 (57%), Gaps = 37/320 (11%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRR 118
F F +N+ Y S+EE R T+F N+ RA + Q LD +A +GIT+FSDLT EF
Sbjct: 165 FKDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGITKFSDLTEEEFHT 224
Query: 119 TYLG--LRR----KLRLPKDADQAPILPTNDL-PADFDWREKGAVGPVKDQGSCGSCWSF 171
YL L++ K+ L K NDL P ++DWR+KGAV VKDQG CGSCW+F
Sbjct: 225 IYLNPLLQKESGGKMSLAKS--------INDLAPPEWDWRKKGAVTEVKDQGMCGSCWAF 276
Query: 172 STTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAG 231
S TG +EG FL G L+SLSEQ+L+DCD D C GGL ++A+ G
Sbjct: 277 SVTGNVEGQWFLNRGTLLSLSEQELLDCD---------KMDKACMGGLPSNAYTAIKNLG 327
Query: 232 GLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV 291
GL E+DY Y G AC F + + +S DE++IAA L + GP++VAINA
Sbjct: 328 GLETEDDYGYQG--HVQACNFSTQMAKVYINDSVELSRDENKIAAWLAQKGPISVAINAF 385
Query: 292 YMQTYIGGVSCPY--ICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
MQ Y G++ P+ +CS +DH VLLVGYG+ PYW IKNSWG WGE
Sbjct: 386 GMQFYRHGIAHPFRPLCSPWFIDHAVLLVGYGNR-------SNIPYWAIKNSWGRDWGEE 438
Query: 349 GYYKICRGRNVCGVDSMVST 368
GYY + RG CGV++M S+
Sbjct: 439 GYYYLYRGSGACGVNTMASS 458
>gi|395851695|ref|XP_003798388.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin F [Otolemur garnettii]
Length = 491
Score = 222 bits (566), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 142/321 (44%), Positives = 185/321 (57%), Gaps = 37/321 (11%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRR 118
F F +N+ Y S+EE R +IF N+ RA + Q LD +A +GIT+FSDLT EFR
Sbjct: 194 FKNFLTTYNRTYESKEETQWRLSIFINNMVRAQKIQALDQGTARYGITKFSDLTEEEFRT 253
Query: 119 TYLG--LR----RKLRLPKDADQAPILPTND-LPADFDWREKGAVGPVKDQGSCGSCWSF 171
YL LR +K+R+ K P D P ++DWR KGAV VK+QG CGSCW+F
Sbjct: 254 IYLNPLLREDPGKKMRVAK--------PVGDPAPPEWDWRNKGAVTNVKNQGMCGSCWAF 305
Query: 172 STTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAG 231
S TG +EG FL G L+SLSEQ+L+DCD D C GGL ++A+ G
Sbjct: 306 SVTGNVEGQWFLKQGTLLSLSEQELLDCDK---------MDKACLGGLPSNAYSAIKNLG 356
Query: 232 GLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV 291
GL EEDY Y G + AC F K + + +S +E ++AA L K GP++VAINA
Sbjct: 357 GLETEEDYSYQG--QMQACNFSAEKAKVYINDSVELSHNEQKLAAWLAKKGPISVAINAF 414
Query: 292 YMQTYIGGVSCPY--ICSRRL-DHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
MQ Y G+S P +C+ L DH VL+VGYG+ + P+W IKNSWG WGE
Sbjct: 415 GMQFYRHGISRPLRPLCTPWLIDHAVLIVGYGNR-------SDIPFWAIKNSWGTDWGEQ 467
Query: 349 GYYKICRGRNVCGVDSMVSTV 369
GYY + RG CGV++M S+
Sbjct: 468 GYYYLHRGSGACGVNTMASSA 488
>gi|224103643|ref|XP_002313136.1| predicted protein [Populus trichocarpa]
gi|222849544|gb|EEE87091.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 222 bits (566), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 133/320 (41%), Positives = 182/320 (56%), Gaps = 30/320 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKL-DPSATHGITQFSDLTPAEFRR 118
+ ++ K+ KAY + E + RF IFK NL+ +H + +PS G+ +F+DL+ E+R
Sbjct: 49 YEMWLVKYGKAYNALGEKERRFEIFKDNLKFVDQHNSVGNPSYKLGLNKFADLSNEEYRA 108
Query: 119 TYLGLR-----RKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFST 173
YLG R R L PK A + +DLP DWREKGAV PVKDQG CGSCW+FST
Sbjct: 109 AYLGTRMDGKRRLLGGPKSA-RYLFKDGDDLPESVDWREKGAVAPVKDQGQCGSCWAFST 167
Query: 174 TGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGL 233
GA+EG N + TG L SLSEQ+LVDCD + GCNGGLM+ AFE+ +K GG+
Sbjct: 168 VGAVEGINQIVTGNLTSLSEQELVDCDK--------VYNQGCNGGLMDYAFEFIMKNGGI 219
Query: 234 MREEDYPYTGTDRGHACKFD-KSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA-- 290
EEDYPY D C + K+ ++ + V ++++ V N P++VAI A
Sbjct: 220 DTEEDYPYKAVD--SMCDPNRKNARVVTIDGYEDVPQNDEKSLRKAVANQPVSVAIEAGG 277
Query: 291 VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
Q Y GV C +LDHGV+ VGYG+ YW+++NSWG +WGENGY
Sbjct: 278 RAFQLYQSGVFTG-SCGTQLDHGVVAVGYGTENGV-------DYWVVRNSWGPAWGENGY 329
Query: 351 YKICRGRNVCGVDSMVSTVA 370
++ RNV ++ +A
Sbjct: 330 IRM--ERNVASTETGKCGIA 347
>gi|397517049|ref|XP_003828732.1| PREDICTED: cathepsin F [Pan paniscus]
Length = 379
Score = 222 bits (565), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 139/314 (44%), Positives = 181/314 (57%), Gaps = 25/314 (7%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRR 118
F F +N+ Y S+EE R ++F N+ RA + Q LD +A +G+T+FSDLT EFR
Sbjct: 82 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 141
Query: 119 TYLGLRRKLRLPKDADQAPILPTNDL-PADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
YL + QA + DL P ++DWR KGAV VKDQG CGSCW+FS TG +
Sbjct: 142 IYLNPLLRKEPGNKMKQAKSV--GDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNV 199
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREE 237
EG FL G L+SLSEQ+L+DCD D C GGL ++A+ GGL E+
Sbjct: 200 EGQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSAIKNLGGLETED 250
Query: 238 DYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYI 297
DY Y G +C F K + + V+S +E ++AA L K GP++VAINA MQ Y
Sbjct: 251 DYSYQG--HMQSCNFSAEKAKVYINDSVVLSQNEQKLAAWLAKRGPISVAINAFGMQFYR 308
Query: 298 GGVSCPY--ICSRRL-DHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
G+S P +CS L DH VLLVGYG+ + P+W IKNSWG WGE GYY +
Sbjct: 309 HGISRPLRPLCSPWLIDHAVLLVGYGNR-------SDVPFWAIKNSWGTDWGEKGYYYLH 361
Query: 355 RGRNVCGVDSMVST 368
RG CGV++M S+
Sbjct: 362 RGSGACGVNTMASS 375
>gi|14041143|emb|CAA71554.1| cathepsin [Geodia cydonium]
Length = 322
Score = 222 bits (565), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 132/316 (41%), Positives = 177/316 (56%), Gaps = 27/316 (8%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
+ +K K+NK Y+SQEE R ++ +NL+ T + +F+DL P EF
Sbjct: 19 WEQWKLKYNKQYSSQEEDYLRQRVWLSNLKFVEEFDSEREGYTVAMNEFADLDPREFVSH 78
Query: 120 YLGLRRKLRLPKDADQAPILPTND---LPADFDWREKGAVGPVKDQGSCGSCWSFSTTGA 176
Y GLRR+ P + P D LP DWR KG V VK+QG CGSCW+FS TG+
Sbjct: 79 YNGLRRR---PHTSSGEPCTLGEDVSALPTTVDWRTKGYVTGVKNQGQCGSCWAFSATGS 135
Query: 177 LEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMRE 236
LEG +F ATGKLVSLSEQ LVDC + GCNGGL + AF+Y +K GG+ E
Sbjct: 136 LEGQHFNATGKLVSLSEQNLVDC-------SSAEGNEGCNGGLPDDAFKYVIKNGGIDTE 188
Query: 237 EDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAINAVYM-- 293
YPY D C + + I ++ +++ + S E Q+ GP+ V I+A ++
Sbjct: 189 ASYPYVARDE--KCHYSSANIGSTCSSYVDIESKSEAQLQVASATVGPIPVGIDASHLGF 246
Query: 294 QTYIGGVSCPYICSR-RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYK 352
Q Y GGV +CS+ RLDHGVL+VGYG KEK YW++KNSWG +WG +G
Sbjct: 247 QLYDGGVYHSDLCSQTRLDHGVLVVGYGV-------YKEKDYWMVKNSWGTNWGISGDMM 299
Query: 353 ICRGR-NVCGVDSMVS 367
+ R R N CG+ +M S
Sbjct: 300 MSRNRDNNCGIATMAS 315
>gi|343474734|emb|CCD13687.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 524
Score = 222 bits (565), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 125/323 (38%), Positives = 179/323 (55%), Gaps = 27/323 (8%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
+ F+ FK+K++++Y E RF +FK ++ RA +P AT G+TQFSD++P EF
Sbjct: 117 QQQFAAFKQKYSRSYKDATEEAFRFRMFKQSMERAKEEAAANPYATFGVTQFSDMSPEEF 176
Query: 117 RRTYL-GLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
R TYL G + K + + T P DWR+KGAV PVKDQGSCGSCW+F+ G
Sbjct: 177 RATYLNGAKYYAAALKRPRKVVNVSTGKAPPAVDWRKKGAVTPVKDQGSCGSCWAFAAIG 236
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA--GGL 233
+EG +A +L SLSEQ LV CD + + C GG + AF++ + + G +
Sbjct: 237 NIEGQWKIAGHELTSLSEQMLVSCD---------TTEDNCGGGFADRAFKWIVSSNKGNV 287
Query: 234 MREEDYPYTGTDRGHACKFDKSK--IAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV 291
E YPY D G+ +KS + A ++ + DE+ IA L +NGP+A+A++A
Sbjct: 288 FTERSYPYASID-GYVPPCNKSGKVVGAKISGHINLPKDENAIAEWLARNGPVAIAVDAS 346
Query: 292 YMQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
Y GGV SC S+ ++H VLLVGY + PYWIIKNSW + WGE G
Sbjct: 347 TFLDYKGGVLTSC---SSKHVNHEVLLVGYNDTS-------KPPYWIIKNSWDKEWGEEG 396
Query: 350 YYKICRGRNVCGVDSMVSTVAAA 372
Y +I +G N+C + +V +
Sbjct: 397 YIRIEKGTNLCLMKEYARSVVVS 419
>gi|297791625|ref|XP_002863697.1| hypothetical protein ARALYDRAFT_917391 [Arabidopsis lyrata subsp.
lyrata]
gi|297309532|gb|EFH39956.1| hypothetical protein ARALYDRAFT_917391 [Arabidopsis lyrata subsp.
lyrata]
Length = 463
Score = 222 bits (565), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 140/353 (39%), Positives = 192/353 (54%), Gaps = 30/353 (8%)
Query: 6 VVLFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGD-EILSHHESTNNDLLGAEHHFSLFK 64
++L L + V A+ + D + I V+ D E+ +E+ EH K
Sbjct: 9 MILLLAMIGVSYAIDMSIISYDENHHISTVSSRSDAEVERIYEA-----WMVEHG----K 59
Query: 65 KKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLR 124
KK N+ E+ D RF IFK NLR H + S G+T+F+DLT E+R YLG +
Sbjct: 60 KKMNQNGLGAEK-DQRFEIFKDNLRYIDEHNTKNLSYKLGLTRFADLTNDEYRSMYLGAK 118
Query: 125 RKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLA 184
R+ K +D+ + LP DWR++GAV VKDQGSCGSCW+FST GA+EG N +
Sbjct: 119 PVKRVLKTSDRYEARVGDALPDSVDWRKEGAVADVKDQGSCGSCWAFSTIGAVEGINKIV 178
Query: 185 TGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGT 244
TG L+SLSEQ+LVDCD S + GCNGGLM+ AFE+ +K GG+ E DYPY
Sbjct: 179 TGDLISLSEQELVDCDT--------SYNQGCNGGLMDYAFEFIIKNGGIDTEADYPYKAA 230
Query: 245 DRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA--VYMQTYIGGVSC 302
D G + K+ ++ ++ V + + + + P++VAI A Q Y GV
Sbjct: 231 D-GRCDQNRKNAKVVTIDSYEDVPENSEASLKKALAHQPISVAIEAGGRAFQLYSSGVF- 288
Query: 303 PYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
IC LDHGV+ VGYG+ K YWI++NSWG WGE+GY K+ R
Sbjct: 289 DGICGTELDHGVVAVGYGTE-------NGKDYWIVRNSWGNRWGESGYIKMAR 334
>gi|426252094|ref|XP_004019753.1| PREDICTED: cathepsin F isoform 1 [Ovis aries]
Length = 460
Score = 221 bits (564), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 142/321 (44%), Positives = 181/321 (56%), Gaps = 39/321 (12%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRR 118
F F +N+ Y SQEE R ++F N+ RA + Q LD +A +G+T+FSDLT EFR
Sbjct: 163 FKDFVTTYNRTYDSQEEASWRMSVFANNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 222
Query: 119 TYLG--LR----RKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFS 172
YL L+ R +RL + P P +DWR KGAV VKDQG CGSCW+FS
Sbjct: 223 IYLNPLLKDAPGRNMRLAQPVTDVP-------PPQWDWRNKGAVTDVKDQGMCGSCWAFS 275
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
TG +EG FL G L+SLSEQ+L+DCD D C GGL ++A+ GG
Sbjct: 276 VTGNVEGQWFLKRGTLLSLSEQELLDCDK---------TDKACLGGLPSNAYSAIRTLGG 326
Query: 233 LMREEDYPYTGTDRGH--ACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA 290
L E+DY Y RGH C F K + + +S +E ++AA L K GP++VAINA
Sbjct: 327 LETEDDYSY----RGHLQTCSFSAEKAKVYINDSVELSKNEQKLAAWLAKKGPISVAINA 382
Query: 291 VYMQTYIGGVSCPY--ICSRRL-DHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGE 347
MQ Y G+S P +CS L DH VLLVGYG+ P+W IKNSWG +WGE
Sbjct: 383 FGMQFYRHGISHPLRPLCSPWLIDHAVLLVGYGNRS-------ATPFWAIKNSWGTNWGE 435
Query: 348 NGYYKICRGRNVCGVDSMVST 368
GYY + RG CGV+ M S+
Sbjct: 436 EGYYYLHRGSGACGVNIMASS 456
>gi|154332647|ref|XP_001562140.1| cathepsin L-like protease [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134059588|emb|CAM37170.1| cathepsin L-like protease [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 441
Score = 221 bits (564), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 132/312 (42%), Positives = 172/312 (55%), Gaps = 31/312 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK+ + + YA+ +E R F+ NL HQ +P A GIT+F DL+ EF
Sbjct: 38 FEEFKQTYQRVYATLDEEQQRLANFQRNLELMREHQANNPHARFGITKFFDLSEEEFATR 97
Query: 120 YL-GLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YL G + K A Q DL PA DWREKGAV PVKDQG CGSCW+FS G
Sbjct: 98 YLSGATHFAKAKKFASQHYRKVGADLSTAPAAVDWREKGAVTPVKDQGMCGSCWAFSAIG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTL--KAGGL 233
+E +LAT L+SLSEQ+LV CD D GCNGGLM AF++ L + G +
Sbjct: 158 NIESQWYLATHSLISLSEQELVSCD---------DVDEGCNGGLMLQAFDWLLNNRNGAV 208
Query: 234 MREEDYPYTGTDRGHACKFDKSK---IAASVANFSVVSLDEDQIAANLVKNGPLAVAINA 290
YPY + G + +S I A + + +ED +AA L NGP+A+A++A
Sbjct: 209 YTGVSYPYV-SGNGSVPECSESSDLVIGAYIDGHVTIESNEDTMAAWLAANGPIAIAVDA 267
Query: 291 VYMQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+Y GGV SC ++L+HGVLLVGY G E PYW+IKNSWG++WGE
Sbjct: 268 SAFMSYTGGVLTSCD---GKQLNHGVLLVGYNMTG-------EVPYWLIKNSWGKNWGEK 317
Query: 349 GYYKICRGRNVC 360
GY ++ +G N C
Sbjct: 318 GYVRVRKGTNEC 329
>gi|74229834|gb|AAU14993.2| cysteine proteinase [Cryptobia salmositica]
Length = 443
Score = 221 bits (564), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 127/325 (39%), Positives = 185/325 (56%), Gaps = 33/325 (10%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
E F FK + YAS +E RF IF N+++AA + +P AT G +F+D+T EF
Sbjct: 22 EVLFGNFKAAHARNYASPDEERKRFEIFAGNMKKAAVLNRKNPMATFGPNEFADMTSEEF 81
Query: 117 RRTY----LGLRRKLRLPKDADQAPILPTNDLPA----DFDWREKGAVGPVKDQGSCGSC 168
+ + K R PK+ ++ A DWR KGAV PVK+QG+CGSC
Sbjct: 82 QTRHNAARHYAAAKARPPKNTK---TFTAEEIKAAVGQQIDWRLKGAVTPVKNQGACGSC 138
Query: 169 WSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTL 228
WSFSTTG +EG + +ATG+LV++SEQ+LV CD D GCNGGLM++AF + +
Sbjct: 139 WSFSTTGNIEGQHAIATGQLVAVSEQELVSCD---------PIDDGCNGGLMDNAFGWLI 189
Query: 229 KA--GGLMREEDYPY-TGTDRGHACKF--DKSKIAASVANFSVVSLDEDQIAANLVKNGP 283
A G + E +YPY +G AC + + A+++ F ++ E+ +AA + K+GP
Sbjct: 190 SAHKGQIATEANYPYVSGNGIVPACSSSPESKPVGATISAFQDIARTEEDMAAFVFKHGP 249
Query: 284 LAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGE 343
L++ ++A Q+Y GG+ Y ++DHGVL+VG+ PYWIIKNSW
Sbjct: 250 LSIGVDASTWQSYAGGIM-SYCPQDQIDHGVLIVGFDDTA-------STPYWIIKNSWTA 301
Query: 344 SWGENGYYKICRGRNVCGVDSMVST 368
+WGE GY ++ +G N CG+ S S+
Sbjct: 302 NWGEEGYIRVAKGSNQCGLTSHPSS 326
>gi|340371596|ref|XP_003384331.1| PREDICTED: digestive cysteine proteinase 2-like [Amphimedon
queenslandica]
Length = 327
Score = 221 bits (564), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 127/311 (40%), Positives = 178/311 (57%), Gaps = 25/311 (8%)
Query: 58 HHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFR 117
+ L+K K+ K Y S E + R I+ N H +D S + +F+DLT EF
Sbjct: 27 EEWRLWKGKYGKTYRSIYEDNMRQKIWLQNRDYVNEHNSMDSSFQLEVNEFADLTAEEFS 86
Query: 118 RTYLGLRRKLRLPKDADQAPILPTND--LPADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
Y G K R ++ + I +P DWR KG V PVK+Q CGSCW+FSTTG
Sbjct: 87 SIYNGYG-KGRNRENHENTTIYRYTGGAIPDSVDWRTKGLVTPVKNQKQCGSCWAFSTTG 145
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMR 235
+LEGA+ TGKLVSLSEQ LVDCD + D GC GGLM +AF+Y + G+
Sbjct: 146 SLEGAHAKKTGKLVSLSEQNLVDCDKK---------DHGCQGGLMTTAFKYIEENKGIDT 196
Query: 236 EEDYPYTGTDRGHACKFDKSKIAASVA-NFSVVSLDEDQIAANLVKNGPLAVAINAVY-- 292
EE YPY + C+F K I A+V + S+++ D + + + + GP++VA++A +
Sbjct: 197 EESYPYKA--KNGRCEFKKDDIGATVERHVSILTTDCEALKKAVAEIGPISVAMDASHSS 254
Query: 293 MQTYIGGVSCPYIC-SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYY 351
Q Y G+ P IC SR+LDHGVL+VGYG + + YW++KNSWG++WG GY+
Sbjct: 255 FQLYKSGIYDPKICSSRKLDHGVLVVGYG-------KEDGEEYWLVKNSWGKNWGMEGYF 307
Query: 352 KICRGRNVCGV 362
KI +N+CG+
Sbjct: 308 KIASKKNLCGI 318
>gi|20147096|gb|AAM09951.1| 49 kDa cysteine proteinase Cysp1 [Cryptobia salmositica]
Length = 428
Score = 221 bits (564), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 126/322 (39%), Positives = 184/322 (57%), Gaps = 33/322 (10%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK + YAS +E RF IF N+++AA + +P AT G +F+D+T EF+
Sbjct: 10 FGNFKAAHARNYASPDEERKRFEIFAGNMKKAAVLNRKNPMATFGPNEFADMTSEEFQTR 69
Query: 120 Y----LGLRRKLRLPKDADQAPILPTNDLPA----DFDWREKGAVGPVKDQGSCGSCWSF 171
+ K R PK+ ++ A DWR KGAV PVK+QG+CGSCWSF
Sbjct: 70 HNAARHYAAAKARPPKNTK---TFTAEEIKAAVGQQIDWRLKGAVTPVKNQGACGSCWSF 126
Query: 172 STTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA- 230
STTG +EG + +ATG+LV++SEQ+LV CD D GCNGGLM++AF + + A
Sbjct: 127 STTGNIEGQHAIATGQLVAVSEQELVSCD---------PIDDGCNGGLMDNAFGWLISAH 177
Query: 231 -GGLMREEDYPY-TGTDRGHACKF--DKSKIAASVANFSVVSLDEDQIAANLVKNGPLAV 286
G + E +YPY +G AC + + A+++ F ++ E+ +AA + K+GPL++
Sbjct: 178 KGQIATEANYPYVSGNGIVPACSSSPESKPVGATISAFQDIARTEEDMAAFVFKHGPLSI 237
Query: 287 AINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWG 346
++A Q+Y GG+ Y ++DHGVL+VG+ PYWIIKNSW +WG
Sbjct: 238 GVDASTWQSYAGGIMS-YCPQDQIDHGVLIVGFDDTA-------STPYWIIKNSWTANWG 289
Query: 347 ENGYYKICRGRNVCGVDSMVST 368
E GY ++ +G N CG+ S S+
Sbjct: 290 EEGYIRVAKGSNQCGLTSHPSS 311
>gi|8347420|dbj|BAA96501.1| cysteine protease [Nicotiana tabacum]
Length = 360
Score = 221 bits (564), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 146/374 (39%), Positives = 194/374 (51%), Gaps = 39/374 (10%)
Query: 9 FLVSLVV----FSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHH---FS 61
L++LVV F++ +G + IRQV G L E+ ++G H F+
Sbjct: 6 LLLALVVAGGLFASALAGPATFADENPIRQVVSDG---LHELENAILQVVGKTRHALSFA 62
Query: 62 LFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYL 121
F ++ K Y S EE RF +F NL+ H K S G+ +F+DLT EFRR L
Sbjct: 63 RFAHRYGKRYESVEEIKQRFEVFLDNLKMIRSHNKKGLSYKLGVNEFTDLTWDEFRRDRL 122
Query: 122 GLRRKLRLPKDADQAPILPTND-LPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGA 180
G + + + TN LP DWRE G V PVK+QG CGSCW+FSTTGALE A
Sbjct: 123 GAAQNCSATTKGN---LKVTNVVLPETKDWREAGIVSPVKNQGKCGSCWTFSTTGALEAA 179
Query: 181 NFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYP 240
A GK +SLSEQQLVDC + + GCNGGL + AFEY GGL EE YP
Sbjct: 180 YSQAFGKGISLSEQQLVDCAGAFN-------NFGCNGGLPSQAFEYIKSNGGLDTEEAYP 232
Query: 241 YTGTDRGHACKFDKSKIAASV---ANFSVVSLDEDQIAANLVKNGPLAVAINAVY-MQTY 296
YTG + CKF + V N ++ + DE + A LV+ P+++A + + Y
Sbjct: 233 YTG--KNGLCKFSSENVGVKVIDSVNITLGAEDELKYAVALVR--PVSIAFEVIKGFKQY 288
Query: 297 IGGVSCPYICSRR---LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKI 353
GV C ++H VL VGYG PYW+IKNSWG WG+NGY+K+
Sbjct: 289 KSGVYTSTECGNTPMDVNHAVLAVGYGVENGV-------PYWLIKNSWGADWGDNGYFKM 341
Query: 354 CRGRNVCGVDSMVS 367
G+N+CG+ + S
Sbjct: 342 EMGKNMCGIATCAS 355
>gi|332376957|gb|AEE63618.1| unknown [Dendroctonus ponderosae]
Length = 318
Score = 221 bits (564), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 127/309 (41%), Positives = 174/309 (56%), Gaps = 30/309 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP----SATHGITQFSDLTPAE 115
F FK K +K+Y++Q E R IF NLR H L S + QF+DLT E
Sbjct: 25 FQSFKLKHSKSYSNQVEEAKRLAIFTENLRDIEEHNALYAAGLVSYNKSVNQFTDLTIDE 84
Query: 116 FRRTYLGLRRKLRLPKDADQAPILPTN-DLPADFDWREKGAVGPVKDQGSCGSCWSFSTT 174
F+ YL L K L + P + T +P DWR +G V VKDQG CGSCW+FS
Sbjct: 85 FK-AYLTLHSKPTL----NTVPYVRTGLQVPTTLDWRSQGYVTGVKDQGDCGSCWAFSVV 139
Query: 175 GALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLM 234
G+ EGA + +TGKLVSLSEQQL+DC + + GC+GG + F Y ++ GL+
Sbjct: 140 GSTEGAYYKSTGKLVSLSEQQLIDC--------TTNVNDGCDGGYLEETFPY-VQQTGLV 190
Query: 235 REEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQ 294
E YPYTG D C+ +S + V+ + ++ + D + A + GP++VA++A Y+
Sbjct: 191 SESSYPYTGRDGN--CRISESDVVTKVSKYVLLGGEADLLEA-VGSVGPVSVAMDATYIY 247
Query: 295 TYIGGVSCPYICS-RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKI 353
+Y GV +CS L+HGVL+VGYG+ K YW+IKNSWG +WGE GY K+
Sbjct: 248 SYASGVYESSLCSLYSLNHGVLVVGYGTQ-------DGKDYWLIKNSWGNTWGEQGYLKL 300
Query: 354 CRGRNVCGV 362
RG N CG+
Sbjct: 301 LRGTNECGI 309
>gi|426252096|ref|XP_004019754.1| PREDICTED: cathepsin F isoform 2 [Ovis aries]
Length = 477
Score = 221 bits (564), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 142/322 (44%), Positives = 181/322 (56%), Gaps = 39/322 (12%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRR 118
F F +N+ Y SQEE R ++F N+ RA + Q LD +A +G+T+FSDLT EFR
Sbjct: 180 FKDFVTTYNRTYDSQEEASWRMSVFANNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 239
Query: 119 TYLG--LR----RKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFS 172
YL L+ R +RL + P P +DWR KGAV VKDQG CGSCW+FS
Sbjct: 240 IYLNPLLKDAPGRNMRLAQPVTDVP-------PPQWDWRNKGAVTDVKDQGMCGSCWAFS 292
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
TG +EG FL G L+SLSEQ+L+DCD D C GGL ++A+ GG
Sbjct: 293 VTGNVEGQWFLKRGTLLSLSEQELLDCDK---------TDKACLGGLPSNAYSAIRTLGG 343
Query: 233 LMREEDYPYTGTDRGH--ACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA 290
L E+DY Y RGH C F K + + +S +E ++AA L K GP++VAINA
Sbjct: 344 LETEDDYSY----RGHLQTCSFSAEKAKVYINDSVELSKNEQKLAAWLAKKGPISVAINA 399
Query: 291 VYMQTYIGGVSCPY--ICSRRL-DHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGE 347
MQ Y G+S P +CS L DH VLLVGYG+ P+W IKNSWG +WGE
Sbjct: 400 FGMQFYRHGISHPLRPLCSPWLIDHAVLLVGYGNRS-------ATPFWAIKNSWGTNWGE 452
Query: 348 NGYYKICRGRNVCGVDSMVSTV 369
GYY + RG CGV+ M S+
Sbjct: 453 EGYYYLHRGSGACGVNIMASSA 474
>gi|449500145|ref|XP_004161017.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 349
Score = 221 bits (564), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 125/314 (39%), Positives = 173/314 (55%), Gaps = 23/314 (7%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F + K +K Y S EE HRF IF NL+ K S G+ +F+DL+ EF+
Sbjct: 47 FESWMSKHSKTYRSIEEKLHRFEIFLDNLKHIDETNKKVSSYWLGLNEFADLSHEEFKSK 106
Query: 120 YLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEG 179
YLGLR + + + DLP DWR KGAV PVK+QGSCGSCW+FST A+EG
Sbjct: 107 YLGLRVEFPRKRSSRGFSYGDVEDLPESVDWRTKGAVTPVKNQGSCGSCWAFSTVAAVEG 166
Query: 180 ANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDY 239
N + TG L SLSEQ+L+DCD S ++GC GGLM+ AF+Y + GL +EEDY
Sbjct: 167 INQIVTGNLTSLSEQELIDCDR--------SFNNGCYGGLMDYAFQYIMSNSGLRKEEDY 218
Query: 240 PYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY--MQTYI 297
PY + G + + +++ + V +++Q + + P++VAI A Q Y
Sbjct: 219 PYL-MEEGRCIREKEQFEVVTISGYEDVPANDEQSLLKALSHQPVSVAIEASSRNFQFYK 277
Query: 298 GGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRG- 356
GG+ C ++DHGV VGYGS+ + Y I+KNSWG WGENGY ++ R
Sbjct: 278 GGIFTGR-CGTQMDHGVTAVGYGSS-------EGTDYIIVKNSWGPKWGENGYIRMKRNT 329
Query: 357 ---RNVCGVDSMVS 367
+CG++ M S
Sbjct: 330 GKPEGLCGINQMAS 343
>gi|116787404|gb|ABK24495.1| unknown [Picea sitchensis]
gi|224286306|gb|ACN40861.1| unknown [Picea sitchensis]
Length = 452
Score = 221 bits (563), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 131/313 (41%), Positives = 175/313 (55%), Gaps = 27/313 (8%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
+ L+ + +AY +E RF++FK N H + + S G+ QF+DL+ EF+ T
Sbjct: 42 YELWLAEHKRAYNGLDEKQKRFSVFKDNFLYIHEHNQGNRSYKLGLNQFADLSHEEFKAT 101
Query: 120 YLG--LRRKLRLPKD-ADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGA 176
YLG L K RL + + + DLP DWREKGAV VKDQGSCGSCW+FST A
Sbjct: 102 YLGAKLDTKKRLSRPPSRRYQYSDGEDLPESIDWREKGAVTSVKDQGSCGSCWAFSTVAA 161
Query: 177 LEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMRE 236
+EG N + TG L+SLSEQ+LVDCD S + GCNGGLM+ AFE+ + GGL E
Sbjct: 162 VEGINQIVTGDLISLSEQELVDCDT--------SYNQGCNGGLMDYAFEFIINNGGLDSE 213
Query: 237 EDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV--YMQ 294
EDYPYT D G + K+ ++ ++ V ++++ N P++VAI A Q
Sbjct: 214 EDYPYTAYD-GSCDSYRKNAHVVTIDDYEDVPENDEKSLKKAAANQPISVAIEASGREFQ 272
Query: 295 TYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
Y GV C +LDHGV LVGYGS YW +KNSWG+SWGE G+ ++
Sbjct: 273 FYDSGVFTS-TCGTQLDHGVTLVGYGSE-------SGTDYWTVKNSWGKSWGEEGFIRLQ 324
Query: 355 RGRNV-----CGV 362
R V CG+
Sbjct: 325 RNIEVASTGMCGI 337
>gi|3916212|gb|AAC78838.1| cathepsin F [Homo sapiens]
Length = 338
Score = 221 bits (563), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 138/314 (43%), Positives = 179/314 (57%), Gaps = 25/314 (7%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRR 118
F F +N+ Y S+EE R ++F N+ RA + Q LD +A +G+T+FSDLT EFR
Sbjct: 41 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 100
Query: 119 TYLGLRRKLRLPKDADQAPILPTNDL-PADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
YL + QA DL P ++DWR KGAV VKDQG CGSCW+FS TG +
Sbjct: 101 IYLNTLLRKEPGNKMKQAK--SVGDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNV 158
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREE 237
EG FL G L+SLSEQ+L+DCD D C GGL ++A+ GGL E+
Sbjct: 159 EGQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSAIKNLGGLETED 209
Query: 238 DYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYI 297
DY Y G +C F K + + +S +E ++AA L K GP++VAINA MQ Y
Sbjct: 210 DYSYQG--HMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAINAFGMQFYR 267
Query: 298 GGVSCPY--ICSRRL-DHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
G+S P +CS L DH VLLVGYG+ + P+W IKNSWG WGE GYY +
Sbjct: 268 HGISRPLRPLCSPWLIDHAVLLVGYGNRS-------DVPFWAIKNSWGTDWGEKGYYYLH 320
Query: 355 RGRNVCGVDSMVST 368
RG CGV++M S+
Sbjct: 321 RGSGACGVNTMASS 334
>gi|94448674|emb|CAI91575.1| cathepsin L2 [Lubomirskia baicalensis]
Length = 324
Score = 221 bits (562), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 128/312 (41%), Positives = 173/312 (55%), Gaps = 23/312 (7%)
Query: 63 FKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP--SATHGITQFSDLTPAEFRRTY 120
+K + K+Y + +E R ++AN + H + T + QF DL +EF+ Y
Sbjct: 25 WKAEHGKSYRNHKEEMLRHVTWQANKKYIDEHNQHAGVFGYTLKMNQFGDLENSEFKSLY 84
Query: 121 LGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGA 180
G R K P DLPA DW +KG V PVK+QG CGSCWSFS TG++EG
Sbjct: 85 NGYRMSNAPRKGKPFVPAARVQDLPASVDWSKKGWVTPVKNQGQCGSCWSFSATGSMEGQ 144
Query: 181 NFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYP 240
+F ATG L+SLSEQ LVDC + GCNGGLM+ AFEY +K G+ E YP
Sbjct: 145 HFNATGTLMSLSEQNLVDC-------SAAEGNHGCNGGLMDDAFEYVIKNNGIDTEASYP 197
Query: 241 YTGTDRGHACKFDKSKIAASVANFSVVSLD-EDQIAANLVKNGPLAVAINAVYM--QTYI 297
Y D CKF+ + + A+++ + V+ D E + + GP++VAI+A ++ Q Y
Sbjct: 198 YRAVDS--TCKFNTADVGATISGYVDVTKDSESDLQVAVATIGPVSVAIDASHISFQFYS 255
Query: 298 GGVSCPYICSR-RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRG 356
GV P ICS LDHGVL VGYG+ G K YW++KNSWG SWG +GY ++ R
Sbjct: 256 SGVYDPLICSSTNLDHGVLAVGYGTDG-------SKDYWLVKNSWGASWGMSGYIEMVRN 308
Query: 357 R-NVCGVDSMVS 367
N CG+ + S
Sbjct: 309 HNNKCGIATSAS 320
>gi|18422289|ref|NP_568620.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
gi|9757832|dbj|BAB08269.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
gi|17065064|gb|AAL32686.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
gi|21387153|gb|AAM47980.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
gi|332007522|gb|AED94905.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
Length = 463
Score = 221 bits (562), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 127/294 (43%), Positives = 170/294 (57%), Gaps = 20/294 (6%)
Query: 64 KKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGL 123
KKK N+ E+ D RF IFK NLR H + S G+T+F+DLT E+R YLG
Sbjct: 59 KKKMNQNGLGAEK-DQRFEIFKDNLRFIDEHNTKNLSYKLGLTRFADLTNEEYRSMYLGA 117
Query: 124 RRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFL 183
+ R+ K +D+ + LP DWR++GAV VKDQGSCGSCW+FST GA+EG N +
Sbjct: 118 KPTKRVLKTSDRYQARVGDALPDSVDWRKEGAVADVKDQGSCGSCWAFSTIGAVEGINKI 177
Query: 184 ATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTG 243
TG L+SLSEQ+LVDCD S + GCNGGLM+ AFE+ +K GG+ E DYPY
Sbjct: 178 VTGDLISLSEQELVDCDT--------SYNQGCNGGLMDYAFEFIIKNGGIDTEADYPYKA 229
Query: 244 TDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA--VYMQTYIGGVS 301
D G + K+ ++ ++ V + + + + P++VAI A Q Y GV
Sbjct: 230 AD-GRCDQNRKNAKVVTIDSYEDVPENSEASLKKALAHQPISVAIEAGGRAFQLYSSGVF 288
Query: 302 CPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
+C LDHGV+ VGYG+ K YWI++NSWG WGE+GY K+ R
Sbjct: 289 -DGLCGTELDHGVVAVGYGTE-------NGKDYWIVRNSWGNRWGESGYIKMAR 334
>gi|358255476|dbj|GAA57175.1| cathepsin L [Clonorchis sinensis]
Length = 385
Score = 221 bits (562), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 142/348 (40%), Positives = 184/348 (52%), Gaps = 50/348 (14%)
Query: 52 DLLGAEHHFSL------FKKKFNKAYASQEEHDHRFTIFKANLRRAARHQ----KLDPSA 101
D++G + +F+L F + + Y EH+ RF IF N R ++H + S
Sbjct: 52 DVIGVDWNFTLSSIWKHFMTTYKRNYIDPSEHERRFKIFANNFVRISKHNVRFIQGQVSY 111
Query: 102 THGITQFSD------------LTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFD 149
T GI +FSD T E +R R L +D + I P++ D
Sbjct: 112 TMGINEFSDKVIGLIIHTICFQTDEELKRLRC-FRGSLNASRDGSKY-ITIAAPPPSEID 169
Query: 150 WREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPG 209
WR KGAV PVK+QG+CGSCW+FS TGA+EG NFLATG LVSLSEQQLVDC E
Sbjct: 170 WRNKGAVTPVKNQGNCGSCWAFSATGAIEGQNFLATGNLVSLSEQQLVDCSSEYG----- 224
Query: 210 SCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHA---CKFDKSKIAASVANFSV 266
++ CNGGLM++AF+Y + G+ E YPY + G A C+F+ + V +
Sbjct: 225 --NNACNGGLMDNAFKYVKDSNGIDTEASYPYVSGETGDANPTCRFNLKEAVVRVTGY-- 280
Query: 267 VSLDEDQIA---ANLVKNGPLAVAINA--VYMQTYIGGVSCPYICSR-RLDHGVLLVGYG 320
+ L Q++ + GP++VAINA +Y GV CS LDHGVLLVGYG
Sbjct: 281 IDLPRGQVSELKQAVGHYGPISVAINAGLPSFMSYKSGVYSDDQCSSDDLDHGVLLVGYG 340
Query: 321 SAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGR-NVCGVDSMVS 367
PYW+IKNSWG WGENGY KI R N+CGV SM S
Sbjct: 341 EE-------NGIPYWLIKNSWGPHWGENGYVKILRDHNNLCGVASMAS 381
>gi|332375406|gb|AEE62844.1| unknown [Dendroctonus ponderosae]
Length = 320
Score = 221 bits (562), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 131/310 (42%), Positives = 174/310 (56%), Gaps = 28/310 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARH-----QKLDPSATHGITQFSDLTPA 114
F FK K NK Y + E R+ IF+A L H Q L+ + G+ +FSD T
Sbjct: 23 FQAFKLKQNKTYKTPVEETTRYGIFQAKLLEIEEHNSRFEQGLE-TYKKGVNKFSDWTQD 81
Query: 115 EFRRTYLGLRRKLRLPKDADQAPILPTN-DLPADFDWREKGAVGPVKDQGSCGSCWSFST 173
EF YLGL K K P + T +PA DWR +G V VK+QG CGSCW+FS
Sbjct: 82 EFN-AYLGLHPKP--AKLGKGIPYVKTGVSVPASVDWRTEGYVTGVKNQGDCGSCWAFSL 138
Query: 174 TGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGL 233
TG++EGA F +TGKLVSLSEQQLVDC + G+ + GC+GG + F Y ++ GL
Sbjct: 139 TGSVEGALFKSTGKLVSLSEQQLVDCTY-------GTVNFGCDGGYLEETFPY-IQETGL 190
Query: 234 MREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYM 293
E YPY D CKFD SK+ + ++ DE+ + GP++VA++A Y+
Sbjct: 191 EAEASYPYKARD--GTCKFDASKVVTKINDYVYWYGDEEALLEATATIGPISVAMDANYI 248
Query: 294 QTYIGGVSCPYICSR-RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYK 352
+Y GV +CS L+HGVL+VGYGS YW++KNSW E WGE+GY K
Sbjct: 249 DSYASGVFSSRLCSSDDLNHGVLVVGYGSENGV-------NYWLVKNSWAEDWGESGYLK 301
Query: 353 ICRGRNVCGV 362
+ RG+N CG+
Sbjct: 302 LLRGQNECGI 311
>gi|355751926|gb|EHH56046.1| Cathepsin F, partial [Macaca fascicularis]
Length = 381
Score = 221 bits (562), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 141/316 (44%), Positives = 182/316 (57%), Gaps = 29/316 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRR 118
F F +N+ Y S+EE R ++F N+ RA + Q LD +A +G+T+FSDLT EFR
Sbjct: 84 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 143
Query: 119 TYLGLRRKLRLPKDADQAPILPTNDL-PADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
YL + QA + DL P ++DWR KGAV VKDQG CGSCW+FS TG +
Sbjct: 144 IYLNPLLREEPGNKMKQAKSV--GDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNV 201
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREE 237
EG FL G L+SLSEQ+L+DCD D C GGL ++A+ GGL E+
Sbjct: 202 EGQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSAIKNLGGLETED 252
Query: 238 DYPYTGTDRGH--ACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQT 295
DY Y RGH AC F K + + +S +E ++AA L K GP++VAINA MQ
Sbjct: 253 DYSY----RGHMQACNFSAEKAKVYINDSVELSQNEQKLAAWLAKKGPISVAINAFGMQF 308
Query: 296 YIGGVSCPY--ICSRRL-DHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYK 352
Y G+S P +CS L DH VLLVGYG+ + P+W IKNSWG WGE GYY
Sbjct: 309 YRHGISRPLRPLCSPWLIDHAVLLVGYGNRS-------DIPFWAIKNSWGTDWGEKGYYY 361
Query: 353 ICRGRNVCGVDSMVST 368
+ RG CGV++M S+
Sbjct: 362 LHRGSGACGVNTMASS 377
>gi|146335576|gb|ABQ23397.1| cathepsin L [Trypanosoma carassii]
Length = 456
Score = 221 bits (562), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 126/315 (40%), Positives = 179/315 (56%), Gaps = 24/315 (7%)
Query: 55 GAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPA 114
G F+ FK + K+Y S E +R +F+ +++ A H +P A G+T+FSDLT
Sbjct: 31 GLAAQFAAFKAEHGKSYTSAAEEGYRMRVFEESMKAAQAHAAANPHAKFGVTKFSDLTHE 90
Query: 115 EFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTT 174
EF+ Y + P+ T P ++DWR+KGAV PVKDQG CGSCW+FSTT
Sbjct: 91 EFKTLYANGAAHFAAAAKRARRPVSVTGTAPDEWDWRKKGAVTPVKDQGHCGSCWTFSTT 150
Query: 175 GALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA--GG 232
G +EG +A +L +LSEQ LV CD D GC+GGLM++AFE+ + G
Sbjct: 151 GNIEGQWAVAGNELTNLSEQMLVSCDAR---------DYGCSGGLMDNAFEWIVNQNDGF 201
Query: 233 LMREEDYPY-TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV 291
+ EE YPY +G+ C K+ A++ + DE+++AA L NGP+++A++A
Sbjct: 202 VFTEESYPYASGSGDAPLCDVGGRKVGATIKGHVGLPNDEEKMAAWLAANGPISIAVDAD 261
Query: 292 YMQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
+ Y GGV C +LDHGVLLVGY ++ PYWIIKNSWG +WGE+G
Sbjct: 262 SFKAYKGGVLTGCE---EGQLDHGVLLVGYN-------KVANPPYWIIKNSWGPNWGEHG 311
Query: 350 YYKICRGRNVCGVDS 364
Y ++ G N C ++S
Sbjct: 312 YIRVGFGTNQCNLNS 326
>gi|413919736|gb|AFW59668.1| cysteine protease 1 [Zea mays]
Length = 469
Score = 220 bits (561), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 130/321 (40%), Positives = 177/321 (55%), Gaps = 27/321 (8%)
Query: 42 ILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSA 101
I+S+ E + + A ++ + + Y + E + RF +F+ NLR H +
Sbjct: 31 IVSYGERSEEE---ARRMYAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAADAG 87
Query: 102 TH----GITQFSDLTPAEFRRTYLGLR-RKLRLPKDADQAPILPTNDLPADFDWREKGAV 156
H G+ +F+DLT E+R TYLG+R R R + D+ DLP DWR KGAV
Sbjct: 88 VHSFRLGLNRFADLTNDEYRATYLGVRSRPQRERRLGDRYLAGDNEDLPESVDWRAKGAV 147
Query: 157 GPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCN 216
VKDQGSCGSCW+FST A+EG N + TG ++SLSEQ+LVDCD S + GCN
Sbjct: 148 AEVKDQGSCGSCWAFSTIAAVEGINQIVTGDMISLSEQELVDCDT--------SYNQGCN 199
Query: 217 GGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAA 276
GGLM+ AFE+ + GG+ EEDYPY GTD G K+ ++ ++ V + ++
Sbjct: 200 GGLMDYAFEFIINNGGIDTEEDYPYKGTD-GRCDVNRKNAKVVTIDSYEDVPANSEKSLQ 258
Query: 277 NLVKNGPLAVAINA--VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPY 334
V N P++VAI A Q Y G+ C LDHGV VGYG+ K Y
Sbjct: 259 KAVANQPISVAIEAGGRAFQLYNSGIFTG-TCGTALDHGVTAVGYGTE-------NGKDY 310
Query: 335 WIIKNSWGESWGENGYYKICR 355
WI+KNSWG SWGE+GY ++ R
Sbjct: 311 WIVKNSWGSSWGESGYVRMER 331
>gi|356508490|ref|XP_003522989.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 349
Score = 220 bits (561), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 127/315 (40%), Positives = 175/315 (55%), Gaps = 25/315 (7%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F + K K Y S EE RF IFK NL+ K+ + G+ +F+DL+ EF+
Sbjct: 47 FESWMSKHGKIYQSIEEKLLRFEIFKDNLKHIDERNKVVSNYWLGLNEFADLSHQEFKNK 106
Query: 120 YLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEG 179
YLGL+ +++ + +LP DWR+KGAV PVK+QGSCGSCW+FST A+EG
Sbjct: 107 YLGLKVDYSRRRESPEEFTYKDVELPKSVDWRKKGAVAPVKNQGSCGSCWAFSTVAAVEG 166
Query: 180 ANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDY 239
N + TG L SLSEQ+L+DCD + ++GCNGGLM+ AF + ++ GGL +EEDY
Sbjct: 167 INQIVTGNLTSLSEQELIDCDR--------TYNNGCNGGLMDYAFSFIVENGGLHKEEDY 218
Query: 240 PYTGTDRGHACKFDKSKI-AASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY--MQTY 296
PY + C+ K + +++ + V + +Q + N PL+VAI A Q Y
Sbjct: 219 PYIMEE--GTCEMTKEETEVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFY 276
Query: 297 IGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRG 356
GGV + C LDHGV VGYG+A K Y I+KNSWG WGE GY ++ R
Sbjct: 277 SGGVFDGH-CGSDLDHGVAAVGYGTA-------KGVDYIIVKNSWGSKWGEKGYIRMRRN 328
Query: 357 ----RNVCGVDSMVS 367
+CG+ M S
Sbjct: 329 IGKPEGICGIYKMAS 343
>gi|344271925|ref|XP_003407787.1| PREDICTED: cathepsin L1-like [Loxodonta africana]
Length = 333
Score = 220 bits (561), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 127/311 (40%), Positives = 169/311 (54%), Gaps = 21/311 (6%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQ----FSDLTPA 114
++ ++ + K YA EE D R +++ N++ RH + HG T F D T
Sbjct: 28 QWNQWRSTYKKVYAVNEE-DWRRAVWEKNMKMIERHNQEYSQGKHGFTMAMNAFGDKTNE 86
Query: 115 EFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTT 174
EFR+ G + + P+ +P DW +KG V PVKDQG CGSCW+FS T
Sbjct: 87 EFRQLMNGFQSQKHKKGKLFYEPVF--GHIPTSVDWTQKGYVTPVKDQGQCGSCWAFSAT 144
Query: 175 GALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLM 234
GALEG F TGKLVSLSEQ LVDC + GCNGGLM++AF+Y GGL
Sbjct: 145 GALEGQMFRKTGKLVSLSEQNLVDCSWR-------EGNEGCNGGLMDNAFQYVKDNGGLD 197
Query: 235 REEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA--VY 292
EE YPYT TD C+++ AA+ F + E + + GP++VAI+A V
Sbjct: 198 SEESYPYTATDT-QDCRYNPKYSAANDTGFVDIPPQEKALMKAVATVGPISVAIDAGQVS 256
Query: 293 MQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYK 352
Q Y G+ C ++HGVL VGYG G P + K YW++KNSWG+SWG +GY K
Sbjct: 257 FQFYSSGIYFDPACRLTVNHGVLAVGYGFEGTDPDKNK---YWLVKNSWGKSWGADGYIK 313
Query: 353 ICRGRNV-CGV 362
I + RN CG+
Sbjct: 314 IAKDRNNHCGI 324
>gi|355566270|gb|EHH22649.1| Cathepsin F [Macaca mulatta]
Length = 484
Score = 220 bits (561), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 141/317 (44%), Positives = 182/317 (57%), Gaps = 29/317 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRR 118
F F +N+ Y S+EE R ++F N+ RA + Q LD +A +G+T+FSDLT EFR
Sbjct: 187 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 246
Query: 119 TYLGLRRKLRLPKDADQAPILPTNDL-PADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
YL + QA + DL P ++DWR KGAV VKDQG CGSCW+FS TG +
Sbjct: 247 IYLNPLLREEPGNKMKQAKSV--GDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNV 304
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREE 237
EG FL G L+SLSEQ+L+DCD D C GGL ++A+ GGL E+
Sbjct: 305 EGQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSAIKNLGGLETED 355
Query: 238 DYPYTGTDRGH--ACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQT 295
DY Y RGH AC F K + + +S +E ++AA L K GP++VAINA MQ
Sbjct: 356 DYSY----RGHMQACNFSAEKAKVYINDSVELSQNEQKLAAWLAKKGPISVAINAFGMQF 411
Query: 296 YIGGVSCPY--ICSRRL-DHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYK 352
Y G+S P +CS L DH VLLVGYG+ + P+W IKNSWG WGE GYY
Sbjct: 412 YRHGISRPLRPLCSPWLIDHAVLLVGYGNR-------SDIPFWAIKNSWGTDWGEKGYYY 464
Query: 353 ICRGRNVCGVDSMVSTV 369
+ RG CGV++M S+
Sbjct: 465 LHRGSGACGVNTMASSA 481
>gi|320169652|gb|EFW46551.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 325
Score = 220 bits (561), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 133/316 (42%), Positives = 178/316 (56%), Gaps = 23/316 (7%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRR 118
F+ +K N+ YAS +E R I+ +NL H S T G+ +F DL EF
Sbjct: 21 FAEWKALHNRQYASAQEEALRQEIYLSNLELINEHNAAGRHSYTLGMNEFGDLAHHEFAA 80
Query: 119 TYLGLR-RKLRLPKDADQAPILPTN-DLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGA 176
YLG+R + K + LP LP DWR G V PVK+QG CGSCWSFSTTG+
Sbjct: 81 KYLGVRFNGVNATKSFASSTYLPRMVSLPDSVDWRTAGIVTPVKNQGQCGSCWSFSTTGS 140
Query: 177 LEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMRE 236
+EG + TG LVSLSEQ LVDC + E GCNGGLM+ AFEY +K GG+ E
Sbjct: 141 VEGQHARKTGTLVSLSEQNLVDCSSQEGNE-------GCNGGLMDDAFEYIIKNGGIDTE 193
Query: 237 EDYPYTGTDRGHACKFDKSKIAASVANFS-VVSLDEDQIAANLVKNGPLAVAINAVYM-- 293
YPYT T CKF+ + I A+VA++ +++ E + + GP++VAI+A ++
Sbjct: 194 ASYPYTATTG--TCKFNAANIGATVASYQDIITGSESDLQNAVATVGPVSVAIDASHINF 251
Query: 294 QTYIGGVSCPYICS-RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYK 352
Q Y GV CS +LDHGVL VGYG++ + K YW++KNSWG +WG+ GY
Sbjct: 252 QFYFTGVYNEKKCSTTQLDHGVLAVGYGTS------TEGKDYWLVKNSWGATWGKAGYIW 305
Query: 353 ICRGR-NVCGVDSMVS 367
+ R N CG+ + S
Sbjct: 306 MSRNADNQCGIATSAS 321
>gi|226496089|ref|NP_001149658.1| cysteine protease 1 precursor [Zea mays]
gi|195629242|gb|ACG36262.1| cysteine protease 1 precursor [Zea mays]
Length = 469
Score = 220 bits (561), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 129/321 (40%), Positives = 177/321 (55%), Gaps = 27/321 (8%)
Query: 42 ILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSA 101
I+S+ E + + A ++ + + Y + E + RF +F+ NLR H +
Sbjct: 31 IVSYGERSEEE---ARRMYAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAADAG 87
Query: 102 TH----GITQFSDLTPAEFRRTYLGLR-RKLRLPKDADQAPILPTNDLPADFDWREKGAV 156
H G+ +F+DLT E+R TYLG+R R R + D+ DLP DWR KGAV
Sbjct: 88 VHSFRLGLNRFADLTNDEYRATYLGVRSRPQRERRLGDRYLAGDNEDLPESVDWRAKGAV 147
Query: 157 GPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCN 216
+KDQGSCGSCW+FST A+EG N + TG ++SLSEQ+LVDCD S + GCN
Sbjct: 148 AEIKDQGSCGSCWAFSTIAAVEGINQIVTGDMISLSEQELVDCDT--------SYNQGCN 199
Query: 217 GGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAA 276
GGLM+ AFE+ + GG+ EEDYPY GTD G K+ ++ ++ V + ++
Sbjct: 200 GGLMDYAFEFIINNGGIDTEEDYPYKGTD-GRCDVNRKNAKVVTIDSYEDVPANSEKSLQ 258
Query: 277 NLVKNGPLAVAINA--VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPY 334
V N P++VAI A Q Y G+ C LDHGV VGYG+ K Y
Sbjct: 259 KAVANQPISVAIEAGGRAFQLYNSGIFTG-TCGTALDHGVTAVGYGTE-------NGKDY 310
Query: 335 WIIKNSWGESWGENGYYKICR 355
WI+KNSWG SWGE+GY ++ R
Sbjct: 311 WIVKNSWGSSWGESGYVRMER 331
>gi|344271892|ref|XP_003407771.1| PREDICTED: cathepsin L1-like [Loxodonta africana]
Length = 334
Score = 220 bits (560), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 126/319 (39%), Positives = 174/319 (54%), Gaps = 22/319 (6%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQ----FSDLT 112
+ + +K + K YA+ EE D R +++ N++ RH + HG T F D+T
Sbjct: 26 DEQWYQWKSLYKKPYAANEE-DWRRAVWEKNMKMIERHNQEYSQGKHGFTMTMNAFGDMT 84
Query: 113 PAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFS 172
EFR+ G + + R+ P+ +P DW +KG V PVKDQG CGSCW+FS
Sbjct: 85 NEEFRQVMNGFQNQKRIQGKLLYEPVF--GHIPKSVDWTQKGYVTPVKDQGQCGSCWAFS 142
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
TGALEG F TGKLVSLSEQ LVDC + GCNGGLM++AF+Y GG
Sbjct: 143 ATGALEGQMFRKTGKLVSLSEQNLVDCSRR-------EGNEGCNGGLMDNAFQYIKDNGG 195
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY 292
L EE YPYT D+ C+++ AA+ F + E + + GP++VA++A +
Sbjct: 196 LDSEESYPYTAMDK-QDCRYNPKYSAANDTGFVDIPPQEKALMKAVATVGPISVAVDAGH 254
Query: 293 --MQTYIGGVSCPYICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
Q Y G+ CS + L+HGVL+VGYG G I YW++KNSWG WG +G
Sbjct: 255 ESFQFYKSGIYYDSNCSSKDLNHGVLVVGYGFEG---IDSANNRYWLVKNSWGTGWGTDG 311
Query: 350 YYKICRGRNV-CGVDSMVS 367
Y K+ + RN CG+ + S
Sbjct: 312 YIKMAKDRNNHCGIATAAS 330
>gi|3916214|gb|AAC78839.1| cathepsin F [Homo sapiens]
Length = 302
Score = 220 bits (560), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 138/314 (43%), Positives = 179/314 (57%), Gaps = 25/314 (7%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRR 118
F F +N+ Y S+EE R ++F N+ RA + Q LD +A +G+T+FSDLT EFR
Sbjct: 5 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 64
Query: 119 TYLGLRRKLRLPKDADQAPILPTNDL-PADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
YL + QA DL P ++DWR KGAV VKDQG CGSCW+FS TG +
Sbjct: 65 IYLNTLLRKEPGNKMKQAK--SVGDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNV 122
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREE 237
EG FL G L+SLSEQ+L+DCD D C GGL ++A+ GGL E+
Sbjct: 123 EGQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSAIKNLGGLETED 173
Query: 238 DYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYI 297
DY Y G +C F K + + +S +E ++AA L K GP++VAINA MQ Y
Sbjct: 174 DYSYQG--HMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAINAFGMQFYR 231
Query: 298 GGVSCPY--ICSRRL-DHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
G+S P +CS L DH VLLVGYG+ + P+W IKNSWG WGE GYY +
Sbjct: 232 HGISRPLRPLCSPWLIDHAVLLVGYGNR-------SDVPFWAIKNSWGTDWGEKGYYYLH 284
Query: 355 RGRNVCGVDSMVST 368
RG CGV++M S+
Sbjct: 285 RGSGACGVNTMASS 298
>gi|355681647|gb|AER96812.1| cathepsin F [Mustela putorius furo]
Length = 408
Score = 220 bits (560), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 146/352 (41%), Positives = 194/352 (55%), Gaps = 42/352 (11%)
Query: 34 QVTDGGDEILS------HHESTNNDL-LGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKA 86
+VTD +E LS + E D + F F +N+ Y S+EE R ++F
Sbjct: 79 KVTDDKNETLSSVLPLLNKEPLPQDFSVKMASIFKEFVTTYNRTYESKEETQWRMSVFSN 138
Query: 87 NLRRAARHQKLDP-SATHGITQFSDLTPAEFRRTYLG--LR----RKLRLPKDADQAPIL 139
N+ RA + Q LD +A +G+T+FSDLT EFR YL LR + +RL K
Sbjct: 139 NMMRAQKIQALDRGTAQYGVTKFSDLTEEEFRTIYLNPLLREYRGKNMRLDKSTG----- 193
Query: 140 PTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDC 199
+ P+++DWR KGAV VK+QG CGSCW+FS TG +EG FL G L+SLSEQ+L+DC
Sbjct: 194 --DSAPSEWDWRRKGAVTKVKNQGMCGSCWAFSVTGNVEGQWFLKQGALLSLSEQELLDC 251
Query: 200 DHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAA 259
D D C GGL ++A+ GGL E+DY Y G R C F K
Sbjct: 252 DK---------VDKACLGGLPSNAYSAIKTLGGLETEDDYSYRG--RMQTCGFSPKKARV 300
Query: 260 SVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPY--ICSRRL-DHGVLL 316
+ + +S +E+ +AA L + GP++VAINA MQ Y G+S P +CS L DH VLL
Sbjct: 301 YINDSVELSQNEETLAAWLAEKGPISVAINAFGMQFYRHGISHPLRPLCSPWLIDHAVLL 360
Query: 317 VGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVST 368
VGYG+ P+W IKNSWG WGE GYY + RG CGV++M S+
Sbjct: 361 VGYGNR-------SGTPFWAIKNSWGSDWGEEGYYYLHRGSGACGVNTMASS 405
>gi|391332597|ref|XP_003740719.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 330
Score = 220 bits (560), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 138/327 (42%), Positives = 185/327 (56%), Gaps = 39/327 (11%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKL----DPSATHGITQFSDLT 112
+ H+ LFK++ NK Y +++ R IF+AN+++ H L S G+ F+D+T
Sbjct: 23 DEHWELFKRQHNKTYLQKQDVGRR-AIFEANIKKINAHNLLYDLGRSSYRLGLNGFADMT 81
Query: 113 PAEFRRTYLGLRRKLRLPKDADQAPILPTND-----LPADFDWREKGAVGPVKDQGSCGS 167
P EF + R R + + L D +P DWR +G V PVK+QG CGS
Sbjct: 82 PDEFEKY-----RGTRFEANEARVSKLQHRDNRSMHVPDTVDWRTEGYVTPVKNQGVCGS 136
Query: 168 CWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYT 227
CW+FSTTGALEG +F +G LVSLSEQ LVDC ++GCNGGLM++AF +
Sbjct: 137 CWAFSTTGALEGQHFRRSGDLVSLSEQMLVDC-------SAVYGNAGCNGGLMDNAFRFI 189
Query: 228 LKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQI--AANLVKNGPL 284
AGGL E+ YPYTG D C FD I A + F V S DE+ + AA +V GP+
Sbjct: 190 KDAGGLETEKSYPYTGKD--GTCHFDARGIGAKLTGFVDVPSRDEEALKEAAGVV--GPV 245
Query: 285 AVAINAV--YMQTYIGGVSCPYICSR-RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSW 341
+VAI+A Q Y GV CS LDHGVL+VGYG+ K YW++KNSW
Sbjct: 246 SVAIDASGQNFQFYKDGVYDEITCSSTSLDHGVLVVGYGTT------RDGKDYWLVKNSW 299
Query: 342 GESWGENGYYKICRGR-NVCGVDSMVS 367
G SWG++GY ++ R + N CG+ +M S
Sbjct: 300 GSSWGQSGYIQMSRNKENQCGIATMAS 326
>gi|2511689|emb|CAB17074.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 364
Score = 220 bits (560), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 123/297 (41%), Positives = 166/297 (55%), Gaps = 24/297 (8%)
Query: 66 KFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRR 125
K K Y +E + RF +FK NL H + + T G+ +F+D+T E+R YLG R
Sbjct: 42 KHRKVYNGLDEKEKRFQVFKDNLGFIQDHNAQNNTYTLGLNKFADITNEEYRAMYLGTRT 101
Query: 126 --KLRLPKDADQAPILPTN---DLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGA 180
K R+ K + N LP DWR KGAVGP+KDQG+CGSCW+FST A+EG
Sbjct: 102 DAKRRVMKTQNTGHRYAYNSGDQLPVHVDWRLKGAVGPIKDQGNCGSCWAFSTVAAVEGI 161
Query: 181 NFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYP 240
N + TG+ VSLSEQ+LVDCD E D GCNGGLM+ AF++ ++ GG+ EEDYP
Sbjct: 162 NNIVTGEFVSLSEQELVDCDRE--------YDEGCNGGLMDYAFQFIIQNGGIDTEEDYP 213
Query: 241 YTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV--YMQTYIG 298
Y G D G + K + + V + + V + P++VAI A +Q Y
Sbjct: 214 YQGID-GTCDQTKKKTKVVQIDGYEDVPSNNENALKKAVSHQPVSVAIEASGRALQLYQS 272
Query: 299 GVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
GV C LDHGV++VGYG+ YW+++NSWG WGE+GY+K+ R
Sbjct: 273 GVFTGK-CGTALDHGVVVVGYGTENGV-------DYWLVRNSWGTGWGEDGYFKMER 321
>gi|119594953|gb|EAW74547.1| cathepsin F, isoform CRA_a [Homo sapiens]
gi|119594954|gb|EAW74548.1| cathepsin F, isoform CRA_a [Homo sapiens]
Length = 392
Score = 220 bits (560), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 138/314 (43%), Positives = 179/314 (57%), Gaps = 25/314 (7%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRR 118
F F +N+ Y S+EE R ++F N+ RA + Q LD +A +G+T+FSDLT EFR
Sbjct: 95 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 154
Query: 119 TYLGLRRKLRLPKDADQAPILPTNDL-PADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
YL + QA DL P ++DWR KGAV VKDQG CGSCW+FS TG +
Sbjct: 155 IYLNTLLRKEPGNKMKQAK--SVGDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNV 212
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREE 237
EG FL G L+SLSEQ+L+DCD D C GGL ++A+ GGL E+
Sbjct: 213 EGQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSAIKNLGGLETED 263
Query: 238 DYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYI 297
DY Y G +C F K + + +S +E ++AA L K GP++VAINA MQ Y
Sbjct: 264 DYSYQG--HMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAINAFGMQFYR 321
Query: 298 GGVSCPY--ICSRRL-DHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
G+S P +CS L DH VLLVGYG+ + P+W IKNSWG WGE GYY +
Sbjct: 322 HGISRPLRPLCSPWLIDHAVLLVGYGNRS-------DVPFWAIKNSWGTDWGEKGYYYLH 374
Query: 355 RGRNVCGVDSMVST 368
RG CGV++M S+
Sbjct: 375 RGSGACGVNTMASS 388
>gi|395742406|ref|XP_003777749.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin F [Pongo abelii]
Length = 490
Score = 220 bits (560), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 139/314 (44%), Positives = 180/314 (57%), Gaps = 25/314 (7%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRR 118
F F +N+ Y S+EE R +IF N+ RA + Q LD +A +G+T+FSDLT EFR
Sbjct: 193 FKNFVITYNRTYESKEEARWRLSIFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 252
Query: 119 TYLGLRRKLRLPKDADQAPILPTNDL-PADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
YL + QA + DL P ++DWR KGAV VKDQG CGSCW+FS TG +
Sbjct: 253 IYLNPLLREEPSNKMKQAKSV--GDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNV 310
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREE 237
EG FL G L+SLSEQ+L+DCD D C GGL ++A+ GGL E+
Sbjct: 311 EGQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSAIKNLGGLETED 361
Query: 238 DYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYI 297
DY Y G +C F K + + +S +E ++AA L K GP++VAINA MQ Y
Sbjct: 362 DYSYQG--HMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAINAFGMQFYR 419
Query: 298 GGVSCPY--ICSRRL-DHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
G+S P +CS L DH VLLVGYG+ + P+W IKNSWG WGE GYY +
Sbjct: 420 HGISRPLRPLCSPWLIDHAVLLVGYGNR-------SDVPFWAIKNSWGTDWGEKGYYYLH 472
Query: 355 RGRNVCGVDSMVST 368
RG CGV++M S+
Sbjct: 473 RGSGACGVNTMASS 486
>gi|1256830|gb|AAB68374.1| cysteine endopeptidase 1 [Phaseolus vulgaris]
gi|2959418|emb|CAA12118.1| cysteine protease [Phaseolus vulgaris]
Length = 364
Score = 220 bits (560), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 123/297 (41%), Positives = 166/297 (55%), Gaps = 24/297 (8%)
Query: 66 KFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRR 125
K K Y +E + RF +FK NL H + + T G+ +F+D+T E+R YLG R
Sbjct: 42 KHRKVYNGLDEKEKRFQVFKDNLGFIQDHNAQNNTYTLGLNKFADITNKEYRAMYLGTRT 101
Query: 126 --KLRLPKDADQAPILPTN---DLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGA 180
K R+ K + N LP DWR KGAVGP+KDQG+CGSCW+FST A+EG
Sbjct: 102 DAKRRVMKTQNTGHRYAYNSGDQLPVHVDWRLKGAVGPIKDQGNCGSCWAFSTVAAVEGI 161
Query: 181 NFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYP 240
N + TG+ VSLSEQ+LVDCD E D GCNGGLM+ AF++ ++ GG+ EEDYP
Sbjct: 162 NNIVTGEFVSLSEQELVDCDRE--------YDEGCNGGLMDYAFQFIIQNGGIDTEEDYP 213
Query: 241 YTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV--YMQTYIG 298
Y G D G + K + + V + + V + P++VAI A +Q Y
Sbjct: 214 YQGID-GTCDETKKKTKVVQIDGYEDVPSNNENALKKAVSHQPVSVAIEASGRALQLYQS 272
Query: 299 GVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
GV C LDHGV++VGYG+ YW+++NSWG WGE+GY+K+ R
Sbjct: 273 GVFTGK-CGTALDHGVVVVGYGTENGV-------DYWLVRNSWGTGWGEDGYFKMER 321
>gi|113819972|gb|AAH04054.2| Ctsf protein [Mus musculus]
Length = 332
Score = 220 bits (560), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 137/314 (43%), Positives = 180/314 (57%), Gaps = 25/314 (7%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRR 118
F F +N+ Y S+EE R T+F N+ RA + Q LD +A +GIT+FSDLT EF
Sbjct: 35 FKDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGITKFSDLTEEEFHT 94
Query: 119 TYLGLRRKLRLPKDADQAPILPTNDL-PADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
YL L+ +P NDL P ++DWR+KGAV VK+QG CGSCW+FS TG +
Sbjct: 95 IYLN--PLLQKESGRKMSPAKSINDLAPPEWDWRKKGAVTEVKNQGMCGSCWAFSVTGNV 152
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREE 237
EG FL G L+SLSEQ+L+DCD D C GGL ++A+ GGL E+
Sbjct: 153 EGQWFLNRGTLLSLSEQELLDCD---------KVDKACLGGLPSNAYAAIKNLGGLETED 203
Query: 238 DYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYI 297
DY Y G C F + + +S +E++IAA L + GP++VAINA MQ Y
Sbjct: 204 DYGYQG--HVQTCNFSAQMAKVYINDSVELSRNENKIAAWLAQKGPISVAINAFGMQFYR 261
Query: 298 GGVSCPY--ICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
G++ P+ +CS +DH VLLVGYG+ PYW IKNSWG WGE GYY +
Sbjct: 262 HGIAHPFRPLCSPWFIDHAVLLVGYGNR-------SNIPYWAIKNSWGSDWGEEGYYYLY 314
Query: 355 RGRNVCGVDSMVST 368
RG CGV++M S+
Sbjct: 315 RGSGACGVNTMASS 328
>gi|402892718|ref|XP_003909556.1| PREDICTED: cathepsin F [Papio anubis]
Length = 460
Score = 220 bits (560), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 141/316 (44%), Positives = 182/316 (57%), Gaps = 29/316 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRR 118
F F +N+ Y S+EE R ++F N+ RA + Q LD +A +G+T+FSDLT EFR
Sbjct: 163 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 222
Query: 119 TYLGLRRKLRLPKDADQAPILPTNDL-PADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
YL + QA + DL P ++DWR KGAV VKDQG CGSCW+FS TG +
Sbjct: 223 IYLNPLLREEPGNKMKQAKSV--GDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNV 280
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREE 237
EG FL G L+SLSEQ+L+DCD D C GGL ++A+ GGL E+
Sbjct: 281 EGQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSAIKNLGGLETED 331
Query: 238 DYPYTGTDRGH--ACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQT 295
DY Y RGH AC F K + + +S +E ++AA L K GP++VAINA MQ
Sbjct: 332 DYSY----RGHMQACNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAINAFGMQF 387
Query: 296 YIGGVSCPY--ICSRRL-DHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYK 352
Y G+S P +CS L DH VLLVGYG+ + P+W IKNSWG WGE GYY
Sbjct: 388 YRHGISRPLRPLCSPWLIDHAVLLVGYGNR-------SDIPFWAIKNSWGTDWGEKGYYY 440
Query: 353 ICRGRNVCGVDSMVST 368
+ RG CGV++M S+
Sbjct: 441 LHRGSGACGVNTMASS 456
>gi|242086591|ref|XP_002439128.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
gi|241944413|gb|EES17558.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
Length = 371
Score = 220 bits (560), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 135/329 (41%), Positives = 185/329 (56%), Gaps = 31/329 (9%)
Query: 52 DLLGAEHHFSLFKK---KFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQF 108
DL+ + LF++ K+ KAYAS EE HRF +FK NL K + G+ F
Sbjct: 55 DLVHHDRLIKLFEEWVAKYRKAYASFEEKLHRFEVFKDNLHHIDEANKKVTTYWLGLNAF 114
Query: 109 SDLTPAEFRRTYLGLRRKLRLPKDAD---QAPILPTNDLPADFDWREKGAVGPVKDQGSC 165
+DLT EF+ TYLGLR+ K D + + +D+PA DWR+KGAV VK+QG C
Sbjct: 115 ADLTHDEFKATYLGLRQP-ETKKTTDSRFRYGGVADDDVPASVDWRKKGAVTDVKNQGQC 173
Query: 166 GSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFE 225
GSCW+FST A+EG N + TG L SLSEQ+LVDC + ++GCNGG+M++AF
Sbjct: 174 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDCSTD--------GNNGCNGGVMDNAFS 225
Query: 226 YTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLA 285
Y +GGL EE YPY + K + +++ + V +++Q + + PL+
Sbjct: 226 YIASSGGLRTEEAYPYLMEEGDCDDKARDGEQVVTISGYEDVPANDEQALVKALAHQPLS 285
Query: 286 VAINAV--YMQTYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWG 342
VAI A + Q Y GGV + P C LDHGV VGYGS+ K + Y I+KNSWG
Sbjct: 286 VAIEASGRHFQFYSGGVFNGP--CGSELDHGVAAVGYGSS-------KGQDYIIVKNSWG 336
Query: 343 ESWGENGYYKICRG----RNVCGVDSMVS 367
WGE GY ++ RG +CG++ M S
Sbjct: 337 SHWGEKGYIRMKRGTGKPEGLCGINKMAS 365
>gi|358339356|dbj|GAA47436.1| cathepsin L [Clonorchis sinensis]
Length = 236
Score = 220 bits (560), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 110/228 (48%), Positives = 144/228 (63%), Gaps = 19/228 (8%)
Query: 140 PTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDC 199
PT LP FDWR+ G V VKDQG CGSCW+F+ TG +EG + T KLVSLSEQQL+DC
Sbjct: 17 PTQSLPGSFDWRQHGVVTEVKDQGMCGSCWAFAVTGNIEGQWYKKTKKLVSLSEQQLLDC 76
Query: 200 DHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAA 259
D + D CNGG A+E +K GGLM E+DYPY C + I+A
Sbjct: 77 DKK---------DEACNGGFPEWAYESIVKMGGLMSEKDYPYEA--HKETCNLKPNNISA 125
Query: 260 SVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCP--YICSRR-LDHGVLL 316
+ + +S DE ++AA L +NGP++V +NA ++Q Y GGVS P +CS + LDH VLL
Sbjct: 126 YINDSVTLSKDEKELAAWLTENGPISVGMNANFLQFYFGGVSHPPHMLCSEQGLDHAVLL 185
Query: 317 VGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDS 364
VGYG + ++PYWI+KNSWG SWGE GY++I RG CG+++
Sbjct: 186 VGYGVTSFW-----QRPYWIVKNSWGRSWGEKGYFRIYRGDGTCGINA 228
>gi|440793751|gb|ELR14926.1| Cysteine proteinase 5, putative [Acanthamoeba castellanii str.
Neff]
Length = 326
Score = 220 bits (560), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 127/312 (40%), Positives = 178/312 (57%), Gaps = 23/312 (7%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F+ + ++ K+YA+ EE +R+ +++ N H + S + +F DLT AEF +
Sbjct: 30 FADWMQEHQKSYAN-EEFVYRWNVWRENYLYIEAHNHQNKSFHLAMNKFGDLTNAEFNKL 88
Query: 120 YLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEG 179
+ GL + + ++ I P LPADFDWR+KGAV VK+QG CGSCWSFSTTG+ EG
Sbjct: 89 FKGL--SITADQAKQESDIAPAPGLPADFDWRQKGAVTHVKNQGQCGSCWSFSTTGSTEG 146
Query: 180 ANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDY 239
ANFL G+L SLSEQ LVDC + GCNGGLM+ AFEY ++ G+ EE Y
Sbjct: 147 ANFLKHGRLTSLSEQNLVDC-------STSYGNHGCNGGLMDYAFEYIIRNKGIDTEESY 199
Query: 240 PYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY--MQTYI 297
PY + C+++K + +++ V + N V P +VAI+A + Q Y
Sbjct: 200 PYHASQG--TCRYNKQHSGGELVSYTNVPSGNEGALLNAVATQPTSVAIDASHSSFQFYK 257
Query: 298 GGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRG 356
GGV P S RLDHGVL VG+G +R K YW++KNSWG WG +GY ++ R
Sbjct: 258 GGVYDEPACSSSRLDHGVLAVGWG------VR-DGKDYWLVKNSWGADWGLSGYIEMSRN 310
Query: 357 R-NVCGVDSMVS 367
+ N CG+ + S
Sbjct: 311 KHNQCGIATAAS 322
>gi|9845246|ref|NP_063914.1| cathepsin F precursor [Mus musculus]
gi|12643321|sp|Q9R013.1|CATF_MOUSE RecName: Full=Cathepsin F; Flags: Precursor
gi|6467384|gb|AAF13147.1|AF136280_1 cathepsin F precursor [Mus musculus]
gi|7141165|gb|AAF37228.1|AF217224_1 cathepsin F [Mus musculus]
gi|26344728|dbj|BAC36013.1| unnamed protein product [Mus musculus]
gi|37589148|gb|AAH58758.1| Cathepsin F [Mus musculus]
gi|148701127|gb|EDL33074.1| cathepsin F, isoform CRA_b [Mus musculus]
Length = 462
Score = 220 bits (560), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 137/314 (43%), Positives = 180/314 (57%), Gaps = 25/314 (7%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRR 118
F F +N+ Y S+EE R T+F N+ RA + Q LD +A +GIT+FSDLT EF
Sbjct: 165 FKDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGITKFSDLTEEEFHT 224
Query: 119 TYLGLRRKLRLPKDADQAPILPTNDL-PADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
YL L+ +P NDL P ++DWR+KGAV VK+QG CGSCW+FS TG +
Sbjct: 225 IYLN--PLLQKESGRKMSPAKSINDLAPPEWDWRKKGAVTEVKNQGMCGSCWAFSVTGNV 282
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREE 237
EG FL G L+SLSEQ+L+DCD D C GGL ++A+ GGL E+
Sbjct: 283 EGQWFLNRGTLLSLSEQELLDCDK---------VDKACLGGLPSNAYAAIKNLGGLETED 333
Query: 238 DYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYI 297
DY Y G C F + + +S +E++IAA L + GP++VAINA MQ Y
Sbjct: 334 DYGYQG--HVQTCNFSAQMAKVYINDSVELSRNENKIAAWLAQKGPISVAINAFGMQFYR 391
Query: 298 GGVSCPY--ICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
G++ P+ +CS +DH VLLVGYG+ PYW IKNSWG WGE GYY +
Sbjct: 392 HGIAHPFRPLCSPWFIDHAVLLVGYGNRS-------NIPYWAIKNSWGSDWGEEGYYYLY 444
Query: 355 RGRNVCGVDSMVST 368
RG CGV++M S+
Sbjct: 445 RGSGACGVNTMASS 458
>gi|261289779|ref|XP_002611751.1| hypothetical protein BRAFLDRAFT_284345 [Branchiostoma floridae]
gi|229297123|gb|EEN67761.1| hypothetical protein BRAFLDRAFT_284345 [Branchiostoma floridae]
Length = 330
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 130/321 (40%), Positives = 179/321 (55%), Gaps = 29/321 (9%)
Query: 54 LGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH----GITQFS 109
L E + FK K +K Y+ +EE+ R IF+ NL+ H + + H G+ QF+
Sbjct: 18 LSFESQWEAFKIKHDKVYSEKEEYARRL-IFQDNLKTIESHNQEADTGKHSYWLGVNQFA 76
Query: 110 DLTPAEFRRTYLG-LRRKLRLPKDADQAPI--LPTNDLPADFDWREKGAVGPVKDQGSCG 166
D+T AE+ +G L K +A +P + DWR+KG V +KDQG CG
Sbjct: 77 DMTHAEYLNQVIGGCLITSNLTKTGSRATYRYMPNMQVNDTVDWRDKGLVTDIKDQGQCG 136
Query: 167 SCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEY 226
SCW+FSTTG+LEG + ATG LVSLSEQ LVDC + + GC GG M+ F+Y
Sbjct: 137 SCWAFSTTGSLEGQHAKATGTLVSLSEQNLVDCSRQ-------EGNKGCEGGDMDQGFQY 189
Query: 227 TLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFS-VVSLDEDQIAANLVKNGPLA 285
++ G+ E+ YPY + H CKFD S I A++++F+ V S DED + GP++
Sbjct: 190 IIQNKGIDTEQCYPYKA--KNHRCKFDNSCIGATMSSFTDVTSGDEDALKQACANIGPIS 247
Query: 286 VAINAVY--MQTYIGGVSCPYICSR-RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWG 342
V I+A + Q Y GV + CS +LDHGVL+VGYG+ G K YW++KNSWG
Sbjct: 248 VGIDASHQSFQFYSSGVYNEFECSSTKLDHGVLVVGYGTYG-------SKDYWLVKNSWG 300
Query: 343 ESWGENGYYKICRGR-NVCGV 362
WG GY + R + N CGV
Sbjct: 301 TVWGNEGYIMMSRNKDNQCGV 321
>gi|4826565|emb|CAB42884.1| cathepsin F [Mus musculus]
Length = 462
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 137/314 (43%), Positives = 180/314 (57%), Gaps = 25/314 (7%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRR 118
F F +N+ Y S+EE R T+F N+ RA + Q LD +A +GIT+FSDLT EF
Sbjct: 165 FKDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGITKFSDLTEEEFHT 224
Query: 119 TYLGLRRKLRLPKDADQAPILPTNDL-PADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
YL L+ +P NDL P ++DWR+KGAV VK+QG CGSCW+FS TG +
Sbjct: 225 IYLN--PLLQKESGRKMSPAKSINDLAPPEWDWRKKGAVTEVKNQGMCGSCWAFSVTGNV 282
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREE 237
EG FL G L+SLSEQ+L+DCD D C GGL ++A+ GGL E+
Sbjct: 283 EGQWFLNRGTLLSLSEQELLDCDK---------VDKACLGGLPSNAYAAIKNLGGLETED 333
Query: 238 DYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYI 297
DY Y G C F + + +S +E++IAA L + GP++VAINA MQ Y
Sbjct: 334 DYGYQG--HVQTCNFSAQMAKVYINDSVELSRNENKIAAWLAQKGPISVAINAFGMQFYR 391
Query: 298 GGVSCPY--ICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
G++ P+ +CS +DH VLLVGYG+ PYW IKNSWG WGE GYY +
Sbjct: 392 HGIAHPFRPLCSPWFIDHAVLLVGYGNRS-------NIPYWAIKNSWGSDWGEEGYYYLY 444
Query: 355 RGRNVCGVDSMVST 368
RG CGV++M S+
Sbjct: 445 RGSGACGVNTMASS 458
>gi|54696066|gb|AAV38405.1| cathepsin F [synthetic construct]
Length = 485
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 138/314 (43%), Positives = 180/314 (57%), Gaps = 25/314 (7%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRR 118
F F +N+ Y S+EE R ++F N+ RA + Q LD +A +G+T+FSDLT EFR
Sbjct: 187 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 246
Query: 119 TYLGLRRKLRLPKDADQAPILPTNDL-PADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
YL + QA + DL P ++DWR KGAV VKDQG CGSCW+FS TG +
Sbjct: 247 IYLNTLLRKEPGNKMKQAKSV--GDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNV 304
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREE 237
EG FL G L+SLSEQ+L+DCD D C GGL ++A+ GGL E+
Sbjct: 305 EGQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSAIKNLGGLETED 355
Query: 238 DYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYI 297
DY Y G +C F K + + +S +E ++AA L K GP++VAINA MQ Y
Sbjct: 356 DYSYQG--HMQSCNFSAEKAKVYINDSMELSQNEQKLAAWLAKRGPISVAINAFGMQFYR 413
Query: 298 GGVSCPY--ICSRRL-DHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
G+S P +CS L DH VLLVGYG+ + P+W IKNSWG WGE GYY +
Sbjct: 414 HGISRPLRPLCSPWLIDHAVLLVGYGNR-------SDVPFWAIKNSWGTDWGEKGYYYLH 466
Query: 355 RGRNVCGVDSMVST 368
RG CGV++M S+
Sbjct: 467 RGSGACGVNTMASS 480
>gi|371781445|emb|CCA95082.1| putative responsive to dehydration 19, partial [Ginkgo biloba]
Length = 130
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 104/129 (80%), Positives = 116/129 (89%), Gaps = 3/129 (2%)
Query: 226 YTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLA 285
Y LKAGGL +EEDYPYTGTD CKFD K+ A+V+NFSVVS+DEDQIAANLVKNGPL+
Sbjct: 4 YALKAGGLEKEEDYPYTGTD--GTCKFDDKKVVAAVSNFSVVSIDEDQIAANLVKNGPLS 61
Query: 286 VAINAVYMQTYIGGVSCPYICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGES 344
V INAV+MQTYIGGVSCPYICS+R LDHGVLLVGYGSAGYAPIR+K+KPYWIIKNSWG +
Sbjct: 62 VGINAVFMQTYIGGVSCPYICSKRNLDHGVLLVGYGSAGYAPIRMKDKPYWIIKNSWGAN 121
Query: 345 WGENGYYKI 353
WGE GYYK+
Sbjct: 122 WGEQGYYKL 130
>gi|326430491|gb|EGD76061.1| cathepsin [Salpingoeca sp. ATCC 50818]
Length = 381
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 137/294 (46%), Positives = 169/294 (57%), Gaps = 32/294 (10%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH----GITQFSDLTPAE 115
F FK F K Y S EE RF IF NL ARH H G+ QF+DLT E
Sbjct: 20 FDDFKTTFEKQYESPEEEARRFAIFADNLAFIARHNAEAARGLHTHTVGVNQFADLTNEE 79
Query: 116 FRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
+R+ YL L ++ + + N DWR+KGAV P+K+QG CGSCWSFSTTG
Sbjct: 80 YRQLYLRPYPTELLGRERQEVWLDGPN--AGSVDWRQKGAVTPIKNQGQCGSCWSFSTTG 137
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMR 235
++EGA+ +ATG LVSLSEQQLVDC + GCNGGLM++AF+Y + GGL
Sbjct: 138 SVEGAHAIATGNLVSLSEQQLVDCSGSFG-------NQGCNGGLMDNAFKYIISNGGLDT 190
Query: 236 EEDYPYTGTDRGHACKFDKSKIAASVANFS-VVSLDEDQIAANLVKNGPLAVAINA--VY 292
E+DYPYT D G K +SK A S++ + V +EDQ+AA V+ GP++VAI A
Sbjct: 191 EQDYPYTARD-GVCDKSKESKHAVSISGYKDVPQNNEDQLAA-AVEKGPVSVAIEADQQS 248
Query: 293 MQTYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESW 345
Q Y GV S P C LDHGVL+VGY S YWI+KNSWG SW
Sbjct: 249 FQMYSSGVFSGP--CGTNLDHGVLVVGYTS-----------DYWIVKNSWGASW 289
>gi|401416322|ref|XP_003872656.1| putative cathepsin L-like protease [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322488880|emb|CBZ24130.1| putative cathepsin L-like protease [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 366
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 131/310 (42%), Positives = 169/310 (54%), Gaps = 27/310 (8%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK+ + +AY + E R F+ NL HQ +P A GIT+F DL+ AEF
Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQARNPHAQFGITKFFDLSEAEFAAR 97
Query: 120 YL-GLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YL G + A Q DL P DWREKGAV PVKDQG+CGSCW+FS G
Sbjct: 98 YLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFSAVG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGL 233
+EG +LA +LVSLSEQQLV CD D GC+GGLM AF++ L+ G L
Sbjct: 158 NIEGQWYLAGHELVSLSEQQLVSCDDMND---------GCSGGLMLQAFDWLLQNTNGHL 208
Query: 234 MREEDYPYTGTDRGHACKFDKSK---IAASVANFSVVSLDEDQIAANLVKNGPLAVAINA 290
E+ YPY + G+ + S + A + ++ E +AA L KNGP+A+A++A
Sbjct: 209 YTEDSYPYV-SGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDA 267
Query: 291 VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
+Y GV I ++L+HGVLLVGY G E PYW+IKNSWG WGE GY
Sbjct: 268 SSFMSYKSGVLTACI-GKQLNHGVLLVGYDMTG-------EVPYWVIKNSWGGDWGEQGY 319
Query: 351 YKICRGRNVC 360
++ G N C
Sbjct: 320 VRVVMGVNAC 329
>gi|116786779|gb|ABK24233.1| unknown [Picea sitchensis]
Length = 463
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 132/317 (41%), Positives = 178/317 (56%), Gaps = 25/317 (7%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKL-DPSATHGITQFSDLTPAEFRR 118
+ L+ + KAY E +RF++FK N +H +PS G+ QF+DL+ EF+
Sbjct: 44 YELWLAQHKKAYNGLGEKQNRFSVFKDNFLYIHQHNNQGNPSYKLGLNQFADLSHEEFKA 103
Query: 119 TYLG--LRRKLRLPKD-ADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
TYLG L K RL + + DLP DWREKGAV VKDQGSCGSCW+FST
Sbjct: 104 TYLGAKLDTKKRLSNSPSPRYQYSDGEDLPESIDWREKGAVTAVKDQGSCGSCWAFSTVA 163
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMR 235
A+EG N + TG L SLSEQ+LVDCD S + GCNGGLM+ AF++ + GGL
Sbjct: 164 AVEGINQIVTGNLTSLSEQELVDCDT--------SYNQGCNGGLMDYAFQFIINNGGLDS 215
Query: 236 EEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV--YM 293
E+DYPY D G + K+ ++ ++ V ++++ N P++VAI A
Sbjct: 216 EDDYPYKAND-GSCDAYRKNAHVVTIDDYEDVPENDEKSLKKAAANQPISVAIEASGRAF 274
Query: 294 QTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKI 353
Q Y GV C +LDHGV LVGYGS YWI+KNSWG+SWGE G+ ++
Sbjct: 275 QFYESGVFTS-TCGTQLDHGVTLVGYGSE-------SGTDYWIVKNSWGKSWGEKGFIRL 326
Query: 354 CRGRNVCGVDSMVSTVA 370
RN+ GV + + +A
Sbjct: 327 --QRNIEGVSTGMCGIA 341
>gi|6042196|ref|NP_003784.2| cathepsin F precursor [Homo sapiens]
gi|12643325|sp|Q9UBX1.1|CATF_HUMAN RecName: Full=Cathepsin F; Short=CATSF; Flags: Precursor
gi|4731642|gb|AAD26616.2|AF088886_1 cathepsin F precursor [Homo sapiens]
gi|5305722|gb|AAD41790.1|AF132894_1 cathepsin F [Homo sapiens]
gi|4826528|emb|CAB42883.1| cysteine proteinase [Homo sapiens]
gi|15079738|gb|AAH11682.1| Cathepsin F [Homo sapiens]
gi|22209085|gb|AAH36451.1| Cathepsin F [Homo sapiens]
gi|61363874|gb|AAX42458.1| cathepsin F [synthetic construct]
gi|123993139|gb|ABM84171.1| cathepsin F [synthetic construct]
gi|189053904|dbj|BAG36411.1| unnamed protein product [Homo sapiens]
Length = 484
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 138/314 (43%), Positives = 180/314 (57%), Gaps = 25/314 (7%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRR 118
F F +N+ Y S+EE R ++F N+ RA + Q LD +A +G+T+FSDLT EFR
Sbjct: 187 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 246
Query: 119 TYLGLRRKLRLPKDADQAPILPTNDL-PADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
YL + QA + DL P ++DWR KGAV VKDQG CGSCW+FS TG +
Sbjct: 247 IYLNTLLRKEPGNKMKQAKSV--GDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNV 304
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREE 237
EG FL G L+SLSEQ+L+DCD D C GGL ++A+ GGL E+
Sbjct: 305 EGQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSAIKNLGGLETED 355
Query: 238 DYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYI 297
DY Y G +C F K + + +S +E ++AA L K GP++VAINA MQ Y
Sbjct: 356 DYSYQG--HMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAINAFGMQFYR 413
Query: 298 GGVSCPY--ICSRRL-DHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
G+S P +CS L DH VLLVGYG+ + P+W IKNSWG WGE GYY +
Sbjct: 414 HGISRPLRPLCSPWLIDHAVLLVGYGNR-------SDVPFWAIKNSWGTDWGEKGYYYLH 466
Query: 355 RGRNVCGVDSMVST 368
RG CGV++M S+
Sbjct: 467 RGSGACGVNTMASS 480
>gi|11066228|gb|AAG28508.1|AF197480_1 cathepsin F [Mus musculus]
Length = 462
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 137/314 (43%), Positives = 180/314 (57%), Gaps = 25/314 (7%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRR 118
F F +N+ Y S+EE R T+F N+ RA + Q LD +A +GIT+FSDLT EF
Sbjct: 165 FKDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGITKFSDLTEEEFHT 224
Query: 119 TYLGLRRKLRLPKDADQAPILPTNDL-PADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
YL L+ +P NDL P ++DWR+KGAV VK+QG CGSCW+FS TG +
Sbjct: 225 IYLN--PLLQKESGRKMSPAKSINDLAPPEWDWRKKGAVTEVKNQGMCGSCWAFSVTGNV 282
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREE 237
EG FL G L+SLSEQ+L+DCD D C GGL ++A+ GGL E+
Sbjct: 283 EGQWFLNRGTLLSLSEQELLDCDK---------VDKACLGGLPSNAYAAIKNLGGLETED 333
Query: 238 DYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYI 297
DY Y G C F + + +S +E++IAA L + GP++VAINA MQ Y
Sbjct: 334 DYGYQG--HVQTCNFSAQMAKVYINDSVELSRNENKIAAWLAQKGPISVAINAFGMQFYR 391
Query: 298 GGVSCPY--ICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
G++ P+ +CS +DH VLLVGYG+ PYW IKNSWG WGE GYY +
Sbjct: 392 HGIAHPFRPLCSPWFIDHAVLLVGYGNR-------SNIPYWAIKNSWGSDWGEEGYYYLY 444
Query: 355 RGRNVCGVDSMVST 368
RG CGV++M S+
Sbjct: 445 RGSGACGVNTMASS 458
>gi|281203744|gb|EFA77940.1| hypothetical protein PPL_08585 [Polysphondylium pallidum PN500]
Length = 505
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 133/343 (38%), Positives = 190/343 (55%), Gaps = 35/343 (10%)
Query: 51 NDLLGAE----HHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGIT 106
N LL +E + F + +F K Y E RF+IFK+N+ + G+
Sbjct: 168 NALLFSEEQYKNEFENWIDRFEKKY-DVSEFKKRFSIFKSNMDFVHSWNSKNSQTVLGLN 226
Query: 107 QFSDLTPAEFRRTYLGLRRK--LRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGS 164
+DLT E+R+ YLG +K L P + + + + A DWR+KGAV P+KDQG
Sbjct: 227 HLADLTNLEYRQFYLGTHKKAVLGTPGNHEVSNLQSVFGDSATVDWRQKGAVSPIKDQGQ 286
Query: 165 CGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAF 224
CGSCWSFSTTG++EGA+ + +G +V LSEQ LVDC + GCNGGLM+ AF
Sbjct: 287 CGSCWSFSTTGSVEGAHQIKSGNMVELSEQNLVDC-------STSEGNMGCNGGLMDYAF 339
Query: 225 EYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKN-GP 283
EY + G+ E YPYT + G CK++K+ A+++++ ++ + A+ VKN GP
Sbjct: 340 EYIITNNGIDTESSYPYTASS-GTTCKYNKANSGATISSYKNITAGSESDLADAVKNAGP 398
Query: 284 LAVAINAVY--MQTYIGGVSCPYICSR-RLDHGVLLVGYGSAGY---------APIRLK- 330
++VAI+A + Q Y G+ CS LDHGVL+VGYGS + +R+K
Sbjct: 399 VSVAIDASHNSFQLYSHGIYYDASCSSVNLDHGVLVVGYGSGTPDSDSRVHKGSQVRVKV 458
Query: 331 -----EKPYWIIKNSWGESWGENGYYKICRGR-NVCGVDSMVS 367
K YWI+KNSWG SWG+ G+ + + R N CG+ S S
Sbjct: 459 PKTDDTKNYWIVKNSWGTSWGDKGFIYMSKDRDNNCGIASCAS 501
>gi|55740402|gb|AAV63977.1| cathepsin L precursor [Artemia franciscana]
Length = 338
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 137/328 (41%), Positives = 179/328 (54%), Gaps = 30/328 (9%)
Query: 52 DLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARH----QKLDPSATHGITQ 107
+LL E H LFK K Y SQ E R I+ N + A+H +K + S + +
Sbjct: 25 NLLADEWH--LFKATHKKEYPSQLEEKFRMKIYLENKHKVAKHNILYEKGEKSYQVAMNK 82
Query: 108 FSDLTPAEFRRTYLGLRRKLRLPKDADQAPIL--PTN-DLPADFDWREKGAVGPVKDQGS 164
F DL EFR G + K + A+ P N ++P DWREKGA+ PVKDQG
Sbjct: 83 FGDLLHHEFRSIMNGYQHKKQNSSRAESTFTFMEPANVEVPESVDWREKGAITPVKDQGQ 142
Query: 165 CGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAF 224
CGSCW+FS+TGALEG F TGKL+SLSEQ L+DC + E GCNGGLM+ AF
Sbjct: 143 CGSCWAFSSTGALEGQTFRKTGKLISLSEQNLIDCSGKYGNE-------GCNGGLMDQAF 195
Query: 225 EYTLKAGGLMREEDYPYTGTDRGHACKFD-KSKIAASVANFSVVSLDEDQIAANLVKNGP 283
+Y G+ E YPY D C+++ +++ A + S +ED++ A + GP
Sbjct: 196 QYIKDNKGIDTENTYPYEAED--DVCRYNPRNRGAVDRGFVDIPSGEEDKLKAAVATVGP 253
Query: 284 LAVAINAVY--MQTYIGGVSCPYIC-SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNS 340
++VAI+A + Q Y GV C S LDHGVL+VGYGS K YW++KNS
Sbjct: 254 VSVAIDASHESFQFYSKGVYYEPSCDSDDLDHGVLVVGYGSDN-------GKDYWLVKNS 306
Query: 341 WGESWGENGYYKICRGR-NVCGVDSMVS 367
W E WG+ GY KI R R N CGV + S
Sbjct: 307 WSEHWGDEGYIKIARNRKNHCGVATAAS 334
>gi|2677828|gb|AAB97142.1| cysteine protease [Prunus armeniaca]
Length = 358
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 144/372 (38%), Positives = 189/372 (50%), Gaps = 32/372 (8%)
Query: 6 VVLFLVSLVVFSAVSSGTLIDDVDQL--IRQVTDGGDEILSHHESTNNDLLGAEH---HF 60
V L L + +V A+S G D+ IR V+DG E+ E +LG HF
Sbjct: 4 VTLVLSAALVLVAISCGAAASSFDESNPIRLVSDGLREL----EQQVVQVLGNSRRALHF 59
Query: 61 SLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTY 120
+ F ++ K Y S EE R+ IF N + K T + +F+D + EFRR
Sbjct: 60 ARFAHRYGKKYESVEEMKLRYEIFSENKKLIRSTNKKGLPYTLAVNRFADWSWEEFRRQR 119
Query: 121 LGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGA 180
LG + L LP +WRE+G V PVKDQG CGSCW+FSTTGALE A
Sbjct: 120 LGAAQNCSATTKGSHE--LTDAVLPESKNWREEGIVTPVKDQGHCGSCWTFSTTGALEAA 177
Query: 181 NFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYP 240
A K +SLSEQQLVDC + + GC+GGL + AFEY GGL E YP
Sbjct: 178 YVQAFRKQISLSEQQLVDCAGAFN-------NFGCHGGLPSQAFEYIKYNGGLDTEAAYP 230
Query: 241 YTGTDRGHACKFDKSKIAASVANFSVVSL-DEDQIAANLVKNGPLAVAINAVY-MQTYIG 298
Y GTD ACKF + V + ++L DE ++ + P++VA V + Y
Sbjct: 231 YVGTD--GACKFSAENVGVQVLDSVNITLGDEQELKHAVAFVRPVSVAFQVVKSFRIYKS 288
Query: 299 GVSCPYICSRR---LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
GV C ++H VL VGYG G P+W+IKNSWGESWG+NGY+K+
Sbjct: 289 GVYTSDTCGSSPMDVNHAVLAVGYGEEGGV-------PFWLIKNSWGESWGDNGYFKMEF 341
Query: 356 GRNVCGVDSMVS 367
G+N+CGV + S
Sbjct: 342 GKNMCGVATCAS 353
>gi|47224192|emb|CAG13112.1| unnamed protein product [Tetraodon nigroviridis]
Length = 327
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 134/316 (42%), Positives = 175/316 (55%), Gaps = 32/316 (10%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
E HF + NKAY+ QE H R IF N RR +H + S T G+ QFSD+T AEF
Sbjct: 26 EQHFKSWMALHNKAYSVQEFH-QRLQIFTENKRRIEKHNGGNHSFTMGLNQFSDMTFAEF 84
Query: 117 RRTYLGLRRKLRLPKD--ADQAPILPTND-LPADFDWREKGA-VGPVKDQGSCGSCWSFS 172
R+ +L P++ A + + TN P DWR KG V PVK+QG+CGSCW+FS
Sbjct: 85 RKRFL-----WSEPQNCSATKGSYMKTNSPQPESIDWRTKGNYVTPVKNQGACGSCWTFS 139
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
TTG LE + TGKLV LSEQQLVDC + + + GCNGGL + AFEY G
Sbjct: 140 TTGCLESVTAINTGKLVPLSEQQLVDCAWDFN-------NHGCNGGLPSQAFEYIKYNKG 192
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVAN-FSVVSLDEDQIAANLVKNGPLAVAINAV 291
LM E YPYT + CK+ AA V N ++ + DE + + + P++ A
Sbjct: 193 LMTESGYPYTAFEG--KCKYKPELAAAFVKNVVNITAYDEKGMEDAVATHNPVSFAFEVT 250
Query: 292 --YMQTYIGGVSCPYICSR---RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWG 346
+M Y GGV C + +++H VL VGYG+ PYWI+KNSWG WG
Sbjct: 251 DDFMH-YKGGVYSSSRCHKTTDKVNHAVLAVGYGNNN------SSVPYWIVKNSWGPYWG 303
Query: 347 ENGYYKICRGRNVCGV 362
ENGY+ I RG+N+CG+
Sbjct: 304 ENGYFLIERGKNMCGL 319
>gi|449448298|ref|XP_004141903.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
sativus]
gi|449531757|ref|XP_004172852.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
sativus]
Length = 365
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 138/364 (37%), Positives = 192/364 (52%), Gaps = 53/364 (14%)
Query: 1 MGSKTVVLFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHF 60
M + T L L+S F ++S+ L D +R++ D
Sbjct: 1 MATATTSLALLSFF-FLSISASALSRRSDGEVREIYD----------------------- 36
Query: 61 SLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTY 120
L+ K KAY +E + RF IFK NL+ H + + G+ F+DLT E+R Y
Sbjct: 37 -LWLAKHGKAYNGIDEREKRFQIFKENLKFIDDHNSENRTYKVGLNMFADLTNEEYRALY 95
Query: 121 LGLR----RKLRLPKDADQAPILPTND-LPADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
LG R R++ K A + + D LP DWR +GAV PVK+QGSCGSCW+FST
Sbjct: 96 LGTRSPPARRVMKAKTASRRYAVNNLDRLPESMDWRTRGAVAPVKNQGSCGSCWAFSTIA 155
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMR 235
A+EG N + TG+L+SLSEQ+LV CD + +SGCNGGLM+ AF++ + GGL
Sbjct: 156 AVEGINQIVTGELISLSEQELVSCDKK--------YNSGCNGGLMDYAFQFIIDNGGLDT 207
Query: 236 EEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA--VYM 293
EEDYPY D G K+ S+ + V ++++ V + P++VAI A + +
Sbjct: 208 EEDYPYEAFD-GQCDPTRKNAKVVSIDAYEDVPANDEESLKKAVAHQPVSVAIEASGLAL 266
Query: 294 QTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEK--PYWIIKNSWGESWGENGYY 351
Q Y GV C LDHGV+ VGYG KE YW+++NSWG SWGE+GY+
Sbjct: 267 QLYQSGVFTGK-CGSALDHGVVAVGYG---------KENGVDYWLVRNSWGTSWGEDGYF 316
Query: 352 KICR 355
K+ R
Sbjct: 317 KLER 320
>gi|449452572|ref|XP_004144033.1| PREDICTED: thiol protease aleurain-like [Cucumis sativus]
gi|449500499|ref|XP_004161114.1| PREDICTED: thiol protease aleurain-like [Cucumis sativus]
Length = 356
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 150/372 (40%), Positives = 192/372 (51%), Gaps = 37/372 (9%)
Query: 8 LFLVS--LVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHH---FSL 62
LF VS L+V S +G++ DD + IR V+D E+ E +LG H F+
Sbjct: 5 LFFVSSLLLVLSCAVAGSVFDDSNP-IRMVSDRLREL----ELEVVRVLGQVPHALRFAR 59
Query: 63 FKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLG 122
F ++ K Y + EE RF IF +L K S G+ QF+D T EFR+ LG
Sbjct: 60 FAHRYGKKYETAEEMKLRFGIFLESLELIKSTNKQGLSYKLGVNQFADWTWEEFRKHRLG 119
Query: 123 LRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANF 182
+ L LP DWR+ G V PVKDQG CGSCW+FSTTGALE A
Sbjct: 120 AAQNCSATTKGSHK--LTDTALPESKDWRKDGIVSPVKDQGHCGSCWTFSTTGALEAAYA 177
Query: 183 LATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYT 242
A GK +SLSEQQLVDC G + GCNGGL + AFEY GGL EE YPYT
Sbjct: 178 QAHGKGISLSEQQLVDCGR-------GFNNFGCNGGLPSQAFEYIKYNGGLDTEEAYPYT 230
Query: 243 GTDRGHACKFDKSKIAASV---ANFSVVSLDEDQIAANLVKNGPLAVAINAVY-MQTYIG 298
G D +CKF + V N ++ + DE + A V+ P++VA V + Y
Sbjct: 231 GVD--GSCKFVPENVGVQVIDSVNITLGAEDELKHAVAFVR--PVSVAFEVVSGFRLYSK 286
Query: 299 GVSCPYICSRR---LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
GV C ++H VL VGYG PYW+IKNSWG +WG+NGY+K+
Sbjct: 287 GVYTSNSCGSTPMDVNHAVLAVGYGVE-------DGIPYWLIKNSWGGNWGDNGYFKMEM 339
Query: 356 GRNVCGVDSMVS 367
G+N+CGV + S
Sbjct: 340 GKNMCGVATCAS 351
>gi|148701126|gb|EDL33073.1| cathepsin F, isoform CRA_a [Mus musculus]
Length = 417
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 137/315 (43%), Positives = 180/315 (57%), Gaps = 25/315 (7%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRR 118
F F +N+ Y S+EE R T+F N+ RA + Q LD +A +GIT+FSDLT EF
Sbjct: 120 FKDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGITKFSDLTEEEFHT 179
Query: 119 TYLGLRRKLRLPKDADQAPILPTNDL-PADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
YL L+ +P NDL P ++DWR+KGAV VK+QG CGSCW+FS TG +
Sbjct: 180 IYL--NPLLQKESGRKMSPAKSINDLAPPEWDWRKKGAVTEVKNQGMCGSCWAFSVTGNV 237
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREE 237
EG FL G L+SLSEQ+L+DCD D C GGL ++A+ GGL E+
Sbjct: 238 EGQWFLNRGTLLSLSEQELLDCDK---------VDKACLGGLPSNAYAAIKNLGGLETED 288
Query: 238 DYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYI 297
DY Y G C F + + +S +E++IAA L + GP++VAINA MQ Y
Sbjct: 289 DYGYQG--HVQTCNFSAQMAKVYINDSVELSRNENKIAAWLAQKGPISVAINAFGMQFYR 346
Query: 298 GGVSCPY--ICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
G++ P+ +CS +DH VLLVGYG+ PYW IKNSWG WGE GYY +
Sbjct: 347 HGIAHPFRPLCSPWFIDHAVLLVGYGNR-------SNIPYWAIKNSWGSDWGEEGYYYLY 399
Query: 355 RGRNVCGVDSMVSTV 369
RG CGV++M S+
Sbjct: 400 RGSGACGVNTMASSA 414
>gi|116242316|gb|ABJ89815.1| putative cathepsin L preprotein [Clonorchis sinensis]
Length = 371
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 133/317 (41%), Positives = 177/317 (55%), Gaps = 31/317 (9%)
Query: 63 FKKKFNKAYASQEEHDHRFTIFKANLRRAARH----QKLDPSATHGITQFSDLTPAEFRR 118
F +K+ + Y S+ E + R IF N R + H +K + S + GI FSD T +E
Sbjct: 70 FLEKYKRVYDSKLEEERRLGIFTENFIRISEHNLLFEKGEVSYSMGINAFSDKTNSELD- 128
Query: 119 TYLGLRRKLRLPKDADQAPILPTNDLP-ADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
G R + + Q +P + P A+ DWR KGAV PVK+QG CGSCW+FS TG +
Sbjct: 129 VLRGFRHSSKASRSGSQ--YIPFDAAPPAEVDWRTKGAVTPVKNQGDCGSCWAFSATGGI 186
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREE 237
EG ++LATGKLVSLSEQQLVDC S + GC+GGLM+ AFEY + G+ E
Sbjct: 187 EGQHYLATGKLVSLSEQQLVDCS---------SSNDGCDGGLMDLAFEYVKEHKGIDTEV 237
Query: 238 DYPYTGTDRGHA--CKFDKSKIAASVANFSVVSLDEDQIAANLVK-NGPLAVAINAVY-- 292
YPY + G+A C FD A +V + + ++ + V +GP++V INA
Sbjct: 238 HYPYVSGNTGYARQCSFDPKYAAVNVTGYVDIPEGQELLLQQAVGFHGPISVGINAGLPS 297
Query: 293 MQTYIGGVSCPYICS-RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYY 351
Y G+ + C+ LDHGVL+VGYG PYW+IKNSWGE WGENGY
Sbjct: 298 FMAYESGIYSDHRCNPHDLDHGVLVVGYGVD-------NGVPYWLIKNSWGEDWGENGYV 350
Query: 352 KICRGR-NVCGVDSMVS 367
+I R N+CGV +M S
Sbjct: 351 RILRNHNNLCGVATMAS 367
>gi|146084829|ref|XP_001465113.1| cysteine peptidase A (CPA) [Leishmania infantum JPCM5]
gi|134069209|emb|CAM67356.1| cysteine peptidase A (CPA) [Leishmania infantum JPCM5]
Length = 354
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 135/374 (36%), Positives = 186/374 (49%), Gaps = 42/374 (11%)
Query: 1 MGSKTVVLFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHF 60
M + F + + + V G+ LI Q G D+ + A H+
Sbjct: 1 MARRNPFFFAIVVTILFVVCYGS------ALIAQTPLGVDDFI------------ASAHY 42
Query: 61 SLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGIT-QFSDLTPAEFRRT 119
FKK+ K + E RF FK N++ A +P A + ++ +F+DLTP EF +
Sbjct: 43 GRFKKRHGKPFGEDAEEGRRFNAFKQNMQTAYFLNAHNPHAHYDVSGKFADLTPQEFAKL 102
Query: 120 YLGLRRKLRLPKDADQAPIL--PTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
YL R KD + + DWREKG V PVK+QG CGSCW+F+TTG +
Sbjct: 103 YLNPNYYARHGKDYKEHVHVDDSVRSGVMSVDWREKGVVTPVKNQGMCGSCWAFATTGNI 162
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGLMR 235
EG L LVSLSEQ LV CD + D GCNGGLM A ++ + G +
Sbjct: 163 EGQWALKNHSLVSLSEQVLVSCD---------NIDDGCNGGLMQQAMQWIINDHNGTVPT 213
Query: 236 EEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQT 295
E+ YPYT D + A + + + DE++IAA + KNGP+AVA++A Q
Sbjct: 214 EDSYPYTSAGGTRPPCHDNGTVGAKIKGYMSLPHDEEEIAAYVGKNGPVAVAVDATTWQL 273
Query: 296 YIGGVSCPYIC-SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
Y GGV +C L+HGVL+VG+ R + PYWI+KNSWG SWGE GY ++
Sbjct: 274 YFGGVVT--LCFGLSLNHGVLVVGFN-------RQAKPPYWIVKNSWGSSWGEKGYIRLA 324
Query: 355 RGRNVCGVDSMVST 368
G N C + + V T
Sbjct: 325 MGSNQCLLKNYVVT 338
>gi|410978262|ref|XP_003995514.1| PREDICTED: cathepsin L1-like [Felis catus]
Length = 333
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 130/316 (41%), Positives = 170/316 (53%), Gaps = 23/316 (7%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGIT----QFSDLTPAE 115
+S +K K Y EE R ++K N++ +H H T F D+T E
Sbjct: 29 WSQWKATHGKLYGMDEE-GWRREVWKKNMKMIRQHNWEHSQGKHSFTVAMNGFGDMTNEE 87
Query: 116 FRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
F++ GL+ + QAP+ +P+ DWREKG V PVKDQG CGSCW+FS TG
Sbjct: 88 FKQVMNGLQMQKHKKGKMFQAPLFAK--IPSSVDWREKGYVTPVKDQGPCGSCWAFSATG 145
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMR 235
ALEG F TGKLVSLSEQ LVDC + GCNGGLMN+AF+Y GGL
Sbjct: 146 ALEGQMFRKTGKLVSLSEQNLVDCSQ-------AEGNEGCNGGLMNNAFQYVKDNGGLDS 198
Query: 236 EEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY--M 293
EE YPY D +CK+ AA+ F + E + + GP++V I+A +
Sbjct: 199 EESYPYHAQDE--SCKYKPQDSAANDTGFFDIPQQEKALMVAVATKGPISVGIDASHFTF 256
Query: 294 QTYIGGVSC-PYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYK 352
Q Y G+ P S LDHGVL++GYG+ I K YWI+KNSWG +WG +GY K
Sbjct: 257 QFYHEGIYYDPDCSSEDLDHGVLVIGYGTEIGQSIN---KTYWIVKNSWGANWGIDGYIK 313
Query: 353 ICRGR-NVCGVDSMVS 367
+ + R N CG+ +M S
Sbjct: 314 MAKDRKNHCGIATMAS 329
>gi|356533293|ref|XP_003535200.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
[Glycine max]
Length = 466
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 129/308 (41%), Positives = 172/308 (55%), Gaps = 28/308 (9%)
Query: 58 HHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKL-DPSATHGITQFSDLTPAEF 116
H + + K KAY + E + RF IFK NLR H D S G+ +F+DLT E+
Sbjct: 46 HVYEAWLVKHGKAYNALGEKERRFKIFKDNLRFIEEHNGAGDKSYKLGLNKFADLTNEEY 105
Query: 117 RRTYLGLR------RKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWS 170
R +LG R + + K D+ +LPA DWREKGAV P+KDQG CGSCW+
Sbjct: 106 RAMFLGTRTRGPKNKAAVVAKKTDRYAYRAGEELPAMVDWREKGAVTPIKDQGQCGSCWA 165
Query: 171 FSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA 230
FST GA+EG N + TG L SLSEQ+LVDCD + GCNGGLM+ AFE+ ++
Sbjct: 166 FSTVGAVEGINQIVTGNLTSLSEQELVDCDR--------GYNMGCNGGLMDYAFEFIVQN 217
Query: 231 GGLMREEDYPYTGTDRGHACKFD-KSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAIN 289
GG+ EEDYPY D + C + K+ ++ + V ++++ V N P++VAI
Sbjct: 218 GGIDTEEDYPYHAKD--NTCDPNRKNARVVTIDGYEDVPTNDEKSLMKAVANQPVSVAIE 275
Query: 290 AVYM--QTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGE 347
A M Q Y GV C LDHGV+ VGYG+ YW+++NSWG +WGE
Sbjct: 276 AGGMEFQLYQSGVFTGR-CGTNLDHGVVAVGYGTE-------NGTDYWLVRNSWGSAWGE 327
Query: 348 NGYYKICR 355
NGY K+ R
Sbjct: 328 NGYIKLER 335
>gi|398010921|ref|XP_003858657.1| cathepsin L-like protease, partial [Leishmania donovani]
gi|322496866|emb|CBZ31937.1| cathepsin L-like protease, partial [Leishmania donovani]
Length = 345
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 132/311 (42%), Positives = 170/311 (54%), Gaps = 29/311 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK+ + +AY + E R F+ NL HQ +P A GIT+F DL+ AEF
Sbjct: 38 FEEFKRTYRRAYGTLAEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 120 YL-GLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YL G + A Q DL P DWREKGAV PVK+QG+CGSCW+FS G
Sbjct: 98 YLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFSAVG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGL 233
+E A LVSLSEQQLV CD + D+GCNGGLM AFE+ L+ G +
Sbjct: 158 NIESQWARAGHGLVSLSEQQLVSCDDK---------DNGCNGGLMLQAFEWLLRHMYGIV 208
Query: 234 MREEDYPYTGTDRGHACKFDKSKI--AASVANFSVVSLDEDQIAANLVKNGPLAVAINAV 291
E+ YPYT + A + SK+ A + + ++ +E +AA L +NGP+A+A++A
Sbjct: 209 FTEKSYPYTSGNGDVAECLNSSKLVPGAQIDGYVMIPSNETVMAAWLAENGPIAIAVDAS 268
Query: 292 YMQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
+Y GV SC L+HGVLLVGY G PYW+IKNSWGE WGE G
Sbjct: 269 SFMSYQSGVLTSC---AGDALNHGVLLVGYNKTGGV-------PYWVIKNSWGEDWGEKG 318
Query: 350 YYKICRGRNVC 360
Y ++ GRN C
Sbjct: 319 YVRVAMGRNAC 329
>gi|9542|emb|CAA78443.1| cysteine proteinase [Leishmania mexicana]
Length = 443
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 131/310 (42%), Positives = 170/310 (54%), Gaps = 27/310 (8%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK+ + +AY + E R F+ NL HQ +P A GIT+F DL+ AEF
Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQARNPHAQFGITKFFDLSEAEFAAR 97
Query: 120 YL-GLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YL G + A Q DL P DWREKGAV PVKDQG+CGSCW+FS G
Sbjct: 98 YLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFSAVG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGL 233
+EG +LA +LVSLSEQQLV CD D+GC+GGLM AF++ L+ G L
Sbjct: 158 NIEGQWYLAGHELVSLSEQQLVSCD---------DMDNGCSGGLMLQAFDWLLQNTNGHL 208
Query: 234 MREEDYPYTGTDRGHACKFDKSK---IAASVANFSVVSLDEDQIAANLVKNGPLAVAINA 290
E+ YPY + G+ + S + A + ++ E +AA L KNGP+A+A++A
Sbjct: 209 HTEDSYPYV-SGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDA 267
Query: 291 VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
+Y GV I ++L+HGVLLVGY G E PYW+IKNSWG WGE GY
Sbjct: 268 SSFMSYKSGVLTACI-GKQLNHGVLLVGYDMTG-------EVPYWVIKNSWGGDWGEQGY 319
Query: 351 YKICRGRNVC 360
++ G N C
Sbjct: 320 VRVVMGVNAC 329
>gi|398014254|ref|XP_003860318.1| cysteine peptidase A (CBA) [Leishmania donovani]
gi|13518086|gb|AAK27384.1| cysteine proteinase-like protein [Leishmania donovani]
gi|322498538|emb|CBZ33611.1| cysteine peptidase A (CBA) [Leishmania donovani]
Length = 354
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 134/366 (36%), Positives = 183/366 (50%), Gaps = 42/366 (11%)
Query: 1 MGSKTVVLFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHF 60
M + F + + + V G+ LI Q G D+ + A H+
Sbjct: 1 MARRNPFFFAIVVTILFVVCYGS------ALIAQTPLGVDDFI------------ASAHY 42
Query: 61 SLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGIT-QFSDLTPAEFRRT 119
FKK+ K + E RF FK N++ A +P A + ++ +F+DLTP EF +
Sbjct: 43 GRFKKRHGKPFGEDAEEGRRFNAFKQNMQTAYFLNAHNPHAHYDVSGKFADLTPQEFAKL 102
Query: 120 YLGLRRKLRLPKDADQAPIL--PTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
YL R KD + + DWREKG V PVK+QG CGSCW+F+TTG +
Sbjct: 103 YLNPNYYARHGKDYKEHVHVDDSVRSGVMSVDWREKGVVTPVKNQGMCGSCWAFATTGNI 162
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGLMR 235
EG L LVSLSEQ LV CD + D GCNGGLM A ++ + G +
Sbjct: 163 EGQWALKNHSLVSLSEQVLVSCD---------NIDDGCNGGLMEQAMQWIINDHNGTVPT 213
Query: 236 EEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQT 295
E+ YPYT D + A +A + + DE++IAA + KNGP+AVA++A Q
Sbjct: 214 EDSYPYTSAGGTRPPCHDNGTVGAKIAGYMSLPHDEEEIAAYVGKNGPVAVAVDATTWQL 273
Query: 296 YIGGVSCPYIC-SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
Y GGV +C L+HGVL+VG+ R + PYWI+KNSWG SWGE GY ++
Sbjct: 274 YFGGVVT--LCFGLSLNHGVLVVGFN-------RQAKPPYWIVKNSWGSSWGEKGYIRLA 324
Query: 355 RGRNVC 360
G N C
Sbjct: 325 MGSNQC 330
>gi|255555337|ref|XP_002518705.1| cysteine protease, putative [Ricinus communis]
gi|223542086|gb|EEF43630.1| cysteine protease, putative [Ricinus communis]
Length = 471
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 126/302 (41%), Positives = 175/302 (57%), Gaps = 24/302 (7%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
+ ++ + KAY + E + RF IFK NLR H +D S G+ +F+DLT E++
Sbjct: 51 YEMWLVEHGKAYNALGEKEKRFEIFKDNLRFIDEHNSVDRSYKVGLNRFADLTNEEYKAM 110
Query: 120 YLG--LRRKLRLPKDADQAPILPT-NDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGA 176
+LG + RK R Q + +DLP + DWREKGAV PVKDQG CGSCW+FST GA
Sbjct: 111 FLGTKMERKNRFLGTRSQRYLFKDGDDLPENVDWREKGAVVPVKDQGQCGSCWAFSTVGA 170
Query: 177 LEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMRE 236
+EG N + TG+L+SLSEQ+LVDCD S + GCNGGLM+ AFE+ + GG+ E
Sbjct: 171 VEGINQIVTGELISLSEQELVDCDK--------SYNQGCNGGLMDYAFEFIINNGGIDTE 222
Query: 237 EDYPYTGTDRGHACKFD-KSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA--VYM 293
EDYPY +D + C + K+ ++ + V +++ V + P++VAI A
Sbjct: 223 EDYPYKASD--NICDPNRKNAKVVTIDGYEDVPENDENSLKKAVAHQPVSVAIEAGGRAF 280
Query: 294 QTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKI 353
Q Y GV C LDHGV+ VGYG+ YWI++NSWG +WGE+GY ++
Sbjct: 281 QLYKSGVFTGR-CGTELDHGVVAVGYGTENGV-------NYWIVRNSWGSAWGESGYIRM 332
Query: 354 CR 355
R
Sbjct: 333 ER 334
>gi|401430387|ref|XP_003886572.1| unnamed protein product, partial [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|356491640|emb|CBZ40951.1| unnamed protein product, partial [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 332
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 131/310 (42%), Positives = 169/310 (54%), Gaps = 27/310 (8%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK+ + +AY + E R F+ NL HQ +P A GIT+F DL+ AEF
Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQARNPHAQFGITKFFDLSEAEFAAR 97
Query: 120 YL-GLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YL G + A Q DL P DWREKGAV PVKDQG+CGSCW+FS G
Sbjct: 98 YLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFSAVG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGL 233
+EG +LA +LVSLSEQQLV CD D GC+GGLM AF++ L+ G L
Sbjct: 158 NIEGQWYLAGHELVSLSEQQLVSCDDMND---------GCSGGLMLQAFDWLLQNTNGHL 208
Query: 234 MREEDYPYTGTDRGHACKFDKSK---IAASVANFSVVSLDEDQIAANLVKNGPLAVAINA 290
E+ YPY + G+ + S + A + ++ E +AA L KNGP+A+A++A
Sbjct: 209 HTEDSYPYV-SGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDA 267
Query: 291 VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
+Y GV I ++L+HGVLLVGY G E PYW+IKNSWG WGE GY
Sbjct: 268 SSFMSYKSGVLTACI-GKQLNHGVLLVGYDMTG-------EVPYWVIKNSWGGDWGEQGY 319
Query: 351 YKICRGRNVC 360
++ G N C
Sbjct: 320 VRVVMGVNAC 329
>gi|186516984|ref|NP_195406.2| cysteine proteinase1 [Arabidopsis thaliana]
gi|15290508|gb|AAK92229.1| cysteine proteinase [Arabidopsis thaliana]
gi|332661313|gb|AEE86713.1| cysteine proteinase1 [Arabidopsis thaliana]
Length = 376
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 123/290 (42%), Positives = 168/290 (57%), Gaps = 27/290 (9%)
Query: 76 EHDHRFTIFKANLRRAARHQKLDPSATH--GITQFSDLTPAEFRRTYLGLR----RKLRL 129
+ D RF IFK NLR H + + +AT+ G+T+F+DLT E+R+ YLG R R++
Sbjct: 69 DQDKRFNIFKDNLRFIDLHNENNKNATYKLGLTKFTDLTNDEYRKLYLGARTEPARRIAK 128
Query: 130 PKDADQAPILPTN--DLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGK 187
K+ +Q N ++P DWR+KGAV P+KDQG+CGSCW+FSTT A+EG N + TG+
Sbjct: 129 AKNVNQKYSAAVNGKEVPETVDWRQKGAVNPIKDQGTCGSCWAFSTTAAVEGINKIVTGE 188
Query: 188 LVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRG 247
L+SLSEQ+LVDCD S + GCNGGLM+ AF++ +K GGL E+DYPY G G
Sbjct: 189 LISLSEQELVDCDK--------SYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYRGFG-G 239
Query: 248 HACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA--VYMQTYIGGVSCPYI 305
F K+ S+ + V ++ + P++VAI A Q Y G+
Sbjct: 240 KCNSFLKNSRVVSIDGYEDVPTKDETALKKAISYQPVSVAIEAGGRIFQHYQSGIFTGS- 298
Query: 306 CSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
C LDH V+ VGYGS YWI++NSWG WGE GY ++ R
Sbjct: 299 CGTNLDHAVVAVGYGSENGV-------DYWIVRNSWGPRWGEEGYIRMER 341
>gi|297830592|ref|XP_002883178.1| hypothetical protein ARALYDRAFT_479457 [Arabidopsis lyrata subsp.
lyrata]
gi|297329018|gb|EFH59437.1| hypothetical protein ARALYDRAFT_479457 [Arabidopsis lyrata subsp.
lyrata]
Length = 452
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 129/310 (41%), Positives = 176/310 (56%), Gaps = 30/310 (9%)
Query: 69 KAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH--GITQFSDLTPAEFRRTYLGLR-R 125
K Y E + RF IF NL+ H + P+ T G+T+F+DLT EFR YL +
Sbjct: 52 KNYNGLGEKETRFEIFTDNLKYIEEHNSV-PNQTFEVGLTRFADLTNDEFRAIYLRSKME 110
Query: 126 KLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLAT 185
+ R+P ++ + LP DWR KGAV PVKDQG+CGSCW+FS GA+EG N + T
Sbjct: 111 RTRVPVKGERYLYKVGDTLPDQIDWRAKGAVNPVKDQGNCGSCWAFSAIGAVEGINQIKT 170
Query: 186 GKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTD 245
G+L+SLSEQ+LVDCD S + GC GGLM+ AF++ ++ GG+ EEDYPYT TD
Sbjct: 171 GELISLSEQELVDCDT--------SYNGGCGGGLMDYAFKFIIENGGIDTEEDYPYTATD 222
Query: 246 RGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA--VYMQTYIGGVSCP 303
K+ ++ + V ++++ + N P++VAI A Q Y GV
Sbjct: 223 DNICNSDKKNSRVVTIDGYEDVPQNDEKSLKKALANQPISVAIEAGGRAFQLYKSGVFTG 282
Query: 304 YICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNV---- 359
C LDHGV+ VGYGS G + YWI++NSWG +WGE+GY+K+ RN+
Sbjct: 283 -TCGTSLDHGVVAVGYGSEG-------GQDYWIVRNSWGSNWGESGYFKL--ERNIKESS 332
Query: 360 --CGVDSMVS 367
CGV M S
Sbjct: 333 GKCGVAMMAS 342
>gi|158524604|gb|ABW71226.1| cysteine protease [Nicotiana tabacum]
Length = 360
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 139/350 (39%), Positives = 181/350 (51%), Gaps = 33/350 (9%)
Query: 28 VDQLIRQVTDGGDEILSHHESTNNDLLGAEHH---FSLFKKKFNKAYASQEEHDHRFTIF 84
V+ IRQV G L E+ ++G H F F ++ K Y S EE RF +F
Sbjct: 29 VENPIRQVVSDG---LHELENGILQVVGQSRHALSFVRFAHRYGKRYESVEEIKQRFEVF 85
Query: 85 KANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDL 144
NL+ H K S G+ +F+DLT EFRR LG + + L L
Sbjct: 86 LDNLKMIRSHNKKGLSYKLGVNEFTDLTWDEFRRDRLGAAQNCSATTKGNVK--LTNAVL 143
Query: 145 PADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECD 204
P DWRE G V PVK+QG CGSCW+FSTTGALE A A GK +SLSEQQLVDC +
Sbjct: 144 PETKDWREDGIVSPVKNQGKCGSCWTFSTTGALEAAYSQAFGKGISLSEQQLVDCAGAFN 203
Query: 205 PEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASV--- 261
+ GCNGGL + AFEY GGL EE YPYTG + CKF + V
Sbjct: 204 -------NFGCNGGLPSQAFEYIKSNGGLDTEEAYPYTG--KNGLCKFSSENVGVKVIDS 254
Query: 262 ANFSVVSLDEDQIAANLVKNGPLAVAINAVY-MQTYIGGVSCPYICSRR---LDHGVLLV 317
N ++ + DE + A LV+ P+++A + + Y GV C ++H VL V
Sbjct: 255 VNITLGAEDELKYAVALVR--PVSIAFEVIKGFKQYKSGVYSSTECGNTPMDVNHAVLAV 312
Query: 318 GYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVS 367
GYG PYW+IKNSWG WG++GY+K+ G+N+CG+ + S
Sbjct: 313 GYGVENGV-------PYWLIKNSWGADWGDDGYFKMEMGKNMCGIATCAS 355
>gi|426369382|ref|XP_004051670.1| PREDICTED: cathepsin F [Gorilla gorilla gorilla]
Length = 517
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 138/314 (43%), Positives = 180/314 (57%), Gaps = 25/314 (7%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRR 118
F F +N+ Y S+EE R ++F N+ RA + Q LD +A +G+T+FSDLT EFR
Sbjct: 220 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 279
Query: 119 TYLGLRRKLRLPKDADQAPILPTNDL-PADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
YL + QA + DL P ++DWR KGAV VKDQG CGSCW+FS TG +
Sbjct: 280 IYLNSLLREEPGNKMKQAKSV--GDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNV 337
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREE 237
EG FL G L+SLSEQ+L+DCD D C GGL ++A+ GGL E+
Sbjct: 338 EGQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSAIKNLGGLETED 388
Query: 238 DYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYI 297
DY Y G +C F K + + +S +E ++AA L K GP++VAINA MQ Y
Sbjct: 389 DYSYQG--HMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAINAFGMQFYR 446
Query: 298 GGVSCPY--ICSRRL-DHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
G+S P +CS L DH VLLVGYG+ + P+W IKNSWG WGE GYY +
Sbjct: 447 HGISRPLRPLCSPWLIDHAVLLVGYGNR-------SDVPFWAIKNSWGTDWGEKGYYYLH 499
Query: 355 RGRNVCGVDSMVST 368
RG CGV++M S+
Sbjct: 500 RGSGACGVNTMASS 513
>gi|343473977|emb|CCD14279.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 361
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 122/316 (38%), Positives = 173/316 (54%), Gaps = 21/316 (6%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
+ F+ FK+K++++Y E RF +FK N+ RA +P AT G+T+FSD++P EF
Sbjct: 38 QQQFAAFKQKYSRSYRDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97
Query: 117 RRTYL-GLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
R TY G K + + T P DWR+KGAV PVKDQG C S W+F+ G
Sbjct: 98 RATYHNGAEYYAAALKRPRKVVNVSTGKAPEAVDWRKKGAVTPVKDQGKCDSSWAFTVIG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA--GGL 233
+EG +A +L SLSEQ LV CD + D GC G M++AF++ + G +
Sbjct: 158 NIEGQWKIAGHELTSLSEQMLVSCD---------TNDLGCRAGFMDTAFKWIVSPNDGNV 208
Query: 234 MREEDYPY-TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY 292
E+ YPY +G AC + A++ + + +E+ IA L KNGP+A+A++A
Sbjct: 209 FTEQSYPYASGGGNVPACNKSGKVVGANIRDHVHILDNENAIAEWLAKNGPVAIAVDATS 268
Query: 293 MQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYK 352
Q Y GGV I S+ ++ LLVGY + PYWIIKNSWG+ WGE GY +
Sbjct: 269 FQRYTGGVLTSCI-SKEVNSAALLVGYDDT-------SKPPYWIIKNSWGKGWGEEGYIR 320
Query: 353 ICRGRNVCGVDSMVST 368
I +G N C + VS+
Sbjct: 321 IEKGTNQCRMKDYVSS 336
>gi|302790836|ref|XP_002977185.1| hypothetical protein SELMODRAFT_106228 [Selaginella moellendorffii]
gi|300155161|gb|EFJ21794.1| hypothetical protein SELMODRAFT_106228 [Selaginella moellendorffii]
Length = 299
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 126/317 (39%), Positives = 174/317 (54%), Gaps = 32/317 (10%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQ-KLDPSATHGITQFSDLTPAEFRR 118
F + K K+Y+S E R IF L +H + + + T G+ +FSDLT AEFR
Sbjct: 2 FEDWAAKHGKSYSSDSEKARRLMIFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFRA 61
Query: 119 TYLGLRRKLRLPKDADQAPI----LPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTT 174
Y+G K + P+ D+ P + + LP DWR++GAV P+KDQG CGSCW+FS
Sbjct: 62 NYVG---KFKSPRYQDRRPAKDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAI 118
Query: 175 GALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLM 234
++E A+FLAT +LVSLSEQQL+DCD + D GC GG AF++ ++ GG+
Sbjct: 119 ASIESAHFLATKELVSLSEQQLIDCD---------TVDQGCQGGFPEDAFKFVVENGGVT 169
Query: 235 REEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAI--NAVY 292
EE YPYTG +C +K+K+ + + V+ D V P+ V I +
Sbjct: 170 TEEAYPYTGF--AGSCNANKNKV-VEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQN 226
Query: 293 MQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYK 352
Q Y G+ CS DH VL++GYG+ G PYWIIKNSWG SWGENG+ K
Sbjct: 227 FQNYRSGILSGQ-CSNSRDHAVLVIGYGTEG-------GMPYWIIKNSWGTSWGENGFMK 278
Query: 353 ICR--GRNVCGVDSMVS 367
I + G +CG++ S
Sbjct: 279 IKKKDGEGMCGMNGQSS 295
>gi|343477446|emb|CCD11725.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 361
Score = 218 bits (556), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 122/316 (38%), Positives = 173/316 (54%), Gaps = 21/316 (6%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
+ F+ FK+K++++Y E RF +FK N+ RA +P AT G+T+FSD++P EF
Sbjct: 38 QQQFAAFKQKYSRSYRDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97
Query: 117 RRTYL-GLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
R TY G K + + T P DWR+KGAV PVKDQG C S W+F+ G
Sbjct: 98 RATYHNGAEYYAAALKRPRKVVNVSTGKAPEAVDWRKKGAVTPVKDQGKCDSSWAFTVIG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA--GGL 233
+EG +A +L SLSEQ LV CD + D GC G M++AF++ + G +
Sbjct: 158 NIEGQWKIAGHELTSLSEQMLVSCD---------TNDLGCRAGFMDTAFKWIVSPNDGNV 208
Query: 234 MREEDYPY-TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY 292
E+ YPY +G AC + A++ + + +E+ IA L KNGP+A+A++A
Sbjct: 209 FTEQSYPYASGGGNVPACNKSGKVVGANIDDHVHILDNENAIAEWLAKNGPVAIAVDATS 268
Query: 293 MQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYK 352
Q Y GGV I S+ ++ LLVGY + PYWIIKNSWG+ WGE GY +
Sbjct: 269 FQRYTGGVLTSCI-SKEVNSAALLVGYDDT-------SKPPYWIIKNSWGKGWGEEGYIR 320
Query: 353 ICRGRNVCGVDSMVST 368
I +G N C + VS+
Sbjct: 321 IEKGTNQCRMKDYVSS 336
>gi|338712411|ref|XP_001491536.3| PREDICTED: cathepsin F [Equus caballus]
Length = 459
Score = 218 bits (556), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 138/319 (43%), Positives = 181/319 (56%), Gaps = 35/319 (10%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRR 118
F F +N+ Y ++EE R +IF +N+ RA + Q LD +A +G+T+FSDLT EFR
Sbjct: 162 FKHFVTTYNRTYETKEEAQWRMSIFASNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 221
Query: 119 TYLG--LRR----KLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFS 172
YL L+ K+R K + P ++DWR KGAV VKDQG CGSCW+FS
Sbjct: 222 IYLNPLLKEEPGVKMRRAKSVG-------DSAPPEWDWRSKGAVTEVKDQGMCGSCWAFS 274
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
TG +EG FL G L+SLSEQ+L+DCD D C GGL ++A+ GG
Sbjct: 275 VTGNVEGQWFLNRGALLSLSEQELLDCD---------KVDKACMGGLPSNAYSAIKTLGG 325
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY 292
L E+DY Y G AC F K + + ++ +E ++AA L K GP++VAINA
Sbjct: 326 LETEDDYSYHG--HLQACSFSAEKAKVYINDSVELTKNEQKLAAWLAKKGPISVAINAFG 383
Query: 293 MQTYIGGVSCPY--ICSRRL-DHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
MQ Y G+S P +CS L DH VLLVGYG+ P+W IKNSWG WGE G
Sbjct: 384 MQFYRHGISHPLRPLCSPWLIDHAVLLVGYGNRSAV-------PFWAIKNSWGTDWGEEG 436
Query: 350 YYKICRGRNVCGVDSMVST 368
YY + RG CGV++M S+
Sbjct: 437 YYYLYRGSGACGVNTMASS 455
>gi|378943060|gb|AFC76271.1| cathepsin L-like protease [Leishmania major]
Length = 348
Score = 218 bits (556), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 130/309 (42%), Positives = 168/309 (54%), Gaps = 25/309 (8%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK+ + +AY + E R F+ NL HQ +P A GIT+F DL+ AEF
Sbjct: 38 FEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 120 YL-GLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YL G + A Q DL P DWREKGAV PVK+QG+CGSCW+FS G
Sbjct: 98 YLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFSAVG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGL 233
+E +A KLV LSEQQLV CDH D+GC GGLM AFE+ L+ G +
Sbjct: 158 NIESQWAVAGHKLVRLSEQQLVSCDH---------VDNGCGGGLMLQAFEWVLRNMNGTV 208
Query: 234 MREEDYPYTGTDRGHACKFDKSKIA--ASVANFSVVSLDEDQIAANLVKNGPLAVAINAV 291
E+ YPYT + + S++A A + + + E +AA L KNGP+++A++A
Sbjct: 209 FTEKSYPYTSGNGDVPECSNSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDAS 268
Query: 292 YMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYY 351
+Y GV I +L+HGVLLVGY G E PYW+IKNSWGE WGE GY
Sbjct: 269 SFMSYHSGVLTSCI-GEQLNHGVLLVGYNMTG-------EVPYWVIKNSWGEDWGEKGYV 320
Query: 352 KICRGRNVC 360
++ G N C
Sbjct: 321 RVTMGVNAC 329
>gi|151573014|gb|ABS17682.1| cathepsin L-1 [Artemia salina]
Length = 334
Score = 218 bits (556), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 138/328 (42%), Positives = 178/328 (54%), Gaps = 30/328 (9%)
Query: 52 DLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARH----QKLDPSATHGITQ 107
+LL E H LFK K Y SQ E R I+ N + A+H +K + S + +
Sbjct: 21 NLLADEWH--LFKATHKKEYPSQLEEKFRMKIYLENKHKVAKHNILYEKGEKSYHVAMNK 78
Query: 108 FSDLTPAEFRRTYLGLRRKLRLPKDADQAPIL--PTN-DLPADFDWREKGAVGPVKDQGS 164
F DL EFR G + K + A+ P N +P DWREKGA+ PVKDQG
Sbjct: 79 FGDLLHHEFRSIMNGYQHKKQNSSRAESTFTFMEPANVTVPESVDWREKGAITPVKDQGQ 138
Query: 165 CGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAF 224
CGSCW+FS+TGALEG F TGKLVSLSEQ L+DC + E GCNGGLM+ AF
Sbjct: 139 CGSCWAFSSTGALEGQTFRKTGKLVSLSEQNLIDCSGKYGNE-------GCNGGLMDQAF 191
Query: 225 EYTLKAGGLMREEDYPYTGTDRGHACKFD-KSKIAASVANFSVVSLDEDQIAANLVKNGP 283
+Y G+ E YPY D C+++ +++ A + S +ED++ A + GP
Sbjct: 192 QYIKDNKGIDTENTYPYEAEDD--VCRYNPRNRGAVDRGFVDIPSGEEDKLKAAVATVGP 249
Query: 284 LAVAINAVY--MQTYIGGVSCPYIC-SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNS 340
++VAI+A + Q Y GV C S LDHGVL+VGYGS K YW++KNS
Sbjct: 250 VSVAIDASHESFQFYSKGVYYEPSCDSDDLDHGVLVVGYGSD-------NGKDYWLVKNS 302
Query: 341 WGESWGENGYYKICRGR-NVCGVDSMVS 367
W E WG+ GY K+ R R N CGV S S
Sbjct: 303 WSEHWGDEGYIKMARNRKNHCGVASAAS 330
>gi|359491865|ref|XP_002273243.2| PREDICTED: xylem cysteine proteinase 1-like [Vitis vinifera]
Length = 351
Score = 218 bits (556), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 128/317 (40%), Positives = 173/317 (54%), Gaps = 28/317 (8%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F + K K+Y S EE HRF +F+ NL+ K S G+ +F+DL+ EF+R
Sbjct: 48 FESWMSKHGKSYRSFEEKLHRFEVFQDNLKHIDETNKKVSSYWLGLNEFADLSHEEFKRK 107
Query: 120 YLGLRRKLRLPKDADQAPILPTND---LPADFDWREKGAVGPVKDQGSCGSCWSFSTTGA 176
YLGL K+ LPK D D LP DWR+KGAV VK+QG+CGSCW+FST A
Sbjct: 108 YLGL--KIELPKRRDSPEEFSYKDVADLPKSVDWRKKGAVAHVKNQGACGSCWAFSTVAA 165
Query: 177 LEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMRE 236
+EG N + TG L +LSEQ+L+DCD ++GCNGGLM+ AF + + GGL +E
Sbjct: 166 VEGINQIVTGNLTALSEQELIDCDK--------PFNNGCNGGLMDYAFAFIISNGGLRKE 217
Query: 237 EDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY--MQ 294
EDYPY + G + + +++ + V D +Q + N PL+VAI A Q
Sbjct: 218 EDYPYV-MEEGTCGEKKEELEVVTISGYHDVPEDNEQSFLKALANQPLSVAIEASSRGFQ 276
Query: 295 TYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
Y GG+ + C LDHGV VGYG++ K Y +KNSWG WGE GY ++
Sbjct: 277 FYSGGIFNGH-CGTELDHGVAAVGYGTS-------KGVDYITVKNSWGSKWGEKGYIRMK 328
Query: 355 RG----RNVCGVDSMVS 367
R +CG+ M S
Sbjct: 329 RNVGKPEGICGIYKMAS 345
>gi|281346354|gb|EFB21938.1| hypothetical protein PANDA_009085 [Ailuropoda melanoleuca]
Length = 333
Score = 218 bits (556), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 126/311 (40%), Positives = 172/311 (55%), Gaps = 22/311 (7%)
Query: 65 KKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH----GITQFSDLTPAEFRRTY 120
K N +++E R +++ N++ +H + H + F DLT EF++
Sbjct: 33 KAANGKLYNKDEEVWRRAVWEKNMKMIDQHNEEYSQGKHSFILAMNAFGDLTNEEFKQVM 92
Query: 121 LGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGA 180
GL K++ P++ + +LP + P+ DWREKG V PVKDQG CGSCW+FS TGALEG
Sbjct: 93 NGL--KIQNPREGNMFQLLPFAETPSSVDWREKGYVTPVKDQGQCGSCWAFSATGALEGQ 150
Query: 181 NFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYP 240
F TGKLVSLSEQ LVDC ++GCNGGLM++AF Y GGL EE YP
Sbjct: 151 MFRKTGKLVSLSEQNLVDCSR-------AEGNAGCNGGLMDNAFRYVKDNGGLDSEESYP 203
Query: 241 YTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA---VYMQTYI 297
Y D CK+ + AA+ F+ + DE+ + ++ GP++VAI+A + Y
Sbjct: 204 YLAQD--GRCKYKPEQSAANDTGFADIHQDEESLMLSVATVGPISVAIDASLDTFRFYYK 261
Query: 298 GGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGR 357
G P S LDHGVL+VGYGS + K YWI+KNSWG WG GY + + R
Sbjct: 262 GIYYDPNCSSEDLDHGVLVVGYGS---DEREAENKNYWIVKNSWGTQWGMQGYILMAKDR 318
Query: 358 -NVCGVDSMVS 367
N CG+ + S
Sbjct: 319 GNHCGIATSAS 329
>gi|46395939|sp|Q94B08.2|GCP1_ARATH RecName: Full=Germination-specific cysteine protease 1; Flags:
Precursor
gi|4006883|emb|CAB16767.1| cysteine proteinase [Arabidopsis thaliana]
gi|7270637|emb|CAB80354.1| cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 218 bits (555), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 123/290 (42%), Positives = 168/290 (57%), Gaps = 27/290 (9%)
Query: 76 EHDHRFTIFKANLRRAARHQKLDPSATH--GITQFSDLTPAEFRRTYLGLR----RKLRL 129
+ D RF IFK NLR H + + +AT+ G+T+F+DLT E+R+ YLG R R++
Sbjct: 69 DQDKRFNIFKDNLRFIDLHNEDNKNATYKLGLTKFTDLTNDEYRKLYLGARTEPARRIAK 128
Query: 130 PKDADQAPILPTN--DLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGK 187
K+ +Q N ++P DWR+KGAV P+KDQG+CGSCW+FSTT A+EG N + TG+
Sbjct: 129 AKNVNQKYSAAVNGKEVPETVDWRQKGAVNPIKDQGTCGSCWAFSTTAAVEGINKIVTGE 188
Query: 188 LVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRG 247
L+SLSEQ+LVDCD S + GCNGGLM+ AF++ +K GGL E+DYPY G G
Sbjct: 189 LISLSEQELVDCDK--------SYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYRGFG-G 239
Query: 248 HACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA--VYMQTYIGGVSCPYI 305
F K+ S+ + V ++ + P++VAI A Q Y G+
Sbjct: 240 KCNSFLKNSRVVSIDGYEDVPTKDETALKKAISYQPVSVAIEAGGRIFQHYQSGIFTGS- 298
Query: 306 CSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
C LDH V+ VGYGS YWI++NSWG WGE GY ++ R
Sbjct: 299 CGTNLDHAVVAVGYGSENGV-------DYWIVRNSWGPRWGEEGYIRMER 341
>gi|2804262|dbj|BAA24442.1| cysteine proteinase [Sitophilus zeamais]
Length = 338
Score = 218 bits (555), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 135/326 (41%), Positives = 180/326 (55%), Gaps = 30/326 (9%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSA----THGITQFSDLT 112
+ +S FK + +K Y S+ E R IF N + A+H KL G+ +++D+
Sbjct: 24 QEQWSSFKMQHSKNYDSETEERFRMKIFMENAHKVAKHNKLFSQGFVKFKLGLNKYADML 83
Query: 113 PAEFRRTYLGLRR-KLRLPKDADQAP----ILPTN-DLPADFDWREKGAVGPVKDQGSCG 166
EF T G + K + K +D I P N LP DWR+KGAV VKDQG CG
Sbjct: 84 HHEFVSTLNGFNKTKNNILKGSDLNDAVRFISPANVKLPDTVDWRDKGAVTEVKDQGHCG 143
Query: 167 SCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEY 226
SCWSFS TG+LEG +F TGKLVSLSEQ LVDC ++GCNGGLM++AF Y
Sbjct: 144 SCWSFSATGSLEGQHFRKTGKLVSLSEQNLVDCSGRYG-------NNGCNGGLMDNAFRY 196
Query: 227 TLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLA 285
GG+ E+ YPY D C + A+ F + +ED + A + GP++
Sbjct: 197 IKDNGGIDTEKSYPYLAEDE--KCHYKAQNSGATDKGFVDIEEANEDDLKAAVATVGPVS 254
Query: 286 VAINAVY--MQTYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWG 342
+AI+A + Q Y GV S P S+ LDHGVL+VGYG++ + YW++KNSWG
Sbjct: 255 IAIDASHETFQLYSDGVYSDPECSSQELDHGVLVVGYGTSD------DGQDYWLVKNSWG 308
Query: 343 ESWGENGYYKICRGR-NVCGVDSMVS 367
SWG NGY K+ R + N+CGV S S
Sbjct: 309 PSWGLNGYIKMARNQDNMCGVASQAS 334
>gi|6649593|gb|AAF21470.1|U85983_1 cysteine proteinase [Clonorchis sinensis]
Length = 259
Score = 218 bits (555), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 123/273 (45%), Positives = 155/273 (56%), Gaps = 26/273 (9%)
Query: 100 SATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPAD---FDWREKGAV 156
+A +G+TQFSDLT EF+ YL ++R + P D+ D FDWRE GAV
Sbjct: 5 TAHYGVTQFSDLTSEEFKTRYL----RMRFDGPIVSEDLTPEEDVTMDNEKFDWREHGAV 60
Query: 157 GPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCN 216
GPV DQG CGSCW+FS G + G F TG L++LSEQQLVDCD+ D GC+
Sbjct: 61 GPVLDQGKCGSCWAFSVIGNVVGQWFRKTGHLLALSEQQLVDCDY---------LDDGCD 111
Query: 217 GGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAA 276
GG + K GGL DYPYTG G C DKSK A V +++ L E A
Sbjct: 112 GGYPPQTYTAIQKMGGLELASDYPYTGV--GGICHMDKSKFVAYVNGSTILPLSEKVQAQ 169
Query: 277 NLVKNGPLAVAINAVYMQTYIGGVSCPYICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYW 335
L GPL+ A+NA +Q Y GG+ P C ++H VL VGYG KPYW
Sbjct: 170 KLRAIGPLSSALNADTLQLYKGGIMRPKWCDPAGVNHAVLTVGYGVQ-------NGKPYW 222
Query: 336 IIKNSWGESWGENGYYKICRGRNVCGVDSMVST 368
I+KNSWGE +GE GY++I RG CG++S+V+T
Sbjct: 223 IVKNSWGEDFGEEGYFRIYRGDGTCGINSIVTT 255
>gi|91092014|ref|XP_970644.1| PREDICTED: similar to cathepsin-L-like cysteine peptidase 02
[Tribolium castaneum]
gi|270001249|gb|EEZ97696.1| cathepsin L precursor [Tribolium castaneum]
Length = 337
Score = 218 bits (555), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 136/326 (41%), Positives = 180/326 (55%), Gaps = 29/326 (8%)
Query: 56 AEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP----SATHGITQFSDL 111
+ + FK K Y S+ E R IF N + A+H KL S G+ ++SD+
Sbjct: 23 VQEQWGAFKVTHKKQYESETEERFRMKIFMENAHKVAKHNKLYAQGLVSFKLGVNKYSDM 82
Query: 112 TPAEFRRTYLGLRRK---LRLPK-DADQAPILPTN-DLPADFDWREKGAVGPVKDQGSCG 166
EF T G R LR + D I P N +LP DWR+ GAV PVKDQG CG
Sbjct: 83 LNHEFVHTLNGYNRSKTPLRSGELDESITFIPPANVELPKQIDWRKLGAVTPVKDQGQCG 142
Query: 167 SCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEY 226
SCWSFSTTG+LEG +F + KLVSLSEQ L+DC E+ G ++GCNGGLM++AF Y
Sbjct: 143 SCWSFSTTGSLEGQHFRKSKKLVSLSEQNLIDCS-----EKYG--NNGCNGGLMDNAFRY 195
Query: 227 TLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLA 285
GG+ E+ YPY D C + A+ F + S DE+++ A + GP++
Sbjct: 196 IKDNGGIDTEQSYPYKAEDE--KCHYKPRNKGATDRGFVDIESGDEEKLKAAVATVGPIS 253
Query: 286 VAINAVY--MQTYIGGVSC-PYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWG 342
VAI+A + Q Y GV P S +LDHGVL+VGYG+ YW++KNSWG
Sbjct: 254 VAIDASHPTFQQYSEGVYYEPECSSEQLDHGVLVVGYGTDEDG------NDYWLVKNSWG 307
Query: 343 ESWGENGYYKICRGR-NVCGVDSMVS 367
+SWG+ GY K+ R R N CG+ + S
Sbjct: 308 DSWGDQGYIKMARNRDNNCGIATQAS 333
>gi|346466067|gb|AEO32878.1| hypothetical protein [Amblyomma maculatum]
Length = 358
Score = 218 bits (555), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 134/332 (40%), Positives = 185/332 (55%), Gaps = 31/332 (9%)
Query: 49 TNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKL----DPSATHG 104
T+ +L+GAE +S FK K Y S+ E +R I+ N + ARH + S
Sbjct: 41 THQELVGAE--WSAFKALHGKEYHSETEEYYRLKIYMENRLKIARHNEKYANNKASYKLA 98
Query: 105 ITQFSDLTPAEFRRTYLGLRRKLR-LPKDAD---QAPILPTNDLPADFDWREKGAVGPVK 160
+ +F DL EF T G +R R P++ + + LP DWR+KGAV PVK
Sbjct: 99 MNEFGDLLHHEFVSTRNGFKRNYRSTPREGSFYIEPEGIEDKHLPKTVDWRKKGAVTPVK 158
Query: 161 DQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLM 220
+QG CGSCW+FSTTG+LEG +F TG++VSLSEQ LVDC + ++GC GGLM
Sbjct: 159 NQGQCGSCWAFSTTGSLEGQHFRKTGRMVSLSEQNLVDCSGKFG-------NNGCEGGLM 211
Query: 221 NSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVK 280
++AF+Y GG+ E YPY GTD C F+KS + A+ F + +Q+ V
Sbjct: 212 DNAFKYIKANGGIDTELSYPYNGTDG--ICHFEKSDVGATDTGFVDIPEGNEQLLKKAVA 269
Query: 281 N-GPLAVAINAVY--MQTYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWI 336
GP++VAI+A + Q Y GV P S LDHGVL+VGYG+ + YW+
Sbjct: 270 TVGPVSVAIDASHESFQFYSQGVYDEPECSSESLDHGVLVVGYGTK-------DGQDYWL 322
Query: 337 IKNSWGESWGENGYYKICRGR-NVCGVDSMVS 367
+KNSWG +WG++GY + R + N CG+ S S
Sbjct: 323 VKNSWGTTWGDDGYIYMTRNKENQCGIASSAS 354
>gi|116781957|gb|ABK22314.1| unknown [Picea sitchensis]
Length = 369
Score = 218 bits (555), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 120/297 (40%), Positives = 166/297 (55%), Gaps = 25/297 (8%)
Query: 75 EEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDAD 134
EEH RF IFK N++ K D G+ +F+DL+ EF+ Y+G + LR ++
Sbjct: 62 EEHAERFEIFKENVKYIDSVNKKDSPYKLGLNKFADLSNEEFKAIYMGTKMDLRGDREVQ 121
Query: 135 QAPILPTND--LPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLS 192
+ N LPA DWR+KGAV VK+QG CGSCW+FST ++EG N++ TG LVSLS
Sbjct: 122 SGSFMYQNSEPLPASIDWRQKGAVAAVKNQGHCGSCWAFSTVASVEGINYITTGNLVSLS 181
Query: 193 EQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTG-TDRGHACK 251
EQQLVDC E +SGCNGGLM++AF+Y + GG++ E++YPYT + K
Sbjct: 182 EQQLVDCSTE---------NSGCNGGLMDTAFQYIINNGGIVTEDNYPYTAEATECSSTK 232
Query: 252 FDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV--YMQTYIGGVSCPYICSRR 309
+ + F V + +Q V + P++VAI A Q Y GV C
Sbjct: 233 INSQTTRVVIDGFEDVPANNEQALKEAVAHQPVSVAIEASGQDFQFYSTGVFTGK-CGTA 291
Query: 310 LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRG----RNVCGV 362
LDHGV+ VGYG+ +P + YWI++NSWG WGE GY ++ +G CG+
Sbjct: 292 LDHGVVAVGYGT---SPEGIN---YWIVRNSWGPKWGEEGYIRMQQGIEAAEGKCGI 342
>gi|110737959|dbj|BAF00916.1| cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 218 bits (555), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 123/290 (42%), Positives = 167/290 (57%), Gaps = 27/290 (9%)
Query: 76 EHDHRFTIFKANLRRAARHQKLDPSATH--GITQFSDLTPAEFRRTYLGLR----RKLRL 129
+ D RF IFK NLR H + + +AT+ G+T+F+DLT E+R+ YLG R R++
Sbjct: 69 DQDKRFNIFKDNLRFIDLHNENNKNATYKLGLTKFTDLTNDEYRKLYLGARTEPARRIAK 128
Query: 130 PKDADQAPILPTN--DLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGK 187
K+ +Q N ++P DWR+KGAV P+KDQG+CGSCW+FSTT A+EG N + TG+
Sbjct: 129 AKNVNQKYSAAVNGKEVPETVDWRQKGAVNPIKDQGTCGSCWAFSTTAAVEGINKIVTGE 188
Query: 188 LVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRG 247
L+SLSEQ+LVDCD S + GCNGGLM+ AF++ +K GGL E+DYPY G G
Sbjct: 189 LISLSEQELVDCDK--------SYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYRGFG-G 239
Query: 248 HACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA--VYMQTYIGGVSCPYI 305
F K+ S+ + V ++ + P+ VAI A Q Y G+
Sbjct: 240 KCNSFLKNSRVVSIDGYEDVPTKDETALKKAISYQPVRVAIEAGGRIFQHYQSGIFTGS- 298
Query: 306 CSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
C LDH V+ VGYGS YWI++NSWG WGE GY ++ R
Sbjct: 299 CGTNLDHAVVAVGYGSENGV-------DYWIVRNSWGPRWGEEGYIRMER 341
>gi|6467382|gb|AAF13146.1|AF136279_1 cathepsin F precursor [Homo sapiens]
Length = 484
Score = 218 bits (554), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 137/314 (43%), Positives = 180/314 (57%), Gaps = 25/314 (7%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRR 118
F F +N+ Y S+EE R ++F N+ RA + Q LD +A +G+T+FSDLT EFR
Sbjct: 187 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 246
Query: 119 TYLGLRRKLRLPKDADQAPILPTNDL-PADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
YL + QA + DL P ++DWR KGAV VKDQG CGSCW+FS TG +
Sbjct: 247 IYLNTLLRKEPGNKMKQAKSV--GDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNV 304
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREE 237
+G FL G L+SLSEQ+L+DCD D C GGL ++A+ GGL E+
Sbjct: 305 KGQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSAIKNLGGLETED 355
Query: 238 DYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYI 297
DY Y G +C F K + + +S +E ++AA L K GP++VAINA MQ Y
Sbjct: 356 DYSYQG--HMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAINAFGMQFYR 413
Query: 298 GGVSCPY--ICSRRL-DHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
G+S P +CS L DH VLLVGYG+ + P+W IKNSWG WGE GYY +
Sbjct: 414 HGISRPLRPLCSPWLIDHAVLLVGYGNR-------SDVPFWAIKNSWGTDWGEKGYYYLH 466
Query: 355 RGRNVCGVDSMVST 368
RG CGV++M S+
Sbjct: 467 RGSGACGVNTMASS 480
>gi|351693703|gb|AEQ59229.1| cysteine protease precursor [Clonorchis sinensis]
Length = 327
Score = 218 bits (554), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 133/318 (41%), Positives = 183/318 (57%), Gaps = 26/318 (8%)
Query: 56 AEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPA 114
A + FK K+ K+Y S ++ ++RF +FK NL R + Q ++ +A +G+TQFSDLT
Sbjct: 27 ARQLYEEFKLKYKKSY-SNDDDEYRFRVFKDNLLRIKQFQNMERGTAKYGVTQFSDLTAQ 85
Query: 115 EFRRTYLGLRRKLR-LPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFST 173
EF+ YL R K +P D + P + + +FDWR GAVGPV D+G CGSCW+FS
Sbjct: 86 EFKVRYL--RSKFGGVPVDREPVPFIRMDVDDDNFDWRNHGAVGPVLDKGDCGSCWAFSA 143
Query: 174 TGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGL 233
G +EG F T L+ LSEQQL+DCD D GCNGG AF+ L GGL
Sbjct: 144 VGNIEGQWFRKTDNLLQLSEQQLLDCDE---------VDEGCNGGTPQQAFKQILGMGGL 194
Query: 234 MREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYM 293
+ DYPY G R C+ SK+ + ++ DE A L + GP + A+NA+ +
Sbjct: 195 QLDSDYPYEG--REGQCRMVPSKVKVYINGSKILPEDEQIQAQMLKETGPFSSALNALSL 252
Query: 294 QTYIGGV--SCPYIC-SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
Q Y G+ P +C ++ L+H VL VGYG G RL PYW +KNSW +GENGY
Sbjct: 253 QFYTEGILHPLPALCDAQSLNHAVLTVGYGKEG----RL---PYWTVKNSWSTMFGENGY 305
Query: 351 YKICRGRNVCGVDSMVST 368
++I RG CG++++VST
Sbjct: 306 FRIYRGDGPCGINTLVST 323
>gi|49456321|emb|CAG46481.1| CTSF [Homo sapiens]
Length = 338
Score = 218 bits (554), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 137/314 (43%), Positives = 178/314 (56%), Gaps = 25/314 (7%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRR 118
F F +N+ Y S+EE R ++F N+ RA + Q LD +A +G+T+FSDLT EFR
Sbjct: 41 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 100
Query: 119 TYLGLRRKLRLPKDADQAPILPTNDL-PADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
YL + QA DL P ++DWR KGAV VKDQG CGSCW+FS TG +
Sbjct: 101 IYLNTLLRKEPGNKMKQAK--SVGDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNV 158
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREE 237
EG FL G L+SLSEQ+L+DCD D C GGL ++A+ GGL +
Sbjct: 159 EGQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSAIKNLGGLETVD 209
Query: 238 DYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYI 297
DY Y G +C F K + + +S +E ++AA L K GP++VAINA MQ Y
Sbjct: 210 DYSYQG--HMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAINAFGMQFYR 267
Query: 298 GGVSCPY--ICSRRL-DHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
G+S P +CS L DH VLLVGYG+ + P+W IKNSWG WGE GYY +
Sbjct: 268 HGISRPLRPLCSPWLIDHAVLLVGYGNRS-------DVPFWAIKNSWGTDWGEKGYYYLH 320
Query: 355 RGRNVCGVDSMVST 368
RG CGV++M S+
Sbjct: 321 RGSGACGVNTMASS 334
>gi|356508487|ref|XP_003522988.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 349
Score = 218 bits (554), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 126/315 (40%), Positives = 176/315 (55%), Gaps = 25/315 (7%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F + + K Y S EE HRF IFK NL+ K+ + G+ +F+DL+ EF+
Sbjct: 47 FESWMSRHGKIYQSIEEKLHRFDIFKDNLKHIDERNKVVSNYWLGLNEFADLSHQEFKNK 106
Query: 120 YLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEG 179
YLGL+ +++ + +LP DWR+KGAV VK+QGSCGSCW+FST A+EG
Sbjct: 107 YLGLKVDYSRRRESPEEFTYKDFELPKSVDWRKKGAVTQVKNQGSCGSCWAFSTVAAVEG 166
Query: 180 ANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDY 239
N + TG L SLSEQ+L+DCD + ++GCNGGLM+ AF + ++ GGL +EEDY
Sbjct: 167 INQIVTGNLTSLSEQELIDCDR--------TYNNGCNGGLMDYAFSFIVENGGLHKEEDY 218
Query: 240 PYTGTDRGHACKFDKSKI-AASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY--MQTY 296
PY + G C+ K + +++ + V + +Q + N PL+VAI A Q Y
Sbjct: 219 PYI-MEEG-TCEMTKEETEVVTISGYHDVPQNNEQSLLKALVNQPLSVAIEASGRDFQFY 276
Query: 297 IGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRG 356
GGV + C LDHGV VGYG++ K Y I+KNSWG WGE GY ++ R
Sbjct: 277 SGGVFDGH-CGSDLDHGVAAVGYGTS-------KGVNYIIVKNSWGSKWGEKGYIRMRRN 328
Query: 357 ----RNVCGVDSMVS 367
+CG+ M S
Sbjct: 329 IGKPEGICGIYKMAS 343
>gi|14349349|gb|AAC38833.2| cysteine protease [Leishmania chagasi]
Length = 353
Score = 218 bits (554), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 125/311 (40%), Positives = 167/311 (53%), Gaps = 24/311 (7%)
Query: 56 AEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGIT-QFSDLTPA 114
A H+ FKK+ K + E RF FK N++ A +P A + ++ +F+DLTP
Sbjct: 37 ASAHYGRFKKRHGKPFGEDAEEGRRFNAFKQNMQTAYFLNAHNPHAHYDVSGKFADLTPQ 96
Query: 115 EFRRTYLGLRRKLRLPKDADQAPIL--PTNDLPADFDWREKGAVGPVKDQGSCGSCWSFS 172
EF + YL R KD + + DWREKG V PVK+QG CGSCW+F+
Sbjct: 97 EFAKLYLNPNYYARHGKDYKEHVHVDDSVRSGVMSVDWREKGVVTPVKNQGMCGSCWAFA 156
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--A 230
TTG +EG L LVSLSEQ LV CD + D GCNGGLM A ++ +
Sbjct: 157 TTGNIEGQWALKNHSLVSLSEQVLVSCD---------NIDDGCNGGLMQQAMQWIINDHN 207
Query: 231 GGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA 290
G + E+ YPYT D + A +A + + DE++IAA + KNGP+AVA++A
Sbjct: 208 GTVPTEDSYPYTSAGGTRPPCHDNGTVGAKIAGYMSLPHDEEEIAAYVGKNGPVAVAVDA 267
Query: 291 VYMQTYIGGVSCPYIC-SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
Q Y GGV +C L+HGVL+VG+ R + PYWI+KNSWG SWGE G
Sbjct: 268 TTWQLYFGGVVT--LCFGLSLNHGVLVVGFN-------RQAKPPYWIVKNSWGSSWGEKG 318
Query: 350 YYKICRGRNVC 360
Y ++ G N C
Sbjct: 319 YIRLAMGSNQC 329
>gi|395502422|ref|XP_003755580.1| PREDICTED: pro-cathepsin H [Sarcophilus harrisii]
Length = 334
Score = 218 bits (554), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 133/322 (41%), Positives = 183/322 (56%), Gaps = 37/322 (11%)
Query: 54 LGAEHHFSLFK---KKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSD 110
+ AE F LFK K+ NK Y E H HR F N RR +H + S T + QFSD
Sbjct: 26 VSAEEKF-LFKSWMKQNNKKYHLSEYH-HRLHTFLENKRRIDKHNAGNHSFTMRLNQFSD 83
Query: 111 LTPAEFRRTYLGLRRKLRLPKDADQAP---ILPTNDLPADFDWREKGA-VGPVKDQGSCG 166
++ EF++TYL +RLP++ + P DWR+KG V PVK+QG CG
Sbjct: 84 MSFDEFKKTYL-----MRLPQNCSATKGSHVRRLGPYPESVDWRKKGNFVSPVKNQGGCG 138
Query: 167 SCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEY 226
SCW+FSTTG LE A +ATGKL+SL+EQQLVDC + + + GCNGGL + AFEY
Sbjct: 139 SCWTFSTTGGLESAVAIATGKLLSLAEQQLVDCAQDFN-------NHGCNGGLPSQAFEY 191
Query: 227 TLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFS-VVSLDEDQIAANLVKNGPLA 285
+ G+M E+ YPY G D CKF +K A V + + + + DE+ + + + P++
Sbjct: 192 IMYNKGIMGEDTYPYEGKDG--TCKFQPNKAIAFVKDVANITAYDEEAMTEAVAHHNPVS 249
Query: 286 VAINAV--YMQTYIGGVSCPYICSR---RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNS 340
A ++ + G S P CS+ +++H VL VGYG + PYWI+KNS
Sbjct: 250 FAFEVTDDFLSYHKGIYSNP-KCSKSPDKVNHAVLAVGYG-------KENGIPYWIVKNS 301
Query: 341 WGESWGENGYYKICRGRNVCGV 362
WG SWG NGY+ I RG+N+CG+
Sbjct: 302 WGTSWGNNGYFLIERGKNMCGL 323
>gi|15824704|gb|AAL09448.1| cysteine protease [Leishmania donovani]
Length = 353
Score = 218 bits (554), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 125/311 (40%), Positives = 167/311 (53%), Gaps = 24/311 (7%)
Query: 56 AEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGIT-QFSDLTPA 114
A H+ FKK+ K + E RF FK N++ A +P A + ++ +F+DLTP
Sbjct: 37 ASAHYGRFKKRHGKPFGEDAEEGRRFNAFKQNMQTAYFLNAHNPHAHYDVSGKFADLTPQ 96
Query: 115 EFRRTYLGLRRKLRLPKDADQAPIL--PTNDLPADFDWREKGAVGPVKDQGSCGSCWSFS 172
EF + YL R KD + + DWREKG V PVK+QG CGSCW+F+
Sbjct: 97 EFAKLYLNPNYYARHGKDYKEHVHVDDSVRSGVMSVDWREKGVVTPVKNQGMCGSCWAFA 156
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--A 230
TTG +EG L LVSLSEQ LV CD + D GCNGGLM A ++ +
Sbjct: 157 TTGNIEGQWALKNHSLVSLSEQVLVSCD---------NIDDGCNGGLMEQAMQWIINDHN 207
Query: 231 GGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA 290
G + E+ YPYT D + A +A + + DE++IAA + KNGP+AVA++A
Sbjct: 208 GTVPTEDSYPYTSAGGTRPPCHDNGTVGAKIAGYMSLPHDEEEIAAYVGKNGPVAVAVDA 267
Query: 291 VYMQTYIGGVSCPYIC-SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
Q Y GGV +C L+HGVL+VG+ R + PYWI+KNSWG SWGE G
Sbjct: 268 TTWQLYFGGVVT--LCFGLSLNHGVLVVGFN-------RQAKPPYWIVKNSWGSSWGEKG 318
Query: 350 YYKICRGRNVC 360
Y ++ G N C
Sbjct: 319 YIRLAMGSNQC 329
>gi|118425914|gb|ABK90856.1| cathepsin-L-like cysteine peptidase [Radix peregra]
Length = 324
Score = 218 bits (554), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 131/320 (40%), Positives = 181/320 (56%), Gaps = 27/320 (8%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH----GITQFSDLT 112
E ++++FK K NK Y+ E+ R+ I++ NL++ H +L G +++D+T
Sbjct: 19 EANWAIFKAKHNKTYSGDEDIIRRY-IWQTNLQKIEAHNELYAKGLSTYFLGENKYADMT 77
Query: 113 PAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFS 172
EFRRT GLR L D + + LP DWR++G V VKDQG CGSCW+FS
Sbjct: 78 NEEFRRTLSGLRVDKELTP-GDFVSGMFKDSLPTAVDWRKEGYVTEVKDQGQCGSCWAFS 136
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
TTG+LEG +F AT +LVSLSE LVDC + + GCNGGLM++AF+Y G
Sbjct: 137 TTGSLEGQHFKATKQLVSLSESNLVDCSKKWG-------NQGCNGGLMDNAFKYIADNKG 189
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVANFS-VVSLDEDQIAANLVKNGPLAVAINAV 291
+ E+ YPY DR C F K+ + A+ + + S ED + + GP++VAI+A
Sbjct: 190 IDTEKSYPYKPEDR--KCNFKKANVGATDKLYKDITSGSEDALQEAVATIGPISVAIDAS 247
Query: 292 Y--MQTYIGGVSCPYICS-RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+ Q Y GGV CS + LDHGVL VGY S YWI+KNSWG+SWG +
Sbjct: 248 HDSFQLYSGGVYNEKACSTKTLDHGVLAVGYDSK-------NGDDYWIVKNSWGKSWGID 300
Query: 349 GYYKICRG-RNVCGVDSMVS 367
GY + R +N CG+ +M S
Sbjct: 301 GYIWMSRNKKNQCGIATMAS 320
>gi|5231178|gb|AAD41105.1|AF157961_1 cysteine proteinase [Hypera postica]
Length = 324
Score = 218 bits (554), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 130/331 (39%), Positives = 182/331 (54%), Gaps = 44/331 (13%)
Query: 54 LGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKL----DPSATHGITQFS 109
LGA+ F FK + K Y +Q E RF IF N+R H L S GI +F+
Sbjct: 22 LGAK--FQAFKLEHGKTYLNQAEESKRFNIFTDNVRAIEAHNALYEQGKVSYKKGINKFT 79
Query: 110 DLTPAEFRRTYLGLRRKLRLPKDADQAPILPTN-------DLPADFDWREKGAVGPVKDQ 162
D++ EF K L A + P L T ++P+ DWR++G V VKDQ
Sbjct: 80 DMSQEEF---------KTMLTLSASRKPTLETTSYVKTGVEIPSSVDWRKEGRVTGVKDQ 130
Query: 163 GSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNS 222
G CGSCW+FS TG+ EGA +GKLVSLSEQQL+DC C +GC+GG ++
Sbjct: 131 GDCGSCWAFSITGSTEGAYARKSGKLVSLSEQQLIDC---CT-----DTSAGCDGGSLDD 182
Query: 223 AFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKN 281
F+Y +K GL EE Y Y G D ACK++ + + V+ + S+ + DED + +
Sbjct: 183 NFKYVMK-DGLQSEESYTYKGED--GACKYNVASVVTKVSKYTSIPAEDEDALLEAVATV 239
Query: 282 GPLAVAINAVYMQTYIGGVSCPYICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNS 340
GP++V ++A Y+ +Y G+ CS L+H +L VGYG+ K YWIIKNS
Sbjct: 240 GPVSVGMDASYLSSYDSGIYEDQDCSPAGLNHAILAVGYGTE-------NGKDYWIIKNS 292
Query: 341 WGESWGENGYYKICRGRNVCGV--DSMVSTV 369
WG SWGE GY+++ RG+N CG+ D++ T+
Sbjct: 293 WGASWGEQGYFRLARGKNQCGISEDTVYPTI 323
>gi|390337642|ref|XP_780653.3| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 333
Score = 218 bits (554), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 136/318 (42%), Positives = 173/318 (54%), Gaps = 32/318 (10%)
Query: 63 FKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH-----GITQFSDLTPAEFR 117
+K + K Y S EE R I++ NL +H L H GI QF+DL EF
Sbjct: 31 WKNEHGKRYLSDEEEASRRLIWQKNLDIVIKHN-LKYDLGHFTYDLGINQFTDLQNEEFV 89
Query: 118 RTYLGLRRKLRLPKDADQAPILPTN---DLPADFDWREKGAVGPVKDQGSCGSCWSFSTT 174
G R K A + LP N +LP DWR KG V PVKDQG CGSCW+FSTT
Sbjct: 90 AMMTGFRVS-GTSKAAKGSTFLPPNNVGELPKTVDWRTKGYVTPVKDQGQCGSCWAFSTT 148
Query: 175 GALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLM 234
G++EG +F ATGKLVSLSEQ LVDC D+GC+GG M+ AF+Y + AGG+
Sbjct: 149 GSVEGQHFKATGKLVSLSEQNLVDCSGR---------DAGCDGGFMDRAFQYIIDAGGID 199
Query: 235 REEDYPYTGTDRGHACKFDKSKIAASVANFS-VVSLDEDQIAANLVKNGPLAVAINAVYM 293
E YPY D C F K+ + A+V ++ V S E + + GP++VAI+A +M
Sbjct: 200 TEASYPYKAVDG--KCHFKKANVGATVTGYTDVTSGSEKALQKAVAHVGPISVAIDASHM 257
Query: 294 --QTYIGGVSCPYIC-SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
Q Y GV C S LDHGVL VGYG++ YWI+KNSW E+WG NGY
Sbjct: 258 SFQHYKSGVYNEPGCDSTVLDHGVLAVGYGTSS------DGTDYWIVKNSWAETWGMNGY 311
Query: 351 YKICRGR-NVCGVDSMVS 367
+ R + N CG+ + S
Sbjct: 312 VWMSRNKDNQCGIATNAS 329
>gi|71482942|gb|AAZ32410.1| cysteine proteinase aleuran type [Nicotiana benthamiana]
Length = 360
Score = 218 bits (554), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 136/346 (39%), Positives = 180/346 (52%), Gaps = 33/346 (9%)
Query: 32 IRQVTDGGDEILSHHESTNNDLLGAEHH---FSLFKKKFNKAYASQEEHDHRFTIFKANL 88
IRQ+ G L E+ ++G H F+ F ++ K Y + EE RF +F NL
Sbjct: 33 IRQIVSDG---LHELENGILQVVGKTRHALLFARFAHRYGKRYETVEEIKQRFEVFLDNL 89
Query: 89 RRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADF 148
+ H K S G+ +F+D+T EFRR LG + + L LP
Sbjct: 90 KMIRSHNKKGLSYKLGVNEFTDITWDEFRRDRLGAAQNCSATTKGNLK--LTNVVLPETK 147
Query: 149 DWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEP 208
DWRE G V PVK+QG CGSCW+FSTTGALE A A GK +SLSEQQLVDC +
Sbjct: 148 DWREAGIVSPVKNQGKCGSCWTFSTTGALEAAYGQAFGKGISLSEQQLVDCAGAFN---- 203
Query: 209 GSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASV---ANFS 265
+ GCNGGL + AFEY GGL EE YPYTG + CKF + V N +
Sbjct: 204 ---NFGCNGGLPSQAFEYIKSNGGLDTEEAYPYTG--KNGLCKFSSENVGVKVIDSVNIT 258
Query: 266 VVSLDEDQIAANLVKNGPLAVAINAVY-MQTYIGGVSCPYICSRR---LDHGVLLVGYGS 321
+ + DE + A LV+ P+++A + + Y GV C ++H VL VGYG
Sbjct: 259 LGAEDELKYAVALVR--PVSIAFEVIKGFKQYKSGVYTSTECGNTPMDVNHAVLAVGYGV 316
Query: 322 AGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVS 367
PYW+IKNSWG WG+NGY+K+ G+N+CG+ + S
Sbjct: 317 ENGV-------PYWLIKNSWGADWGDNGYFKMEMGKNMCGIATCAS 355
>gi|388890776|gb|AFK80364.1| cysteine proteinase 3, partial [Acanthamoeba castellanii]
Length = 329
Score = 218 bits (554), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 126/314 (40%), Positives = 179/314 (57%), Gaps = 24/314 (7%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F+ + + +K+Y S EE R+ +++ N + H + + ++ + +F DLT AEF +
Sbjct: 30 FAEWMRDNSKSY-SNEEFVFRWNVWRENQQLIEEHNRSNKTSFLAMNKFGDLTNAEFNKL 88
Query: 120 YLGLRRKLRLPKDADQA-PILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALE 178
+ GL + A +P L ADFDWR+KGAV VK+QG CGSCWSFSTTG+ E
Sbjct: 89 FKGLAFDYSFHANKAAAEKAVPAPGLSADFDWRQKGAVTHVKNQGQCGSCWSFSTTGSTE 148
Query: 179 GANFLATGKLVSLSEQQLVDCDHECDPEEPGSC-DSGCNGGLMNSAFEYTLKAGGLMREE 237
GANFL TG+L SLSEQ L+DC GS ++GCNGGLM+ AFEY + G+ E
Sbjct: 149 GANFLKTGRLTSLSEQNLIDC--------SGSYGNNGCNGGLMDYAFEYIINNKGIDTEA 200
Query: 238 DYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY--MQT 295
YPY + C+++ + S+ +++ VS ++ N V P +VAI+A + Q
Sbjct: 201 SYPYQTAQ--YTCQYNPANSGGSLTSYTDVSSGDENALLNAVATEPTSVAIDASHNSFQF 258
Query: 296 YIGGVSCPYICSR-RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
Y GGV CS +LDHGVL VG+G+ + YW++KNSWG WG GY K+
Sbjct: 259 YSGGVYYESACSSTQLDHGVLAVGWGTE-------DGQDYWLVKNSWGADWGLAGYIKMA 311
Query: 355 RGR-NVCGVDSMVS 367
R R N CG+ + S
Sbjct: 312 RNRSNNCGIATSAS 325
>gi|157864847|ref|XP_001681132.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|68124426|emb|CAJ02282.1| cathepsin L-like protease [Leishmania major strain Friedlin]
Length = 443
Score = 218 bits (554), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 131/310 (42%), Positives = 170/310 (54%), Gaps = 27/310 (8%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK+ + +AY + E R F+ NL HQ +P A GIT+F DL+ AEF
Sbjct: 38 FEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 120 YL-GLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YL G + + A Q DL P DWREKGAV PVK+QG+CGSCW+FS G
Sbjct: 98 YLNGAAYFAAVKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFSAVG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGL 233
+E +A KLV LSEQQLV CDH D+GC GGLM AFE+ L+ G +
Sbjct: 158 NIESQWAVAGHKLVRLSEQQLVSCDH---------VDNGCGGGLMLQAFEWVLRNMNGTV 208
Query: 234 MREEDYPY-TGTDRGHACKFDKSKIA--ASVANFSVVSLDEDQIAANLVKNGPLAVAINA 290
E+ YPY +G C + S++A A + + + E +AA L KNGP+++A++A
Sbjct: 209 FTEKSYPYVSGNGDVPECS-NSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDA 267
Query: 291 VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
+Y GV I +L+HGVLLVGY G E PYW+IKNSWGE WGE GY
Sbjct: 268 SSFMSYHSGVLTSCI-GEQLNHGVLLVGYNMTG-------EVPYWVIKNSWGEDWGEKGY 319
Query: 351 YKICRGRNVC 360
++ G N C
Sbjct: 320 VRVTMGVNAC 329
>gi|68304200|ref|YP_249668.1| VCATH [Chrysodeixis chalcites nucleopolyhedrovirus]
gi|67973029|gb|AAY83995.1| VCATH [Chrysodeixis chalcites nucleopolyhedrovirus]
Length = 344
Score = 218 bits (554), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 138/374 (36%), Positives = 196/374 (52%), Gaps = 40/374 (10%)
Query: 4 KTVVLFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLF 63
K ++LF V VF+ SG + VD +I VT L + +L A +F F
Sbjct: 2 KKIILFFV--FVFA---SGGFDNGVDAIIDYVTAAPQFKLQY------NLERAPQYFETF 50
Query: 64 KKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGL 123
+ K+ K YA E D+R+ IFK NL + + SA + I +F+DLT E + GL
Sbjct: 51 QTKYKKVYADDNERDYRYKIFKTNLEIINLKNQQNDSAVYNINKFADLTKNEVIAKFTGL 110
Query: 124 RRKLRLPKDADQAPIL--PTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGAN 181
+ K++ + I+ P+ FDWR+ + VKDQG CGSCW+FST LE
Sbjct: 111 GIRSPALKNSCEPVIVDGPSKYTQETFDWRQFNKITSVKDQGFCGSCWAFSTIAGLESQY 170
Query: 182 FLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPY 241
+ + V LSEQQLVDCD + D GC GGL+++A+E + GGL EEDYPY
Sbjct: 171 AIKYNEHVDLSEQQLVDCD---------TIDMGCAGGLLHTAYEEIMAMGGLEYEEDYPY 221
Query: 242 TGTDRGHACKFDKSKIAASVAN-FSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGV 300
C+ K SV N + V ED++ L + GP+AVA++AV + Y GG+
Sbjct: 222 RSVQ--GPCRLQSDKFEVSVDNCYRYVLYSEDKLKDVLHEMGPIAVAVDAVDLTDYYGGI 279
Query: 301 --SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRN 358
SC + L+H VLLVGYG P+W++KNSWG +GENG+ ++ R N
Sbjct: 280 ITSCK---NYGLNHAVLLVGYGIENGV-------PFWVLKNSWGSDYGENGFVRVKRNVN 329
Query: 359 VCGVDSMVSTVAAA 372
CG M++ +AA+
Sbjct: 330 SCG---MINELAAS 340
>gi|156371477|ref|XP_001628790.1| predicted protein [Nematostella vectensis]
gi|156215775|gb|EDO36727.1| predicted protein [Nematostella vectensis]
Length = 330
Score = 218 bits (554), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 132/318 (41%), Positives = 172/318 (54%), Gaps = 23/318 (7%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
E + +K K Y + E R I++ NL++ +H S T + DLT EF
Sbjct: 25 EQQWQAWKLFHTKKYTTVTEEGARKAIWRDNLKKIQKHNAEGHSFTLAMNHLGDLTQDEF 84
Query: 117 RRTYLGLRRKL-RLPKDADQAPILPTN-DLPADFDWREKGAVGPVKDQGSCGSCWSFSTT 174
R Y G+R K A + P++ +P DWR++G V PVK+QG CGSCW+FSTT
Sbjct: 85 RYFYTGMRSHYSNYTKKQGSAFLAPSHVQVPDTVDWRKEGYVTPVKNQGQCGSCWAFSTT 144
Query: 175 GALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLM 234
G+LEG NF TGKLVSLSEQ LVDC ++GC GGLM+ AF+Y + GG+
Sbjct: 145 GSLEGQNFKKTGKLVSLSEQNLVDC-------STAYGNNGCQGGLMDYAFKYIKENGGID 197
Query: 235 REEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAINAVYM 293
EE YPY R C+F KS I A F V DE+ + GP++VAI+A +M
Sbjct: 198 TEESYPYEA--RNDRCRFQKSNIGAVDTGFVDVTHGDEEALKTAAGTVGPISVAIDAGHM 255
Query: 294 --QTYIGGVSCPYICSR-RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
Q Y GV CS LDHGVL+VGYG+ + YW++KNSWGE WG GY
Sbjct: 256 SFQFYHSGVYNNAGCSSTSLDHGVLVVGYGT-------YQGSDYWLVKNSWGERWGMEGY 308
Query: 351 YKICRGR-NVCGVDSMVS 367
+ R + N CGV + S
Sbjct: 309 IMMSRNKNNQCGVATQAS 326
>gi|17384029|emb|CAD12392.1| cysteine proteinase [Leishmania infantum]
Length = 354
Score = 217 bits (553), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 126/319 (39%), Positives = 170/319 (53%), Gaps = 24/319 (7%)
Query: 56 AEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGIT-QFSDLTPA 114
A H+ FKK+ K + E RF FK N++ A +P A + ++ +F+DLTP
Sbjct: 38 ASAHYGRFKKRHGKPFGEDAEEGRRFNAFKQNMQTAYFLNAHNPHAHYDVSGKFADLTPQ 97
Query: 115 EFRRTYLGLRRKLRLPKDADQAPIL--PTNDLPADFDWREKGAVGPVKDQGSCGSCWSFS 172
EF + YL R KD + + DWREKG V PVK+QG CGSCW+F+
Sbjct: 98 EFAKLYLNPNYYARHGKDYKEHVHVDDSVRSGVMSVDWREKGVVTPVKNQGMCGSCWAFA 157
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--A 230
TTG +EG L LVSLSEQ LV CD + D GCNGGLM A ++ +
Sbjct: 158 TTGNIEGQWALKNHSLVSLSEQVLVSCD---------NIDDGCNGGLMQQAMQWIINDHN 208
Query: 231 GGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA 290
G + E+ YPYT D + A + + + DE++IAA + KNGP+AVA++A
Sbjct: 209 GTVPTEDSYPYTSAGGTRPPCHDNGTVGAKIKGYMSLPHDEEEIAAYVGKNGPVAVAVDA 268
Query: 291 VYMQTYIGGVSCPYIC-SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
Q Y GGV +C L+HGVL+VG+ R + PYWI+KNSWG SWGE G
Sbjct: 269 TTRQLYFGGVVT--LCFGLSLNHGVLVVGFN-------RQAKPPYWIVKNSWGSSWGEKG 319
Query: 350 YYKICRGRNVCGVDSMVST 368
Y ++ G N C + + V T
Sbjct: 320 YIRLAMGSNQCLLKNYVVT 338
>gi|410974700|ref|XP_003993781.1| PREDICTED: cathepsin F [Felis catus]
Length = 459
Score = 217 bits (553), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 137/313 (43%), Positives = 178/313 (56%), Gaps = 23/313 (7%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRR 118
F F +N+ Y +QEE R ++F N+ RA + Q LD +A +GIT+FSDLT EFR
Sbjct: 162 FKEFVTTYNRTYGTQEEAQWRLSVFSNNMVRAQKIQALDRGTAQYGITKFSDLTEEEFRA 221
Query: 119 TYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALE 178
YL K K A + + P ++DWR KGAV VK+QG CGSCW+FS TG +E
Sbjct: 222 IYLNPLLKENRNKMMHLAKSI-GDHAPPEWDWRTKGAVTNVKNQGMCGSCWAFSVTGNVE 280
Query: 179 GANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREED 238
G FL G L+SLSEQ+L+DCD D C GGL ++A+ GGL E+D
Sbjct: 281 GQWFLKQGDLLSLSEQELLDCD---------KVDKACLGGLPSNAYLAIKNLGGLETEDD 331
Query: 239 YPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIG 298
Y Y+G C F K + + +S +E ++AA L K GP++VAINA MQ Y
Sbjct: 332 YSYSG--HLQTCSFSAKKAKVYINDSVELSQNEQKLAAWLAKKGPISVAINAFGMQFYRR 389
Query: 299 GVSCPY--ICSRRL-DHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
G+S P +CS L DH VLLVGYG+ P+W IKNSWG WGE GYY + R
Sbjct: 390 GISHPLRPLCSPWLIDHAVLLVGYGNRSGI-------PFWAIKNSWGTDWGEEGYYYLYR 442
Query: 356 GRNVCGVDSMVST 368
G CGV++M S+
Sbjct: 443 GSGACGVNAMASS 455
>gi|301769893|ref|XP_002920368.1| PREDICTED: cathepsin L1-like [Ailuropoda melanoleuca]
Length = 503
Score = 217 bits (553), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 126/311 (40%), Positives = 172/311 (55%), Gaps = 22/311 (7%)
Query: 65 KKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH----GITQFSDLTPAEFRRTY 120
K N +++E R +++ N++ +H + H + F DLT EF++
Sbjct: 33 KAANGKLYNKDEEVWRRAVWEKNMKMIDQHNEEYSQGKHSFILAMNAFGDLTNEEFKQVM 92
Query: 121 LGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGA 180
GL K++ P++ + +LP + P+ DWREKG V PVKDQG CGSCW+FS TGALEG
Sbjct: 93 NGL--KIQNPREGNMFQLLPFAETPSSVDWREKGYVTPVKDQGQCGSCWAFSATGALEGQ 150
Query: 181 NFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYP 240
F TGKLVSLSEQ LVDC ++GCNGGLM++AF Y GGL EE YP
Sbjct: 151 MFRKTGKLVSLSEQNLVDCSR-------AEGNAGCNGGLMDNAFRYVKDNGGLDSEESYP 203
Query: 241 YTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA---VYMQTYI 297
Y D CK+ + AA+ F+ + DE+ + ++ GP++VAI+A + Y
Sbjct: 204 YLAQD--GRCKYKPEQSAANDTGFADIHQDEESLMLSVATVGPISVAIDASLDTFRFYYK 261
Query: 298 GGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGR 357
G P S LDHGVL+VGYGS + K YWI+KNSWG WG GY + + R
Sbjct: 262 GIYYDPNCSSEDLDHGVLVVGYGS---DEREAENKNYWIVKNSWGTQWGMQGYILMAKDR 318
Query: 358 -NVCGVDSMVS 367
N CG+ + S
Sbjct: 319 GNHCGIATSAS 329
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 38/101 (37%), Positives = 48/101 (47%), Gaps = 6/101 (5%)
Query: 258 AASVANFSVVSLDEDQIAANLVKNGPLAVAINAV---YMQTYIGGVSCPYICSRRLDHGV 314
AA V V E+ + + GP++ AI A + G P S LDHGV
Sbjct: 391 AADVTGPVNVPQQEEAVMLAVAAGGPVSAAIRASLGSFQFCKEGIYYDPNCSSEDLDHGV 450
Query: 315 LLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
L+VGYGS + K YWI+KNSWG WG GY + R
Sbjct: 451 LVVGYGSD---EREAENKNYWIVKNSWGTDWGLQGYMLLVR 488
>gi|18308182|gb|AAL67857.1|AF462309_1 cysteine proteinase [Acanthamoeba healyi]
Length = 330
Score = 217 bits (553), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 131/313 (41%), Positives = 178/313 (56%), Gaps = 28/313 (8%)
Query: 67 FNKAYASQEEHDHRFT------IFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTY 120
F K + ++RF I++ N+ R H + + S + QF DLT AEF R +
Sbjct: 30 FAKWMRENTKSNYRFVYSNEEFIYRWNVWRDEEHNRQNKSYFLAMNQFGDLTNAEFNRLF 89
Query: 121 LGLRRKL-RLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEG 179
GL + K AP P +P++FDWR+KGAV VK+QG CGSCWSFSTTG+ EG
Sbjct: 90 KGLAFDYSKHAKIHTAAPEAPATGIPSEFDWRQKGAVTHVKNQGQCGSCWSFSTTGSTEG 149
Query: 180 ANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDY 239
ANFL TG+LVSLSEQ L+DC ++GCNGGLM+ AFEY + G+ E Y
Sbjct: 150 ANFLKTGRLVSLSEQNLIDCSVSYG-------NNGCNGGLMDYAFEYIINNRGIDTEASY 202
Query: 240 PYTGTDRGHACKFDKSKIAASVANFS-VVSLDEDQIAANLVKNGPLAVAINAVY--MQTY 296
PY T C+++ + S+ ++ V S DE+ + VK P++VAI+A + Q Y
Sbjct: 203 PYQ-TAGPLTCQYNAANKGGSLTGYTDVTSGDENALLNAAVKE-PVSVAIDASHNSFQFY 260
Query: 297 IGGVSCPYICSR-RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
GGV CS +LDHGVL+VG+GS + +W +KNSWG SWG NGY K+ R
Sbjct: 261 SGGVYYESACSSTQLDHGVLVVGWGSE-------NGQDFWWVKNSWGASWGLNGYIKMSR 313
Query: 356 GR-NVCGVDSMVS 367
+ N CG+ + S
Sbjct: 314 NQNNNCGIATAAS 326
>gi|1581746|prf||2117247B Cys protease:ISOTYPE=2
Length = 467
Score = 217 bits (553), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 126/310 (40%), Positives = 165/310 (53%), Gaps = 23/310 (7%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRR 118
F+ FK++ K Y S E R +FK NL A H +P A+ G+T FSDLT EFR
Sbjct: 37 QFAAFKQRHGKVYGSAAEEAFRLGVFKENLLFARLHAAANPHASFGVTPFSDLTREEFRS 96
Query: 119 TYLGLRRKLRLPKDADQAPILPT---NDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
Y + + P+ PA DWR +GAV +KDQG CGSCW+FST G
Sbjct: 97 RYHNAAAHFAAAQKRVRVPVEVEVEVGGAPAAVDWRARGAVTAIKDQGGCGSCWAFSTIG 156
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTL--KAGGL 233
+EG LA L LSEQ LV CD+ D+GC+GGLM+SAF++ + G +
Sbjct: 157 NIEGQWHLAGNPLTGLSEQMLVSCDNA---------DNGCDGGLMDSAFDWIVGQNNGSV 207
Query: 234 MREEDYPY-TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY 292
E Y Y +G C + A ++ + DED++AA L NGPLA+A++A
Sbjct: 208 YTEASYSYVSGGGDSQTCNMSSHVVGAVISGHVDLPQDEDKMAAWLAVNGPLAIAVDATS 267
Query: 293 MQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYK 352
+Y GGV + S +LDHGV+LVGY + PYWIIKNSWG WGE GY +
Sbjct: 268 FMSYTGGVLTNCV-SDQLDHGVVLVGYNDS-------SNPPYWIIKNSWGADWGEEGYIR 319
Query: 353 ICRGRNVCGV 362
I +G N C V
Sbjct: 320 IQKGTNQCLV 329
>gi|156398078|ref|XP_001638016.1| predicted protein [Nematostella vectensis]
gi|156225133|gb|EDO45953.1| predicted protein [Nematostella vectensis]
Length = 326
Score = 217 bits (553), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 123/318 (38%), Positives = 169/318 (53%), Gaps = 20/318 (6%)
Query: 54 LGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTP 113
G + + +K K+Y+ E R I++ NL + RH D S + DLT
Sbjct: 21 FGQDSEWVAWKSYHGKSYSDVHEERTRMAIWQQNLEKIKRHNAEDHSYKMAMNHLGDLTE 80
Query: 114 AEFRRTYLGLRRKLRLPKDADQAPILPTN-DLPADFDWREKGAVGPVKDQGSCGSCWSFS 172
EFR YLG+R K + P+N +P+ DW +KG V VK+QG CGSCW+FS
Sbjct: 81 DEFRYFYLGVRAHHNSTKRGWATYMPPSNVKIPSSVDWSQKGYVTGVKNQGQCGSCWAFS 140
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
TTG++EG +F TG LVSLSEQ L+DC ++GC GGLM++AF Y GG
Sbjct: 141 TTGSVEGQHFRKTGSLVSLSEQNLIDCSGSYG-------NNGCQGGLMDNAFRYIESNGG 193
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKN-GPLAVAINAV 291
+ E YPY G +C F S + A V + + +Q + V GP++VA++A
Sbjct: 194 IDTESSYPYLGQQG--SCHFSSSHVGARVTGYQDIPQGSEQALQSAVATVGPVSVAVDAS 251
Query: 292 YMQTYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
Q Y GV PY S +LDHGVL++GYG+ + YW++KNSWG SWG GY
Sbjct: 252 QWQFYSSGVYDNPYCSSTQLDHGVLVIGYGN-------YNGQDYWLVKNSWGYSWGVEGY 304
Query: 351 YKICRGR-NVCGVDSMVS 367
+ R + N CG+ S S
Sbjct: 305 IMMSRNKNNQCGIASSAS 322
>gi|2780176|emb|CAA71085.1| cystein proteinase [Leishmania mexicana]
Length = 443
Score = 217 bits (553), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 130/310 (41%), Positives = 170/310 (54%), Gaps = 27/310 (8%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK+ + +AY + E R F+ NL HQ +P A GIT+F DL+ AEF
Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQARNPHAQFGITKFFDLSEAEFAAR 97
Query: 120 YL-GLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YL G + A Q DL P DWREKGAV PVK+QG+CGSCW+FS G
Sbjct: 98 YLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFSAVG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGL 233
+EG +LA +LVSLSEQQLV CD D+GC+GGLM AF++ L+ G L
Sbjct: 158 NIEGQWYLAGHELVSLSEQQLVSCD---------DMDNGCSGGLMLQAFDWLLQNTNGHL 208
Query: 234 MREEDYPYTGTDRGHACKFDKSK---IAASVANFSVVSLDEDQIAANLVKNGPLAVAINA 290
E+ YPY + G+ + S + A + ++ E +AA L KNGP+A+A++A
Sbjct: 209 YTEDSYPYV-SGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDA 267
Query: 291 VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
+Y GV I ++L+HGVLLVGY G E PYW+IKNSWG WGE GY
Sbjct: 268 SSFMSYKSGVLTACI-GKQLNHGVLLVGYDMTG-------EVPYWVIKNSWGGDWGEQGY 319
Query: 351 YKICRGRNVC 360
++ G N C
Sbjct: 320 VRVVMGVNAC 329
>gi|1730100|sp|P36400.2|LMCPB_LEIME RecName: Full=Cysteine proteinase B; Flags: Precursor
gi|899313|emb|CAA90236.1| LmCPb2.8 [Leishmania mexicana]
Length = 443
Score = 217 bits (553), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 131/310 (42%), Positives = 169/310 (54%), Gaps = 27/310 (8%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK+ + +AY + E R F+ NL HQ +P A GIT+F DL+ AEF
Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQARNPHAQFGITKFFDLSEAEFAAR 97
Query: 120 YL-GLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YL G + A Q DL P DWREKGAV PVKDQG+CGSCW+FS G
Sbjct: 98 YLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFSAVG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGL 233
+EG +LA +LVSLSEQQLV CD D GC+GGLM AF++ L+ G L
Sbjct: 158 NIEGQWYLAGHELVSLSEQQLVSCDDMND---------GCDGGLMLQAFDWLLQNTNGHL 208
Query: 234 MREEDYPYTGTDRGHACKFDKSK---IAASVANFSVVSLDEDQIAANLVKNGPLAVAINA 290
E+ YPY + G+ + S + A + ++ E +AA L KNGP+A+A++A
Sbjct: 209 HTEDSYPYV-SGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDA 267
Query: 291 VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
+Y GV I ++L+HGVLLVGY G E PYW+IKNSWG WGE GY
Sbjct: 268 SSFMSYKSGVLTACI-GKQLNHGVLLVGYDMTG-------EVPYWVIKNSWGGDWGEQGY 319
Query: 351 YKICRGRNVC 360
++ G N C
Sbjct: 320 VRVVMGVNAC 329
>gi|405958751|gb|EKC24845.1| Cathepsin L [Crassostrea gigas]
Length = 330
Score = 217 bits (553), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 127/320 (39%), Positives = 175/320 (54%), Gaps = 26/320 (8%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQ----KLDPSATHGITQFSDLT 112
+ + L+ K K Y ++EE R I++ NL +H + D S G+ ++ D+T
Sbjct: 24 DSEWQLYLKAHGKQYGAEEEARRR-VIWEGNLDYIEKHNLAADRGDYSFWLGMNEYGDMT 82
Query: 113 PAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFS 172
EFR T G + + + + P DLP DWR KG V P+K+QG CGSCWSFS
Sbjct: 83 NEEFRSTMNGYKMRNGTSRGSLYLPPSNIGDLPDTVDWRPKGYVTPIKNQGQCGSCWSFS 142
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
TG+LEG F TGKL SLSEQ LVDC + + GC GGLM+ AF+Y G
Sbjct: 143 ATGSLEGQTFKKTGKLPSLSEQNLVDCSQK-------QGNHGCQGGLMDDAFQYIKDNNG 195
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVANFS-VVSLDEDQIAANLVKNGPLAVAINAV 291
+ E YPY + C+F+ + + A+ + F+ + S E + + + GP+AVAI+A
Sbjct: 196 IDTESSYPYEA--KNGKCRFNAANVGATDSGFTDIKSKSESDLQSAVATVGPIAVAIDAS 253
Query: 292 YM--QTYIGGVSCPYICSR-RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+M Q Y GV + CS RLDHGVL VGYG+ K YW++KNSWGESWG+
Sbjct: 254 HMSFQLYKSGVYHEFFCSETRLDHGVLAVGYGTE-------SGKDYWLVKNSWGESWGQK 306
Query: 349 GYYKICRG-RNVCGVDSMVS 367
GY + R RN CG+ + S
Sbjct: 307 GYIMMSRNKRNNCGIATSAS 326
>gi|390337645|ref|XP_001199228.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 333
Score = 217 bits (553), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 135/318 (42%), Positives = 172/318 (54%), Gaps = 32/318 (10%)
Query: 63 FKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH-----GITQFSDLTPAEFR 117
+K + K Y S EE R I++ NL RH L H G+ QF+DL EF
Sbjct: 31 WKNEHGKRYLSDEEEASRRLIWQKNLDIVIRHN-LKYDLGHFTYDLGMNQFADLQNKEFV 89
Query: 118 RTYLGLRRKLRLPKDADQAPILPTND---LPADFDWREKGAVGPVKDQGSCGSCWSFSTT 174
G R K A + LP N+ LP DWR KG V PVKDQG CGSCW+FS T
Sbjct: 90 AMMTGFRVN-GTSKAAKGSTFLPPNNVGKLPKTVDWRTKGYVTPVKDQGQCGSCWAFSAT 148
Query: 175 GALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLM 234
G+LEG +F TGKLVSLSEQ LVDC + + GCNGGLM+ AF+Y + AGG+
Sbjct: 149 GSLEGQHFKKTGKLVSLSEQNLVDCSDK---------NYGCNGGLMDRAFQYIIDAGGID 199
Query: 235 REEDYPYTGTDRGHACKFDKSKIAASVANFS-VVSLDEDQIAANLVKNGPLAVAINAVY- 292
EE YPY D C F + + A+V ++ V S E + + GP++VAI+A +
Sbjct: 200 TEESYPYIAMDGN--CHFKTANVGATVTGYTDVTSGSEKALQKAVAHIGPISVAIDASHF 257
Query: 293 -MQTYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
Q Y GV + P S LDHGVL VGYG+ + YWI+KNSW E+WG NGY
Sbjct: 258 SFQLYQSGVYNEPGCSSTLLDHGVLAVGYGTT------IDGTDYWIVKNSWAETWGMNGY 311
Query: 351 YKICRGR-NVCGVDSMVS 367
+ R + N CG+ + S
Sbjct: 312 IWMSRNKDNQCGIATQAS 329
>gi|161598418|gb|ABX74953.1| cysteine protease [Leishmania panamensis]
Length = 441
Score = 217 bits (553), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 128/312 (41%), Positives = 170/312 (54%), Gaps = 31/312 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK+ + + YA+ E R F+ NL HQ +P A GIT+F DL+ AEF
Sbjct: 38 FEEFKQTYKRVYATLAEEQQRVANFQRNLELMREHQANNPHARFGITKFFDLSEAEFATR 97
Query: 120 YL-GLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YL G + K A Q DL PA DWR+ GAV PV DQG+CGSCW+FS G
Sbjct: 98 YLSGATHFAKAKKFASQHYRKVGADLSTAPAAVDWRQMGAVTPVNDQGACGSCWAFSAIG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTL--KAGGL 233
+E ++ T L++LSEQ+LV CD D GCNGGLM AF++ L K G +
Sbjct: 158 NIESQWYVTTHSLITLSEQELVSCD---------DVDEGCNGGLMLQAFDWLLNNKNGAV 208
Query: 234 MREEDYPYTGTDRGHACKFDKSK---IAASVANFSVVSLDEDQIAANLVKNGPLAVAINA 290
YPY + G + +S + A + + +ED +AA L NGP+A+A++A
Sbjct: 209 YTGASYPYV-SGNGSVPECSESSELVVGAYIDGHVTIESNEDTMAAWLAVNGPIAIAVDA 267
Query: 291 VYMQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+Y GG+ SC R+L+HGVLLVGY G E PYW+IKNSWGE+WGE
Sbjct: 268 SAFMSYTGGILTSCD---GRQLNHGVLLVGYNMTG-------EVPYWLIKNSWGENWGEK 317
Query: 349 GYYKICRGRNVC 360
GY ++ +G N C
Sbjct: 318 GYVRVRKGTNEC 329
>gi|157864843|ref|XP_001681130.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|68124424|emb|CAJ02280.1| cathepsin L-like protease [Leishmania major strain Friedlin]
Length = 348
Score = 217 bits (553), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 130/309 (42%), Positives = 168/309 (54%), Gaps = 25/309 (8%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK+ + +AY + E R F+ NL HQ +P A GIT+F DL+ AEF
Sbjct: 38 FEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 120 YL-GLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YL G + A Q DL P DWREKGAV PVK+QG+CGSCW+FS G
Sbjct: 98 YLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFSAVG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGL 233
+E +A KLV LSEQQLV CDH D+GC GGLM AFE+ L+ G +
Sbjct: 158 NIESQWAVAGHKLVRLSEQQLVSCDH---------VDNGCGGGLMLQAFEWVLRNMNGTV 208
Query: 234 MREEDYPYTGTDRGHACKFDKSKIA--ASVANFSVVSLDEDQIAANLVKNGPLAVAINAV 291
E+ YPYT T + S++A A + + + E +AA L KNGP+++A++A
Sbjct: 209 FTEKSYPYTSTFGYVPECSNSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDAS 268
Query: 292 YMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYY 351
+Y GV I +L+HGVLLVGY G E PYW+IKNSWG+ WGE GY
Sbjct: 269 SFMSYHSGVLTSCI-GEQLNHGVLLVGYNMTG-------EVPYWVIKNSWGKDWGEKGYV 320
Query: 352 KICRGRNVC 360
++ G N C
Sbjct: 321 RVTMGVNAC 329
>gi|414585111|tpg|DAA35682.1| TPA: cysteine proteinase Mir3 [Zea mays]
Length = 468
Score = 217 bits (553), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 129/321 (40%), Positives = 177/321 (55%), Gaps = 27/321 (8%)
Query: 42 ILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSA 101
I+S+ E ++ + A ++ + + Y + E + R+ +F+ NLR H +
Sbjct: 31 IVSYGERSDEE---ARRMYAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAADAG 87
Query: 102 TH----GITQFSDLTPAEFRRTYLGLR-RKLRLPKDADQAPILPTNDLPADFDWREKGAV 156
H G+ +F+DLT E+R TYLG R R R K + DLP DWR KGAV
Sbjct: 88 VHSFRLGLNRFADLTNDEYRATYLGARTRPQRERKLGARYHAADNEDLPESVDWRAKGAV 147
Query: 157 GPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCN 216
VKDQGSCGSCW+FST A+EG N + TG L+SLSEQ+LVDCD S + GCN
Sbjct: 148 AEVKDQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDT--------SYNQGCN 199
Query: 217 GGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAA 276
GGLM+ AFE+ + GG+ E+DYPY GTD G K+ ++ ++ V ++++
Sbjct: 200 GGLMDYAFEFIINNGGIDTEKDYPYKGTD-GRCDVNRKNAKVVTIDSYEDVPANDEKSLQ 258
Query: 277 NLVKNGPLAVAINA--VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPY 334
V N P++VAI A Q Y G+ C LDHGV VGYG+ K Y
Sbjct: 259 KAVANQPVSVAIEAAGTAFQLYSSGIFTG-SCGTALDHGVTAVGYGTE-------NGKDY 310
Query: 335 WIIKNSWGESWGENGYYKICR 355
WI+KNSWG SWGE+GY ++ R
Sbjct: 311 WIVKNSWGSSWGESGYVRMER 331
>gi|401416324|ref|XP_003872657.1| putative cathepsin L-like protease [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322488881|emb|CBZ24131.1| putative cathepsin L-like protease [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 443
Score = 217 bits (553), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 131/310 (42%), Positives = 169/310 (54%), Gaps = 27/310 (8%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK+ + +AY + E R F+ NL HQ +P A GIT+F DL+ AEF
Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQARNPHAQFGITKFFDLSEAEFAAR 97
Query: 120 YL-GLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YL G + A Q DL P DWREKGAV PVKDQG+CGSCW+FS G
Sbjct: 98 YLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFSAVG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGL 233
+EG +LA +LVSLSEQQLV CD D GC+GGLM AF++ L+ G L
Sbjct: 158 NIEGQWYLAGHELVSLSEQQLVSCDDMND---------GCDGGLMLQAFDWLLQNTNGHL 208
Query: 234 MREEDYPYTGTDRGHACKFDKSK---IAASVANFSVVSLDEDQIAANLVKNGPLAVAINA 290
E+ YPY + G+ + S + A + ++ E +AA L KNGP+A+A++A
Sbjct: 209 HTEDSYPYV-SGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDA 267
Query: 291 VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
+Y GV I ++L+HGVLLVGY G E PYW+IKNSWG WGE GY
Sbjct: 268 SSFMSYKSGVLTACI-GKQLNHGVLLVGYDMTG-------EVPYWVIKNSWGGDWGEQGY 319
Query: 351 YKICRGRNVC 360
++ G N C
Sbjct: 320 VRVVMGVNAC 329
>gi|241062152|gb|ACS66748.1| cysteine protease [Leishmania guyanensis]
Length = 441
Score = 217 bits (553), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 128/312 (41%), Positives = 170/312 (54%), Gaps = 31/312 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK+ + + YA+ E R F+ NL HQ +P A GIT+F DL+ AEF
Sbjct: 38 FEEFKQTYKRVYATLAEEQQRVANFQRNLELMREHQANNPHARFGITKFFDLSEAEFATR 97
Query: 120 YL-GLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YL G + K A Q DL PA DWR+ GAV PVKDQG+CGSCW+ S G
Sbjct: 98 YLSGATHFAKAKKFASQHYRKVGADLSTAPAAVDWRQMGAVTPVKDQGACGSCWALSAIG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTL--KAGGL 233
+E ++ T L++LSEQ+LV CD D GCNGGLM AF++ L K G +
Sbjct: 158 NIESQWYVTTHSLITLSEQELVSCD---------DVDEGCNGGLMLQAFDWLLNNKNGAV 208
Query: 234 MREEDYPYTGTDRGHACKFDKSK---IAASVANFSVVSLDEDQIAANLVKNGPLAVAINA 290
YPY + G + +S + A + + +ED +AA L NGP+A+A++A
Sbjct: 209 YTGASYPYV-SGNGSVPECSESSELVVGAYIDGHVTIESNEDTMAAWLAVNGPIAIAVDA 267
Query: 291 VYMQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+Y GG+ SC R+L+HGVLLVGY G E PYW+IKNSWGE+WGE
Sbjct: 268 SAFMSYTGGILTSCD---GRQLNHGVLLVGYNMTG-------EVPYWLIKNSWGENWGEK 317
Query: 349 GYYKICRGRNVC 360
GY ++ +G N C
Sbjct: 318 GYVRVRKGTNEC 329
>gi|13507095|gb|AAK28439.1| cysteine protease 3 precursor [Clonorchis sinensis]
Length = 320
Score = 217 bits (553), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 133/316 (42%), Positives = 182/316 (57%), Gaps = 29/316 (9%)
Query: 56 AEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPA 114
A + FK K+ K+Y S ++ ++RF +FK NL R + Q ++ +A +G+TQFSDLT
Sbjct: 27 ARQLYEEFKLKYKKSY-SNDDDEYRFRVFKDNLLRIKQFQNMERGTAKYGVTQFSDLTAQ 85
Query: 115 EFRRTYLGLRRKLR-LPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFST 173
EF+ YL R K +P D + P + + +FDWR GAVGPV DQG CGSCW+FS
Sbjct: 86 EFKVRYL--RSKFGGVPVDREPVPFIRMDVDDDNFDWRNHGAVGPVLDQGDCGSCWAFSA 143
Query: 174 TGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGL 233
G +EG F T L+ LSEQQL+DCD D GCNGG AF L GGL
Sbjct: 144 VGNIEGQWFRKTDNLLQLSEQQLLDCD---------GVDEGCNGGTPQQAFRQILGMGGL 194
Query: 234 MREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYM 293
+ DYPY G R C+ SK+ + ++ DE A L + GPL+ A+NA+++
Sbjct: 195 QLDSDYPYEG--REGQCRMVPSKVKVYINGSKILPEDEQIQAQMLKETGPLSSALNALFL 252
Query: 294 QTYIGGVSCPYIC-SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYK 352
Q + P +C ++ L+H VL VGYG G RL PYW +KNSW +GENGY++
Sbjct: 253 QHPL-----PALCDAQSLNHAVLTVGYGKEG----RL---PYWTVKNSWSTMFGENGYFR 300
Query: 353 ICRGRNVCGVDSMVST 368
I RG CG++++VST
Sbjct: 301 IYRGDGTCGINTLVST 316
>gi|157864851|ref|XP_001681134.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|68124428|emb|CAJ02284.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|378943050|gb|AFC76266.1| cathepsin L-like protease [Leishmania major]
gi|378943052|gb|AFC76267.1| cathepsin L-like protease [Leishmania major]
gi|378943054|gb|AFC76268.1| cathepsin L-like protease [Leishmania major]
gi|378943058|gb|AFC76270.1| cathepsin L-like protease [Leishmania major]
gi|394331737|gb|AFN27091.1| cysteine protease [Leishmania major]
gi|394331741|gb|AFN27093.1| cysteine protease [Leishmania major]
gi|394331747|gb|AFN27096.1| cysteine protease [Leishmania major]
gi|394331749|gb|AFN27097.1| cysteine protease [Leishmania major]
gi|394331751|gb|AFN27098.1| cysteine protease [Leishmania major]
gi|394331753|gb|AFN27099.1| cysteine protease [Leishmania major]
gi|394331755|gb|AFN27100.1| cysteine protease [Leishmania major]
gi|394331757|gb|AFN27101.1| cysteine protease [Leishmania major]
gi|394331759|gb|AFN27102.1| cysteine protease [Leishmania major]
gi|394331761|gb|AFN27103.1| cysteine protease [Leishmania major]
gi|394331763|gb|AFN27104.1| cysteine protease [Leishmania major]
gi|394331765|gb|AFN27105.1| cysteine protease [Leishmania major]
gi|394331767|gb|AFN27106.1| cysteine protease [Leishmania major]
gi|394331769|gb|AFN27107.1| cysteine protease [Leishmania major]
gi|394331771|gb|AFN27108.1| cysteine protease [Leishmania major]
gi|394331773|gb|AFN27109.1| cysteine protease [Leishmania major]
gi|394331775|gb|AFN27110.1| cysteine protease [Leishmania major]
gi|394331777|gb|AFN27111.1| cysteine protease [Leishmania major]
gi|394331779|gb|AFN27112.1| cysteine protease [Leishmania major]
gi|394331781|gb|AFN27113.1| cysteine protease [Leishmania major]
gi|394331783|gb|AFN27114.1| cysteine protease [Leishmania major]
gi|394331785|gb|AFN27115.1| cysteine protease [Leishmania major]
gi|394331787|gb|AFN27116.1| cysteine protease [Leishmania major]
gi|394331789|gb|AFN27117.1| cysteine protease [Leishmania major]
gi|394331791|gb|AFN27118.1| cysteine protease [Leishmania major]
gi|394331793|gb|AFN27119.1| cysteine protease [Leishmania major]
gi|394331795|gb|AFN27120.1| cysteine protease [Leishmania major]
gi|394331797|gb|AFN27121.1| cysteine protease [Leishmania major]
gi|394331799|gb|AFN27122.1| cysteine protease [Leishmania major]
gi|394331801|gb|AFN27123.1| cysteine protease [Leishmania major]
gi|394331803|gb|AFN27124.1| cysteine protease [Leishmania major]
Length = 348
Score = 217 bits (553), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 131/310 (42%), Positives = 169/310 (54%), Gaps = 27/310 (8%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK+ + +AY + E R F+ NL HQ +P A GIT+F DL+ AEF
Sbjct: 38 FEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 120 YL-GLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YL G + A Q DL P DWREKGAV PVK+QG+CGSCW+FS G
Sbjct: 98 YLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFSAVG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGL 233
+E +A KLV LSEQQLV CDH D+GC GGLM AFE+ L+ G +
Sbjct: 158 NIESQWAVAGHKLVRLSEQQLVSCDH---------VDNGCGGGLMLQAFEWVLRNMNGTV 208
Query: 234 MREEDYPY-TGTDRGHACKFDKSKIA--ASVANFSVVSLDEDQIAANLVKNGPLAVAINA 290
E+ YPY +G C + S++A A + + + E +AA L KNGP+++A++A
Sbjct: 209 FTEKSYPYVSGNGDVPECS-NSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDA 267
Query: 291 VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
+Y GV I +L+HGVLLVGY G E PYW+IKNSWGE WGE GY
Sbjct: 268 SSFMSYHSGVLTSCI-GEQLNHGVLLVGYNMTG-------EVPYWVIKNSWGEDWGEKGY 319
Query: 351 YKICRGRNVC 360
++ G N C
Sbjct: 320 VRVTMGVNAC 329
>gi|394331743|gb|AFN27094.1| cysteine protease [Leishmania major]
Length = 348
Score = 217 bits (552), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 131/310 (42%), Positives = 169/310 (54%), Gaps = 27/310 (8%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK+ + +AY + E R F+ NL HQ +P A GIT+F DL+ AEF
Sbjct: 38 FEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 120 YL-GLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YL G + A Q DL P DWREKGAV PVK+QG+CGSCW+FS G
Sbjct: 98 YLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFSAVG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGL 233
+E +A KLV LSEQQLV CDH D+GC GGLM AFE+ L+ G +
Sbjct: 158 NIESQWAVAGHKLVRLSEQQLVSCDH---------VDNGCGGGLMLQAFEWVLRNMNGTV 208
Query: 234 MREEDYPY-TGTDRGHACKFDKSKIA--ASVANFSVVSLDEDQIAANLVKNGPLAVAINA 290
E+ YPY +G C + S++A A + + + E +AA L KNGP+++A++A
Sbjct: 209 FTEKSYPYVSGNGDVPECS-NSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDA 267
Query: 291 VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
+Y GV I +L+HGVLLVGY G E PYW+IKNSWGE WGE GY
Sbjct: 268 SSFMSYHSGVLTSCI-GEQLNHGVLLVGYNMTG-------EVPYWVIKNSWGEDWGEKGY 319
Query: 351 YKICRGRNVC 360
++ G N C
Sbjct: 320 VRVTMGVNAC 329
>gi|297745594|emb|CBI40759.3| unnamed protein product [Vitis vinifera]
Length = 300
Score = 217 bits (552), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 127/311 (40%), Positives = 171/311 (54%), Gaps = 28/311 (9%)
Query: 66 KFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRR 125
K K+Y S EE HRF +F+ NL+ K S G+ +F+DL+ EF+R YLGL
Sbjct: 3 KHGKSYRSFEEKLHRFEVFQDNLKHIDETNKKVSSYWLGLNEFADLSHEEFKRKYLGL-- 60
Query: 126 KLRLPKDADQAPILPTND---LPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANF 182
K+ LPK D D LP DWR+KGAV VK+QG+CGSCW+FST A+EG N
Sbjct: 61 KIELPKRRDSPEEFSYKDVADLPKSVDWRKKGAVAHVKNQGACGSCWAFSTVAAVEGINQ 120
Query: 183 LATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYT 242
+ TG L +LSEQ+L+DCD ++GCNGGLM+ AF + + GGL +EEDYPY
Sbjct: 121 IVTGNLTALSEQELIDCDK--------PFNNGCNGGLMDYAFAFIISNGGLRKEEDYPYV 172
Query: 243 GTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY--MQTYIGGV 300
+ G + + +++ + V D +Q + N PL+VAI A Q Y GG+
Sbjct: 173 -MEEGTCGEKKEELEVVTISGYHDVPEDNEQSFLKALANQPLSVAIEASSRGFQFYSGGI 231
Query: 301 SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRG---- 356
+ C LDHGV VGYG++ K Y +KNSWG WGE GY ++ R
Sbjct: 232 FNGH-CGTELDHGVAAVGYGTS-------KGVDYITVKNSWGSKWGEKGYIRMKRNVGKP 283
Query: 357 RNVCGVDSMVS 367
+CG+ M S
Sbjct: 284 EGICGIYKMAS 294
>gi|378943046|gb|AFC76264.1| cathepsin L-like protease [Leishmania major]
gi|378943056|gb|AFC76269.1| cathepsin L-like protease [Leishmania major]
gi|394331745|gb|AFN27095.1| cysteine protease [Leishmania major]
Length = 348
Score = 217 bits (552), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 131/310 (42%), Positives = 169/310 (54%), Gaps = 27/310 (8%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK+ + +AY + E R F+ NL HQ +P A GIT+F DL+ AEF
Sbjct: 38 FEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 120 YL-GLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YL G + A Q DL P DWREKGAV PVK+QG+CGSCW+FS G
Sbjct: 98 YLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFSAVG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGL 233
+E +A KLV LSEQQLV CDH D+GC GGLM AFE+ L+ G +
Sbjct: 158 NIESQWAVAGHKLVRLSEQQLVSCDH---------VDNGCGGGLMLQAFEWVLRNMNGTV 208
Query: 234 MREEDYPY-TGTDRGHACKFDKSKIA--ASVANFSVVSLDEDQIAANLVKNGPLAVAINA 290
E+ YPY +G C + S++A A + + + E +AA L KNGP+++A++A
Sbjct: 209 FTEKSYPYVSGNGDVPECS-NSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDA 267
Query: 291 VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
+Y GV I +L+HGVLLVGY G E PYW+IKNSWGE WGE GY
Sbjct: 268 SSFMSYHSGVLTSCI-GEQLNHGVLLVGYNMTG-------EVPYWVIKNSWGEDWGEKGY 319
Query: 351 YKICRGRNVC 360
++ G N C
Sbjct: 320 VRVTMGVNAC 329
>gi|401430350|ref|XP_003886559.1| unnamed protein product, partial [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|356491516|emb|CBZ40966.1| unnamed protein product, partial [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 503
Score = 217 bits (552), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 131/310 (42%), Positives = 169/310 (54%), Gaps = 27/310 (8%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK+ + +AY + E R F+ NL HQ +P A GIT+F DL+ AEF
Sbjct: 98 FEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQARNPHAQFGITKFFDLSEAEFAAR 157
Query: 120 YL-GLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YL G + A Q DL P DWREKGAV PVKDQG+CGSCW+FS G
Sbjct: 158 YLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFSAVG 217
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGL 233
+EG +LA +LVSLSEQQLV CD D GC+GGLM AF++ L+ G L
Sbjct: 218 NIEGQWYLAGHELVSLSEQQLVSCDDMND---------GCDGGLMLQAFDWLLQNTNGHL 268
Query: 234 MREEDYPYTGTDRGHACKFDKSK---IAASVANFSVVSLDEDQIAANLVKNGPLAVAINA 290
E+ YPY + G+ + S + A + ++ E +AA L KNGP+A+A++A
Sbjct: 269 YTEDSYPYV-SGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDA 327
Query: 291 VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
+Y GV I ++L+HGVLLVGY G E PYW+IKNSWG WGE GY
Sbjct: 328 SSFMSYKSGVLTACI-GKQLNHGVLLVGYDMTG-------EVPYWVIKNSWGGDWGEQGY 379
Query: 351 YKICRGRNVC 360
++ G N C
Sbjct: 380 VRVVMGVNAC 389
>gi|50355617|dbj|BAD29957.1| cysteine protease [Daucus carota]
Length = 437
Score = 217 bits (552), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 128/337 (37%), Positives = 191/337 (56%), Gaps = 34/337 (10%)
Query: 42 ILSHHESTNNDLLGAEHH----FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKL 97
I+++ ++ N L+ + ++ + K K+Y + E + RF IFK NLR H
Sbjct: 27 IINYDQTHTNSLIRTDDEVMTMYNSWLVKHGKSYNALGEKETRFQIFKDNLRYIDNHNA- 85
Query: 98 DPSATH--GITQFSDLTPAEFRRTYLGLRRKLRLPK----DADQAPILPTNDLPADFDWR 151
DP ++ G+ +F+DLT E+R YLG + + PK +D+ + +LP DWR
Sbjct: 86 DPDRSYELGLNRFADLTNEEYRAKYLGTKSRESRPKLSKGPSDRYAPVEGEELPDSIDWR 145
Query: 152 EKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSC 211
EKGAV VKDQGSCGSCW+FS GA+EG N + TG+L++LSEQ+LVDCD S
Sbjct: 146 EKGAVAAVKDQGSCGSCWAFSAIGAVEGINQITTGELITLSEQELVDCDR--------SY 197
Query: 212 DSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDE 271
+ GC GGLM+ AF + +K GG+ + DYPYTG D G + ++ ++ ++ V + +
Sbjct: 198 NEGCEGGLMDYAFNFIIKNGGIDSDLDYPYTGRD-GTCNQNKENAKVVTIDSYEDVPVYD 256
Query: 272 DQIAANLVKNGPLAVAINAVYM--QTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRL 329
++ N P++VAI A M Q Y+ G+ C +DHGV++VGYGS
Sbjct: 257 EKALQKAAANQPISVAIEAGGMDFQLYVSGIFTGK-CGTAVDHGVVVVGYGSE------- 308
Query: 330 KEKPYWIIKNSWGESWGENGYYKICRG----RNVCGV 362
+ YWI++NSWG +WGE GY K+ R +CG+
Sbjct: 309 EGMDYWIVRNSWGAAWGEAGYLKMQRNVGKSSGLCGI 345
>gi|343472970|emb|CCD15012.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 382
Score = 217 bits (552), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 123/314 (39%), Positives = 171/314 (54%), Gaps = 25/314 (7%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
+ F+ FK+K++++Y E RF +FK N+ RA +P AT G+T+FSD++P EF
Sbjct: 38 QQQFAAFKQKYSRSYRDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97
Query: 117 RRTYL-GLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
R TY G K + + T P DWR+KGAV PVKDQG C S W+FS G
Sbjct: 98 RATYHNGAEYYAAALKRPRKVVNVSTGKAPPAIDWRKKGAVTPVKDQGQCDSSWAFSAIG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA--GGL 233
+EG +A +L SLSEQ LV CD + D GC GG + AF++ + + G +
Sbjct: 158 NIEGQWKVAGHELTSLSEQMLVSCD---------TNDFGCGGGFSDPAFKWIVSSNKGNV 208
Query: 234 MREEDYPY-TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY 292
E+ YPY +G C + A + + + DE+ IA L KNGP+A+A++A
Sbjct: 209 FTEQSYPYASGGGNVPTCDKSGKVVGAKIRDRVDLPRDENAIAEWLAKNGPVAIAVDATS 268
Query: 293 MQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
Q+Y GGV SC S+ ++ VLLVGY + PYWIIKNSW + WGE GY
Sbjct: 269 FQSYTGGVLTSC---ISKEMNSAVLLVGYDDTS-------KPPYWIIKNSWSKGWGEKGY 318
Query: 351 YKICRGRNVCGVDS 364
+I +G N C V +
Sbjct: 319 IRIEKGTNQCLVKN 332
>gi|18141281|gb|AAL60578.1|AF454956_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 445
Score = 217 bits (552), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 136/313 (43%), Positives = 180/313 (57%), Gaps = 35/313 (11%)
Query: 68 NKAYASQEEHDHRFTIFKANLRRAARHQKL-DPSATHGITQFSDLTPAEFRRTYLGLRRK 126
+K Y E D RF IF NL+ H + + S G+T+F+DLT EFR YL R K
Sbjct: 45 HKNYNGLGEKDKRFEIFMDNLKFVQEHNSVPNQSYELGLTRFADLTNEEFRAIYL--RSK 102
Query: 127 LRLPKDADQAPILPTN---DLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFL 183
+ +D+ ++ N LP + DWR KGAV PVKDQGSCGSCW+FS GA+EG N +
Sbjct: 103 MERTRDSVKSERYLHNVGDKLPDEVDWRAKGAVVPVKDQGSCGSCWAFSAIGAVEGINQI 162
Query: 184 ATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTG 243
TG+LVSLSEQ+LVDCD S ++GC GGLM+ AF++ + GG+ EEDYPYT
Sbjct: 163 KTGELVSLSEQELVDCDT--------SYNNGCGGGLMDYAFQFIISNGGIDTEEDYPYTA 214
Query: 244 TDRGHACKFDKSKI-AASVANFSVVSLDEDQIAANLVKNGPLAVAINA--VYMQTYIGGV 300
TD + C DK ++ + V +E+ + L N P++VAI A Q Y GV
Sbjct: 215 TDD-NICNTDKKNTRVVTIDGYEDVPENENSLKKALA-NQPISVAIEAGGRGFQLYKSGV 272
Query: 301 SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNV- 359
C LDHGV+ VGYG++ + + YWII+NSWG +WGE+GY K+ RN+
Sbjct: 273 FTG-TCGTALDHGVVAVGYGTS-------EGQDYWIIRNSWGSNWGESGYIKL--QRNIK 322
Query: 360 -----CGVDSMVS 367
CGV M S
Sbjct: 323 DSSGKCGVAMMAS 335
>gi|302763837|ref|XP_002965340.1| hypothetical protein SELMODRAFT_143126 [Selaginella moellendorffii]
gi|302790566|ref|XP_002977050.1| hypothetical protein SELMODRAFT_232903 [Selaginella moellendorffii]
gi|300155026|gb|EFJ21659.1| hypothetical protein SELMODRAFT_232903 [Selaginella moellendorffii]
gi|300167573|gb|EFJ34178.1| hypothetical protein SELMODRAFT_143126 [Selaginella moellendorffii]
Length = 300
Score = 217 bits (552), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 125/317 (39%), Positives = 175/317 (55%), Gaps = 32/317 (10%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKL-DPSATHGITQFSDLTPAEFRR 118
F + K K+Y+S E R IF L +H L + + T G+ +FSDLT AEFR
Sbjct: 2 FEGWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHNALPNTTFTLGLNKFSDLTNAEFRA 61
Query: 119 TYLGLRRKLRLPKDADQAPI----LPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTT 174
Y+G K + P+ D+ P + + LP DWR++GAV P+KDQG CGSCW+FS
Sbjct: 62 NYVG---KFKPPRYQDRRPAKDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAI 118
Query: 175 GALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLM 234
++E A+FLAT +LVSLSEQQL+DCD + D GC GG AF++ ++ GG+
Sbjct: 119 ASIESAHFLATKELVSLSEQQLIDCD---------TVDQGCQGGFPEDAFKFVVENGGVT 169
Query: 235 REEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAI--NAVY 292
EE YPYTG +C +K+K+ + + V+ D V P+ V I +
Sbjct: 170 TEEAYPYTGF--AGSCNANKNKV-VEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQN 226
Query: 293 MQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYK 352
Q Y G+ + CS DH VL++GYG+ G PYWIIKNSWG SWGE+G+ +
Sbjct: 227 FQNYRSGILSGH-CSNSRDHAVLVIGYGTEG-------GMPYWIIKNSWGTSWGEDGFMR 278
Query: 353 ICR--GRNVCGVDSMVS 367
I + G +CG++ S
Sbjct: 279 IKKEDGEGMCGMNGQSS 295
>gi|394331822|gb|AFN27130.1| cysteine protease [Leishmania tropica]
Length = 443
Score = 217 bits (552), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 131/312 (41%), Positives = 169/312 (54%), Gaps = 31/312 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK+ + +AY + E R F+ NL HQ +P A GIT+F DL+ AEF
Sbjct: 38 FEEFKRTYRRAYGTVAEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 120 YL-GLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YL G + A Q DL P DWR+KGAV PVKDQG+CGSCW+FS G
Sbjct: 98 YLNGAAYFAAAKQHAGQHYRKARADLSAVPYAVDWRKKGAVTPVKDQGACGSCWAFSAVG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGL 233
++E LA +L +LSEQQLV CD + DSGC GGLM AFE+ L+ G +
Sbjct: 158 SIESQWALAGHRLTALSEQQLVSCDDK---------DSGCGGGLMLQAFEWLLRNMNGTM 208
Query: 234 MREEDYPYTGTDRGHACKFDKSK---IAASVANFSVVSLDEDQIAANLVKNGPLAVAINA 290
E+ YPY + G+ + S A + + + E +AA L KNGP+++A++A
Sbjct: 209 FTEDSYPYVSSS-GYVPECSNSSQLVPGARIDGYMTIESSETVMAAWLAKNGPISIAVDA 267
Query: 291 VYMQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+Y GV SC I L+HGVLLVGY G E PYW+IKNSWGE WGEN
Sbjct: 268 SSFMSYESGVLTSCAGI---TLNHGVLLVGYNMTG-------EVPYWVIKNSWGEDWGEN 317
Query: 349 GYYKICRGRNVC 360
GY ++ G N C
Sbjct: 318 GYVRVTMGVNAC 329
>gi|302790570|ref|XP_002977052.1| hypothetical protein SELMODRAFT_268054 [Selaginella moellendorffii]
gi|300155028|gb|EFJ21661.1| hypothetical protein SELMODRAFT_268054 [Selaginella moellendorffii]
Length = 300
Score = 217 bits (552), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 125/317 (39%), Positives = 175/317 (55%), Gaps = 32/317 (10%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKL-DPSATHGITQFSDLTPAEFRR 118
F + K K+Y+S E R IF L +H L + + T G+ +FSDLT AEFR
Sbjct: 2 FEGWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHNALPNTTFTLGLNKFSDLTNAEFRA 61
Query: 119 TYLGLRRKLRLPKDADQAPI----LPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTT 174
Y+G K + P+ D+ P + + LP DWR++GAV P+KDQG CGSCW+FS
Sbjct: 62 NYVG---KFKPPRYQDRRPAKDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAI 118
Query: 175 GALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLM 234
++E A+FLAT +LVSLSEQQL+DCD + D GC GG AF++ ++ GG+
Sbjct: 119 ASIESAHFLATKELVSLSEQQLIDCD---------TVDQGCQGGFPEDAFKFVVENGGVT 169
Query: 235 REEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAI--NAVY 292
EE YPYTG +C +K+K+ + + V+ D V P+ V I +
Sbjct: 170 TEEAYPYTGF--AGSCNANKNKV-VEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQN 226
Query: 293 MQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYK 352
Q Y G+ + CS DH VL++GYG+ G PYWIIKNSWG SWGE+G+ +
Sbjct: 227 FQNYRSGILSGH-CSNSRDHAVLVIGYGTEG-------GMPYWIIKNSWGTSWGEDGFMR 278
Query: 353 ICR--GRNVCGVDSMVS 367
I + G +CG++ S
Sbjct: 279 IKKKDGEGMCGMNGQSS 295
>gi|229596051|ref|XP_001013456.3| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|225565626|gb|EAR93211.3| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 315
Score = 217 bits (552), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 131/324 (40%), Positives = 177/324 (54%), Gaps = 38/324 (11%)
Query: 44 SHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH 103
+HH + + + A +S FK K+NK YA + +R IF NL+ + K +
Sbjct: 26 THHNTQEDQNIQA--LWSAFKTKYNKKYADPDFERYRIEIFTENLKVVESNTK-----NY 78
Query: 104 GITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQG 163
GITQF D+T EF++TYL L+ K L +P ND + DW KGAV PVKDQG
Sbjct: 79 GITQFMDITREEFKQTYLTLKMKNGLKA----SPFAKFNDAGVEIDWTTKGAVTPVKDQG 134
Query: 164 SCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSA 223
CGSCWSFSTTGA+EGA FL+T KL SLSEQ LVDC + + GCNGGLM++A
Sbjct: 135 QCGSCWSFSTTGAVEGALFLSTKKLTSLSEQYLVDCSKD--------GNEGCNGGLMDTA 186
Query: 224 FEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGP 283
F++ + G+ E YPY D CK S S + + N ++ P
Sbjct: 187 FDF-ISQHGIPTEAAYPYKAVD--GTCKMTSGPYKIS----SHTDIQDCNDLLNKIQKQP 239
Query: 284 LAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGE 343
+A+A++A Q Y + C LDHGVLLVGY ++G YW +KNSWG
Sbjct: 240 IAIAVDANNFQYYQKDIFSD--CGTELDHGVLLVGYSASG---------KYWKVKNSWGP 288
Query: 344 SWGENGYYKICRGRNVCGVDSMVS 367
+WGE+G+ ++ G N CG+ +M S
Sbjct: 289 NWGESGFIRLAAG-NTCGLCNMAS 311
>gi|461905|sp|Q05094.1|CYSP2_LEIPI RecName: Full=Cysteine proteinase 2; AltName: Full=Amastigote
cysteine proteinase A-2; Flags: Precursor
gi|159298|gb|AAA29229.1| cysteine proteinase [Leishmania pifanoi]
Length = 444
Score = 217 bits (552), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 131/311 (42%), Positives = 169/311 (54%), Gaps = 28/311 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK+ + +AY + E R F+ NL HQ +P A GIT+F DL+ AEF
Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQARNPHAQFGITKFFDLSEAEFAAR 97
Query: 120 YL-GLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YL G + A Q DL P DWREKGAV PVKDQG+CGSCW+FS G
Sbjct: 98 YLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFSAVG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGL 233
+EG +LA +LVSLSEQQLV CD D GC+GGLM AF++ L+ G L
Sbjct: 158 NIEGQWYLAGHELVSLSEQQLVSCDDMND---------GCDGGLMLQAFDWLLQNTNGHL 208
Query: 234 MREEDYPYTGTDRGHACKFDKSK----IAASVANFSVVSLDEDQIAANLVKNGPLAVAIN 289
E+ YPY + G+ + S + A + ++ E +AA L KNGP+A+A++
Sbjct: 209 HTEDSYPYV-SGNGYVPECSNSSEELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALD 267
Query: 290 AVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
A +Y GV I ++L+HGVLLVGY G E PYW+IKNSWG WGE G
Sbjct: 268 ASSFMSYKSGVLTACI-GKQLNHGVLLVGYDMTG-------EVPYWVIKNSWGGDWGEQG 319
Query: 350 YYKICRGRNVC 360
Y ++ G N C
Sbjct: 320 YVRVVMGVNAC 330
>gi|577617|gb|AAC37213.1| cysteine proteinase [Trypanosoma cruzi]
Length = 467
Score = 217 bits (552), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 127/314 (40%), Positives = 168/314 (53%), Gaps = 21/314 (6%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRR 118
F+ FK+K + Y S E R ++F+ANL A H +P AT G+T FSDLT EFR
Sbjct: 37 QFAEFKQKHGRVYGSAAEEAFRLSVFRANLFLARLHAAANPHATFGVTPFSDLTREEFRS 96
Query: 119 TYLGLRRKLRLPKDADQAPI-LPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
Y ++ + P+ + PA DWRE+GAV VK+QG CGSCW+F+ G +
Sbjct: 97 RYHNGAAHFAAAEERARVPVDVEVVGAPAAKDWREEGAVTAVKNQGICGSCWAFAAIGNI 156
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA--GGLMR 235
EG FLA L LSEQ LV CD+ +SGC GGL + AFE+ ++ G +
Sbjct: 157 EGQWFLAGNPLTRLSEQMLVSCDNT---------NSGCGGGLSSKAFEWIVQENNGAVYT 207
Query: 236 EEDYPY-TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQ 294
E+ YPY + CK + A++ + DE QIAA+ GPL+VA++A
Sbjct: 208 EDSYPYHSCIGIKLPCKDSDRTVGATITGHVELPQDEAQIAASGAVKGPLSVAVDASSWF 267
Query: 295 TYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
Y GGV + S+RL H VLLVGY + PYWIIKNSW WGE GY +I
Sbjct: 268 FYTGGVLTNCV-SKRLSHAVLLVGYNDSAAV-------PYWIIKNSWTTHWGEGGYIRIA 319
Query: 355 RGRNVCGVDSMVST 368
+G N C V VS+
Sbjct: 320 KGSNQCLVKEEVSS 333
>gi|394331739|gb|AFN27092.1| cysteine protease [Leishmania major]
Length = 348
Score = 217 bits (552), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 131/310 (42%), Positives = 169/310 (54%), Gaps = 27/310 (8%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK+ + +AY + E R F+ NL HQ +P A GIT+F DL+ AEF
Sbjct: 38 FEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 120 YL-GLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YL G + A Q DL P DWREKGAV PVK+QG+CGSCW+FS G
Sbjct: 98 YLNGAAYFAAAKQHAGQHCRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFSAVG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGL 233
+E +A KLV LSEQQLV CDH D+GC GGLM AFE+ L+ G +
Sbjct: 158 NIESQWAVAGHKLVRLSEQQLVSCDH---------VDNGCGGGLMLQAFEWVLRNMNGTV 208
Query: 234 MREEDYPY-TGTDRGHACKFDKSKIA--ASVANFSVVSLDEDQIAANLVKNGPLAVAINA 290
E+ YPY +G C + S++A A + + + E +AA L KNGP+++A++A
Sbjct: 209 FTEKSYPYVSGNGDVPECS-NSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDA 267
Query: 291 VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
+Y GV I +L+HGVLLVGY G E PYW+IKNSWGE WGE GY
Sbjct: 268 SSFMSYHSGVLTSCI-GEQLNHGVLLVGYNMTG-------EVPYWVIKNSWGEDWGEKGY 319
Query: 351 YKICRGRNVC 360
++ G N C
Sbjct: 320 VRVTMGVNAC 329
>gi|255646767|gb|ACU23856.1| unknown [Glycine max]
Length = 350
Score = 217 bits (552), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 125/315 (39%), Positives = 175/315 (55%), Gaps = 25/315 (7%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F + + K Y + EE RF IFK NL+ K+ + G+++F+DL+ EF
Sbjct: 48 FESWMSRHGKIYENIEEKLLRFEIFKDNLKHIDERNKVVSNYWLGLSEFADLSHREFNNK 107
Query: 120 YLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEG 179
YLGL+ +++ + +LP DWR+KGAV PVK+QGSCGSCW+FST A+EG
Sbjct: 108 YLGLKVDYSRRRESPEEFTYKDVELPKSVDWRKKGAVAPVKNQGSCGSCWAFSTVAAVEG 167
Query: 180 ANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDY 239
N + TG L SLSEQ+L+DCD + ++GCNGGLM+ AF + ++ GGL +EEDY
Sbjct: 168 INQIVTGNLTSLSEQELIDCDR--------TYNNGCNGGLMDYAFSFIVENGGLHKEEDY 219
Query: 240 PYTGTDRGHACKFDKSKI-AASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY--MQTY 296
PY + AC+ K + +++ + V + +Q + N PL+VAI A Q Y
Sbjct: 220 PYIMEE--GACEMTKEETQVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFY 277
Query: 297 IGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRG 356
GGV + C LDHGV VGYG+A K Y +KNSWG WGE GY ++ R
Sbjct: 278 SGGVFDGH-CGSDLDHGVAAVGYGTA-------KGVDYITVKNSWGSKWGEKGYIRMRRN 329
Query: 357 ----RNVCGVDSMVS 367
+CG+ M S
Sbjct: 330 IGKPEGICGIYKMAS 344
>gi|302763109|ref|XP_002964976.1| hypothetical protein SELMODRAFT_83176 [Selaginella moellendorffii]
gi|302763113|ref|XP_002964978.1| hypothetical protein SELMODRAFT_83554 [Selaginella moellendorffii]
gi|300167209|gb|EFJ33814.1| hypothetical protein SELMODRAFT_83176 [Selaginella moellendorffii]
gi|300167211|gb|EFJ33816.1| hypothetical protein SELMODRAFT_83554 [Selaginella moellendorffii]
Length = 300
Score = 217 bits (552), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 124/317 (39%), Positives = 177/317 (55%), Gaps = 32/317 (10%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQ-KLDPSATHGITQFSDLTPAEFRR 118
F + K +K+Y+S E R +F L +H + + + T G+ +FSDLT AEFR
Sbjct: 2 FEDWAAKHDKSYSSDWEKARRLMVFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFRA 61
Query: 119 TYLGLRRKLRLPKDADQAPI----LPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTT 174
Y+G K + P+ D+ P + + LP DWR++GAV P+KDQG CGSCW+FS
Sbjct: 62 NYVG---KFKPPRYQDRRPAKDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAI 118
Query: 175 GALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLM 234
++E A+FLAT +LVSLSEQQL+DCD + D GC GG + AF++ ++ GG+
Sbjct: 119 ASIESAHFLATKELVSLSEQQLIDCD---------TVDQGCQGGFPDDAFKFVVENGGVT 169
Query: 235 REEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAI--NAVY 292
EE YPYTG +C +K+K+ + + V+ D V P+ V I +
Sbjct: 170 TEEAYPYTGF--AGSCNTNKNKV-VEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQN 226
Query: 293 MQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYK 352
Q Y G+ C+ R DH VL++GYG+ G PYWIIKNSWG SWGE+G+ K
Sbjct: 227 FQNYRSGILSGQCCNSR-DHAVLVIGYGTEG-------GMPYWIIKNSWGTSWGEDGFMK 278
Query: 353 ICR--GRNVCGVDSMVS 367
I + G +CG++ S
Sbjct: 279 IKKKDGEGMCGMNGQSS 295
>gi|62320725|dbj|BAD95392.1| cysteine proteinase RD21A [Arabidopsis thaliana]
Length = 433
Score = 217 bits (552), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 138/360 (38%), Positives = 195/360 (54%), Gaps = 47/360 (13%)
Query: 7 VLFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGD---EILSHHESTNNDLLGAEHHFSLF 63
+LFL + V SAV + D + T GG E++S +E+ L
Sbjct: 10 ILFLAMVAVSSAVDMSIISYDEKHGVS--TTGGRSEAEVMSIYEAW------------LV 55
Query: 64 KKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGL 123
K ++ S E D RF IFK NLR H + + S G+T+F+DLT E+R YLG
Sbjct: 56 KHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEKNLSYRLGLTRFADLTNDEYRSKYLGA 115
Query: 124 R------RKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
+ R+ L +A ++LP DWR+KGAV VKDQG CGSCW+FST GA+
Sbjct: 116 KMEKKGERRTSLRYEARVG-----DELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAV 170
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREE 237
EG N + TG L++LSEQ+LVDCD S + GCNGGLM+ AFE+ +K GG+ ++
Sbjct: 171 EGINQIVTGDLITLSEQELVDCDT--------SYNEGCNGGLMDYAFEFIIKNGGIDTDK 222
Query: 238 DYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA--VYMQT 295
DYPY G D G + K+ ++ ++ V ++ V + P+++AI A Q
Sbjct: 223 DYPYKGVD-GTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQL 281
Query: 296 YIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
Y G+ C +LDHGV+ VGYG+ K YWI++NSWG+SWGE+GY ++ R
Sbjct: 282 YDSGIF-DGSCGTQLDHGVVAVGYGTE-------NGKDYWIVRNSWGKSWGESGYLRMAR 333
>gi|167521499|ref|XP_001745088.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776702|gb|EDQ90321.1| predicted protein [Monosiga brevicollis MX1]
Length = 294
Score = 217 bits (552), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 134/308 (43%), Positives = 177/308 (57%), Gaps = 34/308 (11%)
Query: 63 FKKKFNKAYASQEEHDHRFTIFKANL----RRAARHQKLDPSATHGITQFSDLTPAEFRR 118
FK ++K+Y S+ R F+ANL + A H + S T G+ +F+DLT EF
Sbjct: 1 FKSDYSKSYESEAVEAKRLAAFEANLEFINKHNAEHAQGLHSYTVGVNEFADLTIDEFMA 60
Query: 119 TYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALE 178
Y+ + +P + P + + DWR KGAV P+K+QG CGSCWSFSTTG+ E
Sbjct: 61 LYVPSKFNRTMPYNTVYLPATSEDSV----DWRTKGAVTPIKNQGQCGSCWSFSTTGSTE 116
Query: 179 GANFLATGKLVSLSEQQLVDCDHECDPEEPGSC-DSGCNGGLMNSAFEYTLKAGGLMREE 237
GA+ +ATG LVSLSEQQLVDC GS + GCNGGLM+ AF+Y + GL EE
Sbjct: 117 GAHAIATGNLVSLSEQQLVDCS--------GSFGNQGCNGGLMDDAFKYIISNKGLDTEE 168
Query: 238 DYPYTGTDRGHACKFDKSKIAASVANFS-VVSLDEDQIAANLVKNGPLAVAINAVY--MQ 294
DYPYT D G K ++K AA+++++S V +EDQ+AA + K GP++VAI A Q
Sbjct: 169 DYPYTAQD-GTCNKEKEAKHAATISSYSDVPKNNEDQLAAAVAK-GPVSVAIEADQSGFQ 226
Query: 295 TYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
Y GV C LDHGVL+VGY YWI+KNSWG +WG GY +
Sbjct: 227 LYKSGV-FDGNCGTNLDHGVLVVGY-----------TDDYWIVKNSWGTTWGVEGYINMK 274
Query: 355 RGRNVCGV 362
RG + G+
Sbjct: 275 RGVSASGI 282
>gi|147790682|emb|CAN61026.1| hypothetical protein VITISV_001146 [Vitis vinifera]
Length = 469
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 126/312 (40%), Positives = 174/312 (55%), Gaps = 25/312 (8%)
Query: 49 TNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQF 108
T+ D++ + + K K+Y + E + RF IFK NLR H + + G+ +F
Sbjct: 45 TDEDVMAV---YEAWLAKHGKSYNALGEKERRFQIFKDNLRFIDEHNAENRTYKVGLNRF 101
Query: 109 SDLTPAEFRRTYLGLR---RKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSC 165
+DLT E+R YLG R ++ K +D+ + LP DWR+KGAV VKDQGSC
Sbjct: 102 ADLTNEEYRSMYLGTRTAAKRRSSNKISDRYAFRVGDSLPESVDWRKKGAVVEVKDQGSC 161
Query: 166 GSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFE 225
GSCW+FST A+EG N + TG L+SLSEQ+LVDCD S + GCNGGLM+ AFE
Sbjct: 162 GSCWAFSTIAAVEGINKIVTGGLISLSEQELVDCDT--------SYNEGCNGGLMDYAFE 213
Query: 226 YTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLA 285
+ + GG+ EEDYPY +D G ++ K+ ++ + V ++++ V N P++
Sbjct: 214 FIINNGGIDSEEDYPYKASD-GRCDQYRKNAXVVTIDGYEDVPENDEKSLEKAVANQPVS 272
Query: 286 VAINA--VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGE 343
VAI A Q Y G+ C LDHGV VGYG+ YWI+KNSWG
Sbjct: 273 VAIEAGGREFQLYQSGIFTGR-CGTALDHGVTAVGYGTENGV-------DYWIVKNSWGA 324
Query: 344 SWGENGYYKICR 355
SWGE GY ++ R
Sbjct: 325 SWGEEGYIRMER 336
>gi|344271616|ref|XP_003407633.1| PREDICTED: cathepsin L1-like [Loxodonta africana]
Length = 334
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 127/317 (40%), Positives = 170/317 (53%), Gaps = 22/317 (6%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQ----FSDLTPA 114
++ ++ + K YA EE D R +++ N++ RH + HG T F D+T
Sbjct: 28 QWNQWRSTYKKPYAVNEE-DWRRAVWEKNVKMIERHNQEYSQGKHGFTMAMNAFGDMTNE 86
Query: 115 EFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTT 174
EFR+ G + + P+ +P DW +KG V PVK+QG CGSCW+FS T
Sbjct: 87 EFRQVMNGFQNQKHKKGKLFYEPVF--GHIPTSVDWTQKGYVTPVKNQGQCGSCWAFSAT 144
Query: 175 GALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLM 234
GALEG F TGKLVSLSEQ LVDC + GCNGGLM++AF+Y GGL
Sbjct: 145 GALEGQMFRKTGKLVSLSEQNLVDCSRR-------EGNEGCNGGLMDNAFQYVQDNGGLD 197
Query: 235 REEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY-- 292
EE YPY TD H C + AA+ F + E + + GP++VAI+A +
Sbjct: 198 SEESYPYLATDT-HTCNYKPECSAANDTGFVDIPQREKALMKAVATVGPISVAIDAGHES 256
Query: 293 MQTYIGGVSC-PYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYY 351
Q Y G+ P S+ LDHGVLLVGYG G + +WI+KNSWG SWG NGY
Sbjct: 257 FQFYKSGIYYEPGCSSKDLDHGVLLVGYGFEGKDS---ENNKFWIVKNSWGTSWGTNGYV 313
Query: 352 KICRGRNV-CGVDSMVS 367
K+ + +N CG+ + S
Sbjct: 314 KMAKDQNNHCGIATAAS 330
>gi|14517542|gb|AAK62661.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
gi|19548039|gb|AAL87383.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
Length = 462
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 138/360 (38%), Positives = 195/360 (54%), Gaps = 47/360 (13%)
Query: 7 VLFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGD---EILSHHESTNNDLLGAEHHFSLF 63
+LFL + V SAV + D + T GG E++S +E+ L
Sbjct: 10 ILFLAMVTVSSAVDMSIISYDEKHGVS--TTGGRSEAEVMSIYEAW------------LV 55
Query: 64 KKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGL 123
K ++ S E D RF IFK NLR H + + S G+T+F+DLT E+R YLG
Sbjct: 56 KHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEKNLSYRLGLTRFADLTNDEYRSKYLGA 115
Query: 124 R------RKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
+ R+ L +A ++LP DWR+KGAV VKDQG CGSCW+FST GA+
Sbjct: 116 KMEKKGERRTSLRYEARVG-----DELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAV 170
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREE 237
EG N + TG L++LSEQ+LVDCD S + GCNGGLM+ AFE+ +K GG+ ++
Sbjct: 171 EGINQIVTGDLITLSEQELVDCDT--------SYNEGCNGGLMDYAFEFIIKNGGIDTDK 222
Query: 238 DYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA--VYMQT 295
DYPY G D G + K+ ++ ++ V ++ V + P+++AI A Q
Sbjct: 223 DYPYKGVD-GTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQL 281
Query: 296 YIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
Y G+ C +LDHGV+ VGYG+ K YWI++NSWG+SWGE+GY ++ R
Sbjct: 282 YDSGIF-DGSCGTQLDHGVVAVGYGTE-------NGKDYWIVRNSWGKSWGESGYLRMAR 333
>gi|410045434|ref|XP_003313198.2| PREDICTED: LOW QUALITY PROTEIN: cathepsin F [Pan troglodytes]
Length = 548
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 138/313 (44%), Positives = 179/313 (57%), Gaps = 25/313 (7%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRR 118
F F +N+ Y S+EE R ++F N+ RA + Q LD +A +G+T+FSDLT EFR
Sbjct: 251 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 310
Query: 119 TYLGLRRKLRLPKDADQAPILPTNDL-PADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
YL + QA + DL P ++DWR KGAV VKDQG CGSCW+FS TG +
Sbjct: 311 IYLNPLLRKEPGNKMKQAKSV--GDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNV 368
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREE 237
EG FL G L+SLSEQ+L+DCD D C GGL ++A+ GGL E+
Sbjct: 369 EGQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSAIKNLGGLETED 419
Query: 238 DYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYI 297
DY Y G +C F K + + V+S +E ++AA L K GP++VAINA MQ Y
Sbjct: 420 DYSYQG--HMQSCNFSAEKAKVYINDSVVLSQNEQKLAAWLAKRGPISVAINAFGMQFYR 477
Query: 298 GGVSCPY--ICSRRL-DHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
G+S P +CS L DH VLLVGYG+ + P+W IKNSWG WGE GYY +
Sbjct: 478 HGISRPLRPLCSPWLIDHAVLLVGYGNR-------SDVPFWAIKNSWGTDWGEKGYYYLH 530
Query: 355 RGRNVCGVDSMVS 367
G CGV++M S
Sbjct: 531 CGSEACGVNTMAS 543
>gi|320169658|gb|EFW46557.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 324
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 128/315 (40%), Positives = 173/315 (54%), Gaps = 23/315 (7%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F +K +YA+ E R I++ANL +H S + +F+DLT EF
Sbjct: 22 FDSWKATHGVSYATVGEETARRGIYRANLDFIEKHNSEGHSYKLAVNKFADLTYPEFAAK 81
Query: 120 YLGLR-RKLRLPKDADQAPILPTN-DLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
YLGLR K + LP LP DWR G V P+KDQG CGSCWSFSTTG++
Sbjct: 82 YLGLRFDATNATKSFAASTYLPRMVSLPDSVDWRTAGIVTPIKDQGQCGSCWSFSTTGSV 141
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREE 237
EG + TG+LVSLSEQ LVDC ++GCNGGLM+ AF+Y + G+ E
Sbjct: 142 EGQHARKTGQLVSLSEQNLVDC-------SSAQGNAGCNGGLMDQAFQYIISNNGIDTES 194
Query: 238 DYPYTGTDRGHACKFDKSKIAASVANFS-VVSLDEDQIAANLVKNGPLAVAINAVY--MQ 294
YPYT D C+F+ + + A+VA++ + S E + + GP++VAI+A Q
Sbjct: 195 SYPYTAQD--GTCQFNSANVGATVASYQDIASGSESDLQNAVATVGPISVAIDASQPSFQ 252
Query: 295 TYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKI 353
Y GV + P S +LDHGVL VGYG++G YW++KNSWG SWG++GY +
Sbjct: 253 FYSSGVYNEPACSSSQLDHGVLAVGYGTSG-------SSDYWLVKNSWGTSWGQSGYIWM 305
Query: 354 CRG-RNVCGVDSMVS 367
R N CG+ + S
Sbjct: 306 TRNSNNQCGIATAAS 320
>gi|18401614|ref|NP_564497.1| cysteine proteinase RD21a [Arabidopsis thaliana]
gi|1172873|sp|P43297.1|RD21A_ARATH RecName: Full=Cysteine proteinase RD21a; Short=RD21; Flags:
Precursor
gi|12321010|gb|AAG50628.1|AC083835_13 cysteine protease, putative [Arabidopsis thaliana]
gi|435619|dbj|BAA02374.1| thiol protease [Arabidopsis thaliana]
gi|18175926|gb|AAL59952.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
gi|22136972|gb|AAM91715.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
gi|332194014|gb|AEE32135.1| cysteine proteinase RD21a [Arabidopsis thaliana]
Length = 462
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 138/360 (38%), Positives = 195/360 (54%), Gaps = 47/360 (13%)
Query: 7 VLFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGD---EILSHHESTNNDLLGAEHHFSLF 63
+LFL + V SAV + D + T GG E++S +E+ L
Sbjct: 10 ILFLAMVAVSSAVDMSIISYDEKHGVS--TTGGRSEAEVMSIYEAW------------LV 55
Query: 64 KKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGL 123
K ++ S E D RF IFK NLR H + + S G+T+F+DLT E+R YLG
Sbjct: 56 KHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEKNLSYRLGLTRFADLTNDEYRSKYLGA 115
Query: 124 R------RKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
+ R+ L +A ++LP DWR+KGAV VKDQG CGSCW+FST GA+
Sbjct: 116 KMEKKGERRTSLRYEARVG-----DELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAV 170
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREE 237
EG N + TG L++LSEQ+LVDCD S + GCNGGLM+ AFE+ +K GG+ ++
Sbjct: 171 EGINQIVTGDLITLSEQELVDCDT--------SYNEGCNGGLMDYAFEFIIKNGGIDTDK 222
Query: 238 DYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA--VYMQT 295
DYPY G D G + K+ ++ ++ V ++ V + P+++AI A Q
Sbjct: 223 DYPYKGVD-GTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQL 281
Query: 296 YIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
Y G+ C +LDHGV+ VGYG+ K YWI++NSWG+SWGE+GY ++ R
Sbjct: 282 YDSGIF-DGSCGTQLDHGVVAVGYGTE-------NGKDYWIVRNSWGKSWGESGYLRMAR 333
>gi|182375363|gb|ACB87490.1| mucunain [Mucuna pruriens]
Length = 422
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 126/306 (41%), Positives = 175/306 (57%), Gaps = 27/306 (8%)
Query: 60 FSLFKK---KFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
SL+++ K KAY + E D RF IFK NLR H + + G+ +F+DLT E+
Sbjct: 1 MSLYEQWLVKHGKAYNALGEKDKRFDIFKDNLRFIDDHNADNRTYKLGLNRFADLTNEEY 60
Query: 117 RRTYLGLR-----RKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSF 171
R YLG R R ++ +++ ++LP DWR + AV PVKDQG+CGSCW+F
Sbjct: 61 RARYLGTRIDPNRRFVKTKTQSNRYAPRVGDNLPESVDWRNESAVLPVKDQGNCGSCWAF 120
Query: 172 STTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAG 231
ST GA+EG N + TG L+SLSEQ+LVDCD S + GCNGGLM+ A+E+ + G
Sbjct: 121 STIGAVEGINKIVTGDLISLSEQELVDCDT--------SYNQGCNGGLMDYAYEFIINNG 172
Query: 232 GLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAIN-- 289
G+ EEDYPY D G ++ K+ ++ ++ V +++ V N P++VAI
Sbjct: 173 GIDSEEDYPYRAVD-GTCDQYRKNAKVVTIDSYEDVPANDELALKKAVANQPVSVAIEGG 231
Query: 290 AVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
Q Y+ GV C LDHGV+ VGYGS +K YWI++NSWG SWGE G
Sbjct: 232 GREFQLYVSGVFTGR-CGTALDHGVVAVGYGS-------VKGHDYWIVRNSWGASWGEEG 283
Query: 350 YYKICR 355
Y ++ R
Sbjct: 284 YVRLER 289
>gi|109390302|gb|ABG33750.1| cysteine protease [Hevea brasiliensis]
Length = 457
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 126/306 (41%), Positives = 173/306 (56%), Gaps = 26/306 (8%)
Query: 66 KFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLR- 124
K KAY S E + RF +FK NLR H + + G+ +F+DLT E+R YLG
Sbjct: 48 KHGKAYNSLGEKERRFEVFKDNLRFIDEHNSENRTYRVGLNRFADLTNEEYRSMYLGALS 107
Query: 125 --RKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANF 182
R+ +L K +D+ + LP DWR++GAV VKDQGSCGSCW+FS A+EG N
Sbjct: 108 GIRRNKLRKISDRYTPRVGDSLPDSVDWRKEGAVVGVKDQGSCGSCWAFSAVAAVEGINK 167
Query: 183 LATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYT 242
+ TG L+SLSEQ+LVDCD+ S + GCNGGLM+ FE+ + GG+ EEDYPY
Sbjct: 168 IVTGDLISLSEQELVDCDN--------SYNEGCNGGLMDYGFEFIINNGGIDSEEDYPYL 219
Query: 243 GTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA--VYMQTYIGGV 300
D G + K+ S+ ++ V ++ + V N P++VAI A Q Y GV
Sbjct: 220 ARD-GRCDTYRKNARVVSIDSYEDVPVNNEAALQKAVANQPVSVAIEAGGRDFQLYSSGV 278
Query: 301 SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRG---- 356
C LDHGV+ VGYG+ + YWI++NSWG+SWGE+GY ++ R
Sbjct: 279 FSGR-CGTALDHGVVAVGYGTE-------NGQDYWIVRNSWGKSWGESGYLRMARNIRKP 330
Query: 357 RNVCGV 362
+CG+
Sbjct: 331 TGICGI 336
>gi|255538210|ref|XP_002510170.1| cysteine protease, putative [Ricinus communis]
gi|223550871|gb|EEF52357.1| cysteine protease, putative [Ricinus communis]
Length = 469
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 120/285 (42%), Positives = 167/285 (58%), Gaps = 22/285 (7%)
Query: 76 EHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLR---RKLRLPKD 132
E + RF +FK NLR H + S G+ +F+DLT E+R YLG R ++ RL +
Sbjct: 70 EKERRFQVFKDNLRFIDEHNSENRSYKVGLNRFADLTNEEYRSMYLGARSGAKRNRLSRS 129
Query: 133 ADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLS 192
+++ + LP DWR++GAV VKDQGSCGSCW+FST A+EG N + TG L+SLS
Sbjct: 130 SNRYLPRVGDSLPDSVDWRKEGAVAEVKDQGSCGSCWAFSTIAAVEGINKIVTGDLISLS 189
Query: 193 EQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKF 252
EQ+LVDCD S + GCNGGLM+ AF++ + GG+ EEDYPY D G +
Sbjct: 190 EQELVDCDR--------SYNEGCNGGLMDYAFQFIINNGGIDSEEDYPYLARD-GTCDTY 240
Query: 253 DKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA--VYMQTYIGGVSCPYICSRRL 310
K+ ++ N+ V +++++ V N P++VAI A Q Y G+ C L
Sbjct: 241 RKNAKVVTIDNYEDVPVNDEKALQKAVANQPVSVAIEAGGREFQFYQSGIFTGR-CGTAL 299
Query: 311 DHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
DHGV VGYG+ K YWI++NSWG+SWGE+GY ++ R
Sbjct: 300 DHGVAAVGYGTE-------NGKDYWIVRNSWGKSWGESGYIRMER 337
>gi|157864853|ref|XP_001681135.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|157864857|ref|XP_001681137.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|68124429|emb|CAJ02285.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|68124431|emb|CAJ02287.1| cathepsin L-like protease [Leishmania major strain Friedlin]
Length = 443
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 131/310 (42%), Positives = 169/310 (54%), Gaps = 27/310 (8%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK+ + +AY + E R F+ NL HQ +P A GIT+F DL+ AEF
Sbjct: 38 FEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 120 YL-GLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YL G + A Q DL P DWREKGAV PVK+QG+CGSCW+FS G
Sbjct: 98 YLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFSAVG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGL 233
+E +A KLV LSEQQLV CDH D+GC GGLM AFE+ L+ G +
Sbjct: 158 NIESQWAVAGHKLVRLSEQQLVSCDH---------VDNGCGGGLMLQAFEWVLRNMNGTV 208
Query: 234 MREEDYPY-TGTDRGHACKFDKSKIA--ASVANFSVVSLDEDQIAANLVKNGPLAVAINA 290
E+ YPY +G C + S++A A + + + E +AA L KNGP+++A++A
Sbjct: 209 FTEKSYPYVSGNGDVPECS-NSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDA 267
Query: 291 VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
+Y GV I +L+HGVLLVGY G E PYW+IKNSWGE WGE GY
Sbjct: 268 SSFMSYHSGVLTSCI-GEQLNHGVLLVGYNMTG-------EVPYWVIKNSWGEDWGEKGY 319
Query: 351 YKICRGRNVC 360
++ G N C
Sbjct: 320 VRVTMGVNAC 329
>gi|225458701|ref|XP_002284973.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
Length = 467
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 126/312 (40%), Positives = 174/312 (55%), Gaps = 25/312 (8%)
Query: 49 TNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQF 108
T+ D++ + + K K+Y + E + RF IFK NLR H + + G+ +F
Sbjct: 43 TDEDVMAV---YEAWLAKHGKSYNALGEKERRFQIFKDNLRFIDEHNAENRTYKVGLNRF 99
Query: 109 SDLTPAEFRRTYLGLR---RKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSC 165
+DLT E+R YLG R ++ K +D+ + LP DWR+KGAV VKDQGSC
Sbjct: 100 ADLTNEEYRSMYLGTRTAAKRRSSNKISDRYAFRVGDSLPESVDWRKKGAVVEVKDQGSC 159
Query: 166 GSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFE 225
GSCW+FST A+EG N + TG L+SLSEQ+LVDCD S + GCNGGLM+ AFE
Sbjct: 160 GSCWAFSTIAAVEGINKIVTGGLISLSEQELVDCDT--------SYNEGCNGGLMDYAFE 211
Query: 226 YTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLA 285
+ + GG+ EEDYPY +D G ++ K+ ++ + V ++++ V N P++
Sbjct: 212 FIINNGGIDSEEDYPYKASD-GRCDQYRKNAKVVTIDGYEDVPENDEKSLEKAVANQPVS 270
Query: 286 VAINA--VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGE 343
VAI A Q Y G+ C LDHGV VGYG+ YWI+KNSWG
Sbjct: 271 VAIEAGGREFQLYQSGIFTGR-CGTALDHGVTAVGYGTENGV-------DYWIVKNSWGA 322
Query: 344 SWGENGYYKICR 355
SWGE GY ++ R
Sbjct: 323 SWGEEGYIRMER 334
>gi|226495425|ref|NP_001148706.1| cysteine protease 1 precursor [Zea mays]
gi|195621544|gb|ACG32602.1| cysteine protease 1 precursor [Zea mays]
Length = 463
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 129/321 (40%), Positives = 176/321 (54%), Gaps = 27/321 (8%)
Query: 42 ILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSA 101
I+S+ E + + A ++ + + Y + E + R+ +F+ NLR H +
Sbjct: 26 IVSYGERSXEE---ARRMYAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAADAG 82
Query: 102 TH----GITQFSDLTPAEFRRTYLGLR-RKLRLPKDADQAPILPTNDLPADFDWREKGAV 156
H G+ +F+DLT E+R TYLG R R R K + DLP DWR KGAV
Sbjct: 83 VHSFRLGLNRFADLTNDEYRATYLGARTRPQRERKLGARYHAADNEDLPESVDWRAKGAV 142
Query: 157 GPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCN 216
VKDQGSCGSCW+FST A+EG N + TG L+SLSEQ+LVDCD S + GCN
Sbjct: 143 AEVKDQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDT--------SYNQGCN 194
Query: 217 GGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAA 276
GGLM+ AFE+ + GG+ E+DYPY GTD G K+ ++ ++ V ++++
Sbjct: 195 GGLMDYAFEFIINNGGIDTEKDYPYKGTD-GRCDVNRKNAKVVTIDSYEDVPANDEKSLQ 253
Query: 277 NLVKNGPLAVAINA--VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPY 334
V N P++VAI A Q Y G+ C LDHGV VGYG+ K Y
Sbjct: 254 KAVANQPVSVAIEAAGTAFQLYSSGIFTG-SCGTALDHGVTAVGYGTE-------NGKDY 305
Query: 335 WIIKNSWGESWGENGYYKICR 355
WI+KNSWG SWGE+GY ++ R
Sbjct: 306 WIVKNSWGSSWGESGYVRMER 326
>gi|401430288|ref|XP_003886537.1| unnamed protein product [Leishmania mexicana MHOM/GT/2001/U1103]
gi|356491333|emb|CBZ40988.1| unnamed protein product [Leishmania mexicana MHOM/GT/2001/U1103]
Length = 533
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 131/310 (42%), Positives = 169/310 (54%), Gaps = 27/310 (8%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK+ + +AY + E R F+ NL HQ +P A GIT+F DL+ AEF
Sbjct: 128 FEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQARNPHAQFGITKFFDLSEAEFAAR 187
Query: 120 YL-GLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YL G + A Q DL P DWREKGAV PVKDQG+CGSCW+FS G
Sbjct: 188 YLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFSAVG 247
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGL 233
+EG +LA +LVSLSEQQLV CD D GC+GGLM AF++ L+ G L
Sbjct: 248 NIEGQWYLAGHELVSLSEQQLVSCDDMND---------GCDGGLMLQAFDWLLQNTNGHL 298
Query: 234 MREEDYPYTGTDRGHACKFDKSK---IAASVANFSVVSLDEDQIAANLVKNGPLAVAINA 290
E+ YPY + G+ + S + A + ++ E +AA L KNGP+A+A++A
Sbjct: 299 HTEDSYPYV-SGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDA 357
Query: 291 VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
+Y GV I ++L+HGVLLVGY G E PYW+IKNSWG WGE GY
Sbjct: 358 SSFMSYKSGVLTACI-GKQLNHGVLLVGYDMTG-------EVPYWVIKNSWGGDWGEQGY 409
Query: 351 YKICRGRNVC 360
++ G N C
Sbjct: 410 VRVVMGVNAC 419
>gi|5081735|gb|AAD39513.1|AF147207_1 cathepsin L-like protease precursor [Artemia franciscana]
Length = 338
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 136/328 (41%), Positives = 177/328 (53%), Gaps = 30/328 (9%)
Query: 52 DLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARH----QKLDPSATHGITQ 107
+LL E H LFK K Y SQ E R I+ N + A+H +K + S + +
Sbjct: 25 NLLADEWH--LFKATHKKEYPSQLEEKFRMKIYLENKHKVAKHNILYEKGEKSYQVAMNK 82
Query: 108 FSDLTPAEFRRTYLGLRRKLRLPKDADQAPIL--PTN-DLPADFDWREKGAVGPVKDQGS 164
F DL EFR G + K + A+ P N ++P DWR KGA+ PVKDQG
Sbjct: 83 FGDLLHHEFRSIMNGYQHKKQNSSRAESTFTFMEPANVEVPESVDWRVKGAITPVKDQGQ 142
Query: 165 CGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAF 224
CGSCW+FS+TGALEG F TGKL+SLSEQ L+DC + E GCNGGLM+ AF
Sbjct: 143 CGSCWAFSSTGALEGQTFRKTGKLISLSEQNLIDCSGKYGNE-------GCNGGLMDQAF 195
Query: 225 EYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGP 283
+Y G+ E YPY D + C+++ A F + S +ED++ A + GP
Sbjct: 196 QYIKDNKGIDTENTYPYEAED--NVCRYNPRNRGAIDRGFVHIPSGEEDKLKAAVATVGP 253
Query: 284 LAVAINAVY--MQTYIGGVSCPYIC-SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNS 340
++VAI+A + Q Y GV C S LDHGVL+VGYGS K YW++KNS
Sbjct: 254 VSVAIDASHESFQFYSKGVYYEPSCDSDDLDHGVLVVGYGSDN-------GKDYWLVKNS 306
Query: 341 WGESWGENGYYKICRGR-NVCGVDSMVS 367
W E WG+ GY KI R R N CG+ + S
Sbjct: 307 WSEHWGDEGYIKIARNRKNHCGIATAAS 334
>gi|302812789|ref|XP_002988081.1| hypothetical protein SELMODRAFT_183539 [Selaginella moellendorffii]
gi|300144187|gb|EFJ10873.1| hypothetical protein SELMODRAFT_183539 [Selaginella moellendorffii]
Length = 425
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 128/334 (38%), Positives = 185/334 (55%), Gaps = 34/334 (10%)
Query: 48 STNNDLLGAEHHFSLFKKKFNKAYASQEE-HDHRFTIFKANLRRAARHQKLDPSATH-GI 105
S+++DL G ++ + KF K AS DHRF FK N R H + + G+
Sbjct: 4 SSDSDLSG---EYASWCAKFGKECASSNSLGDHRFETFKENFRYIEEHNRAGKHSYRLGL 60
Query: 106 TQFSDLTPAEFRRTYLGLRRKL------RLPKDADQAPILPTNDLPADFDWREKGAVGPV 159
QFSDLT EFR+ +LGLR L ++P+D+D DLPA DWR+ GAV
Sbjct: 61 NQFSDLTSEEFRQRFLGLRPDLIDSPVLKMPRDSDIEEGFQNVDLPASVDWRQHGAVTAP 120
Query: 160 KDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGL 219
KDQGSCG CW+F+TTGA+EG N + TG+LVSLSEQ+L+DCD + D GC+GGL
Sbjct: 121 KDQGSCGGCWAFATTGAIEGINQIVTGQLVSLSEQELIDCDKK--------ADKGCDGGL 172
Query: 220 MNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLV 279
M +A+++ ++ GGL E DYPY ++ K S++ A + + + ++Q V
Sbjct: 173 MENAYQFIVENGGLDTETDYPYHASESHCNMKKLNSRVVA-IDGYKAIPEGDEQALLLAV 231
Query: 280 KNGPLAVAINAVY--MQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWII 337
P++VAI Q Y GV + C ++HGVL+VGYG+ YWI+
Sbjct: 232 AKQPVSVAIEGASKDFQHYASGVFTGH-CGEEINHGVLIVGYGTE-------DGLDYWIV 283
Query: 338 KNSWGESWGENGYYKICRGR----NVCGVDSMVS 367
KNSW +WG+ G+ K+ R +C ++++ S
Sbjct: 284 KNSWAATWGDGGFVKMQRNTGKRGGLCSINTLAS 317
>gi|537437|gb|AAC35211.1| cysteine proteinase [Hemerocallis hybrid cultivar]
Length = 359
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 132/324 (40%), Positives = 177/324 (54%), Gaps = 31/324 (9%)
Query: 52 DLLGAEHHFSLFKKKFNKAYASQE--EHDHRFTIFKANLRRAAR-HQKLDPSATHGITQF 108
DL E +SL++K S++ + D RF +FK N++ +QK D + + +F
Sbjct: 30 DLASEESLWSLYEKWRAHHAVSRDLDDTDKRFNVFKENVKFIHEFNQKKDATYKLALNKF 89
Query: 109 SDLTPAEFRRTYLGLR----RKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGS 164
D+T EFR TY G + LR KDA + +DLP DWREKGAV VKDQG
Sbjct: 90 GDMTNQEFRSTYAGSKIDHHMTLRGVKDAGEFSYEKFHDLPTSVDWREKGAVTGVKDQGQ 149
Query: 165 CGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAF 224
CGSCW+FST A+EG N + T +LVSLSEQQLVDCD + +SGCNGGLM+ AF
Sbjct: 150 CGSCWAFSTVVAVEGINQIKTNELVSLSEQQLVDCDTK---------NSGCNGGLMDYAF 200
Query: 225 EYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPL 284
++ GGL E+ YPY + +C + + ++ + V + + V N P+
Sbjct: 201 DFIKNNGGLSSEDSYPYLAEQK--SCGSEANSAVVTIDGYQDVPRNNEAALMKAVANQPV 258
Query: 285 AVAINA--VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWG 342
+VAI A Q Y GV + C LDHGV VGYG + K YWI+KNSWG
Sbjct: 259 SVAIEASGYAFQFYSQGVFSGH-CGTELDHGVAAVGYG------VDDDGKKYWIVKNSWG 311
Query: 343 ESWGENGYYKICRG----RNVCGV 362
E WGE+GY ++ RG R CG+
Sbjct: 312 EGWGESGYIRMERGIKDKRGKCGI 335
>gi|394331805|gb|AFN27125.1| cysteine protease [Leishmania major]
Length = 348
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 131/310 (42%), Positives = 169/310 (54%), Gaps = 27/310 (8%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK+ + +AY + E R F+ NL HQ +P A GIT+F DL+ AEF
Sbjct: 38 FEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 120 YL-GLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YL G + A Q DL P DWREKGAV PVK+QG+CGSCW+FS G
Sbjct: 98 YLNGAAYFAAAKQHAGQHYRKACADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFSAVG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGL 233
+E +A KLV LSEQQLV CDH D+GC GGLM AFE+ L+ G +
Sbjct: 158 NIESQWAVAGHKLVRLSEQQLVSCDH---------VDNGCGGGLMLQAFEWVLRNMNGTV 208
Query: 234 MREEDYPY-TGTDRGHACKFDKSKIA--ASVANFSVVSLDEDQIAANLVKNGPLAVAINA 290
E+ YPY +G C + S++A A + + + E +AA L KNGP+++A++A
Sbjct: 209 STEKSYPYVSGNGDVPECS-NSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDA 267
Query: 291 VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
+Y GV I +L+HGVLLVGY G E PYW+IKNSWGE WGE GY
Sbjct: 268 SSFMSYHSGVLTSCI-GEQLNHGVLLVGYNMTG-------EVPYWVIKNSWGEDWGEKGY 319
Query: 351 YKICRGRNVC 360
++ G N C
Sbjct: 320 VRVTMGVNAC 329
>gi|30685308|ref|NP_566634.2| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315949|sp|Q9LT77.1|CPR1_ARATH RecName: Full=Probable cysteine proteinase At3g19400; Flags:
Precursor
gi|11994462|dbj|BAB02464.1| cysteine proteinase [Arabidopsis thaliana]
gi|332642715|gb|AEE76236.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 362
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 129/311 (41%), Positives = 178/311 (57%), Gaps = 30/311 (9%)
Query: 69 KAYASQEEHDHRFTIFKANLRRAARHQKL-DPSATHGITQFSDLTPAEFRRTYLGLRRKL 127
K Y E + RF IFK NL+ H + D + G+T+F+DLT EFR YL R+K+
Sbjct: 53 KNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFADLTNEEFRAIYL--RKKM 110
Query: 128 RLPKDADQAP--ILPTND-LPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLA 184
KD+ + + D LP + DWR GAV VKDQG+CGSCW+FS GA+EG N +
Sbjct: 111 ERTKDSVKTERYLYKEGDVLPDEVDWRANGAVVSVKDQGNCGSCWAFSAVGAVEGINQIT 170
Query: 185 TGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGT 244
TG+L+SLSEQ+LVDCD G ++GC+GG+MN AFE+ +K GG+ ++DYPY
Sbjct: 171 TGELISLSEQELVDCDR-------GFVNAGCDGGIMNYAFEFIMKNGGIETDQDYPYNAN 223
Query: 245 DRGHACKFDKSK--IAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY--MQTYIGGV 300
D G C DK+ ++ + V D+++ V + P++VAI A Q Y GV
Sbjct: 224 DLG-LCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEASSQAFQLYKSGV 282
Query: 301 SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRN-- 358
C LDHGV++VGYGS + YWII+NSWG +WG++GY K+ R +
Sbjct: 283 MTG-TCGISLDHGVVVVGYGST-------SGEDYWIIRNSWGLNWGDSGYVKLQRNIDDP 334
Query: 359 --VCGVDSMVS 367
CG+ M S
Sbjct: 335 FGKCGIAMMPS 345
>gi|1848231|gb|AAB48120.1| cathepsin L-like protease [Leishmania major]
Length = 443
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 131/310 (42%), Positives = 169/310 (54%), Gaps = 27/310 (8%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK+ + +AY + E R F+ NL HQ +P A GIT+F DL+ AEF
Sbjct: 38 FEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 120 YL-GLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YL G + A Q DL P DWREKGAV PVK+QG+CGSCW+FS G
Sbjct: 98 YLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFSAVG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGL 233
+E +A KLV LSEQQLV CDH D+GC GGLM AFE+ L+ G +
Sbjct: 158 NIESQWAVAGHKLVRLSEQQLVSCDH---------VDNGCGGGLMLQAFEWVLRNMNGTV 208
Query: 234 MREEDYPY-TGTDRGHACKFDKSKIA--ASVANFSVVSLDEDQIAANLVKNGPLAVAINA 290
E+ YPY +G C + S++A A + + + E +AA L KNGP+++A++A
Sbjct: 209 FTEKSYPYVSGNGDVPECS-NSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDA 267
Query: 291 VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
+Y GV I +L+HGVLLVGY G E PYW+IKNSWGE WGE GY
Sbjct: 268 SSFMSYHSGVLTSCI-GEQLNHGVLLVGYNMTG-------EVPYWVIKNSWGEDWGEKGY 319
Query: 351 YKICRGRNVC 360
++ G N C
Sbjct: 320 VRVTMGVNAC 329
>gi|18394919|ref|NP_564126.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
gi|71153409|sp|Q9LM66.2|XCP2_ARATH RecName: Full=Xylem cysteine proteinase 2; Short=AtXCP2; Flags:
Precursor
gi|4836904|gb|AAD30607.1|AC007369_17 Putative cysteine proteinase [Arabidopsis thaliana]
gi|6708183|gb|AAF25832.1|AF191028_1 papain-type cysteine endopeptidase XCP2 [Arabidopsis thaliana]
gi|28466959|gb|AAO44088.1| At1g20850 [Arabidopsis thaliana]
gi|110743795|dbj|BAE99733.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332191910|gb|AEE30031.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
Length = 356
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 129/317 (40%), Positives = 174/317 (54%), Gaps = 26/317 (8%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F + F KAY + EE RF +FK NL+ K S G+ +F+DL+ EF++
Sbjct: 51 FENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKGKSYWLGLNEFADLSHEEFKKM 110
Query: 120 YLGLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSFSTTGA 176
YLGL+ + + D+ P DWR+KGAV VK+QGSCGSCW+FST A
Sbjct: 111 YLGLKTDIVRRDEERSYAEFAYRDVEAVPKSVDWRKKGAVAEVKNQGSCGSCWAFSTVAA 170
Query: 177 LEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMRE 236
+EG N + TG L +LSEQ+L+DCD + ++GCNGGLM+ AFEY +K GGL +E
Sbjct: 171 VEGINKIVTGNLTTLSEQELIDCDT--------TYNNGCNGGLMDYAFEYIVKNGGLRKE 222
Query: 237 EDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV--YMQ 294
EDYPY+ + + D+S+ + V + DE + L PL+VAI+A Q
Sbjct: 223 EDYPYSMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLKALAHQ-PLSVAIDASGREFQ 281
Query: 295 TYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
Y GGV C LDHGV VGYGS+ K Y I+KNSWG WGE GY ++
Sbjct: 282 FYSGGV-FDGRCGVDLDHGVAAVGYGSS-------KGSDYIIVKNSWGPKWGEKGYIRLK 333
Query: 355 RG----RNVCGVDSMVS 367
R +CG++ M S
Sbjct: 334 RNTGKPEGLCGINKMAS 350
>gi|401419663|ref|XP_003874321.1| cysteine peptidase A (CBA) [Leishmania mexicana MHOM/GT/2001/U1103]
gi|1706259|sp|P35591.2|CYSP1_LEIPI RecName: Full=Cysteine proteinase 1; AltName: Full=Amastigote
cysteine proteinase A-1; Flags: Precursor
gi|1220383|gb|AAA91859.1| cysteine proteinase [Leishmania pifanoi]
gi|322490556|emb|CBZ25817.1| cysteine peptidase A (CBA) [Leishmania mexicana MHOM/GT/2001/U1103]
Length = 354
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 132/367 (35%), Positives = 188/367 (51%), Gaps = 44/367 (11%)
Query: 1 MGSKTVVLFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHF 60
M + +LF + + + V G+ LI Q D + A H+
Sbjct: 1 MARRNPLLFAIVVTILFVVCYGS------ALIAQTPPPVDNFV------------ASAHY 42
Query: 61 SLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGIT-QFSDLTPAEFRRT 119
FKK+ KA+ E HRF FK N++ A +P A + ++ +F+DLTP EF +
Sbjct: 43 GSFKKRHGKAFGGDAEEGHRFNAFKQNMQTAYFLNTQNPHAHYDVSGKFADLTPQEFAKL 102
Query: 120 YLGLRRKLRLPKDADQAPILPTNDLPA---DFDWREKGAVGPVKDQGSCGSCWSFSTTGA 176
YL R KD + + + P+ DWR+KGAV PVK+QG CGSCW+FS G
Sbjct: 103 YLNPDYYARHLKDHKED-VHVDDSAPSGVMSVDWRDKGAVTPVKNQGLCGSCWAFSAIGN 161
Query: 177 LEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA--GGLM 234
+EG + LVSLSEQ LV CD + D GCNGGLM+ A + +++ G +
Sbjct: 162 IEGQWAASGHSLVSLSEQMLVSCD---------NIDEGCNGGLMDQAMNWIMQSHNGSVF 212
Query: 235 REEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQ 294
E YPYT D+ ++ A + F + DE++IA + K GP+AVA++A Q
Sbjct: 213 TEASYPYTSGGGTRPPCHDEGEVGAKITGFLSLPHDEERIAEWVEKRGPVAVAVDATTWQ 272
Query: 295 TYIGGVSCPYIC-SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKI 353
Y GGV +C + L+HGVL+VG+ + + PYWI+KNSWG SWGE GY ++
Sbjct: 273 LYFGGVVS--LCLAWSLNHGVLIVGFN-------KNAKPPYWIVKNSWGSSWGEKGYIRL 323
Query: 354 CRGRNVC 360
G N C
Sbjct: 324 AMGSNQC 330
>gi|405966498|gb|EKC31776.1| Cathepsin L [Crassostrea gigas]
Length = 330
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 126/320 (39%), Positives = 175/320 (54%), Gaps = 26/320 (8%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQ----KLDPSATHGITQFSDLT 112
+ + L+ K K Y ++EE R I++ NL +H + D S G+ ++ D+T
Sbjct: 24 DSEWQLYLKAHGKQYGAEEEARRR-VIWEGNLDYIEKHNLAADRGDYSFWLGMNEYGDMT 82
Query: 113 PAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFS 172
EFR T G + + + + P DLP DWR KG V P+K+QG CGSCWSFS
Sbjct: 83 NEEFRSTMNGYKMRNGTSRGSLYLPPSNIGDLPDTVDWRPKGYVTPIKNQGQCGSCWSFS 142
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
TG+LEG F TGKL SLSEQ LVDC + + GC GGLM+ AF+Y G
Sbjct: 143 ATGSLEGQTFKKTGKLPSLSEQNLVDCSQK-------QGNHGCQGGLMDDAFQYIKDNSG 195
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVANFS-VVSLDEDQIAANLVKNGPLAVAINAV 291
+ E YPY + C+F+ + + A+ + F+ + S E + + + GP++VAI+A
Sbjct: 196 IDTESSYPYEA--KNGKCRFNAANVGATDSGFTDIKSKSESDLQSAVATVGPISVAIDAS 253
Query: 292 YM--QTYIGGVSCPYICSR-RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+M Q Y GV + CS RLDHGVL VGYG+ K YW++KNSWGESWG+
Sbjct: 254 HMSFQLYRSGVYHEFFCSETRLDHGVLAVGYGTE-------SGKDYWLVKNSWGESWGQK 306
Query: 349 GYYKICRG-RNVCGVDSMVS 367
GY + R RN CG+ + S
Sbjct: 307 GYIMMSRNKRNNCGIATSAS 326
>gi|151573016|gb|ABS17683.1| cathepsin L-1 [Artemia persimilis]
Length = 334
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 137/328 (41%), Positives = 175/328 (53%), Gaps = 30/328 (9%)
Query: 52 DLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARH----QKLDPSATHGITQ 107
+LL E H LFK K Y SQ E R I+ N + A+H +K + S + +
Sbjct: 21 NLLADEWH--LFKATHKKEYPSQLEEKFRMKIYLENKHKVAKHNILFEKGEKSYQVAMNK 78
Query: 108 FSDLTPAEFRRTYLGLRRKLRLPKDADQAPIL--PTN-DLPADFDWREKGAVGPVKDQGS 164
F DL EFR G + K + A+ P N ++P DWREKGA+ PVKDQG
Sbjct: 79 FGDLLHHEFRSIMNGYQHKKQNSSRAESTFTFMEPANVEVPESVDWREKGAITPVKDQGQ 138
Query: 165 CGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAF 224
CG CW+FS+TGALEG F TGKLVSL EQ L+DC + E GCNGGLM+ AF
Sbjct: 139 CGPCWAFSSTGALEGQTFRKTGKLVSLREQNLIDCSGKYGNE-------GCNGGLMDQAF 191
Query: 225 EYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGP 283
+Y G+ E YPY D C+++ A F + S +ED++ A + GP
Sbjct: 192 QYIKDNKGIDTENTYPYEAED--DVCRYNPRNRGAVDRGFVDIPSGEEDKLKAAVATVGP 249
Query: 284 LAVAINAVY--MQTYIGGVSCPYIC-SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNS 340
++VAI+A + Q Y GV C S LDHGVL+VGYGS K YW++KNS
Sbjct: 250 VSVAIDASHESFQFYSKGVYYEPSCDSDDLDHGVLVVGYGSD-------NGKDYWLVKNS 302
Query: 341 WGESWGENGYYKICRGR-NVCGVDSMVS 367
W E WG+ GY KI R R N CGV + S
Sbjct: 303 WSEHWGDQGYIKIARNRKNHCGVATAAS 330
>gi|28192371|gb|AAK07729.1| NTCP23-like cysteine proteinase [Nicotiana tabacum]
Length = 360
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 144/374 (38%), Positives = 192/374 (51%), Gaps = 39/374 (10%)
Query: 9 FLVSLVV----FSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHF---S 61
L++LVV F++ +G + IRQV G L E+ ++G H +
Sbjct: 6 LLLALVVAGGLFASALAGPATFADENPIRQVVSDG---LHELENAILQVVGKTRHALSSA 62
Query: 62 LFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYL 121
F ++ K Y S EE RF +F NL+ H K S G+ +F+DLT EFRR L
Sbjct: 63 RFAHRYGKRYESVEEIKQRFEVFLDNLKMIRSHNKKGLSYKLGVNEFTDLTWDEFRRDRL 122
Query: 122 GLRRKLRLPKDADQAPILPTND-LPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGA 180
G + + + TN LP WRE G V PVK+QG CGSCW+FSTTGALE A
Sbjct: 123 GAAQNCSATTKGN---LKVTNVVLPETKGWREAGIVSPVKNQGKCGSCWTFSTTGALEAA 179
Query: 181 NFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYP 240
A GK +SLSEQQLVDC + + GCNGGL + AFEY GGL EE YP
Sbjct: 180 YSQAFGKGISLSEQQLVDCAGAFN-------NFGCNGGLPSQAFEYIKSNGGLDTEEAYP 232
Query: 241 YTGTDRGHACKFDKSKIAASV---ANFSVVSLDEDQIAANLVKNGPLAVAINAVY-MQTY 296
YTG + CKF + V N ++ + DE + A LV+ P+++A + + Y
Sbjct: 233 YTG--KNGLCKFSSENVGVKVIDSVNITLGAEDELKYAVALVR--PVSIAFEVIKGFKQY 288
Query: 297 IGGVSCPYICSRR---LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKI 353
GV C ++H VL VGYG PYW+IKNSWG WG+NGY+K+
Sbjct: 289 KSGVYTSTECGNTPMDVNHAVLAVGYGVENGV-------PYWLIKNSWGADWGDNGYFKM 341
Query: 354 CRGRNVCGVDSMVS 367
G+N+CG+ + S
Sbjct: 342 EMGKNMCGIATCAS 355
>gi|354496134|ref|XP_003510182.1| PREDICTED: cathepsin F [Cricetulus griseus]
gi|344250261|gb|EGW06365.1| Cathepsin F [Cricetulus griseus]
Length = 462
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 136/319 (42%), Positives = 183/319 (57%), Gaps = 35/319 (10%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRR 118
F F +N+ Y S+EE R T+F N+ +A + + LD +A +GIT+FSDLT EF
Sbjct: 165 FKDFMITYNRTYESREETQWRLTVFTRNMVKAQKIEALDRGTAQYGITKFSDLTEEEFYT 224
Query: 119 TYLG--LRRK----LRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFS 172
YL L++K + L K + + P ++DWR+KGAV VKDQG CGSCW+FS
Sbjct: 225 IYLNPLLQKKPGSKMSLAKSIN-------DPAPPEWDWRKKGAVTKVKDQGMCGSCWAFS 277
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
TG +EG FL G L+SLSEQ+L+DCD D C GG+ ++A+ GG
Sbjct: 278 VTGNVEGQWFLNQGTLLSLSEQELLDCDK---------MDKACLGGMPSNAYTAIKSLGG 328
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY 292
L E+DY Y G AC F K + + +S +E ++AA L + GP++VAINA
Sbjct: 329 LETEDDYSYKG--YVQACNFSAQKAKVYINDSVELSKNESKMAAWLAQKGPISVAINAFG 386
Query: 293 MQTYIGGVSCPY--ICSRRL-DHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
MQ Y G++ P +CS L DH VLLVGYG+ PYW IKNSWG +WGE G
Sbjct: 387 MQFYRHGIAHPLRPLCSPWLIDHAVLLVGYGNRS-------NTPYWAIKNSWGSNWGEEG 439
Query: 350 YYKICRGRNVCGVDSMVST 368
YY + RG CGV++M S+
Sbjct: 440 YYYLYRGSGACGVNTMASS 458
>gi|401430108|ref|XP_003879535.1| unnamed protein product [Leishmania mexicana MHOM/GT/2001/U1103]
gi|356491914|emb|CBZ40911.1| unnamed protein product [Leishmania mexicana MHOM/GT/2001/U1103]
Length = 359
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 130/309 (42%), Positives = 167/309 (54%), Gaps = 25/309 (8%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK+ + +AY + E R F+ NL HQ +P A GIT+F DL+ AEF
Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQARNPHAQFGITKFFDLSEAEFAAR 97
Query: 120 YL-GLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YL G + A Q DL P DWREKGAV PVKDQG CGSCW+FS+ G
Sbjct: 98 YLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGECGSCWAFSSVG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGL 233
+EG +LA +LVSLSEQQLV CD D GC+GGLM AF++ L+ G L
Sbjct: 158 NIEGQWYLAGHELVSLSEQQLVSCDDMND---------GCDGGLMLQAFDWLLQNTNGHL 208
Query: 234 MREEDYPY-TGTDRGHAC-KFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV 291
E+ YPY +G C K + A + ++ E +AA L KNGP+A+A++A
Sbjct: 209 YTEDSYPYVSGNGYLPECSNSSKLVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDAS 268
Query: 292 YMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYY 351
+Y GV I ++++H VLLVGY G E PYW+IKNSWG WGE GY
Sbjct: 269 SFMSYKSGVLTACI-GKQVNHAVLLVGYDMTG-------EVPYWVIKNSWGGDWGEQGYV 320
Query: 352 KICRGRNVC 360
++ G N C
Sbjct: 321 RVVMGVNAC 329
>gi|47086859|ref|NP_997749.1| cathepsin L, 1 a precursor [Danio rerio]
gi|42542930|gb|AAH66490.1| Cathepsin L1, a [Danio rerio]
Length = 337
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 137/320 (42%), Positives = 172/320 (53%), Gaps = 25/320 (7%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH----GITQFSDLTPA 114
H+ +KK +K Y + EE R I++ NL++ H H G+ F D+T
Sbjct: 28 HWDQWKKWHSKKYHATEE-GWRRVIWEKNLKKIEMHNLEHSMGIHTYRLGMNHFGDMTHE 86
Query: 115 EFRRTYLGLRRK--LRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFS 172
EFR+ G + K R P ++P DWREKG V PVKDQG CGSCW+FS
Sbjct: 87 EFRQVMNGFKHKKDRRFRGSLFMEPNFI--EVPNKLDWREKGYVTPVKDQGECGSCWAFS 144
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
TTGALEG F TGKLVSLSEQ LVDC PE + GCNGGLM+ AF+Y G
Sbjct: 145 TTGALEGQMFRKTGKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYVKDQNG 197
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAINAV 291
L EE YPY GTD C FD AA+ F + S E + + GP++VAI+A
Sbjct: 198 LDSEESYPYLGTD-DQPCHFDPKNSAANDTGFVDIPSGKERALMKAIAAVGPVSVAIDAG 256
Query: 292 Y--MQTYIGGVSCPYIC-SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+ Q Y G+ C S LDHGVL VGYG G + K YWI+KNSW E+WG+
Sbjct: 257 HESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEGE---DVDGKKYWIVKNSWSENWGDK 313
Query: 349 GYYKICRGR-NVCGVDSMVS 367
GY + + R N CG+ + S
Sbjct: 314 GYIYMAKDRHNHCGIATAAS 333
>gi|401416326|ref|XP_003872658.1| putative cathepsin L-like protease [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|14348750|emb|CAC41275.1| CPB2 protein [Leishmania mexicana]
gi|322488882|emb|CBZ24132.1| putative cathepsin L-like protease [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 359
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 129/310 (41%), Positives = 168/310 (54%), Gaps = 27/310 (8%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK+ + +AY + E R F+ NL HQ +P A GIT+F DL+ AEF
Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQARNPHAQFGITKFFDLSEAEFAAR 97
Query: 120 YL-GLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YL G + A Q DL P DWREKGAV PVKDQG CGSCW+FS+ G
Sbjct: 98 YLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGECGSCWAFSSVG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGL 233
+EG +LA +LVSLSEQQLV CD D GC+GGLM AF++ L+ G L
Sbjct: 158 NIEGQWYLAGHELVSLSEQQLVSCDDMND---------GCDGGLMLQAFDWLLQNTNGHL 208
Query: 234 MREEDYPYTGTDRGHACKFDKSK---IAASVANFSVVSLDEDQIAANLVKNGPLAVAINA 290
E+ YPY + G+ + S + A + + ++ E +AA L KNGP+A+A++A
Sbjct: 209 YTEDSYPYV-SGNGYLPECSNSSELVVGAQIDSHVLIGSSEKAMAAWLAKNGPIAIALDA 267
Query: 291 VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
+Y GV I + ++H VLLVGY G E PYW+IKNSWG WGE GY
Sbjct: 268 SSFMSYKSGVLTACI-GKEVNHAVLLVGYDMTG-------EVPYWVIKNSWGGDWGEQGY 319
Query: 351 YKICRGRNVC 360
++ G N C
Sbjct: 320 VRVVMGVNAC 329
>gi|74151179|dbj|BAE27712.1| unnamed protein product [Mus musculus]
Length = 334
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 129/322 (40%), Positives = 173/322 (53%), Gaps = 25/322 (7%)
Query: 54 LGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQ----FS 109
AE H +K + Y + EE + R I++ N+R H + HG + F
Sbjct: 25 FSAEWH--QWKSTHRRLYGTNEE-EWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAFG 81
Query: 110 DLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCW 169
D+T EFR+ G R + Q P++ +P DWREKG V PVK+QG CGSCW
Sbjct: 82 DMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK--IPKSVDWREKGCVTPVKNQGQCGSCW 139
Query: 170 SFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK 229
+FS +G LEG FL TGKL+SLSEQ LVDC H + GCNGGLM+ AF+Y +
Sbjct: 140 AFSASGCLEGQMFLKTGKLISLSEQNLVDCSH-------AQGNQGCNGGLMDFAFQYIKE 192
Query: 230 AGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAIN 289
GGL EE YPY D +CK+ A+ F + E+ + + GP++VA++
Sbjct: 193 NGGLDSEESYPYEAKDG--SCKYRAEFAVANDTGFVDIPQQEEALMKAVATVGPISVAMD 250
Query: 290 AVY--MQTYIGGVSC-PYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWG 346
A + +Q Y G+ P S+ LDHGVLLVGYG G + K YW++KNSWG WG
Sbjct: 251 ASHPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNK---YWLVKNSWGSEWG 307
Query: 347 ENGYYKICRGR-NVCGVDSMVS 367
GY KI + R N CG+ + S
Sbjct: 308 MEGYIKIAKDRDNHCGLATAAS 329
>gi|26452046|dbj|BAC43113.1| putative cysteine proteinase RD21A precursor [Arabidopsis thaliana]
Length = 362
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 129/311 (41%), Positives = 178/311 (57%), Gaps = 30/311 (9%)
Query: 69 KAYASQEEHDHRFTIFKANLRRAARHQKL-DPSATHGITQFSDLTPAEFRRTYLGLRRKL 127
K Y E + RF IFK NL+ H + D + G+T+F+DLT EFR YL R+K+
Sbjct: 53 KNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFADLTNEEFRAIYL--RKKM 110
Query: 128 RLPKDADQAP--ILPTND-LPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLA 184
KD+ + + D LP + DWR GAV VKDQG+CGSCW+FS GA+EG N +
Sbjct: 111 ERNKDSVKTERYLYKEGDVLPDEVDWRANGAVVSVKDQGNCGSCWAFSAVGAVEGINQIT 170
Query: 185 TGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGT 244
TG+L+SLSEQ+LVDCD G ++GC+GG+MN AFE+ +K GG+ ++DYPY
Sbjct: 171 TGELISLSEQELVDCDR-------GFVNAGCDGGIMNYAFEFIMKNGGIETDQDYPYNAN 223
Query: 245 DRGHACKFDKSK--IAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY--MQTYIGGV 300
D G C DK+ ++ + V D+++ V + P++VAI A Q Y GV
Sbjct: 224 DLG-LCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEASSQAFQLYKSGV 282
Query: 301 SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRN-- 358
C LDHGV++VGYGS + YWII+NSWG +WG++GY K+ R +
Sbjct: 283 MTG-TCGISLDHGVVVVGYGST-------SGEDYWIIRNSWGLNWGDSGYVKLQRNIDDP 334
Query: 359 --VCGVDSMVS 367
CG+ M S
Sbjct: 335 FGKCGIAMMPS 345
>gi|332326587|gb|AEE42617.1| cysteine protease [Leishmania aethiopica]
Length = 443
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 128/311 (41%), Positives = 166/311 (53%), Gaps = 29/311 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK+ + +AY + E R F+ NL HQ +P A GIT+F DL+ AEF
Sbjct: 38 FEEFKRTYRRAYGTVAEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 120 YL-GLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YL G + A Q DL P DWREKGAV PVKDQG+CGSCW+FS G
Sbjct: 98 YLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFSAVG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGL 233
+E +A +L +LSEQQLV CD + DSGCNGGLM AFE+ L+ G +
Sbjct: 158 NIESQWAVAGHRLTALSEQQLVSCDDK---------DSGCNGGLMTQAFEWLLRNMNGTM 208
Query: 234 MREEDYPYTGT--DRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV 291
+ E+ YPY + D + A + + + E +AA L K+GP+++A++A
Sbjct: 209 LTEDSYPYVSSTGDVPECTNSSQLVPGARIDGYVTIESSETVMAAWLAKSGPISIAVDAS 268
Query: 292 YMQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
+Y GV SC L+HGVLLVGY G E PYW+IKNSWGE WGE G
Sbjct: 269 SFMSYESGVLTSC---AGDALNHGVLLVGYNXTG-------EVPYWVIKNSWGEDWGEKG 318
Query: 350 YYKICRGRNVC 360
Y ++ G N C
Sbjct: 319 YVRVTMGVNAC 329
>gi|224056176|ref|XP_002298740.1| predicted protein [Populus trichocarpa]
gi|222845998|gb|EEE83545.1| predicted protein [Populus trichocarpa]
Length = 455
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 127/305 (41%), Positives = 176/305 (57%), Gaps = 28/305 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKL-DPSATHGITQFSDLTPAEFRR 118
+ ++ K +AY + E + RF IFK NL+ H + +PS G+ +F+DL+ E+R
Sbjct: 25 YEMWLVKHGRAYNALGEKERRFEIFKDNLKFIDEHNSVGNPSYKLGLNKFADLSNDEYRS 84
Query: 119 TYLGLR-----RKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFST 173
YLG R R L PK +++ +DLP DWREKGAV PVKDQG CGSCW+FST
Sbjct: 85 VYLGTRMDGKGRLLGGPK-SERYLFKEGDDLPETVDWREKGAVAPVKDQGQCGSCWAFST 143
Query: 174 TGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGL 233
GA+EG N + TG L SLSEQ+LVDCD + + GCNGGLM+ AF++ ++ GG+
Sbjct: 144 VGAVEGINQIVTGNLTSLSEQELVDCDK--------TYNLGCNGGLMDYAFDFIIENGGI 195
Query: 234 MREEDYPYTGTDRGHACKFD-KSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA-- 290
EEDYPY D C + K+ ++ + V ++++ V N P++VAI A
Sbjct: 196 DTEEDYPYKAID--SMCDPNRKNARVVTIDGYEDVPQNDEKSLKKAVANQPVSVAIEAGG 253
Query: 291 VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
Q Y GV C +LDHGV+ VGYG+ YWI++NSWG +WGENGY
Sbjct: 254 RGFQLYQSGVFTG-SCGTQLDHGVVTVGYGTE-------HGVDYWIVRNSWGPAWGENGY 305
Query: 351 YKICR 355
++ R
Sbjct: 306 IRMER 310
>gi|375073984|gb|AFA34859.1| cathepsin L-like protein [Trypanosoma rangeli]
Length = 467
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 128/312 (41%), Positives = 167/312 (53%), Gaps = 23/312 (7%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRR 118
HF+ FK++ K Y S E R +FK NL A H +P A+ G+T FSDLT EFR
Sbjct: 37 HFAAFKQRHGKVYRSAAEEAFRLGVFKENLLLARLHAAANPHASFGVTPFSDLTREEFRS 96
Query: 119 TYLGLRRKLRLPKD---ADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
Y + + PA DWR +GAV VKDQG CGSCW+FST G
Sbjct: 97 RYHNAAAHFAAAQKRARVPVEVEVEVGGAPAAVDWRARGAVTAVKDQGECGSCWAFSTIG 156
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTL--KAGGL 233
+EG LA L SLSEQ LV CD+ D+GC+GGLM++AF++ + G +
Sbjct: 157 NIEGQWHLAGNPLTSLSEQMLVSCDNA---------DNGCDGGLMDNAFDWIVGKNNGTV 207
Query: 234 MREEDYPY-TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY 292
E Y Y +G C + A ++ + DED++AA L NGPLA+A++A
Sbjct: 208 YTEASYSYVSGGGNSQKCDMSGHVVGAVISGHVDLPKDEDKMAAWLAANGPLAIAVDATS 267
Query: 293 MQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYK 352
+Y GGV I S +LDHGV+LVGY + PYWIIKNSWG WGE GY +
Sbjct: 268 FMSYTGGVLTNCI-SDQLDHGVVLVGYNDS-------SNPPYWIIKNSWGADWGEGGYIR 319
Query: 353 ICRGRNVCGVDS 364
I +G N C V++
Sbjct: 320 IQKGTNQCLVNN 331
>gi|157864855|ref|XP_001681136.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|68124430|emb|CAJ02286.1| cathepsin L-like protease [Leishmania major strain Friedlin]
Length = 348
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 130/310 (41%), Positives = 168/310 (54%), Gaps = 27/310 (8%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK+ + +AY + E R F+ NL HQ +P A GIT+F DL+ AEF
Sbjct: 38 FEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 120 YL-GLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YL G + A Q DL P DWREKGAV PVK+QG+CGSCW+FS G
Sbjct: 98 YLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFSAVG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGL 233
+E +A KLV LSEQQLV CDH D+GC GGLM AFE+ L+ G +
Sbjct: 158 NIESQWAVAGHKLVRLSEQQLVSCDH---------VDNGCGGGLMLQAFEWVLRNMNGTV 208
Query: 234 MREEDYPY-TGTDRGHACKFDKSKIA--ASVANFSVVSLDEDQIAANLVKNGPLAVAINA 290
E+ YPY +G C + S++A A + + + E + A L KNGP+++A++A
Sbjct: 209 FTEKSYPYVSGNGDVPECS-NSSELAPGARIDGYVSMESSERVMTAWLAKNGPISIAVDA 267
Query: 291 VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
+Y GV I +L+HGVLLVGY G E PYW+IKNSWGE WGE GY
Sbjct: 268 SSFMSYHSGVLTSCI-GEQLNHGVLLVGYNMTG-------EVPYWVIKNSWGEDWGEKGY 319
Query: 351 YKICRGRNVC 360
++ G N C
Sbjct: 320 VRVTMGVNAC 329
>gi|74229746|ref|YP_308950.1| cathepsin [Trichoplusia ni SNPV]
gi|72259660|gb|AAZ67431.1| cathepsin [Trichoplusia ni SNPV]
Length = 344
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 134/374 (35%), Positives = 196/374 (52%), Gaps = 40/374 (10%)
Query: 4 KTVVLFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLF 63
K ++LF V +V +SG L + V+ +I V + H +L A +F F
Sbjct: 2 KKIILFFVFVV-----ASGGLDNGVNAVIDYVA------AAPHFKLQYNLERAPQYFETF 50
Query: 64 KKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGL 123
+ K+ K YA E D+R+ IFK NL + + SA + I +F+DLT E + GL
Sbjct: 51 QTKYKKVYADDNERDYRYKIFKTNLEIINLKNQQNDSAVYNINKFADLTKNEVIAKFTGL 110
Query: 124 RRKLRLPKDADQAPIL--PTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGAN 181
K K+ I+ P+ FDWR+ + VKDQG CGSCW+FST LE
Sbjct: 111 GVKSPNLKNFCDPLIVDGPSKYTQETFDWRQFNKITSVKDQGFCGSCWAFSTIAGLESQY 170
Query: 182 FLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPY 241
+ + + LSEQQLVDCD + D GC GGL+++A+E + GG+ EEDYPY
Sbjct: 171 AIKYNEHIDLSEQQLVDCD---------TIDMGCAGGLLHTAYEEIMSMGGVEYEEDYPY 221
Query: 242 TGTDRGHACKFDKSKIAASVAN-FSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGV 300
C+ + K SV N + + ED++ L + GP+AVA++AV + Y GG+
Sbjct: 222 RSVQ--GPCRIENDKFQVSVDNCYRYILYSEDKLKDVLHEMGPIAVAVDAVDLTDYYGGI 279
Query: 301 --SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRN 358
SC + L+H VLLVGYG+ P+W++KNSWG +GENG+ ++ R N
Sbjct: 280 ITSCK---NYGLNHAVLLVGYGTEN-------GIPFWVLKNSWGTDYGENGFVRVKRNVN 329
Query: 359 VCGVDSMVSTVAAA 372
CG M++ +AA+
Sbjct: 330 SCG---MINELAAS 340
>gi|89274062|dbj|BAE80740.1| cysteine proteinase [Platycodon grandiflorus]
Length = 462
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 132/364 (36%), Positives = 193/364 (53%), Gaps = 37/364 (10%)
Query: 3 SKTVVLFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSL 62
S + L V SA+ + D + DE+++ +ES
Sbjct: 7 SMAIALLFALFVASSALDMSIINYDATHASKSSWRTDDEVMAMYES-------------- 52
Query: 63 FKKKFNKAYASQEEHDHRFTIFKANLRRAARHQ-KLDPSATHGITQFSDLTPAEFRRTYL 121
+ K K+Y + E + RF IFK NLR H + + S G+ +F+DLT E+R TYL
Sbjct: 53 WLVKHGKSYNALGEKEKRFQIFKDNLRFIDEHNAEENLSYKVGLNRFADLTNEEYRSTYL 112
Query: 122 GLRRKLRLPK-DADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGA 180
G + K +L K +D+ + LP DWR KGAV P+KDQGSCGSCW+FST A+EG
Sbjct: 113 GAKSKPKLSKVKSDRYAPRVGDSLPESVDWRAKGAVAPIKDQGSCGSCWAFSTVNAVEGI 172
Query: 181 NFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYP 240
N + TG+L++LSEQ+LVDCD S + GC+GGLM+ FE+ + GG+ ++DYP
Sbjct: 173 NQIVTGELITLSEQELVDCDK--------SYNEGCDGGLMDYGFEFIINNGGIDTDKDYP 224
Query: 241 YTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAIN--AVYMQTYIG 298
Y G D ++ K+ ++ ++ V ++ ++ V + P++V I Q Y
Sbjct: 225 YLGRD-ARCDQYRKNAKVVTIDSYEDVPVNNEEALKKAVASQPVSVGIEGGGRAFQFYDS 283
Query: 299 GVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRN 358
G+ C LDHGV +VGYG+ K K YWI++NSWG SWGE GY ++ RN
Sbjct: 284 GIFTG-KCGTALDHGVNVVGYGTE-------KGKDYWIVRNSWGSSWGEAGYIRM--ERN 333
Query: 359 VCGV 362
+ G
Sbjct: 334 LAGT 337
>gi|394331735|gb|AFN27090.1| cysteine protease [Leishmania major]
Length = 348
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 130/310 (41%), Positives = 169/310 (54%), Gaps = 27/310 (8%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK+ + +AY + E R F+ NL HQ +P A GIT+F DL+ AEF
Sbjct: 38 FEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 120 YL-GLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YL G + A Q DL P DWR+KGAV PVK+QG+CGSCW+FS G
Sbjct: 98 YLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWRKKGAVTPVKNQGACGSCWAFSAVG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGL 233
+E +A KLV LSEQQLV CDH D+GC GGLM AFE+ L+ G +
Sbjct: 158 NIESQWAVAGHKLVRLSEQQLVSCDH---------VDNGCGGGLMLQAFEWVLRNMNGTV 208
Query: 234 MREEDYPY-TGTDRGHACKFDKSKIA--ASVANFSVVSLDEDQIAANLVKNGPLAVAINA 290
E+ YPY +G C + S++A A + + + E +AA L KNGP+++A++A
Sbjct: 209 FTEKSYPYVSGNGDVPECS-NSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDA 267
Query: 291 VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
+Y GV I +L+HGVLLVGY G E PYW+IKNSWGE WGE GY
Sbjct: 268 SSFMSYHSGVLTSCI-GEQLNHGVLLVGYNMTG-------EVPYWVIKNSWGEDWGEKGY 319
Query: 351 YKICRGRNVC 360
++ G N C
Sbjct: 320 VRVTMGVNAC 329
>gi|157864849|ref|XP_001681133.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|68124427|emb|CAJ02283.1| cathepsin L-like protease [Leishmania major strain Friedlin]
Length = 348
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 130/310 (41%), Positives = 169/310 (54%), Gaps = 27/310 (8%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK+ + +AY + E R F+ NL HQ +P A GIT+F DL+ AEF
Sbjct: 38 FEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 120 YL-GLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YL G + A Q DL P DWREKGAV PVK+QG+CGSCW+FS G
Sbjct: 98 YLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFSAVG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGL 233
+E +A KLV LSEQQLV CDH D+GC GGLM AFE+ L+ G +
Sbjct: 158 NIESQWAVAGHKLVRLSEQQLVSCDH---------VDNGCGGGLMLQAFEWVLRNMNGTV 208
Query: 234 MREEDYPY-TGTDRGHACKFDKSKIA--ASVANFSVVSLDEDQIAANLVKNGPLAVAINA 290
E+ YPY +G C + S++A A + + + E +AA L KNGP+++A++A
Sbjct: 209 FTEKSYPYVSGNGDVPECS-NSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDA 267
Query: 291 VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
+Y GV I +L+HGVLLVGY G E PYW+IKNSWG+ WGE GY
Sbjct: 268 SSFMSYHSGVLTSCI-GEQLNHGVLLVGYNMTG-------EVPYWVIKNSWGKDWGEKGY 319
Query: 351 YKICRGRNVC 360
++ G N C
Sbjct: 320 VRVTMGVNAC 329
>gi|55740406|gb|AAV63979.1| cathepsin L1 precursor [Artemia parthenogenetica]
Length = 338
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 137/328 (41%), Positives = 176/328 (53%), Gaps = 30/328 (9%)
Query: 52 DLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARH----QKLDPSATHGITQ 107
+LL E H LFK K Y SQ E R I+ N + A+H +K + S + +
Sbjct: 25 NLLADEWH--LFKATHKKEYPSQLEEKLRMKIYLENKHKVAKHNILYEKGEKSYQVAMNK 82
Query: 108 FSDLTPAEFRRTYLGLRRKLRLPKDADQAPIL--PTN-DLPADFDWREKGAVGPVKDQGS 164
F DL EFR G + K + A+ P N ++P DWREKGA+ PVKDQG
Sbjct: 83 FGDLLHHEFRSIMNGYQHKKQNSSRAESTFTFMEPANVEVPESVDWREKGAITPVKDQGQ 142
Query: 165 CGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAF 224
CGSCW+FS+TGALEG F TGKLVSLSEQ L+DC + E GCNGGLM+ AF
Sbjct: 143 CGSCWAFSSTGALEGQTFRKTGKLVSLSEQNLIDCSGKYGNE-------GCNGGLMDQAF 195
Query: 225 EYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGP 283
+Y G+ E YPY D C+++ A F + S +ED++ A + GP
Sbjct: 196 QYIKDNKGIDTENTYPYEAEDG--VCRYNPRNRGAVDRGFVDIPSGEEDKLKAAVATVGP 253
Query: 284 LAVAINAVY--MQTYIGGVSCPYIC-SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNS 340
++VAI+A + Q Y G C S LDHGVL+VGYGS + YW++KNS
Sbjct: 254 VSVAIDASHESFQFYSKGXYYEPSCDSDDLDHGVLVVGYGSD-------NGEDYWLVKNS 306
Query: 341 WGESWGENGYYKICRGR-NVCGVDSMVS 367
W E WG+ GY KI R R N CGV + S
Sbjct: 307 WSEHWGDEGYIKIARNRKNHCGVATAAS 334
>gi|442754503|gb|JAA69411.1| Putative cathepsin l-like cysteine proteinase b [Ixodes ricinus]
Length = 335
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 129/329 (39%), Positives = 180/329 (54%), Gaps = 31/329 (9%)
Query: 52 DLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKL----DPSATHGITQ 107
++LGAE +S FK K K+Y S+ E R I+ N + A+H + + + + +
Sbjct: 21 EVLGAE--WSAFKAKHGKSYVSETEEVFRLKIYMENRHKIAKHNEKYARGEVPYSMAMNE 78
Query: 108 FSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTN----DLPADFDWREKGAVGPVKDQG 163
F D+ EF T G +R + + P N LP DWR KGAV PVK+QG
Sbjct: 79 FGDMLHHEFVSTRNGFKRNYKDQPREGSTYLEPENIEDFSLPKTVDWRTKGAVTPVKNQG 138
Query: 164 SCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSA 223
CGSCW+FS TG+LEG +F +G +VSLSEQ LVDC + ++GC GGLM++A
Sbjct: 139 QCGSCWAFSATGSLEGQHFRKSGSMVSLSEQNLVDCSTDFG-------NNGCEGGLMDNA 191
Query: 224 FEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNG 282
F+Y G+ E+ YPY GTD C F KS + A+ + F + E Q+ + G
Sbjct: 192 FKYIRANKGIDTEKSYPYNGTDG--TCHFKKSTVGATDSGFVDIKEGSETQLKKAVATVG 249
Query: 283 PLAVAINAVY--MQTYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKN 339
P++VAI+A + Q Y GV P S LDHGVL+VGYG+ L YW++KN
Sbjct: 250 PISVAIDASHESFQFYSDGVYDEPECDSESLDHGVLVVGYGT-------LNGTDYWLVKN 302
Query: 340 SWGESWGENGYYKICRG-RNVCGVDSMVS 367
SWG +WG+ GY ++ R +N CG+ S S
Sbjct: 303 SWGTTWGDEGYIRMSRNKKNQCGIASSAS 331
>gi|148927396|gb|ABR19829.1| cysteine proteinase [Elaeis guineensis]
Length = 358
Score = 215 bits (548), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 138/351 (39%), Positives = 183/351 (52%), Gaps = 34/351 (9%)
Query: 27 DVDQLIRQVTDGGDEILSHHESTNNDLLGAEH---HFSLFKKKFNKAYASQEEHDHRFTI 83
D LI+ VT+ D + E++ +LG HF+ F ++ K Y S EE RF I
Sbjct: 26 DEANLIQSVTERIDSL----ETSLLGVLGQTRNALHFARFAHRYGKRYQSVEEMKLRFAI 81
Query: 84 FKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTND 143
F NL + GI +++D++ EFR + LG + + +
Sbjct: 82 FMENLELIRSTNRRGLPYKLGINRYADMSWEEFRASRLGAAQNCSATLKGNHK--MTDEL 139
Query: 144 LPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHEC 203
LP DWRE G V PVKDQGSCGSCW+FSTTGALE A ATGK +SLSEQQLVDC +
Sbjct: 140 LPKTKDWREDGIVSPVKDQGSCGSCWTFSTTGALEAAYTQATGKGISLSEQQLVDCAYAF 199
Query: 204 DPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASV-- 261
+ + GCNGGL + AFEY GGL EE YPY G + C F + V
Sbjct: 200 N-------NFGCNGGLPSQAFEYIKYNGGLDTEESYPYAGVN--GFCHFKPENVGVKVVE 250
Query: 262 -ANFSVVSLDEDQIAANLVKNGPLAVAINAVY-MQTYIGGVSCPYICSRR---LDHGVLL 316
N ++ + DE A LV+ P+++A V + Y GGV C R ++H VL
Sbjct: 251 SVNITLGAEDELLHAVGLVR--PVSIAFEVVSGFRFYKGGVYTSDTCGRTQMDVNHAVLA 308
Query: 317 VGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVS 367
VGYG PYW+IKNSWGE WG +GY+K+ G+N+CG+ + S
Sbjct: 309 VGYGVE-------NGVPYWLIKNSWGEEWGVDGYFKMELGKNMCGIATCAS 352
>gi|357130141|ref|XP_003566711.1| PREDICTED: xylem cysteine proteinase 1-like [Brachypodium
distachyon]
Length = 457
Score = 215 bits (548), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 134/332 (40%), Positives = 183/332 (55%), Gaps = 29/332 (8%)
Query: 49 TNNDLLGAEHHFSLFKK---KFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGI 105
+ DL + LF+K K KAYAS EE HRF +FK NL+ + + S G+
Sbjct: 136 SEEDLSSNDRIIELFEKWLAKHQKAYASFEEKLHRFEVFKDNLKHIDKVNREVTSYWLGL 195
Query: 106 TQFSDLTPAEFRRTYLGLR--RKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQG 163
+F+DLT EF+ TYLGL R + + + + +DLP DWR KGAV VK+QG
Sbjct: 196 NEFADLTHEEFKATYLGLAPPAPARESRGSFKYEDVSADDLPKSVDWRTKGAVTEVKNQG 255
Query: 164 SCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSA 223
CGSCW+FST A+EG N + TG L +LSEQ+L+DC + ++GCNGGLM+ A
Sbjct: 256 QCGSCWAFSTVAAVEGINAIVTGNLTALSEQELIDCSVD--------GNNGCNGGLMDYA 307
Query: 224 FEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKI-AASVANFSVVSLDEDQIAANLVKNG 282
F Y +GGL EE YPY + G KS+ A +++ + V +Q + +
Sbjct: 308 FSYIASSGGLHTEEAYPYL-MEEGSCGDGKKSESEAVTISGYEDVPAHNEQALIKALAHQ 366
Query: 283 PLAVAINAV--YMQTYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKN 339
P++VAI A + Q Y GGV P C +LDHGV VGYGS + K Y I++N
Sbjct: 367 PVSVAIEASGRHFQFYSGGVFDGP--CGTQLDHGVAAVGYGSD-----KGKGHDYIIVRN 419
Query: 340 SWGESWGENGYYKICR----GRNVCGVDSMVS 367
SWG WGE GY ++ R G +CG++ M S
Sbjct: 420 SWGAKWGEKGYIRMKRGTGKGEGLCGINKMAS 451
>gi|394331814|gb|AFN27126.1| cysteine protease [Leishmania tropica]
Length = 443
Score = 215 bits (548), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 130/312 (41%), Positives = 168/312 (53%), Gaps = 31/312 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK+ + +AY + E R F+ NL HQ +P A GIT+F DL+ AEF
Sbjct: 38 FEEFKRTYRRAYGTVAEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 120 YL-GLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YL G + A Q DL P DWR+KGAV PVKDQG+CGSCW+FS G
Sbjct: 98 YLNGAAYFAAAKQHAGQHYRKARADLSAVPYAVDWRKKGAVTPVKDQGACGSCWAFSAVG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGL 233
++E LA +L +LSEQQLV CD + DSGC GGLM AFE+ L+ G +
Sbjct: 158 SIESQWALAGHRLTALSEQQLVSCDDK---------DSGCGGGLMLQAFEWLLRNMNGTM 208
Query: 234 MREEDYPYTGTDRGHACKFDKSK---IAASVANFSVVSLDEDQIAANLVKNGPLAVAINA 290
E+ YPY + G+ + S A + + + E +AA L KNGP+++A++A
Sbjct: 209 FTEDSYPYVSSS-GYVPECSNSSQLVPGARIDGYMTIESSETVMAAWLAKNGPISIAVDA 267
Query: 291 VYMQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+Y GV SC L+HGVLLVGY G E PYW+IKNSWGE WGEN
Sbjct: 268 SSFMSYESGVLTSCA---GDTLNHGVLLVGYNMTG-------EVPYWVIKNSWGEDWGEN 317
Query: 349 GYYKICRGRNVC 360
GY ++ G N C
Sbjct: 318 GYVRVTMGVNAC 329
>gi|194320502|gb|ACF48469.1| cathepsin L [Triatoma brasiliensis]
Length = 330
Score = 215 bits (548), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 134/326 (41%), Positives = 178/326 (54%), Gaps = 30/326 (9%)
Query: 52 DLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRA----ARHQKLDPSATHGITQ 107
D+ E H +FK K Y +Q E R IF N ++ A++++ + S +
Sbjct: 21 DIYPEEWH--VFKAMHGKTYKNQFEEMFRMKIFMDNKKKIEAHNAKYEQGEVSYKMMMNH 78
Query: 108 FSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTN-DLPADFDWREKGAVGPVKDQGSCG 166
F DL EF+ G + ++ + P+N +LP DWR+KGAV PVKDQG CG
Sbjct: 79 FGDLMVHEFKALMNGFKMSPDTKRNGEL--YFPSNSNLPKTVDWRQKGAVTPVKDQGQCG 136
Query: 167 SCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEY 226
SCWSFS TG+LEG FL TGKLVSLSEQ LVDC ++GC GGLM+ AF+Y
Sbjct: 137 SCWSFSATGSLEGQVFLKTGKLVSLSEQNLVDC-------STSYGNNGCEGGLMDQAFQY 189
Query: 227 TLKAGGLMREEDYPYTGTDRGHACKFDKSKIAAS-VANFSVVSLDEDQIAANLVKNGPLA 285
G+ E YPY R + C+F K+K+ + + + + DE + L GP++
Sbjct: 190 VSDNKGIDTEASYPYEA--RENTCRFKKNKVGGTDKGHVDIPAGDEKALQNALATVGPIS 247
Query: 286 VAINAVY--MQTYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWG 342
VAI+A + Q Y GV + P S LDHGVL VGYG+ + YW++KNSWG
Sbjct: 248 VAIDANHGSFQFYSKGVYNEPNCSSYDLDHGVLAVGYGTE-------NGQDYWLVKNSWG 300
Query: 343 ESWGENGYYKICRGR-NVCGVDSMVS 367
SWGENGY KI R N CG+ SM S
Sbjct: 301 PSWGENGYIKIARNHSNHCGIASMAS 326
>gi|124484387|dbj|BAF46304.1| cysteine proteinase precursor [Ipomoea nil]
Length = 474
Score = 215 bits (548), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 127/316 (40%), Positives = 174/316 (55%), Gaps = 26/316 (8%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAE 115
+ F + K K+Y + +E D RF IF+ NL+ L+ S G+ +F+D+T E
Sbjct: 47 KEMFESWLVKHGKSYNAVDEKDKRFKIFRDNLKYIDEKNSLENRSYKLGLNRFADITNEE 106
Query: 116 FRRTYLGLRR---KLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFS 172
+R YLG +R + + +D+ + + LP DWREKGAV VKDQGSCGSCW+FS
Sbjct: 107 YRTGYLGAKRDASRNMVKSKSDRYAPVAGDSLPDSIDWREKGAVTGVKDQGSCGSCWAFS 166
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
T A+EG N LATG L+SLSEQ+LVDCD + + GCNGG M AF++ +K GG
Sbjct: 167 TIAAVEGVNQLATGNLISLSEQELVDCDRK--------INQGCNGGDMGYAFQFIIKNGG 218
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA-- 290
+ EEDYPYTG D + AS+ + V ++ ++ V N P++VAI A
Sbjct: 219 IDSEEDYPYTGKDGKCDSYRQNNAKVASIDGYEEVPVNNEKSLQKAVANQPVSVAIEAGG 278
Query: 291 VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
Q Y G+ C LDHGV VGYG+ YWI+KNSWG+ WGE GY
Sbjct: 279 YDFQLYSSGIFTG-SCGTDLDHGVAAVGYGTENGV-------DYWIVKNSWGDYWGEKGY 330
Query: 351 YKICRG----RNVCGV 362
++ R +CG+
Sbjct: 331 VRMQRNVKAKTGLCGI 346
>gi|162463464|ref|NP_001104879.1| cysteine proteinase Mir3 precursor [Zea mays]
gi|2425066|gb|AAB88263.1| cysteine proteinase Mir3 [Zea mays]
Length = 480
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 128/321 (39%), Positives = 176/321 (54%), Gaps = 27/321 (8%)
Query: 42 ILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSA 101
I+S+ E T+ + A ++ + + Y + + R+ +F+ NLR H +
Sbjct: 29 IVSYGERTDEE---ARRMYAEWMAAHGRTYNAVGAEERRYQVFRDNLRYIDAHNAAADAG 85
Query: 102 TH----GITQFSDLTPAEFRRTYLGLR-RKLRLPKDADQAPILPTNDLPADFDWREKGAV 156
H G+ +F+DLT E+ TYLG R R R K + DLP DWR KGAV
Sbjct: 86 VHSFRLGLNRFADLTNDEYPATYLGARTRPQRDRKLGARYHAADNEDLPESVDWRAKGAV 145
Query: 157 GPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCN 216
VKDQGSCG+CW+FST A+EG N + TG L+SLSEQ+LVDCD S + GCN
Sbjct: 146 AEVKDQGSCGTCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDT--------SYNQGCN 197
Query: 217 GGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAA 276
GGLM+ AFE+ + GG+ E+DYPY GTD G K+ ++ ++ V ++++
Sbjct: 198 GGLMDYAFEFIINNGGIDTEKDYPYKGTD-GRCDVNRKNAKVVTIDSYEDVPANDEKSLQ 256
Query: 277 NLVKNGPLAVAINA--VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPY 334
V N P++VAI A Q Y G+ C RLDHGV VGYG+ K Y
Sbjct: 257 KAVANQPVSVAIEAAGTAFQLYSSGIFTG-SCGTRLDHGVTAVGYGTE-------NGKDY 308
Query: 335 WIIKNSWGESWGENGYYKICR 355
WI+KNSWG SWGE+GY ++ R
Sbjct: 309 WIVKNSWGSSWGESGYVRMER 329
>gi|255567869|ref|XP_002524912.1| cysteine protease, putative [Ricinus communis]
gi|223535747|gb|EEF37409.1| cysteine protease, putative [Ricinus communis]
Length = 366
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 127/325 (39%), Positives = 184/325 (56%), Gaps = 37/325 (11%)
Query: 45 HHEST---NNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQK-LDPS 100
HH+S+ +N+++ ++ + K +K Y E + RF IFK NLR H + +
Sbjct: 33 HHQSSWRSDNEVISM---YNWWLAKHSKTYNKLGEREKRFEIFKNNLRFIDEHNNSKNRT 89
Query: 101 ATHGITQFSDLTPAEFRRTYLGLR----RKLRLPKDADQAPILPTND-LPADFDWREKGA 155
G+T+F+DLT E+R +LG + R+L K+ Q D LP DWR+ GA
Sbjct: 90 YKVGLTRFADLTNEEYRAKFLGTKSDPKRRLMKSKNPSQRYAFKAGDVLPESIDWRQSGA 149
Query: 156 VGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGC 215
V +KDQGSCGSCW+FST A+EG N + TG+L+SLSEQ+LVDCD S ++GC
Sbjct: 150 VSAIKDQGSCGSCWAFSTIAAVEGVNKIVTGELISLSEQELVDCDR--------SYNAGC 201
Query: 216 NGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKI---AASVANFSVVSLDED 272
NGGLM++AF++ + GG+ ++DYPY D K D +K+ A ++ F V ++
Sbjct: 202 NGGLMDNAFQFIINNGGIDTDKDYPYQAVD----GKCDTTKVKNKAVTIDGFEDVMAFDE 257
Query: 273 QIAANLVKNGPLAVAINA--VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLK 330
V + P++VAI A + +Q Y GV C LDHGV++VGYG+
Sbjct: 258 MALQKAVAHQPVSVAIEASGMALQFYQSGVFTGE-CGSALDHGVVIVGYGTE-------D 309
Query: 331 EKPYWIIKNSWGESWGENGYYKICR 355
YW+++NSWG WGENGY K+ R
Sbjct: 310 GIDYWLVRNSWGRDWGENGYIKMQR 334
>gi|332326581|gb|AEE42614.1| cysteine protease [Leishmania aethiopica]
Length = 443
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 129/311 (41%), Positives = 167/311 (53%), Gaps = 29/311 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK+ + +AY + E R F+ NL HQ +P A GIT+F DL+ AEF
Sbjct: 38 FEEFKRTYRRAYGTVAEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 120 YL-GLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YL G + A Q DL P DWREKGAV PVKDQG+CGSCW+FS G
Sbjct: 98 YLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFSAVG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGL 233
+E +A +L +LSEQQLV CD + DSGC GGLM AFE+ L+ G +
Sbjct: 158 NIESQWAVAGHRLTALSEQQLVSCDDK---------DSGCGGGLMTQAFEWLLRNMNGTM 208
Query: 234 MREEDYPY-TGTDRGHACKFDKSKI-AASVANFSVVSLDEDQIAANLVKNGPLAVAINAV 291
E+ YPY + T AC + A + + + E +AA L K+GP+++A++A
Sbjct: 209 XTEDSYPYVSSTGDVPACTNSSQLVPGARIDGYVTIESSETVMAAWLAKSGPISIAVDAS 268
Query: 292 YMQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
+Y GV SC + L+HGVLLVGY G E PYW+IKNSWGE WGE G
Sbjct: 269 SFMSYXSGVLTSC---AGKXLNHGVLLVGYNMTG-------EVPYWVIKNSWGEDWGEKG 318
Query: 350 YYKICRGRNVC 360
Y ++ G N C
Sbjct: 319 YVRVTMGVNAC 329
>gi|12847813|dbj|BAB27719.1| unnamed protein product [Mus musculus]
Length = 334
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 129/322 (40%), Positives = 172/322 (53%), Gaps = 25/322 (7%)
Query: 54 LGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQ----FS 109
AE H +K + Y + EE + R I++ N+R H + HG + F
Sbjct: 25 FSAEWH--QWKSTHRRLYGTNEE-EWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAFG 81
Query: 110 DLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCW 169
D+T EFR+ G R + Q P++ +P DWREKG V PVK+QG CGSCW
Sbjct: 82 DMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK--IPKSVDWREKGCVTPVKNQGQCGSCW 139
Query: 170 SFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK 229
+FS +G LEG FL TGKL+SLSEQ LVDC H + GCNGGLM+ AF+Y +
Sbjct: 140 AFSASGCLEGQMFLKTGKLISLSEQNLVDCSH-------AQGNQGCNGGLMDYAFQYIKE 192
Query: 230 AGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAIN 289
GGL EE YPY D +CK+ A+ F + E + + GP++VA++
Sbjct: 193 NGGLDSEESYPYEAKDG--SCKYRAEFAVANDTGFVDIPQQEKALMKAVATVGPISVAMD 250
Query: 290 AVY--MQTYIGGVSC-PYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWG 346
A + +Q Y G+ P S+ LDHGVLLVGYG G + K YW++KNSWG WG
Sbjct: 251 ASHPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNK---YWLVKNSWGSEWG 307
Query: 347 ENGYYKICRGR-NVCGVDSMVS 367
GY KI + R N CG+ + S
Sbjct: 308 MEGYIKIAKDRDNHCGLATAAS 329
>gi|356553978|ref|XP_003545327.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 496
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 137/362 (37%), Positives = 189/362 (52%), Gaps = 41/362 (11%)
Query: 1 MGSKTVVLFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHF 60
M T++L V SA+ + D +D +E++S +E
Sbjct: 36 MAMATILLLFTVFAVSSALDMSIISYDNAHAATSRSD--EELMSMYEQ------------ 81
Query: 61 SLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQ-KLDPSATHGITQFSDLTPAEFRRT 119
+ K K Y + E + RF IFK NLR H + D + G+ +F+DLT E+R
Sbjct: 82 --WLVKHGKVYNALGEKEKRFQIFKDNLRFIDDHNSQEDRTYKLGLNRFADLTNEEYRAK 139
Query: 120 YLGLR----RKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YLG + R+L AP + + LP DWR++GAV PVKDQG CGSCW+FS G
Sbjct: 140 YLGTKIDPNRRLGKTPSNRYAPRV-GDKLPESVDWRKEGAVPPVKDQGGCGSCWAFSAIG 198
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMR 235
A+EG N + TG+L+SLSEQ+LVDCD + GCNGGLM+ AFE+ + GG+
Sbjct: 199 AVEGINKIVTGELISLSEQELVDCDT--------GYNEGCNGGLMDYAFEFIINNGGIDS 250
Query: 236 EEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAIN--AVYM 293
EEDYPY G D G + K+ S+ ++ V ++ V N P++VAI
Sbjct: 251 EEDYPYRGVD-GRCDTYRKNAKVVSIDDYEDVPAYDELALKKAVANQPVSVAIEGGGREF 309
Query: 294 QTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKI 353
Q Y+ GV C LDHGV+ VGYG+A YWI++NSWG SWGE+GY ++
Sbjct: 310 QLYVSGVFTGR-CGTALDHGVVAVGYGTA-------NGHDYWIVRNSWGPSWGEDGYIRL 361
Query: 354 CR 355
R
Sbjct: 362 ER 363
>gi|168047065|ref|XP_001775992.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162672650|gb|EDQ59184.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 336
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 135/351 (38%), Positives = 186/351 (52%), Gaps = 34/351 (9%)
Query: 24 LIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTI 83
++ D++ L EIL H + D+L HF+ F K+ K Y + EE HRF
Sbjct: 1 MVTDLEALASTSAGLFTEILGH----SRDVL----HFAGFAAKYKKEYKTVEELKHRFVT 52
Query: 84 FKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTND 143
F +++ H K S + + +F+D+T EFR + L ++ + +L
Sbjct: 53 FLESVKLVETHNKGQHSYSLAVNEFADMTFEEFRDSRL-MKGEQNCSATVGN-HVLTGES 110
Query: 144 LPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHEC 203
LP DWRE+G V VK+Q SCGSCW+FSTTGALE A+ ATGK+V LSEQQLVDC E
Sbjct: 111 LPKTKDWREEGIVSQVKNQASCGSCWTFSTTGALEAAHAQATGKMVLLSEQQLVDCAGEF 170
Query: 204 DPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVAN 263
+ + GC GGL + AFEY GG+ E+ YPY D C+F K+ I A V
Sbjct: 171 N-------NFGCGGGLPSQAFEYIRYNGGIDTEDSYPYNAKDS--QCRFHKNTIGAQV-- 219
Query: 264 FSVVSLD---EDQIAANLVKNGPLAVAINAVY-MQTYIGGVSCPYICS---RRLDHGVLL 316
+ VV++ E Q+ + P++VA V+ + Y GGV C + ++H VL
Sbjct: 220 WDVVNITEGAETQLKHAIATMRPVSVAFEVVHDFRLYNGGVYTSLNCHTGPQTVNHAVLA 279
Query: 317 VGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVS 367
VGYG PYWIIKNSWG WG NGY+ + G+N+CGV + S
Sbjct: 280 VGYGEDENGV------PYWIIKNSWGADWGMNGYFNMEMGKNMCGVATCAS 324
>gi|161408097|dbj|BAF94152.1| cathepsin L-like cysteine protease 2 [Plautia stali]
Length = 334
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 131/321 (40%), Positives = 180/321 (56%), Gaps = 30/321 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARH----QKLDPSATHGITQFSDLTPAE 115
F+LFKK K Y ++ E +R IF N +R +H ++ S + +D+ E
Sbjct: 27 FTLFKKFHRKEYDNELEESYRKKIFLENKKRIEKHNSRYKQGKVSFKLKLNHLADMLIHE 86
Query: 116 FRRTYLGLRRKLRLPKDADQAP--ILPTN-DLPADFDWREKGAVGPVKDQGSCGSCWSFS 172
+ YLG + + + Q+ I P + L + DWR KGAV PVK+QG CGSCW+FS
Sbjct: 87 YSDVYLGFNKSSKANNNKLQSYTFIPPAHVTLNKEVDWRTKGAVTPVKNQGHCGSCWAFS 146
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSC-DSGCNGGLMNSAFEYTLKAG 231
TTGALEG NF TGKLVSLSEQ LVDC GS ++GC GGLM++AF+Y +
Sbjct: 147 TTGALEGQNFRKTGKLVSLSEQNLVDC--------SGSYGNNGCEGGLMDNAFQYIKENH 198
Query: 232 GLMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAINA 290
G+ E+ YPY G D C+F K+ I A+ + F + DE+ + + GP++VAI+A
Sbjct: 199 GIDTEKSYPYEGEDE--TCRFRKTSIGATDSGFVDITQGDEEALMQAVATIGPISVAIDA 256
Query: 291 VY--MQTYIGGVSC-PYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGE 347
+ Q Y GV P S LDHGVL+VGYG + YW++KNSWG WG+
Sbjct: 257 SHQSFQFYSEGVYYEPECSSENLDHGVLVVGYGVE-------DNQKYWLVKNSWGTQWGD 309
Query: 348 NGYYKICRGR-NVCGVDSMVS 367
GY K+ R + N CG+ + S
Sbjct: 310 GGYIKMARDQDNNCGIATQAS 330
>gi|356517188|ref|XP_003527271.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 350
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 124/315 (39%), Positives = 173/315 (54%), Gaps = 25/315 (7%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F + + K Y + EE RF IFK NL+ K+ + G+ +F+DL+ EF
Sbjct: 48 FESWMSRHGKIYENIEEKLLRFEIFKDNLKHIDERNKVVSNYWLGLNEFADLSHREFNNK 107
Query: 120 YLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEG 179
YLGL+ +++ + +LP DWR+KGAV PVK+QGSCGSCW+FST A+EG
Sbjct: 108 YLGLKVDYSRRRESPEEFTYKDVELPKSVDWRKKGAVAPVKNQGSCGSCWAFSTVAAVEG 167
Query: 180 ANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDY 239
N + TG L SLSEQ+L+DCD + ++GCNGGLM+ AF + ++ GGL +EEDY
Sbjct: 168 INQIVTGNLTSLSEQELIDCDR--------TYNNGCNGGLMDYAFSFIVENGGLHKEEDY 219
Query: 240 PYTGTDRGHACKFDKSKI-AASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY--MQTY 296
PY + C+ K + +++ + V + +Q + N PL+VAI A Q Y
Sbjct: 220 PYIMEE--GTCEMTKEETQVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFY 277
Query: 297 IGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRG 356
GGV + C LDHGV VGYG+A K Y +KNSWG WGE GY ++ R
Sbjct: 278 SGGVFDGH-CGSDLDHGVAAVGYGTA-------KGVDYITVKNSWGSKWGEKGYIRMRRN 329
Query: 357 ----RNVCGVDSMVS 367
+CG+ M S
Sbjct: 330 IGKPEGICGIYKMAS 344
>gi|33348834|gb|AAQ16117.1| cathepsin L-like cysteine proteinase A [Rhipicephalus
haemaphysaloides haemaphysaloides]
Length = 332
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 134/321 (41%), Positives = 175/321 (54%), Gaps = 30/321 (9%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKAN----LRRAARHQKLDPSATHGITQFSDLTPA 114
+ FK K+Y S E RF IF N + A++ K S G+ QF DL
Sbjct: 26 QWEAFKTTHKKSYESHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAH 85
Query: 115 EFRRTYLGLRRKLRLPKDADQAPILPTND--LPADFDWREKGAVGPVKDQGSCGSCWSFS 172
EF + + G R + R + + P ND LP+ DWR+KGAV PVKDQG CGSCW+FS
Sbjct: 86 EFAKIFNGYRGQ-RTSRGSTFMPPANVNDSSLPSTVDWRKKGAVTPVKDQGQCGSCWAFS 144
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
TG+LEG +FL G+LVSLSEQ LVDC ++GC GGLM++AF+Y G
Sbjct: 145 ATGSLEGQHFLKDGELVSLSEQNLVDCSQSFG-------NNGCEGGLMDNAFKYIKANDG 197
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAINAV 291
+ EE YPY D C+F K + A+ F + ED + + GP++VAI+A
Sbjct: 198 IDAEESYPYEAMD--DKCRFKKEDVGATDTGFVDIEGGSEDDLKKAVATVGPISVAIDAG 255
Query: 292 Y--MQTYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKE-KPYWIIKNSWGESWGE 347
+ Q Y GV P S LDHGVL VGYG +K+ K YW++KNSWG SWG+
Sbjct: 256 HSSFQLYSEGVYDEPECSSEELDHGVLAVGYG--------VKDGKKYWLVKNSWGGSWGD 307
Query: 348 NGYYKICRGR-NVCGVDSMVS 367
NGY + R + N CG+ S S
Sbjct: 308 NGYILMSRDKNNQCGIASAAS 328
>gi|196002275|ref|XP_002111005.1| expressed hypothetical protein [Trichoplax adhaerens]
gi|190586956|gb|EDV27009.1| expressed hypothetical protein [Trichoplax adhaerens]
Length = 325
Score = 215 bits (547), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 125/314 (39%), Positives = 171/314 (54%), Gaps = 23/314 (7%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
+ +K K Y +Q E D R +F N++ A H + I +FSDLT EF +T
Sbjct: 25 WEAWKSFHGKKYHNQGEDDFRHYVFLQNIKTIAAHNA-KSTFKMAINEFSDLTRKEFVKT 83
Query: 120 YLGLRRKLRLPKDADQAPILPTN-DLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALE 178
Y G R ++ + + P N ++P + DWR++G V P+K+QG CGSCW+FSTTG+LE
Sbjct: 84 YNGYRLSMKKSTNKPSTFMAPLNTNMPTEVDWRKEGYVTPIKNQGRCGSCWAFSTTGSLE 143
Query: 179 GANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREED 238
G +F TGKLVSLSEQ L+DC + GC GG M+ AFEY G+ E
Sbjct: 144 GQHFRKTGKLVSLSEQNLIDC-------SAAEGNDGCGGGFMDDAFEYIKLNNGIDTEAS 196
Query: 239 YPYTGTDRGHACKFDKS-KIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY--MQT 295
YPY G R C++ K+ K A + ED + A + GP++VAI+A +
Sbjct: 197 YPYEG--RDDICRYKKTNKGAIDTGYMDIKQYSEDDLKAAVATVGPISVAIDASHKSFHM 254
Query: 296 YIGGVSCPYICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
Y GV CS+ LDHGVL+VGYG+ + YW++KNSWG WG NGY K+
Sbjct: 255 YHTGVYHEPECSQTVLDHGVLVVGYGTE-------NGEDYWLVKNSWGTDWGMNGYIKMS 307
Query: 355 RGR-NVCGVDSMVS 367
R R N CG+ + S
Sbjct: 308 RNRSNNCGIATNAS 321
>gi|157864845|ref|XP_001681131.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|68124425|emb|CAJ02281.1| cathepsin L-like protease [Leishmania major strain Friedlin]
Length = 348
Score = 215 bits (547), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 130/310 (41%), Positives = 168/310 (54%), Gaps = 27/310 (8%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK+ + +AY + E R F+ NL HQ +P A GIT+F DL+ AEF
Sbjct: 38 FEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 120 YL-GLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YL G + A Q DL P DWREKGAV PVK+QG+CGSCW+FS G
Sbjct: 98 YLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFSAVG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGL 233
+E +A KLV LSEQQLV CDH D+GC GGLM AFE+ L+ G +
Sbjct: 158 NIESQWAVAGHKLVRLSEQQLVSCDH---------VDNGCGGGLMLQAFEWVLRNMNGTV 208
Query: 234 MREEDYPY-TGTDRGHACKFDKSKIA--ASVANFSVVSLDEDQIAANLVKNGPLAVAINA 290
E+ YPY +G C + S++A A + + + E + A L KNGP+++A++A
Sbjct: 209 STEKSYPYVSGNGDVPECS-NSSELAPGARIDGYVSMESSERVMTAWLAKNGPISIAVDA 267
Query: 291 VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
+Y GV I +L+HGVLLVGY G E PYW+IKNSWGE WGE GY
Sbjct: 268 SSFMSYHSGVLTSCI-GEQLNHGVLLVGYNMTG-------EVPYWVIKNSWGEDWGEKGY 319
Query: 351 YKICRGRNVC 360
++ G N C
Sbjct: 320 VRVTMGVNAC 329
>gi|343470212|emb|CCD17026.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 445
Score = 214 bits (546), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 119/316 (37%), Positives = 171/316 (54%), Gaps = 21/316 (6%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
+ F+ FK+K++++Y E RF +FK N+ RA +P AT G+T+FSD++P EF
Sbjct: 38 QQQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97
Query: 117 RRTYL-GLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
R TY G K + + T P DWR+KGAV PVKDQG C S W+F+ G
Sbjct: 98 RATYHNGAEYYAAALKRPRKVVNVSTGKAPEAVDWRKKGAVTPVKDQGKCDSSWAFTVIG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA--GGL 233
+EG +A +L SLSEQ LV CD D GC G M++AF++ + + G +
Sbjct: 158 NIEGQWKIAGHELTSLSEQMLVSCDTN---------DLGCRAGFMDTAFKWIVSSNNGNV 208
Query: 234 MREEDYPY-TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY 292
E+ YPY +G C + A++ + + +E+ IA L K GP+A+A++A
Sbjct: 209 FTEQSYPYASGGGNVPTCNKSGKVVGANIDDHVHILDNENAIAEWLAKKGPVAIAVDATS 268
Query: 293 MQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYK 352
Q+Y GGV I S+ ++ LLVGY + PYWIIKNSW + WGE GY +
Sbjct: 269 FQSYTGGVLTSCI-SKEVNSAALLVGYDDTS-------KPPYWIIKNSWSKGWGEEGYIR 320
Query: 353 ICRGRNVCGVDSMVST 368
I +G N C + VS+
Sbjct: 321 IEKGTNQCRMKEYVSS 336
>gi|1581747|prf||2117247C Cys protease:ISOTYPE=3
Length = 469
Score = 214 bits (546), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 125/312 (40%), Positives = 166/312 (53%), Gaps = 25/312 (8%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRR 118
F+ FK++ K Y S E R +FK NL A H +P A+ G+T FSDLT EFR
Sbjct: 37 QFAAFKQRHGKVYGSAAEETFRLGVFKENLLFARLHAAANPHASFGVTPFSDLTREEFRS 96
Query: 119 TYLGLRRKLRLPKDADQAPILPT-----NDLPADFDWREKGAVGPVKDQGSCGSCWSFST 173
Y + + P+ PA DWR +GAV +KDQG+C SCW+FST
Sbjct: 97 RYHNAAAHFAAAQKRVRVPVEVEVEVEVGGAPAAVDWRARGAVTAIKDQGNCSSCWAFST 156
Query: 174 TGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA--G 231
G +EG LA L LSEQ LV CD+ D+GC+GGLM+SAF++ ++ G
Sbjct: 157 IGNIEGQWHLAGNPLTGLSEQMLVSCDNA---------DNGCDGGLMDSAFDWIVEQNNG 207
Query: 232 GLMREEDYPY-TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA 290
+ E Y Y +G C + A ++ + DED++AA L NGPLA+A++A
Sbjct: 208 SVYTEASYSYVSGGGDSQTCDMSDHVVGAVISGHVDLPQDEDKMAAWLAVNGPLAIAVDA 267
Query: 291 VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
+Y GGV + S +LDHGV+LVGY + PYWIIKNSWG WGE GY
Sbjct: 268 TSFMSYTGGVLTNCV-SDQLDHGVVLVGYNDS-------SNPPYWIIKNSWGADWGEEGY 319
Query: 351 YKICRGRNVCGV 362
+I +G N C V
Sbjct: 320 IRIQKGTNQCLV 331
>gi|356564154|ref|XP_003550321.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 476
Score = 214 bits (546), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 131/322 (40%), Positives = 177/322 (54%), Gaps = 28/322 (8%)
Query: 44 SHHESTNNDLLGAEHHFSLFKK---KFNKAYASQEEHDHRFTIFKANLRRAARHQKL-DP 99
S H L E S++++ K K Y + E + RF IFK NLR H D
Sbjct: 40 SAHADKAATLRTEEELMSMYEQWLVKHGKVYNALGEKEKRFQIFKDNLRFIDDHNSAEDR 99
Query: 100 SATHGITQFSDLTPAEFRRTYLGLR----RKLRLPKDADQAPILPTNDLPADFDWREKGA 155
+ G+ +F+DLT E+R YLG + R+L AP + + LP DWR++GA
Sbjct: 100 TYKLGLNRFADLTNEEYRAKYLGTKIDPNRRLGKTPSNRYAPRV-GDKLPDSVDWRKEGA 158
Query: 156 VGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGC 215
V PVKDQG CGSCW+FS GA+EG N + TG+L+SLSEQ+LVDCD + GC
Sbjct: 159 VPPVKDQGGCGSCWAFSAIGAVEGINKIVTGELISLSEQELVDCDT--------GYNQGC 210
Query: 216 NGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIA 275
NGGLM+ AFE+ + GG+ +EDYPY G D G + K+ S+ ++ V ++
Sbjct: 211 NGGLMDYAFEFIINNGGIDSDEDYPYRGVD-GRCDTYRKNAKVVSIDDYEDVPAYDELAL 269
Query: 276 ANLVKNGPLAVAIN--AVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKP 333
V N P++VAI Q Y+ GV C LDHGV+ VGYG+A K
Sbjct: 270 KKAVANQPVSVAIEGGGREFQLYVSGVFTGR-CGTALDHGVVAVGYGTA-------KGHD 321
Query: 334 YWIIKNSWGESWGENGYYKICR 355
YWI++NSWG SWGE+GY ++ R
Sbjct: 322 YWIVRNSWGSSWGEDGYIRLER 343
>gi|410923307|ref|XP_003975123.1| PREDICTED: cathepsin L1-like [Takifugu rubripes]
Length = 336
Score = 214 bits (546), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 135/322 (41%), Positives = 180/322 (55%), Gaps = 25/322 (7%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH----GITQFSDLT 112
+ H++L+K +K Y +EE R +++ NL++ H H G+ F D+T
Sbjct: 25 DEHWNLWKDWHSKKYHEKEE-GWRRMVWEKNLKKIELHNLEHSMGKHTYSLGMNHFGDMT 83
Query: 113 PAEFRRTYLGLRRKLRLPKDADQAPILPTNDL--PADFDWREKGAVGPVKDQGSCGSCWS 170
EFR+ G KL+ + + + N L P DWR+KG V PVKDQG CGSCW+
Sbjct: 84 HEEFRQIMNGY--KLKSQRKLRGSLFMEPNFLEAPRSVDWRDKGYVTPVKDQGQCGSCWA 141
Query: 171 FSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA 230
FSTTGA+EG +F TG LVSLSEQ LVDC PE + GCNGGLM+ AF+Y
Sbjct: 142 FSTTGAMEGQHFRKTGTLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYIKDN 194
Query: 231 GGLMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAIN 289
GGL EE YPY GTD G C +D S +A+ F V S E + + GP++VAI+
Sbjct: 195 GGLDSEESYPYLGTDEG-PCHYDPSYNSANDTGFVDVPSGSERALMKAVASVGPVSVAID 253
Query: 290 AVY--MQTYIGGVSCPYIC-SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWG 346
A + Q Y G+ C S LDHGVL+VGYG G + K YWI+KNSW E+WG
Sbjct: 254 AGHESFQFYHSGIYYDKECSSEELDHGVLVVGYGFEG---KDVDGKKYWIVKNSWSENWG 310
Query: 347 ENGYYKICR-GRNVCGVDSMVS 367
+ GY + + +N CG+ + S
Sbjct: 311 DKGYIYMAKDKKNHCGIATAAS 332
>gi|74149661|dbj|BAE36450.1| unnamed protein product [Mus musculus]
Length = 334
Score = 214 bits (546), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 129/322 (40%), Positives = 172/322 (53%), Gaps = 25/322 (7%)
Query: 54 LGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQ----FS 109
AE H +K + Y + EE + R I++ N+R H + HG + F
Sbjct: 25 FSAEWH--QWKSTHRRLYGTNEE-EWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAFG 81
Query: 110 DLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCW 169
D+T EFR+ G R + Q P++ +P DWREKG V PVK+QG CGSCW
Sbjct: 82 DMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK--IPKSVDWREKGCVTPVKNQGQCGSCW 139
Query: 170 SFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK 229
+FS +G LEG FL TGKL+SLSEQ LVDC H + GCNGGLM+ AF+Y +
Sbjct: 140 AFSASGCLEGQMFLKTGKLISLSEQNLVDCSH-------AQGNQGCNGGLMDFAFQYIKE 192
Query: 230 AGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAIN 289
GGL EE YPY D +CK+ A+ F + E + + GP++VA++
Sbjct: 193 NGGLDSEESYPYEAKDG--SCKYRAEFAVANGTGFVDIPQQEKALMKAVATVGPISVAMD 250
Query: 290 AVY--MQTYIGGVSC-PYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWG 346
A + +Q Y G+ P S+ LDHGVLLVGYG G + K YW++KNSWG WG
Sbjct: 251 ASHPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNK---YWLVKNSWGSEWG 307
Query: 347 ENGYYKICRGR-NVCGVDSMVS 367
GY KI + R N CG+ + S
Sbjct: 308 MEGYIKIAKDRDNHCGLATAAS 329
>gi|6753558|ref|NP_034114.1| cathepsin L1 preproprotein [Mus musculus]
gi|115742|sp|P06797.2|CATL1_MOUSE RecName: Full=Cathepsin L1; AltName: Full=Major excreted protein;
Short=MEP; AltName: Full=p39 cysteine proteinase;
Contains: RecName: Full=Cathepsin L1 heavy chain;
Contains: RecName: Full=Cathepsin L1 light chain; Flags:
Precursor
gi|53047|emb|CAA29470.1| unnamed protein product [Mus musculus]
gi|309186|gb|AAA37445.1| preprocysteine proteinase [Mus musculus]
gi|12832050|dbj|BAB21945.1| unnamed protein product [Mus musculus]
gi|26340196|dbj|BAC33761.1| unnamed protein product [Mus musculus]
gi|45768760|gb|AAH68163.1| Cathepsin L [Mus musculus]
gi|74139700|dbj|BAE31701.1| unnamed protein product [Mus musculus]
gi|74146632|dbj|BAE41323.1| unnamed protein product [Mus musculus]
gi|74151584|dbj|BAE41141.1| unnamed protein product [Mus musculus]
gi|74185397|dbj|BAE30172.1| unnamed protein product [Mus musculus]
gi|74197196|dbj|BAE35143.1| unnamed protein product [Mus musculus]
gi|74203006|dbj|BAE26206.1| unnamed protein product [Mus musculus]
gi|74219606|dbj|BAE29572.1| unnamed protein product [Mus musculus]
gi|148684295|gb|EDL16242.1| cathepsin L [Mus musculus]
Length = 334
Score = 214 bits (546), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 129/322 (40%), Positives = 172/322 (53%), Gaps = 25/322 (7%)
Query: 54 LGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQ----FS 109
AE H +K + Y + EE + R I++ N+R H + HG + F
Sbjct: 25 FSAEWH--QWKSTHRRLYGTNEE-EWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAFG 81
Query: 110 DLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCW 169
D+T EFR+ G R + Q P++ +P DWREKG V PVK+QG CGSCW
Sbjct: 82 DMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK--IPKSVDWREKGCVTPVKNQGQCGSCW 139
Query: 170 SFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK 229
+FS +G LEG FL TGKL+SLSEQ LVDC H + GCNGGLM+ AF+Y +
Sbjct: 140 AFSASGCLEGQMFLKTGKLISLSEQNLVDCSH-------AQGNQGCNGGLMDFAFQYIKE 192
Query: 230 AGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAIN 289
GGL EE YPY D +CK+ A+ F + E + + GP++VA++
Sbjct: 193 NGGLDSEESYPYEAKDG--SCKYRAEFAVANDTGFVDIPQQEKALMKAVATVGPISVAMD 250
Query: 290 AVY--MQTYIGGVSC-PYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWG 346
A + +Q Y G+ P S+ LDHGVLLVGYG G + K YW++KNSWG WG
Sbjct: 251 ASHPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNK---YWLVKNSWGSEWG 307
Query: 347 ENGYYKICRGR-NVCGVDSMVS 367
GY KI + R N CG+ + S
Sbjct: 308 MEGYIKIAKDRDNHCGLATAAS 329
>gi|330805275|ref|XP_003290610.1| hypothetical protein DICPUDRAFT_98747 [Dictyostelium purpureum]
gi|325079249|gb|EGC32858.1| hypothetical protein DICPUDRAFT_98747 [Dictyostelium purpureum]
Length = 334
Score = 214 bits (546), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 128/337 (37%), Positives = 186/337 (55%), Gaps = 30/337 (8%)
Query: 42 ILSHHESTNNDLLGAEHHFSLF---KKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLD 98
ILS + +L A+ + + F KK NKAY E +D ++ FK N+ +
Sbjct: 13 ILSINVCAATNLFSAQTYQTSFLGWMKKHNKAYHHHEFND-KYQTFKDNMDFIHNWNSKE 71
Query: 99 PSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTN----DLPADFDWREKG 154
G+ +F+DLT E+++TYLG+ + L A+Q P+ N P+ DWR+ G
Sbjct: 72 SDTVLGLNRFADLTNEEYKKTYLGMSINVNLR--ANQVPMNGLNFERFTGPSSIDWRQNG 129
Query: 155 AVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSG 214
AV VKDQG CGSCW+F+TTGA+EGA+ + TG +V+ SEQ LVDC ++G
Sbjct: 130 AVAYVKDQGHCGSCWAFATTGAVEGAHQIKTGNMVTFSEQHLVDCSGRYG-------NNG 182
Query: 215 CNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQI 274
C+GGLM SAF+Y + G+ EE YPYT T + C ++ + + +++ + V +
Sbjct: 183 CDGGLMTSAFKYIIDNDGIATEEAYPYTATQ--NRCVYNTTMLGTAISGYKDVPRGSESA 240
Query: 275 AANLVKNGPLAVAINA--VYMQTYIGGVSCPYICSR-RLDHGVLLVGYGSAGYAPIRLKE 331
+ P+AVAI+A + Q Y GV CS RL+HGVL VGYG+ L+
Sbjct: 241 LTAAISKQPVAVAIDASPITFQLYKSGVYQEATCSSYRLNHGVLAVGYGT-------LEG 293
Query: 332 KPYWIIKNSWGESWGENGYYKICR-GRNVCGVDSMVS 367
K Y+I+KNSW E+WG GY + R N CG+ +M S
Sbjct: 294 KDYYIVKNSWAETWGNQGYILMARNANNHCGIATMAS 330
>gi|297802418|ref|XP_002869093.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
lyrata]
gi|297314929|gb|EFH45352.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
lyrata]
Length = 355
Score = 214 bits (546), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 131/331 (39%), Positives = 177/331 (53%), Gaps = 30/331 (9%)
Query: 49 TNNDLLGAEHHFSLFKK---KFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGI 105
T L E LF+ + +K Y S EE HRF +F+ NL + S G+
Sbjct: 37 TPEQLTSTEKLLELFESWMSEHSKVYKSVEEKVHRFEVFRENLMHIDQRNNEINSYWLGL 96
Query: 106 TQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPIL--PTNDLPADFDWREKGAVGPVKDQG 163
+F+DLT EF+ YLGL + K A DLP DWR+KGAV PVKDQG
Sbjct: 97 NEFADLTHEEFKGRYLGLAKPQFSRKRQPSANFRYRDITDLPKSVDWRKKGAVAPVKDQG 156
Query: 164 SCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSA 223
CGSCW+FST A+EG N + TG L SLSEQ+L+DCD + +SGCNGGLM+ A
Sbjct: 157 QCGSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDT--------TFNSGCNGGLMDYA 208
Query: 224 FEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIA-ASVANFSVVSLDEDQIAANLVKNG 282
F+Y + GGL +E+DYPY + C+ K + +++ + V ++D+ + +
Sbjct: 209 FQYIISTGGLHKEDDYPYLMEE--GICQEQKEDVERVTISGYEDVPENDDESLVKALAHQ 266
Query: 283 PLAVAINAV--YMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNS 340
P++VAI A Q Y GGV C LDHGV VGYGS+ K Y I+KNS
Sbjct: 267 PVSVAIEASGRDFQFYKGGVFNGQ-CGTDLDHGVAAVGYGSS-------KGSDYVIVKNS 318
Query: 341 WGESWGENGYYKICRG----RNVCGVDSMVS 367
WG WGE G+ ++ R +CG++ M S
Sbjct: 319 WGPRWGEKGFIRMKRNTGKPEGLCGINKMAS 349
>gi|89272015|emb|CAJ83143.1| cathepsin L2 [Xenopus (Silurana) tropicalis]
Length = 335
Score = 214 bits (546), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 129/320 (40%), Positives = 179/320 (55%), Gaps = 23/320 (7%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH----GITQFSDLT 112
++H++L+K K+YA +EE R +++ NLR H H G+ QF D+T
Sbjct: 26 DNHWNLWKNWHKKSYAPKEE-GWRRVLWEKNLRMIEFHNLEHSLGKHSHSLGMNQFGDMT 84
Query: 113 PAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFS 172
EFR+ G + + ++ AP + P DWR+KG V PVKDQG CGSCW+FS
Sbjct: 85 NEEFRQLMNGYKNQKKIRGSTFLAP--NNFESPKSVDWRKKGYVTPVKDQGQCGSCWAFS 142
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
TTGALEG ++ TGK++SLSEQ LVDC + GCNGGLM+ AF+Y GG
Sbjct: 143 TTGALEGQHYRNTGKMISLSEQNLVDCSR-------AQGNQGCNGGLMDQAFQYVKDNGG 195
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKN-GPLAVAINAV 291
+ E+ YPYT D C +D + +A+ F V+ + ++ N V + GP++VA++A
Sbjct: 196 IDSEDSYPYTAKDD-QECHYDPNYNSANDTGFVDVTSESEKDLMNAVASVGPVSVAVDAG 254
Query: 292 Y--MQTYIGGVSC-PYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+ Q Y G+ P S LDHGVL+VGYG G K YWI+KNSW E WG +
Sbjct: 255 HQSFQFYKSGIYYEPECSSEDLDHGVLVVGYGFEGEDE---DGKKYWIVKNSWSEKWGND 311
Query: 349 GYYKICRGR-NVCGVDSMVS 367
GY I + R N CG+ + S
Sbjct: 312 GYIYIAKDRHNHCGIATAAS 331
>gi|18418684|ref|NP_567983.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
gi|71153408|sp|O65493.1|XCP1_ARATH RecName: Full=Xylem cysteine proteinase 1; Short=AtXCP1; Flags:
Precursor
gi|6708181|gb|AAF25831.1|AF191027_1 papain-type cysteine endopeptidase XCP1 [Arabidopsis thaliana]
gi|3080415|emb|CAA18734.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|7270487|emb|CAB80252.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|26449881|dbj|BAC42063.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|28827736|gb|AAO50712.1| unknown protein [Arabidopsis thaliana]
gi|332661101|gb|AEE86501.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
Length = 355
Score = 214 bits (546), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 128/317 (40%), Positives = 174/317 (54%), Gaps = 27/317 (8%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F + + +KAY S EE HRF +F+ NL + S G+ +F+DLT EF+
Sbjct: 51 FESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEINSYWLGLNEFADLTHEEFKGR 110
Query: 120 YLGLRRKLRLPKDADQAPIL--PTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
YLGL + K A DLP DWR+KGAV PVKDQG CGSCW+FST A+
Sbjct: 111 YLGLAKPQFSRKRQPSANFRYRDITDLPKSVDWRKKGAVAPVKDQGQCGSCWAFSTVAAV 170
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREE 237
EG N + TG L SLSEQ+L+DCD + +SGCNGGLM+ AF+Y + GGL +E+
Sbjct: 171 EGINQITTGNLSSLSEQELIDCDT--------TFNSGCNGGLMDYAFQYIISTGGLHKED 222
Query: 238 DYPYTGTDRGHACKFDKSKIA-ASVANFSVVSLDEDQIAANLVKNGPLAVAINAV--YMQ 294
DYPY + C+ K + +++ + V ++D+ + + P++VAI A Q
Sbjct: 223 DYPYLMEE--GICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRDFQ 280
Query: 295 TYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
Y GGV C LDHGV VGYGS+ K Y I+KNSWG WGE G+ ++
Sbjct: 281 FYKGGVFNGK-CGTDLDHGVAAVGYGSS-------KGSDYVIVKNSWGPRWGEKGFIRMK 332
Query: 355 RG----RNVCGVDSMVS 367
R +CG++ M S
Sbjct: 333 RNTGKPEGLCGINKMAS 349
>gi|126021|sp|P25775.1|LMCPA_LEIME RecName: Full=Cysteine proteinase A; Flags: Precursor
gi|9573|emb|CAA44094.1| cysteine proteinase [Leishmania mexicana]
Length = 354
Score = 214 bits (546), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 131/367 (35%), Positives = 188/367 (51%), Gaps = 44/367 (11%)
Query: 1 MGSKTVVLFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHF 60
M + +LF + + + V G+ LI Q D + A H+
Sbjct: 1 MARRNPLLFAIVVTILFVVCYGS------ALIAQTPPPVDNFV------------ASAHY 42
Query: 61 SLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGIT-QFSDLTPAEFRRT 119
FKK+ KA+ E HRF FK N++ A +P A + ++ +F+DLTP EF +
Sbjct: 43 GSFKKRHGKAFGGDAEEGHRFNAFKQNMQTAYFLNTQNPHAHYDVSGKFADLTPQEFAKL 102
Query: 120 YLGLRRKLRLPKDADQAPILPTNDLPA---DFDWREKGAVGPVKDQGSCGSCWSFSTTGA 176
YL R K+ + + + P+ DWR+KGAV PVK+QG CGSCW+FS G
Sbjct: 103 YLNPDYYARHLKNHKED-VHVDDSAPSGVMSVDWRDKGAVTPVKNQGLCGSCWAFSAIGN 161
Query: 177 LEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA--GGLM 234
+EG + LVSLSEQ LV CD + D GCNGGLM+ A + +++ G +
Sbjct: 162 IEGQWAASGHSLVSLSEQMLVSCD---------NIDEGCNGGLMDQAMNWIMQSHNGSVF 212
Query: 235 REEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQ 294
E YPYT D+ ++ A + F + DE++IA + K GP+AVA++A Q
Sbjct: 213 TEASYPYTSGGGTRPPCHDEGEVGAKITGFLSLPHDEERIAEWVEKRGPVAVAVDATTWQ 272
Query: 295 TYIGGVSCPYIC-SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKI 353
Y GGV +C + L+HGVL+VG+ + + PYWI+KNSWG SWGE GY ++
Sbjct: 273 LYFGGVVS--LCLAWSLNHGVLIVGFN-------KNAKPPYWIVKNSWGSSWGEKGYIRL 323
Query: 354 CRGRNVC 360
G N C
Sbjct: 324 AMGSNQC 330
>gi|74200292|dbj|BAE22939.1| unnamed protein product [Mus musculus]
Length = 308
Score = 214 bits (546), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 129/320 (40%), Positives = 172/320 (53%), Gaps = 25/320 (7%)
Query: 56 AEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQ----FSDL 111
AE H +K + Y + EE + R I++ N+R H + HG + F D+
Sbjct: 1 AEWH--QWKSTHRRLYGTNEE-EWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAFGDM 57
Query: 112 TPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSF 171
T EFR+ G R + Q P++ +P DWREKG V PVK+QG CGSCW+F
Sbjct: 58 TNEEFRQVVNGYRHQKHKKGRLFQEPLMLK--IPKSVDWREKGCVTPVKNQGQCGSCWAF 115
Query: 172 STTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAG 231
S +G LEG FL TGKL+SLSEQ LVDC H + GCNGGLM+ AF+Y + G
Sbjct: 116 SASGCLEGQMFLKTGKLISLSEQNLVDCSH-------AQGNQGCNGGLMDFAFQYIKENG 168
Query: 232 GLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV 291
GL EE YPY D +CK+ A+ F + E + + GP++VA++A
Sbjct: 169 GLDSEESYPYEAKDG--SCKYRAEFAVANDTGFVDIPQQEKALMKAVATVGPISVAMDAS 226
Query: 292 Y--MQTYIGGVSC-PYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+ +Q Y G+ P S+ LDHGVLLVGYG G + K YW++KNSWG WG
Sbjct: 227 HPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNK---YWLVKNSWGSEWGME 283
Query: 349 GYYKICRGR-NVCGVDSMVS 367
GY KI + R N CG+ + S
Sbjct: 284 GYIKIAKDRDNHCGLATAAS 303
>gi|15824691|gb|AAL09443.1| cysteine protease [Leishmania donovani]
Length = 443
Score = 214 bits (546), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 130/311 (41%), Positives = 169/311 (54%), Gaps = 29/311 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK+ + +AY + E R F+ NL HQ +P A GIT+F DL+ AEF
Sbjct: 38 FEEFKRTYRRAYGTLAEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 120 YL-GLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YL G + A Q DL P DWREKGAV PVK+QG+CGSCW+FS G
Sbjct: 98 YLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFSAVG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGL 233
+E LVSLSEQQLV CD + D+GCNGGLM AFE+ L+ G +
Sbjct: 158 NIESQWARVGHGLVSLSEQQLVSCDDK---------DNGCNGGLMLQAFEWLLRHMYGIV 208
Query: 234 MREEDYPYTGTDRGHACKFDKSKI--AASVANFSVVSLDEDQIAANLVKNGPLAVAINAV 291
E+ YPYT + A + SK+ A + + ++ +E +AA L +NGP+A+A++A
Sbjct: 209 FTEKSYPYTSGNGDVAECLNSSKLVPGAQIDGYVMIPSNETVMAAWLAENGPIAIAVDAS 268
Query: 292 YMQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
+Y GV SC L+HGVLLVGY G PYW+IKNSWGE WGE G
Sbjct: 269 SFMSYQSGVLTSCA---GDALNHGVLLVGYNKTGGV-------PYWVIKNSWGEDWGEKG 318
Query: 350 YYKICRGRNVC 360
Y ++ G+N C
Sbjct: 319 YVRVAMGKNAC 329
>gi|357467173|ref|XP_003603871.1| Cysteine proteinase [Medicago truncatula]
gi|355492919|gb|AES74122.1| Cysteine proteinase [Medicago truncatula]
gi|388499154|gb|AFK37643.1| unknown [Medicago truncatula]
Length = 350
Score = 214 bits (545), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 127/329 (38%), Positives = 182/329 (55%), Gaps = 27/329 (8%)
Query: 49 TNNDLLGAEHHFSLFKK---KFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGI 105
++ DL + LF+ + K Y + EE RF +FK NL+ K+ + G+
Sbjct: 33 SSEDLKSMDKLIELFESWMSRHGKIYETIEEKLLRFEVFKDNLKHIDDRNKVVSNYWLGL 92
Query: 106 TQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTN-DLPADFDWREKGAVGPVKDQGS 164
+F+DL+ EF+ YLGL+ L +++ + + DLP DWR+KGAV PVK+QG
Sbjct: 93 NEFADLSHQEFKNKYLGLKVDLSQRRESSEEEFTYRDVDLPKSVDWRKKGAVTPVKNQGQ 152
Query: 165 CGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAF 224
CGSCW+FST A+EG N + TG L SLSEQ+L+DCD + ++GCNGGLM+ AF
Sbjct: 153 CGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDT--------TYNNGCNGGLMDYAF 204
Query: 225 EYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPL 284
+ +K GGL +EEDYPY + K + S++ ++ + V + +Q + N PL
Sbjct: 205 SFIVKNGGLHKEEDYPYIMEESTCEMKKEVSEV-VTINGYHDVPQNNEQSLLKALANQPL 263
Query: 285 AVAINAV--YMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWG 342
+VAI A Q Y GGV + C LDHGV VGYG++ K Y I+KNSWG
Sbjct: 264 SVAIEASGRDFQFYSGGVFDGH-CGSELDHGVSAVGYGTS-------KGLDYIIVKNSWG 315
Query: 343 ESWGENGYYKICRG----RNVCGVDSMVS 367
WGE G+ ++ R +CG+ M S
Sbjct: 316 AKWGEKGFIRMKRNIGKSEGICGLYKMAS 344
>gi|109939734|sp|P25776.2|ORYA_ORYSJ RecName: Full=Oryzain alpha chain; Flags: Precursor
gi|78192122|gb|ABB30151.1| oryzain alpha [Oryza sativa Japonica Group]
Length = 458
Score = 214 bits (545), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 124/321 (38%), Positives = 177/321 (55%), Gaps = 27/321 (8%)
Query: 42 ILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSA 101
I+S+ E + + A ++ +K + K+Y + E + R+ F+ NLR H +
Sbjct: 25 IVSYGERSEEE---ARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAG 81
Query: 102 TH----GITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTND-LPADFDWREKGAV 156
H G+ +F+DLT E+R TYLGLR K R + + N+ LP DWR KGAV
Sbjct: 82 VHSFRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEALPESVDWRTKGAV 141
Query: 157 GPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCN 216
+KDQG CGSCW+FS A+EG N + TG L+SLSEQ+LVDCD S + GCN
Sbjct: 142 AEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDT--------SYNEGCN 193
Query: 217 GGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAA 276
GGLM+ AF++ + GG+ E+DYPY G D +K+ ++ ++ V+ + +
Sbjct: 194 GGLMDYAFDFIINNGGIDTEDDYPYKGKDERCDVNRKNAKV-VTIDSYEDVTPNSETSLQ 252
Query: 277 NLVKNGPLAVAINA--VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPY 334
V N P++VAI A Q Y G+ C LDHGV VGYG+ K Y
Sbjct: 253 KAVANQPVSVAIEAGGRAFQLYSSGIFTG-KCGTALDHGVAAVGYGTE-------NGKDY 304
Query: 335 WIIKNSWGESWGENGYYKICR 355
WI++NSWG+SWGE+GY ++ R
Sbjct: 305 WIVRNSWGKSWGESGYVRMER 325
>gi|30142040|gb|AAN34825.1| cysteine proteinase [Leishmania amazonensis]
gi|30142042|gb|AAN34826.1| cysteine proteinase [Leishmania amazonensis]
gi|30142572|gb|AAP21894.1| cysteine proteinase [Leishmania amazonensis]
Length = 354
Score = 214 bits (545), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 131/367 (35%), Positives = 188/367 (51%), Gaps = 44/367 (11%)
Query: 1 MGSKTVVLFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHF 60
M + +LF + + + V G+ LI Q D + A H+
Sbjct: 1 MARRNPLLFAIVVTILFVVCYGS------ALIAQTPPAVDNFV------------ASAHY 42
Query: 61 SLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGIT-QFSDLTPAEFRRT 119
FKK+ +KA+ E HRF FK N++ A +P A + ++ +F+DLTP EF +
Sbjct: 43 GSFKKRHSKAFGGDAEEGHRFNAFKQNMQTAYFLNTQNPHAHYDVSGKFADLTPQEFAKL 102
Query: 120 YLGLRRKLRLPKDADQAPILPTNDLPA---DFDWREKGAVGPVKDQGSCGSCWSFSTTGA 176
YL KD + + + P+ DWR+KGAV PVK+QG CGSCW+FS G
Sbjct: 103 YLNPDYYTSHLKDHKED-VHVDDSAPSGVMSVDWRDKGAVTPVKNQGLCGSCWAFSAIGN 161
Query: 177 LEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA--GGLM 234
+EG + LVSLSEQ LV CD + D GCNGGLM+ A + +++ G +
Sbjct: 162 IEGQWAASGHSLVSLSEQMLVSCD---------NVDEGCNGGLMDQAMNWIMQSHNGSVF 212
Query: 235 REEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQ 294
E YPYT D+ ++ A + F + DE++IA + K GP+AVA++A Q
Sbjct: 213 TEASYPYTSGGGTRPPCHDEGEVGAKITGFLSLPHDEERIADWVEKRGPVAVAVDATTWQ 272
Query: 295 TYIGGVSCPYIC-SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKI 353
Y GGV +C + L+HGVL+VG+ + + PYWI+KNSWG SWGE GY ++
Sbjct: 273 LYFGGVVS--LCLAWSLNHGVLIVGFN-------KNAKPPYWIVKNSWGSSWGEKGYIRL 323
Query: 354 CRGRNVC 360
G N C
Sbjct: 324 AMGSNQC 330
>gi|50355623|dbj|BAD29960.1| cysteine protease [Daucus carota]
Length = 460
Score = 214 bits (545), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 128/331 (38%), Positives = 183/331 (55%), Gaps = 29/331 (8%)
Query: 44 SHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKAN-LRRAARHQKLDPSAT 102
+H + +D++ A + L K K+Y + E + RF IFK N L ++ D S
Sbjct: 30 THAVGSTDDVIMAAYESWLVKH--GKSYNALGEKEQRFQIFKDNFLYIDEQNAAKDRSFK 87
Query: 103 HGITQFSDLTPAEFRRTYLGLRRK---LRLPKDADQAPILPTNDLPADFDWREKGAVGPV 159
G+ +F+DLT E+R Y G+R K ++ + + L LP DWRE GAV V
Sbjct: 88 LGLNRFADLTNEEYRSKYTGIRTKDSRKKVSGKSQRYASLAGESLPESVDWREHGAVASV 147
Query: 160 KDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGL 219
KDQG CGSCW+FST A+EG N +ATGKL++LSEQ+LVDCD S + GCNGGL
Sbjct: 148 KDQGQCGSCWAFSTISAVEGINQIATGKLITLSEQELVDCDR--------SYNEGCNGGL 199
Query: 220 MNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLV 279
M+ AF++ + GG+ + DYPYTG D G ++ K+ ++ ++ V +++
Sbjct: 200 MDDAFQFIINNGGIDSDADYPYTGRD-GQCDQYRKNAKVVTIDSYEDVPEYDEKALQKAA 258
Query: 280 KNGPLAVAINAV--YMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWII 337
N P++VAI A Q Y G+ C LDHGV++VGYG+ K YWI+
Sbjct: 259 ANQPISVAIEASGRDFQFYDSGIFTG-KCGTDLDHGVVVVGYGTE-------NGKDYWIV 310
Query: 338 KNSWGESWGENGYYKICRG----RNVCGVDS 364
+NSWG WGE GY ++ RG +CG+ S
Sbjct: 311 RNSWGADWGEKGYLRMERGISSKAGICGITS 341
>gi|6978723|ref|NP_037288.1| cathepsin L1 preproprotein [Rattus norvegicus]
gi|55888|emb|CAA68691.1| prepro-cathepsin L [Rattus norvegicus]
Length = 334
Score = 214 bits (545), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 125/313 (39%), Positives = 172/313 (54%), Gaps = 23/313 (7%)
Query: 63 FKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQ----FSDLTPAEFRR 118
+K + Y + EE + R +++ N+R H + HG T F D+T EFR+
Sbjct: 32 WKSTHRRLYGTNEE-EWRRAVWEKNMRMIQLHNGEYSNGKHGFTMEMNAFGDMTNEEFRQ 90
Query: 119 TYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALE 178
G R + Q P++ +P DWREKG V PVK+QG CGSCW+FS +G LE
Sbjct: 91 IVNGYRHQKHKKGRLFQEPLML--QIPKTVDWREKGCVTPVKNQGQCGSCWAFSASGCLE 148
Query: 179 GANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREED 238
G FL TGKL+SLSEQ LVDC H+ + GCNGGLM+ AF+Y + GGL EE
Sbjct: 149 GQMFLKTGKLISLSEQNLVDCSHD-------QGNQGCNGGLMDFAFQYIKENGGLDSEES 201
Query: 239 YPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY--MQTY 296
YPY D +CK+ A+ F + E + + GP++VA++A + +Q Y
Sbjct: 202 YPYEAKDG--SCKYRAEYAVANDTGFVDIPQQEKALMKPVATVGPISVAMDASHPSLQFY 259
Query: 297 IGGVSC-PYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
G+ P S+ LDHGVL+VGYG G + K YW++KNSWG+ WG +GY KI +
Sbjct: 260 SSGIYYEPNCSSKDLDHGVLVVGYGYEGTDSNKDK---YWLVKNSWGKEWGMDGYIKIAK 316
Query: 356 GRNV-CGVDSMVS 367
RN CG+ + S
Sbjct: 317 DRNNHCGLATAAS 329
>gi|115743|sp|P07154.2|CATL1_RAT RecName: Full=Cathepsin L1; AltName: Full=Cyclic protein 2;
Short=CP-2; AltName: Full=Major excreted protein;
Short=MEP; Contains: RecName: Full=Procathepsin L;
Contains: RecName: Full=Cathepsin L1 heavy chain;
Contains: RecName: Full=Cathepsin L1 light chain; Flags:
Precursor
gi|38648869|gb|AAH63175.1| Cathepsin L1 [Rattus norvegicus]
gi|149029152|gb|EDL84437.1| cathepsin L, isoform CRA_a [Rattus norvegicus]
gi|386267881|dbj|BAM14518.1| cathepsin L [Rattus norvegicus]
Length = 334
Score = 214 bits (545), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 125/313 (39%), Positives = 172/313 (54%), Gaps = 23/313 (7%)
Query: 63 FKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQ----FSDLTPAEFRR 118
+K + Y + EE + R +++ N+R H + HG T F D+T EFR+
Sbjct: 32 WKSTHRRLYGTNEE-EWRRAVWEKNMRMIQLHNGEYSNGKHGFTMEMNAFGDMTNEEFRQ 90
Query: 119 TYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALE 178
G R + Q P++ +P DWREKG V PVK+QG CGSCW+FS +G LE
Sbjct: 91 IVNGYRHQKHKKGRLFQEPLML--QIPKTVDWREKGCVTPVKNQGQCGSCWAFSASGCLE 148
Query: 179 GANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREED 238
G FL TGKL+SLSEQ LVDC H+ + GCNGGLM+ AF+Y + GGL EE
Sbjct: 149 GQMFLKTGKLISLSEQNLVDCSHD-------QGNQGCNGGLMDFAFQYIKENGGLDSEES 201
Query: 239 YPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY--MQTY 296
YPY D +CK+ A+ F + E + + GP++VA++A + +Q Y
Sbjct: 202 YPYEAKDG--SCKYRAEYAVANDTGFVDIPQQEKALMKAVATVGPISVAMDASHPSLQFY 259
Query: 297 IGGVSC-PYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
G+ P S+ LDHGVL+VGYG G + K YW++KNSWG+ WG +GY KI +
Sbjct: 260 SSGIYYEPNCSSKDLDHGVLVVGYGYEGTDSNKDK---YWLVKNSWGKEWGMDGYIKIAK 316
Query: 356 GRNV-CGVDSMVS 367
RN CG+ + S
Sbjct: 317 DRNNHCGLATAAS 329
>gi|339896953|ref|XP_003392238.1| cathepsin L-like protease [Leishmania infantum JPCM5]
gi|14349351|gb|AAC38832.2| cysteine protease [Leishmania chagasi]
gi|17384031|emb|CAD12393.1| cysteine proteinase [Leishmania infantum]
gi|321398984|emb|CBZ08377.1| cathepsin L-like protease [Leishmania infantum JPCM5]
Length = 443
Score = 214 bits (545), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 131/311 (42%), Positives = 169/311 (54%), Gaps = 29/311 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK+ + +AY + E R F+ NL HQ +P A GIT+F DL+ AEF
Sbjct: 38 FEEFKRTYRRAYGTLAEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 120 YL-GLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YL G + A Q DL P DWREKGAV PVK+QG+CGSCW+FS G
Sbjct: 98 YLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFSAVG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGL 233
+E A LVSLSEQQLV CD + D+GCNGGLM AFE+ L+ G +
Sbjct: 158 NIESQWARAGHGLVSLSEQQLVSCDDK---------DNGCNGGLMLQAFEWLLRHMYGIV 208
Query: 234 MREEDYPYTGTDRGHACKFDKSKI--AASVANFSVVSLDEDQIAANLVKNGPLAVAINAV 291
E+ YPYT + A + SK+ A + + ++ +E +AA L +NGP+A+A++A
Sbjct: 209 FTEKSYPYTSGNGDVAECLNSSKLVPGAQIDGYVMIPSNETVMAAWLAENGPIAIAVDAS 268
Query: 292 YMQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
+Y GV SC L+HGVLLVGY G PYW+IKNSWGE WGE G
Sbjct: 269 SFMSYQSGVLTSCA---GDALNHGVLLVGYNKTGGV-------PYWVIKNSWGEDWGEKG 318
Query: 350 YYKICRGRNVC 360
Y ++ G N C
Sbjct: 319 YVRVVMGLNAC 329
>gi|332374900|gb|AEE62591.1| unknown [Dendroctonus ponderosae]
Length = 359
Score = 214 bits (545), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 119/314 (37%), Positives = 170/314 (54%), Gaps = 24/314 (7%)
Query: 56 AEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP----SATHGITQFSDL 111
A F F++K+ K Y + E R IFK NL + H K S G+ QFSDL
Sbjct: 20 ATETFVTFQQKYGKVYQNDSELSVREEIFKENLAKIEEHNKQFQQNLVSYELGLNQFSDL 79
Query: 112 TPAEFRRTYLGLRRKLRLPKDADQA-PILPTNDLPADFDWREKGAVGPVKDQGSCGSCWS 170
T AEF+ +L K ++ P +W EKG V PVK+QG+CGSCW+
Sbjct: 80 TEAEFQALLTMSPLTDQLTKQMEKYNSEFDIKTAPVSVNWAEKGVVTPVKNQGNCGSCWT 139
Query: 171 FSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA 230
F+TTG +E L TG LVSLSEQQL+DC+ ++GC+GG+++ A +Y +++
Sbjct: 140 FTTTGTIESRLALKTGSLVSLSEQQLLDCNR---------VNAGCDGGVLSYALQY-VES 189
Query: 231 GGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA 290
GL E++YPY + C +AA ++++ + V GP+AVA+NA
Sbjct: 190 AGLTTEDEYPYKAWN--GTCNSTHKPVAAYTKGYTLIYTRSESDLMKAVAEGPVAVALNA 247
Query: 291 VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
+Q Y G+ P CS ++HG L+VGY PYWIIKNSWG +WGENGY
Sbjct: 248 DLLQYYSKGIFNPSACSSTVNHGGLVVGYEENA-------TLPYWIIKNSWGATWGENGY 300
Query: 351 YKICRGRNVCGVDS 364
+++ +G N+CG+ S
Sbjct: 301 FRMAKGYNLCGITS 314
>gi|42564163|gb|AAS20593.1| digestive cysteine proteinase intestain [Leptinotarsa decemlineata]
Length = 324
Score = 214 bits (545), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 129/326 (39%), Positives = 179/326 (54%), Gaps = 30/326 (9%)
Query: 52 DLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARH-QKLD---PSATHGITQ 107
+ L + + FK F+K+Y + E RF IF +NL R H Q + G+ +
Sbjct: 15 EALSDKEKWQNFKINFSKSYQNVVEEKRRFNIFLSNLLRIEEHNQNFSRGLSTYEMGVNK 74
Query: 108 FSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGS 167
F+DLTP EF + LR K + ++QA DLPA+ DW ++GAV VK QGSCGS
Sbjct: 75 FADLTPEEFMERFRPLR-KTKPKFLSEQAKFNFDGDLPAEVDWTKQGAVTEVKSQGSCGS 133
Query: 168 CWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYT 227
CW+FSTTG++E NF+ TGKL+SLSEQQLVDC +SGC GG M+ A EY
Sbjct: 134 CWAFSTTGSVESHNFIKTGKLISLSEQQLVDCVKN---------NSGCAGGWMDIALEY- 183
Query: 228 LKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSL-DEDQIAANLVKNGPLAV 286
++A G+M E+DYPY +R C+F+ SK A + ++ + DE + + GP++V
Sbjct: 184 IEADGIMSEDDYPY--EERNTTCRFNNSKAAVQIKSYKAIKKNDEIDLQKAVALEGPVSV 241
Query: 287 AIN-AVYMQTYIGGVSCPYICSRR---LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWG 342
AI + Q Y G+ C L H VL+ GYGS K YWI+KNSWG
Sbjct: 242 AIEVTIAFQLYARGILNDPQCKNTEGDLTHAVLVTGYGSQ-------DGKDYWIVKNSWG 294
Query: 343 ESWGENGYYKICR-GRNVCGVDSMVS 367
+G +GY ++ R N CG+ + S
Sbjct: 295 AEYGMDGYLRMSRNADNQCGIATRAS 320
>gi|1749812|emb|CAA90237.1| cysteine proteinase LmCPB1 [Leishmania mexicana]
Length = 359
Score = 214 bits (545), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 128/310 (41%), Positives = 168/310 (54%), Gaps = 27/310 (8%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK+ + +AY + E R F+ NL HQ +P A GIT+F DL+ AEF
Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQARNPHAQFGITKFFDLSEAEFCAR 97
Query: 120 YLG-----LRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTT 174
YL K P+ +A + +P DWREKGAV PVKDQG+CGSCW+FS
Sbjct: 98 YLNGAAYFAAAKRHTPQHYPKARA-DLSAVPDAVDWREKGAVTPVKDQGACGSCWAFSAV 156
Query: 175 GALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGG 232
G +EG +LA +LVSLSEQQLV CD D GC+GGLM AF++ L+ G
Sbjct: 157 GNIEGQWYLAGHELVSLSEQQLVSCDDMND---------GCDGGLMLQAFDWLLQNTNGH 207
Query: 233 LMREEDYPY-TGTDRGHAC-KFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA 290
L E+ YPY +G C K + A + ++ E +AA L KNGP+A+A++A
Sbjct: 208 LYTEDSYPYVSGNGYLPECSNSSKLVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDA 267
Query: 291 VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
+Y GV I ++++H VLLVGY G E PYW+IKNSWG WGE GY
Sbjct: 268 SSFMSYKSGVLTACI-GKQVNHAVLLVGYDMTG-------EVPYWVIKNSWGGDWGEQGY 319
Query: 351 YKICRGRNVC 360
++ G N C
Sbjct: 320 VRVVMGVNAC 329
>gi|222629675|gb|EEE61807.1| hypothetical protein OsJ_16426 [Oryza sativa Japonica Group]
Length = 459
Score = 214 bits (545), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 124/321 (38%), Positives = 177/321 (55%), Gaps = 27/321 (8%)
Query: 42 ILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSA 101
I+S+ E + + A ++ +K + K+Y + E + R+ F+ NLR H +
Sbjct: 26 IVSYGERSEEE---ARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAG 82
Query: 102 TH----GITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTND-LPADFDWREKGAV 156
H G+ +F+DLT E+R TYLGLR K R + + N+ LP DWR KGAV
Sbjct: 83 VHSFRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEALPESVDWRTKGAV 142
Query: 157 GPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCN 216
+KDQG CGSCW+FS A+EG N + TG L+SLSEQ+LVDCD S + GCN
Sbjct: 143 AEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDT--------SYNEGCN 194
Query: 217 GGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAA 276
GGLM+ AF++ + GG+ E+DYPY G D +K+ ++ ++ V+ + +
Sbjct: 195 GGLMDYAFDFIINNGGIDTEDDYPYKGKDERCDVNRKNAKV-VTIDSYEDVTPNSETSLQ 253
Query: 277 NLVKNGPLAVAINA--VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPY 334
V N P++VAI A Q Y G+ C LDHGV VGYG+ K Y
Sbjct: 254 KAVANQPVSVAIEAGGRAFQLYSSGIFTG-KCGTALDHGVAAVGYGTE-------NGKDY 305
Query: 335 WIIKNSWGESWGENGYYKICR 355
WI++NSWG+SWGE+GY ++ R
Sbjct: 306 WIVRNSWGKSWGESGYVRMER 326
>gi|224081756|ref|XP_002306486.1| predicted protein [Populus trichocarpa]
gi|222855935|gb|EEE93482.1| predicted protein [Populus trichocarpa]
Length = 352
Score = 214 bits (545), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 126/303 (41%), Positives = 168/303 (55%), Gaps = 24/303 (7%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
+ + K KAY E RF IFK NLR H + + G+T+F+DLT E+R
Sbjct: 4 YKWWLAKHGKAYNGLGEEAERFEIFKNNLRFIDEHNSQNHTYKVGLTKFADLTNEEYRAM 63
Query: 120 YLGLR----RKLRLPKDADQAPILPTND-LPADFDWREKGAVGPVKDQGSCGSCWSFSTT 174
+LG R R+L K + D LP DWR KGAV P+KDQGSCGSCW+FST
Sbjct: 64 FLGTRSDAKRRLMKSKSPSERYAFKAGDKLPESVDWRAKGAVNPIKDQGSCGSCWAFSTV 123
Query: 175 GALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLM 234
A+EG N + TG+L+SLSEQ+LVDCD + ++GCNGGLM+ AF++ + GGL
Sbjct: 124 AAVEGINQIVTGELISLSEQELVDCDR--------TYNAGCNGGLMDYAFQFIINNGGLD 175
Query: 235 REEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA--VY 292
E+DYPY G D K A S+ F V +++ V + P++VAI A +
Sbjct: 176 TEKDYPYVGDDD-KCDKDKMKTKAVSIDGFEDVLPYDEKALQKAVAHQPVSVAIEASGMA 234
Query: 293 MQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYK 352
+Q Y GV C LDHGV++VGY S YW+++NSWG WGE+GY K
Sbjct: 235 LQFYQSGVFTGE-CGTALDHGVVVVGYASE-------NGLDYWLVRNSWGTEWGEHGYIK 286
Query: 353 ICR 355
+ R
Sbjct: 287 MQR 289
>gi|50355619|dbj|BAD29958.1| cysteine protease [Daucus carota]
Length = 496
Score = 214 bits (545), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 124/304 (40%), Positives = 171/304 (56%), Gaps = 27/304 (8%)
Query: 69 KAYASQEEHDHRFTIFKANLRRAARHQKL-DPSATHGITQFSDLTPAEFRRTYLGLRRK- 126
K+Y + E + RF IFK NLR + D G+ +F+DLT E+R Y G++ K
Sbjct: 54 KSYNALGEEEKRFQIFKNNLRYIDEQNLVEDRGFKLGLNKFADLTNEEYRSKYTGIKSKD 113
Query: 127 --LRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLA 184
++ + + L LP DWRE GAV VKDQGSCGSCW+FST A+EG N +A
Sbjct: 114 LRKKVSAKSGRYATLSGESLPESVDWRESGAVATVKDQGSCGSCWAFSTISAVEGINQIA 173
Query: 185 TGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGT 244
TGKL++LSEQ+LVDCD S + GCNGGLM+ AFE+ + GG+ + DYPYTG
Sbjct: 174 TGKLITLSEQELVDCDR--------SYNEGCNGGLMDYAFEFIINNGGIDTDVDYPYTGR 225
Query: 245 DRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY--MQTYIGGVSC 302
D G ++ K+ ++ ++ V ++ N P++VAI A Q Y G+
Sbjct: 226 D-GKCDQYRKNAKVVTIDSYEDVPAYDELALKKAAANQPISVAIEASGRDFQFYDSGIFT 284
Query: 303 PYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRG----RN 358
C LDHGV++VGYG+ K YWI++NSWG WGENGY ++ RG
Sbjct: 285 G-KCGIALDHGVVVVGYGTE-------NGKDYWIVRNSWGADWGENGYLRMERGISSKTG 336
Query: 359 VCGV 362
+CG+
Sbjct: 337 ICGI 340
>gi|195124431|ref|XP_002006696.1| GI21205 [Drosophila mojavensis]
gi|193911764|gb|EDW10631.1| GI21205 [Drosophila mojavensis]
Length = 339
Score = 214 bits (545), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 132/332 (39%), Positives = 185/332 (55%), Gaps = 33/332 (9%)
Query: 52 DLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKL----DPSATHGITQ 107
D++ E H FK + K Y + E R IF N + A+H + + + + +
Sbjct: 21 DVIKEEWH--TFKLEHRKTYQDETEERFRLKIFNENKHKIAKHNQRYATGEVTFKMAVNK 78
Query: 108 FSDLTPAEFRRTYLG----LRRKLRL--PKDADQAPILPTN-DLPADFDWREKGAVGPVK 160
++D+ EFR T G L ++LR P I P + LP DWREKGAV VK
Sbjct: 79 YADMLHHEFRETMNGFNYTLHKELRASDPSFTGITFISPAHVKLPKSVDWREKGAVTAVK 138
Query: 161 DQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLM 220
DQG CGSCW+FS+TGALEG +F TG LVSLSEQ LVDC + ++GCNGGLM
Sbjct: 139 DQGHCGSCWAFSSTGALEGQHFRKTGTLVSLSEQNLVDCSAKYG-------NNGCNGGLM 191
Query: 221 NSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSL-DEDQIAANLV 279
++AF Y GG+ E+ YPY G D +C F+K + A+ F+ + +E ++A +
Sbjct: 192 DNAFRYIKDNGGIDTEKSYPYEGIDD--SCHFNKDSVGATDRGFADIPQGNEKKMAEAVA 249
Query: 280 KNGPLAVAINAVY--MQTYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWI 336
GP++VAI+A + Q Y G+ + P S+ LDHGVL+VGYG+ K YW+
Sbjct: 250 TIGPVSVAIDASHESFQFYSEGIYNEPECNSQNLDHGVLVVGYGTDESG------KDYWL 303
Query: 337 IKNSWGESWGENGYYKICRGR-NVCGVDSMVS 367
+KNSWG +WG+ G+ K+ R N CG+ S S
Sbjct: 304 VKNSWGTTWGDKGFIKMARNEDNQCGIASASS 335
>gi|356517184|ref|XP_003527269.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 350
Score = 214 bits (545), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 127/329 (38%), Positives = 179/329 (54%), Gaps = 28/329 (8%)
Query: 49 TNNDLLGAEHHFSLFKK---KFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGI 105
++ DL + LF+ + K Y S EE HRF IFK NL+ K+ + G+
Sbjct: 34 SSEDLKSMDKLIELFESWISRHGKIYQSIEEKLHRFEIFKDNLKHIDERNKVVSNYWLGL 93
Query: 106 TQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSC 165
+F+DL+ EF+ YLGL+ +++ + +LP DWR+KGAV VK+QGSC
Sbjct: 94 NEFADLSHQEFKNKYLGLKVDYSRRRESPEEFTYKDVELPKSVDWRKKGAVTQVKNQGSC 153
Query: 166 GSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFE 225
GSCW+FST A+EG N + TG L SLSEQ+L+DCD + ++GCNGGLM+ AF
Sbjct: 154 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDR--------TYNNGCNGGLMDYAFS 205
Query: 226 YTLKAGGLMREEDYPYTGTDRGHACKFDKSKI-AASVANFSVVSLDEDQIAANLVKNGPL 284
+ ++ GL +EEDYPY + C+ K + +++ + V + +Q + N PL
Sbjct: 206 FIVENDGLHKEEDYPYIMEE--GTCEMAKEETEVVTISGYHDVPQNNEQSLLKALANQPL 263
Query: 285 AVAINAV--YMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWG 342
+VAI A Q Y GGV + C LDHGV VGYG+A K Y +KNSWG
Sbjct: 264 SVAIEASGRDFQFYSGGVFDGH-CGSDLDHGVAAVGYGTA-------KGVDYITVKNSWG 315
Query: 343 ESWGENGYYKICRG----RNVCGVDSMVS 367
WGE GY ++ R +CG+ M S
Sbjct: 316 SKWGEKGYIRMRRNIGKPEGICGIYKMAS 344
>gi|390457768|ref|XP_002742793.2| PREDICTED: cathepsin L2 [Callithrix jacchus]
Length = 588
Score = 214 bits (545), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 125/313 (39%), Positives = 167/313 (53%), Gaps = 23/313 (7%)
Query: 63 FKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQ----FSDLTPAEFRR 118
+K + Y + EE R +++ N++ H HG T F D+T EFR+
Sbjct: 32 WKATHRRLYGTNEE-GWRRAVWEKNMKMIELHNGEYSQGKHGFTMAMNAFGDMTNEEFRQ 90
Query: 119 TYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALE 178
+ R + + + P+L +LP DWR+KG V PVK+Q CGSCW+FS TGALE
Sbjct: 91 VMVCFRNQKHKNRKVFRGPLL--LNLPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALE 148
Query: 179 GANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREED 238
G F TGKLVSLSEQ LVDC H + GCNGG MN+AF+Y + GGL E
Sbjct: 149 GQMFRKTGKLVSLSEQNLVDCSHP-------QGNQGCNGGFMNNAFQYVKENGGLDSEAS 201
Query: 239 YPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY--MQTY 296
YPY D +CK+ A+ F V+ E ++ + GP++VA++A + Q Y
Sbjct: 202 YPYVAKD--GSCKYKPENSVANDTGFVVIPAHEKELMKAVATVGPISVAVDASHSSFQFY 259
Query: 297 IGGVSCPYIC-SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
G+ C S+ LDHGVL+VGYG G YW+IKNSWG WG NGY KI +
Sbjct: 260 KSGIYFEQDCSSKNLDHGVLVVGYGFEG---TNSNNNNYWLIKNSWGPEWGSNGYIKIAK 316
Query: 356 GRNV-CGVDSMVS 367
RN CG+ + S
Sbjct: 317 DRNNHCGIATAAS 329
>gi|60100207|gb|AAX13273.1| putative cysteine protease [Oryza sativa Japonica Group]
Length = 349
Score = 214 bits (544), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 136/318 (42%), Positives = 174/318 (54%), Gaps = 34/318 (10%)
Query: 66 KFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGI-----TQFSDLTPAEFRRTY 120
K +AYA E R +F+ N+ A + ++ +A+ QF+DLT AEFR T
Sbjct: 46 KHGRAYADDAEKARRLEVFRDNV---AFIESVNAAASQHKFWLEENQFADLTNAEFRATR 102
Query: 121 LGLR----RKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGA 176
GLR R R P A + T DLPA DWR KGAV PVKDQG CG CW+FS A
Sbjct: 103 TGLRPSSSRGNRAPTSFRYANV-STGDLPASVDWRGKGAVNPVKDQGDCGCCWAFSAVAA 161
Query: 177 LEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMRE 236
+EGA LATGKLVSLSEQQLV CD + + D GC GGLM+ AF++ +K GGL E
Sbjct: 162 MEGAVKLATGKLVSLSEQQLVSCDVKGE-------DQGCEGGLMDDAFDFIIKNGGLAAE 214
Query: 237 EDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA--VYMQ 294
DYPYT +D AA++ + V +++ V N P++VAI+ + Q
Sbjct: 215 SDYPYTASDD-KCATAGAGAAAATIKGYEDVPANDEAALLKAVANQPVSVAIDGGDRHFQ 273
Query: 295 TYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKI 353
Y GGV S C+ LDH + VGYG A YW++KNSWG SWGE+GY ++
Sbjct: 274 FYKGGVLSGAAGCATELDHAITAVGYGVAS------DGTKYWLMKNSWGTSWGEDGYVRM 327
Query: 354 CRG----RNVCGVDSMVS 367
RG VCG+ M S
Sbjct: 328 ERGVADKEGVCGLAMMAS 345
>gi|194352750|emb|CAQ00103.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326514262|dbj|BAJ92281.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326519402|dbj|BAJ96700.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326524351|dbj|BAK00559.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326531998|dbj|BAK01375.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 356
Score = 214 bits (544), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 134/332 (40%), Positives = 184/332 (55%), Gaps = 29/332 (8%)
Query: 49 TNNDLLGAEHHFSLFKK---KFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGI 105
+ DL E LF+K K KAYAS EE HRF +FK NL+ + + S G+
Sbjct: 35 SEEDLSSNERLVELFEKWLAKHQKAYASFEEKLHRFEVFKDNLKHIDKINREVTSYWLGL 94
Query: 106 TQFSDLTPAEFRRTYLGLRRK--LRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQG 163
+F+DLT EF+ YLGL R + + + +DLP DWR+KGAV VK+QG
Sbjct: 95 NEFADLTHDEFKAAYLGLDAAPARRGSSRSFRYEDVSASDLPKSVDWRKKGAVTEVKNQG 154
Query: 164 SCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSA 223
CGSCW+FST A+EG N + TG L +LSEQ+L+DC + +SGCNGGLM+ A
Sbjct: 155 QCGSCWAFSTVAAVEGINAIVTGNLTALSEQELIDCSVD--------GNSGCNGGLMDYA 206
Query: 224 FEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKI-AASVANFSVVSLDEDQIAANLVKNG 282
F Y +GGL EE YPY + G K++ A +++ + V +++Q + +
Sbjct: 207 FSYIASSGGLHTEEAYPYL-MEEGSCGDGKKAESEAVTISGYEDVPANDEQALIKALAHQ 265
Query: 283 PLAVAINAV--YMQTYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKN 339
P++VAI A + Q Y GGV P C +LDHGV VGYGS + K Y I++N
Sbjct: 266 PVSVAIEASGRHFQFYSGGVFDGP--CGAQLDHGVAAVGYGSD-----KGKGHDYIIVRN 318
Query: 340 SWGESWGENGYYKICR----GRNVCGVDSMVS 367
SWG WGE GY ++ R G +CG++ M S
Sbjct: 319 SWGAQWGEKGYIRMKRGTSNGEGLCGINKMAS 350
>gi|343978787|gb|AEM76722.1| cathepsin L-like proteinase [Triatoma brasiliensis]
Length = 330
Score = 214 bits (544), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 133/319 (41%), Positives = 172/319 (53%), Gaps = 30/319 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRA----ARHQKLDPSATHGITQFSDLTPAE 115
+ FK K Y +Q E R IF N +R A++++ + S + F DL E
Sbjct: 27 WETFKVVHGKNYKNQFEEMFRRKIFMNNKKRIEAHNAKYEQGEVSYKMKMNHFGDLMSHE 86
Query: 116 FRRTYLGLRRKLRLPKDADQAPI-LPTND-LPADFDWREKGAVGPVKDQGSCGSCWSFST 173
+ G + P + I P+ND LP DWR+KGAV PVKDQG CGSCWSFS
Sbjct: 87 IKALMNGFKM---TPNTKREGKIYFPSNDKLPKSVDWRQKGAVTPVKDQGQCGSCWSFSA 143
Query: 174 TGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGL 233
TG+LEG FL GKLVSLSEQ L+DC E ++GC GGLM+ AF+Y G+
Sbjct: 144 TGSLEGQIFLKKGKLVSLSEQNLMDCSKEYG-------NNGCEGGLMDKAFQYVSDNKGI 196
Query: 234 MREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAINAVY 292
E YPY D +AC+F K K+ + + + DE + L GP++VAI+A +
Sbjct: 197 DTESSYPYEARD--YACRFKKDKVGGTDKGYVDIPEGDEKALQNALATVGPISVAIDASH 254
Query: 293 --MQTYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
Y GV + PY S LDHGVL VGYG+ + YW++KNSWG SWGE+G
Sbjct: 255 ESFHFYSEGVYNEPYCSSYDLDHGVLAVGYGTE-------NGQDYWLVKNSWGPSWGESG 307
Query: 350 YYKICRGR-NVCGVDSMVS 367
Y KI R N CG+ SM S
Sbjct: 308 YIKIARNHSNHCGIASMAS 326
>gi|4886998|gb|AAD32136.1|AF121837_1 cathepsin L [Mus musculus]
gi|4887000|gb|AAD32137.1|AF121838_1 cathepsin L [Mus musculus]
gi|4887002|gb|AAD32138.1|AF121839_1 cathepsin L [Mus musculus]
gi|200501|gb|AAA39984.1| preprocathepsin L precursor [Mus musculus]
Length = 334
Score = 214 bits (544), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 129/322 (40%), Positives = 172/322 (53%), Gaps = 25/322 (7%)
Query: 54 LGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQ----FS 109
AE H +K + Y + EE + R I++ N+R H + HG + F
Sbjct: 25 FSAEWH--QWKSTHRRLYGTNEE-EWRRAIWEKNMRIIQLHNGEYSNGQHGFSMEMNAFG 81
Query: 110 DLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCW 169
D+T EFR+ G R + Q P++ +P DWREKG V PVK+QG CGSCW
Sbjct: 82 DMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK--IPKSVDWREKGCVTPVKNQGQCGSCW 139
Query: 170 SFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK 229
+FS +G LEG FL TGKL+SLSEQ LVDC H + GCNGGLM+ AF+Y +
Sbjct: 140 AFSASGCLEGQMFLKTGKLISLSEQNLVDCSH-------AQGNQGCNGGLMDFAFQYIKE 192
Query: 230 AGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAIN 289
GGL EE YPY D +CK+ A+ F + E + + GP++VA++
Sbjct: 193 NGGLDSEESYPYEAKDG--SCKYRAEFAVANDTGFVDIPQQEKALMKAVATVGPISVAMD 250
Query: 290 AVY--MQTYIGGVSC-PYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWG 346
A + +Q Y G+ P S+ LDHGVLLVGYG G + K YW++KNSWG WG
Sbjct: 251 ASHPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNK---YWLVKNSWGSEWG 307
Query: 347 ENGYYKICRGR-NVCGVDSMVS 367
GY KI + R N CG+ + S
Sbjct: 308 MEGYIKIAKDRDNHCGLATAAS 329
>gi|449275508|gb|EMC84350.1| Cathepsin L1, partial [Columba livia]
Length = 319
Score = 214 bits (544), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 135/319 (42%), Positives = 173/319 (54%), Gaps = 22/319 (6%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH----GITQFSDLTPA 114
H+ L+K NK Y +EE R +++ NL+ H H G+ QF D+T
Sbjct: 9 HWQLWKSWHNKDYHEREE-SWRRVVWEKNLKMIELHNLDHTLGKHSYKLGMNQFGDMTTE 67
Query: 115 EFRRTYLGLRRKLRLPKDADQAPILPTN-DLPADFDWREKGAVGPVKDQGSCGSCWSFST 173
EFR+ G K K + P+ + P DWREKG V PVKDQG CGSCW+FST
Sbjct: 68 EFRQLMNGYAHKKSERKYRGSQFLEPSFLEAPRSVDWREKGYVTPVKDQGQCGSCWAFST 127
Query: 174 TGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGL 233
TGALEG +F TGKLVSLSEQ LVDC PE + GCNGGLM+ AF+Y GG+
Sbjct: 128 TGALEGQHFRKTGKLVSLSEQNLVDCSR---PE----GNQGCNGGLMDQAFQYVQDNGGI 180
Query: 234 MREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAINAVY 292
EE YPYT D C++ AA+ F + E + + GP++VAI+A +
Sbjct: 181 DSEESYPYTAKD-DEDCRYKAEYNAANDTGFVDIPQGHERALMKAVAAVGPVSVAIDAGH 239
Query: 293 --MQTYIGGVSCPYICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
Q Y G+ CS LDHGVL+VGYG G + K YWI+KNSWGE WG+ G
Sbjct: 240 SSFQFYQSGIYYEPDCSSEDLDHGVLVVGYGFEG---EDVDGKKYWIVKNSWGEKWGDKG 296
Query: 350 YYKICRGR-NVCGVDSMVS 367
Y + + R N CG+ + S
Sbjct: 297 YIYMAKDRKNHCGIATAAS 315
>gi|325303202|tpg|DAA34687.1| TPA_inf: cathepsin L-like cysteine proteinase B [Amblyomma
variegatum]
Length = 337
Score = 214 bits (544), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 132/332 (39%), Positives = 178/332 (53%), Gaps = 31/332 (9%)
Query: 49 TNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKL----DPSATHG 104
T+ +L+GAE +S FK K Y S+ E +R I+ N ARH + S
Sbjct: 20 THQELVGAE--WSAFKALHGKEYQSETEEYYRLKIYMENRMMIARHNEKYANNKVSYKLA 77
Query: 105 ITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPT----NDLPADFDWREKGAVGPVK 160
+ ++ D+ EF T G RR R I P LP DWR+KGAV PVK
Sbjct: 78 MNEYGDMLHHEFVSTRNGFRRDYRSKPRQGSFYIEPEGIEDKHLPKTVDWRKKGAVTPVK 137
Query: 161 DQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLM 220
+QG CGSCW+FSTTG+LEG +F +G +VSLSEQ LVDC ++GC GGLM
Sbjct: 138 NQGQCGSCWAFSTTGSLEGQHFRKSGDMVSLSEQNLVDC-------STAFGNNGCEGGLM 190
Query: 221 NSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVK 280
++AF+Y GG+ E+ YPY GTD C F KS + A+ F + + + V
Sbjct: 191 DNAFKYIKANGGIDTEKSYPYNGTDG--TCHFKKSDVGATDTGFVDIPEGNEHLLKKAVA 248
Query: 281 N-GPLAVAINAVY--MQTYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWI 336
GP++VAI+A + Q Y GV P S LDHGVL+VGYG+ ++ YW+
Sbjct: 249 TVGPISVAIDASHQSFQFYSQGVYDEPECSSENLDHGVLVVGYGTK-------DDQDYWL 301
Query: 337 IKNSWGESWGENGYYKICRGR-NVCGVDSMVS 367
+KNSWG +WG+ GY + R + N CG+ S S
Sbjct: 302 VKNSWGTTWGDGGYIYMTRNKDNQCGIASSAS 333
>gi|50355615|dbj|BAD29956.1| cysteine protease [Daucus carota]
Length = 423
Score = 214 bits (544), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 126/309 (40%), Positives = 178/309 (57%), Gaps = 23/309 (7%)
Query: 66 KFNKAYASQEEHDHRFTIFKANLRRAARHQK-LDPSATHGITQFSDLTPAEFRRTYLGLR 124
K +K Y + + RF IFK NLR H K ++ S G+ +F+DL+ E++ +LG R
Sbjct: 13 KHHKNYNALGAKEKRFEIFKDNLRFIDEHNKGVNQSFKLGLNKFADLSNEEYKSMFLGGR 72
Query: 125 R-KLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFL 183
+ R ++D+ ++LP DWREKGAV PVKDQG CGSCW+FST A+EG N +
Sbjct: 73 MVRDRKGFESDRFKYGVGDELPQSVDWREKGAVAPVKDQGQCGSCWAFSTVAAVEGINQI 132
Query: 184 ATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTG 243
ATG L+SLSEQ+LVDCD + GCNGG M+ AFE+ +K GG+ E+DYPY G
Sbjct: 133 ATGDLISLSEQELVDCDK--------GFNQGCNGGFMDYAFEFIVKNGGIDTEDDYPYKG 184
Query: 244 TDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA--VYMQTYIGGVS 301
D G + K+ ++ F V ++++ V + P++VAI A Q Y G+
Sbjct: 185 VD-GQCDQNRKNAKVVTINGFEDVPQNDEKSLKKAVAHQPVSVAIEAGGRAFQLYESGIF 243
Query: 302 CPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCG 361
+C LDHGV+ VGYG+ K YWI++NSWG +WGENGY ++ RNV
Sbjct: 244 -NGLCGTDLDHGVVAVGYGTE-------DGKDYWIVRNSWGPNWGENGYIRL--ERNVAS 293
Query: 362 VDSMVSTVA 370
++ +A
Sbjct: 294 TNTGKCGIA 302
>gi|340380717|ref|XP_003388868.1| PREDICTED: pro-cathepsin H-like [Amphimedon queenslandica]
Length = 337
Score = 214 bits (544), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 127/328 (38%), Positives = 187/328 (57%), Gaps = 26/328 (7%)
Query: 48 STNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAAR-HQKLDPSATHGIT 106
S ++D + AE F+++ KK+ K Y++ EE++ R ++ +N + +++ P + +
Sbjct: 24 SPSDDEVMAES-FNMWMKKYEKTYSTMEEYNERLRVYTSNYYYIEQLNKEHGPHTEYELN 82
Query: 107 QFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCG 166
QFSDLT AEF++ YL + Q P+ + P DWREK + PVKDQG CG
Sbjct: 83 QFSDLTFAEFKKIYLTEPQHCSATNGNFQKPVNARD--PVAVDWREKNVITPVKDQGKCG 140
Query: 167 SCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEY 226
SCW+FSTTG LE + + TG+L+SLSEQQLVDC + + GCNGGL + AFEY
Sbjct: 141 SCWTFSTTGCLEAHHAIKTGQLISLSEQQLVDCAGAFN-------NHGCNGGLPSQAFEY 193
Query: 227 TLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLD-EDQIAANLVKNGPLA 285
GG+ E +Y YT D C+F+ S +AA+V++ ++ D E I + GP++
Sbjct: 194 IKYNGGIESESNYNYTAKDG--VCRFNSSLVAATVSDVVNITKDAEGDIGTAVANVGPVS 251
Query: 286 VAINAVY-MQTYIGGVSCPYI--CSR---RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKN 339
+A Q Y GV I CS+ +++H VL+VGY +L E+ YWI+KN
Sbjct: 252 IAFEVTKSFQHYKKGVYQGEIEVCSQSPDKVNHAVLVVGYNQT-----KLGEE-YWIVKN 305
Query: 340 SWGESWGENGYYKICRGRNVCGVDSMVS 367
SW SWG +GY+ I RG N CG+ + S
Sbjct: 306 SWSASWGMDGYFWIRRGHNACGLATCAS 333
>gi|297845064|ref|XP_002890413.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
lyrata]
gi|297336255|gb|EFH66672.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
lyrata]
Length = 357
Score = 214 bits (544), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 127/317 (40%), Positives = 172/317 (54%), Gaps = 25/317 (7%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F + F KAY + EE RF +FK NL+ K S G+ +F+DL+ EF++
Sbjct: 51 FENWISNFEKAYETVEEKLLRFEVFKDNLKHIDETNKKVKSYWLGLNEFADLSHEEFKKM 110
Query: 120 YLGLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSFSTTGA 176
YLGL+ + + D+ P DWR+KGAV VK+QGSCGSCW+FST A
Sbjct: 111 YLGLKTDIVRRDEERSYAEFAYRDVEAVPKSVDWRKKGAVAEVKNQGSCGSCWAFSTVAA 170
Query: 177 LEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMRE 236
+EG N + TG L +LSEQ+L+DCD + ++GCNGGLM+ AFEY +K GGL +E
Sbjct: 171 VEGINKIVTGNLTTLSEQELIDCDT--------TYNNGCNGGLMDYAFEYIVKNGGLRKE 222
Query: 237 EDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV--YMQ 294
EDYPY+ + + D+S+ + V + DE + L PL+VAI+A Q
Sbjct: 223 EDYPYSMEEGTCEMQKDESETVTIDGHQDVPTNDEKSLLKALAHQ-PLSVAIDASGREFQ 281
Query: 295 TYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
Y G C LDHGV VGYGS+ K Y I+KNSWG WGE GY ++
Sbjct: 282 FYSGVSVFDGRCGVDLDHGVAAVGYGSS-------KGSDYIIVKNSWGPKWGEKGYIRLK 334
Query: 355 RG----RNVCGVDSMVS 367
R +CG++ M S
Sbjct: 335 RNTGKPEGLCGINKMAS 351
>gi|378943048|gb|AFC76265.1| cathepsin L-like protease [Leishmania major]
Length = 348
Score = 214 bits (544), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 130/310 (41%), Positives = 168/310 (54%), Gaps = 27/310 (8%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK+ + +AY + E R F+ NL HQ +P A GIT+F DL+ A F
Sbjct: 38 FEEFKRTYQRAYGTLTEEQRRLANFERNLELMREHQARNPHARFGITKFFDLSEAVFAAR 97
Query: 120 YL-GLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YL G + A Q DL P DWREKGAV PVK+QG+CGSCW+FS G
Sbjct: 98 YLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFSAVG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGL 233
+E +A KLV LSEQQLV CDH D+GC GGLM AFE+ L+ G +
Sbjct: 158 NIESQWAVAGHKLVRLSEQQLVSCDH---------VDNGCGGGLMLQAFEWVLRNMNGTV 208
Query: 234 MREEDYPY-TGTDRGHACKFDKSKIA--ASVANFSVVSLDEDQIAANLVKNGPLAVAINA 290
E+ YPY +G C + S++A A + + + E +AA L KNGP+++A++A
Sbjct: 209 FTEKSYPYVSGNGDVPECS-NSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDA 267
Query: 291 VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
+Y GV I +L+HGVLLVGY G E PYW+IKNSWGE WGE GY
Sbjct: 268 SSFMSYHSGVLTSCI-GEQLNHGVLLVGYNMTG-------EVPYWVIKNSWGEDWGEKGY 319
Query: 351 YKICRGRNVC 360
++ G N C
Sbjct: 320 VRVTMGVNAC 329
>gi|357619725|gb|EHJ72184.1| hypothetical protein KGM_03271 [Danaus plexippus]
Length = 338
Score = 214 bits (544), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 121/319 (37%), Positives = 175/319 (54%), Gaps = 29/319 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F F K +NK Y + E + RF IF NL+ + +A +GI +FSDL+ EF +
Sbjct: 41 FEQFIKDYNKEY-DESEKEERFKIFVNNLKDINAMNERSSNAVYGINKFSDLSKEEFIKY 99
Query: 120 YLGLRRKLRLPKDADQAPILPTN---DLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGA 176
Y GL+R+ + + LP + P FDWR+KG V +K+Q CGSCW+FS
Sbjct: 100 YTGLKREESPSNEDHKKTDLPESFNVTAPDQFDWRKKGVVSSIKNQKHCGSCWAFSAAAN 159
Query: 177 LEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMRE 236
+E + + TGKL+ +SEQQL+DCD DSGC+GGL A Y + A G M
Sbjct: 160 VESIHAIKTGKLIDVSEQQLLDCD---------KYDSGCSGGLPWDALRYFV-ANGAMSL 209
Query: 237 EDYPYTGTDRGHACKFDKSKIAASVANFSVVS-LDEDQIAANLVKNGPLAVAINAVYMQT 295
+ YPY + C++D SK+ + + + S + EDQI +L GPL++AI+ ++
Sbjct: 210 KSYPYVAKE--GKCRYDSSKVEIRLKGYKIFSKISEDQIKEHLYNIGPLSIAIDVSPIKP 267
Query: 296 YIGGV---SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYK 352
Y+GG+ C +C +++H VLLVGYG YWI+KNSWG +WGENGY++
Sbjct: 268 YVGGIVMEECHEVC--QVNHAVLLVGYGKEYSV-------EYWIVKNSWGPNWGENGYFR 318
Query: 353 ICRGRNVCGVDSMVSTVAA 371
+ RG N + S T A
Sbjct: 319 MERGVNCLLLTSTGITTAV 337
>gi|301769891|ref|XP_002920367.1| PREDICTED: cathepsin L1-like [Ailuropoda melanoleuca]
gi|281346353|gb|EFB21937.1| hypothetical protein PANDA_009084 [Ailuropoda melanoleuca]
Length = 333
Score = 214 bits (544), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 133/319 (41%), Positives = 177/319 (55%), Gaps = 27/319 (8%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQ----FSDLTPA 114
H+S +K+ K Y EE R +++ N++ +H + H T F DLT
Sbjct: 28 HWSHWKEAHGKLYDKDEEGQRR-RVWEKNMKMIDQHNEEYSQGQHSFTMAMNAFGDLTSE 86
Query: 115 EFRRTYLGLRRKLRLPKDAD--QAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFS 172
EF++ L K++ P++ + QAP+ ++PA DWREKG V PVK QG C SCW+FS
Sbjct: 87 EFKQVLNDL--KIQKPEEGNVFQAPLFA--EIPASVDWREKGYVTPVKYQGHCQSCWAFS 142
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
TGALEG F TGKLVSLSEQ LVDC P + D GC GGLM++AF Y GG
Sbjct: 143 ATGALEGQMFRKTGKLVSLSEQNLVDCSW------PQNND-GCRGGLMDNAFRYVKDNGG 195
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY 292
L E YPY G R +CK+ K AA++ F VS ED + + GP++ A+++
Sbjct: 196 LDSAESYPYLG--RNESCKYRPEKSAANLTTFWSVSNKEDGLMTTVATVGPVSAAVDSSL 253
Query: 293 --MQTYIGGVSC-PYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
Q Y G+ P S RL+H VL+VGYG G + K YWIIKNSWG +WG G
Sbjct: 254 HSFQFYKKGIYYDPNCRSNRLNHAVLVVGYGFEGEES---ENKKYWIIKNSWGTNWGMKG 310
Query: 350 YYKICRGR-NVCGVDSMVS 367
Y + + R N CG+ +M S
Sbjct: 311 YMLLAKDRDNHCGIATMAS 329
>gi|71084302|gb|AAZ23596.1| cysteine protease [Leishmania aethiopica]
Length = 443
Score = 214 bits (544), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 128/312 (41%), Positives = 170/312 (54%), Gaps = 31/312 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK+ + +AY + E R F+ NL HQ +P A GIT+F DL+ AEF
Sbjct: 38 FEEFKRTYRRAYGTVAEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 120 YL-GLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YL G + A Q DL P DWREKGAV PVK+QG+CGSCW+FS G
Sbjct: 98 YLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFSVVG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGL 233
+E +A +L +LSEQQLV CD DSGC GGLM AFE+ L+ G +
Sbjct: 158 NIESQWAVAGHRLTALSEQQLVSCD---------DMDSGCGGGLMTQAFEWLLRNMNGTM 208
Query: 234 MREEDYPYTGTDRGHACKFDKSK---IAASVANFSVVSLDEDQIAANLVKNGPLAVAINA 290
E+ YPY T G+ + S A + + ++ +E +AA L K+GP+++ ++A
Sbjct: 209 FTEDSYPYVST-FGYVPECTNSSQLVPGARIDGYVMIESNETVMAAWLAKSGPISIGVDA 267
Query: 291 VYMQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+Y GGV SC ++L+HGVLLVGY G E PYW+IKNSWGE+WGE
Sbjct: 268 SSFMSYHGGVLTSC---AGKQLNHGVLLVGYNMTG-------EVPYWVIKNSWGENWGEK 317
Query: 349 GYYKICRGRNVC 360
GY ++ G N C
Sbjct: 318 GYVRVTMGVNAC 329
>gi|113195461|ref|YP_717598.1| V-CATH [Clanis bilineata nucleopolyhedrosis virus]
gi|66968272|gb|AAY59557.1| V-CATH [Clanis bilineata nucleopolyhedrosis virus]
Length = 325
Score = 213 bits (543), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 119/326 (36%), Positives = 180/326 (55%), Gaps = 27/326 (8%)
Query: 52 DLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDL 111
DLL A +F F +NK Y +E +R+ IFK NL +++ A I +FSD+
Sbjct: 19 DLLKAPDYFESFVANYNKMYNDTQEKAYRYKIFKHNLEEINIKNQVEDHAVFSINKFSDM 78
Query: 112 TPAEFRRTYLGLRRKLRLPKDADQAPIL--PTNDLPADFDWREKGAVGPVKDQGSCGSCW 169
+ +E Y GL + ++ +A IL P N P +FDWR+ AV PV+ QG+CGSCW
Sbjct: 79 SKSEIISKYTGLSLPSLMQENFCRAIILDGPPNKAPINFDWRQYNAVTPVRVQGNCGSCW 138
Query: 170 SFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK 229
+FST +E + K +SLS QQLVDCD + + GC GGL+++A E +
Sbjct: 139 AFSTLAGIESQYSIKYNKQISLSVQQLVDCD---------TSNMGCAGGLLHTALEQIIN 189
Query: 230 A-GGLMREEDYPYTGTDRGHACKFDKSKIAASVAN-FSVVSLDEDQIAANLVKNGPLAVA 287
A GG+++EEDYPY G D+ C + A V + + ++E+++ L GP+ VA
Sbjct: 190 AGGGVLQEEDYPYKGVDK--QCNLPHNNFAVQVLGCYRYIVMNEEKLKDVLRAVGPIPVA 247
Query: 288 INAVYMQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESW 345
I+A + Y G+ +C Y L+H VLLVGYG PYW +KN+WG+ W
Sbjct: 248 IDAASIVDYSRGIIRTCTY---YGLNHAVLLVGYGVQ-------DGVPYWTLKNTWGDDW 297
Query: 346 GENGYYKICRGRNVCGVDSMVSTVAA 371
GE+GY+++ + N CG+ + +++ A
Sbjct: 298 GEHGYFRVRQNVNSCGIINDLASTAV 323
>gi|148224022|ref|NP_001087489.1| cathepsin L2 precursor [Xenopus laevis]
gi|51258284|gb|AAH80004.1| MGC81823 protein [Xenopus laevis]
Length = 335
Score = 213 bits (543), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 131/318 (41%), Positives = 172/318 (54%), Gaps = 23/318 (7%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH----GITQFSDLTPA 114
H+ L+K K+Y +EE R +++ NLR H H G+ QF D+T
Sbjct: 28 HWHLWKNWHKKSYLPKEE-GWRRVLWEKNLRTIEFHNLDHSLGKHSYRLGMNQFGDMTNE 86
Query: 115 EFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTT 174
EFR+ G + + + AP + P DWREKG V PVKDQG CGSCW+FSTT
Sbjct: 87 EFRQLMNGYKNQKMIKGSTFLAP--NNFEAPKTVDWREKGYVTPVKDQGQCGSCWAFSTT 144
Query: 175 GALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLM 234
GALEG ++ GKL+SLSEQ LVDC + GCNGGLM+ AF+Y GG+
Sbjct: 145 GALEGQHYRKAGKLISLSEQNLVDCSR-------AQGNQGCNGGLMDQAFQYVKDNGGID 197
Query: 235 REEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAINAVY- 292
E+ YPYT D C +D + +A+ F V S E + + GP++VA++A +
Sbjct: 198 SEDSYPYTAKDD-QECHYDPNYNSANDTGFVDVPSGSEKDLMKAVASVGPVSVAVDAGHK 256
Query: 293 -MQTYIGGVSC-PYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
Q Y G+ P S LDHGVL+VGYG G + K YWI+KNSW E WG NGY
Sbjct: 257 SFQFYQSGIYYDPECSSEDLDHGVLVVGYGFEG---EDVDGKRYWIVKNSWSEKWGNNGY 313
Query: 351 YKICRGR-NVCGVDSMVS 367
KI + R N CG+ + S
Sbjct: 314 IKIAKDRHNHCGIATAAS 331
>gi|394331816|gb|AFN27127.1| cysteine protease [Leishmania tropica]
Length = 443
Score = 213 bits (543), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 129/312 (41%), Positives = 169/312 (54%), Gaps = 31/312 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK+ + +AY + E R F+ NL HQ +P A GIT+F DL+ AEF
Sbjct: 38 FEEFKRTYRRAYGTVAEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 120 YL-GLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YL G + A Q DL P DWR+KGAV PVKDQG+CGSCW+FS G
Sbjct: 98 YLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWRKKGAVTPVKDQGACGSCWAFSAVG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGL 233
++E LA +L +LSEQQLV CD + D+GC GGLM AFE+ L+ G +
Sbjct: 158 SIESQWALAGHRLTALSEQQLVSCDDK---------DNGCAGGLMLQAFEWLLRNMNGTM 208
Query: 234 MREEDYPYTGTDRGHACKFDKSK---IAASVANFSVVSLDEDQIAANLVKNGPLAVAINA 290
E+ YPY + G+ + S A + + + E +AA L KNGP+++A++A
Sbjct: 209 FTEDSYPYV-SSTGYVPECSNSSQLVPGARIDGYLTIESSETVMAAWLAKNGPISIAVDA 267
Query: 291 VYMQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+Y GV SC L+HGVLLVGY G E PYW+IKNSWGE+WGEN
Sbjct: 268 SSFMSYQSGVLTSC---AGDALNHGVLLVGYNRTG-------EVPYWVIKNSWGENWGEN 317
Query: 349 GYYKICRGRNVC 360
GY ++ G N C
Sbjct: 318 GYVRVTMGVNAC 329
>gi|402770499|gb|AFQ98384.1| cathepsin L, partial [Hyalomma anatolicum anatolicum]
Length = 312
Score = 213 bits (543), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 132/320 (41%), Positives = 172/320 (53%), Gaps = 28/320 (8%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKAN----LRRAARHQKLDPSATHGITQFSDLTPA 114
+ FK K+Y S+ E R+ IF N + A++ K S G+ QF DL P
Sbjct: 6 QWEAFKTTHKKSYQSKMEELLRYKIFTENSLLIAKHNAKYAKGLVSYKLGMNQFGDLLPH 65
Query: 115 EFRRTYLGLRRKLRLPKDADQAPILPTND--LPADFDWREKGAVGPVKDQGSCGSCWSFS 172
EF + + G + R + + P ND LP DWR+KGAV PVKDQG CGSCW+FS
Sbjct: 66 EFAKMFNGYHGE-RKGRGSTFLPPANVNDSSLPKTVDWRKKGAVTPVKDQGQCGSCWAFS 124
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
TG+LEG +FL +GKLVSLSEQ L+DC E GC GGLM++AF+Y G
Sbjct: 125 ATGSLEGQHFLKSGKLVSLSEQNLIDCSGSFGNE-------GCGGGLMDNAFKYIKANDG 177
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAINAV 291
+ EE YPY D C+F K + A+ F + ED + + GP++VAI+A
Sbjct: 178 IDTEESYPYEAMD--GDCRFKKEDVGATDTGFVDIQQGSEDDLQKAVATVGPISVAIDAS 235
Query: 292 Y--MQTYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+ Q Y GV P S LDHGVL VGYG K YW++KNSW E+WG+N
Sbjct: 236 HSSFQLYSEGVYDEPNCSSEELDHGVLAVGYGVK-------NGKKYWLVKNSWAETWGDN 288
Query: 349 GYYKICRGR-NVCGVDSMVS 367
GY + R + N CG+ S S
Sbjct: 289 GYILMSRDKDNQCGIASSAS 308
>gi|38346007|emb|CAD40110.2| OSJNBa0035O13.9 [Oryza sativa Japonica Group]
gi|125589429|gb|EAZ29779.1| hypothetical protein OsJ_13837 [Oryza sativa Japonica Group]
Length = 314
Score = 213 bits (543), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 136/318 (42%), Positives = 174/318 (54%), Gaps = 34/318 (10%)
Query: 66 KFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGI-----TQFSDLTPAEFRRTY 120
K +AYA E R +F+ N+ A + ++ +A+ QF+DLT AEFR T
Sbjct: 11 KHGRAYADDAEKARRLEVFRDNV---AFIESVNAAASQHKFWLEENQFADLTNAEFRATR 67
Query: 121 LGLR----RKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGA 176
GLR R R P A + T DLPA DWR KGAV PVKDQG CG CW+FS A
Sbjct: 68 TGLRPSSSRGNRAPTSFRYANV-STGDLPASVDWRGKGAVNPVKDQGDCGCCWAFSAVAA 126
Query: 177 LEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMRE 236
+EGA LATGKLVSLSEQQLV CD + + D GC GGLM+ AF++ +K GGL E
Sbjct: 127 MEGAVKLATGKLVSLSEQQLVSCDVKGE-------DQGCEGGLMDDAFDFIIKNGGLAAE 179
Query: 237 EDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA--VYMQ 294
DYPYT +D AA++ + V +++ V N P++VAI+ + Q
Sbjct: 180 SDYPYTASDD-KCATAGAGAAAATIKGYEDVPANDEAALLKAVANQPVSVAIDGGDRHFQ 238
Query: 295 TYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKI 353
Y GGV S C+ LDH + VGYG A YW++KNSWG SWGE+GY ++
Sbjct: 239 FYKGGVLSGAAGCATELDHAITAVGYGVAS------DGTKYWLMKNSWGTSWGEDGYVRM 292
Query: 354 CRG----RNVCGVDSMVS 367
RG VCG+ M S
Sbjct: 293 ERGVADKEGVCGLAMMAS 310
>gi|218195711|gb|EEC78138.1| hypothetical protein OsI_17694 [Oryza sativa Indica Group]
Length = 458
Score = 213 bits (543), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 124/321 (38%), Positives = 176/321 (54%), Gaps = 27/321 (8%)
Query: 42 ILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSA 101
I+S+ E + + A ++ +K + K Y + E + R+ F+ NLR H +
Sbjct: 25 IVSYGERSEEE---ARRLYAEWKAEHGKNYNAVGEEERRYAAFRDNLRYIDEHNAAADAG 81
Query: 102 TH----GITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTND-LPADFDWREKGAV 156
H G+ +F+DLT E+R TYLGLR K R + + N+ LP DWR KGAV
Sbjct: 82 VHSFRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEALPESVDWRTKGAV 141
Query: 157 GPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCN 216
+KDQG CGSCW+FS A+EG N + TG L+SLSEQ+LVDCD S + GCN
Sbjct: 142 AEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDT--------SYNEGCN 193
Query: 217 GGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAA 276
GGLM+ AF++ + GG+ E+DYPY G D +K+ ++ ++ V+ + +
Sbjct: 194 GGLMDYAFDFIINNGGIDTEDDYPYKGKDERCDVNRKNAKV-VTIDSYEDVTPNSETSLQ 252
Query: 277 NLVKNGPLAVAINA--VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPY 334
V N P++VAI A Q Y G+ C LDHGV VGYG+ K Y
Sbjct: 253 KAVANQPVSVAIEAGGRAFQLYSSGIFTG-KCGTALDHGVAAVGYGTE-------NGKDY 304
Query: 335 WIIKNSWGESWGENGYYKICR 355
WI++NSWG+SWGE+GY ++ R
Sbjct: 305 WIVRNSWGKSWGESGYVRMER 325
>gi|116563690|gb|ABJ99858.1| cathepsin L [Hippoglossus hippoglossus]
Length = 336
Score = 213 bits (543), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 135/320 (42%), Positives = 176/320 (55%), Gaps = 25/320 (7%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH----GITQFSDLTPA 114
H+ L+K +K Y +EE R +++ NL++ H TH G+ F D+T
Sbjct: 27 HWDLWKSWHSKKYHEKEEGWRRM-VWEKNLQKIELHNLEHSMGTHSFRLGMNHFGDMTHE 85
Query: 115 EFRRTYLGLRRKLRLPKDADQAPILPTNDL--PADFDWREKGAVGPVKDQGSCGSCWSFS 172
EFR+ G KL+ + + + N + P+ DWREKG V PVKDQG CGSCW+FS
Sbjct: 86 EFRQIMNGY--KLKTQRKFTGSLFMEPNFMTAPSAVDWREKGYVTPVKDQGQCGSCWAFS 143
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
TTGALEG F TGKLVSLSEQ LVDC PE + GC GGLM+ AF+Y G
Sbjct: 144 TTGALEGQQFRKTGKLVSLSEQNLVDCSR---PE----GNEGCGGGLMDQAFQYVTDNQG 196
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAINAV 291
L E+ YPYTGTD C +D +A+ F V S E + + GP++VAI+A
Sbjct: 197 LDSEDSYPYTGTDD-QPCHYDPLYNSANDTGFVDVPSGKEHALMKAVASVGPVSVAIDAG 255
Query: 292 Y--MQTYIGGVSCPYIC-SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+ Q Y G+ C S LDHGVL VGYG G + K +WI+KNSWGE WG+
Sbjct: 256 HESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEGEDKMG---KKFWIVKNSWGEKWGDK 312
Query: 349 GYYKICRGR-NVCGVDSMVS 367
GY + + R N CG+ + S
Sbjct: 313 GYIYMAKDRKNHCGIATAAS 332
>gi|52345644|ref|NP_001004869.1| cathepsin L2 precursor [Xenopus (Silurana) tropicalis]
gi|49522051|gb|AAH74718.1| MGC69486 protein [Xenopus (Silurana) tropicalis]
Length = 335
Score = 213 bits (543), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 129/320 (40%), Positives = 176/320 (55%), Gaps = 23/320 (7%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH----GITQFSDLT 112
++H++L+K K+YA +EE R +++ NLR H H G+ QF D+T
Sbjct: 26 DNHWNLWKNWHKKSYAPKEE-GWRRVLWEKNLRMIEFHNLEHSLGKHSHSLGMNQFGDMT 84
Query: 113 PAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFS 172
EFR+ G + + ++ AP + P DWR+KG V PVKDQG CGSCW+FS
Sbjct: 85 NEEFRQLMNGYKNQKKIRGSTFLAP--NNFESPKSVDWRKKGYVTPVKDQGQCGSCWAFS 142
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
TTGALEG ++ TGK++SLSEQ LVDC + GCNGGLM+ AF+Y GG
Sbjct: 143 TTGALEGQHYRNTGKMISLSEQNLVDCSR-------AQGNQGCNGGLMDQAFQYVKDNGG 195
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAINAV 291
+ E+ YPYT D C +D + +A+ F V S E + + GP++VA++A
Sbjct: 196 IDSEDSYPYTAKDD-QECHYDPNYNSANDTGFVDVTSGSEKDLMNAVASVGPVSVAVDAG 254
Query: 292 Y--MQTYIGGVSC-PYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+ Q Y G+ P S LDHGVL+VGYG G K YWI+KNSW E WG +
Sbjct: 255 HQSFQFYKSGIYYEPECSSEDLDHGVLVVGYGFEGEDE---DGKKYWIVKNSWSEKWGND 311
Query: 349 GYYKICRGR-NVCGVDSMVS 367
GY I + R N CG+ + S
Sbjct: 312 GYIYIAKDRHNHCGIATAAS 331
>gi|346469447|gb|AEO34568.1| hypothetical protein [Amblyomma maculatum]
Length = 333
Score = 213 bits (543), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 133/320 (41%), Positives = 169/320 (52%), Gaps = 27/320 (8%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQ----KLDPSATHGITQFSDLTPA 114
+ FK K Y S E RF IF N A+H K S GI QF+DL P
Sbjct: 26 EWEAFKSTHKKTYKSNVEELLRFKIFTENSLFIAKHNVKYAKGLVSYKLGINQFADLLPH 85
Query: 115 EFRRTYLGLRRKLRLPKDADQAPILPTND--LPADFDWREKGAVGPVKDQGSCGSCWSFS 172
EF + G + K + + P ND LP DWR+KGAV PVKDQG CGSCW+FS
Sbjct: 86 EFVKMMNGYQGKRLAGRGSTYLPPANLNDSSLPKTVDWRKKGAVTPVKDQGQCGSCWAFS 145
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
+TG+LEG +FL TGKLVSLSEQ LVDC + GCNGGLM+++F Y GG
Sbjct: 146 STGSLEGQHFLKTGKLVSLSEQNLVDC-------SSAYGNQGCNGGLMDNSFNYIKANGG 198
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAINAV 291
+ E+ YPY D C++ K + A+ F + E + + GP++VAI+A
Sbjct: 199 IDTEDSYPYEAED--GDCRYKKEDVGATDTGFVDIKEGSEKDLQKAVATVGPVSVAIDAS 256
Query: 292 Y--MQTYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
Q Y GV P S LDHGVL VGYG K YW++KNSW E+WG++
Sbjct: 257 QQSFQLYSEGVYDEPNCSSESLDHGVLAVGYGVK-------NGKKYWLVKNSWAETWGQD 309
Query: 349 GYYKICRGR-NVCGVDSMVS 367
GY + R + N CG+ S S
Sbjct: 310 GYILMSRDKNNQCGIASSAS 329
>gi|33348836|gb|AAQ16118.1| cathepsin L-like cysteine proteinase B [Rhipicephalus
haemaphysaloides haemaphysaloides]
Length = 335
Score = 213 bits (543), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 134/332 (40%), Positives = 180/332 (54%), Gaps = 31/332 (9%)
Query: 49 TNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQ----KLDPSATHG 104
T+ +L+GAE +S FK K YAS E +R I+ N + ARH K S
Sbjct: 18 THQELVGAE--WSAFKALHGKDYASDTEEYYRLKIYMENRLKIARHNEKYAKSQVSYKLA 75
Query: 105 ITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTN----DLPADFDWREKGAVGPVK 160
+ +F DL EF T G +R R + P LP DWR+KGAV PVK
Sbjct: 76 MNEFGDLLHHEFVSTRNGFKRNYRDSPREGSFFVEPEGFEDLQLPKTVDWRKKGAVTPVK 135
Query: 161 DQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLM 220
+QG CGSCW+FSTTG+LEG +F T KLVSLSEQ LVDC ++GC GGLM
Sbjct: 136 NQGQCGSCWAFSTTGSLEGPHFRKTRKLVSLSEQNLVDCSRSFG-------NNGCEGGLM 188
Query: 221 NSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLV 279
++AF+Y G+ E YPY TD C F++S + A+ F + DE+++ +
Sbjct: 189 DNAFKYIKSNKGIDTEWSYPYNATD--GVCHFNRSDVGATDTGFVDIPEGDENKLKKAVA 246
Query: 280 KNGPLAVAINAVY--MQTYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWI 336
GP++VAI+A + Q Y GV P S +LDHGVL+VGYG+ + YW+
Sbjct: 247 AVGPVSVAIDASHESFQFYSEGVYDEPECSSEQLDHGVLVVGYGTK-------DGQDYWL 299
Query: 337 IKNSWGESWGENGYYKICRGR-NVCGVDSMVS 367
+KNSWG +WG+ GY + R + N CG+ S S
Sbjct: 300 VKNSWGTTWGDEGYIYMTRNKDNQCGIASSAS 331
>gi|74222595|dbj|BAE38161.1| unnamed protein product [Mus musculus]
Length = 334
Score = 213 bits (543), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 129/322 (40%), Positives = 172/322 (53%), Gaps = 25/322 (7%)
Query: 54 LGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQ----FS 109
AE H +K + Y + EE + R I++ N+R H + HG + F
Sbjct: 25 FSAEWH--QWKSTHRRLYGTNEE-EWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAFG 81
Query: 110 DLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCW 169
D+T EFR+ G R + Q P++ +P DWREKG V PVK+QG CGSCW
Sbjct: 82 DMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK--IPKSVDWREKGCVTPVKNQGQCGSCW 139
Query: 170 SFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK 229
+FS +G LEG FL TGKL+SLSEQ LVDC H + GCNGGLM+ AF+Y +
Sbjct: 140 AFSASGCLEGQMFLKTGKLISLSEQNLVDCSH-------AQGNQGCNGGLMDFAFQYIKE 192
Query: 230 AGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAIN 289
GGL EE YPY D +CK+ A+ F + E + + GP++VA++
Sbjct: 193 NGGLDSEESYPYEAKDG--SCKYRAEFAVANDTGFVDIPQQEKALMKAVATVGPISVAMD 250
Query: 290 AVY--MQTYIGGVSC-PYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWG 346
A + +Q Y G+ P S+ LDHGVLLVGYG G + K YW++KNSWG WG
Sbjct: 251 ASHPSLQFYSLGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNK---YWLVKNSWGSEWG 307
Query: 347 ENGYYKICRGR-NVCGVDSMVS 367
GY KI + R N CG+ + S
Sbjct: 308 MEGYIKIAKDRDNHCGLATAAS 329
>gi|297802228|ref|XP_002868998.1| cysteine proteinase [Arabidopsis lyrata subsp. lyrata]
gi|297314834|gb|EFH45257.1| cysteine proteinase [Arabidopsis lyrata subsp. lyrata]
Length = 375
Score = 213 bits (543), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 121/290 (41%), Positives = 167/290 (57%), Gaps = 27/290 (9%)
Query: 76 EHDHRFTIFKANLRRAARHQKLDPSATH--GITQFSDLTPAEFRRTYLGLR----RKLRL 129
+ D RF IFK NLR H + + +AT+ G+T+F+DLT E+R YLG R R++
Sbjct: 69 DQDKRFNIFKDNLRFIDLHNEKNKNATYKLGLTKFTDLTNEEYRSLYLGARTEPVRRIAK 128
Query: 130 PKDADQ--APILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGK 187
K+ +Q + + ++P DWR KGAV P+KDQG+CGSCW+FST A+EG N + TG+
Sbjct: 129 AKNVNQKYSAAVDGKEVPETVDWRLKGAVNPIKDQGTCGSCWAFSTAAAVEGINKIVTGE 188
Query: 188 LVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRG 247
L+SLSEQ+LVDCD+ S + GCNGGLM+ AF++ +K GGL E+DYPY G G
Sbjct: 189 LISLSEQELVDCDN--------SYNQGCNGGLMDYAFQFIMKNGGLKTEKDYPYRGFG-G 239
Query: 248 HACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA--VYMQTYIGGVSCPYI 305
F K+ S+ + V ++ + P++VAI A Q Y G+
Sbjct: 240 KCNSFLKNAKVVSIDGYEDVPTKDETALKRAISLQPVSVAIEAGGRIFQHYQTGIFTGN- 298
Query: 306 CSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
C LDH V+ VGYGS YWI++NSWG WGE GY ++ R
Sbjct: 299 CGTNLDHAVVAVGYGSENGV-------DYWIVRNSWGPRWGEEGYIRMER 341
>gi|74213650|dbj|BAE35627.1| unnamed protein product [Mus musculus]
Length = 334
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 128/322 (39%), Positives = 172/322 (53%), Gaps = 25/322 (7%)
Query: 54 LGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQ----FS 109
AE H +K + Y + EE + R I++ N+R H + HG + F
Sbjct: 25 FSAEWH--QWKSTHRRLYGTNEE-EWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAFG 81
Query: 110 DLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCW 169
D+T EFR+ G R + Q P++ +P DWREKG V PVK+QG CGSCW
Sbjct: 82 DMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK--IPKSVDWREKGCVTPVKNQGQCGSCW 139
Query: 170 SFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK 229
+FS +G LEG FL TGKL+SLSEQ LVDC H + GCNGGLM+ AF+Y +
Sbjct: 140 AFSASGCLEGQMFLKTGKLISLSEQNLVDCSH-------AQGNQGCNGGLMDFAFQYIKE 192
Query: 230 AGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAIN 289
GGL EE YPY D +CK+ A+ F + E + + GP++VA++
Sbjct: 193 NGGLDSEESYPYEAKDG--SCKYRAEFAVANDTGFVDIPQQEKALMKAVATVGPISVAMD 250
Query: 290 AVY--MQTYIGGVSC-PYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWG 346
A + +Q Y G+ P S+ LDHGVLLVGYG G + K YW++KNSWG WG
Sbjct: 251 ASHPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNK---YWLVKNSWGSEWG 307
Query: 347 ENGYYKICRGR-NVCGVDSMVS 367
GY +I + R N CG+ + S
Sbjct: 308 MEGYIEIAKDRDNHCGLATAAS 329
>gi|162459393|ref|NP_001105993.1| cysteine protease component of protease-inhibitor complex precursor
[Zea mays]
gi|6682829|dbj|BAA88898.1| cysteine protease component of protease-inhibitor complex [Zea
mays]
Length = 465
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 128/321 (39%), Positives = 176/321 (54%), Gaps = 27/321 (8%)
Query: 42 ILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSA 101
I+S+ E ++ + A ++ + + Y + E + R+ +F+ NLR H +
Sbjct: 29 IVSYGERSDEE---ARRMYAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAADAG 85
Query: 102 TH----GITQFSDLTPAEFRRTYLGLR-RKLRLPKDADQAPILPTNDLPADFDWREKGAV 156
H G+ +F+DLT E+R TYLG R R R K + DLP DWR KGAV
Sbjct: 86 VHSFRLGLNRFADLTNDEYRATYLGARTRPQRERKLGARYHAADNEDLPESVDWRAKGAV 145
Query: 157 GPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCN 216
VKDQGS GSCW+FST A+EG N + TG L+SLSEQ+LVDCD S + GCN
Sbjct: 146 AEVKDQGSYGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDT--------SYNQGCN 197
Query: 217 GGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAA 276
GGLM+ AFE+ + GG+ E+DYPY GTD G K+ ++ ++ V ++++
Sbjct: 198 GGLMDYAFEFIINNGGIDTEKDYPYKGTD-GRCDVNRKNAKVVTIDSYEDVPANDEKSLQ 256
Query: 277 NLVKNGPLAVAINA--VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPY 334
V N P++VAI A Q Y G+ C LDHGV VGYG+ K Y
Sbjct: 257 KAVANQPVSVAIEAAGTQFQLYSSGIFTG-SCGTALDHGVTAVGYGTE-------NGKDY 308
Query: 335 WIIKNSWGESWGENGYYKICR 355
WI+KNSWG SWGE+GY ++ R
Sbjct: 309 WIVKNSWGSSWGESGYVRMER 329
>gi|22653679|sp|Q26636.1|CATL_SARPE RecName: Full=Cathepsin L; Contains: RecName: Full=Cathepsin L
heavy chain; Contains: RecName: Full=Cathepsin L light
chain; Flags: Precursor
gi|505140|dbj|BAA03970.1| cathepsin L precursor [Sarcophaga peregrina]
Length = 339
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 132/331 (39%), Positives = 184/331 (55%), Gaps = 32/331 (9%)
Query: 52 DLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKL----DPSATHGITQ 107
DL+ E H +K + K YA++ E R IF N + A+H +L S G+ +
Sbjct: 22 DLIKEEWH--TYKLQHRKNYANEVEERFRMKIFNENRHKIAKHNQLFAQGKVSYKLGLNK 79
Query: 108 FSDLTPAEFRRTYLG----LRRKLRLPKDADQAPILPTNDL--PADFDWREKGAVGPVKD 161
++D+ EF+ T G LR+ +R A +P + P DWRE GAV VKD
Sbjct: 80 YADMLHHEFKETMNGYNHTLRQLMRERTGLVGATYIPPAHVTVPKSVDWREHGAVTGVKD 139
Query: 162 QGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMN 221
QG CGSCW+FS+TGALEG +F G LVSLSEQ LVDC + ++GCNGGLM+
Sbjct: 140 QGHCGSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYG-------NNGCNGGLMD 192
Query: 222 SAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVK 280
+AF Y GG+ E+ YPY G D +C F+K+ I A+ F + DE+++ +
Sbjct: 193 NAFRYIKDNGGIDTEKSYPYEGIDD--SCHFNKATIGATDTGFVDIPEGDEEKMKKAVAT 250
Query: 281 NGPLAVAINAVY--MQTYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWII 337
GP++VAI+A + Q Y GV + P + LDHGVL+VGYG+ YW++
Sbjct: 251 MGPVSVAIDASHESFQLYSEGVYNEPECDEQNLDHGVLVVGYGTDESG------MDYWLV 304
Query: 338 KNSWGESWGENGYYKICRGR-NVCGVDSMVS 367
KNSWG +WGE GY K+ R + N CG+ + S
Sbjct: 305 KNSWGTTWGEQGYIKMARNQNNQCGIATASS 335
>gi|50355611|dbj|BAD29954.1| cysteine protease [Daucus carota]
Length = 474
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 138/364 (37%), Positives = 198/364 (54%), Gaps = 43/364 (11%)
Query: 4 KTVVLF--LVSLVVFSAVSSGTLI--DDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHH 59
+ +VLF L S ++ S+ S ++I D+ L D++LS +ES
Sbjct: 14 QCLVLFFSLASFLMLSSASDMSIITYDETHGLNSPPLRTHDQLLSLYES----------- 62
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKL-DPSATHGITQFSDLTPAEFRR 118
+ K +K Y + E + RF IFK N+ RH + + S G+ +F+DLT E+R
Sbjct: 63 ---WLVKHHKNYNALGEKETRFGIFKDNVGFVDRHNSMRNQSYKLGLNKFADLTNDEYRS 119
Query: 119 TYLGLRRKLRLPKD-----ADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFST 173
YL + R K+ +D+ + LP DWR++GAV PVKDQG CGSCW+FST
Sbjct: 120 LYLSGKMMKRERKNEDGFRSDRFVFEDGDHLPESVDWRDRGAVAPVKDQGQCGSCWAFST 179
Query: 174 TGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGL 233
GA+EG N + TG+L+SLSEQ+LVDCD+ + GCNGGLM+ AFE+ +K GG+
Sbjct: 180 VGAVEGINKIVTGELISLSEQELVDCDN--------GYNQGCNGGLMDYAFEFIVKNGGI 231
Query: 234 MREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA--V 291
E+DYPY G D G + K+ ++ + V ++++ V + P++VAI A
Sbjct: 232 DTEDDYPYKGVD-GLCDQNRKNAKVVTINGYEDVPHNDEKSLKKAVAHQPVSVAIEAGGR 290
Query: 292 YMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYY 351
Q Y GV C LDHGV+ VGYGS K YWI++NSWG WGE+GY
Sbjct: 291 AFQLYESGVFTGQ-CGTELDHGVVAVGYGSE-------NGKDYWIVRNSWGPDWGESGYI 342
Query: 352 KICR 355
++ R
Sbjct: 343 RLER 346
>gi|285002340|ref|YP_003422404.1| cathepsin [Pseudaletia unipuncta granulovirus]
gi|197343600|gb|ACH69415.1| cathepsin [Pseudaletia unipuncta granulovirus]
Length = 338
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 121/328 (36%), Positives = 170/328 (51%), Gaps = 25/328 (7%)
Query: 42 ILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSA 101
I+S + DL +E F F K+ K YA+ E RF +FKANL + SA
Sbjct: 19 IVSSMNNLQYDLSNSEVLFDEFVTKYGKVYANDAERKSRFDVFKANLAIINERNAQEESA 78
Query: 102 THGITQFSDLTPAEFRRTYLGLRRKL-----RLPKDADQAPIL--PTNDLPADFDWREKG 154
T GI +SDL+ E R G + L + K + I T LP F+WR+
Sbjct: 79 TFGINFYSDLSSNELLRKQTGFKTALHNDNEKKSKYCTRRVITGPSTRLLPEAFNWRDSD 138
Query: 155 AVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSG 214
AV VK Q CGSCW+FS +E ++ + V LSEQQ+VDCD ++G
Sbjct: 139 AVTSVKQQRDCGSCWAFSAVANIESQYYIKNKQYVDLSEQQIVDCD---------PINNG 189
Query: 215 CNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQI 274
CNGGLM+ A EY +++GG+ EEDY Y G + CK + + + S +E+++
Sbjct: 190 CNGGLMSWAMEYVMRSGGVQLEEDYQYVGNE--GVCKNNSANVVQISGCVSYDLRNEERL 247
Query: 275 AANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPY 334
LV NGP++VAI+ + + Y G++ + L+H VLLVGYG PY
Sbjct: 248 RELLVSNGPISVAIDVMDVTNYQSGIAKHCSVAHGLNHAVLLVGYGVQ-------NNTPY 300
Query: 335 WIIKNSWGESWGENGYYKICRGRNVCGV 362
W+ KNSWG WGENGY+++ R N CG+
Sbjct: 301 WVFKNSWGSDWGENGYFRVLRDVNSCGM 328
>gi|269784818|ref|NP_001161481.1| cathepsin L1 precursor [Gallus gallus]
Length = 353
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 134/319 (42%), Positives = 175/319 (54%), Gaps = 22/319 (6%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH----GITQFSDLTPA 114
H+ L+K +K Y +EE R +++ NL+ H H G+ QF D+T
Sbjct: 43 HWQLWKSWHSKDYHEREE-SWRRVVWEKNLKMIELHNLDHSLGKHSYKLGMNQFGDMTAE 101
Query: 115 EFRRTYLGLRRKLRLPKDADQAPILPTN-DLPADFDWREKGAVGPVKDQGSCGSCWSFST 173
EFR+ G + K K + P+ + P DWREKG V PVKDQG CGSCW+FST
Sbjct: 102 EFRQLMNGYKHKKSERKYRGSQFLEPSFLEAPRSVDWREKGYVTPVKDQGQCGSCWAFST 161
Query: 174 TGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGL 233
TGALEG +F TGKLVSLSEQ LVDC PE + GCNGGLM+ AF+Y GG+
Sbjct: 162 TGALEGQHFRKTGKLVSLSEQNLVDCSR---PE----GNQGCNGGLMDQAFQYVQDNGGI 214
Query: 234 MREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKN-GPLAVAINAVY 292
EE YPYT D C++ AA+ F + ++ V + GP++VAI+A +
Sbjct: 215 DSEESYPYTAKD-DEDCRYKAEYNAANDTGFVDIPQGHERALMKAVASVGPVSVAIDAGH 273
Query: 293 --MQTYIGGVSCPYICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
Q Y G+ CS LDHGVL+VGYG G + K YWI+KNSWGE WG+ G
Sbjct: 274 SSFQFYQSGIYYEPDCSSEDLDHGVLVVGYGFEGED---VDGKKYWIVKNSWGEKWGDKG 330
Query: 350 YYKICRGR-NVCGVDSMVS 367
Y + + R N CG+ + S
Sbjct: 331 YIYMAKDRKNHCGIATAAS 349
>gi|74142447|dbj|BAE31977.1| unnamed protein product [Mus musculus]
Length = 334
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 128/322 (39%), Positives = 172/322 (53%), Gaps = 25/322 (7%)
Query: 54 LGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQ----FS 109
AE H +K + Y + EE + R I++ N+R H + HG + F
Sbjct: 25 FSAEWH--QWKSTHRRLYGTNEE-EWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAFG 81
Query: 110 DLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCW 169
D+T EFR+ G R + Q P++ +P DWREKG V PVK++G CGSCW
Sbjct: 82 DMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK--IPKSVDWREKGCVTPVKNKGQCGSCW 139
Query: 170 SFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK 229
+FS +G LEG FL TGKL+SLSEQ LVDC H + GCNGGLM+ AF+Y +
Sbjct: 140 AFSASGCLEGQMFLKTGKLISLSEQNLVDCSH-------AQGNQGCNGGLMDFAFQYIKE 192
Query: 230 AGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAIN 289
GGL EE YPY D +CK+ A+ F + E + + GP++VA++
Sbjct: 193 NGGLDSEESYPYEAKDG--SCKYRAEFAVANDTGFVDIPQQEKALMKAVATVGPISVAMD 250
Query: 290 AVY--MQTYIGGVSC-PYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWG 346
A + +Q Y G+ P S+ LDHGVLLVGYG G + K YW++KNSWG WG
Sbjct: 251 ASHPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNK---YWLVKNSWGSEWG 307
Query: 347 ENGYYKICRGR-NVCGVDSMVS 367
GY KI + R N CG+ + S
Sbjct: 308 MEGYIKIAKDRDNHCGLATAAS 329
>gi|356565778|ref|XP_003551114.1| PREDICTED: thiol protease aleurain-like [Glycine max]
Length = 353
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 142/375 (37%), Positives = 195/375 (52%), Gaps = 35/375 (9%)
Query: 1 MGSKTVVLFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHH- 59
M ++++F V + +G+ DD + IR +D ++L D++G H
Sbjct: 1 MARLSLLIFAFCAVAVAVAVAGSSFDDANP-IRLASDLESQVL--------DVIGQSRHA 51
Query: 60 --FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFR 117
F+ F ++ K Y S +E +RF IF NL+ + + T G+ F+D T EF
Sbjct: 52 LSFARFARRHGKRYRSVDEIRNRFRIFSDNLKLIRSTNRRSLTYTLGVNHFADWTWEEFT 111
Query: 118 RTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
R LG + + L LP + DWR++G V VKDQG+CGSCW+FSTTGAL
Sbjct: 112 RHKLGAPQNCSATLKGNHR--LTDAVLPDEKDWRKEGIVSQVKDQGNCGSCWTFSTTGAL 169
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREE 237
E A A GK +SLSEQQLVDC + + GCNGGL + AFEY GGL EE
Sbjct: 170 EAAYAQAFGKNISLSEQQLVDCAGAFN-------NFGCNGGLPSQAFEYIKYNGGLDTEE 222
Query: 238 DYPYTGTDRGHACKFDKSKIAASVANFSVVSLD-EDQIAANLVKNGPLAVAIN-AVYMQT 295
YPYTG D CKF +A V + ++L ED++ + P++VA A +
Sbjct: 223 AYPYTGKD--GVCKFTAKNVAVRVIDSINITLGAEDELKQAVAFVRPVSVAFEVAKDFRF 280
Query: 296 YIGGVSCPYICSRR---LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYK 352
Y GV IC ++H VL VGYG PYWIIKNSWG +WG+NGY+K
Sbjct: 281 YNNGVYTSTICGSTPMDVNHAVLAVGYGVE-------DGVPYWIIKNSWGSNWGDNGYFK 333
Query: 353 ICRGRNVCGVDSMVS 367
+ G+N+CGV + S
Sbjct: 334 MELGKNMCGVATCAS 348
>gi|300121328|emb|CBK21708.2| unnamed protein product [Blastocystis hominis]
Length = 318
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 126/296 (42%), Positives = 163/296 (55%), Gaps = 20/296 (6%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRR 118
F+ + K+ K YA+ EE +R +F NL + H + T G+ +F+D++ EF
Sbjct: 21 EFTSYMSKYGKTYAAPEEARYRLRVFNDNLLKIKEHNAKNLPWTLGVNKFADVSAEEFAY 80
Query: 119 TYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALE 178
+ G + PK D+PA DWRE+GAV PVK+QG CGSCW+FSTTG E
Sbjct: 81 KFCGCAKD---PKTRGTRQTTLVGDVPARVDWREQGAVTPVKNQGMCGSCWAFSTTGTTE 137
Query: 179 GANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREED 238
GA FL TG LVSLSEQQLVDC DPE + GC+GG SA +Y K GL EED
Sbjct: 138 GAYFLKTGNLVSLSEQQLVDCAR--DPEYE---NFGCSGGWPWSAVDYVTKH-GLCTEED 191
Query: 239 YPYTGTDRGHACKFDKSKIAA-SVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYI 297
YPY G D CK K+A SV + DED +A + K P+++ ++A MQ Y
Sbjct: 192 YPYKGVD--AECKESSCKVAVQSVDKVQLPVGDEDSLAVAVSKT-PVSIVLDATAMQLYD 248
Query: 298 GGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKI 353
G+ CS ++H VL VGY ++ YWIIKNSWG WGE GY +I
Sbjct: 249 KGIITR--CSESINHAVLAVGYDKDAETGLK-----YWIIKNSWGADWGEEGYCRI 297
>gi|46309423|ref|YP_006313.1| ORF31 [Agrotis segetum granulovirus]
gi|46200640|gb|AAS82707.1| ORF31 [Agrotis segetum granulovirus]
Length = 327
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 120/331 (36%), Positives = 179/331 (54%), Gaps = 33/331 (9%)
Query: 52 DLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDL 111
+L +E F F +K+NK+Y+S+EE +F FK N+R L SA + I +SD+
Sbjct: 17 NLNDSEKLFEDFVQKYNKSYSSEEERQIKFDNFKNNIRSINEKNSLSNSAVYDINFYSDM 76
Query: 112 TPAEFRRTYLGLRRKLRLPKDADQAPILPTND----------LPADFDWREKGAVGPVKD 161
E R G + L+ + D + + N LP FDWR++ + VK+
Sbjct: 77 NKNELLRKQTGFKINLK-KNNLDLSWNIKCNKKLINGNPAVLLPDSFDWRDRHVITSVKN 135
Query: 162 QGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMN 221
Q CGSCW+FST +E + KL+ LSEQQLV+CD + ++GCNGGLM+
Sbjct: 136 QRDCGSCWAFSTIANIESLYAIKYNKLLDLSEQQLVNCDEQ---------NNGCNGGLMH 186
Query: 222 SAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKN 281
A E ++ GG+ E D+PYT +D CK + + + N ++S +ED++ L+ N
Sbjct: 187 WAMEEIIRQGGVSNETDFPYTASD--GFCKRKQGFVNINGCNQFILS-NEDRLRELLIFN 243
Query: 282 GPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSW 341
GP+++AI+ + + Y G+S L+H VLLVGYG PYWI+KNSW
Sbjct: 244 GPISIAIDVIDVIDYSQGISSTCRNDNGLNHAVLLVGYGVKN-------NIPYWILKNSW 296
Query: 342 GESWGENGYYKICRGRNVCGVDSMVSTVAAA 372
G WGENGY+++ R N CG M++ AA+
Sbjct: 297 GSQWGENGYFRVQRNINSCG---MINDYAAS 324
>gi|281211531|gb|EFA85693.1| cysteine protease [Polysphondylium pallidum PN500]
Length = 366
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 126/340 (37%), Positives = 186/340 (54%), Gaps = 48/340 (14%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F+ + KF + Y S E ++ FK+N+ + + +D +P E+++
Sbjct: 27 FTDWTHKFQRLY-SNNEFLKKYHTFKSNMDYVHSWNAKNSDTVLELNHLADHSPEEYKKF 85
Query: 120 YLGLRRKLRLPKDADQAPI---LPT--NDLPADFDWREKGAVGPVKDQGSCGSCWSFSTT 174
YLG R K + + I L T D A DWR+KGAV P+KDQG CGSCWSFSTT
Sbjct: 86 YLGTRVK-HIHFNVQGTHINTQLSTVFEDSGATVDWRKKGAVSPIKDQGQCGSCWSFSTT 144
Query: 175 GALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLM 234
G++EGA+ + TG +V LSEQ LVDC + GCNGGLMN+AF+Y + G+
Sbjct: 145 GSVEGAHQIKTGNMVELSEQNLVDC-------SSAEGNMGCNGGLMNNAFDYIISNHGID 197
Query: 235 REEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKN-GPLAVAINAVY- 292
E+ YPYT + G CKF+K+ + A+++++ ++ + AN VK GP++VAI+A +
Sbjct: 198 TEQSYPYTA-NTGSVCKFNKTNVGATISSYKSITPGSETDLANAVKTAGPVSVAIDASHR 256
Query: 293 -MQTYIGGVSCPYICSR-RLDHGVLLVGYGSA----------------------GYAPIR 328
Q Y G+ ++CS RLDHGVL+VGYGS G ++
Sbjct: 257 SFQLYSHGIYYEWLCSSTRLDHGVLVVGYGSGNPPNSDMDHMILKKTAKTDHYHGKKSLK 316
Query: 329 LKE------KPYWIIKNSWGESWGENGYYKICRGR-NVCG 361
+++ K YWI+KNSW ++WG+ GY + + R N CG
Sbjct: 317 VEKVDTTSSKNYWIVKNSWSDTWGDKGYIYMSKDRKNNCG 356
>gi|218181|dbj|BAA14402.1| oryzain alpha precursor [Oryza sativa Japonica Group]
Length = 458
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 123/321 (38%), Positives = 177/321 (55%), Gaps = 27/321 (8%)
Query: 42 ILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSA 101
I+S+ E + + A ++ +K + K+Y + E + R+ F+ NLR H +
Sbjct: 25 IVSYGERSEEE---ARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAG 81
Query: 102 TH----GITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTND-LPADFDWREKGAV 156
H G+ +F+DLT E+R TYLGLR K R + + N+ LP DWR KGAV
Sbjct: 82 VHSFRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEALPESVDWRTKGAV 141
Query: 157 GPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCN 216
+KDQG CGSCW+FS A+E N + TG L+SLSEQ+LVDCD S + GCN
Sbjct: 142 AEIKDQGGCGSCWAFSAIAAVEDINQIVTGDLISLSEQELVDCDT--------SYNEGCN 193
Query: 217 GGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAA 276
GGLM+ AF++ + GG+ E+DYPY G D +K+ ++ ++ V+ + +
Sbjct: 194 GGLMDYAFDFIINNGGIDTEDDYPYKGKDERCDVNRKNAKV-VTIDSYEDVTPNSETSLQ 252
Query: 277 NLVKNGPLAVAINA--VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPY 334
V+N P++VAI A Q Y G+ C LDHGV VGYG+ K Y
Sbjct: 253 KAVRNQPVSVAIEAGGRAFQLYSSGIFTG-KCGTALDHGVAAVGYGTE-------NGKDY 304
Query: 335 WIIKNSWGESWGENGYYKICR 355
WI++NSWG+SWGE+GY ++ R
Sbjct: 305 WIVRNSWGKSWGESGYVRMER 325
>gi|156397875|ref|XP_001637915.1| predicted protein [Nematostella vectensis]
gi|156225031|gb|EDO45852.1| predicted protein [Nematostella vectensis]
Length = 331
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 127/316 (40%), Positives = 171/316 (54%), Gaps = 23/316 (7%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRR 118
++ +K K Y ++ E R I++ NL++ H + S + D+T E +
Sbjct: 28 NWKAWKSFHGKEYPNKNEETMRNFIWQNNLKKIVTHNEGKHSFKLAMNHLGDMTSLEISQ 87
Query: 119 TYLGLRRKLRLPKDADQAPILPTNDLPA--DFDWREKGAVGPVKDQGSCGSCWSFSTTGA 176
T LGL+ K A LP ++ DWR KG V PVK+QG CGSCW+FSTTGA
Sbjct: 88 TLLGLKLKKHAESQPKGATFLPPANVKVVDSIDWRSKGYVTPVKNQGQCGSCWAFSTTGA 147
Query: 177 LEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMRE 236
LEG +F TGKLVSLSEQ LVDC + ++GC GGLM++AF+Y + GG+ E
Sbjct: 148 LEGQHFRKTGKLVSLSEQNLVDCSGKYG-------NNGCEGGLMDNAFQYIKENGGIDTE 200
Query: 237 EDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAINA---VY 292
+ YPY D C ++KS I A F + + DE+ + L GP+++AI+A +
Sbjct: 201 KSYPYLAKDG--VCHYNKSAIGAKDTGFVDIPTGDENALQQALASVGPISIAIDASQSTF 258
Query: 293 MQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYK 352
+ G P S RLDHGVL VGYG+ K YW++KNSWG SWGE GY K
Sbjct: 259 HFYHQGVYDDPDCSSTRLDHGVLAVGYGTD-------DGKDYWLVKNSWGPSWGEEGYIK 311
Query: 353 ICRG-RNVCGVDSMVS 367
I R + CGV S S
Sbjct: 312 IARNDHDKCGVASKAS 327
>gi|332326589|gb|AEE42618.1| cysteine protease [Leishmania aethiopica]
Length = 443
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 127/311 (40%), Positives = 165/311 (53%), Gaps = 29/311 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK+ + + Y + E R F+ NL HQ +P A GIT+F DL+ AEF
Sbjct: 38 FEEFKRTYWRVYGTVAEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 120 YL-GLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YL G + A Q DL P DWREKGAV PVKDQG+CGSCW+FS G
Sbjct: 98 YLNGAAYFAAAKQHAGQHHRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFSAVG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGL 233
+E +A +L +LSEQQLV CD + DSGCNGGLM AFE+ L+ G +
Sbjct: 158 NIESQWAVAGHRLTALSEQQLVSCDDK---------DSGCNGGLMTQAFEWLLRNMNGTM 208
Query: 234 MREEDYPYTGT--DRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV 291
+ E+ YPY + D + A + + + E +AA L K+GP+++A++A
Sbjct: 209 LTEDSYPYVSSTGDVPECTNSSQLVPGARIDGYVTIESSETVMAAWLAKSGPISIAVDAS 268
Query: 292 YMQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
+Y GV SC L+HGVLLVGY G E PYW+IKNSWGE WGE G
Sbjct: 269 SFMSYESGVLTSC---AGDALNHGVLLVGYNRTG-------EVPYWVIKNSWGEDWGEKG 318
Query: 350 YYKICRGRNVC 360
Y ++ G N C
Sbjct: 319 YVRVTMGVNAC 329
>gi|125547258|gb|EAY93080.1| hypothetical protein OsI_14881 [Oryza sativa Indica Group]
Length = 314
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 136/318 (42%), Positives = 174/318 (54%), Gaps = 34/318 (10%)
Query: 66 KFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGI-----TQFSDLTPAEFRRTY 120
K +AYA E R +F+ N+ A + ++ +A+ QF+DLT AEFR T
Sbjct: 11 KHGRAYADDAEKVRRLEVFRDNV---AFIESVNAAASQHKFWLEENQFADLTNAEFRATR 67
Query: 121 LGLR----RKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGA 176
GLR R R P A + T DLPA DWR KGAV PVKDQG CG CW+FS A
Sbjct: 68 TGLRPSSSRGNRAPTSFRYANV-STGDLPASVDWRGKGAVNPVKDQGDCGCCWAFSAVAA 126
Query: 177 LEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMRE 236
+EGA LATGKLVSLSEQQLV CD + + D GC GGLM+ AF++ +K GGL E
Sbjct: 127 MEGAVKLATGKLVSLSEQQLVSCDVKGE-------DQGCEGGLMDDAFDFIIKNGGLAAE 179
Query: 237 EDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA--VYMQ 294
DYPYT +D AA++ + V +++ V N P++VAI+ + Q
Sbjct: 180 SDYPYTASDD-KCATAGAGAAAATIKGYEDVPANDEAALLKAVANQPVSVAIDGGDRHFQ 238
Query: 295 TYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKI 353
Y GGV S C+ LDH + VGYG A YW++KNSWG SWGE+GY ++
Sbjct: 239 FYKGGVLSGAAGCATELDHAITAVGYGVAS------DGTKYWLMKNSWGTSWGEDGYVRM 292
Query: 354 CRG----RNVCGVDSMVS 367
RG VCG+ M S
Sbjct: 293 ERGVADKEGVCGLAMMAS 310
>gi|82659048|gb|ABB88697.1| cysteine protease [Leishmania tropica]
Length = 443
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 129/312 (41%), Positives = 167/312 (53%), Gaps = 31/312 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK+ + +AY + E R F+ NL HQ +P A GIT+F DL+ AEF
Sbjct: 38 FEEFKRTYRRAYGTVAEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 120 YL-GLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YL G + A Q DL P DWR+KGAV PVKDQG+CGSCW+FS G
Sbjct: 98 YLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWRKKGAVTPVKDQGACGSCWAFSAVG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGL 233
++E LA L +LSEQQLV CD + D+GC GGLM AFE+ L+ G +
Sbjct: 158 SIESQWALAGHGLTALSEQQLVSCDDK---------DNGCGGGLMLQAFEWLLRNMNGTM 208
Query: 234 MREEDYPYTGTDRGHACKFDKSK---IAASVANFSVVSLDEDQIAANLVKNGPLAVAINA 290
E+ YPY + G+ + S A + + + E +AA L KNGP+++A++A
Sbjct: 209 FTEDSYPYVSSS-GYVPECSNSSQLVPGARIDGYMTIESSETVMAAWLAKNGPISIAVDA 267
Query: 291 VYMQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+Y GV SC L+HGVLLVGY G E PYW+IKNSWGE WGEN
Sbjct: 268 SSFMSYQSGVLTSC---AGDALNHGVLLVGYNRTG-------EVPYWVIKNSWGEDWGEN 317
Query: 349 GYYKICRGRNVC 360
GY ++ G N C
Sbjct: 318 GYVRVTMGVNAC 329
>gi|449673497|ref|XP_002169904.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 325
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 127/316 (40%), Positives = 171/316 (54%), Gaps = 23/316 (7%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
E + ++K NKAY+ + E + R+ I+K N+ R + + + F D+T EF
Sbjct: 24 ESSWYVWKMAHNKAYSHESEENVRYAIWKDNMNRITEYNSKSKNVILRMNHFGDMTNTEF 83
Query: 117 RRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGA 176
R GL L ++ + P DWR +G V PVK+QG CGSCW+FS+TGA
Sbjct: 84 RAKMNGLL--LHKHQNGSTFLVPSHTAAPDAVDWRSEGYVTPVKNQGQCGSCWAFSSTGA 141
Query: 177 LEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMRE 236
LEG +F TG+LVSLSEQ LVDC + ++GCNGGLM++AF Y GG+ E
Sbjct: 142 LEGQHFKKTGRLVSLSEQNLVDCSTDYG-------NNGCNGGLMDNAFSYIKANGGIDTE 194
Query: 237 EDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAINAVYM-- 293
YPY G D C++ KS I A F + DED + + GP++VAI+A +M
Sbjct: 195 TGYPYEGQDG--TCRYSKSSIGADDTGFVDIPEGDEDALKQAVATVGPVSVAIDASHMSF 252
Query: 294 QTYIGGVSCPYICS-RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYK 352
Q Y GV CS LDHGVL+VGYG+ K YW++KNSWG WG GY
Sbjct: 253 QFYHSGVYDEPQCSPSALDHGVLVVGYGTD-------NGKDYWLVKNSWGTGWGTEGYIY 305
Query: 353 ICR-GRNVCGVDSMVS 367
+ R +N CG+ S S
Sbjct: 306 MSRNNQNQCGIASKAS 321
>gi|318037269|ref|NP_001187182.1| cathepsin L precursor [Ictalurus punctatus]
gi|196475596|gb|ACG76367.1| cathepsin L [Ictalurus punctatus]
Length = 336
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 134/320 (41%), Positives = 175/320 (54%), Gaps = 26/320 (8%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH----GITQFSDLTPA 114
H+ +K+ NK Y +EE R +++ NL++ H H + F D+
Sbjct: 28 HWQQWKEWHNKDYHEKEE-GWRRMVWEKNLKKIELHNLEHSLGKHSYRLAMNHFGDMPHE 86
Query: 115 EFRRTYLGLRRKLRLPKDADQAPILPTNDL--PADFDWREKGAVGPVKDQGSCGSCWSFS 172
EFR+ G + K+R + + + N L P+ DWREKG V PVKDQG CGSCW+FS
Sbjct: 87 EFRQVMNGYKHKVRKIRGS---LFMEPNFLEAPSKLDWREKGYVTPVKDQGQCGSCWAFS 143
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
TTGA+EG F TGKLVSLSEQ LVDC PE + GCNGGLM+ AF+Y GG
Sbjct: 144 TTGAMEGQQFRKTGKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYIKDNGG 196
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAINAV 291
L E+ YPY GTD C +D S AA+ F + S E + + GP++VAI+A
Sbjct: 197 LDTEKFYPYLGTDD-QPCHYDPSYSAANDTGFVDIPSGKEHALMKAVTAVGPVSVAIDAG 255
Query: 292 Y--MQTYIGGVSCPYICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+ Q Y G+ CS LDHGVL+VGY GY + K YWI+KNSW E WG
Sbjct: 256 HESFQFYQSGIYYEADCSSEDLDHGVLVVGY---GYEGENVDGKKYWIVKNSWSEQWGNK 312
Query: 349 GYYKICRGR-NVCGVDSMVS 367
GY + + R N CG+ + S
Sbjct: 313 GYIYMAKDRHNHCGIATAAS 332
>gi|194352752|emb|CAQ00104.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 351
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 135/332 (40%), Positives = 182/332 (54%), Gaps = 29/332 (8%)
Query: 49 TNNDLLGAEHHFSLFKK---KFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGI 105
+ DL + LF+K K KAYAS EE HRF +FK NL+ + S G+
Sbjct: 30 SEEDLSSHDRLVELFEKWLAKHQKAYASFEEKLHRFEVFKDNLKLIDEINREVTSYWLGL 89
Query: 106 TQFSDLTPAEFRRTYLGLRRKLRLPKDAD--QAPILPTNDLPADFDWREKGAVGPVKDQG 163
+F+DLT EF+ TYLGL + + + +DLP DWR+KGAV VK+QG
Sbjct: 90 NEFADLTHDEFKTTYLGLSPPPARRSSSRSFRYENVAAHDLPKAVDWRKKGAVTDVKNQG 149
Query: 164 SCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSA 223
CGSCW+FST A+EG N + TG L +LSEQ+L+DC + +SGCNGG+M+ A
Sbjct: 150 QCGSCWAFSTVAAVEGINAIVTGNLTALSEQELIDCSVD--------GNSGCNGGMMDYA 201
Query: 224 FEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKI-AASVANFSVVSLDEDQIAANLVKNG 282
F Y +GGL EE YPY + G KS+ A S++ + V ++Q + +
Sbjct: 202 FSYIASSGGLHTEEAYPYL-MEEGSCGDGKKSESEAVSISGYEDVPTKDEQALIKALAHQ 260
Query: 283 PLAVAINAV--YMQTYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKN 339
P++VAI A + Q Y GGV P C +LDHGV VGYGS + K Y I+KN
Sbjct: 261 PVSVAIEASGRHFQFYSGGVFDGP--CGAQLDHGVAAVGYGSD-----KGKGHDYIIVKN 313
Query: 340 SWGESWGENGYYKICRG----RNVCGVDSMVS 367
SWG WGE GY ++ RG +CG++ M S
Sbjct: 314 SWGGKWGEKGYIRMKRGTGKSEGLCGINKMAS 345
>gi|41688064|dbj|BAD08618.1| cathepsin L preproprotein [Cyprinus carpio]
Length = 337
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 134/322 (41%), Positives = 174/322 (54%), Gaps = 25/322 (7%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH----GITQFSDLT 112
++H+ +K K Y +EE R +++ NL++ H TH G+ +F D+T
Sbjct: 26 DNHWEQWKNWHGKKYHEKEE-GWRRMVWEKNLQKIELHNLEHSMGTHTYRLGMNRFGDMT 84
Query: 113 PAEFRRTYLGLRRK--LRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWS 170
EFR+ G + K R P ++P DWREKG V PVKDQG CGSCW+
Sbjct: 85 HEEFRQVMNGYKHKKERRFRGSLFMEPNFL--EVPNSLDWREKGYVTPVKDQGECGSCWA 142
Query: 171 FSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA 230
FSTTGA+EG F TGKLVSLSEQ LVDC PE + GCNGGLM+ AF+Y
Sbjct: 143 FSTTGAMEGQMFRKTGKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYIKDQ 195
Query: 231 GGLMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAIN 289
GL EE YPY GTD C +D AA+ F + S E + + GP++VAI+
Sbjct: 196 NGLDSEESYPYVGTD-DQPCHYDPKYSAANDTGFVDIPSGKEHALMKAIAAVGPVSVAID 254
Query: 290 AVY--MQTYIGGVSCPYIC-SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWG 346
A + Q Y G+ C S LDHGVL VGYG G + K YWI+KNSW E+WG
Sbjct: 255 AGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEG---EDVDGKKYWIVKNSWSENWG 311
Query: 347 ENGYYKICRGR-NVCGVDSMVS 367
+ GY + + R N CG+ + S
Sbjct: 312 DKGYVYMAKDRHNHCGIATAAS 333
>gi|348564702|ref|XP_003468143.1| PREDICTED: cathepsin F-like [Cavia porcellus]
Length = 462
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 134/315 (42%), Positives = 179/315 (56%), Gaps = 26/315 (8%)
Query: 61 SLFKK---KFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEF 116
SLFKK +N+ Y S+EE R ++F N+ A + Q LD +A +G+T+FSDLT EF
Sbjct: 163 SLFKKFVATYNRTYESKEETQWRLSVFTRNMILAQKIQALDRGTAQYGVTKFSDLTEEEF 222
Query: 117 RRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGA 176
R YL + K QA I+ + P ++DWR+KGAV VK+QG CGSCW+FS TG
Sbjct: 223 RTIYLNPLLREHPSKTMRQAKIV-HDSAPPEWDWRKKGAVTEVKNQGMCGSCWAFSVTGN 281
Query: 177 LEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMRE 236
+EG FL G L+SLSEQ+L+DCD D C GGL +A+ GGL E
Sbjct: 282 VEGQWFLKKGTLLSLSEQELLDCD---------KVDKACMGGLPINAYSAIKSLGGLETE 332
Query: 237 EDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTY 296
+DY Y G AC F K + + +S +E +AA L GP+++AINA MQ Y
Sbjct: 333 DDYSYQG--HMEACNFSAKKAKVYINDSVELSKNEQYLAAWLAVKGPISIAINAFGMQFY 390
Query: 297 IGGVSCPY--ICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKI 353
G++ P +CS +DH +L+VGYG + P+W IKNSWG WGE GYY +
Sbjct: 391 RHGIAHPLQPLCSPWFIDHAMLIVGYG-------KRSGVPFWAIKNSWGTDWGEEGYYYL 443
Query: 354 CRGRNVCGVDSMVST 368
RG CGV+ M S+
Sbjct: 444 HRGSRSCGVNVMASS 458
>gi|427797099|gb|JAA64001.1| Putative cathepsin l cathepsin l, partial [Rhipicephalus
pulchellus]
Length = 331
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 133/332 (40%), Positives = 181/332 (54%), Gaps = 31/332 (9%)
Query: 49 TNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQ----KLDPSATHG 104
T+ +L+GAE +S FK K Y S E +R I+ N + ARH K S
Sbjct: 14 THEELVGAE--WSAFKALHGKEYESDTEEYYRLKIYMENRLKIARHNEKYAKSQVSYKLA 71
Query: 105 ITQFSDLTPAEFRRTYLGLRRKLR-LPKDAD---QAPILPTNDLPADFDWREKGAVGPVK 160
+ +F D+ EF T G +R R P++ + L LP DWR+KGAV PVK
Sbjct: 72 MNEFGDMLHHEFVSTRNGFKRNYRDTPREGSFFVEPEGLEDFHLPKTVDWRKKGAVTPVK 131
Query: 161 DQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLM 220
+QG CGSCWSFSTTG+LEG +F KLVSLSEQ L+DC ++GC GGLM
Sbjct: 132 NQGQCGSCWSFSTTGSLEGQHFRKLHKLVSLSEQNLIDCSRSFG-------NNGCEGGLM 184
Query: 221 NSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLV 279
+ AF+Y G+ E+ YPY TD C F+KS + A+ F + DE+++ +
Sbjct: 185 DYAFKYIKANKGIDTEQSYPYNATDG--VCHFNKSAVGATDTGFVDIPEGDENKLKKAVA 242
Query: 280 KNGPLAVAINAVY--MQTYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWI 336
GP++VAI+A + Q Y GV P S +LDHGVL+VGYG+ + YW+
Sbjct: 243 TVGPVSVAIDASHESFQFYSEGVYDEPECDSEQLDHGVLVVGYGTK-------DGQDYWL 295
Query: 337 IKNSWGESWGENGYYKICRGR-NVCGVDSMVS 367
+KNSWG +WG+ GY + R + N CG+ S S
Sbjct: 296 VKNSWGTTWGDGGYIYMSRNKDNQCGIASAAS 327
>gi|151547430|gb|ABS12459.1| cysteine protease Cp [Citrus sinensis]
Length = 361
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 137/346 (39%), Positives = 180/346 (52%), Gaps = 33/346 (9%)
Query: 32 IRQVTDGGDEILSHHESTNNDLLGAEHH---FSLFKKKFNKAYASQEEHDHRFTIFKANL 88
IR V+ G L E++ ++G H F+ F +++ K Y S EE RF F NL
Sbjct: 34 IRLVSSDG---LRDFETSVLQVIGQARHALSFARFARRYGKIYESVEEMKLRFATFSKNL 90
Query: 89 RRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADF 148
S G+ +F+D + EF+R LG + + L + LP
Sbjct: 91 DLIRSTNCKGLSYRLGLNKFADWSWEEFQRHRLGAAQNCSATTKGNHK--LTADVLPETK 148
Query: 149 DWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEP 208
DWRE G V PVKDQG CGSCW+FSTTG+LE A A GK +SLSEQQLVDC +
Sbjct: 149 DWRESGIVSPVKDQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFN---- 204
Query: 209 GSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASV---ANFS 265
+ GCNGGL + AFEY GGL EE YPYTG D CKF + V N +
Sbjct: 205 ---NQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKD--GVCKFSSENVGVQVLDSVNIT 259
Query: 266 VVSLDEDQIAANLVKNGPLAVAINAV-YMQTYIGGVSCPYICSRR---LDHGVLLVGYGS 321
+ + DE Q A LV+ P++VA V + Y GV C ++H V+ VGYG
Sbjct: 260 LGAEDELQHAVGLVR--PVSVAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGV 317
Query: 322 AGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVS 367
PYW+IKNSWGE+WG++GY+KI G+N+CG+ + S
Sbjct: 318 E-------DGVPYWLIKNSWGENWGDHGYFKIKMGKNMCGIATCAS 356
>gi|260516678|gb|ACX43965.1| cysteine protease 1 [Brachiaria hybrid cultivar]
Length = 338
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 129/311 (41%), Positives = 168/311 (54%), Gaps = 31/311 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKL-DPSATHGITQFSDLTPAEFRR 118
F+ F K+++KAY S E RF FKAN+ H L + S T G+ +F+DL+ EF+
Sbjct: 42 FTAFMKQYSKAY-SHAEFSSRFNQFKANVETIRLHNTLANASYTMGLNEFADLSFEEFKG 100
Query: 119 TYLGLR---RKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
Y G + R+ + Q P DWR AV P+KDQG CGSCW+FS TG
Sbjct: 101 KYFGYKHVEREFARSNNLHQ----EVEAAPTSIDWRTSNAVTPIKDQGQCGSCWAFSATG 156
Query: 176 ALEGANFLATGK--LVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGL 233
++EGA ++ GK L SLSEQQLVDC D+GCNGGLM+ AFEY + G+
Sbjct: 157 SIEGA-WVLQGKHTLTSLSEQQLVDCSTSYG-------DAGCNGGLMDYAFEYIIANKGI 208
Query: 234 MREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA--V 291
E YPY G G C+ +K+ V S DE + + GP++VAI A
Sbjct: 209 CAESAYPYKGV--GGLCQKSCTKVVTISGYKDVASGDEASLLNAVGTVGPVSVAIEADQA 266
Query: 292 YMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYY 351
Q Y GV C LDHGVL VGYG+ G + YWI+KNSWG SWGE+GY
Sbjct: 267 GFQFYSSGVFSG-TCGHNLDHGVLAVGYGTTG-------SQDYWIVKNSWGTSWGESGYI 318
Query: 352 KICRGRNVCGV 362
++ R +N CG+
Sbjct: 319 RMIRNKNQCGI 329
>gi|449460678|ref|XP_004148072.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Cucumis
sativus]
Length = 317
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 121/318 (38%), Positives = 180/318 (56%), Gaps = 28/318 (8%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
+ + + K+ + Y S+EE + RFTI++AN++ ++ S T F+DLT EF
Sbjct: 16 QDRYQKWMDKYGRQYKSREEWERRFTIYQANVQYIDNFNSMNHSHTLAENNFADLTNEEF 75
Query: 117 RRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGA 176
+ TYLG + + +P + + +LP + DWR++GAV P+K+QG CGSCW+FS A
Sbjct: 76 KATYLGYK-TVSIPDTCFRYGNMV--NLPTNVDWRQEGAVTPIKNQGQCGSCWAFSAVAA 132
Query: 177 LEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMRE 236
+EG N + GKL+SLSEQ+LVDCD S + GCNGG M AFE+ +K GL E
Sbjct: 133 VEGINKIKAGKLISLSEQELVDCD-------VTSGNQGCNGGYMYKAFEF-IKRTGLTTE 184
Query: 237 EDYPYTGTDRGHACKFDKSKIA-ASVANFSVVSLDEDQIAANLVKNGPLAVAINAV--YM 293
+YPY G + AC K K S++ + V +++++ V N P++VAI+A
Sbjct: 185 IEYPYQGAE--SACNEQKEKYQFVSISGYEKVPVNDEKSLKAAVANQPVSVAIDAEGNNF 242
Query: 294 QTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKI 353
Q Y GG+ C +L+HGV +VGYG + YW++KNSWG WGE+GY ++
Sbjct: 243 QFYSGGIFSG-NCGNQLNHGVAIVGYG-------ETSNQAYWLVKNSWGTDWGESGYIRM 294
Query: 354 CRG----RNVCGVDSMVS 367
R + CG+ M S
Sbjct: 295 KRDSTDRQGTCGIAMMAS 312
>gi|388509526|gb|AFK42829.1| unknown [Lotus japonicus]
Length = 333
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 125/320 (39%), Positives = 175/320 (54%), Gaps = 28/320 (8%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH----GITQFSDLT 112
+ H++LFK F K Y++ EE R ++AN+ +H H G+ ++DLT
Sbjct: 25 DSHWALFKTTFGKQYSTAEEITRRLA-WEANVAIIRQHNLEHDLGLHTYTLGLNNYADLT 83
Query: 113 PAEFRRTYLGLRRKLRLPKDADQAP-ILPTN-DLPADFDWREKGAVGPVKDQGSCGSCWS 170
AEF + GLR K A++ + P +LP DWR KG V P+KDQG CGSCW+
Sbjct: 84 NAEFNQVMNGLRVNASQTKSANRRTYVAPVGVELPTSVDWRTKGYVTPIKDQGQCGSCWA 143
Query: 171 FSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA 230
FS+TG+LEG +F TG+LVSLSEQ L DC + + GCNGGLM+ AF Y +
Sbjct: 144 FSSTGSLEGQHFAKTGQLVSLSEQNLTDCSQK-------QGNMGCNGGLMDQAFTYIKEN 196
Query: 231 GGLMREEDYPYTGTDRGHACKFDKSKIAASVANFS-VVSLDEDQIAANLVKNGPLAVAIN 289
G+ E YPY D C F + + A+ ++ + DE+ + + + GP++VAI+
Sbjct: 197 NGIDTESSYPYKAVDE--KCHFKAADVGATDTGYTDIAQQDENALQSAIATVGPISVAID 254
Query: 290 AVY--MQTYIGGVSCPYICS-RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWG 346
A + Q Y G CS +LDHGVL VGY S K Y+I+KNSWG SWG
Sbjct: 255 ASHSSFQLYRSGAYNERACSATQLDHGVLAVGYDSE-------DGKDYYIVKNSWGTSWG 307
Query: 347 ENGYYKICRGR-NVCGVDSM 365
+ GY + R + N CG+ +M
Sbjct: 308 QKGYIWMTRNKNNQCGIATM 327
>gi|156124996|gb|ABU50816.1| Ale o 1 allergen [Aleuroglyphus ovatus]
Length = 337
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 131/323 (40%), Positives = 174/323 (53%), Gaps = 31/323 (9%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQ----KLDPSATHGITQFSDLT 112
E F FK F + Y S E HR +IF+ANL+ RH D + + + F+DL+
Sbjct: 30 EAQFEQFKSTFGRVYPSPEIELHRKSIFRANLQFILRHNIDYFNGDSTFSVSVNNFTDLS 89
Query: 113 PAEFRRTYLGLRRKLRLPKDADQAPILPTND---LPADFDWREKGAVGPVKDQGSCGSCW 169
EFR T+ G RR L AD + ND LPA DW KG V P+K+Q CGSCW
Sbjct: 90 NEEFRATFNGYRR-LAAVSLADS--VHADNDVEALPATVDWTTKGVVTPIKNQQQCGSCW 146
Query: 170 SFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK 229
+FS ++EG + L TGKLVSLSEQ LVDC D GC+GG M+ AF+Y ++
Sbjct: 147 AFSAVASMEGQHALKTGKLVSLSEQNLVDC-------SAAEGDMGCSGGWMDYAFKYVIQ 199
Query: 230 AGGLMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAI 288
G+ E YPY D +C+F ++ I A++ +F V + DE + + GP++VAI
Sbjct: 200 NRGIDTEASYPYKAIDE--SCEFKRNSIGATIHSFVDVKTGDESALQNAVASIGPISVAI 257
Query: 289 NAVY--MQTYIGGVSCPYICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESW 345
+A Q Y GV CS LDHGV VGYG+ L PYW +KNSWG SW
Sbjct: 258 DASQPSFQFYSSGVYNEPDCSTEILDHGVTAVGYGT-------LNGVPYWKVKNSWGTSW 310
Query: 346 GENGYYKICRGR-NVCGVDSMVS 367
G+ GY + R + N CG+ + S
Sbjct: 311 GQKGYIFMSRNKQNQCGIATKAS 333
>gi|37780041|gb|AAP32193.1| cysteine protease 14 [Trifolium repens]
Length = 351
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 121/316 (38%), Positives = 175/316 (55%), Gaps = 25/316 (7%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F + + K Y + EE RF +FK NL+ K+ + G+ +F+DL+ EF+
Sbjct: 47 FESWMSRHGKIYETIEEKLLRFEVFKDNLKHIDERNKIVSNYWLGLNEFADLSHQEFKNK 106
Query: 120 YLGLRRKLRLPKDADQAPILPTND--LPADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
YLGL+ L +++ D LP DWR+KGAV PVK+QG CGSCW+FST A+
Sbjct: 107 YLGLKVNLSQRRESSNEEEFTYRDVDLPKSVDWRKKGAVTPVKNQGQCGSCWAFSTVAAV 166
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREE 237
EG N + TG L SLSEQ+L+DCD + ++GCNGGLM+ AF + ++ GGL +E+
Sbjct: 167 EGINQIVTGNLTSLSEQELIDCD--------TTYNNGCNGGLMDYAFSFIVQNGGLHKED 218
Query: 238 DYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY--MQT 295
DYPY + K +++++ ++ + V + +Q + N PL+VAI A Q
Sbjct: 219 DYPYIMEESTCEMKKEETQV-VTINGYHDVPQNNEQSLLKALANQPLSVAIEASSRDFQF 277
Query: 296 YIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
Y GGV + C LDHGV VGYG++ K Y I+KNSWG WGE G+ ++ R
Sbjct: 278 YSGGVFDGH-CGSDLDHGVSAVGYGTS-------KNLDYIIVKNSWGAKWGEKGFIRMKR 329
Query: 356 G----RNVCGVDSMVS 367
+CG+ M S
Sbjct: 330 NIGKPEGICGLYKMAS 345
>gi|449524070|ref|XP_004169046.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like, partial
[Cucumis sativus]
Length = 314
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 121/318 (38%), Positives = 180/318 (56%), Gaps = 28/318 (8%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
+ + + K+ + Y S+EE + RFTI++AN++ ++ S T F+DLT EF
Sbjct: 16 QDRYQKWMDKYGRQYKSREEWERRFTIYQANVQYIDNFNSMNHSHTLAENNFADLTNEEF 75
Query: 117 RRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGA 176
+ TYLG + + +P + + +LP + DWR++GAV P+K+QG CGSCW+FS A
Sbjct: 76 KATYLGYK-TVSIPDTCFRYGNMV--NLPTNVDWRQEGAVTPIKNQGQCGSCWAFSAVAA 132
Query: 177 LEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMRE 236
+EG N + GKL+SLSEQ+LVDCD S + GCNGG M AFE+ +K GL E
Sbjct: 133 VEGINKIKAGKLISLSEQELVDCD-------VTSGNQGCNGGYMYKAFEF-IKRTGLTTE 184
Query: 237 EDYPYTGTDRGHACKFDKSKIA-ASVANFSVVSLDEDQIAANLVKNGPLAVAINAV--YM 293
+YPY G + AC K K S++ + V +++++ V N P++VAI+A
Sbjct: 185 IEYPYQGAE--SACNEQKEKYQFVSISGYEKVPVNDEKSLKAAVANQPVSVAIDAEGNNF 242
Query: 294 QTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKI 353
Q Y GG+ C +L+HGV +VGYG + YW++KNSWG WGE+GY ++
Sbjct: 243 QFYSGGIFSG-NCGNQLNHGVAIVGYG-------ETSNQAYWLVKNSWGTDWGESGYIRM 294
Query: 354 CR----GRNVCGVDSMVS 367
R + CG+ M S
Sbjct: 295 KRDSTDKQGTCGIAMMAS 312
>gi|302781881|ref|XP_002972714.1| hypothetical protein SELMODRAFT_98707 [Selaginella moellendorffii]
gi|300159315|gb|EFJ25935.1| hypothetical protein SELMODRAFT_98707 [Selaginella moellendorffii]
Length = 446
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 127/334 (38%), Positives = 184/334 (55%), Gaps = 34/334 (10%)
Query: 48 STNNDLLGAEHHFSLFKKKFNKAYASQEE-HDHRFTIFKANLRRAARHQKLDP-SATHGI 105
S+++DL G ++ + KF K AS D RF FK N R H + S G+
Sbjct: 4 SSDSDLSG---EYASWCAKFGKECASSNSLGDRRFETFKENFRYIEEHNRAGKHSYRLGL 60
Query: 106 TQFSDLTPAEFRRTYLGLRRKL------RLPKDADQAPILPTNDLPADFDWREKGAVGPV 159
QFSDLT EFR+ +LGLR L ++P+D+D DLPA DWR+ GAV
Sbjct: 61 NQFSDLTSEEFRQRFLGLRPDLIDSPVLKMPRDSDIEEGFQNVDLPASVDWRKHGAVTAP 120
Query: 160 KDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGL 219
KDQGSCG CW+F+TTGA+EG N + TG+L+SLSEQ+L+DCD + D GC+GGL
Sbjct: 121 KDQGSCGGCWAFATTGAIEGINQIVTGQLMSLSEQELIDCDKK--------ADKGCDGGL 172
Query: 220 MNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLV 279
M +A+++ ++ GGL E DYPY ++ K S++ A + + + ++Q V
Sbjct: 173 MENAYQFIVENGGLDTETDYPYHASESHCNMKKLNSRVVA-IDGYEAIPDGDEQALLRAV 231
Query: 280 KNGPLAVAINAVY--MQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWII 337
P++VAI Q Y GV + C ++HGVL+VGYG+ YWI+
Sbjct: 232 AKQPVSVAIEGASKDFQHYASGVFTGH-CGEEINHGVLIVGYGTE-------DGLDYWIV 283
Query: 338 KNSWGESWGENGYYKICRGR----NVCGVDSMVS 367
KNSW +WG+ G+ K+ R +C ++++ S
Sbjct: 284 KNSWAATWGDGGFVKMQRNTGKRGGLCSINTLAS 317
>gi|15593249|gb|AAL02221.1|AF410881_1 cysteine protease CP10 precursor [Frankliniella occidentalis]
Length = 334
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 135/321 (42%), Positives = 167/321 (52%), Gaps = 29/321 (9%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSA----THGITQFSDLTPA 114
H+ FK K YA+ E +R +FK N R A+H L S G +Q++D+
Sbjct: 27 HWESFKATHAKTYANTVEEAYRAKVFKENAIRIAKHNDLFASGEVTFKVGYSQYADMHTH 86
Query: 115 EFRRTYLGLRRKLRLPKDADQAPILPTND---LPADFDWREKGAVGPVKDQGSCGSCWSF 171
E G R L K A +ND DWR KGAV P+KDQG CGSCWSF
Sbjct: 87 EVTEKLNGYRSGL---KQASAFVHTASNDSWPWSKKVDWRSKGAVTPIKDQGQCGSCWSF 143
Query: 172 STTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAG 231
S TG+LEG FL LVSLSEQ LVDC + E GCNGGLM+SAFEY G
Sbjct: 144 SATGSLEGQLFLKNKNLVSLSEQNLVDCSWDFGNE-------GCNGGLMDSAFEYVESNG 196
Query: 232 GLMREEDYPYTGTDRGHACKFDKSKIAASVANFS-VVSLDEDQIAANLVKNGPLAVAINA 290
G+ EE YPYT D G +C + + A + V + E + + K GP++VAI+A
Sbjct: 197 GIDTEESYPYTAVD-GDSCLYKAANNAGVNTGYKDVQAKSESALRDAVEKAGPVSVAIDA 255
Query: 291 V--YMQTYIGGVSCPYICSR-RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGE 347
Q Y G+ CS LDHGVL VGYGS K +WI+KNSWG SWGE
Sbjct: 256 SNWSFQMYSSGIYYESACSSDYLDHGVLAVGYGS------EWPNKEFWIVKNSWGTSWGE 309
Query: 348 NGYYKICRG-RNVCGVDSMVS 367
GY K+ R +N CG+ + S
Sbjct: 310 EGYIKMARNKKNNCGIATEAS 330
>gi|348505824|ref|XP_003440460.1| PREDICTED: pro-cathepsin H-like [Oreochromis niloticus]
Length = 324
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 125/320 (39%), Positives = 182/320 (56%), Gaps = 31/320 (9%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
E HF + ++NK Y + +E+ R IF N +R +H + + S T G+ +FSD+T +EF
Sbjct: 23 EFHFKSWMAQYNKEY-NLKEYYQRLQIFTENKKRIDKHNEGNHSFTMGLNEFSDMTFSEF 81
Query: 117 RRTYLGLRRKLRLPKD--ADQAPILPTNDL-PADFDWREKGA-VGPVKDQGSCGSCWSFS 172
R+++L + P++ A + +N L P DWR+KG V PVK+QG CGSCW+FS
Sbjct: 82 RKSFL-----MSEPQNCSATKGNYFSSNGLLPDSIDWRKKGNYVTPVKNQGGCGSCWTFS 136
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
TTG LE + GKLV LSEQQLVDC + + + GCNGGL + AFEY + G
Sbjct: 137 TTGCLESVTAINKGKLVPLSEQQLVDCAQDFN-------NHGCNGGLPSQAFEYIMYNKG 189
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVAN-FSVVSLDEDQIAANLVKNGPLAVAINAV 291
LM E+DYPYT + C + K AA V + ++ + +E ++ + + P++ A
Sbjct: 190 LMTEQDYPYTAFEG--KCVYKPGKAAAFVNSVVNITAYNELEMVDAVGTHNPVSFAFEVT 247
Query: 292 Y-MQTYIGGVSCPYIC---SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGE 347
+Y GV C + +++H VL VGYG + PYWI+KNSWG SWG
Sbjct: 248 SDFMSYHQGVYTSTECHNTTDKVNHAVLAVGYG-------QENGTPYWIVKNSWGSSWGM 300
Query: 348 NGYYKICRGRNVCGVDSMVS 367
NGY+ I RG+N+CG+ + S
Sbjct: 301 NGYFLIERGKNMCGLAACAS 320
>gi|403258371|ref|XP_003921746.1| PREDICTED: pro-cathepsin H [Saimiri boliviensis boliviensis]
Length = 336
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 121/318 (38%), Positives = 172/318 (54%), Gaps = 30/318 (9%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRR 118
HF + K +K Y+ +EE+ HR F +N R+ H + + + QF+D++ AE +R
Sbjct: 34 HFKSWMAKHHKTYSREEEYHHRLQTFASNWRKINAHNNGNHTFKMAVNQFADMSFAEIKR 93
Query: 119 TYLGLRRKLRLPKDADQAP---ILPTNDLPADFDWREKGA-VGPVKDQGSCGSCWSFSTT 174
YL P++ + T P DWR+KG V PVK+QG+CGSCW+FSTT
Sbjct: 94 KYL-----WSEPQNCSATKSNYLRGTGPYPPSVDWRKKGNFVSPVKNQGACGSCWTFSTT 148
Query: 175 GALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLM 234
GALE A +ATGK++SL+EQQLVDC + + + GC GGL + AFEY L G+M
Sbjct: 149 GALESAIAIATGKMLSLAEQQLVDCAQDFN-------NHGCQGGLPSQAFEYILYNKGIM 201
Query: 235 REEDYPYTGTDRGHACKFDKSKIAASVANFSVVSL-DEDQIAANLVKNGPLAVAINAVY- 292
E+ YPY G D CKF K V + + +++ DED + + P++ A
Sbjct: 202 GEDTYPYQGKDSD--CKFQPGKAIGFVKDVANITIYDEDAMVEAVALYNPVSFAFEVTQD 259
Query: 293 MQTYIGGVSCPYICSR---RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
Y G+ C + +++H VL VGYG PYWI+KNSWG WG NG
Sbjct: 260 FMMYKRGIYSSTSCHKTPDKVNHAVLAVGYGEE-------NGIPYWIVKNSWGPQWGMNG 312
Query: 350 YYKICRGRNVCGVDSMVS 367
Y+ I RG+N+CG+ + S
Sbjct: 313 YFLIERGKNMCGLAACAS 330
>gi|340503366|gb|EGR29962.1| hypothetical protein IMG5_145110 [Ichthyophthirius multifiliis]
Length = 1095
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 109/285 (38%), Positives = 161/285 (56%), Gaps = 26/285 (9%)
Query: 93 RHQ--KLDPSATHGITQFSDLTPAEFRRTYLGLRRK--LRLPKDAD------QAPILPTN 142
+HQ ++ SA G T+FSDL+P +F + +L L +K L++ K+ Q I
Sbjct: 822 QHQSFQVKNSAVFGHTKFSDLSPQQFAQKHLKLNQKKLLQVKKETKKLTTPIQQDITVEE 881
Query: 143 DLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHE 202
++P FDWR++ V K Q +CGSCW+FSTTG +E + KLV SEQQLVDCD
Sbjct: 882 NVPEQFDWRDRNVVTEPKYQNTCGSCWTFSTTGVIESQYAIKHQKLVPFSEQQLVDCD-- 939
Query: 203 CDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVA 262
+ GC+GGLM A++Y ++GGL EDY ++ CKFD +K+ A +
Sbjct: 940 -------DINDGCHGGLMTDAYKYLQQSGGLEFAEDYG-DYKNKKEKCKFDLNKVQAKIK 991
Query: 263 NFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSA 322
+ + DE+ I L +NGP+A +NA +Q Y G+ P C ++H +L+VGYG
Sbjct: 992 EWQQIDEDEEIIKKQLYQNGPIAAGVNARLLQFYKSGIFDPKECDSDINHAILIVGYG-- 1049
Query: 323 GYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVS 367
+ + YWIIKN WG+ WG +GY+K+ RG+ CG+ + S
Sbjct: 1050 ----VEKDGQKYWIIKNQWGKDWGMDGYFKLARGKKQCGIHTYAS 1090
>gi|148907299|gb|ABR16787.1| unknown [Picea sitchensis]
Length = 372
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 120/306 (39%), Positives = 169/306 (55%), Gaps = 29/306 (9%)
Query: 75 EEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLR----RKLRLP 130
+EH RF IFK N++ K D G+ +F+DL+ EF+ ++ + + LR
Sbjct: 61 DEHARRFEIFKENVKHIDSVNKKDGPYKLGLNKFADLSNEEFKAMHMTTKMEKHKSLRGD 120
Query: 131 KDADQAPILPTND--LPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKL 188
+ + + N LPA DWR+KGAV PVK+QG CGSCW+FST ++EG N++ TGKL
Sbjct: 121 RGVESGSFMYQNSKRLPASIDWRKKGAVTPVKNQGQCGSCWAFSTIASVEGINYIKTGKL 180
Query: 189 VSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTG-TDRG 247
VSLSEQQLVDC E ++GCNGGLM++AF+Y + GG++ E++YPYT
Sbjct: 181 VSLSEQQLVDCSKE---------NAGCNGGLMDNAFQYIIDNGGIVTEDEYPYTAEAGEC 231
Query: 248 HACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV--YMQTYIGGVSCPYI 305
K + IA + F V + + V + P+++AI A Q Y GV
Sbjct: 232 STTKIESKSIATIIDGFEDVPANNEGALKKAVAHQPVSIAIEASGHDFQFYSTGVFTGK- 290
Query: 306 CSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRG----RNVCG 361
C LDHGV++VGYG +P + YWI++NSWG WGE GY ++ RG CG
Sbjct: 291 CGTELDHGVVVVGYGK---SPEGIN---YWIVRNSWGPEWGEQGYIRMQRGIEATEGKCG 344
Query: 362 VDSMVS 367
+ S
Sbjct: 345 ISMQAS 350
>gi|185135439|ref|NP_001117777.1| procathepsin L precursor [Oncorhynchus mykiss]
gi|14582899|gb|AAK69706.1|AF358668_1 procathepsin L [Oncorhynchus mykiss]
Length = 338
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 135/322 (41%), Positives = 174/322 (54%), Gaps = 25/322 (7%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH----GITQFSDLT 112
E H+ L+K +K Y EE R +++ NL++ H H G+ F D+T
Sbjct: 27 EDHWHLWKNWHSKHYHESEE-GWRRMVWEKNLKKIEIHNLEHTMGKHSYRLGMNHFGDMT 85
Query: 113 PAEFRRTYLGLRRKLRLPKDADQAPILPTNDL--PADFDWREKGAVGPVKDQGSCGSCWS 170
EFR+T G ++ + + + N L P DWREKG V PVKDQGSCGSCW+
Sbjct: 86 NEEFRQTMNGYKQTTE--RKFKGSLFMEPNYLQAPKAVDWREKGYVTPVKDQGSCGSCWA 143
Query: 171 FSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA 230
FSTTGA+EG F TGKLVSLSEQ LVDC PE + GCNGGLM+ AF+Y
Sbjct: 144 FSTTGAMEGQQFRKTGKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYIQDN 196
Query: 231 GGLMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAIN 289
GL EE YPY GTD C + AA+ F + S E + + GP++VAI+
Sbjct: 197 AGLDTEESYPYVGTDED-PCHYKPEFSAANETGFVDIPSGKEHAMMKAVAAVGPVSVAID 255
Query: 290 AVY--MQTYIGGVSCPYIC-SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWG 346
A + Q Y G+ C S LDHGVL+VGYG G + K YWI+KNSW E WG
Sbjct: 256 AGHESFQFYESGIYYEKECSSEELDHGVLVVGYGFEG---EDVDGKKYWIVKNSWSEKWG 312
Query: 347 ENGYYKICRGR-NVCGVDSMVS 367
+ GY + + R N CG+ + S
Sbjct: 313 DKGYIYMAKDRKNHCGIATASS 334
>gi|449683741|ref|XP_002155462.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 324
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 132/314 (42%), Positives = 169/314 (53%), Gaps = 32/314 (10%)
Query: 63 FKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFR--RTY 120
+K NKAY+ E R+TI+K N RR H + QF D+T EF+ Y
Sbjct: 30 WKMAHNKAYSHDGEETVRYTIWKDNERRIREHNLQGGDFLLEMNQFGDMTNNEFKDFNGY 89
Query: 121 LGLRRKLRLPKDADQAPILPTNDL--PADFDWREKGAVGPVKDQGSCGSCWSFSTTGALE 178
L K + L N P DWR +G V PVKDQG CGSCW+FSTTG+LE
Sbjct: 90 LS-------HKHVSGSTFLTPNSFVAPDSVDWRNEGYVTPVKDQGQCGSCWAFSTTGSLE 142
Query: 179 GANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREED 238
G NF TGKLVSLSEQ LVDC ++GCNGGLM++AF Y + G+ E
Sbjct: 143 GQNFKKTGKLVSLSEQNLVDC-------STAYGNNGCNGGLMDNAFTYIKENNGIDSEAS 195
Query: 239 YPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAINAVY--MQT 295
YPYT D C F K +AA+ F + S DE+++ + GP++VAI+A + Q
Sbjct: 196 YPYTAKD--GKCAFTKPNVAATDTGFVDIPSGDENKLKEAVASVGPISVAIDASHFSFQF 253
Query: 296 YIGGVSCPYICSR-RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
Y GV CS LDHGVL+VGYG+ K YW++KNSW SWG+ GY K+
Sbjct: 254 YRKGVYNERKCSSTELDHGVLVVGYGTE-------SGKDYWLVKNSWNTSWGDKGYIKMS 306
Query: 355 R-GRNVCGVDSMVS 367
R +N CG+ + S
Sbjct: 307 RNAKNQCGIATNAS 320
>gi|225428328|ref|XP_002279940.1| PREDICTED: cysteine proteinase-like [Vitis vinifera]
Length = 707
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 127/317 (40%), Positives = 175/317 (55%), Gaps = 28/317 (8%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F + K K Y S EE HRF +F+ NL K S G+ +F+DL+ EF+
Sbjct: 404 FESWVSKHGKVYKSMEEKLHRFEVFRENLNHIDERNKEVSSYWLGLNEFADLSHEEFKSK 463
Query: 120 YLGLRRKLRLPKD-ADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALE 178
YLGLR + +D + + DLP DWR+KGAV VK+QG+CGSCW+FST A+E
Sbjct: 464 YLGLRAEFPRSRDYSGEFRYRDVADLPESVDWRKKGAVTHVKNQGACGSCWAFSTVAAVE 523
Query: 179 GANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREED 238
G N + TG L +LSEQ+L+DCD + +SGCNGGLM+ AF + GGL +E+D
Sbjct: 524 GINQIVTGNLTTLSEQELIDCD--------TTFNSGCNGGLMDYAFAFIASNGGLHKEDD 575
Query: 239 YPYTGTDRGHACKFDKSKI-AASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY--MQT 295
YPY + G C+ K + +++ + V +++ + + PL+VAI A Q
Sbjct: 576 YPYL-MEEG-TCEEQKEDVDIVTISGYEDVPEKDEESLLKALAHQPLSVAIEASGRDFQF 633
Query: 296 YIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
Y GGV + P C LDHGV VGYGS+ K Y I+KNSWG WGE GY ++
Sbjct: 634 YSGGVFNGP--CGTELDHGVAAVGYGSS-------KGLDYIIVKNSWGPKWGEKGYIRMK 684
Query: 355 RG----RNVCGVDSMVS 367
R +CG++ M S
Sbjct: 685 RNTGKTEGLCGINKMAS 701
>gi|343472974|emb|CCD15016.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 361
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 120/310 (38%), Positives = 165/310 (53%), Gaps = 21/310 (6%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
+ F+ FK+K++++Y E RF +FK N+ RA +P AT G+T+FSD++P EF
Sbjct: 38 QQQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97
Query: 117 RRTYL-GLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
R TY G K + + T P DWR+KGAV PVKDQG C S W+FS TG
Sbjct: 98 RATYHNGAEYYAAALKRPRKVVNVSTGRPPMTVDWRKKGAVTPVKDQGKCDSSWAFSATG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA--GGL 233
+EG +A +L SLSEQ LV CD + D GC G + AF + + + G +
Sbjct: 158 NIEGQWKVAGHELTSLSEQMLVSCDTD---------DLGCRDGFPDIAFNWIVSSNKGNV 208
Query: 234 MREEDYPY-TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY 292
E+ YPY +G C + A + + ++ DED IA L + GP A+ ++A
Sbjct: 209 FTEQSYPYASGGGNVPTCDKSGKVVGAKIRDHVDLARDEDMIAEWLARKGPAAITVDATS 268
Query: 293 MQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYK 352
Q Y GGV I S+ ++ LLVGY + PYWIIKNSWG+ WGE GY +
Sbjct: 269 FQRYTGGVLTSCI-SKEMNSAALLVGYDDT-------SKPPYWIIKNSWGKGWGEEGYIR 320
Query: 353 ICRGRNVCGV 362
I +G N C V
Sbjct: 321 IEKGTNQCLV 330
>gi|226443040|ref|NP_001140018.1| Cathepsin L1 precursor [Salmo salar]
gi|221221188|gb|ACM09255.1| Cathepsin L1 precursor [Salmo salar]
Length = 338
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 134/322 (41%), Positives = 174/322 (54%), Gaps = 25/322 (7%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH----GITQFSDLT 112
E H+ L+K +K+Y EE R +++ NL++ H H G+ F D+T
Sbjct: 27 EDHWHLWKNWHSKSYHESEE-GWRRMVWEKNLKKIEMHNLEHTMGKHSYRLGMNHFGDMT 85
Query: 113 PAEFRRTYLGLRRKLRLPKDADQAPILPTNDL--PADFDWREKGAVGPVKDQGSCGSCWS 170
EFR+T G ++ + + + N L P DWREKG V PVKDQGSCGSCW+
Sbjct: 86 NEEFRQTMNGYKQTTE--RKFKGSLFMEPNYLQAPKAVDWREKGYVTPVKDQGSCGSCWA 143
Query: 171 FSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA 230
FSTTGA+EG F TGKLVSLSEQ LVDC PE + GCNGGLM+ AF+Y
Sbjct: 144 FSTTGAMEGQQFRKTGKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYIQDN 196
Query: 231 GGLMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAIN 289
GL EE YPY GTD C + A+ F + S E + + GP++VAI+
Sbjct: 197 AGLDTEESYPYVGTDED-PCHYKPEFSGANETGFVDIPSGKEHAMMKAVAAVGPVSVAID 255
Query: 290 AVY--MQTYIGGVSCPYIC-SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWG 346
A + Q Y G+ C S LDHGVL+VGYG G + K YWI+KNSW E WG
Sbjct: 256 AGHESFQFYESGIYYEKECSSEELDHGVLVVGYGFEG---EDVDGKKYWIVKNSWSEKWG 312
Query: 347 ENGYYKICRGR-NVCGVDSMVS 367
+ GY + + R N CG+ + S
Sbjct: 313 DKGYIYMAKDRKNHCGIATASS 334
>gi|332375975|gb|AEE63128.1| unknown [Dendroctonus ponderosae]
Length = 338
Score = 212 bits (539), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 134/328 (40%), Positives = 178/328 (54%), Gaps = 32/328 (9%)
Query: 56 AEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH----GITQFSDL 111
+ ++ FK + K Y S+ E R IF N + A+H KL + + ++ DL
Sbjct: 23 VQEQWNSFKVQHKKQYESETEERFRMKIFMDNSHKVAKHNKLFEQGLYPYKLAMNKYGDL 82
Query: 112 TPAEFRRTYLGLRR-KLRLPKDADQAPIL---PTN-DLPADFDWREKGAVGPVKDQGSCG 166
EF G R K L + Q I P + D+P DWR++GAV PVKDQG CG
Sbjct: 83 LHHEFVGLLNGFNRTKTYLKRGELQDSITFIEPAHVDIPDTVDWRQEGAVTPVKDQGHCG 142
Query: 167 SCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEY 226
SCWSFS TGALEG +F T KLVSLSEQ LVDC ++GCNGGLM++AF Y
Sbjct: 143 SCWSFSATGALEGQHFRQTKKLVSLSEQNLVDCSSRFG-------NNGCNGGLMDNAFRY 195
Query: 227 TLKAGGLMREEDYPYTGTDRGHACKFD---KSKIAASVANFSVVSLDEDQIAANLVKNGP 283
GG+ E YPY G D KF K++ A + S DED++ A + GP
Sbjct: 196 IKNNGGIDTEAAYPYMGEDE----KFRYSAKNRGATDKGFVDIPSGDEDKLKAAVATVGP 251
Query: 284 LAVAINAVY--MQTYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNS 340
+++AI+A + Q Y GV S P S LDHGVL+VGYG+ + YW++KNS
Sbjct: 252 ISIAIDASHESFQLYSNGVYSDPTCSSTELDHGVLVVGYGTDEKTGM-----DYWLVKNS 306
Query: 341 WGESWGENGYYKICRGR-NVCGVDSMVS 367
WG++WG +GY K+ R + N CGV + S
Sbjct: 307 WGDTWGLDGYIKMARNQDNQCGVATQAS 334
>gi|344953542|gb|AEN28617.1| cathepsin L-like cysteine protease [Epinephelus coioides]
Length = 336
Score = 212 bits (539), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 134/320 (41%), Positives = 175/320 (54%), Gaps = 25/320 (7%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH----GITQFSDLTPA 114
H+ L+K +K Y +EE R +++ NL++ H TH G+ F D+T
Sbjct: 27 HWELWKSWHSKKYHEKEE-GWRRMVWEKNLKKIELHNLEHSMGTHSYRLGMNHFGDMTHE 85
Query: 115 EFRRTYLGLRRKLRLPKDADQAPILPTNDL--PADFDWREKGAVGPVKDQGSCGSCWSFS 172
EFR+ G +RK A + L N L P DWR+ G V PVKDQG CGSCW+FS
Sbjct: 86 EFRQLMNGYKRKAET--KARGSLFLEPNFLEAPKSVDWRDNGYVTPVKDQGQCGSCWAFS 143
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
TTGALEG +F TGKLVSLSEQ LVDC PE + GCNGGLM+ AF+Y G
Sbjct: 144 TTGALEGQHFRKTGKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYVKDNQG 196
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAINAV 291
L E+ YPY GTD C +D + + + F + S E + + GP++VAI+A
Sbjct: 197 LDSEDSYPYLGTDD-QPCHYDPTYNSVNDTGFVDIPSGKERALMKAVAAVGPVSVAIDAG 255
Query: 292 Y--MQTYIGGVSCPYIC-SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+ Q Y G+ C S LDHGVL+VGYG G + K YWI+KNSW E WG+
Sbjct: 256 HESFQFYQSGIYYEKECSSEELDHGVLVVGYGFQG---EDVDGKKYWIVKNSWSEKWGDK 312
Query: 349 GYYKICRGR-NVCGVDSMVS 367
GY + + R N CG+ + S
Sbjct: 313 GYIYMAKDRKNHCGIATAAS 332
>gi|297598407|ref|NP_001045533.2| Os01g0971400 [Oryza sativa Japonica Group]
gi|15289977|dbj|BAB63672.1| putative cysteine protease CP1 [Oryza sativa Japonica Group]
gi|125529282|gb|EAY77396.1| hypothetical protein OsI_05384 [Oryza sativa Indica Group]
gi|125573472|gb|EAZ14987.1| hypothetical protein OsJ_04922 [Oryza sativa Japonica Group]
gi|215740756|dbj|BAG97412.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215741010|dbj|BAG97505.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765325|dbj|BAG87022.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767338|dbj|BAG99566.1| unnamed protein product [Oryza sativa Japonica Group]
gi|255674119|dbj|BAF07447.2| Os01g0971400 [Oryza sativa Japonica Group]
Length = 365
Score = 212 bits (539), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 135/338 (39%), Positives = 179/338 (52%), Gaps = 35/338 (10%)
Query: 49 TNNDLLGAEHHFSLFKK---KFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGI 105
+ DL E LF+K K+ KAY+S EE RF +FK NL K G+
Sbjct: 38 SEEDLASHERLMELFEKFMAKYRKAYSSLEEKLRRFEVFKDNLNHIDEENKKITGYWLGL 97
Query: 106 TQFSDLTPAEFRRTYLGLRRKLRLPKDADQA---PILPTNDLPADFDWREKGAVGPVKDQ 162
+F+DLT EF+ YLGL DQ + LP + DWR+KGAV VK+Q
Sbjct: 98 NEFADLTHDEFKAAYLGLTLTPARRNSNDQLFRYEEVEAASLPKEVDWRKKGAVTEVKNQ 157
Query: 163 GSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNS 222
G CGSCW+FST A+EG N + TG L LSEQ+L+DCD + ++GC+GGLM+
Sbjct: 158 GQCGSCWAFSTVAAVEGINAIVTGNLTRLSEQELIDCDTD--------GNNGCSGGLMDY 209
Query: 223 AFEYTLKAGGLMREEDYPY---TGTDRGHACKFD---KSKIAASVANFSVVSLDEDQIAA 276
AF Y GGL EE YPY GT R + + D ++ A +++ + V + +Q
Sbjct: 210 AFSYIAANGGLHTEESYPYLMEEGTCRRGSTEGDDDGEAAAAVTISGYEDVPRNNEQALL 269
Query: 277 NLVKNGPLAVAINAV--YMQTYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKP 333
+ + P++VAI A Q Y GGV P C RLDHGV VGYG+A K
Sbjct: 270 KALAHQPVSVAIEASGRNFQFYSGGVFDGP--CGTRLDHGVTAVGYGTAS------KGHD 321
Query: 334 YWIIKNSWGESWGENGYYKICRGR----NVCGVDSMVS 367
Y I+KNSWG WGE GY ++ RG +CG++ M S
Sbjct: 322 YIIVKNSWGSHWGEKGYIRMRRGTGKHDGLCGINKMAS 359
>gi|156124998|gb|ABU50817.1| Ale o 1 allergen [Aleuroglyphus ovatus]
Length = 337
Score = 212 bits (539), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 130/323 (40%), Positives = 173/323 (53%), Gaps = 31/323 (9%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQ----KLDPSATHGITQFSDLT 112
E F FK F + Y S E HR +IF+ANL+ RH D + + + F+DL+
Sbjct: 30 EAQFEQFKSTFGRVYPSPEIELHRKSIFRANLQFILRHNIDYFNGDSTFSVSVNNFTDLS 89
Query: 113 PAEFRRTYLGLRRKLRLPKDADQAPILPTND---LPADFDWREKGAVGPVKDQGSCGSCW 169
EFR T+ G RR L AD + ND LPA DW KG V P+K+Q CGSCW
Sbjct: 90 NEEFRATFNGYRR-LAAVSLADS--VHADNDVEALPATVDWTTKGVVTPIKNQQQCGSCW 146
Query: 170 SFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK 229
+FS ++EG + L TGKLVSLSEQ LVDC D GC+GG M+ AF+Y ++
Sbjct: 147 AFSAVASMEGQHALKTGKLVSLSEQNLVDC-------SAAEGDMGCSGGWMDYAFKYVIQ 199
Query: 230 AGGLMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAI 288
G+ E YPY D +C+F ++ + A++ +F V + DE + + GP++VAI
Sbjct: 200 NRGIDTEASYPYKAIDE--SCEFKRNSVGATIHSFVDVKTGDESALQNAVASIGPISVAI 257
Query: 289 NAVY--MQTYIGGVSCPYICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESW 345
+A Q Y GV CS LDHGV VGYG+ L PYW +KNSWG SW
Sbjct: 258 DAAQPSFQFYSSGVYNEPDCSTEILDHGVTAVGYGT-------LNGAPYWKVKNSWGTSW 310
Query: 346 GENGYYKICRGR-NVCGVDSMVS 367
G GY + R + N CG+ + S
Sbjct: 311 GRKGYIFMSRNKQNQCGIATKAS 333
>gi|332326585|gb|AEE42616.1| cysteine protease [Leishmania aethiopica]
Length = 443
Score = 212 bits (539), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 125/311 (40%), Positives = 165/311 (53%), Gaps = 29/311 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK+ + +AY + E R F+ NL HQ +P A GIT+F DL+ AEF
Sbjct: 38 FEEFKRTYRRAYGTVAEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 120 YL-GLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YL G + A Q DL P DWREKGAV PVKBQG+CGSCW+FS G
Sbjct: 98 YLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKBQGACGSCWAFSAVG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA--GGL 233
+E +A +L LSEQQLV CD + DSGC GGLM AFE+ L+ G +
Sbjct: 158 NIESQWAVAGHRLXXLSEQQLVSCDDK---------DSGCXGGLMTQAFEWLLRXMNGTM 208
Query: 234 MREEDYPYTGT--DRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV 291
E+ YPY + D + A + + ++ +E +AA L K+GP+++ ++A
Sbjct: 209 FTEDSYPYVSSTGDVPECTNSSELVPGARIDGYVMIESNETVMAAWLAKSGPISIGVDAS 268
Query: 292 YMQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
+Y GV SC + L+HGVLLVGY G E PYW+IKNSWGE WGE G
Sbjct: 269 SFMSYESGVLTSC---AGKHLNHGVLLVGYNMTG-------EVPYWVIKNSWGEDWGEKG 318
Query: 350 YYKICRGRNVC 360
Y ++ G N C
Sbjct: 319 YVRVTMGVNAC 329
>gi|238816977|gb|ACR56863.1| cathepsin L-like cysteine proteinase [Delia coarctata]
Length = 338
Score = 212 bits (539), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 128/318 (40%), Positives = 179/318 (56%), Gaps = 30/318 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKL----DPSATHGITQFSDLTPAE 115
+ FK + K Y S+ E R IF N + A+H +L S G+ +++D+ E
Sbjct: 27 WQTFKMEHRKNYLSEVEERFRMKIFNENRHKIAKHNQLYAQGKVSFKLGLNKYADMLHHE 86
Query: 116 FRRTYLG----LRRKLRLPKDADQAP-ILPTN-DLPADFDWREKGAVGPVKDQGSCGSCW 169
F+ T G +R++LR + + I P N +P DWR+ GAV VKDQG CGSCW
Sbjct: 87 FKETMNGYNHTMRKELRAQEGFNGITYISPANVQVPKAVDWRQHGAVTSVKDQGHCGSCW 146
Query: 170 SFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK 229
SFS+TG+LEG +F G LVSLSEQ LVDC + ++GCNGGLM++AF Y
Sbjct: 147 SFSSTGSLEGQHFRKAGVLVSLSEQNLVDCSTKYG-------NNGCNGGLMDNAFRYIKD 199
Query: 230 AGGLMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAI 288
GG+ E+ YPY G D +C F+K+ + A+ F + DE+ + + GP+AVAI
Sbjct: 200 NGGVDTEKSYPYEGIDD--SCHFNKATVGATDTGFVDIPQGDEEAMMKAVATMGPVAVAI 257
Query: 289 NAV--YMQTYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESW 345
+A Q Y GV + P S LDHGVL+VGYG+ + YW++KNSWG +W
Sbjct: 258 DASNESFQLYSEGVYNDPNCSSDNLDHGVLVVGYGT------DKDGQDYWLVKNSWGTTW 311
Query: 346 GENGYYKICRGR-NVCGV 362
G+ GY K+ R + N CG+
Sbjct: 312 GDQGYIKMARNQDNQCGI 329
>gi|357158628|ref|XP_003578189.1| PREDICTED: thiol protease aleurain-like [Brachypodium distachyon]
Length = 363
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 140/347 (40%), Positives = 178/347 (51%), Gaps = 32/347 (9%)
Query: 31 LIRQVTDGGDEILSHHESTNNDLLGAEHH---FSLFKKKFNKAYASQEEHDHRFTIFKAN 87
LIR VT+ L EST LG H F+ F ++ K+Y S E RF IF +
Sbjct: 33 LIRPVTERAATAL---ESTIVAALGRSRHALRFARFAVRYGKSYESAAEVQRRFRIFSES 89
Query: 88 LRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPAD 147
L + S GI ++SD++ EF+ + LG + + + N LP
Sbjct: 90 LEEVRSTNQKGLSYRLGINRYSDMSWEEFQASRLGAAQTCSATLRGNHR-MQDANALPET 148
Query: 148 FDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEE 207
DWRE G V PVKDQ CGSCW+FSTTGALE A ATGK +SLSEQQLVDC +
Sbjct: 149 KDWREDGIVSPVKDQSHCGSCWTFSTTGALEAAYTQATGKNISLSEQQLVDCAGAYN--- 205
Query: 208 PGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASV---ANF 264
+ GCNGGL + AFEY GGL EE YPY G + C + A V N
Sbjct: 206 ----NFGCNGGLPSQAFEYIKYNGGLDTEESYPYKGVN--GVCHYKPENAAVQVLDSVNI 259
Query: 265 SVVSLDEDQIAANLVKNGPLAVAINAVY-MQTYIGGVSCPYICSRRLD---HGVLLVGYG 320
++ + DE Q A LV+ P++VA + + Y GV C D H VL VGYG
Sbjct: 260 TLNAEDELQNAVGLVR--PVSVAFEVINGFRQYKSGVYTSDHCGTTPDDVNHAVLAVGYG 317
Query: 321 SAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVS 367
PYW+IKNSWGESWG+ GY+K+ RG+N+C V + S
Sbjct: 318 VE-------NGTPYWLIKNSWGESWGDKGYFKMERGKNMCAVATCAS 357
>gi|281200606|gb|EFA74824.1| cysteine proteinase 5 precursor [Polysphondylium pallidum PN500]
Length = 307
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 127/312 (40%), Positives = 178/312 (57%), Gaps = 17/312 (5%)
Query: 68 NKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLR-RK 126
++ Y +QE RF IFK N+ + S G+ +D++ E++R YLG
Sbjct: 5 DRQYTAQE-FGTRFNIFKKNMDFVHKWNAKGSSTVLGLNSMADISNEEYQRVYLGTHIDA 63
Query: 127 LRLPKDADQAPILPTNDL-PADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLAT 185
+ + A + T + A+ DWR KGAV P+K+QG CGSCWSFSTTG+ EGA+F+ T
Sbjct: 64 SQFRQQAASHKLGRTFKVQAANVDWRAKGAVTPIKNQGQCGSCWSFSTTGSTEGAHFIKT 123
Query: 186 GKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTD 245
G LVSLSEQ L+DC PE + GCNGGLM +AFEY +K G+ E YPY D
Sbjct: 124 GNLVSLSEQNLMDCS---KPEG----NQGCNGGLMTAAFEYIIKNNGIDTESSYPYKAED 176
Query: 246 RGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY--MQTYIGGVSCP 303
G C ++ + AA+++++ V+ + A GP++VAI+A + Q Y GV
Sbjct: 177 -GKKCLYNPANSAATLSSYVNVTTGSESDLAVKSGLGPVSVAIDASHNSFQLYSSGVYYE 235
Query: 304 YICSR-RLDHGVLLVGYGSAGY--APIRLKEKPYWIIKNSWGESWGENGYYKICRGR-NV 359
CS+ +LDHGVL+VGYGS A + +WI+KNSWG +WG GY + R R N
Sbjct: 236 PKCSQTQLDHGVLVVGYGSDALPSAGVSAGSGDWWIVKNSWGTTWGVEGYIYMSRNRNNN 295
Query: 360 CGVDSMVSTVAA 371
CG+ +M S +A
Sbjct: 296 CGIATMASLPSA 307
>gi|332326583|gb|AEE42615.1| cysteine protease [Leishmania aethiopica]
Length = 443
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 127/311 (40%), Positives = 164/311 (52%), Gaps = 29/311 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK+ + + Y + E R F+ NL HQ +P A GIT+F DL+ AEF
Sbjct: 38 FEEFKRTYWRVYGTVAEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 120 YL-GLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YL G + A Q DL P DWREKGAV PVKDQG+CGSCW+FS G
Sbjct: 98 YLNGAAYFAAAKQHAGQHHRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFSAVG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGL 233
+E +A +L LSEQQLV CD + DSGCNGGLM AFE+ L+ G +
Sbjct: 158 NIESQWAVADHRLXXLSEQQLVSCDDK---------DSGCNGGLMTQAFEWLLRNMNGTM 208
Query: 234 MREEDYPYTGT--DRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV 291
+ E+ YPY + D + A + + + E +AA L K+GP+++A++A
Sbjct: 209 LTEDSYPYVSSTGDVPECTNSSQLVPGARIDGYVTIESSETVMAAWLAKSGPISIAVDAS 268
Query: 292 YMQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
+Y GV SC L+HGVLLVGY G E PYW+IKNSWGE WGE G
Sbjct: 269 SFMSYESGVLTSC---AGDALNHGVLLVGYNRTG-------EVPYWVIKNSWGEDWGEKG 318
Query: 350 YYKICRGRNVC 360
Y ++ G N C
Sbjct: 319 YVRVTMGVNAC 329
>gi|40806498|gb|AAR92154.1| putative cysteine protease 1 [Iris x hollandica]
Length = 340
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 124/303 (40%), Positives = 161/303 (53%), Gaps = 30/303 (9%)
Query: 69 KAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLR 128
+ Y + E HRF IF+AN+ R + G+ QF+DLT EF+ R L+
Sbjct: 50 RVYKNAAEKAHRFEIFRANVERIESFNAENHKFKLGVNQFADLTNEEFK-----TRNTLK 104
Query: 129 LPKDADQAPILPTN--DLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATG 186
K A N +PA DWR KGAV P+KDQG CGSCW+FS A EG L+TG
Sbjct: 105 PSKMASTKSFKYENVTAVPATMDWRTKGAVTPIKDQGQCGSCWAFSAVAATEGITKLSTG 164
Query: 187 KLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDR 246
KL+SLSEQ++VDCD D D GCNGG M+ AFEY +K G+ E +YPY D
Sbjct: 165 KLISLSEQEVVDCDVTSD-------DQGCNGGEMDDAFEYIIKNKGITTEANYPYKAAD- 216
Query: 247 GHACKFDKSKI-AASVANFSVVSLDEDQIAANLVKNGPLAVAINA--VYMQTYIGGVSCP 303
C K+ AAS+ + V+++ + N P+AVAI+A Q Y GV
Sbjct: 217 -GTCNTKKAASHAASITGYEDVTVNSEAALLKAAANQPIAVAIDAGDFAFQMYSSGVFTG 275
Query: 304 YICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR----GRNV 359
C LDHGV LVGYG+ YW++KNSWG SWGE+GY ++ R +
Sbjct: 276 -DCGTDLDHGVTLVGYGATSDGT------KYWLVKNSWGTSWGEDGYIRMERDVDAKEGL 328
Query: 360 CGV 362
CG+
Sbjct: 329 CGI 331
>gi|410519429|gb|AFV73398.1| cathepsin L [Haliotis discus hannai]
Length = 326
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 122/320 (38%), Positives = 169/320 (52%), Gaps = 25/320 (7%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH----GITQFSDLT 112
+ + +FK + NK Y +E +R +F + +H H GI +++D+
Sbjct: 19 DREWGMFKVRHNKQYKDNQEEAYRKGVFMKAVEYIQQHNLEADRGVHSFRVGINEYADMP 78
Query: 113 PAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFS 172
EF R G + + + PK P DLPA DWR KG V VK+QG CGSCW+FS
Sbjct: 79 NEEFVRVMNGYKMQEQRPKAPTYMPPSNVGDLPATVDWRTKGYVTEVKNQGQCGSCWAFS 138
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
+TG+LEG F KL+SLSEQ LVDC E + GC GGLM+ AF Y G
Sbjct: 139 STGSLEGQTFKKYNKLISLSEQNLVDCSTE-------QGNMGCGGGLMDQAFTYIKVNDG 191
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVANFS-VVSLDEDQIAANLVKNGPLAVAINAV 291
+ E YPY C+F+K+ + A+ ++ + S E + + + GP+AVAI+A
Sbjct: 192 IDTETSYPYEAAS--GKCRFNKANVGANDTGYTDIKSKSESDLQSAVATVGPIAVAIDAS 249
Query: 292 YM--QTYIGGVSCPYICSR-RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+M Q Y GV CS+ RLDHGVL VGYG+ K YW++KNSWG +WG+
Sbjct: 250 HMSFQLYKSGVYHYIFCSQTRLDHGVLAVGYGTD-------SGKDYWLVKNSWGATWGQQ 302
Query: 349 GYYKICRGR-NVCGVDSMVS 367
GY + R R N CG+ + S
Sbjct: 303 GYIMMSRNRDNNCGIATQAS 322
>gi|225719768|gb|ACO15730.1| Cathepsin L1 precursor [Caligus clemensi]
Length = 338
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 135/322 (41%), Positives = 175/322 (54%), Gaps = 25/322 (7%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH----GITQFSDLT 112
E H+ L+K +K Y + EE R +++ NL++ H H G+ F D+T
Sbjct: 27 EDHWHLWKNWHSKNYHASEE-GWRRMVWEKNLKKIEIHNLEHTMGKHSHRLGMNHFGDMT 85
Query: 113 PAEFRRTYLGLRRKLRLPKDADQAPILPTNDL--PADFDWREKGAVGPVKDQGSCGSCWS 170
EFR+T G ++ + + + N L P DWREKG V PVKDQGSCGSCW+
Sbjct: 86 NEEFRQTMNGYKQTTE--RKFKGSLFMEPNYLQAPKAVDWREKGYVTPVKDQGSCGSCWA 143
Query: 171 FSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA 230
FSTTGA+EG F TGKLVSLSEQ LVDC PE + GCNGGLM+ AF+Y
Sbjct: 144 FSTTGAMEGQPFRKTGKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYIQDN 196
Query: 231 GGLMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAIN 289
GL EE YPY GTD C + AA+ F + S E + + GP++VAI+
Sbjct: 197 AGLDTEESYPYVGTDED-PCHYKPEFSAANETGFVDIPSGKEHAMMKAVAAVGPVSVAID 255
Query: 290 AVY--MQTYIGGVSCPYIC-SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWG 346
A + Q Y G+ C S LDHGVL+VGYG G + K YWI+KNSW E WG
Sbjct: 256 AGHESFQFYESGIYYEKECSSEELDHGVLVVGYGFEG---EDVDGKKYWIVKNSWSEKWG 312
Query: 347 ENGYYKICRGR-NVCGVDSMVS 367
+ GY + + R N CG+ + S
Sbjct: 313 DKGYIYMAKDRKNHCGIATASS 334
>gi|255635584|gb|ACU18142.1| unknown [Glycine max]
Length = 345
Score = 211 bits (538), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 121/297 (40%), Positives = 166/297 (55%), Gaps = 21/297 (7%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F + K K Y S EE RF IFK NL+ K+ + G+ +F+DL+ EF+
Sbjct: 47 FESWMSKHGKIYQSIEEKLLRFEIFKDNLKHIDERNKVVSNYWLGLNEFADLSHQEFKNK 106
Query: 120 YLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEG 179
YLGL+ +++ + +LP DWR+KGAV PVK+QGSCGSCW+FST A+EG
Sbjct: 107 YLGLKVDYSRRRESPEEFTYKDVELPKSVDWRKKGAVAPVKNQGSCGSCWAFSTVAAVEG 166
Query: 180 ANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDY 239
N + TG L SLSEQ+L+DCD + +GCNGGLM+ AF + ++ GGL +EEDY
Sbjct: 167 INQIVTGNLTSLSEQELIDCDR--------TYSNGCNGGLMDYAFSFIVENGGLHKEEDY 218
Query: 240 PYTGTDRGHACKFDKSKI-AASVANFSVVSLDEDQIAANLVKNGPLAVAINAV--YMQTY 296
PY + C+ K + +++ + V + +Q + N L+VAI A Q Y
Sbjct: 219 PYIMEE--GTCEMTKEETEVVTISGYHDVPQNNEQSLLKALANQSLSVAIEASGRDFQFY 276
Query: 297 IGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKI 353
GGV + C LDHGV VGYG+A K Y I+KNSWG WGE GY ++
Sbjct: 277 SGGVFDGH-CGSDLDHGVAAVGYGTA-------KGVDYIIVKNSWGSKWGEKGYIRM 325
>gi|2352469|gb|AAC00067.1| cysteine protease [Trypanosoma cruzi]
Length = 471
Score = 211 bits (538), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 127/314 (40%), Positives = 167/314 (53%), Gaps = 23/314 (7%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRR 118
F+ FK+K + Y S ++F+ NL A H +P AT G+T FSDLT EFR
Sbjct: 37 QFAEFKQKHGRVYESAARR-LPLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEFRS 95
Query: 119 TYLGLRRKLRLPKDADQAPI-LPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
Y ++ + P+ + PA DWR +GAV VKDQG CGSCW+FS G +
Sbjct: 96 RYHNGAAHFAAAQERARVPVKVEVVGAPAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNV 155
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA--GGLMR 235
E FLA L +LSEQ LV CD D GC+GGLMN+AFE+ ++ G +
Sbjct: 156 ECQWFLAGHPLTNLSEQMLVSCDKT---------DFGCSGGLMNNAFEWIVQENNGAVYT 206
Query: 236 EEDYPY-TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQ 294
E+ YPY +G C + A++ + DE QIAA + NGP+AVA++A
Sbjct: 207 EDSYPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAACVAVNGPVAVAVDASSWM 266
Query: 295 TYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
TY GGV + S +LDHGVLLVGY + PYWIIKNSW + GE GY +I
Sbjct: 267 TYTGGVMTSCV-SEQLDHGVLLVGYNDSAAV-------PYWIIKNSW-TTQGEEGYIRIA 317
Query: 355 RGRNVCGVDSMVST 368
+G N C V S+
Sbjct: 318 KGSNQCLVKEEASS 331
>gi|66735056|gb|AAY53767.1| cysteine protease [Saprolegnia parasitica]
Length = 523
Score = 211 bits (538), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 128/306 (41%), Positives = 167/306 (54%), Gaps = 29/306 (9%)
Query: 79 HRFTIFKANLRRAARHQK-LDPSATHGITQFSDLTPAEFRRTYLGLRRKLRL----PKDA 133
HRF +F N +R H K S T G ++S LT EF++ GLR K A
Sbjct: 46 HRFEVFILNDQRIEAHNKDASSSFTMGHNEYSHLTFDEFKKLRTGLRVSPSYIQSRAKYA 105
Query: 134 DQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSE 193
AP + D+P + DW E+G V PVK+QG CGSCW+FSTTGA+EGA F+++ +LVS+SE
Sbjct: 106 LMAPAVNMTDVPNEMDWVEQGGVTPVKNQGMCGSCWAFSTTGAIEGAAFVSSKQLVSVSE 165
Query: 194 QQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFD 253
Q+LVDCDH + D GCNGGLM++AF++ GL +EEDYPY + C
Sbjct: 166 QELVDCDH--------NGDMGCNGGLMDNAFKWVKTHKGLCKEEDYPYHAKEG--TCALK 215
Query: 254 KSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA--VYMQTYIGGVSCPYICSRRLD 311
K K V F V +++Q V P++VAI A Q Y GV C +LD
Sbjct: 216 KCKPVTKVTAFHDVPANDEQALKAAVAKQPVSVAIEADQPEFQFYKSGV-FDKSCGTKLD 274
Query: 312 HGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR----GRNVCGVDSMVS 367
HGVL+VGYG G K YW +KNSWG WG+ GY K+ R CGV + S
Sbjct: 275 HGVLVVGYGEEG-------GKKYWKVKNSWGADWGDKGYIKLAREFGPETGQCGVAMVPS 327
Query: 368 TVAAAV 373
A++
Sbjct: 328 YPTASI 333
>gi|394331820|gb|AFN27129.1| cysteine protease [Leishmania tropica]
Length = 443
Score = 211 bits (538), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 128/311 (41%), Positives = 168/311 (54%), Gaps = 29/311 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK+ + +AY + E R F+ NL HQ +P A GIT+F DL+ AEF
Sbjct: 38 FEEFKRTYRRAYGTVAEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 120 YL-GLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YL G + A Q DL P DWR+KGAV PVKDQG+CGSCW+FS G
Sbjct: 98 YLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWRKKGAVTPVKDQGACGSCWAFSAVG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGL 233
++E LA +L +LSEQQLV CD + D+GC GGLM AFE+ L+ G +
Sbjct: 158 SIESQWALAGHRLTALSEQQLVSCDDK---------DNGCRGGLMLQAFEWLLRNMNGTM 208
Query: 234 MREEDYPY-TGTDRGHACKFDKSKI-AASVANFSVVSLDEDQIAANLVKNGPLAVAINAV 291
E+ YPY + T C + A + + + E +AA L KNGP+++A++A
Sbjct: 209 FTEDSYPYVSSTGYVPECSNSSQLVPGARIDGYMTIESSETVMAAWLAKNGPISIAVDAS 268
Query: 292 YMQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
+Y GV SC + L+HGVLLV Y G E PYW+IKNSWGE+WGENG
Sbjct: 269 SFMSYQSGVLTSCAGM---PLNHGVLLVWYNRTG-------EVPYWVIKNSWGENWGENG 318
Query: 350 YYKICRGRNVC 360
Y ++ G N C
Sbjct: 319 YVRVTMGVNAC 329
>gi|156399477|ref|XP_001638528.1| predicted protein [Nematostella vectensis]
gi|156225649|gb|EDO46465.1| predicted protein [Nematostella vectensis]
Length = 325
Score = 211 bits (538), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 128/319 (40%), Positives = 175/319 (54%), Gaps = 23/319 (7%)
Query: 54 LGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTP 113
L + + +K K Y +EE D R I+ NL +H + S + F+DLT
Sbjct: 21 LSQDRQWHAWKDFHGKTYTGEEE-DLRRAIWNDNLEIVKKHNAENHSYKLDMNHFADLTV 79
Query: 114 AEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFST 173
EF++ ++G R + P L LPA+ DWR+KG V VK+QG CGSCW+FS+
Sbjct: 80 TEFKQRFMGYRAASNSTGGSTFLP-LSNVQLPAEVDWRDKGFVTAVKNQGQCGSCWAFSS 138
Query: 174 TGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGL 233
TG+LEG +F TGKLVSLSEQ LVDC + ++GC GGLM+ AF+Y G+
Sbjct: 139 TGSLEGQHFRKTGKLVSLSEQNLVDCSKKYG-------NNGCEGGLMDYAFKYIKNNDGI 191
Query: 234 MREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSL-DEDQIAANLVKNGPLAVAINAVY 292
E+ YPYT D C F + A+V ++ V E + + + GP++VAI+A +
Sbjct: 192 DTEQSYPYTARDG--QCHFKPGSVGATVTGYTDVQRGSEGDLQSAVATVGPISVAIDAGH 249
Query: 293 --MQTYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
Q Y GV S P S +LDHGVL VGYG+ K YW++KNSWGE WG NG
Sbjct: 250 SSFQLYKTGVYSEPDCSSTQLDHGVLAVGYGAE-------DGKDYWLVKNSWGEGWGMNG 302
Query: 350 YYKICRGR-NVCGVDSMVS 367
Y K+ R + N CG+ + S
Sbjct: 303 YIKMSRNKDNQCGIATQAS 321
>gi|449438381|ref|XP_004136967.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 479
Score = 211 bits (538), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 131/309 (42%), Positives = 176/309 (56%), Gaps = 28/309 (9%)
Query: 69 KAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLR--RK 126
KAY + E + RF IFK NLR H + + G+T+F+DLT E+R +LG R RK
Sbjct: 71 KAYNAIGEKERRFEIFKDNLRFIDEHNRESRTYKVGLTRFADLTNEEYRARFLGGRFSRK 130
Query: 127 LRL--PKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLA 184
RL K A L +DLP D DWR+KGAV VKDQG CGSCW+FS+ A+EG N +
Sbjct: 131 PRLSAAKSGRYAAAL-GDDLPDDVDWRKKGAVATVKDQGQCGSCWAFSSVAAVEGINQIV 189
Query: 185 TGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGT 244
TG+L+ LSEQ+LVDCD S + GCNGGLM+ AF++ + GG+ EEDYPY G
Sbjct: 190 TGELIPLSEQELVDCDK--------SFNMGCNGGLMDYAFQFIIGNGGIDTEEDYPYKGR 241
Query: 245 DRGHACKFD-KSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA--VYMQTYIGGVS 301
D AC + K+ ++ + V +++ V N P++VAI A Q Y GV
Sbjct: 242 DA--ACDPNRKNAKVVTIDGYEDVPENDESSLKKAVANQPVSVAIEAGGRAFQLYQSGVF 299
Query: 302 CPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCG 361
C LDHGV+ VGYG+ YWI++NSWG+ WGE+GY ++ RNV
Sbjct: 300 TGR-CGTDLDHGVVAVGYGTD-------NGTDYWIVRNSWGKDWGESGYIRL--ERNVAN 349
Query: 362 VDSMVSTVA 370
+ + +A
Sbjct: 350 ITTGKCGIA 358
>gi|21666724|gb|AAM73806.1|AF448505_1 cysteine proteinase [Brassica napus]
gi|21666726|gb|AAM73807.1|AF448506_1 cysteine proteinase [Brassica napus]
Length = 343
Score = 211 bits (538), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 119/294 (40%), Positives = 158/294 (53%), Gaps = 27/294 (9%)
Query: 69 KAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH--GITQFSDLTPAEFRRTYLGLRRK 126
+ YA E ++R+ +FK N+ R ++ T + QF+DLT EFR Y G +
Sbjct: 46 RVYADANEKNNRYVVFKRNVESIERLNEVQYGLTFKLAVNQFADLTNEEFRSMYTGYKGN 105
Query: 127 LRLPKDADQAPI----LPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANF 182
L + ++ LP DWR+KGAV P+KDQGSCGSCW+FS A+EG
Sbjct: 106 SVLSSRTKPTSFRYQHVSSDALPISVDWRKKGAVTPIKDQGSCGSCWAFSAVAAIEGVAQ 165
Query: 183 LATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYT 242
+ GKL+SLSEQ+LVDCD + D GC GG MNSAF YT+ GGL E +YPY
Sbjct: 166 IKKGKLISLSEQELVDCD---------TNDDGCMGGYMNSAFNYTMTTGGLTSESNYPYK 216
Query: 243 GTDRGHACKFDKSK-IAASVANFSVVSLDEDQIAANLVKNGPLAVAI--NAVYMQTYIGG 299
TD C +K+K IA S+ F V ++++ V + P+++ I Q Y G
Sbjct: 217 STD--GTCNINKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGIAGGGTGFQFYSSG 274
Query: 300 VSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKI 353
V CS LDHGV +VGYG + YWI+KNSWG WGE GY +I
Sbjct: 275 VFSGE-CSTHLDHGVAVVGYGKSSNGS------KYWILKNSWGPKWGERGYMRI 321
>gi|126681066|gb|ABO26562.1| cathepsin L-like cysteine protease [Ixodes ricinus]
Length = 335
Score = 211 bits (538), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 128/329 (38%), Positives = 177/329 (53%), Gaps = 31/329 (9%)
Query: 52 DLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKL----DPSATHGITQ 107
++LGAE +S FK K K+Y S+ E R I+ N + A+H + + + + +
Sbjct: 21 EVLGAE--WSAFKAKHGKSYVSETEEVFRLKIYMENRHKIAKHNEKYARGEVPYSMAMNE 78
Query: 108 FSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTN----DLPADFDWREKGAVGPVKDQG 163
F D+ EF T G +R + + P N LP DWR KGAV PVK+QG
Sbjct: 79 FGDMLHHEFVSTRNGFKRNYKDQPREGSTYLEPENIEDFSLPKTVDWRTKGAVTPVKNQG 138
Query: 164 SCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSA 223
CGSCW+FS TG+LEG +F +G +VSLSEQ LV C + ++GC GGLM+ A
Sbjct: 139 QCGSCWAFSATGSLEGQHFRKSGSMVSLSEQNLVGCSTDFG-------NNGCEGGLMDDA 191
Query: 224 FEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNG 282
F+Y G+ E+ YPY GTD C F KS + A+ + F + E Q+ + G
Sbjct: 192 FKYIRANKGIDTEKSYPYNGTDG--TCHFKKSTVGATDSGFVDIKEGSETQLKKAVATVG 249
Query: 283 PLAVAINAVY--MQTYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKN 339
P++VAI+A + Q Y GV P S LDHGVL+VGYG+ L YW +KN
Sbjct: 250 PISVAIDASHESFQFYSDGVYDEPECDSESLDHGVLVVGYGT-------LNGTDYWFVKN 302
Query: 340 SWGESWGENGYYKICRG-RNVCGVDSMVS 367
SWG +WG+ GY ++ R +N CG+ S S
Sbjct: 303 SWGTTWGDEGYIRMSRNKKNQCGIASSAS 331
>gi|115461667|ref|NP_001054433.1| Os05g0108600 [Oryza sativa Japonica Group]
gi|14719319|gb|AAK73137.1|AC079022_10 putative cysteine proteinase [Oryza sativa]
gi|33151125|gb|AAP97431.1| cysteine protease CP1 [Oryza sativa]
gi|52353572|gb|AAU44138.1| cysteine proteinase CP1 [Oryza sativa Japonica Group]
gi|113577984|dbj|BAF16347.1| Os05g0108600 [Oryza sativa Japonica Group]
gi|125550541|gb|EAY96250.1| hypothetical protein OsI_18148 [Oryza sativa Indica Group]
Length = 358
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 130/335 (38%), Positives = 179/335 (53%), Gaps = 35/335 (10%)
Query: 49 TNNDLLGAEHHFSLFKK---KFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGI 105
+ DL + LF+K K+ KAYAS EE RF +FK NL K S G+
Sbjct: 37 SEEDLASHDRLIELFEKWVAKYRKAYASFEEKVRRFEVFKDNLNHIDDINKKVTSYWLGL 96
Query: 106 TQFSDLTPAEFRRTYLGL------RRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPV 159
+F+DLT EF+ TYLGL + + + ++P + DWR+K AV V
Sbjct: 97 NEFADLTHDEFKATYLGLTPPPTRSNSKHYSSEEFRYGKMSNGEVPKEMDWRKKNAVTEV 156
Query: 160 KDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGL 219
K+QG CGSCW+FST A+EG N + TG L SLSEQ+L+DC + ++GCNGGL
Sbjct: 157 KNQGQCGSCWAFSTVAAVEGINAIVTGNLTSLSEQELIDCSTD--------GNNGCNGGL 208
Query: 220 MNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLV 279
M+ AF Y GGL EE YPY + G C K +++ + V +++Q +
Sbjct: 209 MDYAFSYIASTGGLRTEEAYPYA-MEEGD-CDEGKGAAVVTISGYEDVPANDEQALVKAL 266
Query: 280 KNGPLAVAINAV--YMQTYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWI 336
+ P++VAI A + Q Y GGV P C +LDHGV VGYG++ K + Y I
Sbjct: 267 AHQPVSVAIEASGRHFQFYSGGVFDGP--CGEQLDHGVTAVGYGTS-------KGQDYII 317
Query: 337 IKNSWGESWGENGYYKICR----GRNVCGVDSMVS 367
+KNSWG WGE GY ++ R G +CG++ M S
Sbjct: 318 VKNSWGPHWGEKGYIRMKRGTGKGEGLCGINKMAS 352
>gi|302764466|ref|XP_002965654.1| hypothetical protein SELMODRAFT_230713 [Selaginella moellendorffii]
gi|300166468|gb|EFJ33074.1| hypothetical protein SELMODRAFT_230713 [Selaginella moellendorffii]
Length = 345
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 125/312 (40%), Positives = 177/312 (56%), Gaps = 28/312 (8%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQ-KLDPSATHGITQFSDLTPAEFRR 118
+ + ++ KAY S E+ RF IFK N+ H + + S + G+ +F+DLT +EFR
Sbjct: 38 YQKWIQEHGKAYNSAHEYKKRFQIFKENVNYINSHNARRNNSHSLGLNKFADLTNSEFRG 97
Query: 119 TYLG-LRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
Y+G L+R + D A + D DWR+KG V +KDQG CGSCW+FS A+
Sbjct: 98 LYVGRLQRPAPFHEVGDIALVA---DTATSVDWRKKGGVTEIKDQGDCGSCWAFSAVAAV 154
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREE 237
EG FL+TG LVSLSEQ+LVDCD + + GC+GG+M+ AF+Y ++ GG+ +
Sbjct: 155 EGLTFLSTGTLVSLSEQELVDCDT--------TVNQGCDGGIMDYAFQYMIRNGGITSQS 206
Query: 238 DYPYTGTDRGHACKFDKSKI-AASVANFSVVSLDEDQIAANLVKNGPLAVAINA--VYMQ 294
+YPY RG AC DK K AA++ F + +++ V N P++VAI A Q
Sbjct: 207 NYPYRAL-RG-ACDKDKVKYHAATINGFQAIPPQSEELLLRAVANQPVSVAIEAGGQDFQ 264
Query: 295 TYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
Y GV C LDHGV +VGYG+ + YW++KNSWG WGE+GY ++
Sbjct: 265 LYSSGVFTGE-CGSNLDHGVAIVGYGTDAGG------RQYWLVKNSWGSGWGESGYVRME 317
Query: 355 R---GRNVCGVD 363
R G VCG++
Sbjct: 318 RQGPGAGVCGIN 329
>gi|292397748|ref|YP_003517814.1| cathepsin [Lymantria xylina MNPV]
gi|291065465|gb|ADD73783.1| cathepsin [Lymantria xylina MNPV]
Length = 335
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 124/332 (37%), Positives = 184/332 (55%), Gaps = 34/332 (10%)
Query: 52 DLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRR--AARHQKLD-PSATHGITQF 108
+L A +F F + +NK Y S E + R++IFK NL A D P+AT+GI +F
Sbjct: 27 NLQRAPDYFESFVENYNKNYTSDWEKNKRYSIFKDNLHEINAKNGNATDGPTATYGINKF 86
Query: 109 SDLTPAEFRRTYLGLRRKLRLPKDAD---QAPIL--PTNDLPADFDWREKGAVGPVKDQG 163
SDL+ +E + GL +P+ A + +L P + P FDWRE+ V +K+QG
Sbjct: 87 SDLSKSELIAKFTGLS----IPQRASNFCKTIVLNQPPDKGPLHFDWREQNKVTSIKNQG 142
Query: 164 SCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSA 223
+CG+CW+F+T ++E + +LV LSEQQL+DCD S D GCNGGL+++A
Sbjct: 143 ACGACWAFATLASVESQFAMRHNRLVDLSEQQLIDCD---------SVDMGCNGGLLHTA 193
Query: 224 FEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGP 283
FE ++ GG+ E DYP+ G DR + + + V + V ++E+++ L GP
Sbjct: 194 FEEIIRMGGVQAELDYPFVGRDRRCGVDRHRPYVVSLVGCYRYVMVNEEKLKDLLRAVGP 253
Query: 284 LAVAINAVYMQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSW 341
+ +AI+A + Y GV SC + L+H VLLVGYG PYW KN+W
Sbjct: 254 IPMAIDAADIVNYYRGVISSCE---NNGLNHAVLLVGYGVENGV-------PYWAFKNTW 303
Query: 342 GESWGENGYYKICRGRNVCG-VDSMVSTVAAA 372
G+ WGENGY+++ + N CG V+ + ST A
Sbjct: 304 GDDWGENGYFRVRQNINACGMVNDLASTAVLA 335
>gi|171702829|dbj|BAG16370.1| cysteine protease [Brassica oleracea var. italica]
Length = 332
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 119/294 (40%), Positives = 158/294 (53%), Gaps = 27/294 (9%)
Query: 69 KAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH--GITQFSDLTPAEFRRTYLGLRRK 126
+ YA E ++R+ +FK N+ R ++ T + QF+DLT EFR Y G +
Sbjct: 40 RVYADANEKNNRYVVFKRNVESIERLNEVQYGLTFKLAVNQFADLTNEEFRSMYTGYKGN 99
Query: 127 LRLPKDADQAPI----LPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANF 182
L + ++ LP DWR+KGAV P+KDQGSCGSCW+FS A+EG
Sbjct: 100 SVLSSRTKPTSFRYQHVSSDALPISVDWRKKGAVTPIKDQGSCGSCWAFSAVAAIEGVAQ 159
Query: 183 LATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYT 242
+ GKL+SLSEQ+LVDCD + D GC GG MNSAF YT+ GGL E +YPY
Sbjct: 160 IKKGKLISLSEQELVDCD---------TNDDGCMGGYMNSAFNYTMTTGGLTSESNYPYK 210
Query: 243 GTDRGHACKFDKSK-IAASVANFSVVSLDEDQIAANLVKNGPLAVAI--NAVYMQTYIGG 299
TD C +K+K IA S+ F V ++++ V + P+++ I Q Y G
Sbjct: 211 STD--GTCNINKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGIAGGGTGFQFYSSG 268
Query: 300 VSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKI 353
V CS LDHGV +VGYG + YWI+KNSWG WGE GY +I
Sbjct: 269 VFSGE-CSTHLDHGVAVVGYGKSSNGS------KYWILKNSWGPKWGERGYMRI 315
>gi|307110445|gb|EFN58681.1| hypothetical protein CHLNCDRAFT_56822 [Chlorella variabilis]
Length = 466
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 128/315 (40%), Positives = 176/315 (55%), Gaps = 21/315 (6%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F + + +AYAS EE++ RF ++ NLR + S + ++DL+ E+R
Sbjct: 40 FDFWVQTLKRAYASAEEYERRFDVWLDNLRFVHEYNAGHTSHWLSMGVYADLSQDEYRSK 99
Query: 120 YLGLRRKLRLPKDADQAPILPTNDLPA-DFDWREKGAVGPVKDQGSCGSCWSFSTTGALE 178
LG L + AP L +P + DW KGAV PVK+Q CGSCW+FSTTGA+E
Sbjct: 100 ALGYNADLHEERPLRAAPFLYEGTVPPKEVDWVAKGAVTPVKNQLLCGSCWAFSTTGAVE 159
Query: 179 GANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREED 238
GA+ +ATGKL SLSEQ LVDCD E D+GC+GGLM+ AFE+ +K GG+ E+D
Sbjct: 160 GASAIATGKLASLSEQMLVDCDRE--------RDNGCHGGLMDFAFEFIMKNGGIDTEDD 211
Query: 239 YPYTGTDRGHACKFDK-SKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA--VYMQT 295
YPYT + C+ +K + ++ ++ V +++ V N P++VAI A Q
Sbjct: 212 YPYTAEE--GMCQDNKMRRHVVTIDDYQDVPPNDEHALMKAVANQPVSVAIEADQRAFQL 269
Query: 296 YIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
Y GGV C LDHGVL+VGYG+A L PYW++KNSWG WG+ GY ++ R
Sbjct: 270 YGGGVF-DAECGTALDHGVLVVGYGTASNGTHHL---PYWLVKNSWGAEWGDKGYIRLLR 325
Query: 356 G---RNVCGVDSMVS 367
CGV S
Sbjct: 326 NLGEEGQCGVAMQAS 340
>gi|224069140|ref|XP_002326284.1| predicted protein [Populus trichocarpa]
gi|118482340|gb|ABK93094.1| unknown [Populus trichocarpa]
gi|222833477|gb|EEE71954.1| predicted protein [Populus trichocarpa]
Length = 358
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 141/365 (38%), Positives = 188/365 (51%), Gaps = 35/365 (9%)
Query: 13 LVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHH---FSLFKKKFNK 69
L + V++G+ D+ + I+ V+D L ES+ +LG F+ F + K
Sbjct: 13 LFLLCCVAAGSSFDESNP-IKLVSDR----LHDFESSFVKVLGQSRRALSFARFAHRHGK 67
Query: 70 AYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRL 129
Y ++ E RF IF +L K T G+ QF+D T EF++ LG +
Sbjct: 68 RYETEGEMKLRFAIFSESLDLIRSTNKKGLPYTLGLNQFADWTWQEFQKYRLGAAQNCSA 127
Query: 130 PKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLV 189
+ L LP DWRE+G V PVK+QG CGSCW+FSTTGALE A A GK +
Sbjct: 128 TTRGNHK--LTNALLPETKDWREEGIVSPVKNQGHCGSCWTFSTTGALEAAYHQAFGKGI 185
Query: 190 SLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHA 249
SLSEQQLVDC + + GCNGGL + AFEY GGL EE YPYTG D A
Sbjct: 186 SLSEQQLVDCARAFN-------NFGCNGGLPSQAFEYIKFNGGLDTEEAYPYTGKD--DA 236
Query: 250 CKFDKSKIAASV---ANFSVVSLDEDQIAANLVKNGPLAVAINAV-YMQTYIGGVSCPYI 305
CKF + V N ++ + DE + A V+ P++VA V + Y GV
Sbjct: 237 CKFSSENVGVRVVESVNITLGAEDELKHAVAFVR--PVSVAFEVVGSFRLYKEGVYTTST 294
Query: 306 CSRR---LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGV 362
C ++H VL VGYG PYW+IKNSWGE WG+NGY+K+ G+N+CG+
Sbjct: 295 CGSTPMDVNHAVLAVGYGVE-------NGIPYWLIKNSWGEDWGDNGYFKMEMGKNMCGI 347
Query: 363 DSMVS 367
+ S
Sbjct: 348 ATCAS 352
>gi|86355549|ref|YP_473217.1| Cathepsin [Hyphantria cunea nucleopolyhedrovirus]
gi|86198154|dbj|BAE72318.1| Cathepsin [Hyphantria cunea nucleopolyhedrovirus]
Length = 324
Score = 211 bits (537), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 116/312 (37%), Positives = 172/312 (55%), Gaps = 19/312 (6%)
Query: 52 DLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDL 111
DLL A +F F KFNK Y+S+ E RF IF+ NL + D +A + I +FSDL
Sbjct: 20 DLLKAPSYFEDFLHKFNKHYSSESEKLRRFQIFQHNLEEIIIKNQNDTTAQYEINKFSDL 79
Query: 112 TPAEFRRTYLGLRRKLRLPKDADQAPI-LPTNDLPADFDWREKGAVGPVKDQGSCGSCWS 170
+ E Y GL L+ + + P + P +FDWR V VK+QG CG+CW+
Sbjct: 80 SKDETISKYTGLALPLQTQNFCEVVVLNRPPDKGPLEFDWRRLNKVTSVKNQGICGACWA 139
Query: 171 FSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA 230
F+T +LE + +L++LSEQQL+DCD+ D+GCNGGL+++A+E ++
Sbjct: 140 FATLASLESQFAIKHNQLINLSEQQLIDCDY---------VDAGCNGGLLHTAYEAVMQM 190
Query: 231 GGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA 290
GG+ E DYPY G+D G+ + + +++ E+++ L GP+ VAI+A
Sbjct: 191 GGVQAENDYPYEGSD-GNCRVDVAKFVVKVKKCYRYIAVFEEKLKDLLRIVGPIPVAIDA 249
Query: 291 VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
+ Y G+ Y + L+H VLLVGYG PYWI+KN+WGE WGE GY
Sbjct: 250 SDIVNYRRGIM-RYCSNYGLNHAVLLVGYGVEN-------NVPYWILKNTWGEDWGEQGY 301
Query: 351 YKICRGRNVCGV 362
+++ + N CG+
Sbjct: 302 FRVQQNINACGI 313
>gi|296213765|ref|XP_002753411.1| PREDICTED: pro-cathepsin H [Callithrix jacchus]
Length = 336
Score = 211 bits (537), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 121/318 (38%), Positives = 171/318 (53%), Gaps = 30/318 (9%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRR 118
HF + K +K Y+ +EE+ R F +N R+ H + + + QFSD++ AE +R
Sbjct: 34 HFKSWMAKHHKTYSREEEYHQRLQTFASNWRKINAHNNGNHTFKMAVNQFSDMSFAEIKR 93
Query: 119 TYLGLRRKLRLPKDADQAP---ILPTNDLPADFDWREKGA-VGPVKDQGSCGSCWSFSTT 174
YL P++ + T P DWR+KG V PVK+QG+CGSCW+FSTT
Sbjct: 94 KYL-----WSEPQNCSATKSNYLRGTGPYPPSVDWRKKGHFVSPVKNQGACGSCWTFSTT 148
Query: 175 GALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLM 234
GALE A +ATGK++SL+EQQLVDC + + + GC GGL + AFEY L G+M
Sbjct: 149 GALESAIAIATGKMLSLAEQQLVDCAQDFN-------NHGCQGGLPSQAFEYILYNNGIM 201
Query: 235 REEDYPYTGTDRGHACKFDKSKIAASVANFSVVSL-DEDQIAANLVKNGPLAVAINAVY- 292
E+ YPY G D CKF K V + + +++ DED + + P++ A
Sbjct: 202 GEDTYPYQGKDSD--CKFQPGKAIGFVKDVANITIYDEDAMVEAVALYNPVSFAFEVTQD 259
Query: 293 MQTYIGGVSCPYICSR---RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
Y G+ C + +++H VL VGYG PYWI+KNSWG WG NG
Sbjct: 260 FMMYKRGIYSSTSCHKTPDKVNHAVLAVGYGEE-------NGIPYWIVKNSWGPQWGMNG 312
Query: 350 YYKICRGRNVCGVDSMVS 367
Y+ I RG+N+CG+ + S
Sbjct: 313 YFLIERGKNMCGLAACAS 330
>gi|223646726|gb|ACN10121.1| Cathepsin L1 precursor [Salmo salar]
gi|223672581|gb|ACN12472.1| Cathepsin L1 precursor [Salmo salar]
Length = 338
Score = 211 bits (537), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 134/322 (41%), Positives = 174/322 (54%), Gaps = 25/322 (7%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH----GITQFSDLT 112
E H+ L+K +K+Y EE R +++ NL++ H H G+ F D+T
Sbjct: 27 EDHWHLWKNWHSKSYHESEE-GWRRMVWEKNLKKIEMHNLEHTMGKHSYRLGMNHFGDMT 85
Query: 113 PAEFRRTYLGLRRKLRLPKDADQAPILPTNDL--PADFDWREKGAVGPVKDQGSCGSCWS 170
EFR+T G ++ + + + N L P DWREKG V PVKDQGSCGSCW+
Sbjct: 86 NEEFRQTMNGYKQTTE--RKFKGSLFMEPNYLQAPKAVDWREKGYVTPVKDQGSCGSCWA 143
Query: 171 FSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA 230
FSTTGA+EG F TGKLVSLSEQ LVDC PE + GCNGGLM+ AF+Y
Sbjct: 144 FSTTGAMEGQQFRKTGKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYIQDN 196
Query: 231 GGLMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAIN 289
GL EE YPY GTD C + A+ F + S E + + GP++VAI+
Sbjct: 197 AGLDTEESYPYVGTDED-PCHYKPEFSGANETGFVDIPSGKEHAMMKAVAAVGPVSVAID 255
Query: 290 AVY--MQTYIGGVSCPYIC-SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWG 346
A + Q Y G+ C S LDHGVL+VGYG G + K YWI+KNSW E WG
Sbjct: 256 AGHESFQFYEFGIYYEKECSSEELDHGVLVVGYGFEG---EDVDGKKYWIVKNSWSEKWG 312
Query: 347 ENGYYKICRGR-NVCGVDSMVS 367
+ GY + + R N CG+ + S
Sbjct: 313 DKGYIYMAKDRKNHCGIATASS 334
>gi|1706261|sp|Q10717.1|CYSP2_MAIZE RecName: Full=Cysteine proteinase 2; Flags: Precursor
gi|644490|dbj|BAA08245.1| cysteine proteinase [Zea mays]
Length = 360
Score = 211 bits (537), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 144/373 (38%), Positives = 189/373 (50%), Gaps = 36/373 (9%)
Query: 8 LFLVSLVVFS---AVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEH---HFS 61
LF++++VV + AV + D IR VTD L EST LG F+
Sbjct: 6 LFVLAVVVLADTAAVVNSGFADS--NPIRPVTDRAASAL---ESTVFAALGRTRDALRFA 60
Query: 62 LFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYL 121
F ++ K+Y S E RF IF +L+ + S GI +F+D++ EFR T L
Sbjct: 61 RFAVRYGKSYESAAEVHKRFRIFSESLQLVRSTNRKGLSYRLGINRFADMSWEEFRATRL 120
Query: 122 GLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGAN 181
G + + LP DWRE G V PVK+QG CGSCW+FSTTGALE A
Sbjct: 121 GAAQNCSATLTGNHRMRAAAVALPETKDWREDGIVSPVKNQGHCGSCWTFSTTGALEAAY 180
Query: 182 FLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPY 241
ATGK +SLSEQQLVDC + + GCNGGL + AFEY GGL EE YPY
Sbjct: 181 TQATGKPISLSEQQLVDCGFAFN-------NFGCNGGLPSQAFEYIKYNGGLDTEESYPY 233
Query: 242 TGTDRGHACKFDKSKIAASV---ANFSVVSLDEDQIAANLVKNGPLAVAINAVY-MQTYI 297
G + CKF + V N ++ + DE + A LV+ P++VA + + Y
Sbjct: 234 QGVN--GICKFKNENVGVKVLDSVNITLGAEDELKDAVGLVR--PVSVAFEVITGFRLYK 289
Query: 298 GGVSCPYICSRR---LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
GV C ++H VL VGYG PYW+IKNSWG WG+ GY+K+
Sbjct: 290 SGVYTSDHCGTTPMDVNHAVLAVGYGVE-------DGVPYWLIKNSWGADWGDEGYFKME 342
Query: 355 RGRNVCGVDSMVS 367
G+N+CGV + S
Sbjct: 343 MGKNMCGVATCAS 355
>gi|255032|gb|AAB23155.1| COT44=cysteine proteinase homolog [Brassica napus, seedling, rapid
cycling base population CrGC5, Peptide, 328 aa]
Length = 328
Score = 211 bits (537), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 120/290 (41%), Positives = 166/290 (57%), Gaps = 27/290 (9%)
Query: 76 EHDHRFTIFKANLRRAARHQKLDPSATH--GITQFSDLTPAEFRRTYLGLR----RKLRL 129
+ D RF IFK NLR H + + +AT+ G+T F++LT E+R YLG R R++
Sbjct: 24 QQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFANLTNDEYRSLYLGARTEPVRRITK 83
Query: 130 PKDADQAPILPTNDL--PADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGK 187
K+ + ND+ P DWR+KGAV +KDQG+CGSCW+FST A+EG N + TG+
Sbjct: 84 AKNVNMKYSAAVNDVEVPVTVDWRQKGAVNAIKDQGTCGSCWAFSTAAAVEGINKIVTGE 143
Query: 188 LVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRG 247
LVSLSEQ+LVDCD S + GCNGGLM+ AF++ +K GGL E+DYPY GT+ G
Sbjct: 144 LVSLSEQELVDCDK--------SYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYHGTN-G 194
Query: 248 HACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA--VYMQTYIGGVSCPYI 305
K+ ++ + V ++ V P++VAI+A Q Y G+
Sbjct: 195 KCNSLLKNSRVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGGRAFQHYQSGIFTGK- 253
Query: 306 CSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
C +DH V+ VGYGS YWI++NSWG WGE+GY ++ R
Sbjct: 254 CGTNMDHAVVAVGYGSENGV-------DYWIVRNSWGTRWGEDGYIRMER 296
>gi|225444726|ref|XP_002278624.1| PREDICTED: thiol protease aleurain-like isoform 1 [Vitis vinifera]
gi|147826441|emb|CAN62278.1| hypothetical protein VITISV_031382 [Vitis vinifera]
gi|297738562|emb|CBI27807.3| unnamed protein product [Vitis vinifera]
Length = 362
Score = 211 bits (537), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 145/367 (39%), Positives = 191/367 (52%), Gaps = 42/367 (11%)
Query: 13 LVVFSAVSSGTLID------DVDQLIRQVTDGGDEILSHHESTNNDLLGAEHH---FSLF 63
L++ AV+SG D + IR V+D ++ ES+ L+G H F+ F
Sbjct: 11 LILLCAVASGEADHHFRSSFDEENPIRLVSDSIRDL----ESSVLRLIGDTRHAHSFASF 66
Query: 64 KKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGL 123
++ K+Y + +E RF IF NL+ + T + QF+D T EFRR LG
Sbjct: 67 AHRYGKSYKTVDEIKLRFEIFSENLKLIRSTNRKGLPYTLAVNQFADWTWEEFRRHRLGA 126
Query: 124 RRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFL 183
+ + L LP DWRE G V P+KDQG CGSCW+FSTTGALE A
Sbjct: 127 AQNCSATLKGNHK--LTDVILPETKDWREDGIVSPIKDQGHCGSCWTFSTTGALEAAYAQ 184
Query: 184 ATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTG 243
A GK +SLSEQQLVDC + + GC+GGL + AFEY GGL EE YPYTG
Sbjct: 185 AFGKGISLSEQQLVDCAGAFN-------NFGCHGGLPSQAFEYIKYNGGLDTEEAYPYTG 237
Query: 244 TDRGHACKFDKSKIAASV---ANFSVVSLDEDQIAANLVKNGPLAVAINAVY-MQTYIGG 299
D CKF I V N ++ + DE + A V+ P++VA V+ + Y G
Sbjct: 238 LDG--TCKFSSENIGVQVLDSVNITLGAEDELKHAVAFVR--PVSVAFEVVHDFRFYKKG 293
Query: 300 VSCPYICSRR---LDHGVLLVGYG-SAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
V C ++H VL VGYG G A YW+IKNSWGE+WG+NGY+K+
Sbjct: 294 VYTSGTCGSTPMDVNHAVLAVGYGVEDGVA--------YWLIKNSWGENWGDNGYFKMEL 345
Query: 356 GRNVCGV 362
G+N+CGV
Sbjct: 346 GKNMCGV 352
>gi|332326591|gb|AEE42619.1| cysteine protease [Leishmania aethiopica]
Length = 443
Score = 211 bits (537), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 127/311 (40%), Positives = 165/311 (53%), Gaps = 29/311 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK+ + +AY + E R F+ NL HQ +P A GIT+F DL+ AEF
Sbjct: 38 FEEFKRTYWRAYGTVAEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 120 YL-GLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YL G + A Q DL P DWREKGAV PVKBQG+CGSCW+FS G
Sbjct: 98 YLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKBQGACGSCWAFSAVG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGL 233
+E +A LV LSEQQLV CD + DSGC GGLM AFE+ L+ G +
Sbjct: 158 NIESQWAVAXHGLVRLSEQQLVSCDDK---------DSGCGGGLMTQAFEWLLRNMNGTM 208
Query: 234 MREEDYPYTGT--DRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV 291
E+ YPY + D + A + + ++ E +AA L K+GP+++A++A
Sbjct: 209 FTEDSYPYVSSTGDVPECTNSSELVPGARIDGYVMIESXETVMAAWLAKSGPISIAVDAS 268
Query: 292 YMQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
+Y GV SC + L+HGVLLVGY G E PYW+IKNSWGE WGE G
Sbjct: 269 PFMSYESGVLTSC---VGKXLNHGVLLVGYNMTG-------EVPYWVIKNSWGEDWGEKG 318
Query: 350 YYKICRGRNVC 360
Y ++ G N C
Sbjct: 319 YVRVTMGVNAC 329
>gi|32396020|gb|AAP41847.1| senescence-associated cysteine protease [Anthurium andraeanum]
Length = 460
Score = 211 bits (537), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 127/297 (42%), Positives = 170/297 (57%), Gaps = 27/297 (9%)
Query: 69 KAYASQEEHDHRFTIFKANLRRAARHQKLDP--SATHGITQFSDLTPAEFRRTYLGLRRK 126
KAY E + RF IF NLR H + + S T G+T+F+DLT E+R TYLG++
Sbjct: 47 KAYNLLGEKERRFEIFWDNLRYIDDHNRAENNHSYTLGLTRFADLTNEEYRSTYLGVKPG 106
Query: 127 LRLPKDADQAP----ILPTN--DLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGA 180
P+ A++AP L N DLP DWREKGAV P+KDQG CGSCW+FST A+EG
Sbjct: 107 QVRPRRANRAPGRGRDLSANGDDLPQKVDWREKGAVAPIKDQGGCGSCWAFSTVAAVEGI 166
Query: 181 NFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYP 240
N + TG L+ LSEQ+LVDCD + + GCNGGLM+ AF++ + GG+ EEDYP
Sbjct: 167 NQIVTGDLIVLSEQELVDCDT--------AYNEGCNGGLMDYAFQFIISNGGIDTEEDYP 218
Query: 241 YTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAIN--AVYMQTYIG 298
Y D G K+ S+ ++ V +++ V + P++VAI Q Y
Sbjct: 219 YKERD-GLCDPNRKNAKVVSIDSYEDVLENDEHALKTAVAHQPVSVAIEGGGRSFQLYKS 277
Query: 299 GVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
G+ C LDHGV+ VGYG+ K YWI++NSWG+SWGE GY ++ R
Sbjct: 278 GIF-DGRCGIDLDHGVVAVGYGTE-------SGKDYWIVRNSWGKSWGEAGYIRMER 326
>gi|1208549|gb|AAC49455.1| Pseudotzain [Pseudotsuga menziesii]
Length = 454
Score = 211 bits (537), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 137/374 (36%), Positives = 193/374 (51%), Gaps = 45/374 (12%)
Query: 7 VLFLVSLVVFSAVSSGTLIDDVDQLIRQVTD--GGDEILSHHESTNNDLLGAEHHFSLFK 64
+L L +++ SA++ D + D G D I+ +E L+
Sbjct: 3 ILLLFAVLALSAMAGSASRADFSIISYDSQDLIGDDAIMELYE--------------LWL 48
Query: 65 KKFNKAYASQEEHDHRFTIFKANLRRAARHQKL-DPSATHGITQFSDLTPAEFRRTYLGL 123
+ KAY +E +F++FK N +H +PS G+ QF+DL+ EF+ YLG
Sbjct: 49 AQHKKAYNGLDEKQKKFSVFKDNFLYIHQHNNQGNPSYKLGLNQFADLSHEEFKAAYLGT 108
Query: 124 R-----RKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALE 178
+ R R P Q + DLP DWREKGAV VK+QGSCGSCW+FST A+E
Sbjct: 109 KLDAKKRLSRSPSPRYQYSV--GEDLPESIDWREKGAVTAVKNQGSCGSCWAFSTVAAVE 166
Query: 179 GANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREED 238
G N + TG L SLSEQ+LVDCD S + GCNGGLM+ AF++ + GGL E+D
Sbjct: 167 GINQIVTGNLTSLSEQELVDCDT--------SYNQGCNGGLMDYAFQFIISNGGLDSEDD 218
Query: 239 YPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV--YMQTY 296
YPY + G + K+ ++ ++ V ++++ N P++VAI A Q Y
Sbjct: 219 YPYK-ANNGSCDAYRKNAHVVTIDDYEDVPENDEKSLKKAAANQPISVAIEASGRAFQFY 277
Query: 297 IGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRG 356
GV C +LDHGV LVGYGS YW++KNSWG SWGE G+ K+
Sbjct: 278 ESGVFTSN-CGTQLDHGVTLVGYGSE-------SGIDYWLVKNSWGNSWGEKGFIKL--Q 327
Query: 357 RNVCGVDSMVSTVA 370
RN+ G + + +A
Sbjct: 328 RNLEGASTGMCGIA 341
>gi|374713649|gb|AEZ65082.1| cysteine protease [Carica papaya]
Length = 471
Score = 211 bits (537), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 132/332 (39%), Positives = 184/332 (55%), Gaps = 34/332 (10%)
Query: 43 LSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSAT 102
L E T ++ H+ + K K Y + E + RF IFK NLR + P T
Sbjct: 38 LQSTERTEAHMMKMYEHWLV---KHGKNYNAIGEKERRFEIFKDNLRFVDEQNSV-PGRT 93
Query: 103 H--GITQFSDLTPAEFRRTYLG--LRRKLRLPKDADQAPILPT---NDLPADFDWREKGA 155
+ G+T+F+DLT E+R YLG + +K +L + Q + +DLP+ DWREKGA
Sbjct: 94 YKLGLTKFADLTNEEYRAMYLGAKMEKKEKLRTERSQRYLHKAGNDDDLPSHVDWREKGA 153
Query: 156 VGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGC 215
V VKDQG CGSCW+FST G++EG N + TG L+SLSEQ+LVDCD + + GC
Sbjct: 154 VTEVKDQGQCGSCWAFSTVGSVEGINQIVTGDLISLSEQELVDCDK--------AYNQGC 205
Query: 216 NGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFD-KSKIAASVANFSVVSLDEDQI 274
NGGLM+ AFE+ +K GG+ E DYPY +D + C + K+ ++ + V ++++
Sbjct: 206 NGGLMDYAFEFIIKNGGIDSEADYPYRASD--NMCDSNRKNAHVVTIDGYEDVPENDEES 263
Query: 275 AANLVKNGPLAVAINA--VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEK 332
V N P++VAI A Q Y GV C LDHGV+ VGYG+
Sbjct: 264 LKKAVANQPVSVAIEAGGREFQLYQSGVFTGR-CGTNLDHGVVAVGYGTENGI------- 315
Query: 333 PYWIIKNSWGESWGENGYYKICRGRNVCGVDS 364
YWI++NSWG WGE+GY ++ RNV D+
Sbjct: 316 DYWIVRNSWGPKWGESGYIRM--ERNVASTDT 345
>gi|340368362|ref|XP_003382721.1| PREDICTED: cathepsin L1-like [Amphimedon queenslandica]
Length = 329
Score = 211 bits (537), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 126/317 (39%), Positives = 176/317 (55%), Gaps = 28/317 (8%)
Query: 55 GAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGIT----QFSD 110
G + + L+K + K+Y + EE +R ++ N H S HG T F D
Sbjct: 22 GTSNEWELWKATYGKSYLTLEEEKYRRDTWEENSLLIKTHNT--DSDKHGYTLEMNSFGD 79
Query: 111 LTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWS 170
LT AEF Y G R+ L + + N +P+ DWR+K V VK+QG CGSCW+
Sbjct: 80 LTSAEFSSLYNGYRQNLETSGSVFSSSL--RNAMPSSLDWRDKKVVTDVKNQGKCGSCWA 137
Query: 171 FSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA 230
FSTTG+LEG + L TG LVSLSEQQL+DC + ++GC+GG M SAF+Y A
Sbjct: 138 FSTTGSLEGLHALKTGHLVSLSEQQLMDCSVKYG-------NNGCDGGNMRSAFQYIKDA 190
Query: 231 GGLMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAIN 289
GG EE YPYT + +C+FD K+ A+ + + S DE + L + GP++VA++
Sbjct: 191 GGDDTEESYPYTA--KNESCRFDPKKVGATDEGYVRIPSGDEVSLMHALYEVGPISVAMD 248
Query: 290 A--VYMQTYIGGVSCPYICSR-RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWG 346
A Q Y G+ Y+CS L+HGV L+GYG + PYW++KNSWG+ WG
Sbjct: 249 AGLKTFQFYKKGIYSDYLCSNTHLNHGVTLIGYGESS------DGSPYWLVKNSWGKDWG 302
Query: 347 ENGYYKICRG-RNVCGV 362
+GY+ + R N+CGV
Sbjct: 303 IDGYFMLARYVGNMCGV 319
>gi|195056367|ref|XP_001995082.1| GH22826 [Drosophila grimshawi]
gi|193899288|gb|EDV98154.1| GH22826 [Drosophila grimshawi]
Length = 340
Score = 211 bits (537), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 129/324 (39%), Positives = 176/324 (54%), Gaps = 31/324 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKL----DPSATHGITQFSDLTPAE 115
+ FK + K Y + E R IF N + A+H +L + S G+ +++D+ E
Sbjct: 28 WQTFKLEHRKQYQDETEERFRLKIFNENKHKIAKHNQLYAAGEVSFKMGLNKYADMLHHE 87
Query: 116 FRRTYLGLRRKLRLPKDADQAP------ILPTN-DLPADFDWREKGAVGPVKDQGSCGSC 168
F T G L A A I P + LP DWR KGAV VKDQG CGSC
Sbjct: 88 FHETMNGFNYTLHKQLRASDATFTGVTFISPEHVKLPQSVDWRNKGAVTGVKDQGHCGSC 147
Query: 169 WSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTL 228
W+FS+TGALEG +F TG L+SLSEQ LVDC + ++GCNGGLM++AF Y
Sbjct: 148 WAFSSTGALEGQHFRKTGTLISLSEQNLVDCSTKYG-------NNGCNGGLMDNAFRYIK 200
Query: 229 KAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSL-DEDQIAANLVKNGPLAVA 287
GG+ E+ YPY G D +C F+K I A+ F+ + DE ++A + GP++VA
Sbjct: 201 DNGGIDTEKSYPYEGIDD--SCHFNKGTIGATDRGFTDIPQGDEKKLAQAVATIGPVSVA 258
Query: 288 INAVY--MQTYIGGVSCPYICS-RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGES 344
I+A + Q Y GV C + LDHGVL+VGYG+ K YW++KNSWG +
Sbjct: 259 IDASHESFQFYSTGVYDEPQCDPQNLDHGVLVVGYGTDENG------KDYWLVKNSWGTT 312
Query: 345 WGENGYYKICRG-RNVCGVDSMVS 367
WG+ G+ K+ R N CG+ + S
Sbjct: 313 WGDKGFIKMARNDDNQCGIATASS 336
>gi|237844793|ref|XP_002371694.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
gi|50313163|gb|AAT74529.1| toxopain-2 [Toxoplasma gondii]
gi|89242977|gb|ABD64744.1| cathepsin L [Toxoplasma gondii]
gi|95007485|emb|CAJ20707.1| toxopain-2 [Toxoplasma gondii RH]
gi|211969358|gb|EEB04554.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
gi|221480879|gb|EEE19300.1| cysteine protease, putative [Toxoplasma gondii GT1]
gi|221501596|gb|EEE27366.1| cysteine protease, putative [Toxoplasma gondii VEG]
Length = 422
Score = 211 bits (536), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 127/301 (42%), Positives = 169/301 (56%), Gaps = 29/301 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
FS F+ + K+YA++EE R+ IFK NL H + S + + F DL+ EFRR
Sbjct: 117 FSSFQAMYAKSYATEEEKQRRYAIFKNNLVYIHTHNQQGYSYSLKMNHFGDLSRDEFRRK 176
Query: 120 YLGLRRKLRLPKD-----ADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTT 174
YLG ++ L + +LP+ +LPA DWR +G V PVKDQ CGSCW+FSTT
Sbjct: 177 YLGFKKSRNLKSHHLGVATELLNVLPS-ELPAGVDWRSRGCVTPVKDQRDCGSCWAFSTT 235
Query: 175 GALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLM 234
GALEGA+ TGKLVSLSEQ+L+DC + C+GG MN AF+Y L +GG+
Sbjct: 236 GALEGAHCAKTGKLVSLSEQELMDCSR-------AEGNQSCSGGEMNDAFQYVLDSGGIC 288
Query: 235 REEDYPYTGTD---RGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV 291
E+ YPY D R +C+ K+ + V E + A L K+ P+++AI A
Sbjct: 289 SEDAYPYLARDEECRAQSCE----KVVKILGFKDVPRRSEAAMKAALAKS-PVSIAIEAD 343
Query: 292 YM--QTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
M Q Y GV C LDHGVLLVGYG+ + +K +WI+KNSWG WG +G
Sbjct: 344 QMPFQFYHEGV-FDASCGTDLDHGVLLVGYGTD-----KESKKDFWIMKNSWGTGWGRDG 397
Query: 350 Y 350
Y
Sbjct: 398 Y 398
>gi|388513209|gb|AFK44666.1| unknown [Lotus japonicus]
gi|388514955|gb|AFK45539.1| unknown [Lotus japonicus]
Length = 352
Score = 211 bits (536), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 135/344 (39%), Positives = 179/344 (52%), Gaps = 34/344 (9%)
Query: 32 IRQVTDGGDEILSHHESTNNDLLGAEHH---FSLFKKKFNKAYASQEEHDHRFTIFKANL 88
IR V+D +++L ++G H F+ F K+ K Y S EE HRF IF NL
Sbjct: 30 IRLVSDLEEQVL--------QVIGQTRHAVSFARFASKYGKRYDSVEEIQHRFRIFSENL 81
Query: 89 RRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADF 148
K S G+ F+DL+ EFR LG + + L LPA+
Sbjct: 82 ELIKSTNKKRLSYKLGLNHFADLSWDEFRTQKLGAAQNCSATLIGNHK--LTDAVLPAEK 139
Query: 149 DWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEP 208
DWR++ V VKDQ CGSCW+FSTTGALE A A GK +SLSEQQLVDC +
Sbjct: 140 DWRKESIVSEVKDQAHCGSCWTFSTTGALEAAYAQAHGKNISLSEQQLVDCAGAFN---- 195
Query: 209 GSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVS 268
+ GCNGGL + AFEY GG+ E++YPYT D ACKF +A V + ++
Sbjct: 196 ---NFGCNGGLPSQAFEYIKYNGGIALEKEYPYTAKDE--ACKFTAENVAVRVLDSVNIT 250
Query: 269 LD-EDQIAANLVKNGPLAVAINAV-YMQTYIGGVSCPYICSRR---LDHGVLLVGYGSAG 323
L ED++ + P++VA V + Y GV C ++H VL VGYG
Sbjct: 251 LGAEDELKHAVAFARPVSVAFQVVDGFRLYKEGVYTSDTCGNTPMDVNHAVLAVGYGVE- 309
Query: 324 YAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVS 367
PYWIIKNSWG +WG++GY+K+ G+N+CGV + S
Sbjct: 310 ------NNVPYWIIKNSWGSTWGDHGYFKMELGKNMCGVATCAS 347
>gi|157868354|ref|XP_001682730.1| cysteine peptidase A (CPA) [Leishmania major strain Friedlin]
gi|68126185|emb|CAJ07238.1| cysteine peptidase A (CPA) [Leishmania major strain Friedlin]
Length = 354
Score = 211 bits (536), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 130/366 (35%), Positives = 182/366 (49%), Gaps = 42/366 (11%)
Query: 1 MGSKTVVLFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHF 60
M + LF + + + V G+ L+ Q G D + A H+
Sbjct: 1 MARRNPFLFAIVVTILFVVCYGS------ALVAQTPLGVDNFI------------ASAHY 42
Query: 61 SLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGIT-QFSDLTPAEFRRT 119
FK++ K++ + HRF FK N++ A +P A + ++ +F+DLTP EF +
Sbjct: 43 GRFKERHGKSFGEDADEGHRFNAFKQNMQTAYFLNTHNPHAHYDVSGKFADLTPQEFAKL 102
Query: 120 YLGLRRKLRLPKDADQAPILPTNDLPA--DFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
YL KD + + + L DWREKGAV PVK+QG CGSCW+FS G +
Sbjct: 103 YLNPDYYAHRGKDYKEHVHVDDSVLSGAMSVDWREKGAVTPVKNQGMCGSCWAFSAIGNI 162
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA--GGLMR 235
E L LVSLSEQ LV CD D GCNGGLM+ A E+ ++ G +
Sbjct: 163 ESQWALKNHSLVSLSEQMLVSCD---------DIDDGCNGGLMDQAMEWIIQHHNGTVPT 213
Query: 236 EEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQT 295
E+ YPY DK + A ++ + + DE IAA + K GP+AVA++A Q
Sbjct: 214 EKSYPYASAGGTSPPCHDKGEFGARISGYMSLPHDEKAIAAYVEKKGPVAVAVDATTWQL 273
Query: 296 YIGGVSCPYIC-SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
Y GGV +C L+HGVL+VG+ + + PYWI+KNSWG SWGE GY ++
Sbjct: 274 YFGGVVT--LCFGLSLNHGVLVVGFN-------KRAKPPYWIVKNSWGTSWGEKGYIRLA 324
Query: 355 RGRNVC 360
G N C
Sbjct: 325 MGSNQC 330
>gi|151176971|gb|ABR88030.1| digestive cysteine protease [Dermestes frischii]
Length = 339
Score = 211 bits (536), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 134/327 (40%), Positives = 172/327 (52%), Gaps = 31/327 (9%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP----SATHGITQFSDLT 112
+ + FK + K Y S E R IF N + A+ KL S I +++D+
Sbjct: 24 QEQWGTFKLQHKKQYKSDTEEKFRMKIFMENSHKVAKXNKLYEMGLVSYKLKINKYADML 83
Query: 113 PAEFRRTYLGLRRKLRLP-----KDADQAP-ILPTN-DLPADFDWREKGAVGPVKDQGSC 165
EF T G R P +D A I P N P + DWRE GAV VKDQG C
Sbjct: 84 HHEFVHTVNGFNRTKNTPLLGTSEDEQGATFIAPANVKFPENVDWREHGAVTXVKDQGHC 143
Query: 166 GSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFE 225
GSCWSFS TGALEG +F T KLVSLSEQ LVDC + + GCNGGLM++AF+
Sbjct: 144 GSCWSFSATGALEGQHFRKTNKLVSLSEQNLVDCSTKFG-------NDGCNGGLMDNAFK 196
Query: 226 YTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPL 284
Y G+ E YPY D C ++ A+ F + + DE+++ A + GP+
Sbjct: 197 YVKYNHGIDTEASYPYHADDE--KCHYNPKTSGATDRGFVDIPTGDEEKLMAAVATVGPV 254
Query: 285 AVAINAVY--MQTYIGGVSC-PYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSW 341
+VAI+A + Q Y GV P S LDHGVL+VGYG+ + YWI+KNSW
Sbjct: 255 SVAIDASHESFQLYSEGVYYDPECSSEELDHGVLVVGYGTDENG------QDYWIVKNSW 308
Query: 342 GESWGENGYYKICRGR-NVCGVDSMVS 367
GESWGE GY K+ R R N CG+ + S
Sbjct: 309 GESWGEQGYIKMARNRDNNCGIATQAS 335
>gi|260516654|gb|ACX43954.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516656|gb|ACX43955.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516658|gb|ACX43956.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516660|gb|ACX43957.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516662|gb|ACX43958.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516664|gb|ACX43959.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516666|gb|ACX43960.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516668|gb|ACX43961.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516670|gb|ACX43962.1| cysteine protease 2 [Brachiaria hybrid cultivar]
Length = 338
Score = 211 bits (536), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 128/311 (41%), Positives = 168/311 (54%), Gaps = 31/311 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKL-DPSATHGITQFSDLTPAEFRR 118
F+ F K+++KAY S E RF FKAN+ H L + S T G+ +F+DL+ EF+
Sbjct: 42 FTAFMKQYSKAY-SHAEFSSRFNQFKANVETIRLHNTLANASYTMGLNEFADLSFEEFKG 100
Query: 119 TYLGLR---RKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
Y G + R+ + Q P DWR AV P+KDQG CGSCW+FS TG
Sbjct: 101 KYFGYKHVEREFARSNNLHQ----EVEAAPTSIDWRTSNAVTPIKDQGQCGSCWAFSATG 156
Query: 176 ALEGANFLATGK--LVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGL 233
++EGA ++ GK L SLSEQQLVDC ++GCNGGLM+ AFEY + G+
Sbjct: 157 SIEGA-WVLQGKHTLTSLSEQQLVDCSTSYG-------NAGCNGGLMDYAFEYIIANKGI 208
Query: 234 MREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA--V 291
E YPY G G C+ +K+ V S DE + + GP++VAI A
Sbjct: 209 CAESAYPYKGV--GGLCQKSCTKVVTISGYKDVASGDEASLLNAVGTVGPVSVAIEADQA 266
Query: 292 YMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYY 351
Q Y GV C LDHGVL VGYG+ G + YWI+KNSWG SWGE+GY
Sbjct: 267 GFQFYSSGVFSG-TCGHNLDHGVLAVGYGTTG-------SQDYWIVKNSWGTSWGESGYI 318
Query: 352 KICRGRNVCGV 362
++ R +N CG+
Sbjct: 319 RMIRNKNQCGI 329
>gi|90399361|emb|CAJ86180.1| H0212B02.7 [Oryza sativa Indica Group]
Length = 470
Score = 211 bits (536), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 126/334 (37%), Positives = 177/334 (52%), Gaps = 41/334 (12%)
Query: 42 ILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSA 101
I+S+ E + + A ++ +K + K Y + E + R+ F+ NLR H +
Sbjct: 25 IVSYGERSEEE---ARRLYAEWKAEHGKNYNAVGEEERRYAAFRDNLRYIDEHNAAADAG 81
Query: 102 TH----GITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTND-LPADFDWREKGAV 156
H G+ +F+DLT E+R TYLGLR K R + + N+ LP DWR KGAV
Sbjct: 82 VHSFRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEALPESVDWRTKGAV 141
Query: 157 GPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCN 216
+KDQG CGSCW+FS A+EG N + TG L+SLSEQ+LVDCD S + GCN
Sbjct: 142 AEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDT--------SYNEGCN 193
Query: 217 GGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACK-------------FDKSKIAASVAN 263
GGLM+ AF++ + GG+ E+DYPY G D C F K+ ++ +
Sbjct: 194 GGLMDYAFDFIINNGGIDTEDDYPYKGKD--ERCDVNRVSFVFFAPLVFQKNAKVVTIDS 251
Query: 264 FSVVSLDEDQIAANLVKNGPLAVAINA--VYMQTYIGGVSCPYICSRRLDHGVLLVGYGS 321
+ V+ + + V N P++VAI A Q Y G+ C LDHGV VGYG+
Sbjct: 252 YEDVTPNSETSLQKAVANQPVSVAIEAGGRAFQLYSSGIFTG-KCGTALDHGVAAVGYGT 310
Query: 322 AGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
K YWI++NSWG+SWGE+GY ++ R
Sbjct: 311 E-------NGKDYWIVRNSWGKSWGESGYVRMER 337
>gi|94421564|gb|ABF18889.1| cathepsin-L [Lygus lineolaris]
Length = 314
Score = 211 bits (536), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 126/301 (41%), Positives = 165/301 (54%), Gaps = 26/301 (8%)
Query: 63 FKKKFNKAYASQEEHDHRFTIF----KANLRRAARHQKLDPSATHGITQFSDLTPAEFRR 118
+K K+ K Y S E R TI+ + + AR ++ S G+ F+D+ EFR+
Sbjct: 30 YKAKYGKTYESNENEAARRTIYFMAKEKVMEHNARFEQGLVSYKLGLNSFADMHNGEFRK 89
Query: 119 TYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALE 178
G RR P+++ + LPA DWR KGAV P+K+QG CGSCW+FSTTG+LE
Sbjct: 90 MMNGYRRGT--PRNSVVVHVESNITLPASVDWRTKGAVTPIKNQGQCGSCWAFSTTGSLE 147
Query: 179 GANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREED 238
G + L GKLVSLSEQ+LVDC + GC+GGLM+ AF Y K G+ E+
Sbjct: 148 GQHALKKGKLVSLSEQELVDC-------SAAEGNDGCDGGLMDDAFTYIKKNNGIDTEQS 200
Query: 239 YPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAINAVY--MQT 295
YPYTG D C F KS +AA+V F V S E + GP++VAI+A Q
Sbjct: 201 YPYTGED--GTCSFKKSDVAATVTGFVDVTSGSESGLQDASATIGPISVAIDASSWDFQL 258
Query: 296 YIGGVSCPYICS-RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
Y GV CS LDHGVL+VGYG+ YW++KNSWG WG +GY ++
Sbjct: 259 YESGVYDVSDCSTTELDHGVLVVGYGTD-------DGTAYWLVKNSWGTDWGHHGYIQMS 311
Query: 355 R 355
R
Sbjct: 312 R 312
>gi|426219875|ref|XP_004004143.1| PREDICTED: cathepsin L1 [Ovis aries]
Length = 333
Score = 210 bits (535), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 128/317 (40%), Positives = 171/317 (53%), Gaps = 23/317 (7%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQ----FSDLTPA 114
+ L+K K Y EE R ++K N++ H + H + F DLT
Sbjct: 28 QWELWKAVHRKPYDLNEE-GWRKAVWKKNMKMIELHNQEYSQGKHSFSMAMNAFGDLTSE 86
Query: 115 EFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTT 174
EFR+ G +R+ I + +P DWREKG V PVK+QG CGSCW+FSTT
Sbjct: 87 EFRQMMNGFQRQENKKGKVFHETIFAS--IPPSVDWREKGYVTPVKNQGKCGSCWAFSTT 144
Query: 175 GALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLM 234
GALEG F TGKLVSLSEQ LVDC PE + GC+GGLM++AF+Y L GGL
Sbjct: 145 GALEGQMFRKTGKLVSLSEQNLVDCSQ---PE----GNRGCHGGLMDNAFQYVLDVGGLD 197
Query: 235 REEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV--Y 292
EE YPYTG C ++ AA+ F + E+ + + GP++VA++A
Sbjct: 198 SEESYPYTGLVG--TCNYNPKNSAANETGFVDLPKQENALMKAVATLGPISVAVDASNPS 255
Query: 293 MQTYIGGVSCPYIC-SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYY 351
Q Y G+ C S +DHGVL+VGYG G + YW++KNSWG+ WG NGY
Sbjct: 256 FQFYKSGIYYEPKCKSESVDHGVLVVGYGFEG---ADSDDNKYWLVKNSWGKHWGINGYI 312
Query: 352 KICRGRNV-CGVDSMVS 367
K+ + +N CG+ +M S
Sbjct: 313 KMAKDQNNHCGIATMAS 329
>gi|164472556|gb|ABY58967.1| cathepsin L [Toxoplasma gondii]
Length = 421
Score = 210 bits (535), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 127/301 (42%), Positives = 169/301 (56%), Gaps = 29/301 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
FS F+ + K+YA++EE R+ IFK NL H + S + + F DL+ EFRR
Sbjct: 116 FSSFQAMYAKSYATEEEKQRRYAIFKNNLVYIHTHNQQGYSYSLKMNHFGDLSRDEFRRK 175
Query: 120 YLGLRRKLRLPKD-----ADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTT 174
YLG ++ L + +LP+ +LPA DWR +G V PVKDQ CGSCW+FSTT
Sbjct: 176 YLGFKKSRNLKSHHLGVATELLNVLPS-ELPAGVDWRSRGCVTPVKDQRDCGSCWAFSTT 234
Query: 175 GALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLM 234
GALEGA+ TGKLVSLSEQ+L+DC + C+GG MN AF+Y L +GG+
Sbjct: 235 GALEGAHCAKTGKLVSLSEQELMDCSR-------AEGNQSCSGGEMNDAFQYVLDSGGIC 287
Query: 235 REEDYPYTGTD---RGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV 291
E+ YPY D R +C+ K+ + V E + A L K+ P+++AI A
Sbjct: 288 SEDAYPYLARDEECRAQSCE----KVVKILGFKDVPRRSEAAMKAALAKS-PVSIAIEAD 342
Query: 292 YM--QTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
M Q Y GV C LDHGVLLVGYG+ + +K +WI+KNSWG WG +G
Sbjct: 343 QMPFQFYHEGV-FDASCGTDLDHGVLLVGYGTD-----KESKKDFWIMKNSWGTGWGRDG 396
Query: 350 Y 350
Y
Sbjct: 397 Y 397
>gi|225706370|gb|ACO09031.1| Cathepsin L precursor [Osmerus mordax]
Length = 337
Score = 210 bits (535), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 132/321 (41%), Positives = 172/321 (53%), Gaps = 23/321 (7%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH----GITQFSDLT 112
+ H+ L+K +K Y ++E R +++ NL++ H H G+ F D+T
Sbjct: 26 DEHWDLWKSWHSKNYQHEKEEGWRRMVWEKNLKKIEMHNLEHSLGKHSYSLGMNHFGDMT 85
Query: 113 PAEFRRTYLGLRRKLRLPKDADQAPILPTN-DLPADFDWREKGAVGPVKDQGSCGSCWSF 171
EFR+ G KL+ K + P N + P DWRE+G V PVKDQG CGSCW+F
Sbjct: 86 NEEFRQVMNGY--KLQQRKFKGSLFLEPNNMEAPKQVDWREEGYVTPVKDQGQCGSCWAF 143
Query: 172 STTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAG 231
STTGA+EG F T KLVSLSEQ LVDC PE + GCNGGLM+ AF+Y
Sbjct: 144 STTGAMEGQMFRKTQKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYIQDNS 196
Query: 232 GLMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAINA 290
GL EE YPY GTD C + AA+ F + S E + + GP++VAI+A
Sbjct: 197 GLDSEEAYPYLGTDD-QPCNYKAEFSAANDTGFMDIPSGKEHALMKAIASVGPVSVAIDA 255
Query: 291 VY--MQTYIGGVSCPYIC-SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGE 347
+ Q Y G+ C S LDHGVL VGYG G + K YWI+KNSW E WG+
Sbjct: 256 GHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEGE---DVDGKKYWIVKNSWSEKWGD 312
Query: 348 NGYYKICRGR-NVCGVDSMVS 367
GY + + R N CG+ + S
Sbjct: 313 KGYILMAKDRKNHCGIATAAS 333
>gi|37780039|gb|AAP32192.1| cysteine protease 14 [Trifolium repens]
Length = 351
Score = 210 bits (535), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 122/316 (38%), Positives = 174/316 (55%), Gaps = 25/316 (7%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F + + K Y + EE RF +FK NL+ K+ + G+ +F+DL+ EF+
Sbjct: 47 FESWMSRHGKIYETIEEKLLRFEVFKDNLKHIDDRNKIVSNYWLGLNEFADLSHQEFKNK 106
Query: 120 YLGLRRKLRLPKDADQAPILPTND--LPADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
YLGL+ L +++ D LP DWR+KGAV PVK+QG CGSCW+FST A+
Sbjct: 107 YLGLKVDLSQRRESSNEEEFTYRDVDLPKSVDWRKKGAVTPVKNQGQCGSCWAFSTVAAV 166
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREE 237
EG N + TG L SLSEQ+L+DCD + ++GCNGGLM+ AF + + GGL +EE
Sbjct: 167 EGINQIVTGNLTSLSEQELIDCDT--------TYNNGCNGGLMDYAFSFIGQNGGLHKEE 218
Query: 238 DYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY--MQT 295
DYPY + K +++++ ++ + V + +Q + N PL+VAI A Q
Sbjct: 219 DYPYIMEESTCEMKKEETQV-VTINGYHDVPQNNEQSLLKALANQPLSVAIEASSRDFQF 277
Query: 296 YIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
Y GGV + C LDHGV VGYG++ K Y I+KNSWG WGE G+ ++ R
Sbjct: 278 YSGGVFDGH-CGSDLDHGVSAVGYGTS-------KNLDYIIVKNSWGAKWGEKGFIRMKR 329
Query: 356 G----RNVCGVDSMVS 367
+CG+ M S
Sbjct: 330 DIGKPEGICGLYKMAS 345
>gi|313224805|emb|CBY20597.1| unnamed protein product [Oikopleura dioica]
Length = 343
Score = 210 bits (535), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 126/315 (40%), Positives = 170/315 (53%), Gaps = 24/315 (7%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARH-QKLDPSATHGITQFSDLTPAEFRR 118
F ++ +F+K Y + EE R F N H Q+ D + T G+ +DLT +EF+
Sbjct: 42 FRQYEVEFSKMYETAEERRIRAQTFSKNFEMITSHNQREDVTWTMGLNFDADLTFSEFQS 101
Query: 119 TYLGLRRKLRLPKDAD-QAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
YL + + D IL LP +FDWRE G V PVK+QG CGSCW+FSTTG L
Sbjct: 102 RYLMVSQDCSATSTRDLDIDILS---LPENFDWREHGGVSPVKNQGHCGSCWTFSTTGCL 158
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREE 237
E A+ + K +LSEQQLVDC + D + GCNGGL + AFEY GGL E+
Sbjct: 159 ESAHLIHHKKAYNLSEQQLVDCAQDFD-------NHGCNGGLPSHAFEYIHYVGGLEEEQ 211
Query: 238 DYPYTGTDRGHACKFDKSKIAASVAN-FSVVSLDEDQIAANLVKNGPLAVAINAVY-MQT 295
DY Y + C+FD +K A +V F++ DEDQ+ L P++VA V +
Sbjct: 212 DYSYHAEEG--LCEFDPTKTAGTVREVFNITETDEDQLTIALAYFNPVSVAFEVVDGFRF 269
Query: 296 YIGGVSCPYICS---RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYK 352
Y GV C ++H VL VGYG + E PY+I+KNSWG WG+ G++K
Sbjct: 270 YKEGVYQSDTCKSGPEDVNHAVLAVGYGMC-----KKCETPYFIVKNSWGAEWGDEGFFK 324
Query: 353 ICRGRNVCGVDSMVS 367
I RG N+CG+ + S
Sbjct: 325 IKRGENMCGIATCAS 339
>gi|2499879|sp|Q40143.1|CYSP3_SOLLC RecName: Full=Cysteine proteinase 3; Flags: Precursor
gi|1235545|emb|CAA88629.1| pre-pro-cysteine proteinase [Solanum lycopersicum]
Length = 356
Score = 210 bits (535), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 147/377 (38%), Positives = 193/377 (51%), Gaps = 36/377 (9%)
Query: 1 MGSKTVVLFLVSLVVFSAVSSGTLIDDVDQLIRQVT---DGGDEILSHHESTNNDLLGAE 57
M ++VL LV+ + +A++ D + IRQV + + IL T + L
Sbjct: 1 MSRLSLVLILVAGLFATALAGPATFADKNP-IRQVVFPDELENGILQVVGQTRSAL---- 55
Query: 58 HHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFR 117
F+ F + K Y S EE RF IF NL+ H + S GI +F+DLT EFR
Sbjct: 56 -SFARFAIRHRKRYDSVEEIKQRFEIFLDNLKMIRSHNRKGLSYKLGINEFTDLTWDEFR 114
Query: 118 RTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
+ LG + + L LP DWR+ G V PVK QG CGSCW+FSTTGAL
Sbjct: 115 KHKLGASQNCSATTKGNLK--LTNVVLPETKDWRKDGIVSPVKAQGKCGSCWTFSTTGAL 172
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREE 237
E A A GK +SLSEQQLVDC + + GCNGGL + AFEY GGL EE
Sbjct: 173 EAAYAQAFGKGISLSEQQLVDCAGAFN-------NFGCNGGLPSQAFEYIKFNGGLDTEE 225
Query: 238 DYPYTGTDRGHACKFDKSKIAASV---ANFSVVSLDEDQIAANLVKNGPLAVAINAVY-M 293
YPYTG + CKF ++ I V N ++ + E + A LV+ P++VA V
Sbjct: 226 AYPYTG--KNGICKFSQANIGVKVISSVNITLGAEYELKYAVALVR--PVSVAFEVVKGF 281
Query: 294 QTYIGGVSCPYICS---RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
+ Y GV C ++H VL VGYG PYW+IKNSWG WGE+GY
Sbjct: 282 KQYKSGVYASTECGDTPMDVNHAVLAVGYGVE-------NGTPYWLIKNSWGADWGEDGY 334
Query: 351 YKICRGRNVCGVDSMVS 367
+K+ G+N+CGV + S
Sbjct: 335 FKMEMGKNMCGVATCAS 351
>gi|403300987|ref|XP_003941193.1| PREDICTED: cathepsin L2 [Saimiri boliviensis boliviensis]
Length = 333
Score = 210 bits (535), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 126/313 (40%), Positives = 167/313 (53%), Gaps = 23/313 (7%)
Query: 63 FKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQ----FSDLTPAEFRR 118
+K + Y++ EE R +++ N++ H HG T F D+T EFR+
Sbjct: 32 WKATHRRLYSTNEEGWRR-AVWEKNMKMIELHNGEYSRGKHGFTMAMNAFGDMTNEEFRQ 90
Query: 119 TYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALE 178
+ R + + P+L DLP DWR+KG V PVK+Q CGSCW+FS TGALE
Sbjct: 91 VMVCFRNQKHKNGKVFRGPLLL--DLPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALE 148
Query: 179 GANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREED 238
G F TGKLVSLSEQ LVDC P+ + GCNGG MN AF Y + GGL E
Sbjct: 149 GQMFRKTGKLVSLSEQNLVDCSR---PQG----NQGCNGGFMNYAFRYVKENGGLDSEAS 201
Query: 239 YPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY--MQTY 296
YPY D CK+ A+ F V+ E ++ + GP++VA++A + Q Y
Sbjct: 202 YPYEAKD--GICKYKPENSVANDTGFVVIPTHEKELMKAVATVGPISVAVDASHSSFQFY 259
Query: 297 IGGVSCPYIC-SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
G+ C S+ LDHGVL+VGYG G K+ YW+IKNSWG WG NGY KI +
Sbjct: 260 KSGIYFEKKCSSKNLDHGVLVVGYGFEG---ANSKDNKYWLIKNSWGPEWGLNGYIKIAK 316
Query: 356 GRNV-CGVDSMVS 367
+N CG+ + S
Sbjct: 317 DQNNHCGIATAAS 329
>gi|323451241|gb|EGB07119.1| hypothetical protein AURANDRAFT_54023 [Aureococcus anophagefferens]
Length = 377
Score = 210 bits (535), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 130/333 (39%), Positives = 179/333 (53%), Gaps = 22/333 (6%)
Query: 51 NDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGI-TQFS 109
D+ F F KF K Y + EE HR T+F N + H G+ QF+
Sbjct: 56 TDVEAVHEAFMTFMTKFEKTYETVEEWAHRLTVFAQNAKIVLEHDAKAEGFALGLDNQFA 115
Query: 110 DLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCW 169
D T EF +Y L + + P A + P DWR +G V +K+QGSCGSCW
Sbjct: 116 DWTAEEFA-SYQKLHSRPK-PSQAGATHEVSDKAAPTAVDWRTEGVVADIKNQGSCGSCW 173
Query: 170 SFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK 229
+FST ++EGA TGKLV+LSEQ LVDC + + C GC+GGLM++AF+Y +K
Sbjct: 174 TFSTVVSIEGAAARKTGKLVTLSEQNLVDCVKKDQIDGGDECCMGCSGGLMDNAFDYIIK 233
Query: 230 A--GGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSL-DEDQIAANLVKNGPLAV 286
GG+ E Y YTG D C FDK+ + A+++N++ V++ DE +A L GP+++
Sbjct: 234 NQDGGIDTEASYGYTGKDG--TCAFDKANVGATISNWTDVAVGDEVALADALANAGPVSI 291
Query: 287 AINAV-YMQTYIGGVSCPYI---CSR---RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKN 339
A++A Q Y GG+ P CS DHGV +VGYG+ YW I+N
Sbjct: 292 ALDASKQWQLYSGGILKPRSILGCSSDPTHADHGVAIVGYGTD-------DGVDYWWIRN 344
Query: 340 SWGESWGENGYYKICRGRNVCGVDSMVSTVAAA 372
SWG +WGE+GY ++ RG N CGV + S AA
Sbjct: 345 SWGTTWGESGYMRLERGVNACGVANFASYPIAA 377
>gi|169659203|dbj|BAG12786.1| putative cysteine protease [Sorogena stoianovitchae]
Length = 293
Score = 210 bits (535), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 127/311 (40%), Positives = 175/311 (56%), Gaps = 35/311 (11%)
Query: 63 FKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLG 122
+ ++NK Y E+ HR +F ++R S T G+ QF+DLT EF YLG
Sbjct: 9 LEGEYNKTYGGAEDK-HRLALFAESVRIVETENAKGHSYTLGLNQFADLTTEEFSSLYLG 67
Query: 123 LRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANF 182
L + ++ A ++ +L D + DWR+KGAV PVKDQ SCGSCW+FS TGA+EGA
Sbjct: 68 LVLENKV--QASESVVLQDGDSEENVDWRQKGAVTPVKDQKSCGSCWAFSATGAMEGALV 125
Query: 183 LATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYT 242
+TGKL++LSEQQLVDC +C+ GCNGGLM +AF+Y L G E+DYPY
Sbjct: 126 KSTGKLINLSEQQLVDCVTKCN---------GCNGGLMTAAFDYVL-GRGRATEKDYPYK 175
Query: 243 GTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV-YMQTYIGGVS 301
G D CK ++ + ++ V + + V + PL+VA+NA +Q Y GV
Sbjct: 176 GVD--GRCK--QTATDNKIKGYNNVPQNNYKALKAAVAS-PLSVAVNAAGTIQRYKSGV- 229
Query: 302 CPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRN--- 358
C RLDHGVL VGY + + YWI+KNSWG +GENGY+++ G
Sbjct: 230 IDANCGTRLDHGVLAVGY----------QGEDYWIVKNSWGNGYGENGYFRVKMGTQNGG 279
Query: 359 --VCGVDSMVS 367
VCG++ M +
Sbjct: 280 AGVCGINMMAA 290
>gi|146152090|gb|ABQ08058.1| cathepsin L [Misgurnus mizolepis]
Length = 337
Score = 210 bits (535), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 133/320 (41%), Positives = 172/320 (53%), Gaps = 25/320 (7%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH----GITQFSDLTPA 114
H+ +K K Y +EE R I++ NLR+ H H G+ F D+
Sbjct: 28 HWEQWKTWHGKNYHEKEE-GWRRMIWEKNLRKIQFHNLEHSMGIHTYRLGMNHFGDMNHE 86
Query: 115 EFRRTYLGLRRKLRLPKDADQAPILPTN--DLPADFDWREKGAVGPVKDQGSCGSCWSFS 172
EFR+ G + K + + + N ++P+ DWREKG V PVKDQG CGSCW+FS
Sbjct: 87 EFRQVMNGYKHKTE--RKFKGSLFMEPNFLEVPSKLDWREKGYVTPVKDQGECGSCWAFS 144
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
TTGA+EG F GKLVSLSEQ LVDC PE + GCNGGLM+ AF+Y G
Sbjct: 145 TTGAMEGQMFRKQGKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYIKDNNG 197
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAINAV 291
L EE YPY GTD C +D AA+ F + S E + + GP++VAI+A
Sbjct: 198 LDSEEAYPYLGTDD-QPCHYDPKYNAANDTGFVDIPSGKEHALMKAVASVGPVSVAIDAG 256
Query: 292 Y--MQTYIGGVSCPYIC-SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+ Q Y G+ C S LDHGVL+VGYG G + K YWI+KNSW ESWG+
Sbjct: 257 HESFQFYQSGIYFEKECSSEELDHGVLVVGYGFEG---EDVDGKKYWIVKNSWSESWGDK 313
Query: 349 GYYKICRGR-NVCGVDSMVS 367
GY + + R N CG+ + S
Sbjct: 314 GYIYMAKDRKNHCGIATAAS 333
>gi|255539310|ref|XP_002510720.1| cysteine protease, putative [Ricinus communis]
gi|223551421|gb|EEF52907.1| cysteine protease, putative [Ricinus communis]
Length = 349
Score = 210 bits (535), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 127/325 (39%), Positives = 173/325 (53%), Gaps = 26/325 (8%)
Query: 52 DLLGAEHHFSLFKK---KFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQF 108
DL + LF+ +F + Y S EE RF IFK NL K + G+ +F
Sbjct: 36 DLTSNDKLIDLFESWISRFGRVYESAEEKLERFEIFKDNLFHIDDTNKKVRNYWLGLNEF 95
Query: 109 SDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSC 168
+DL+ EF+ YLGL+ L + +P DWR+KGAV PVK+QGSCGSC
Sbjct: 96 ADLSHEEFKNKYLGLKPDLSKRAQCPEEFTYKDVAIPKSVDWRKKGAVTPVKNQGSCGSC 155
Query: 169 WSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTL 228
W+FST A+EG N + TG L SLSEQ+L+DCD + ++GCNGGLM+ AF Y +
Sbjct: 156 WAFSTVAAVEGINQIVTGNLTSLSEQELIDCDT--------TYNNGCNGGLMDYAFAYIV 207
Query: 229 KAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAI 288
GGL +EEDYPY + G + A +++ + V + ++ + N PL++AI
Sbjct: 208 ANGGLHKEEDYPYI-MEEGTCDMRKEESDAVTISGYHDVPQNSEESLLKALANQPLSIAI 266
Query: 289 NAV--YMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWG 346
A Q Y GGV + C LDHGV VGYG++ K Y I+KNSWG WG
Sbjct: 267 EASGRDFQFYSGGVFDGH-CGTELDHGVAAVGYGTS-------KGLDYIIVKNSWGPKWG 318
Query: 347 ENGYYKICRG----RNVCGVDSMVS 367
E GY ++ R +CG+ M S
Sbjct: 319 EKGYIRMKRKTSKPEGICGIYKMAS 343
>gi|6978721|ref|NP_037071.1| pro-cathepsin H precursor [Rattus norvegicus]
gi|115729|sp|P00786.1|CATH_RAT RecName: Full=Pro-cathepsin H; Contains: RecName: Full=Cathepsin H
mini chain; Contains: RecName: Full=Cathepsin H;
Contains: RecName: Full=Cathepsin H heavy chain;
Contains: RecName: Full=Cathepsin H light chain; Flags:
Precursor
gi|55886|emb|CAA68699.1| cathepsin H pre-pro-peptide [Rattus norvegicus]
gi|55391460|gb|AAH85352.1| Cathepsin H [Rattus norvegicus]
gi|149018921|gb|EDL77562.1| cathepsin H, isoform CRA_a [Rattus norvegicus]
gi|226475|prf||1514114A cathepsin H
Length = 333
Score = 210 bits (534), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 121/318 (38%), Positives = 174/318 (54%), Gaps = 31/318 (9%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRR 118
HF+ + K+ K Y+S+E + HR +F N R+ H + + + G+ QFSD++ AE +
Sbjct: 32 HFTSWMKQHQKTYSSRE-YSHRLQVFANNWRKIQAHNQRNHTFKMGLNQFSDMSFAEIKH 90
Query: 119 TYLGLRRKLRLPKDADQAP---ILPTNDLPADFDWREKG-AVGPVKDQGSCGSCWSFSTT 174
YL P++ + T P+ DWR+KG V PVK+QG+CGSCW+FSTT
Sbjct: 91 KYL-----WSEPQNCSATKSNYLRGTGPYPSSMDWRKKGNVVSPVKNQGACGSCWTFSTT 145
Query: 175 GALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLM 234
GALE A +A+GK+++L+EQQLVDC + + GC GGL + AFEY L G+M
Sbjct: 146 GALESAVAIASGKMMTLAEQQLVDCAQNFN-------NHGCQGGLPSQAFEYILYNKGIM 198
Query: 235 REEDYPYTGTDRGHACKFDKSKIAASVANFSVVSL-DEDQIAANLVKNGPLAVAINAVY- 292
E+ YPY G + CKF+ K A V N ++L DE + + P++ A
Sbjct: 199 GEDSYPYIG--KNGQCKFNPEKAVAFVKNVVNITLNDEAAMVEAVALYNPVSFAFEVTED 256
Query: 293 MQTYIGGVSCPYICSR---RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
Y GV C + +++H VL VGYG YWI+KNSWG +WG NG
Sbjct: 257 FMMYKSGVYSSNSCHKTPDKVNHAVLAVGYGEQ-------NGLLYWIVKNSWGSNWGNNG 309
Query: 350 YYKICRGRNVCGVDSMVS 367
Y+ I RG+N+CG+ + S
Sbjct: 310 YFLIERGKNMCGLAACAS 327
>gi|340370276|ref|XP_003383672.1| PREDICTED: digestive cysteine proteinase 2-like [Amphimedon
queenslandica]
Length = 327
Score = 210 bits (534), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 128/321 (39%), Positives = 177/321 (55%), Gaps = 32/321 (9%)
Query: 58 HHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSA--THGITQFSDLTPAE 115
F +K K+NK Y ++E R I+++N + H T + +F+DL E
Sbjct: 22 QEFQDWKVKYNKVYETKETELERQIIWESNKKFVENHNANSDKFGFTVAMNEFADLDAGE 81
Query: 116 FRRTYLGLRRKLRLPKDADQAPILPTN--DLPADFDWREKGAVGPVKDQGSCGSCWSFST 173
F R + GL L P + I + +P DW+EKGAV P+K+QG CGSCWSFS+
Sbjct: 82 FGRIFNGL---LPRPSSYNSTNIYKPSGVKVPDTVDWKEKGAVTPIKNQGQCGSCWSFSS 138
Query: 174 TGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGL 233
TG+LEG +F+ TG LVSLSEQQL+DC + + GCNGGLM+++F Y G
Sbjct: 139 TGSLEGQHFINTGTLVSLSEQQLMDCSTKYG-------NHGCNGGLMDNSFRYLKSVAGD 191
Query: 234 MREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSL---DEDQIAANLVKNGPLAVAINA 290
E++YPYT + C++D S A V + S V + DED + + GP++VAI+A
Sbjct: 192 ETEDNYPYTAEN--GVCRYDSS--LAVVTDKSYVDIPQGDEDSLKDAVANVGPISVAIDA 247
Query: 291 VY--MQTYIGGVSCPYICSR-RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGE 347
+ Q Y GV CS +LDHGVL +GYG+ K YW++KNSWG SWG
Sbjct: 248 SHSSFQLYNSGVYYASTCSSTQLDHGVLAIGYGTE-------DGKDYWLVKNSWGTSWGM 300
Query: 348 NGYYKICRGR-NVCGVDSMVS 367
GY K+ R R N CG+ + S
Sbjct: 301 EGYIKMSRNRNNNCGIATQAS 321
>gi|118127|sp|P25251.1|CYSP4_BRANA RecName: Full=Cysteine proteinase COT44; Flags: Precursor
Length = 328
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 118/290 (40%), Positives = 168/290 (57%), Gaps = 27/290 (9%)
Query: 76 EHDHRFTIFKANLRRAARHQKLDPSATH--GITQFSDLTPAEFRRTYLGLR----RKLRL 129
+ D RF IFK NLR H + + +AT+ G+T F++LT E+R YLG R R++
Sbjct: 24 QQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFANLTNDEYRSLYLGARTEPVRRITK 83
Query: 130 PKDADQ--APILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGK 187
K+ + + + +++P DWR+KGAV +KDQG+CGSCW+FST A+EG N + TG+
Sbjct: 84 AKNVNMKYSAAVNVDEVPVTVDWRQKGAVNAIKDQGTCGSCWAFSTAAAVEGINKIVTGE 143
Query: 188 LVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRG 247
LVSLSEQ+LVDCD S + GCNGGLM+ AF++ +K GGL E+DYPY GT+ G
Sbjct: 144 LVSLSEQELVDCDK--------SYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYHGTN-G 194
Query: 248 HACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA--VYMQTYIGGVSCPYI 305
K+ ++ + V ++ V P++VAI+A Q Y G+
Sbjct: 195 KCNSLLKNSRVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGGRAFQHYQSGIFTGK- 253
Query: 306 CSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
C +DH V+ VGYGS YWI++NSWG WGE+GY ++ R
Sbjct: 254 CGTNMDHAVVAVGYGSENGV-------DYWIVRNSWGTRWGEDGYIRMER 296
>gi|426248750|ref|XP_004018122.1| PREDICTED: pro-cathepsin H [Ovis aries]
Length = 355
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 128/319 (40%), Positives = 175/319 (54%), Gaps = 33/319 (10%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRR 118
HF + + K Y+S+E H HR +F +NLR H + + G+ QFSD++ AE +R
Sbjct: 54 HFQSWMVQHQKKYSSEEYH-HRLQVFASNLREINAHNARNHTFKMGLNQFSDMSFAELKR 112
Query: 119 TYLGLRRKLRLPKDADQAP---ILPTNDLPADFDWREKGA-VGPVKDQGSCGSCWSFSTT 174
YL P++ + T P DWREKG V PVK+QGSCGSCW+FSTT
Sbjct: 113 KYL-----WSEPQNCSATKSNYLRGTGPYPPSMDWREKGNFVTPVKNQGSCGSCWTFSTT 167
Query: 175 GALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLM 234
GALE A +ATGKL L+EQQLVDC + + GC GGL + AFEY G+M
Sbjct: 168 GALESAVAIATGKLPFLAEQQLVDCAQNFN-------NHGCQGGLPSQAFEYIRYNKGIM 220
Query: 235 REEDYPYTGTDRGHACKFDKSKIAASVANFSVVSL-DEDQIAANLVKNGPLAVA--INAV 291
E+ YPY G D CK+ SK A V + + ++L DE+ + + P++ A + A
Sbjct: 221 GEDTYPYRGEDGD--CKYQPSKAIAFVKDVANITLNDEEAMVEAVALYNPVSFAFEVTAD 278
Query: 292 YMQTYIGGVSCPYICSR---RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+M Y G+ C + +++H VL VGYG K PYWI+KNSWG WG
Sbjct: 279 FM-MYRKGIYSSTSCHKTPDKVNHAVLAVGYGEE-------KGIPYWIVKNSWGPHWGMK 330
Query: 349 GYYKICRGRNVCGVDSMVS 367
GY+ I RG+N+CG+ + S
Sbjct: 331 GYFLIERGKNMCGLAACAS 349
>gi|394331818|gb|AFN27128.1| cysteine protease [Leishmania tropica]
Length = 443
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 127/312 (40%), Positives = 167/312 (53%), Gaps = 31/312 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK+ + +AY + E R F+ NL HQ +P A GIT+F DL+ AEF
Sbjct: 38 FEEFKRTYRRAYGTVAEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 120 YL-GLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YL G + A Q DL P DWR+KGAV PVKDQG+CGSCW+FS G
Sbjct: 98 YLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWRKKGAVTPVKDQGACGSCWAFSAVG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGL 233
++E LA +L +LSE LV C + +SGC GGLM AFE+ L+ G +
Sbjct: 158 SIESQWALAGHRLTALSEHHLVSCHDK---------NSGCTGGLMLQAFEWLLRNMNGTM 208
Query: 234 MREEDYPYTGTDRGHACKFDKSK---IAASVANFSVVSLDEDQIAANLVKNGPLAVAINA 290
E+ YPY + G+ + S A + + + E +AA L KNGP+++A++A
Sbjct: 209 FTEDSYPYVSSS-GYVPECSNSSQLVPGARIDGYMTIESSETVMAAWLAKNGPISIAVDA 267
Query: 291 VYMQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+Y GV SC I L+HGVLLVGY G E PYW+IKNSWGE+WGEN
Sbjct: 268 SSFMSYQSGVLTSCAGI---SLNHGVLLVGYNRTG-------EVPYWVIKNSWGENWGEN 317
Query: 349 GYYKICRGRNVC 360
GY ++ G N C
Sbjct: 318 GYVRVTMGVNAC 329
>gi|224083362|ref|XP_002306996.1| predicted protein [Populus trichocarpa]
gi|222856445|gb|EEE93992.1| predicted protein [Populus trichocarpa]
Length = 336
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 127/316 (40%), Positives = 177/316 (56%), Gaps = 26/316 (8%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F + K K Y S EE HRF IFK NL K + G+ +F+DL+ EF+
Sbjct: 33 FESWISKHQKIYESIEEKWHRFEIFKDNLFHIDETNKKVVNYWLGLNEFADLSHEEFKNK 92
Query: 120 YLGLRRKLRLPKD-ADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALE 178
YLGL L ++ +++ + +P DWR+KGAV VK+QGSCGSCW+FST A+E
Sbjct: 93 YLGLNVDLSNRRECSEEFTYKDVSSIPKSVDWRKKGAVTDVKNQGSCGSCWAFSTVAAVE 152
Query: 179 GANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREED 238
G N + TG L SLSEQ+LVDCD + ++GCNGGLM+ AF Y + GGL +EED
Sbjct: 153 GINQIVTGNLTSLSEQELVDCDT--------TYNNGCNGGLMDYAFAYIISNGGLHKEED 204
Query: 239 YPYTGTDRGHACKFDKSKI-AASVANFSVVSLDEDQIAANLVKNGPLAVAINAV--YMQT 295
YPY + G C+ K++ +++ + V + ++ + N PL+VAI+A Q
Sbjct: 205 YPYI-MEEG-TCEMRKAESEVVTISGYHDVPQNSEESLLKALANQPLSVAIDASGRDFQF 262
Query: 296 YIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
Y GGV + C LDHGV VGYGSA K + ++KNSWG WGE G+ ++ R
Sbjct: 263 YSGGVFDGH-CGTELDHGVAAVGYGSA-------KGLDFIVVKNSWGSKWGEKGFIRMKR 314
Query: 356 GR----NVCGVDSMVS 367
+CG++ M S
Sbjct: 315 NTGKPAGLCGINKMAS 330
>gi|15617524|ref|NP_258322.1| cathepsin-like cysteine proteinase [Spodoptera litura NPV]
gi|37077642|sp|Q91BH1.1|CATV_NPVST RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|15553260|gb|AAL01738.1|AF325155_50 cathepsin-like cysteine proteinase [Spodoptera litura NPV]
Length = 337
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 123/332 (37%), Positives = 176/332 (53%), Gaps = 34/332 (10%)
Query: 52 DLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDL 111
D+ A ++ F K+ NK Y + ++ D F FK NL + A +GI +FSD+
Sbjct: 25 DIDSASVYYENFIKQHNKEYTTPDQRDAAFVNFKRNLADMNAMNNVSNQAVYGINKFSDI 84
Query: 112 TPAEFRRTYLGLRRKLRLPKDADQAPIL---------PTNDLPADFDWREKGAVGPVKDQ 162
F + GL L D++ P P+ P FDWR+ V VK+Q
Sbjct: 85 DKITFVNEHAGLVSNLINSTDSNFDPYRLCEYVTVAGPSARTPESFDWRKLNKVTKVKEQ 144
Query: 163 GSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNS 222
G CGSCW+F+ G +E + L+ LSEQQL+DCD D GC+GGLM+
Sbjct: 145 GVCGSCWAFAAIGNIESQYAIMHDSLIDLSEQQLLDCDR---------VDQGCDGGLMHL 195
Query: 223 AFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSL-DEDQIAANLVKN 281
AF+ ++ GG+ E DYPY G + +AC+ SK+A +++ L DE ++ L KN
Sbjct: 196 AFQEIIRIGGVEHEIDYPYQGIE--YACRLAPSKLAVRLSHCYQYDLRDERKLLELLYKN 253
Query: 282 GPLAVAINAVYMQTYIGGVSCPYICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNS 340
GP+AVAI+ V + Y G++ +C+ L+H VLLVGYG + PYWI KNS
Sbjct: 254 GPIAVAIDCVDIIDYRSGIAT--VCNDNGLNHAVLLVGYGIE-------NDTPYWIFKNS 304
Query: 341 WGESWGENGYYKICRGRNVCGVDSMVSTVAAA 372
WG +WGENGY++ R N CG M++ AA+
Sbjct: 305 WGSNWGENGYFRARRNINACG---MLNEFAAS 333
>gi|449513868|ref|XP_002191976.2| PREDICTED: cathepsin L1-like [Taeniopygia guttata]
Length = 443
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 141/350 (40%), Positives = 184/350 (52%), Gaps = 27/350 (7%)
Query: 29 DQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANL 88
+Q+I + E L + +L G H+ L+K K Y +EE R +++ NL
Sbjct: 106 NQVIPVTKENSTETLHCRWQVDPELDG---HWQLWKSWHRKDYHEREE-GWRRVVWEKNL 161
Query: 89 RRAARHQKLDPSATH----GITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDL 144
+ H H G+ QF D+T EFR+ G K + + + L N L
Sbjct: 162 KMIEIHNLDHALGKHSYKLGMNQFGDMTTEEFRQLMNGYVHK-KSERKYRGSQFLEPNFL 220
Query: 145 --PADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHE 202
P DWREKG V PVKDQG CGSCW+FSTTGALEG +F TGKLVSLSEQ LVDC
Sbjct: 221 EAPRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSR- 279
Query: 203 CDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVA 262
PE + GCNGGLM+ AF+Y GG+ EE YPYT D C++ AA+
Sbjct: 280 --PE----GNQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKD-DEDCRYKAEYNAANDT 332
Query: 263 NF-SVVSLDEDQIAANLVKNGPLAVAINAVY--MQTYIGGVSCPYICSRR-LDHGVLLVG 318
F + E + + GP++VAI+A + Q Y G+ CS LDHGVL+VG
Sbjct: 333 GFVDIPQGHERALMKAVAAVGPVSVAIDAGHSSFQFYQSGIYYEPDCSSEDLDHGVLVVG 392
Query: 319 YGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGR-NVCGVDSMVS 367
YG G + K YWI+KNSWGE WG+ GY + + R N CG+ + S
Sbjct: 393 YGFEGED---VDGKKYWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAAS 439
>gi|44844204|emb|CAF32698.1| cysteine proteinase [Leishmania infantum]
Length = 443
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 130/311 (41%), Positives = 166/311 (53%), Gaps = 29/311 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK+ + +AY + E R F+ NL HQ +P A GIT+F DL+ AEF
Sbjct: 38 FEEFKRTYRRAYGTLAEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 120 YL-GLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YL G + A Q DL P DWREKGAV PVK G+CGSCW+FS G
Sbjct: 98 YLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKXXGACGSCWAFSAVG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGL 233
+E A LVSLSEQQLV CD + D+GCNGGLM AFE L+ G +
Sbjct: 158 NIESQWARAGHGLVSLSEQQLVSCDDK---------DNGCNGGLMLQAFEXLLRHMYGIV 208
Query: 234 MREEDYPYTGTDRGHACKFDKSKI--AASVANFSVVSLDEDQIAANLVKNGPLAVAINAV 291
E+ YPYT + A + SK+ A + + ++ +E +AA L +NGP+A+A++A
Sbjct: 209 FTEKSYPYTSGNGDVAECLNSSKLVPGAQIDGYVMIPSNETVMAAWLAENGPIAIAVDAS 268
Query: 292 YMQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
+Y GV SC L+HGVLLVGY G PYW+IKNSWGE WGE G
Sbjct: 269 SFMSYQSGVLTSCA---GDALNHGVLLVGYNKTGGV-------PYWVIKNSWGEDWGEKG 318
Query: 350 YYKICRGRNVC 360
Y ++ G N C
Sbjct: 319 YVRVVMGXNAC 329
>gi|260819200|ref|XP_002604925.1| hypothetical protein BRAFLDRAFT_77225 [Branchiostoma floridae]
gi|229290254|gb|EEN60935.1| hypothetical protein BRAFLDRAFT_77225 [Branchiostoma floridae]
Length = 520
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 130/348 (37%), Positives = 181/348 (52%), Gaps = 43/348 (12%)
Query: 58 HHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFR 117
H S +K + N+ Y + +E RF F+ NL + + QF+D++ EFR
Sbjct: 173 HFASQWKHEHNRRYKTADEEKARFATFQDNLLKIEKLNAEYSGTEFATNQFADMSEEEFR 232
Query: 118 RTYLGLRRKLRLPKDA----DQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFST 173
L R D + NDLP ++W + GAV P+KDQGS GSCW+FST
Sbjct: 233 SKILMRPRPPPQHPRERYLRDYGEV---NDLPEAYNWVDHGAVTPIKDQGSAGSCWAFST 289
Query: 174 TGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGL 233
LEG FL L +LS +Q+VDCD DP+ G+ D G GG AF+Y + GG+
Sbjct: 290 IENLEGQWFLTKHPLTNLSVEQVVDCDDNTDPKT-GNADCGVFGGWPYLAFQYIKRVGGI 348
Query: 234 MREEDYPYT---GTDRG-------------------------HACKF--DKSKI--AASV 261
+EEDYPY G ++G +C F DKSK V
Sbjct: 349 EKEEDYPYCSGLGGEKGTCFPCPAPAYNTSMCGPAVSYCNETESCGFRLDKSKFIPGLQV 408
Query: 262 ANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYIC-SRRLDHGVLLVGYG 320
+++ + +E IA L+K GPL+VA+NAV +Q Y GV P+ C + LDH VLL G+G
Sbjct: 409 TDWAAIDTNETTIAVQLMKIGPLSVALNAVLLQFYHRGVFEPHFCDPKSLDHAVLLTGWG 468
Query: 321 SAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVST 368
I ++KPYWI+KNSWG+ WG +GY+ I RG CG+++ V+T
Sbjct: 469 VE--KTIFGEKKPYWIVKNSWGKKWGMDGYFYIKRGVGQCGINTQVAT 514
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 47/137 (34%), Positives = 66/137 (48%), Gaps = 33/137 (24%)
Query: 209 GSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYT---GTDRG------------------ 247
G+ D G GG AF+Y + GG+ +EEDYPY G ++G
Sbjct: 20 GNADCGVFGGWPYLAFQYIKRVGGIEKEEDYPYCSGLGGEKGTCFPCPAPAYNASMCGPA 79
Query: 248 -------HACKF--DKSKI--AASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTY 296
+C F DKSK V +++ + +E IA L+K GPL+VA+NAV +Q Y
Sbjct: 80 VSYCNETESCGFRLDKSKFIPGLQVTDWAAIDTNETTIAVQLMKIGPLSVALNAVLLQFY 139
Query: 297 IGGVSCPYIC-SRRLDH 312
GV P+ C + LDH
Sbjct: 140 HRGVFEPHFCDPKSLDH 156
>gi|146215990|gb|ABQ10197.1| actinidin Act4a [Actinidia eriantha]
Length = 385
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 116/314 (36%), Positives = 183/314 (58%), Gaps = 24/314 (7%)
Query: 46 HESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQ-KLDPSATHG 104
+ TN++++ F + ++ K+Y + E + RF IFK NLR H ++ S G
Sbjct: 37 EQRTNDEVIAM---FESWLVEYGKSYNALGEKERRFEIFKDNLRFVDEHNADVNRSYKVG 93
Query: 105 ITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGS 164
+ QFSDLT AE+ YLG + +R+ +D+ + LP DWR+KGAV VK+QG+
Sbjct: 94 LNQFSDLTDAEYSSIYLGTKFNIRMTNVSDRYEPRVGDQLPDSVDWRKKGAVLGVKNQGN 153
Query: 165 CGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAF 224
CGSCW+F++ A+EG N + TG L+SLSEQ++VDC + ++GCNGG ++ A+
Sbjct: 154 CGSCWTFASIAAVEGINKIVTGNLISLSEQEIVDCQRKYP-------NNGCNGGTLSGAY 206
Query: 225 EYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPL 284
++ + GG+ E +YPYTG D G + K+K ++ + V + ++ V P+
Sbjct: 207 QFIINNGGINTEANYPYTGRD-GVCDQNKKNKKYVTIDRYENVPSNNEKALQKAVAFQPV 265
Query: 285 AVAI--NAVYMQTYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSW 341
+V I N+ ++Y G+ + P C R+DHGV +VGYG+ G K YWI++NSW
Sbjct: 266 SVVIASNSTAFKSYKSGIFNGP--CGPRIDHGVTIVGYGTEG-------GKDYWIVRNSW 316
Query: 342 GESWGENGYYKICR 355
G +WGE+GY ++ R
Sbjct: 317 GPNWGESGYVRMQR 330
>gi|84660246|emb|CAI43320.1| cathepsin L [Lubomirskia baicalensis]
gi|85677150|emb|CAI46307.1| cathepsin L [Lubomirskia baicalensis]
Length = 327
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 125/314 (39%), Positives = 168/314 (53%), Gaps = 26/314 (8%)
Query: 63 FKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSA--THGITQFSDLTPAEFRRTY 120
+KK+ K Y S E R I++AN + H T G+ QF+DL +EF R Y
Sbjct: 25 WKKEHGKVYNSDREELTRHIIWQANRKYVDEHNAHAEKFGFTVGMNQFADLESSEFGRLY 84
Query: 121 LGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGA 180
G K + K + DLP DWR KG V +K+QG CGSCW+FS LEG
Sbjct: 85 NGYNNKPSMKKAQSKVFSTKVGDLPTSVDWRTKGFVTAIKNQGQCGSCWAFSAVAGLEGQ 144
Query: 181 NFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYP 240
+F ATG LVSLSEQ LVDC + GCNGGLM++AF+Y +K GG+ E YP
Sbjct: 145 HFNATGTLVSLSEQNLVDCS-------TAEGNQGCNGGLMDNAFQYVIKNGGIDTEASYP 197
Query: 241 YTGTDRGHACKFDKSKIAASVANFSVV--SLDEDQIAANLVKNGPLAVAINAVY--MQTY 296
Y D+ CKF+ + + ++ + FS + E + + GP++VAI+A + Q Y
Sbjct: 198 YKAVDQ--KCKFNAANVGSTCSGFSDILPHKSEAALQVAVAVVGPISVAIDASHTSFQLY 255
Query: 297 IGGVSCPYICSR-RLDHGVLLVGY-GSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
GV CS+ LDHGV VGY S+G A YWI+KNSWG +WG+ GY +
Sbjct: 256 KSGVYSESACSQTSLDHGVTAVGYDSSSGVA--------YWIVKNSWGTTWGQAGYIWMS 307
Query: 355 RGR-NVCGVDSMVS 367
R + N CG+ + S
Sbjct: 308 RNKNNQCGIATAAS 321
>gi|401397136|ref|XP_003879989.1| cathepsin L, related [Neospora caninum Liverpool]
gi|325114397|emb|CBZ49954.1| cathepsin L, related [Neospora caninum Liverpool]
Length = 415
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 126/315 (40%), Positives = 172/315 (54%), Gaps = 26/315 (8%)
Query: 46 HESTNNDLLGAEHHFS----LFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSA 101
H++ N E HF F+ + K+YA++EE R+ IFK NL H + S
Sbjct: 101 HKTPVNIWEWKEEHFQNAFGSFRATYGKSYATEEETQKRYAIFKNNLAYIHTHNQQGYSY 160
Query: 102 THGITQFSDLTPAEFRRTYLGLRRKLRLPKD----ADQAPILPTNDLPADFDWREKGAVG 157
+ + F DL+ EFRR YLG + L + A + + +D+P+ DWREKG V
Sbjct: 161 SLKMNHFGDLSREEFRRKYLGYNKSRNLKSNNLGVATELLKVSPSDVPSAVDWREKGCVT 220
Query: 158 PVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNG 217
PVKDQ CGSCW+FS TGALEGA+ TG+L+SLSEQ+LVDC + GC+G
Sbjct: 221 PVKDQRDCGSCWAFSATGALEGAHCAKTGELLSLSEQELVDCS-------LAEGNQGCSG 273
Query: 218 GLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAAN 277
G MN AF+Y + +GGL EE YPY D CK K+ +++ F V +
Sbjct: 274 GEMNDAFQYVVDSGGLCSEEGYPYLARD--GECKRACKKV-VTISGFKDVPRKSETAMKA 330
Query: 278 LVKNGPLAVAINA--VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYW 335
+ + P+++AI A + Q Y GV C LDHGVLLVGYG+ + +K +W
Sbjct: 331 ALAHSPVSIAIEADQLPFQFYHEGV-FDASCGTDLDHGVLLVGYGTD-----KETKKDFW 384
Query: 336 IIKNSWGESWGENGY 350
I+KNSWG WG +GY
Sbjct: 385 IMKNSWGSGWGRDGY 399
>gi|9631045|ref|NP_047715.1| cathepsin-like proteinase [Lymantria dispar MNPV]
gi|13124028|sp|Q9YMP9.1|CATV_NPVLD RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|3822313|gb|AAC70264.1| cathepsin-like proteinase [Lymantria dispar MNPV]
Length = 356
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 121/330 (36%), Positives = 183/330 (55%), Gaps = 30/330 (9%)
Query: 52 DLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRR--AARHQKLD-PSATHGITQF 108
+L A +F F + +NK Y S E + R++IFK NL A D P+AT+ I +F
Sbjct: 48 NLQRAPDYFESFVENYNKNYTSDWEKNKRYSIFKDNLHEINAKNGNATDGPTATYKINKF 107
Query: 109 SDLTPAEFRRTYLGLRRKLRLPKDADQAPI-LPTNDLPADFDWREKGAVGPVKDQGSCGS 167
SDL+ +E + GL R+ + P + P FDWRE+ V +K+QG+CG+
Sbjct: 108 SDLSKSELIAKFTGLSIPERVSNFCKTIILNQPPDKGPLHFDWREQNKVTSIKNQGACGA 167
Query: 168 CWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYT 227
CW+F+T ++E + +L+ LSEQQL+DCD S D GCNGGL+++AFE
Sbjct: 168 CWAFATLASVESQFAMRHNRLIDLSEQQLIDCD---------SVDMGCNGGLLHTAFEEI 218
Query: 228 LKAGGLMREEDYPYTGTDRGHACKFDKSK--IAASVANFSVVSLDEDQIAANLVKNGPLA 285
++ GG+ E DYP+ G +R C D+ + + + V + V ++E+++ L GP+
Sbjct: 219 MRMGGVQTELDYPFVGRNR--RCGLDRHRPYVVSLVGCYRYVMVNEEKLKDLLRAVGPIP 276
Query: 286 VAINAVYMQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGE 343
+AI+A + Y GV SC + L+H VLLVGYG PYW+ KN+WG+
Sbjct: 277 MAIDAADIVNYYRGVISSCE---NNGLNHAVLLVGYGVENGV-------PYWVFKNTWGD 326
Query: 344 SWGENGYYKICRGRNVCG-VDSMVSTVAAA 372
WGENGY+++ + N CG V+ + ST A
Sbjct: 327 DWGENGYFRVRQNVNACGMVNDLASTAVLA 356
>gi|1581745|prf||2117247A Cys protease:ISOTYPE=1
Length = 467
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 123/310 (39%), Positives = 166/310 (53%), Gaps = 23/310 (7%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRR 118
F+ FK++ K Y S E R +FK NL A H +P A+ +T FSDLT EFR
Sbjct: 37 QFAAFKQRHGKVYGSAAEEAFRLGVFKENLLFARLHAAANPHASFAVTPFSDLTREEFRS 96
Query: 119 TYLGLRRKLRLPKDADQAPILPTNDLP---ADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
Y + + P+ ++ A DWR +GAV +KDQG+C SCW+FST G
Sbjct: 97 RYHNAAAHFAAAQKRVRVPVEVEVEVGGPPAAVDWRARGAVTAIKDQGNCSSCWAFSTIG 156
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA--GGL 233
+EG LA L LSEQ LV CD+ D+GC+GGLM+SAF++ ++ G +
Sbjct: 157 NIEGQWHLAGNPLTGLSEQMLVSCDNA---------DNGCDGGLMDSAFDWIVEQNNGSV 207
Query: 234 MREEDYPY-TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY 292
E Y Y +G C + A ++ + DED++AA L NGPLA+A++A
Sbjct: 208 YTEASYSYVSGGGDSQTCDMSDHVVGAVISGHVDLPQDEDKMAAWLAVNGPLAIAVDATS 267
Query: 293 MQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYK 352
+Y GGV + S +LDHGV+LVGY + PYWIIKNSWG WGE GY +
Sbjct: 268 FMSYTGGVLTNCV-SDQLDHGVVLVGYNDS-------SNPPYWIIKNSWGADWGEEGYIR 319
Query: 353 ICRGRNVCGV 362
I +G N C V
Sbjct: 320 IQKGTNQCLV 329
>gi|166235890|ref|NP_031827.2| pro-cathepsin H preproprotein [Mus musculus]
gi|341940309|sp|P49935.2|CATH_MOUSE RecName: Full=Pro-cathepsin H; AltName: Full=Cathepsin B3; AltName:
Full=Cathepsin BA; Contains: RecName: Full=Cathepsin H
mini chain; Contains: RecName: Full=Cathepsin H;
Contains: RecName: Full=Cathepsin H heavy chain;
Contains: RecName: Full=Cathepsin H light chain; Flags:
Precursor
gi|74151776|dbj|BAE29677.1| unnamed protein product [Mus musculus]
gi|74181999|dbj|BAE34071.1| unnamed protein product [Mus musculus]
gi|74211659|dbj|BAE29188.1| unnamed protein product [Mus musculus]
gi|74213518|dbj|BAE35569.1| unnamed protein product [Mus musculus]
gi|148688954|gb|EDL20901.1| cathepsin H, isoform CRA_b [Mus musculus]
Length = 333
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 121/318 (38%), Positives = 173/318 (54%), Gaps = 31/318 (9%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRR 118
HF + K+ K Y+S E++HR +F N R+ H + + + + QFSD++ AE +
Sbjct: 32 HFKSWMKQHQKTYSS-VEYNHRLQMFANNWRKIQAHNQRNHTFKMALNQFSDMSFAEIKH 90
Query: 119 TYLGLRRKLRLPKDADQAP---ILPTNDLPADFDWREKG-AVGPVKDQGSCGSCWSFSTT 174
+L P++ + T P+ DWR+KG V PVK+QG+CGSCW+FSTT
Sbjct: 91 KFLWSE-----PQNCSATKSNYLRGTGPYPSSMDWRKKGNVVSPVKNQGACGSCWTFSTT 145
Query: 175 GALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLM 234
GALE A +A+GK++SL+EQQLVDC + + GC GGL + AFEY L G+M
Sbjct: 146 GALESAVAIASGKMLSLAEQQLVDCAQAFN-------NHGCKGGLPSQAFEYILYNKGIM 198
Query: 235 REEDYPYTGTDRGHACKFDKSKIAASVANFSVVSL-DEDQIAANLVKNGPLAVAINAVY- 292
E+ YPY G D +C+F+ K A V N ++L DE + + P++ A
Sbjct: 199 EEDSYPYIGKDS--SCRFNPQKAVAFVKNVVNITLNDEAAMVEAVALYNPVSFAFEVTED 256
Query: 293 MQTYIGGVSCPYICSR---RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
Y GV C + +++H VL VGYG YWI+KNSWG WGENG
Sbjct: 257 FLMYKSGVYSSKSCHKTPDKVNHAVLAVGYGEQN-------GLLYWIVKNSWGSQWGENG 309
Query: 350 YYKICRGRNVCGVDSMVS 367
Y+ I RG+N+CG+ + S
Sbjct: 310 YFLIERGKNMCGLAACAS 327
>gi|171702841|dbj|BAG16376.1| cysteine protease [Brassica rapa var. perviridis]
Length = 333
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 119/294 (40%), Positives = 161/294 (54%), Gaps = 27/294 (9%)
Query: 69 KAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH--GITQFSDLTPAEFRRTYLGLRRK 126
+ YA E ++R+ +FK N+ R R + T + QF+DLT EFR Y G +
Sbjct: 41 RVYADANEKNNRYAVFKRNVERIERLNDVQSGLTFKLAVNQFADLTNEEFRSMYTGFKGN 100
Query: 127 LRLPKDADQAPI----LPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANF 182
L + ++ LP DWR+KGAV P+KDQG CGSCW+FS A+EG
Sbjct: 101 SVLSSRTKPTSFRYQNVSSDALPVSVDWRKKGAVTPIKDQGLCGSCWAFSAVAAIEGVAQ 160
Query: 183 LATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYT 242
+ GKL+SLSEQ+LVDCD + D GC GGLM++AF YT+ GGL E +YPY
Sbjct: 161 IKKGKLISLSEQELVDCD---------TNDGGCMGGLMDTAFNYTITIGGLTSESNYPYK 211
Query: 243 GTDRGHACKFDKSK-IAASVANFSVVSLDEDQIAANLVKNGPLAVAINA--VYMQTYIGG 299
T+ C F+K+K IA S+ F V ++++ V + P+++ I + Q Y G
Sbjct: 212 STN--GTCNFNKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGIAGGDIGFQFYSSG 269
Query: 300 VSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKI 353
V C+ LDHGV VGYG + LK YWI+KNSWG WGE GY +I
Sbjct: 270 VFSGE-CTTHLDHGVTAVGYGRSKNG---LK---YWILKNSWGPKWGERGYMRI 316
>gi|23110964|ref|NP_001326.2| cathepsin W preproprotein [Homo sapiens]
gi|29476894|gb|AAH48255.1| Cathepsin W [Homo sapiens]
gi|119594870|gb|EAW74464.1| cathepsin W (lymphopain), isoform CRA_b [Homo sapiens]
Length = 376
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 126/335 (37%), Positives = 171/335 (51%), Gaps = 42/335 (12%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLD-PSATHGITQFSDLTPAEFRR 118
F LF+ +FN++Y S EEH HR IF NL +A R Q+ D +A G+T FSDLT EF +
Sbjct: 42 FKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDLTEEEFGQ 101
Query: 119 TYLGLRRKLRLPKDADQAPIL--------PTNDLPADFDWRE-KGAVGPVKDQGSCGSCW 169
Y G RR A P + P +P DWR+ GA+ P+KDQ +C CW
Sbjct: 102 LY-GYRRA------AGGVPSMGREIRSEEPEESVPFSCDWRKVAGAISPIKDQKNCNCCW 154
Query: 170 SFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK 229
+ + G +E ++ V +S Q+L+DC G C GC+GG + AF L
Sbjct: 155 AMAAAGNIETLWRISFWDFVDVSVQELLDC---------GRCGDGCHGGFVWDAFITVLN 205
Query: 230 AGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAIN 289
GL E+DYP+ G R H C K + A + +F ++ +E +IA L GP+ V IN
Sbjct: 206 NSGLASEKDYPFQGKVRAHRCHPKKYQKVAWIQDFIMLQNNEHRIAQYLATYGPITVTIN 265
Query: 290 AVYMQTYIGGV--SCPYICSRRL-DHGVLLVGYG-------------SAGYAPIRLKEKP 333
+Q Y GV + P C +L DH VLLVG+G S+ P P
Sbjct: 266 MKPLQLYRKGVIKATPTTCDPQLVDHSVLLVGFGSVKSEEGIWAETVSSQSQPQPPHPTP 325
Query: 334 YWIIKNSWGESWGENGYYKICRGRNVCGVDSMVST 368
YWI+KNSWG WGE GY+++ RG N CG+ T
Sbjct: 326 YWILKNSWGAQWGEKGYFRLHRGSNTCGITKFPLT 360
>gi|355681656|gb|AER96815.1| Cathepsin L precursor [Mustela putorius furo]
Length = 331
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 126/316 (39%), Positives = 170/316 (53%), Gaps = 23/316 (7%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQ----FSDLTPAE 115
+S +K K Y EE R +++ NL+ +H + H T F DLT E
Sbjct: 29 WSQWKAAHGKLYDENEE-GWRRAVWEKNLKVIKQHNQEYSQGKHSFTMAMNAFGDLTNEE 87
Query: 116 FRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
F++ GL+ + R + QAP P + P+ DWR+KG V PVK+QG CGSCW+FS TG
Sbjct: 88 FKQVMNGLKSQKRKEGNVFQAP--PFAETPSSVDWRKKGYVTPVKNQGPCGSCWAFSATG 145
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMR 235
ALEG F T +LVSLSEQ LVDC + GC+GGLM+ AF+Y GGL
Sbjct: 146 ALEGQMFRKTKRLVSLSEQNLVDCSQ-------AEGNEGCSGGLMDYAFQYVKDNGGLDS 198
Query: 236 EEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY--M 293
EE YPY D +CK+ + AA+ F + +E+ + + GP++ AI+A
Sbjct: 199 EESYPYRAQDE--SCKYKPEQSAANDTGFMDIHPEEESLKLAVATVGPISAAIDASLSTF 256
Query: 294 QTYIGGVSC-PYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYK 352
Q Y G+ P S LDHG+L+VGYGS G + K YWI+KNSWG WG GY
Sbjct: 257 QFYHKGIYYDPDCSSENLDHGILVVGYGSQGEDSEKQK---YWIVKNSWGTDWGTQGYIL 313
Query: 353 ICRGR-NVCGVDSMVS 367
+ + R N CG+ + S
Sbjct: 314 MAKDRDNHCGIATAAS 329
>gi|209732040|gb|ACI66889.1| Cathepsin H precursor [Salmo salar]
Length = 330
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 129/316 (40%), Positives = 168/316 (53%), Gaps = 33/316 (10%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
E+HF + ++NK Y EE+ HR IF + RR H + + G+ QFSD++ AEF
Sbjct: 27 EYHFKQWMLQYNKVY-DLEEYYHRLDIFTRHKRRIDYHNAGKHTFSMGLNQFSDMSFAEF 85
Query: 117 RRTYLGLRRKLRLPKDADQAP---ILPTNDLPADFDWREKG-AVGPVKDQGSCGSCWSFS 172
R+T+L L P++ I P DWREKG V PVK QG CGSCW+FS
Sbjct: 86 RKTFL-----LTEPQNCSATKGSHISSHGPYPGSVDWREKGNYVSPVKYQGHCGSCWTFS 140
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
TTG LE +ATGKL LSEQQLVDC + + + GC GGL + AFEY G
Sbjct: 141 TTGCLESVTAIATGKLPLLSEQQLVDCAQDFN-------NHGCMGGLPSQAFEYVKYNNG 193
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVAN-FSVVSLDEDQIAANLVKNGPLAVAINAV 291
LM E+DYPYTG D +C F AA V + ++ S DE + + + P++
Sbjct: 194 LMTEDDYPYTGHDG--SCNFKPELAAAFVKDVVNITSYDEKGMVDAVARLNPVSFGYEVT 251
Query: 292 --YMQTYIGGVSCPYICSRRLD---HGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWG 346
++ Y GV C D H VL VGYG PYWI+KNSWG +WG
Sbjct: 252 DDFLH-YKDGVYSSTTCKNTTDNVNHAVLAVGYGEK-------NSTPYWIVKNSWGTNWG 303
Query: 347 ENGYYKICRGRNVCGV 362
+GY+ I RGRN+CG+
Sbjct: 304 MDGYFLIERGRNMCGL 319
>gi|5823020|gb|AAD53012.1|AF089849_1 senescence-specific cysteine protease [Brassica napus]
Length = 344
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 119/294 (40%), Positives = 161/294 (54%), Gaps = 27/294 (9%)
Query: 69 KAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH--GITQFSDLTPAEFRRTYLGLRRK 126
+ YA E ++R+ +FK N+ R R + T + QF+DLT EFR Y G +
Sbjct: 47 RVYADANEKNNRYAVFKRNVERIERLNDVQSGLTFKLAVNQFADLTNEEFRSMYTGFKGN 106
Query: 127 LRLPKDADQAPI----LPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANF 182
L + ++ LP DWR+KGAV P+KDQG CGSCW+FS A+EG
Sbjct: 107 SVLSSRTKPTSFRYQNVSSDALPVSVDWRKKGAVTPIKDQGLCGSCWAFSAVAAIEGVAQ 166
Query: 183 LATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYT 242
+ GKL+SLSEQ+LVDCD + D GC GGLM++AF YT+ GGL E +YPY
Sbjct: 167 IKKGKLISLSEQELVDCD---------TNDGGCMGGLMDTAFNYTITIGGLTSESNYPYK 217
Query: 243 GTDRGHACKFDKSK-IAASVANFSVVSLDEDQIAANLVKNGPLAVAINA--VYMQTYIGG 299
T+ C F+K+K IA S+ F V ++++ V + P+++ I + Q Y G
Sbjct: 218 STN--GTCNFNKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGIAGGDIGFQFYSSG 275
Query: 300 VSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKI 353
V C+ LDHGV VGYG + LK YWI+KNSWG WGE GY +I
Sbjct: 276 VFSGE-CTTHLDHGVTAVGYGRSKNG---LK---YWILKNSWGPKWGERGYMRI 322
>gi|198432217|ref|XP_002130230.1| PREDICTED: similar to cathepsin L [Ciona intestinalis]
Length = 327
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 131/315 (41%), Positives = 171/315 (54%), Gaps = 27/315 (8%)
Query: 63 FKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH----GITQFSDLTPAEFRR 118
+K K+YAS EE + I++ NLR +H H +T+F+DL EF
Sbjct: 26 WKNTHGKSYASHEELKRQL-IWEKNLRVVTQHNYEYDEGLHTYTMAMTKFADLENDEFAA 84
Query: 119 TYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALE 178
YL RK P+ + P DWR +G V PVK+Q CGSCW+FSTTG+LE
Sbjct: 85 MYLPRMRKDSRNGFCSAQPVGGFVENPTSIDWRTRGYVTPVKNQLQCGSCWAFSTTGSLE 144
Query: 179 GANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREED 238
G +F T LVSLSEQQL+DC + D GC GG+M+ AF+Y AGG+ E D
Sbjct: 145 GQHFAKTKNLVSLSEQQLMDCSFK-------EGDEGCGGGIMDYAFDYIFLAGGVESEAD 197
Query: 239 YPYTGTDRGHACKFDKSKIAASVAN-FSVVSLDEDQIAANLVKNGPLAVAINAVYM--QT 295
YPY R C+FD S IAA++ V S E Q+ + GP++VAI+A ++ Q
Sbjct: 198 YPYEA--RNDHCRFDNSSIAATLTGCVDVTSGSETQLEKAVGSIGPVSVAIDASHISFQL 255
Query: 296 YIGGVSCPYICS-RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGE-NGYYKI 353
Y GV+ +CS LDHGVL VGYG+ YWI+KNSWGE WG NGY K+
Sbjct: 256 YGSGVNYEPMCSTTTLDHGVLAVGYGAD-------NGNEYWIVKNSWGEGWGHLNGYIKM 308
Query: 354 CRGR-NVCGVDSMVS 367
+ R N CG+ + S
Sbjct: 309 SKNRNNNCGIATQAS 323
>gi|71084306|gb|AAZ23598.1| cysteine protease [Leishmania major]
Length = 327
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 123/315 (39%), Positives = 167/315 (53%), Gaps = 24/315 (7%)
Query: 52 DLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGIT-QFSD 110
D A H+ FK++ K++ + HRF FK N++ A +P A + ++ +F+D
Sbjct: 7 DNFIASAHYGRFKERHGKSFGEDADEGHRFNAFKQNMQTAYFLNTHNPHAHYDVSGKFAD 66
Query: 111 LTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPA--DFDWREKGAVGPVKDQGSCGSC 168
LTP EF + YL R KD + + + L DWREK AV PVK+QG CGSC
Sbjct: 67 LTPQEFAKLYLNPDYYARRGKDYKEHVHVDDSVLSGAMSVDWREKVAVTPVKNQGMCGSC 126
Query: 169 WSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTL 228
W+FS G +E L LVSLSEQ LV CD D GCNGGLM+ A E+ +
Sbjct: 127 WAFSAIGNIESQWALKNHSLVSLSEQMLVSCD---------DIDDGCNGGLMDQAMEWII 177
Query: 229 K--AGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAV 286
+ G + EE YPY DK + A ++ + + DE IAA + K GP+AV
Sbjct: 178 QHHNGTVPTEESYPYASAGGTSPPCHDKGEFGARISGYMSLPHDEKAIAAYVEKKGPVAV 237
Query: 287 AINAVYMQTYIGGVSCPYIC-SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESW 345
A++A Q Y GGV +C L+HGVL+VG+ + + PYWI+KNSWG SW
Sbjct: 238 AVDATTWQLYFGGVVT--LCFGWSLNHGVLVVGFN-------KRAKPPYWIVKNSWGTSW 288
Query: 346 GENGYYKICRGRNVC 360
GE GY ++ G N C
Sbjct: 289 GEKGYIRLAMGSNQC 303
>gi|357166359|ref|XP_003580684.1| PREDICTED: oryzain alpha chain-like [Brachypodium distachyon]
Length = 456
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 120/295 (40%), Positives = 165/295 (55%), Gaps = 26/295 (8%)
Query: 69 KAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH----GITQFSDLTPAEFRRTYLGLR 124
+ Y + E + RF +F+ NLR +H + H G+ +F+DLT E+R TYLG+R
Sbjct: 51 RTYNAIGEEERRFEVFRDNLRYVDQHNAAADAGLHSFRLGLNRFADLTNEEYRDTYLGVR 110
Query: 125 RK-LRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFL 183
K +R + + + +LP DWREKGAV VKDQG CGSCW+FS A+EG N +
Sbjct: 111 TKPVRERRLSGRYQAADNEELPESVDWREKGAVAKVKDQGGCGSCWAFSAIAAVEGINQI 170
Query: 184 ATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTG 243
TG +++LSEQ+LVDCD S + GCNGGLM+ AFE+ + GG+ EEDYPY
Sbjct: 171 VTGDMIALSEQELVDCDT--------SYNQGCNGGLMDYAFEFIINNGGIDSEEDYPY-- 220
Query: 244 TDRGHACKFDKSKI-AASVANFSVVSLDEDQIAANLVKNGPLAVAINA--VYMQTYIGGV 300
+R + C +K ++ + V ++ + V N P++VAI A Q Y G+
Sbjct: 221 KERDNRCDANKKNAKVVTIDGYEDVPVNSELSLKKAVANQPISVAIEAGGRAFQLYKSGI 280
Query: 301 SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
C LDHGV VGYGS K YWI+KNSWG WGE+GY ++ R
Sbjct: 281 FTGR-CGTALDHGVTAVGYGSE-------NGKDYWIVKNSWGTVWGEDGYVRLER 327
>gi|157644745|gb|ABV59078.1| cathepsin L [Lates calcarifer]
Length = 337
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 132/319 (41%), Positives = 174/319 (54%), Gaps = 22/319 (6%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH----GITQFSDLTPA 114
H+ L+K +K Y +EE R +++ NL++ H H G+ F D+T
Sbjct: 27 HWDLWKSWHSKKYHEKEE-GWRRMVWEKNLKKIELHNLEHSMGKHPYRLGMNHFGDMTHE 85
Query: 115 EFRRTYLGLRRKLRLPKDADQAPILPTN-DLPADFDWREKGAVGPVKDQGSCGSCWSFST 173
EFR+ G +++ K + P + P DWR+KG V PVKDQG CGSCW+FST
Sbjct: 86 EFRQIMNGYKQRKTERKFKGSLFMEPNFLEAPRALDWRDKGYVTPVKDQGQCGSCWAFST 145
Query: 174 TGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGL 233
TGALEG F TGKLVSLSEQ LVDC PE + GCNGGLM+ AF+Y GL
Sbjct: 146 TGALEGQQFRKTGKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYVKDNQGL 198
Query: 234 MREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAINAVY 292
E+ YPY GTD C +D + +A+ F V S E + + GP++VAI+A +
Sbjct: 199 DSEDSYPYLGTD-DQPCHYDPNYNSANDTGFVDVPSGKERALMKAVAAVGPVSVAIDAGH 257
Query: 293 --MQTYIGGVSCPYIC-SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
Q Y G+ C S LDHGVL+VGY GY + K YWI+KNSW E WG+ G
Sbjct: 258 ESFQFYQSGIYYEKDCSSEELDHGVLVVGY---GYEGEDVDGKKYWIVKNSWSEKWGDKG 314
Query: 350 YYKICRGR-NVCGVDSMVS 367
Y + + R N CG+ + S
Sbjct: 315 YIYMAKDRKNHCGIATAAS 333
>gi|260830531|ref|XP_002610214.1| hypothetical protein BRAFLDRAFT_216923 [Branchiostoma floridae]
gi|229295578|gb|EEN66224.1| hypothetical protein BRAFLDRAFT_216923 [Branchiostoma floridae]
Length = 274
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 120/297 (40%), Positives = 167/297 (56%), Gaps = 35/297 (11%)
Query: 80 RFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPI 138
R+ +F+ NL++A Q + +A +G+T+F DLT EFRR YL + P
Sbjct: 1 RYFVFQDNLKKAETLQDSERGTAKYGVTKFMDLTEEEFRRYYL--TPVWKAPAKPLPPAT 58
Query: 139 LPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVD 198
+P D P FDWR+ GAV VKDQG CGSCW+FSTTG +EG + G L LSEQ
Sbjct: 59 IPKKDAPTAFDWRDHGAVTEVKDQGQCGSCWAFSTTGNIEGQWAIKKGNLPDLSEQHTSK 118
Query: 199 CDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA----GGLMREEDYPYTGTDRGHACKFDK 254
+ SC +N + T ++ GL E+ YPY D C D
Sbjct: 119 IE---------SCH-------INPIVKRTKRSIDGKSGLESEKAYPYEAKDE--QCHMDY 160
Query: 255 SKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPY--ICS-RRLD 311
SK+ + + +S DE+ +A+ L +NGP+++ INA MQ Y+GG+S P+ C+ LD
Sbjct: 161 SKVQVYINSSVNISKDENDMASWLAENGPISIGINAFPMQFYMGGISHPWRIFCNPEELD 220
Query: 312 HGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVST 368
HGVL+VGYG+ E PYWIIKNSWG++WGE GYY + RG VCG+++M ++
Sbjct: 221 HGVLIVGYGTK-------DETPYWIIKNSWGKNWGEEGYYLVYRGGGVCGLNTMCTS 270
>gi|327358519|gb|AEA51106.1| cathepsin F, partial [Oryzias melastigma]
Length = 255
Score = 209 bits (533), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 123/268 (45%), Positives = 158/268 (58%), Gaps = 23/268 (8%)
Query: 106 TQFSDLTPAEFRRTYLG-LRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGS 164
T+FSDLT EF YL L + L ++ AP + + +DWR+ GAV PVK+QG
Sbjct: 4 TKFSDLTEEEFHSAYLNPLLSQWTLHREMKPAPPAKSPAPDS-WDWRDHGAVSPVKNQGM 62
Query: 165 CGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAF 224
CGSCW+FS TG +EG FL G L+SLSEQ+LVDCD D C GGL ++A+
Sbjct: 63 CGSCWAFSVTGNIEGQWFLKNGTLLSLSEQELVDCD---------GLDQACRGGLPSNAY 113
Query: 225 EYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPL 284
E K GGL E DY YTG + C F K+AA + + + DE +IAA L +NGP+
Sbjct: 114 EAIEKLGGLETETDYSYTG--KKQRCDFTNRKVAAYINSSVELPKDEKEIAAWLAENGPI 171
Query: 285 AVAINAVYMQTYIGGVSCPY--ICS-RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSW 341
+VA+NA MQ Y GVS P+ C+ +DH VLLVGYG P+W IKNSW
Sbjct: 172 SVALNAFAMQFYKKGVSHPWKIFCNPWMIDHAVLLVGYG-------ERNGIPFWAIKNSW 224
Query: 342 GESWGENGYYKICRGRNVCGVDSMVSTV 369
GE +GE GYY + RG N CG++ M S+
Sbjct: 225 GEDYGEQGYYYLHRGSNACGINKMGSSA 252
>gi|15593255|gb|AAL02223.1|AF410883_1 cysteine protease CP19 precursor [Frankliniella occidentalis]
Length = 334
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 134/321 (41%), Positives = 165/321 (51%), Gaps = 29/321 (9%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSA----THGITQFSDLTPA 114
H+ FK K YA+ E +R +FK N R A+H L S G Q++D+
Sbjct: 27 HWESFKATHAKTYANAVEEAYRAKVFKENAIRIAKHNDLFASGEVTFKVGYNQYADMHTH 86
Query: 115 EFRRTYLGLRRKLRLPKDADQAPILPTND---LPADFDWREKGAVGPVKDQGSCGSCWSF 171
E G R L K A +ND DWR KGA P+KDQG CGSCWSF
Sbjct: 87 EVTEKLNGYRSGL---KQASAFVHTASNDSWPWSKKVDWRSKGAATPIKDQGQCGSCWSF 143
Query: 172 STTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAG 231
S TG+LEG FL LVSLSEQ LVDC + E GCNGGLM+SAFEY G
Sbjct: 144 SATGSLEGQLFLKNKNLVSLSEQNLVDCSWDFGNE-------GCNGGLMDSAFEYVKSNG 196
Query: 232 GLMREEDYPYTGTDRGHACKFDKSKIAASVANFS-VVSLDEDQIAANLVKNGPLAVAINA 290
G+ EE YPYT D G +C + + A + V + E + + K GP++VAI+A
Sbjct: 197 GIDTEESYPYTAVD-GDSCLYRAANNAGVNTGYKDVQAKSESALRDAVEKVGPVSVAIDA 255
Query: 291 V--YMQTYIGGVSCPYICSR-RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGE 347
Q Y G+ CS LDHGVL VGYGS K +WI+KNSWG SWGE
Sbjct: 256 SNWSFQMYSSGIYYESACSSDYLDHGVLAVGYGS------EWPNKEFWIVKNSWGTSWGE 309
Query: 348 NGYYKICRG-RNVCGVDSMVS 367
GY K+ R +N CG+ + S
Sbjct: 310 EGYIKMARNKKNNCGIATEAS 330
>gi|167526493|ref|XP_001747580.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774026|gb|EDQ87660.1| predicted protein [Monosiga brevicollis MX1]
Length = 330
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 132/293 (45%), Positives = 164/293 (55%), Gaps = 31/293 (10%)
Query: 81 FTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILP 140
F ANLR H + S T GITQF+DLT AEF + P++ P
Sbjct: 48 FRCHLANLRVIEAHNAGNSSFTMGITQFADLTAAEFSAYVKRFPMNVTRPRNEVWITEAP 107
Query: 141 TNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCD 200
++ DWR+K AV +K+QG CGSCWSFSTTG++EGA+ +ATGKLVSLSEQQL+DC
Sbjct: 108 LQEV----DWRQKNAVTEIKNQGQCGSCWSFSTTGSVEGAHAIATGKLVSLSEQQLMDCS 163
Query: 201 HECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAAS 260
+ GCNGGLM+ AFEY + GGL EEDYPYT D G + K AA
Sbjct: 164 TRYG-------NHGCNGGLMDYAFEYVIANGGLDTEEDYPYTAED-GKCNTEKEKKHAAE 215
Query: 261 VANFSVVSLD-EDQIAANLVKNGPLAVAINA--VYMQTYIGGVSCPYICSRRLDHGVLLV 317
+ F V + EDQ+AA V GP++VAI A Q Y GV C LDHGVL+V
Sbjct: 216 IHGFRNVPKEHEDQLAA-AVSIGPVSVAIEADQAGFQHYTSGVF-DGKCGTSLDHGVLVV 273
Query: 318 GYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRG---RNVCGVDSMVS 367
GY YWI+KNSWG+SWGE GY ++ RG + +CG+ S
Sbjct: 274 GYSD-----------DYWIVKNSWGKSWGEEGYIRLKRGVDKKGMCGITMQAS 315
>gi|394331824|gb|AFN27131.1| cysteine protease [Leishmania tropica]
Length = 443
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 128/312 (41%), Positives = 168/312 (53%), Gaps = 31/312 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK+ + +AY + E R F+ NL HQ +P A GIT+F DL+ AEF
Sbjct: 38 FEEFKRTYRRAYGTVAEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 120 YL-GLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YL G + A Q DL P DWR+KGAV PVKDQG+CGSCW+FS G
Sbjct: 98 YLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWRKKGAVTPVKDQGACGSCWAFSAVG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGL 233
++E LA +L +LSEQQLV CD + DSGC LM AFE+ L+ G +
Sbjct: 158 SIESQWALAGHRLTALSEQQLVSCDDK---------DSGCRARLMLQAFEWLLRNMNGTM 208
Query: 234 MREEDYPYTGTDRGHACKFDKS---KIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA 290
E+ YPY + G+ + S A + + + E +AA L KNGP+++A++A
Sbjct: 209 FTEDSYPYVSST-GYVPECSNSIQLVPGARIDGYMTIESSETVMAAWLAKNGPISIAVDA 267
Query: 291 VYMQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+Y GV SC + L+HGVLLVGY G E PYW+IKNSWGE+WGEN
Sbjct: 268 SSFMSYQRGVVTSCAGM---PLNHGVLLVGYNRTG-------EVPYWVIKNSWGENWGEN 317
Query: 349 GYYKICRGRNVC 360
GY ++ G N C
Sbjct: 318 GYVRVTMGVNAC 329
>gi|350538043|ref|NP_001234324.1| cysteine protease TDI-65 precursor [Solanum lycopersicum]
gi|5726641|gb|AAD48496.1|AF172856_1 cysteine protease TDI-65 [Solanum lycopersicum]
gi|2828252|emb|CAA05894.1| CYP1 [Solanum lycopersicum]
Length = 466
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 132/365 (36%), Positives = 198/365 (54%), Gaps = 47/365 (12%)
Query: 3 SKTVVLFLVSLVVFSAVSSGT---LIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHH 59
S T+ + ++ +++FS +SS + +I + I + TD DE+ + +ES
Sbjct: 5 SSTLTISILLMLIFSTLSSASDMSIISYDETHIHRRTD--DEVSALYES----------- 51
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKL-DPSATHGITQFSDLTPAEFRR 118
+ + K+Y + E D RF IFK NLR + + S G+T+F+DLT E+R
Sbjct: 52 ---WLIEHGKSYNALGEKDKRFQIFKDNLRYIDEQNSVPNQSYKLGLTKFADLTNEEYRS 108
Query: 119 TYLGL-----RRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFST 173
YLG R+KL K P + + LP DWREKG + VKDQGSCGSCW+FS
Sbjct: 109 IYLGTKSSGDRKKLSKNKSDRYLPKV-GDSLPESIDWREKGVLVGVKDQGSCGSCWAFSA 167
Query: 174 TGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGL 233
A+E N + TG L+SLSEQ+LVDCD S + GC+GGLM+ AFE+ +K GG+
Sbjct: 168 VAAMESINAIVTGNLISLSEQELVDCDR--------SYNEGCDGGLMDYAFEFVIKNGGI 219
Query: 234 MREEDYPYTGTDRGHAC-KFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA-- 290
EEDYPY +R C ++ K+ + ++ V ++ ++ V + P+++A+ A
Sbjct: 220 DTEEDYPY--KERNGVCDQYRKNAKVVKIDSYEDVPVNNEKALQKAVAHQPVSIALEAGG 277
Query: 291 VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
Q Y G+ C +DHGV++ GYG+ YWI++NSWG +WGENGY
Sbjct: 278 RDFQHYKSGIFTGK-CGTAVDHGVVIAGYGTE-------NGMDYWIVRNSWGANWGENGY 329
Query: 351 YKICR 355
++ R
Sbjct: 330 LRVQR 334
>gi|148927382|gb|ABR19827.1| cysteine proteinase [Elaeis guineensis]
Length = 470
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 126/311 (40%), Positives = 170/311 (54%), Gaps = 32/311 (10%)
Query: 66 KFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH-----GITQFSDLTPAEFRRTY 120
K +AY + E + RF IFK N+ H A H G+ +F+D+T E+R Y
Sbjct: 56 KHGRAYNALGEKERRFEIFKDNVLFIDAHNAA-ADAGHRSFRLGLNRFADMTNEEYRAVY 114
Query: 121 LGLR---RKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
LG R + R +D+ DLP DWR KGAV VKDQGSCGSCW+FST A+
Sbjct: 115 LGTRPAGHRRRARVGSDRYRYNAGEDLPESVDWRAKGAVAAVKDQGSCGSCWAFSTVAAV 174
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREE 237
EG N + TG L+SLSEQ+LVDCD+ + GCNGGLM+ FE+ + GG+ EE
Sbjct: 175 EGINKIVTGDLISLSEQELVDCDN--------GYNQGCNGGLMDYGFEFIINNGGIDTEE 226
Query: 238 DYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA--VYMQT 295
DYPYT D G ++ K+ S+ + V +++++ V N P++VAI A Q
Sbjct: 227 DYPYTARD-GKCDQYRKNAKVVSIDGYEDVPVNDEKALQKAVANQPVSVAIEAGGREFQL 285
Query: 296 YIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
Y G+ C LDHGV+ VGYG+ K YWI++NSWG WGE+GY ++ R
Sbjct: 286 YHSGIFTGR-CGTDLDHGVVAVGYGTE-------NGKDYWIVRNSWGGDWGESGYIRMER 337
Query: 356 GRNV----CGV 362
N CG+
Sbjct: 338 NVNTSTGKCGI 348
>gi|13124026|sp|Q9WGE0.1|CATV_NPVHC RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|4884631|gb|AAD31760.1|AF120926_1 cysteine proteinase [Hyphantria cunea nucleopolyhedrovirus]
Length = 324
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 115/312 (36%), Positives = 171/312 (54%), Gaps = 19/312 (6%)
Query: 52 DLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDL 111
DLL A +F F KFNK Y+S+ E RF IF+ NL + D +A + I +FSDL
Sbjct: 20 DLLKAPSYFEDFLHKFNKHYSSESEKLRRFQIFQHNLEEIIIKNQNDTTAQYEINKFSDL 79
Query: 112 TPAEFRRTYLGLRRKLRLPKDADQAPI-LPTNDLPADFDWREKGAVGPVKDQGSCGSCWS 170
+ E Y GL L+ + + P + P +FDWR V VK+QG CG+CW+
Sbjct: 80 SKDETISKYTGLALPLQTQNFCEVVVLNRPPDKGPLEFDWRRLNKVTSVKNQGICGACWA 139
Query: 171 FSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA 230
F+T +LE + +L++LSEQQL+DCD+ D+GCNGGL+++A+E ++
Sbjct: 140 FATLASLESQFAIKHNQLINLSEQQLIDCDY---------VDAGCNGGLLHTAYEAVMQM 190
Query: 231 GGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA 290
GG+ E DYPY G+D G+ + + +++ E+++ L GP+ VAI+A
Sbjct: 191 GGVQAENDYPYEGSD-GNCRVDVAKFVVKVKKCYRYIAVFEEKLKDLLRIVGPIPVAIDA 249
Query: 291 VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
+ Y G+ Y + +H VLLVGYG PYWI+KN+WGE WGE GY
Sbjct: 250 SDIVNYRRGIM-RYCSNYGFNHAVLLVGYGVEN-------NVPYWILKNTWGEDWGEQGY 301
Query: 351 YKICRGRNVCGV 362
+++ + N CG+
Sbjct: 302 FRVQQNINACGI 313
>gi|363807062|ref|NP_001242584.1| uncharacterized protein LOC100804015 precursor [Glycine max]
gi|255640677|gb|ACU20623.1| unknown [Glycine max]
Length = 366
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 133/369 (36%), Positives = 190/369 (51%), Gaps = 59/369 (15%)
Query: 1 MGSKTVV--LFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEH 58
M S T++ L +S + A+ + T+I+ D +E+++ +E
Sbjct: 1 MASMTMIYTLLFLSFTLSYAIKTSTIINYTD----------NEVMAMYEE---------- 40
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQK-LDPSATHGITQFSDLTPAEFR 117
+ + K Y + D RF +FK NL H L+ + G+ +F+D+T E+R
Sbjct: 41 ----WLVRHQKGYNELGKKDKRFQVFKDNLGFIQEHNNNLNNTYKLGLNKFADMTNEEYR 96
Query: 118 RTYLGL-----RRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFS 172
YLG RR ++ + + LP DWR KGAV P+KDQGSCGSCW+FS
Sbjct: 97 AMYLGTKSNAKRRLMKTKSTGHRYAFSARDRLPVHVDWRMKGAVAPIKDQGSCGSCWAFS 156
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
T +E N + TGK VSLSEQ+LVDCD + + GCNGGLM+ AFE+ ++ GG
Sbjct: 157 TVATVEAINKIVTGKFVSLSEQELVDCDR--------AYNEGCNGGLMDYAFEFIIQNGG 208
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVANF----SVVSLDEDQIAANLVKNGPLAVAI 288
+ ++DYPY G D C D +K A V N V DE+ + V + P++VAI
Sbjct: 209 IDTDKDYPYRGFD--GIC--DPTKKNAKVVNIDGYEDVPPYDENAL-KKAVAHQPVSVAI 263
Query: 289 NAV--YMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWG 346
A +Q Y GV C LDHGV++VGYGS YW+++NSWG WG
Sbjct: 264 EASGRALQLYQSGVFTG-KCGTSLDHGVVVVGYGSENGV-------DYWLVRNSWGTGWG 315
Query: 347 ENGYYKICR 355
E+GY+K+ R
Sbjct: 316 EDGYFKMQR 324
>gi|37651368|ref|NP_932731.1| cathepsin [Choristoneura fumiferana DEF MNPV]
gi|82024252|sp|Q6VTL7.1|CATV_NPVCD RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|37499277|gb|AAQ91676.1| cathepsin [Choristoneura fumiferana DEF MNPV]
Length = 324
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 119/322 (36%), Positives = 175/322 (54%), Gaps = 23/322 (7%)
Query: 52 DLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDL 111
DLL A +F F FNK Y+S+ E HRF IF+ NL D SA + I +FSDL
Sbjct: 20 DLLKAPSYFEDFLHNFNKNYSSKSEKLHRFKIFQHNLEEIINKNLNDTSAQYEINKFSDL 79
Query: 112 TPAEFRRTYLGLRRKLRLPKDADQAPIL--PTNDLPADFDWREKGAVGPVKDQGSCGSCW 169
+ E Y GL L+ ++ + +L P + P +FDWR V VK+QG+CG+CW
Sbjct: 80 SKDETISKYTGLSLPLQ-NQNFCEVVVLNRPPDKGPLEFDWRRLNKVTSVKNQGTCGACW 138
Query: 170 SFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK 229
+F+T G+LE + +L++LSEQQL+DCD D GC+GGL+++A+E +
Sbjct: 139 AFATLGSLESQFAIKHDQLINLSEQQLIDCDF---------VDMGCDGGLLHTAYEAVMN 189
Query: 230 AGGLMREEDYPYTGTDRGHACKFDKSKIAASVAN-FSVVSLDEDQIAANLVKNGPLAVAI 288
GG+ E DYPY + C+ + +K V + V + E+++ L GPL VAI
Sbjct: 190 MGGIQAENDYPYEANNGD--CRLNAAKFVVKVKKCYRYVLMFEEKLKDLLRIVGPLPVAI 247
Query: 289 NAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+A + Y GV Y + L+H VLLVGY P+WI+KN+WG WGE
Sbjct: 248 DASDIVNYKRGV-IRYCANHGLNHAVLLVGYAVENGV-------PFWILKNTWGTDWGEQ 299
Query: 349 GYYKICRGRNVCGVDSMVSTVA 370
GY+++ + N CG+ + + + A
Sbjct: 300 GYFRVQQNINACGIQNELPSSA 321
>gi|402770507|gb|AFQ98388.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 132/320 (41%), Positives = 166/320 (51%), Gaps = 28/320 (8%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKAN----LRRAARHQKLDPSATHGITQFSDLTPA 114
+ FK K Y S E RF IF N + A++ K S G+ QF DL
Sbjct: 26 QWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAH 85
Query: 115 EFRRTYLGLRRKLRLPKDADQAPILPTND--LPADFDWREKGAVGPVKDQGSCGSCWSFS 172
EF R + G R + P ND LP DWR+KGAV PVKDQG CGSCW+FS
Sbjct: 86 EFARIFNGYHGS-RKSGGSTFLPPANVNDSSLPKAVDWRKKGAVTPVKDQGQCGSCWAFS 144
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
TTG+LEG +FL G+LVSLSEQ LVDC ++GC GGLM AF+Y G
Sbjct: 145 TTGSLEGQHFLKNGELVSLSEQNLVDCSQSFG-------NNGCEGGLMEDAFKYIKANDG 197
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLD-EDQIAANLVKNGPLAVAINAV 291
+ E+ YPY D C+F K + A+ + + ED + + GP++VAI+A
Sbjct: 198 IDTEKSYPYEAVDG--ECRFKKEDVGATDTGYVEIKAGCEDDLKKAVATVGPISVAIDAS 255
Query: 292 Y--MQTYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+ Q Y GV P S LDHGVL+VGYG G K YW++KNSW ESWG+
Sbjct: 256 HSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGVKG-------GKKYWLVKNSWAESWGDQ 308
Query: 349 GYYKICR-GRNVCGVDSMVS 367
GY + R N CG+ S S
Sbjct: 309 GYILMSRDNNNQCGIASQAS 328
>gi|1834307|dbj|BAA09820.1| cysteine proteinase [Spirometra erinaceieuropaei]
gi|1834309|dbj|BAA09821.1| cysteine proteinase [Spirometra erinaceieuropaei]
Length = 336
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 137/334 (41%), Positives = 182/334 (54%), Gaps = 35/334 (10%)
Query: 48 STNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARH-----QKLDPSAT 102
ST ++ + +K F K Y S EE HR F NL RH Q+L+ A
Sbjct: 20 STESETYVRRELWKAWKLAFKKEYFSSEEELHRKRAFFNNLDFIIRHNQRYYQQLESYAV 79
Query: 103 HGITQFSDLTPAEFRRTYLGLR----RKLRLPKDADQAPILPTNDLPADFDWREKGAVGP 158
+ FSDLTP EF YL LR KLR K+A P+ +LP +WRE+GAV
Sbjct: 80 R-LNDFSDLTPGEFAERYLCLRGIVLTKLR-RKEAVSVPL--KENLPDSVNWRERGAVTS 135
Query: 159 VKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGG 218
VK+QG CGSCWSFS GA+EGA + TG L SLSEQQL+DC + + GCNGG
Sbjct: 136 VKNQGQCGSCWSFSANGAIEGAIQIKTGALRSLSEQQLMDCSWDYG-------NQGCNGG 188
Query: 219 LMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSL-DEDQIAAN 277
LM AF+Y + G+ E DY Y T+R C++ + + A+V ++ + DE +
Sbjct: 189 LMPQAFQYAQRY-GVEAEVDYRY--TERDGVCRYRQDLVVANVTGYAELPEGDEGGLQRA 245
Query: 278 LVKNGPLAVAINAV--YMQTYIGGVSCPYICS-RRLDHGVLLVGYGSAGYAPIRLKEKPY 334
+ GP++V I+A +Y GV CS +DHGVL+VGYG+ Y
Sbjct: 246 VATIGPISVGIDAADPGFMSYSHGVFVSKTCSPYAIDHGVLVVGYGAE-------NGDAY 298
Query: 335 WIIKNSWGESWGENGYYKICRGR-NVCGVDSMVS 367
W++KNSWG SWGE+GY K+ R R N+CG+ SM S
Sbjct: 299 WLVKNSWGSSWGEDGYLKMARNRNNMCGIASMAS 332
>gi|242044818|ref|XP_002460280.1| hypothetical protein SORBIDRAFT_02g025920 [Sorghum bicolor]
gi|241923657|gb|EER96801.1| hypothetical protein SORBIDRAFT_02g025920 [Sorghum bicolor]
Length = 363
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 138/346 (39%), Positives = 175/346 (50%), Gaps = 31/346 (8%)
Query: 32 IRQVTDGGDEILSHHESTNNDLLGAEH---HFSLFKKKFNKAYASQEEHDHRFTIFKANL 88
IR VTD L EST LG F+ F ++ K+Y S E RF IF +L
Sbjct: 34 IRSVTDRAASAL---ESTVFGALGRTRDALRFARFAVRYGKSYESAAEVQKRFRIFSESL 90
Query: 89 RRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADF 148
+ + S GI +FSD++ EFR T LG + + LP
Sbjct: 91 QLVRSTNRKGLSYRLGINRFSDMSWEEFRATRLGAAQNCSATLAGNHRMRAAAVALPKTK 150
Query: 149 DWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEP 208
DWRE G V PVK+QG CGSCW+FSTTGALE A ATGK +SLSEQQLVDC +
Sbjct: 151 DWREDGIVSPVKNQGHCGSCWTFSTTGALEAAYTQATGKPISLSEQQLVDCGKPFN---- 206
Query: 209 GSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASV---ANFS 265
+ GCNGGL + AFEY GGL EE YPY G + C F + V N +
Sbjct: 207 ---NFGCNGGLPSQAFEYIKYNGGLDTEESYPYKGVN--GICDFKAENVGVKVLDSVNIT 261
Query: 266 VVSLDEDQIAANLVKNGPLAVAINAVY-MQTYIGGVSCPYICSRR---LDHGVLLVGYGS 321
+ + DE + A LV+ P++VA V + Y GV C ++H VL VGYG
Sbjct: 262 LGAEDELKDAVALVR--PVSVAFQVVNGFRQYKSGVYTSDSCGNTPMDVNHAVLAVGYGV 319
Query: 322 AGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVS 367
PYW+IKNSWG WG+ GY+K+ G+N+CGV + S
Sbjct: 320 ENGV-------PYWLIKNSWGADWGDKGYFKMEMGKNMCGVATCAS 358
>gi|15128493|dbj|BAB62718.1| plerocercoid growth factor/cysteine protease [Spirometra
erinaceieuropaei]
gi|15130639|dbj|BAB62799.1| plerocercoid growth factor-2/cysteine protease [Spirometra
erinaceieuropaei]
Length = 336
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 137/334 (41%), Positives = 182/334 (54%), Gaps = 35/334 (10%)
Query: 48 STNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARH-----QKLDPSAT 102
ST ++ + +K F K Y S EE HR F NL RH Q+L+ A
Sbjct: 20 STGSETYVRRELWKAWKLAFKKEYFSSEEELHRKRAFFNNLDFIIRHNQRYYQQLESYAV 79
Query: 103 HGITQFSDLTPAEFRRTYLGLR----RKLRLPKDADQAPILPTNDLPADFDWREKGAVGP 158
+ FSDLTP EF YL LR KLR K+A P+ +LP +WRE+GAV
Sbjct: 80 R-LNDFSDLTPGEFAERYLCLRGIVLTKLR-RKEAVSVPL--KENLPDSVNWRERGAVTS 135
Query: 159 VKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGG 218
VK+QG CGSCWSFS GA+EGA + TG L SLSEQQL+DC + + GCNGG
Sbjct: 136 VKNQGQCGSCWSFSANGAIEGAIQIKTGALRSLSEQQLMDCSWDYG-------NQGCNGG 188
Query: 219 LMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSL-DEDQIAAN 277
LM AF+Y + G+ E DY Y T+R C++ + + A+V ++ + DE +
Sbjct: 189 LMPQAFQYAQRY-GVEAEVDYRY--TERDGVCRYRQDLVVANVTGYAELPEGDEGGLQRA 245
Query: 278 LVKNGPLAVAINAV--YMQTYIGGVSCPYICS-RRLDHGVLLVGYGSAGYAPIRLKEKPY 334
+ GP++V I+A +Y GV CS +DHGVL+VGYG+ + Y
Sbjct: 246 VATIGPISVGIDAADPGFMSYSHGVFVSKTCSPYAIDHGVLVVGYGAE-------NGEAY 298
Query: 335 WIIKNSWGESWGENGYYKICRGR-NVCGVDSMVS 367
W++KNSWG SWGE GY K+ R R N+CG+ SM S
Sbjct: 299 WLVKNSWGSSWGEGGYVKMARNRNNMCGIASMAS 332
>gi|443724292|gb|ELU12369.1| hypothetical protein CAPTEDRAFT_165495 [Capitella teleta]
Length = 351
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 129/320 (40%), Positives = 174/320 (54%), Gaps = 34/320 (10%)
Query: 63 FKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKL----DPSATHGITQFSDLTPAEFRR 118
FK + Y EE R +F+ NL++ H L S GI QF+D+ EF
Sbjct: 47 FKTVHERNYGETEEM-QRKEVFRNNLKKIEMHNYLHSQGKSSYRMGINQFADMEVKEFAS 105
Query: 119 TYLGLRRKLRLPKDADQ------APILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFS 172
G R R K D +P +P + LPA+ DWR++G V P+KDQG CGSCWSFS
Sbjct: 106 VVNGFRMNNRT-KVRDHLHSHYISPAIPVS-LPAEVDWRKEGYVTPIKDQGHCGSCWSFS 163
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
TTGALEG +F TGKLVSLSEQ L+DC ++GCNGG+M+ AF+Y G
Sbjct: 164 TTGALEGQHFRKTGKLVSLSEQNLIDC-------STSYGNNGCNGGVMDYAFQYIKDNDG 216
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVANFS-VVSLDEDQIAANLVKNGPLAVAINAV 291
E+ YPY D C+F K + A+ ++ + DE+++ + GP++VAI+A
Sbjct: 217 DDTEDSYPYEAADG--PCRFKKEYVGATDTGYTDLPKGDEEKMKEAVAMVGPVSVAIDAS 274
Query: 292 Y--MQTYIGGVSCPYICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+ Q Y GV C LDHGVL+VGYG+ + YW++KNSWG WG+
Sbjct: 275 HTSFQMYQSGVYDEVECDPEGLDHGVLVVGYGTE-------LGQDYWLVKNSWGTKWGDE 327
Query: 349 GYYKICRGR-NVCGVDSMVS 367
GY K+ R + N CG+ SM S
Sbjct: 328 GYIKMSRNKNNQCGISSMAS 347
>gi|6851030|emb|CAB71032.1| cysteine protease [Lolium multiflorum]
Length = 359
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 141/346 (40%), Positives = 177/346 (51%), Gaps = 32/346 (9%)
Query: 32 IRQVTDGGDEILSHHESTNNDLLGAEHH---FSLFKKKFNKAYASQEEHDHRFTIFKANL 88
IR VT+ S EST LG H F+ F + K+Y S E RF IF +L
Sbjct: 30 IRPVTE---RAASAVESTVLGALGRTRHALRFARFAVRHGKSYGSAAEVQRRFRIFSESL 86
Query: 89 RRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADF 148
+ S GI +FSD+T EF+ T LG + + + N LP
Sbjct: 87 DEVRSTNRKGLSYKLGINRFSDMTWEEFQATKLGAAQTCSATLAGNHL-MRDANALPETK 145
Query: 149 DWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEP 208
DWRE G V PVKDQ SCGSCW+FSTTGALE A ATGK +SLSEQQLVDC +
Sbjct: 146 DWRETGIVSPVKDQASCGSCWTFSTTGALEAAYTQATGKNISLSEQQLVDCAGAYN---- 201
Query: 209 GSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVA---NFS 265
+ GCNGGL + AFEY GG+ EE YPY G + CK+ A VA N +
Sbjct: 202 ---NFGCNGGLPSQAFEYIKYNGGIDTEESYPYKGVN--GVCKYRPENAAVQVADSVNIT 256
Query: 266 VVSLDEDQIAANLVKNGPLAVAINAV-YMQTYIGGVSCPYICSRRLD---HGVLLVGYGS 321
+ + DE + A LV+ P++VA + + Y GV C D H VL VGYG
Sbjct: 257 LNAEDELKNAVGLVR--PVSVAFEVIDGFKQYKSGVYTSDHCGTTPDDVNHAVLAVGYGV 314
Query: 322 AGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVS 367
PYW+IKNSWG WGE+GY+K+ G+N+C V + S
Sbjct: 315 E-------NGVPYWLIKNSWGADWGEDGYFKMEMGKNMCAVATCAS 353
>gi|395819351|ref|XP_003783057.1| PREDICTED: cathepsin L1-like [Otolemur garnettii]
Length = 333
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 125/317 (39%), Positives = 169/317 (53%), Gaps = 23/317 (7%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQ----FSDLTPA 114
H+ +K K K Y +EE R +++ N++ H + HG T F D+T
Sbjct: 28 HWYRWKAKHRKLYGMREE-GWRRAVWEKNMKMIEVHNQEYSQGKHGFTMAMNAFGDMTNE 86
Query: 115 EFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTT 174
EFR+ G R + Q P ++P DWREKG V PVK+QG CGSCW+FS T
Sbjct: 87 EFRQVMNGFRNQKHKKGKVFQEPSFL--EVPKSVDWREKGYVTPVKNQGQCGSCWAFSAT 144
Query: 175 GALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLM 234
GALEG F TGKL+SLSEQ LVDC P+ + GC+GGLM+ AF+Y + GGL
Sbjct: 145 GALEGQMFRKTGKLISLSEQNLVDCSR---PQ----GNEGCDGGLMDYAFQYIKENGGLD 197
Query: 235 REEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY-- 292
EE YPY D +CK+ A+ F + +E + + GP++VAI+A +
Sbjct: 198 SEESYPYDAMDE--SCKYRPEYSVANDTGFVDIPKEEKALMKAVATVGPISVAIDAGHES 255
Query: 293 MQTYIGGVSC-PYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYY 351
Q Y GV P S +DHGVL+VGY GY +W++KNSWGE WG GY
Sbjct: 256 FQFYKEGVYFEPECSSDNVDHGVLVVGY---GYEETESDNNKFWLVKNSWGEEWGLGGYI 312
Query: 352 KICRG-RNVCGVDSMVS 367
K+ + +N CG+ + S
Sbjct: 313 KMTKDQKNHCGIATAAS 329
>gi|426369199|ref|XP_004051582.1| PREDICTED: cathepsin W [Gorilla gorilla gorilla]
Length = 376
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 125/335 (37%), Positives = 172/335 (51%), Gaps = 42/335 (12%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLD-PSATHGITQFSDLTPAEFRR 118
F LF+ +FN++Y S EEH HR IF NL +A R Q+ D +A G+T FSDLT EF +
Sbjct: 42 FKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDLTEEEFGQ 101
Query: 119 TYLGLRRKLRLPKDADQAPIL--------PTNDLPADFDWRE-KGAVGPVKDQGSCGSCW 169
Y G RR A P + P +P DWR+ GA+ P+KDQ +C CW
Sbjct: 102 LY-GYRRA------AGGVPSMGREIRSEEPEESVPFTCDWRKVAGAISPIKDQKNCNCCW 154
Query: 170 SFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK 229
+ + G +E ++ V +S Q+L+DC G C GC+GG + AF L
Sbjct: 155 AMAAAGNIETLWRISFWDFVDVSVQELLDC---------GRCGDGCHGGFVWDAFITVLN 205
Query: 230 AGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAIN 289
GL E+DYP+ G R H+C K + A + +F ++ +E +IA L GP+ V IN
Sbjct: 206 NSGLASEKDYPFQGKVRAHSCHPKKYQKVAWIQDFIMLQNNEHRIAQYLATYGPITVTIN 265
Query: 290 AVYMQTYIGGV--SCPYICSRRL-DHGVLLVGYG-------------SAGYAPIRLKEKP 333
++ Y GV + P C +L DH VLLVG+G S+ P P
Sbjct: 266 MKPLRLYRKGVIKATPITCDPQLVDHSVLLVGFGSIKSEEGILAETVSSQSQPQPPHPTP 325
Query: 334 YWIIKNSWGESWGENGYYKICRGRNVCGVDSMVST 368
YWI+KNSWG WGE GY+++ RG N CG+ T
Sbjct: 326 YWILKNSWGAQWGEKGYFRLHRGSNTCGITKFPLT 360
>gi|157132324|ref|XP_001655999.1| cathepsin l [Aedes aegypti]
gi|108881694|gb|EAT45919.1| AAEL002833-PA [Aedes aegypti]
Length = 339
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 131/327 (40%), Positives = 180/327 (55%), Gaps = 31/327 (9%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARH-QKLDPSATH---GITQFSDLT 112
+ ++ FK + K Y S+ E R I+ N + A+H Q+ D + +++DL
Sbjct: 24 KEEWNAFKLQHRKNYDSETEERIRLKIYVQNKHKIAKHNQRFDLGQEKYRLRVNKYADLL 83
Query: 113 PAEFRRTYLGLRR---KLRLPKDADQAP---ILPTN-DLPADFDWREKGAVGPVKDQGSC 165
EF +T G R K L + P I P N ++P DWR+KGAV PVKDQG C
Sbjct: 84 HEEFVQTVNGFNRTDSKKSLKGVRIEEPVTFIEPANVEVPTTVDWRKKGAVTPVKDQGHC 143
Query: 166 GSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFE 225
GSCWSFS TGALEG +F TGKLVSLSEQ LVDC + ++GCNGG+M+ AF+
Sbjct: 144 GSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSGKYG-------NNGCNGGMMDYAFQ 196
Query: 226 YTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPL 284
Y GG+ E+ YPY D C F+ + A+ + + DE+ + L GP+
Sbjct: 197 YIKDNGGIDTEKSYPYEAID--DTCHFNPKAVGATDKGYVDIPQGDEEALKKALATVGPV 254
Query: 285 AVAINAVY--MQTYIGGVSCPYIC-SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSW 341
++AI+A + Q Y GV C S LDHGVL VGYG++ + + YW++KNSW
Sbjct: 255 SIAIDASHESFQFYSEGVYYEPQCDSENLDHGVLAVGYGTSE------EGEDYWLVKNSW 308
Query: 342 GESWGENGYYKICRGR-NVCGVDSMVS 367
G +WG+ GY K+ R R N CGV + S
Sbjct: 309 GTTWGDQGYVKMARNRDNHCGVATCAS 335
>gi|96979798|ref|YP_611001.1| cathepsin [Antheraea pernyi nucleopolyhedrovirus]
gi|37077647|sp|Q91CL9.1|CATV_NPVAP RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|16041073|dbj|BAB69773.1| cathepsin [Antheraea pernyi nucleopolyhedrovirus]
gi|94983331|gb|ABF50271.1| cathepsin [Antheraea pernyi nucleopolyhedrovirus]
gi|146229694|gb|ABQ12259.1| cathepsin [Antheraea pernyi nucleopolyhedrovirus]
Length = 324
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 119/322 (36%), Positives = 177/322 (54%), Gaps = 23/322 (7%)
Query: 52 DLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDL 111
DLL A +F F KFNK Y+S+ E RF IF+ NL + D SA + I +FSDL
Sbjct: 20 DLLKAPSYFEEFLHKFNKNYSSESEKLRRFKIFQHNLEEIINKNQNDTSAQYEINKFSDL 79
Query: 112 TPAEFRRTYLGLRRKLRLPKDADQAPIL--PTNDLPADFDWREKGAVGPVKDQGSCGSCW 169
+ E Y GL L+ ++ + +L P + P +FDWR V VK+QG CG+CW
Sbjct: 80 SKDETISKYTGLSLPLQ-KQNFCEVVVLDRPPDKGPLEFDWRRLNKVTSVKNQGMCGACW 138
Query: 170 SFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK 229
+F+T G+LE + +L++LSEQQL+DCD D GC+GGL+++A+E +
Sbjct: 139 AFATLGSLESQFAIKHDQLINLSEQQLIDCDF---------VDVGCDGGLLHTAYEAVMN 189
Query: 230 AGGLMREEDYPYTGTDRGHACKFDKSKIAASVAN-FSVVSLDEDQIAANLVKNGPLAVAI 288
GG+ E DYPY + C+ + +K V + V+L E+++ L GP+ VAI
Sbjct: 190 MGGIQAENDYPYEANN--GPCRVNAAKFVVRVKKCYRYVTLFEEKLKDLLRIVGPIPVAI 247
Query: 289 NAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+A + Y G+ Y + L+H VLLVGYG P+WI+KN+WG WGE
Sbjct: 248 DASDIVGYKRGI-IRYCENHGLNHAVLLVGYGVENGI-------PFWILKNTWGADWGEQ 299
Query: 349 GYYKICRGRNVCGVDSMVSTVA 370
GY+++ + N CG+ + + + A
Sbjct: 300 GYFRVQQNINACGIKNELPSSA 321
>gi|66810271|ref|XP_638859.1| cysteine proteinase 3 [Dictyostelium discoideum AX4]
gi|166201983|sp|Q23894.2|CYSP3_DICDI RecName: Full=Cysteine proteinase 3; AltName: Full=Cysteine
proteinase II; Flags: Precursor
gi|60467526|gb|EAL65548.1| cysteine proteinase 3 [Dictyostelium discoideum AX4]
Length = 337
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 124/308 (40%), Positives = 166/308 (53%), Gaps = 24/308 (7%)
Query: 68 NKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKL 127
NKAY + +E R+ FK N+ G+ Q +DL+ E+R YLG R +
Sbjct: 42 NKAY-THKEFMPRYEEFKKNMDYVHNWNSKGSKTVLGLNQHADLSNEEYRLNYLGTRAHI 100
Query: 128 RL----PKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFL 183
+L ++ P P + DWREK AV PVKDQG CGSC+SFSTTG++EG +
Sbjct: 101 KLNGYHKRNLGLRLNRPQFKQPLNVDWREKDAVTPVKDQGQCGSCYSFSTTGSVEGVTAI 160
Query: 184 ATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTG 243
TGKLVSLSEQ ++DC E GCNGGLM +AFEY +K GL EE YPY
Sbjct: 161 KTGKLVSLSEQNILDCSSSFGNE-------GCNGGLMTNAFEYIIKNNGLNSEEQYPYE- 212
Query: 244 TDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY--MQTYIGGVS 301
CKF + +AA + ++ + ++ N + P++VAI+A + Q Y GV
Sbjct: 213 MKVNDECKFQEGSVAAKITSYKEIEAGDENDLQNALLLNPVSVAIDASHNSFQLYTAGVY 272
Query: 302 CPYICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGR-NV 359
CS LDHGVL VG G+ + Y+I+KNSWG SWG NGY + R + N
Sbjct: 273 YEPACSSEDLDHGVLAVGMGTD-------NGEDYYIVKNSWGPSWGLNGYIHMARNKDNN 325
Query: 360 CGVDSMVS 367
CG+ +M S
Sbjct: 326 CGISTMAS 333
>gi|332260024|ref|XP_003279085.1| PREDICTED: cathepsin L1 isoform 3 [Nomascus leucogenys]
gi|441593306|ref|XP_004087072.1| PREDICTED: cathepsin L1 [Nomascus leucogenys]
gi|441593309|ref|XP_004087073.1| PREDICTED: cathepsin L1 [Nomascus leucogenys]
Length = 333
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 124/319 (38%), Positives = 169/319 (52%), Gaps = 23/319 (7%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQ----FSDLT 112
E ++ +K N+ Y EE R +++ N++ +H + H T F D+T
Sbjct: 26 EAQWTKWKAMHNRLYGMNEEGWRR-AVWEKNMKMIEQHNQEYREGKHSFTMAMNAFGDMT 84
Query: 113 PAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFS 172
EFR+ G + + Q P+ + P DWREKG V PVK+QG CGSCW+FS
Sbjct: 85 SEEFRQVMNGFQNRKPRKGKVFQEPLF--YEAPRSVDWREKGYVTPVKNQGQCGSCWAFS 142
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
TGALEG F TGKLVSLSEQ LVDC P+ + GCNGGLM+ AF+Y GG
Sbjct: 143 ATGALEGQMFRKTGKLVSLSEQNLVDCS---GPQ----GNEGCNGGLMDYAFQYVQDNGG 195
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY 292
L EE YPY T+ +CK++ A+ F + E + + GP++VA++A +
Sbjct: 196 LDSEESYPYEATEE--SCKYNPKYSVANDTGFVDIPKQEKALMKAVATVGPISVAVDAGH 253
Query: 293 --MQTYIGGVSCPYICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
Q Y G+ CS +DHGVL+VGY G+ YW++KNSWGE WG G
Sbjct: 254 QSFQFYKEGIYFEPDCSSEDMDHGVLVVGY---GFESTESDNNKYWLVKNSWGEEWGMGG 310
Query: 350 YYKICRG-RNVCGVDSMVS 367
Y K+ + RN CG+ S S
Sbjct: 311 YIKMAKDRRNHCGIASAAS 329
>gi|402770501|gb|AFQ98385.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 131/320 (40%), Positives = 166/320 (51%), Gaps = 28/320 (8%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKAN----LRRAARHQKLDPSATHGITQFSDLTPA 114
+ FK K Y S E RF IF N + A++ K S G+ QF DL
Sbjct: 26 QWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAH 85
Query: 115 EFRRTYLGLRRKLRLPKDADQAPILPTND--LPADFDWREKGAVGPVKDQGSCGSCWSFS 172
EF R + G R + P ND LP DWR+KGAV PVKDQG CGSCW+FS
Sbjct: 86 EFARIFNG-HHGTRKTGGSTFLPPANVNDSSLPKAVDWRKKGAVTPVKDQGQCGSCWAFS 144
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
TG+LEG +FL G+LVSLSEQ LVDC ++GC GGLM AF+Y G
Sbjct: 145 ATGSLEGQHFLKNGELVSLSEQNLVDCSQSFG-------NNGCEGGLMEDAFKYIKANDG 197
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAINAV 291
+ E+ YPY D C+F K + A+ + + + ED + + GP++VAI+A
Sbjct: 198 IDTEKSYPYEAVDG--ECRFKKEDVGATDTGYVEIKAGSEDDLKKAVATVGPISVAIDAS 255
Query: 292 Y--MQTYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+ Q Y GV P S LDHGVL+VGYG G K YW++KNSW ESWG+
Sbjct: 256 HSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGVKG-------GKKYWLVKNSWAESWGDQ 308
Query: 349 GYYKICR-GRNVCGVDSMVS 367
GY + R N CG+ S S
Sbjct: 309 GYILMSRDNNNQCGIASQAS 328
>gi|42564161|gb|AAS20592.1| digestive cysteine proteinase intestain [Leptinotarsa decemlineata]
Length = 326
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 120/307 (39%), Positives = 174/307 (56%), Gaps = 22/307 (7%)
Query: 63 FKKKFNKAYASQEEHDHRFTIFKANLRRA----ARHQKLDPSATHGITQFSDLTPAEFRR 118
FK+ K Y S E RF IF++NLR+ A++ K + S G+T F+DLT EF+
Sbjct: 26 FKQTHGKTYKSLLEERTRFGIFQSNLRKIEEHNAKYDKGEESYFLGVTPFADLTHDEFKD 85
Query: 119 TYLGLRRKLRLPKDADQA-PILPTN-DLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGA 176
LRR+++ + + + P ++P DW +KGAV VK QG CGSCW+FS TGA
Sbjct: 86 K---LRRQIKTKPNVEATLAVFPEGLEVPDSIDWTQKGAVLDVKYQGGCGSCWAFSATGA 142
Query: 177 LEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMRE 236
LEG N + + LSEQQL+DC +P D +GGLM+ AF+Y L G+ +
Sbjct: 143 LEGQNAIVNNVKIPLSEQQLLDC------SKPYGNDDCEHGGLMSFAFDYVLDK-GIEAD 195
Query: 237 EDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTY 296
YPY G D C++D K + + VS+ E+++ + GP++VAI+A +Q Y
Sbjct: 196 SSYPYKGIDT--PCQYDAKKTVLKIKGYRNVSISEEELKKAVGTVGPVSVAIDADPIQLY 253
Query: 297 IGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR- 355
GG+ C+ L+HGVL VGYG + +K +W +KNSWG+ WGE GY++I R
Sbjct: 254 SGGILDGLFCTHNLNHGVLAVGYGEEDHL---FGKKKFWKVKNSWGKDWGEQGYFRIKRD 310
Query: 356 GRNVCGV 362
N+CG+
Sbjct: 311 ANNLCGI 317
>gi|288548564|gb|ADC52430.1| cathepsin L1 cysteine protease [Pinctada fucata]
Length = 331
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 122/320 (38%), Positives = 175/320 (54%), Gaps = 26/320 (8%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH----GITQFSDLT 112
+ ++++K F K Y + EE R +++ N+ +H + H G +++D+T
Sbjct: 25 DQEWAIYKDMFAKNYVADEERMRRL-VWEDNIDYIEKHNRRADRGEHKFWLGTNEYADMT 83
Query: 113 PAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFS 172
EF+ G + D +P DLP DWR+KG V PVK+QG CGSCWSFS
Sbjct: 84 IDEFKAIMNGFIMQNGTKGDTYMSPS-NIGDLPDKVDWRDKGYVTPVKNQGHCGSCWSFS 142
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
TG+LEG +F +TGKLVSLSEQ L+DC + + GC GGLM+ AFEY K G
Sbjct: 143 ATGSLEGQHFKSTGKLVSLSEQNLIDCSKK-------EGNHGCKGGLMDFAFEYIQKNDG 195
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAAS-VANFSVVSLDEDQIAANLVKNGPLAVAINAV 291
+ E+ YPYT D G C+F K+ + A+ + E + + GP++VA++A
Sbjct: 196 IDTEQSYPYTAKD-GIECRFKKADVGATDKGKVDLPRQSEKALQEAVATVGPISVAMDAG 254
Query: 292 Y--MQTYIGGVSCPYICSR-RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+ Q Y G+ +CS +LDHGVL VGYGS G E YW++KNSWG +WG
Sbjct: 255 HRSFQLYKRGIYTEPMCSSTKLDHGVLAVGYGSEG-------EGDYWLVKNSWGATWGME 307
Query: 349 GYYKICRG-RNVCGVDSMVS 367
G++ + R RN CG+ + S
Sbjct: 308 GFFMLARNHRNECGIATQAS 327
>gi|402770503|gb|AFQ98386.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 131/320 (40%), Positives = 167/320 (52%), Gaps = 28/320 (8%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKAN----LRRAARHQKLDPSATHGITQFSDLTPA 114
+ FK K Y S E RF IF N + A++ K S G+ QF DL
Sbjct: 26 QWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAH 85
Query: 115 EFRRTYLGLRRKLRLPKDADQAPILPTND--LPADFDWREKGAVGPVKDQGSCGSCWSFS 172
EF R + G R + P ND LP DWR+KGAV PVKDQG CGSCW+FS
Sbjct: 86 EFARIFNG-HHGTRKTGGSTFLPPANVNDSSLPKVVDWRKKGAVTPVKDQGQCGSCWAFS 144
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
TG+LEG +FL G+LVSLSEQ LVDC ++GC GGLM AF+Y + G
Sbjct: 145 ATGSLEGRHFLKNGELVSLSEQNLVDCSQSFG-------NNGCEGGLMEDAFKYIKENDG 197
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAINAV 291
+ E+ YPY D C+F K + A+ + + + ED + + GP++VAI+A
Sbjct: 198 IDTEKSYPYEAVDG--ECRFKKEDVGATDTGYVEIKAGSEDDLKKAVATVGPISVAIDAS 255
Query: 292 Y--MQTYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+ Q Y GV P S LDHGVL+VGYG G K YW++KNSW ESWG+
Sbjct: 256 HSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGVKG-------GKKYWLVKNSWAESWGDQ 308
Query: 349 GYYKICR-GRNVCGVDSMVS 367
GY + R N CG+ S S
Sbjct: 309 GYILMSRDNNNQCGIASQAS 328
>gi|530734|emb|CAA56914.1| cathepsin l [Nephrops norvegicus]
gi|1582620|prf||2119193A cathepsin L-related Cys protease
Length = 324
Score = 209 bits (531), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 129/321 (40%), Positives = 171/321 (53%), Gaps = 29/321 (9%)
Query: 53 LLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRA----ARHQKLDPSATHGITQF 108
L A + FK KF + Y EE +R +F NL+ +++ + + I QF
Sbjct: 13 LAAANPSWEEFKGKFGRKYVDLEEERYRLNVFLDNLQYIEEFNKKYESGEVTYNLAINQF 72
Query: 109 SDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLP--ADFDWREKGAVGPVKDQGSCG 166
SDLT EF G + LR PK A T+ P + DWR KG V VKDQG CG
Sbjct: 73 SDLTNDEFNSMMKGYKTSLR-PKPV--AVFTSTDAAPETTEVDWRTKGCVTHVKDQGQCG 129
Query: 167 SCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEY 226
SCW+FS TG+LEG +FL G+LVSL+EQQLVDC + GCNGG +N AF+Y
Sbjct: 130 SCWAFSATGSLEGQHFLKYGELVSLAEQQLVDCAGGI------YYNQGCNGGWVNQAFKY 183
Query: 227 TLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLA 285
GG+ E YPY D + C+F+ + +AA+ + F S+ E GP++
Sbjct: 184 IKANGGIDTESSYPYEARD--NTCRFNSNSVAATCSGFVSIAQGSESPEVRRTTNTGPIS 241
Query: 286 VAINAVY--MQTYIGGVSC-PYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWG 342
VAI+A + Q+Y GV P S +LDH VL VGYGS G + +W++KNSWG
Sbjct: 242 VAIDAAHRSFQSYSSGVYYEPSCSSSQLDHAVLAVGYGSEG-------GQDFWLVKNSWG 294
Query: 343 ESWGENGYYKICRGR-NVCGV 362
SWG GY + R R N CG+
Sbjct: 295 TSWGSAGYINMARNRNNNCGI 315
>gi|354466410|ref|XP_003495667.1| PREDICTED: pro-cathepsin H-like [Cricetulus griseus]
Length = 333
Score = 209 bits (531), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 127/322 (39%), Positives = 178/322 (55%), Gaps = 39/322 (12%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRR 118
HF + + K Y+S E +++R F N R+ H + + + G+ QFSD+T AE +R
Sbjct: 32 HFKSWMTQHQKTYSSVE-YNYRLKTFANNWRKIHAHNQRNHTFKMGLNQFSDMTFAEIKR 90
Query: 119 TYLGLRRKLRLPKDADQAP---ILPTNDLPADFDWREKGA-VGPVKDQGSCGSCWSFSTT 174
YL P++ + T LP DWR+KG V VK+QGSCGSCW+FSTT
Sbjct: 91 KYL-----WSEPQNCSATKGNYLRGTGPLPPSMDWRKKGNFVSAVKNQGSCGSCWTFSTT 145
Query: 175 GALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLM 234
GALE A +A+GK++SL+EQQLVDC + + GC GGL + AFEY L G+M
Sbjct: 146 GALESAVAIASGKMLSLAEQQLVDCAQNFN-------NHGCEGGLPSQAFEYILYNKGIM 198
Query: 235 REEDYPYTGTDRGHACKFDKSKIAASVANFSVVSL-DEDQIAANLVKNGPLAVAINA--- 290
E+ YPY G D GH CKFD K A V + + ++L DE + + P++ A
Sbjct: 199 GEDTYPYRGKD-GH-CKFDPQKAIAFVKDVANITLNDEKAMVEAVALYNPVSFAFEVTDD 256
Query: 291 --VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEK---PYWIIKNSWGESW 345
+Y + SC + +++H VL VGYG EK PYWI+KNSWG +W
Sbjct: 257 FMLYQKGIYSSTSC-HKTPDKVNHAVLAVGYG----------EKDGIPYWIVKNSWGTNW 305
Query: 346 GENGYYKICRGRNVCGVDSMVS 367
G+ GY+ I RG+N+CG+ + S
Sbjct: 306 GDKGYFLIERGKNMCGLAACAS 327
>gi|16304178|gb|AAL16954.1|AF426414_1 cathepsin L-like cysteine protease precursor [Delia radicum]
Length = 337
Score = 209 bits (531), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 127/322 (39%), Positives = 178/322 (55%), Gaps = 29/322 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKL----DPSATHGITQFSDLTPAE 115
+ FK + K + S+ E R IF N + A+H +L S G+ ++SD+ E
Sbjct: 27 WQTFKMEHRKNFLSEVEERFRMKIFNENRHKIAKHNQLYAQGKVSFKLGLNKYSDMLYHE 86
Query: 116 FRRTYLG----LRRKLRLPKDADQAPILPTN-DLPADFDWREKGAVGPVKDQGSCGSCWS 170
F+ T G +R+ LR + I P N +P DWR+ GAV VKDQG CGSCW+
Sbjct: 87 FKETMNGYNHTMRKVLRAQGFSGIIYIPPANVQIPKSVDWRQHGAVTAVKDQGHCGSCWA 146
Query: 171 FSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA 230
FS+T ALEG +F G LVSLSEQ LVDC + ++GCNGGLM++AF Y
Sbjct: 147 FSSTAALEGQHFRKAGVLVSLSEQNLVDCSTKYG-------NNGCNGGLMDNAFRYIKDN 199
Query: 231 GGLMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAIN 289
GG+ E+ YPY G D +C F KS + A+ F + DE+ + + GP++VAI+
Sbjct: 200 GGIDTEKSYPYEGIDD--SCHFTKSGVGATDTGFVDIPQGDEEALMKAVATMGPVSVAID 257
Query: 290 AVY--MQTYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWG 346
A + Q Y GV + P ++ LDHGVL+VGYG+ YW++KNSWG +WG
Sbjct: 258 ASHESFQLYSEGVYNEPECDAQNLDHGVLVVGYGTDKTGL------DYWLVKNSWGTTWG 311
Query: 347 ENGYYKICRGR-NVCGVDSMVS 367
+ GY K+ R + N CG+ + S
Sbjct: 312 DQGYIKMARNQDNQCGIATASS 333
>gi|110349475|gb|ABG73218.1| cathepsin L 2 precursor [Diaprepes abbreviatus]
Length = 348
Score = 209 bits (531), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 129/336 (38%), Positives = 174/336 (51%), Gaps = 41/336 (12%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLD----PSATHGITQFSDLT 112
+ + FK + K Y S+ E+++R ++F NL + H KL S + DLT
Sbjct: 25 QEQWEQFKLEHGKVYESESENEYRQSVFMENLFQINEHNKLYEMGLSSYQMAMNHLGDLT 84
Query: 113 PAEFRRTYLGLRRKL-------------RLPKDADQ--APILPTN----DLPADFDWREK 153
EF R Y +L LP+D LPTN DLP D DWR+K
Sbjct: 85 KDEFMRIYTVNMPQLPQSENLSDSEPWLDLPQDLQGFVTYALPTNLDEVDLPTDIDWRQK 144
Query: 154 GAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDS 213
GAV PVK+Q +CGSCWSFS TGALE F T KL+SLSEQQLVDC +
Sbjct: 145 GAVTPVKNQRNCGSCWSFSATGALEAQWFKKTNKLISLSEQQLVDCSGRYG-------NH 197
Query: 214 GCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQ 273
GC+GG M+ AF Y + GG+ E+ YPYT D C + AA+V+ +V E+Q
Sbjct: 198 GCHGGWMHWAFGYIKENGGIDTEQSYPYTAKDG--RCAYKPGNKAATVSQVIMVPRGENQ 255
Query: 274 IAANLVKNGPLAVAINAVY-MQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEK 332
+AA + GP+++A + Q Y GV C L+H +L VGYGS G K
Sbjct: 256 LAAKVSSVGPISIAAEVSHKFQFYHSGVYDEPQCGHSLNHAMLAVGYGSMG-------GK 308
Query: 333 PYWIIKNSWGESWGENGYYKICRGR-NVCGVDSMVS 367
+W++KNSWG WG+ GY ++ + + N CG+ M S
Sbjct: 309 NFWLVKNSWGTGWGDQGYIRMAKDKNNQCGIALMAS 344
>gi|326493368|dbj|BAJ85145.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 436
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 121/322 (37%), Positives = 175/322 (54%), Gaps = 29/322 (9%)
Query: 42 ILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSA 101
I+S+ E + ++ ++ + + Y + E + RF F+ NLR +H +
Sbjct: 28 IVSYGERSEEEV---RRMYAEWMAEHGSTYNAIGEEERRFEAFRDNLRYIDQHNAAADAG 84
Query: 102 TH----GITQFSDLTPAEFRRTYLGLRRKL-RLPKDADQAPILPTNDLPADFDWREKGAV 156
H G+ +F+DLT E+R TYLG R K R K + + ++LP DWR+KGAV
Sbjct: 85 VHSFRLGLNRFADLTNEEYRSTYLGARTKPDRERKLSARYQAADNDELPESVDWRKKGAV 144
Query: 157 GPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCN 216
G VKDQG CGSCW+FS A+EG N + TG ++ LSEQ+LVDCD S + GCN
Sbjct: 145 GAVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDT--------SYNQGCN 196
Query: 217 GGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKI-AASVANFSVVSLDEDQIA 275
GGLM+ AFE+ + GG+ EEDYPY +R + C +K ++ + V ++ ++
Sbjct: 197 GGLMDYAFEFIINNGGIDSEEDYPY--KERDNRCDANKKNAKVVTIDGYEDVPVNSEKSL 254
Query: 276 ANLVKNGPLAVAINA--VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKP 333
V N P++VAI A Q Y G+ C LDHGV VGYG+ K
Sbjct: 255 QKAVANQPISVAIEAGGRAFQLYKSGIFTG-TCGTALDHGVAAVGYGTE-------NGKD 306
Query: 334 YWIIKNSWGESWGENGYYKICR 355
YW+++NSWG WGE+GY ++ R
Sbjct: 307 YWLVRNSWGSVWGEDGYIRMER 328
>gi|444514070|gb|ELV10520.1| Cathepsin L1 [Tupaia chinensis]
Length = 450
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 134/334 (40%), Positives = 172/334 (51%), Gaps = 33/334 (9%)
Query: 42 ILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSA 101
I S +++ +L + HH+ K + Y EE R +++ N++ H +
Sbjct: 138 IASATPNSDQNLDTSWHHW---KSTHRRLYGKNEE-GWRRAVWEKNMKMIEMHNHEYSNG 193
Query: 102 THGITQ----FSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVG 157
HG T F D+T EFR+ G R + + AP+L P DWREKG V
Sbjct: 194 KHGFTMGMNAFGDMTNEEFRQVMNGFRNQKQKSGKVFHAPLLL--QAPKSVDWREKGFVT 251
Query: 158 PVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNG 217
PVK+QG CGSCW+FS TGALEG F TGKL+SLSEQ LVDC + GC G
Sbjct: 252 PVKNQGQCGSCWAFSATGALEGQMFRKTGKLISLSEQNLVDCSRR-------QGNLGCQG 304
Query: 218 GLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAAN 277
GLM++AF+Y GGL EE YPY G D C++ A+ F E +
Sbjct: 305 GLMDNAFQYIKDNGGLDSEESYPYKGMD--GTCQYKAEWAVANDTGF------EKALMKA 356
Query: 278 LVKNGPLAVAINAVY--MQTYIGGVSC-PYICSRRLDHGVLLVGYGSAGYAPIRLKEKPY 334
+ GP++VAI+A + Q Y G+ P S LDHGVL+VGYG R Y
Sbjct: 357 VASVGPISVAIDAGHASFQFYKDGIYYEPDCSSENLDHGVLVVGYG----VEKRNSNDKY 412
Query: 335 WIIKNSWGESWGENGYYKICRGRNV-CGVDSMVS 367
W+IKNSWGE WG NGY KI + RN CGV S S
Sbjct: 413 WLIKNSWGEQWGANGYVKIAKDRNNHCGVASAAS 446
>gi|358339355|dbj|GAA47435.1| cathepsin F [Clonorchis sinensis]
Length = 1157
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 115/277 (41%), Positives = 161/277 (58%), Gaps = 22/277 (7%)
Query: 87 NLRRAARHQKLDP-SATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLP 145
N+++A +Q L+ +A +G+TQFSDLT EF+ T+LGLR + K + +P
Sbjct: 654 NIKQAEFYQTLERGTALYGVTQFSDLTGEEFQETFLGLRLDEQYSKSQSYVKKKHSVSIP 713
Query: 146 ADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDP 205
++DWR GAVGPV DQG CGSCW+FS G +EG F TG+LVSLS+QQLVDCD
Sbjct: 714 ENYDWRPYGAVGPVLDQGHCGSCWAFSVIGNIEGQWFRKTGQLVSLSKQQLVDCDRS--- 770
Query: 206 EEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFS 265
GC GG + ++ + GGL E DY YTG D C + K A V +
Sbjct: 771 ------SRGCGGGYPPATYDSIRRIGGLEIELDYRYTGRD--GVCHQNPRKFVAYVNSSV 822
Query: 266 VVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCP---YICSRRLDHGVLLVGYGSA 322
++ DE+ IA L +GP+++A+NA +Q Y+ G+ P Y + + H VL VG+G+
Sbjct: 823 ALTKDENTIAEWLSYHGPISMALNARLLQFYVSGIMHPPAAYCPVKDISHAVLSVGFGTK 882
Query: 323 GYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNV 359
G P+WI+KNSWG WGE GY++I RG ++
Sbjct: 883 G-------NVPFWIVKNSWGTLWGEEGYFRIYRGDDM 912
Score = 155 bits (392), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 101/261 (38%), Positives = 129/261 (49%), Gaps = 34/261 (13%)
Query: 115 EFRRTYLGL---RRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSF 171
EF+ YL RKL K + + D FDWR+ GAVGPV DQ CG+ W+F
Sbjct: 434 EFKALYLTAMYDHRKLNQSKTTEPETVGEPQD---SFDWRDYGAVGPVLDQDRCGASWAF 490
Query: 172 STTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAG 231
S G +EG F+ +L+SLSEQQLVDCD D GC GG AFE + G
Sbjct: 491 SAIGNIEGQYFMRVHRLLSLSEQQLVDCDR---------IDQGCAGGTPYGAFEGIQQLG 541
Query: 232 GLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV 291
GL E DYPY G C+ + + S+ + DEDQIA L +GPL+V IN
Sbjct: 542 GLELEADYPYLGHQDN--CQSNPLRFVVSINGSVQLPKDEDQIAQYLFDHGPLSVGINGA 599
Query: 292 YMQTYIGGVSCPYI--CS-RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+Q Y G+ P C+ ++H L VG+G ++ PYW IKNSWG WGE
Sbjct: 600 LLQYYSSGIMQPLWDNCNPAEMNHAGLAVGFGFE-------QDVPYWTIKNSWGMLWGEE 652
Query: 349 G-------YYKICRGRNVCGV 362
Y + RG + GV
Sbjct: 653 DNIKQAEFYQTLERGTALYGV 673
Score = 151 bits (381), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 84/182 (46%), Positives = 102/182 (56%), Gaps = 12/182 (6%)
Query: 116 FRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
F YLG R R P A + ++P FDWRE GAVGP++DQG CGSCW+FST G
Sbjct: 972 FYFLYLGARFD-REPSRAGSMVVDDLGEIPERFDWRELGAVGPIQDQGDCGSCWAFSTIG 1030
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMR 235
+EG F TG+L++LSEQQL+DCD S D GC GG + +K GGL
Sbjct: 1031 NIEGQWFKKTGQLLTLSEQQLIDCD---------SVDDGCGGGYPPDTYGDIVKMGGLEL 1081
Query: 236 EEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQT 295
DYPY D CK ++SK A V V+ EDQ A L KNGPL+ INA Y+Q
Sbjct: 1082 NADYPYIAAD--GVCKMERSKFRAYVNKSLVLPTKEDQQAVWLSKNGPLSAGINADYLQV 1139
Query: 296 YI 297
I
Sbjct: 1140 VI 1141
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 87/242 (35%), Positives = 115/242 (47%), Gaps = 43/242 (17%)
Query: 115 EFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTT 174
EFRR YL + + D+ + LP+ FDWRE GAVGPV++QG CGSCW+ S
Sbjct: 190 EFRRLYLTYKSPDE-HEPIDRIHVQEVGQLPSYFDWREYGAVGPVRNQGQCGSCWAISA- 247
Query: 175 GALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLM 234
++VDCDH D GC+GG A+E + GGL
Sbjct: 248 --------------------EVVDCDH---------ADHGCSGGFPIHAYECVQRLGGLE 278
Query: 235 REEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQ 294
YPY G + C+ D A + + D +QIA L GPL+V ++A +Q
Sbjct: 279 LAVRYPYVGYQQ--YCQADPRYFVAYINGSVALPKDSEQIAKFLATFGPLSVVLDARLLQ 336
Query: 295 TYIGGVSCP---YICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYY 351
Y G+ P Y L+H VL VG+G+ + PYWIIKNSWGE WGE
Sbjct: 337 YYRSGILNPSVAYCNPEELNHAVLSVGFGTE-------QGIPYWIIKNSWGEQWGEQHLT 389
Query: 352 KI 353
K+
Sbjct: 390 KL 391
Score = 94.4 bits (233), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 56/158 (35%), Positives = 81/158 (51%), Gaps = 21/158 (13%)
Query: 194 QQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFD 253
QQLVDCDH D GC GG AF + GGL DYPY + + AC+F+
Sbjct: 23 QQLVDCDH---------VDRGCEGGFPLDAFMAVQRLGGLQLSIDYPYIASRQ--ACQFN 71
Query: 254 KSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGV---SCPYICSRRL 310
+ A V F+ + +E IA L +NGPL+V +N+ ++ Y G+ + L
Sbjct: 72 PKQAVAFVTGFAALPRNELLIAEYLHRNGPLSVGLNSRTLKFYNSGILNLAAEQCDPEAL 131
Query: 311 DHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+H L VG+G+ + P+WIIKN++G+ WGE
Sbjct: 132 NHAALAVGFGTD-------ESTPFWIIKNTFGKDWGEQ 162
>gi|342305188|dbj|BAK55648.1| cathepsin L [Oplegnathus fasciatus]
Length = 336
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 131/322 (40%), Positives = 174/322 (54%), Gaps = 25/322 (7%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH----GITQFSDLT 112
+ H+ L+K K Y +EE R +++ NL++ H H G+ F D+T
Sbjct: 25 DEHWDLWKSWHTKKYHEKEE-GWRRMVWEKNLKKIELHNLEHSMGEHTYRLGMNHFGDMT 83
Query: 113 PAEFRRTYLGLRRKLRLPKDADQAPILPTNDL--PADFDWREKGAVGPVKDQGSCGSCWS 170
EFR+ G +RK + + + N L P DWR+ G V PVKDQG CGSCW+
Sbjct: 84 HEEFRQIMYGYKRKSE--RKFKGSLFMEPNFLEAPRSVDWRDNGYVTPVKDQGQCGSCWA 141
Query: 171 FSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA 230
FSTTGA+EG +F TGKLVSLSEQ LVDC PE + GCNGGLM+ AF+Y
Sbjct: 142 FSTTGAMEGQHFRKTGKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYIKDN 194
Query: 231 GGLMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAIN 289
GL E+ YPY GTD C +D +A+ F + S E + + GP++VAI+
Sbjct: 195 QGLDSEDSYPYLGTD-DQPCHYDPKYNSANDTGFIDIPSGKERALMKAVAAVGPVSVAID 253
Query: 290 AVY--MQTYIGGVSCPYIC-SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWG 346
A + Q Y G+ C S LDHGVL+VGYG G + K YWI+KNSW E WG
Sbjct: 254 AGHESFQFYQSGIYYEKECSSEELDHGVLVVGYGFEG---EDVDGKKYWIVKNSWSEKWG 310
Query: 347 ENGYYKICRGR-NVCGVDSMVS 367
+ GY + + R N CG+ + S
Sbjct: 311 DKGYIYMAKDRKNHCGIATAAS 332
>gi|23344734|gb|AAN28680.1| cathepsin L [Theromyzon tessulatum]
Length = 351
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 129/315 (40%), Positives = 172/315 (54%), Gaps = 26/315 (8%)
Query: 48 STNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKL----DPSATH 103
S ++L AE + FK + NK Y EE R TIF N + H L + S T
Sbjct: 29 SNFQEVLDAEVAWHKFKLEHNKVYVGIEEESLRKTIFATNYKFIKDHNALHATGEKSFTV 88
Query: 104 GITQFSDLTPAEFRRTYLGLR-RKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQ 162
G+ +F+D+T EF + GL+ R+ +P + LP + DWR KG V VK+Q
Sbjct: 89 GVNEFADMTVHEFAQMMNGLKPDSTRVSGSTYLSPNIDA-PLPVEVDWRTKGLVSEVKNQ 147
Query: 163 GSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNS 222
GSCGSCW+FSTTG+LEG + TG +V LSEQ LVDC + GCNGGLM +
Sbjct: 148 GSCGSCWAFSTTGSLEGQHMRKTGTMVDLSEQNLVDC-------STSYGNDGCNGGLMTN 200
Query: 223 AFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKN 281
AF+Y G+ EE YPY G D CKF K+K+ A+V F + + +E ++ L
Sbjct: 201 AFKYIKDNKGIDTEEAYPYAGRDGD--CKFKKNKVGATVTGFVEIPAGNEKKLQEALATV 258
Query: 282 GPLAVAINA---VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIK 338
GP++VAI+A +M G P S +LDHGVL VGYGS + K Y+I+K
Sbjct: 259 GPVSVAIDANHQSFMLYKSGVYDEPECDSAQLDHGVLAVGYGS-------IHGKDYYIVK 311
Query: 339 NSWGESWGENGYYKI 353
NSWG +WGE GY +
Sbjct: 312 NSWGTTWGEQGYIRF 326
>gi|15824693|gb|AAL09444.1| cysteine protease [Leishmania donovani]
Length = 394
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 129/311 (41%), Positives = 168/311 (54%), Gaps = 29/311 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK+ + +AY + E R F+ NL HQ +P A GIT+F DL+ AEF
Sbjct: 38 FEEFKRTYRRAYGTLAEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 120 YL-GLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YL G + A Q DL P DWREKGAV PVK+QG+CGSCW+FS G
Sbjct: 98 YLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFSAVG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGL 233
+E A LVSLSEQQLV CD + D+GCNGGLM AFE+ L+ G +
Sbjct: 158 NIESQWARAGHGLVSLSEQQLVSCDDK---------DNGCNGGLMLQAFEWLLRHMYGIV 208
Query: 234 MREEDYPYTGTDRGHACKFDKSKI--AASVANFSVVSLDEDQIAANLVKNGPLAVAINAV 291
E+ YPYT + A + SK+ A + + ++ +E +AA L +NGP+A+ ++A
Sbjct: 209 FTEKSYPYTSGNGDVAECLNSSKLVPGARIDGYVMIPSNETVMAAWLAENGPIAIGVDAS 268
Query: 292 YMQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
+Y GV SC L+HGVLLVGY + G PY +IKNSWGE WGE G
Sbjct: 269 SFMSYQSGVLTSC---AGDALNHGVLLVGYNTTGGV-------PYCVIKNSWGEDWGEKG 318
Query: 350 YYKICRGRNVC 360
Y ++ G N C
Sbjct: 319 YVRVAMGLNAC 329
>gi|945081|gb|AAC49361.1| P21 [Petunia x hybrida]
Length = 358
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 140/348 (40%), Positives = 177/348 (50%), Gaps = 37/348 (10%)
Query: 32 IRQVTDGGDEILSHHESTNNDL--LGAEHH---FSLFKKKFNKAYASQEEHDHRFTIFKA 86
IRQV S HE + L +G H F+ F +++ K Y S EE RF IF
Sbjct: 31 IRQVVSD-----SFHELESGILHVVGQTRHALSFARFARRYGKRYDSVEEIKQRFDIFLD 85
Query: 87 NLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPA 146
NL H S G+ +FSDLT EFRR LG + + L LP
Sbjct: 86 NLEMINSHNDKGLSYKLGVNEFSDLTWDEFRRDRLGAAQNCSATTKGNLK--LRDAVLPE 143
Query: 147 DFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPE 206
DWRE G V PVK+QG CGSCW+FSTTGALE A GK +SLSEQQLVDC +
Sbjct: 144 TKDWREAGIVSPVKNQGKCGSCWTFSTTGALEAAYTQKFGKGISLSEQQLVDCAGAFN-- 201
Query: 207 EPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASV---AN 263
+ GCNGGL + AFEY GGL EE YPYTG + CKF + V N
Sbjct: 202 -----NFGCNGGLPSQAFEYIKSNGGLETEEAYPYTG--KNGLCKFSSQNVGVKVTDSVN 254
Query: 264 FSVVSLDEDQIAANLVKNGPLAVAINAVY-MQTYIGGVSCPYICSRR---LDHGVLLVGY 319
++ + DE + A LV+ P++VA V + Y GV C ++H VL VGY
Sbjct: 255 ITLGAEDELKYAVALVR--PVSVAFEVVKGFKQYKSGVYTSTECGTTPMDVNHAVLAVGY 312
Query: 320 GSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVS 367
G Y P+W+IKNSWG WG+N Y+K+ G ++CG+ + S
Sbjct: 313 G-VEYGV------PFWLIKNSWGADWGDNAYFKMEMGNDMCGIATCAS 353
>gi|312281697|dbj|BAJ33714.1| unnamed protein product [Thellungiella halophila]
Length = 347
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 130/377 (34%), Positives = 189/377 (50%), Gaps = 54/377 (14%)
Query: 1 MGSKTVVLFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHF 60
M SK + +FL+ ++ S S TL +D +N+L+ + H
Sbjct: 1 MASKQIQIFLIVSLISSFCLSITLSRPLD--------------------DNELIMQKRH- 39
Query: 61 SLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH--GITQFSDLTPAEFRR 118
+ K + YA +E ++R+ +FK N+ R R + T + QF+DLT EFR
Sbjct: 40 DEWMAKHGRVYADMKEKNNRYVVFKRNVERIERLNNVPAGRTFKLAVNQFADLTNDEFRS 99
Query: 119 TYLGLRRKLRLPKDADQAPI------LPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFS 172
Y G + L + + + LP DWR+KGAV P+K+QG+CG CW+FS
Sbjct: 100 MYTGYKGGSVLSSQSGTKTSSFRYQNVSSGALPVSVDWRKKGAVTPIKNQGTCGCCWAFS 159
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
A+EGA + GKL+SLSEQQLVDCD D GC+GGLM++AFE+ + GG
Sbjct: 160 AVAAIEGATKIKKGKLISLSEQQLVDCDTN---------DFGCSGGLMDTAFEHIMATGG 210
Query: 233 LMREEDYPYTGTDRGHACKFDKSK-IAASVANFSVVSLDEDQIAANLVKNGPLAVAIN-- 289
L E +YPY G D CK +K A S+ + V +++++ V + P+++ I
Sbjct: 211 LTTESNYPYKGKD--ATCKIKNTKPTATSITGYEDVPVNDEKALMKAVAHQPVSIGIEGG 268
Query: 290 AVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
Q Y GV C+ LDH V VGYG + YWIIKNSWG WGE+G
Sbjct: 269 GFDFQFYGSGVFTGE-CTTYLDHAVTAVGYGQSSNGS------KYWIIKNSWGTKWGESG 321
Query: 350 YYKICR----GRNVCGV 362
Y +I + + +CG+
Sbjct: 322 YMRIKKDVKDKKGLCGL 338
>gi|530736|emb|CAA56915.1| cathepsin l [Nephrops norvegicus]
gi|1582621|prf||2119193B cathepsin L-related Cys protease
Length = 313
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 129/324 (39%), Positives = 172/324 (53%), Gaps = 28/324 (8%)
Query: 53 LLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRA-ARHQKLDPSATH---GITQF 108
L A + FK ++ + Y +E +R +F+ N + A ++K + + QF
Sbjct: 5 LATASPSWEHFKTQYGRKYGDAKEELYRQRVFQQNEQLVEAFNKKFENGEVTFKVAMNQF 64
Query: 109 SDLTPAEFRRTYLGLRRKLR-LPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGS 167
D+T EF G ++ R P A P + AD DWR KGAV PVKDQG CGS
Sbjct: 65 GDMTNEEFNAVMKGYKKGSRGEPTTVFTAEGRP---MAADVDWRTKGAVTPVKDQGQCGS 121
Query: 168 CWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYT 227
CW+FS TG+LEG +FL +LVSLSEQ+LVDC E + GC GG M SAF+Y
Sbjct: 122 CWAFSATGSLEGQHFLKNNELVSLSEQELVDCSTEYG-------NDGCGGGWMTSAFDYI 174
Query: 228 LKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVA 287
GG+ E YPY DR +C+FD + I A+ F V E+ + + GP++VA
Sbjct: 175 KDNGGIDTESSYPYEAQDR--SCRFDANSIGATCTGFVEVQHTEEALHEAVSDIGPISVA 232
Query: 288 INAVY--MQTYIGGVSCPYICS-RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGES 344
I+A + Q Y GV CS LDHGVL VGYG+ + YW++KNSWG
Sbjct: 233 IDASHFSFQFYSSGVYYEKKCSPTNLDHGVLAVGYGTE-------STEDYWLVKNSWGSG 285
Query: 345 WGENGYYKICRGR-NVCGVDSMVS 367
WG+ GY K+ R R N CG+ S S
Sbjct: 286 WGDAGYIKMSRNRDNNCGIASEPS 309
>gi|301789679|ref|XP_002930256.1| PREDICTED: cathepsin L1-like [Ailuropoda melanoleuca]
gi|281343339|gb|EFB18923.1| hypothetical protein PANDA_020645 [Ailuropoda melanoleuca]
Length = 334
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 126/313 (40%), Positives = 166/313 (53%), Gaps = 22/313 (7%)
Query: 63 FKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQ----FSDLTPAEFRR 118
+K + Y EE R +++ N++ H + HG T F D+T EFR+
Sbjct: 32 WKATHRRLYGMNEEGWRR-AVWEKNMKMIDLHNREYSQGQHGFTMAMNAFGDMTNEEFRQ 90
Query: 119 TYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALE 178
G R + Q P+ ++P DW KG V PVK+QG CGSCW+FS TGALE
Sbjct: 91 VMNGFRNQKPRKGKVFQEPLFA--EIPKSVDWTLKGYVTPVKNQGQCGSCWAFSATGALE 148
Query: 179 GANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREED 238
G F TGKLVSLSEQ LVDC E GCNGGLM++AF+Y + GGL EE
Sbjct: 149 GQMFRKTGKLVSLSEQNLVDCSRSQGNE-------GCNGGLMDNAFQYVKENGGLDSEES 201
Query: 239 YPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY--MQTY 296
YPY GTD +CK+ AA+ F + E + + GP++VAI+A + Q Y
Sbjct: 202 YPYLGTDT-DSCKYKPECSAANDTGFVDIPQREKALMKAVATVGPISVAIDAGHQSFQFY 260
Query: 297 IGGVSC-PYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
G+ P S+ LDHGVL+VGYG G +WI+KNSWG WG NGY K+ +
Sbjct: 261 KSGIYYDPDCSSKDLDHGVLVVGYGFEG---TDSNNNKFWIVKNSWGPEWGTNGYVKMAK 317
Query: 356 GRNV-CGVDSMVS 367
+N CG+ + S
Sbjct: 318 DQNNHCGIATAAS 330
>gi|119964630|ref|YP_950826.1| cathepsin [Maruca vitrata MNPV]
gi|119514473|gb|ABL76048.1| cathepsin [Maruca vitrata MNPV]
Length = 324
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 115/325 (35%), Positives = 178/325 (54%), Gaps = 23/325 (7%)
Query: 52 DLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDL 111
DLL A ++F F +FNK Y S+ E RF IF+ NL + D +A + I +FSDL
Sbjct: 20 DLLKAPNYFEEFVLQFNKNYGSEIEKLRRFKIFQHNLNEIINKNQNDSAAKYEINKFSDL 79
Query: 112 TPAEFRRTYLGLRRKLRLPKDADQAPIL--PTNDLPADFDWREKGAVGPVKDQGSCGSCW 169
+ E Y GL ++ ++ + +L P P +FDWR V VK+QG CG+CW
Sbjct: 80 SKDETIAKYTGLSLPIQ-TQNFCKVIVLDQPPGKGPFEFDWRRLNKVTNVKNQGVCGACW 138
Query: 170 SFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK 229
+F+ +LE + +L+ LSEQQ++DCD S D+GCNGGL+++AFE +K
Sbjct: 139 AFAALASLESQFAMKHNQLIDLSEQQMIDCD---------SVDAGCNGGLLHTAFEAVIK 189
Query: 230 AGGLMREEDYPYTGTDRGHACKFDKSKIAASVAN-FSVVSLDEDQIAANLVKNGPLAVAI 288
GG+ E+DYPY + C+ + +K V + + + + E+++ L GP+ +AI
Sbjct: 190 MGGVQLEKDYPYEAANNN--CRMNSNKFLVKVKDCYRYIIVYEEKLKDLLRSVGPIPMAI 247
Query: 289 NAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+A + Y G+ Y + L+H VLLVGYG PYW KN+WG WGE+
Sbjct: 248 DAADIVNYKQGI-IKYCLNSGLNHAVLLVGYGVEN-------NIPYWTFKNTWGTDWGES 299
Query: 349 GYYKICRGRNVCGVDSMVSTVAAAV 373
GY+++ + N CG+ + +++ A V
Sbjct: 300 GYFRLQQNINACGMRNELASTAVIV 324
>gi|342675481|gb|AEL31666.1| cathepsin L [Cynoglossus semilaevis]
Length = 336
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 132/320 (41%), Positives = 172/320 (53%), Gaps = 25/320 (7%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH----GITQFSDLTPA 114
H+ L+K +K Y +EE R I++ NL + H H G+ F D+T
Sbjct: 27 HWELWKNWHSKKYHEKEE-GWRRMIWEKNLNKIELHNLEHSMGKHSYRLGMNHFGDMTHE 85
Query: 115 EFRRTYLGLRRKLRLPKDADQAPILPTNDL--PADFDWREKGAVGPVKDQGSCGSCWSFS 172
EFR+ G +RK + A + + N + P+ DWREKG V PVKDQG CGSCW+FS
Sbjct: 86 EFRQIMNGYQRKTE--RKAIGSLFMEPNFMVAPSAVDWREKGYVTPVKDQGQCGSCWAFS 143
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
TTGALZG NF GKLVSLSEQ LVDC PE + GC GGLM+ AF+Y G
Sbjct: 144 TTGALZGQNFRKMGKLVSLSEQNLVDCSR---PE----GNEGCGGGLMDQAFQYVKDNQG 196
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAINAV 291
L E+ YPY GTD C +D + + F + S E + + GP++VAI+A
Sbjct: 197 LDSEDSYPYLGTDD-QPCHYDPKYNSVNDTGFVDIPSGKEHALMKAVASVGPVSVAIDAG 255
Query: 292 Y--MQTYIGGVSCPYIC-SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+ Q Y G+ C S LDHGVL VGYG G + K YWI+KNSW E WG+
Sbjct: 256 HESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEGE---DVDGKKYWIVKNSWSEKWGDK 312
Query: 349 GYYKICRGR-NVCGVDSMVS 367
GY + + R N CG+ + S
Sbjct: 313 GYIYMAKDRKNHCGIATAAS 332
>gi|332326593|gb|AEE42620.1| cysteine protease [Leishmania aethiopica]
Length = 443
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 125/311 (40%), Positives = 162/311 (52%), Gaps = 29/311 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK+ + + Y + E R F+ NL HQ +P A GIT+F DL+ AEF
Sbjct: 38 FEEFKRTYWRVYGTVAEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 120 YL-GLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YL G + A Q DL P DWREKGAV PVKDQG+CGSCW+FS G
Sbjct: 98 YLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFSAVG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGL 233
+E +A +L +LSEQQLV CD + DSGC GGLM AFE+ L+ G +
Sbjct: 158 NIESQWAVAGHRLTALSEQQLVSCDDK---------DSGCGGGLMTQAFEWLLRNMNGTM 208
Query: 234 MREEDYPYTGT--DRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV 291
E+ YPY + D + A + + + E +AA L K+GP+++ ++A
Sbjct: 209 FTEDSYPYVSSXGDVPECTNSSQLVPGARIDGYVTIESSETVMAAWLAKSGPISIGVDAS 268
Query: 292 YMQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
+Y GV SC L+HGVLLVGY G E PYW+IKNSWGE WGE G
Sbjct: 269 SFMSYESGVLTSC---AGBXLNHGVLLVGYNXTG-------EVPYWVIKNSWGEDWGEKG 318
Query: 350 YYKICRGRNVC 360
Y ++ G N C
Sbjct: 319 YVRVAMGVNAC 329
>gi|403367386|gb|EJY83513.1| Cathepsin L [Oxytricha trifallax]
Length = 339
Score = 208 bits (529), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 133/310 (42%), Positives = 173/310 (55%), Gaps = 29/310 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSA-THGITQFSDLTPAEFRR 118
F+ + K+ K+Y ++EE RF ++ N+ A H + + T +F+D TPAE+++
Sbjct: 43 FANYLAKYGKSYGTKEEFQFRFQQYQQNMALIAHHNSNNENTFTLASNKFADYTPAEYKK 102
Query: 119 TYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALE 178
LG +R +PK Q +P DWR KGAV PVKDQG CGSCW+FSTTG+LE
Sbjct: 103 L-LGYKR---MPKANAQYAEFDLTAVPDSIDWRTKGAVTPVKDQGQCGSCWAFSTTGSLE 158
Query: 179 GANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREED 238
G + +ATG L S SEQQLVDCD+ D + GCNGG M A +Y+ K L E D
Sbjct: 159 GRDAIATGTLQSYSEQQLVDCDYSTDGNQ------GCNGGDMGLAMDYSAK-NPLELESD 211
Query: 239 YPYTGTDRGHACKFDK--SKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYM--Q 294
YPY D + K DK SK N SL + + A + GP++VAI A M Q
Sbjct: 212 YPYKAIDGKCSYKADKGHSKNKGHT-NVKQNSLPDLKAA---IAQGPVSVAIEADTMVFQ 267
Query: 295 TYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
Y GG+ C LDHGVL VGYGS KPY+I+KNSWG SWGE GY +I
Sbjct: 268 FYNGGILNSKSCGTNLDHGVLAVGYGSE-------NNKPYYIVKNSWGPSWGEQGYLRIA 320
Query: 355 R--GRNVCGV 362
+ G +CG+
Sbjct: 321 QVDGAGICGI 330
>gi|2804264|dbj|BAA24443.1| cysteine proteinase [Sitophilus zeamais]
Length = 331
Score = 208 bits (529), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 132/324 (40%), Positives = 177/324 (54%), Gaps = 30/324 (9%)
Query: 56 AEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSA----THGITQFSDL 111
+ +S FK + +K Y S+ E R IF N + A+H KL G+ +++D+
Sbjct: 23 VQEQWSSFKMQHSKNYDSETEERFRMKIFMENAHKVAKHSKLFSQGFVKFKLGLNKYADM 82
Query: 112 TPAEFRRTYLGLRR-KLRLPKDADQAP----ILPTN-DLPADFDWREKGAVGPVKDQGSC 165
EF T G + K + K +D I P N LP DWR+KGAV VKDQG C
Sbjct: 83 LHHEFVSTLNGFNKTKNNILKGSDLNDAVRFISPANVKLPDTVDWRDKGAVTKVKDQGHC 142
Query: 166 GSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFE 225
GSCWSFS +G+LEG +F TGKLVSLSEQ LVDC ++GCNGGLM++AF
Sbjct: 143 GSCWSFSGSGSLEGQHFRKTGKLVSLSEQNLVDCSGRYG-------NNGCNGGLMDNAFR 195
Query: 226 YTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPL 284
Y GG+ E+ YPY D C + A+ F + +ED + A + GP+
Sbjct: 196 YIKDNGGIDTEQSYPYLAEDE--KCHYKTQNSGATDKGFVDIEEGNEDDLKAAVATVGPI 253
Query: 285 AVAINAVY--MQTYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSW 341
++AI+A Y Q Y GV S P S+ LDHGVL+VGYG++ + YW++KNSW
Sbjct: 254 SIAIDASYETFQLYSDGVYSDPECISQELDHGVLVVGYGTSDDG------QDYWLVKNSW 307
Query: 342 GESWGENGYYKICRGR-NVCGVDS 364
S G NGY K+ R + N+CGV S
Sbjct: 308 RPSCGLNGYIKMARNQDNMCGVAS 331
>gi|403300975|ref|XP_003941187.1| PREDICTED: cathepsin L1-like isoform 1 [Saimiri boliviensis
boliviensis]
gi|403300977|ref|XP_003941188.1| PREDICTED: cathepsin L1-like isoform 2 [Saimiri boliviensis
boliviensis]
gi|403300979|ref|XP_003941189.1| PREDICTED: cathepsin L1-like isoform 3 [Saimiri boliviensis
boliviensis]
Length = 333
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 128/321 (39%), Positives = 174/321 (54%), Gaps = 27/321 (8%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQ----FSDLT 112
E + +K N+ Y EE + R +++ N++ H H T F D+T
Sbjct: 26 EAQWIKWKAMHNRLYGKNEE-EWRRAVWEKNMKTIELHNHEYNQGKHSFTMAMNTFGDMT 84
Query: 113 PAEFRRTYLGLRRKLRLPKDAD--QAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWS 170
EFR+ G + R P++ Q P+L ++ P DWREKG V PVK+QG CGSCW+
Sbjct: 85 NEEFRQVMNGFQN--RKPRNGKVFQEPLL--HEAPRSVDWREKGYVTPVKNQGQCGSCWA 140
Query: 171 FSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA 230
FS TGALEG F TGKLVSLSEQ LVDC P+ + GCNGGLM+ AF+Y +
Sbjct: 141 FSATGALEGQMFRKTGKLVSLSEQNLVDCS---GPQ----GNQGCNGGLMDYAFQYVQEN 193
Query: 231 GGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA 290
GGL EE YPY T+ +CK++ A+ F + E + + GP++VAI+A
Sbjct: 194 GGLDSEESYPYEATEE--SCKYNPKYSVANDTGFVDIPKLEKALMKAVATVGPISVAIDA 251
Query: 291 VY--MQTYIGGVSC-PYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGE 347
+ Q Y G+ P S +DHGVL+VGY G+ YW++KNSWGE WG
Sbjct: 252 GHESFQFYKEGIYFEPECSSEDMDHGVLVVGY---GFERTGSDNSKYWLVKNSWGEEWGM 308
Query: 348 NGYYKICRGR-NVCGVDSMVS 367
+GY K+ + R N CG+ S S
Sbjct: 309 DGYIKMAKDRKNHCGIASAAS 329
>gi|2414570|emb|CAB16317.1| cysteine proteinase precursor [Nicotiana tabacum]
Length = 374
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 128/354 (36%), Positives = 190/354 (53%), Gaps = 31/354 (8%)
Query: 10 LVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNK 69
+++ ++F+ SS + D+ + D + + + +D ++ + ++ + +
Sbjct: 5 IITTLLFALFSSLSYAIDM-----SIIDYKNNHYARKWTLQSDEDQVKNRYEMWLAEHGR 59
Query: 70 AYASQEEHDHRFTIFKANLRRAARHQKL-DPSATHGITQFSDLTPAEFRRTYLGL----R 124
AY + E + RF IFK NLR H + + G+ QF+DLT E+R YLG R
Sbjct: 60 AYNALGEKEKRFEIFKDNLRFIEGHNNSGNRTYKVGLNQFADLTNEEYRTMYLGTKSDAR 119
Query: 125 RKLRLPKDADQAPILPTNDL-PADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFL 183
R+ K+ Q N+L P DWR++GAV P+K+QGSCGSCW+FST A+EG N +
Sbjct: 120 RRFVKSKNPSQRYASRPNELMPHSVDWRKRGAVAPIKNQGSCGSCWAFSTVAAVEGINQI 179
Query: 184 ATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTG 243
TG++++LSEQ+LVDCD +SGCNGGLM+ AFE+ + GG+ E+ YPY G
Sbjct: 180 VTGEMITLSEQELVDCDR--------VQNSGCNGGLMDYAFEFIISNGGMDTEKHYPYRG 231
Query: 244 TDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV--YMQTYIGGVS 301
+ G K+ S+ + V +E + V + P+ VAI A Q Y GV
Sbjct: 232 VE-GRCDPVRKNYKVVSIDGYEDVPRNERAL-QKAVAHQPVCVAIEASGRAFQLYSSGVF 289
Query: 302 CPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
C +DHGV++VGYGS YWI++NSWG WGENGY K+ R
Sbjct: 290 TGE-CGEEVDHGVVVVGYGSEDGV-------DYWIVRNSWGTKWGENGYVKMER 335
>gi|255550445|ref|XP_002516273.1| cysteine protease, putative [Ricinus communis]
gi|223544759|gb|EEF46275.1| cysteine protease, putative [Ricinus communis]
Length = 358
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 141/362 (38%), Positives = 187/362 (51%), Gaps = 35/362 (9%)
Query: 16 FSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHH---FSLFKKKFNKAYA 72
F SG+ D+ + IR V+D L E++ ++G FS F + K Y
Sbjct: 17 FCVAVSGSNFDESNP-IRLVSD----RLRDFEASVTKVVGHSRRALSFSRFVYRHGKRYQ 71
Query: 73 SQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKD 132
S++E RF IF NL + S T + F+DLT EF++ LG +
Sbjct: 72 SEDEMKMRFAIFSENLDFIRSTNRKGLSYTLAVNDFADLTWQEFQKHRLGAAQNCSATTK 131
Query: 133 ADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLS 192
+ L LP DWRE G V PVK+QG CGSCW+FSTTGALE A A GK +SLS
Sbjct: 132 GNHK--LTGVALPDTKDWREVGIVSPVKNQGHCGSCWTFSTTGALEAAYHQAFGKGISLS 189
Query: 193 EQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKF 252
EQQLVDC + + GC+GGL + AFEY GGL EE YPYTG D ACKF
Sbjct: 190 EQQLVDCAGAFN-------NFGCHGGLPSQAFEYIKYNGGLETEEAYPYTGED--GACKF 240
Query: 253 DKSKIAASV---ANFSVVSLDEDQIAANLVKNGPLAVAINAVY-MQTYIGGVSCPYICSR 308
+ V N ++ + DE + A LV+ P++VA V + Y GV C
Sbjct: 241 SSENVGIQVLDSVNITLGAEDELKEAVGLVR--PVSVAFEVVSGFRFYKSGVYTSDTCGS 298
Query: 309 R---LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSM 365
++H VL VGYG PYW++KNSWGE+WG++GY+K+ G+N+CGV +
Sbjct: 299 TPMDVNHAVLAVGYGVE-------DGVPYWLVKNSWGENWGDHGYFKMEMGKNMCGVATC 351
Query: 366 VS 367
S
Sbjct: 352 AS 353
>gi|71400414|ref|XP_803044.1| cysteine peptidase [Trypanosoma cruzi strain CL Brener]
gi|70865609|gb|EAN81598.1| cysteine peptidase, putative [Trypanosoma cruzi]
Length = 467
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 124/320 (38%), Positives = 166/320 (51%), Gaps = 33/320 (10%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRR 118
F+ FK+K + Y S E R ++F+ANL A H +P AT G+T FSDLT EFR
Sbjct: 37 QFAEFKQKHGRVYGSAAEEAFRLSVFRANLFLARLHAAANPHATFGVTPFSDLTREEFRS 96
Query: 119 TY-------LGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSF 171
Y + + R+P D + PA DWRE+GAV VK+QG CGSCW+F
Sbjct: 97 RYHNGAAHFAAAQERARVPVDVEFV------GAPAAKDWREEGAVTAVKNQGMCGSCWAF 150
Query: 172 STTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTL--K 229
+ G +E FLA L LSEQ LV CD+ +SGC GG AF++ +
Sbjct: 151 AAIGNIECQWFLAGNPLTRLSEQMLVSCDNT---------NSGCGGGWPLVAFKWIVDRN 201
Query: 230 AGGLMREEDYPY-TGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAI 288
G + EE YPY + C + A++ + + DE+ IAA L NGP+AV +
Sbjct: 202 NGTVYTEESYPYHSCIGISPPCTTSGHTVGATITGYVTIPRDENGIAAWLAVNGPVAVVV 261
Query: 289 NAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+A Y GGV + S++L H VLLVGY + P+WIIKNSW WGE+
Sbjct: 262 DASSWIFYTGGVMTSCV-SKQLSHAVLLVGYNDSATV-------PHWIIKNSWTTHWGED 313
Query: 349 GYYKICRGRNVCGVDSMVST 368
GY +I +G N C V VS+
Sbjct: 314 GYIRIAKGSNQCLVKEGVSS 333
>gi|449522968|ref|XP_004168497.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 348
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 128/328 (39%), Positives = 178/328 (54%), Gaps = 31/328 (9%)
Query: 52 DLLGAEHHFSLFKKKFN---KAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQF 108
DL + LF++ + K Y + EE HRF +FK NL+ K S G+ +F
Sbjct: 34 DLTSMDRLIELFEEWISNHGKIYETIEEKWHRFEVFKDNLKHIDETNKKVTSYWLGVNEF 93
Query: 109 SDLTPAEFRRTYLGLR-RKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGS 167
+DLT EF+ YLGL+ R + ++ DLP DWR+KGAV VK+QGSCGS
Sbjct: 94 ADLTHQEFKNMYLGLKVESSRTRQSPEEFTYKDVVDLPKSVDWRKKGAVTRVKNQGSCGS 153
Query: 168 CWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYT 227
CW+FST A+EG N + G L SLSEQ+L+DCD ++GC+GGLM+ AF +
Sbjct: 154 CWAFSTVAAVEGINKIVGGNLTSLSEQELIDCDR--------PYNNGCHGGLMDYAFSFI 205
Query: 228 LKAGGLMREEDYPYTGTDRGHACKFDKSKI-AASVANFSVVSLDEDQIAANLVKNGPLAV 286
+ +GGL +EEDYPY + C K ++ +++ + V + + + + PL+V
Sbjct: 206 VSSGGLHKEEDYPYLEVES--TCDNKKGELEVVTISGYKDVPENNEASLIKALAHQPLSV 263
Query: 287 AINAVY--MQTYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGE 343
AI A Q Y GGV P C +LDHGV VGYGS+ K Y I+KNSWG
Sbjct: 264 AIEASGRDFQFYSGGVFDGP--CGTQLDHGVTAVGYGSS-------KGVDYIIVKNSWGP 314
Query: 344 SWGENGYYKICRGR----NVCGVDSMVS 367
WGE GY ++ R +CG++ M S
Sbjct: 315 KWGEKGYIRMKRNTGKPAGLCGINKMAS 342
>gi|348514005|ref|XP_003444531.1| PREDICTED: cathepsin L1-like [Oreochromis niloticus]
Length = 338
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 131/322 (40%), Positives = 171/322 (53%), Gaps = 25/322 (7%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH----GITQFSDLT 112
+ H++L+K K Y +EE R +++ NL++ H H G+ F D+T
Sbjct: 27 DEHWNLWKSWHTKKYHEKEE-GWRRMVWEKNLKKIELHNLDHSMGKHTYRLGMNHFGDMT 85
Query: 113 PAEFRRTYLGLRRKLRLPKDADQAPILPTNDL--PADFDWREKGAVGPVKDQGSCGSCWS 170
EFR+ G + K + + L N L P DWR+KG V PVKDQG CGSCW+
Sbjct: 86 NEEFRQLMNGYKHKAE--RKVKGSLFLEPNFLEAPRSLDWRDKGYVTPVKDQGQCGSCWA 143
Query: 171 FSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA 230
FS TGALEG F TGK+V LSEQ LV+C PE + GCNGGLM+ AF+Y
Sbjct: 144 FSATGALEGQQFRKTGKMVQLSEQNLVECSR---PE----GNEGCNGGLMDQAFQYVKDN 196
Query: 231 GGLMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAIN 289
GL EE YPY GTD C +D A + F + S E + + GP++VAI+
Sbjct: 197 QGLDSEESYPYLGTD-DQKCHYDPRYNAVNDTGFVDIKSGSEHALMKAVTAVGPISVAID 255
Query: 290 AVY--MQTYIGGVSC-PYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWG 346
A + Q Y G+ P S LDHGVLLVGYG G + K YWI+KNSW E WG
Sbjct: 256 AGHESFQFYQSGIYYEPECSSEELDHGVLLVGYGFEG---EDVDGKKYWIVKNSWSEKWG 312
Query: 347 ENGYYKICRGR-NVCGVDSMVS 367
+ GY + + R N CG+ + S
Sbjct: 313 DKGYVYMAKDRQNHCGIATAAS 334
>gi|157093355|gb|ABV22332.1| cysteine protease 1 [Noctiluca scintillans]
Length = 338
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 133/332 (40%), Positives = 171/332 (51%), Gaps = 41/332 (12%)
Query: 42 ILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSA 101
+ + HE D + F+ FK K+ K Y E RF IFKAN+ + +
Sbjct: 12 VAAGHEVPPPDYM---MMFNNFKTKYGKVYNGINEDAVRFGIFKANVDIIYATNARNLTF 68
Query: 102 THGITQFSDLTPAEFRRTYLGLRRKLR---LPK----DADQAPILPTNDLPADFDWREKG 154
G+ +F+DLT E +Y GL+ LP+ + + AP L + DW +G
Sbjct: 69 ALGVNEFTDLTQEELAASYTGLKPASLWSGLPRLSTHEYNGAP------LASSVDWTTQG 122
Query: 155 AVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSG 214
V PVK+QG CGSCWSFSTTGALEGA L+TG LVSLSEQQ VDCD + DSG
Sbjct: 123 VVTPVKNQGQCGSCWSFSTTGALEGAWALSTGNLVSLSEQQFVDCD---------TTDSG 173
Query: 215 CNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIA---ASVANFSVVSLDE 271
CNGG M++AF + K + E YPYT TD C ++ V ++ VS D
Sbjct: 174 CNGGWMDNAFSFA-KKNSICTEGSYPYTATD--GTCNLSGCQVGIPQGGVVGYTDVSTDS 230
Query: 272 DQIAANLVKNGPLAVAINA--VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRL 329
+Q + V P+++AI A Q Y GV C RLDHGVL VGYGS
Sbjct: 231 EQAMMSAVAQQPVSIAIEADQYSFQLYSSGV-LTASCGTRLDHGVLAVGYGSE------- 282
Query: 330 KEKPYWIIKNSWGESWGENGYYKICRGRNVCG 361
YW +KNSWG SWGE GY ++ RG+ G
Sbjct: 283 AGTDYWKVKNSWGSSWGEQGYVRLQRGKGGAG 314
>gi|116779845|gb|ABK21448.1| unknown [Picea sitchensis]
gi|116791731|gb|ABK26088.1| unknown [Picea sitchensis]
gi|224286276|gb|ACN40847.1| unknown [Picea sitchensis]
Length = 357
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 138/379 (36%), Positives = 192/379 (50%), Gaps = 42/379 (11%)
Query: 3 SKTVVLFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEH---H 59
++ + + L +L+ + S + + I VTD + + ES+ +LG
Sbjct: 2 ARILAIVLSTLLALAIAVSAARSFEETEYIDMVTDK----IQNLESSLFKILGTNPKSVQ 57
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F+ F ++ K Y S + HRF F N+ ++ T I +F+D+T EF
Sbjct: 58 FAEFALRYGKRYDSVRQLVHRFNAFVKNVELIESRNSMNLPYTLAINEFADITWEEFHGQ 117
Query: 120 YLGLRRKLRLPKD----ADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YLG + K D P P DWRE+G V PVK+Q CGSCW+FSTTG
Sbjct: 118 YLGASQNCSATKSNHKFTDAQP-------PTKKDWREEGIVSPVKNQAHCGSCWTFSTTG 170
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMR 235
ALE A ATGK V LSEQQLVDC + + GC+GGL + AFEY GGL
Sbjct: 171 ALEAAYTQATGKTVILSEQQLVDCAGAFN-------NFGCSGGLPSQAFEYIKYNGGLDT 223
Query: 236 EEDYPYTGTDRGHACKFDKSKIAASVA---NFSVVSLDEDQIAANLVKNGPLAVAINAVY 292
EE YPYT D C +D + + VA N S+ + DE + A LV+ P++VA +
Sbjct: 224 EEAYPYTAKD--GVCNYDVNNVGVKVADSVNISLGAEDELKSAVGLVR--PVSVAFQVIQ 279
Query: 293 -MQTYIGGVSCPYICSR---RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+ Y GV C + ++H VL VGYG + + P+WIIKNSWG+SWG
Sbjct: 280 DFRFYKEGVFTSTTCGQGPMDVNHAVLAVGYG------VSEEGTPHWIIKNSWGKSWGVE 333
Query: 349 GYYKICRGRNVCGVDSMVS 367
GY+K+ G+N+CGV + S
Sbjct: 334 GYFKMEMGKNMCGVATCAS 352
>gi|402770509|gb|AFQ98389.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 131/320 (40%), Positives = 166/320 (51%), Gaps = 28/320 (8%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKAN----LRRAARHQKLDPSATHGITQFSDLTPA 114
+ FK K Y S E RF IF + R A++ K S G+ QF DL
Sbjct: 26 QWEAFKTTHKKTYQSHMEELLRFKIFTESSLIIARHNAKYAKGLVSYKLGMNQFGDLLAH 85
Query: 115 EFRRTYLGLRRKLRLPKDADQAPILPTND--LPADFDWREKGAVGPVKDQGSCGSCWSFS 172
EF R + G R + P ND LP DWR+KGAV PVKDQG CGSCW+FS
Sbjct: 86 EFARIFNG-HHGTRKTGGSTFLPPANVNDSSLPKAVDWRKKGAVTPVKDQGQCGSCWAFS 144
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
TG+LEG +FL G+LVSLSEQ LVDC ++GC GGLM AF+Y G
Sbjct: 145 ATGSLEGQHFLKNGELVSLSEQNLVDCSQSFG-------NNGCEGGLMEDAFKYIKANDG 197
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAINAV 291
+ E+ YPY D C+F K + A+ + + + ED + + GP++VAI+A
Sbjct: 198 IDTEKSYPYEAVDG--ECRFKKEDVGATDTGYVEIKAGSEDDLKKAVATVGPISVAIDAS 255
Query: 292 Y--MQTYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+ Q Y GV P S LDHGVL+VGYG G K YW++KNSW ESWG+
Sbjct: 256 HSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGVKG-------GKKYWLVKNSWAESWGDQ 308
Query: 349 GYYKICR-GRNVCGVDSMVS 367
GY + R N CG+ S S
Sbjct: 309 GYILMSRDNNNQCGIASQAS 328
>gi|2351557|gb|AAB68595.1| cathepsin [Choristoneura fumiferana MNPV]
Length = 324
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 116/322 (36%), Positives = 176/322 (54%), Gaps = 23/322 (7%)
Query: 52 DLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDL 111
DLL A +F F FNK Y+S+ E HRF IF+ NL D SA + I +FSDL
Sbjct: 20 DLLKAPSYFEDFLHNFNKNYSSKSEKLHRFKIFQHNLEEIINKNLNDTSAQYEINKFSDL 79
Query: 112 TPAEFRRTYLGLRRKLRLPKDADQAPIL--PTNDLPADFDWREKGAVGPVKDQGSCGSCW 169
+ E Y GL L+ ++ + +L P + P +FDWR V VK+QG+CG+CW
Sbjct: 80 SKDETISKYTGLSLPLQ-NQNFCEVVVLNRPPDKGPLEFDWRRLNKVTSVKNQGTCGACW 138
Query: 170 SFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK 229
+F+T G+LE + +L++LSEQQL+DCD D GC+GGL+++A+E +
Sbjct: 139 AFATLGSLESQFAIKHDQLINLSEQQLIDCDF---------VDMGCDGGLLHTAYEAVMN 189
Query: 230 AGGLMREEDYPYTGTDRGHACKFDKSKIAASVAN-FSVVSLDEDQIAANLVKNGPLAVAI 288
GG+ E DYPY + C+ + +K V + +++ E+++ L GP+ VAI
Sbjct: 190 MGGIQAENDYPYEANNGD--CRANAAKFVVKVKKCYRYITVFEEKLKDLLRSVGPIPVAI 247
Query: 289 NAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+A + Y G+ Y + L+H VLLVGY P+WI+KN+WG WGE
Sbjct: 248 DASDIVNYKRGI-MKYCANHGLNHAVLLVGYAVQNGV-------PFWILKNTWGADWGEQ 299
Query: 349 GYYKICRGRNVCGVDSMVSTVA 370
GY+++ + N CG+ + + + A
Sbjct: 300 GYFRVQQNINACGIQNELPSSA 321
>gi|168017893|ref|XP_001761481.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687165|gb|EDQ73549.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 471
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 129/304 (42%), Positives = 172/304 (56%), Gaps = 28/304 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F + ++ ++ Y S E RF IFK NL H K + S G+ +FSDLT EFR
Sbjct: 52 FHQWLERHSRVYHSLSEKQRRFQIFKDNLHYIHNHNKQEKSYWLGLNKFSDLTHDEFRAL 111
Query: 120 YLGLRRKLRLP--KDADQAPILPTNDLPAD--FDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YLG+R R ++ D+ D+ A+ DWR+KGAV VKDQGSCGSCW+FS G
Sbjct: 112 YLGIRPAGRAHGLRNGDR---FIYEDVVAEEMVDWRKKGAVSDVKDQGSCGSCWAFSAIG 168
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMR 235
++EG N + TG+L+SLSEQ+LVDCD + GCNGGLM+ AF++ +K GG+
Sbjct: 169 SVEGVNAIVTGELISLSEQELVDCDR--------GQNQGCNGGLMDYAFDFIIKNGGIDT 220
Query: 236 EEDYPYTGTD-RGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA--VY 292
EEDYPY TD + + + SK+ + ++ V + V P++VAI A
Sbjct: 221 EEDYPYKATDGQCDEARKETSKVVV-IDDYQDVPTKSESSLLKAVSKNPVSVAIEAGGRD 279
Query: 293 MQTYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYY 351
Q Y GGV + P C LDHGVL VGYG+ YWI+KNSWG SWGE GY
Sbjct: 280 FQHYQGGVFTGP--CGTDLDHGVLAVGYGTDDDGV------NYWIVKNSWGPSWGEKGYI 331
Query: 352 KICR 355
++ R
Sbjct: 332 RMER 335
>gi|194689248|gb|ACF78708.1| unknown [Zea mays]
gi|414885653|tpg|DAA61667.1| TPA: cysteine protease2 [Zea mays]
Length = 360
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 136/346 (39%), Positives = 176/346 (50%), Gaps = 31/346 (8%)
Query: 32 IRQVTDGGDEILSHHESTNNDLLGAEH---HFSLFKKKFNKAYASQEEHDHRFTIFKANL 88
IR VTD L EST LG F+ F ++ K+Y S E RF IF +L
Sbjct: 31 IRPVTDRAASAL---ESTVFAALGRTRDALRFARFAVRYGKSYESAAEVHKRFRIFSESL 87
Query: 89 RRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADF 148
+ + S GI +F+D++ EFR T LG + + LP
Sbjct: 88 QLVRSTNRKGLSYRLGINRFADMSWEEFRATRLGAAQNCSATLTGNHRMRAAAVALPETK 147
Query: 149 DWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEP 208
DWRE G V PVK+QG CGSCW+FSTTGALE A ATGK +SLSEQQL+DC +
Sbjct: 148 DWREDGIVSPVKNQGHCGSCWTFSTTGALEAAYTQATGKPISLSEQQLIDCGFAFN---- 203
Query: 209 GSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASV---ANFS 265
+ GCNGGL + AFEY GGL EE YPY G + CKF + V N +
Sbjct: 204 ---NFGCNGGLPSQAFEYIKYNGGLDTEESYPYQGVN--GICKFKNENVGVKVLDSVNIT 258
Query: 266 VVSLDEDQIAANLVKNGPLAVAINAVY-MQTYIGGVSCPYICSRR---LDHGVLLVGYGS 321
+ + DE + A LV+ P++VA + + Y GV C ++H VL VGYG
Sbjct: 259 LGAEDELKDAVGLVR--PVSVAFEVITGFRLYKSGVYTSDHCGTTPMDVNHAVLAVGYGV 316
Query: 322 AGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVS 367
PYW+IKNSWG WG+ GY+K+ G+N+CGV + S
Sbjct: 317 E-------DGVPYWLIKNSWGADWGDEGYFKMEMGKNMCGVATCAS 355
>gi|12597541|ref|NP_075125.1| cathepsin [Helicoverpa armigera nucleopolyhedrovirus G4]
gi|15426394|ref|NP_203611.1| cathepsin [Helicoverpa armigera NPV]
gi|12483807|gb|AAG53799.1|AF271059_56 cathepsin [Helicoverpa armigera nucleopolyhedrovirus G4]
gi|15384470|gb|AAK96381.1|AF303045_123 cathepsin [Helicoverpa armigera NPV]
gi|18027090|gb|AAL55725.1|AF268612_1 cathepsin [Helicoverpa armigera NPV]
Length = 365
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 120/329 (36%), Positives = 172/329 (52%), Gaps = 39/329 (11%)
Query: 52 DLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQK------------LDP 99
+L +E +F F +++NK+Y +E+ +R+ +FK NL + + L
Sbjct: 47 NLDQSEIYFKHFLQQYNKSYDDPKEYQYRYNVFKDNLNKINSQNRENLLNNKNNNDSLST 106
Query: 100 SATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPIL---PTNDLPADFDWREKGAV 156
SA G+ +FSD TP E + G L + I+ P LP +DWR+ V
Sbjct: 107 SAQFGVNKFSDKTPDEVLHSNTGFFLNLSQHYTLCENRIVKGAPNIRLPDYYDWRDTNKV 166
Query: 157 GPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCN 216
P+KDQG CGSCW+F G +E + KL+ LSEQQL+DCD D GCN
Sbjct: 167 TPIKDQGVCGSCWAFVAIGNIESQYAIRHNKLIDLSEQQLLDCD---------EVDLGCN 217
Query: 217 GGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVAN-FSVVSLDEDQIA 275
GGLM+ AF+ L GG+ E DYPY G+++ C D KIA + + F DE+++
Sbjct: 218 GGLMHLAFQELLLMGGVETEADYPYQGSEQ--MCTLDNRKIAVKLNSCFKYDIRDENKLK 275
Query: 276 ANLVKNGPLAVAINAVYMQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKP 333
+ GP+A+A++A+ + Y G+ C L+H VLL+G+G P
Sbjct: 276 ELVYTTGPVAIAVDAMDIINYRRGILNQCHIY---DLNHAVLLIGWGIEN-------NVP 325
Query: 334 YWIIKNSWGESWGENGYYKICRGRNVCGV 362
YWIIKNSWGE WGENGY ++ R N CG+
Sbjct: 326 YWIIKNSWGEDWGENGYLRVRRNVNACGL 354
>gi|340504799|gb|EGR31212.1| papain family cysteine protease, putative [Ichthyophthirius
multifiliis]
Length = 250
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 103/224 (45%), Positives = 138/224 (61%), Gaps = 17/224 (7%)
Query: 144 LPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHEC 203
LP+ FDWRE+G + PVK Q +CG CW+F+TTG +E L KLV+ SEQQL+DCD
Sbjct: 39 LPSYFDWREQGIITPVKYQDTCGGCWTFATTGVIESQYALKYNKLVNFSEQQLIDCD--- 95
Query: 204 DPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVAN 263
S + GC GGLM A++ + GGL EDY +G CK D +K++A V N
Sbjct: 96 ------SINDGCRGGLMTDAYKAIQEMGGLETSEDYGEYLNSKGQ-CKIDSNKVSAKVIN 148
Query: 264 FSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAG 323
+ +S DE+ I LV+NGP+AV +NA ++Q Y GG+ P +C ++H VL+VGYG
Sbjct: 149 WYQISEDEEAIRRELVQNGPIAVGVNARFLQFYQGGILDPKLCDDSINHAVLIVGYGEE- 207
Query: 324 YAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVS 367
K YWIIKN WG+SWG NGY+K+ RG+ CGV + S
Sbjct: 208 ------NGKKYWIIKNQWGKSWGINGYFKLVRGKKQCGVHTYAS 245
>gi|23452059|gb|AAN32912.1| cathepsin [Danio rerio]
Length = 310
Score = 207 bits (528), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 130/299 (43%), Positives = 159/299 (53%), Gaps = 24/299 (8%)
Query: 80 RFTIFKANLRRAARHQKLDPSATH----GITQFSDLTPAEFRRTYLGLRRK--LRLPKDA 133
R +K NL+ H H G+ F D+T EFR+ G + K R
Sbjct: 21 RRIFWKKNLKXIEMHNLXHSMGIHTYRLGMNHFGDMTHEEFRQVMNGFKHKKDRRFRGSL 80
Query: 134 DQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSE 193
P ++P DWREKG V PVKDQG CGSCW+FSTTGALEG F TGKLVSLSE
Sbjct: 81 FMEPXFI--EVPNKLDWREKGYVTPVKDQGECGSCWAFSTTGALEGQMFRKTGKLVSLSE 138
Query: 194 QQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFD 253
Q LVDC PE + GCNGGLM+ AF+Y GL EE YPY GTD C FD
Sbjct: 139 QNLVDCSR---PE----GNEGCNGGLMDQAFQYVKDQNGLDSEESYPYLGTDD-QPCHFD 190
Query: 254 KSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAINAVY--MQTYIGGVSCPYIC-SRR 309
AA+ F + S E + + GP++VAI+A + Q Y G+ C S
Sbjct: 191 PKNSAANDTGFVDIPSGKERALMKAIAAVGPVSVAIDAGHESFQFYQSGIYYEKECSSEE 250
Query: 310 LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGR-NVCGVDSMVS 367
LDHGVL VGYG G + K YWI+KNSW E+WG+ GY + + R N CG+ + S
Sbjct: 251 LDHGVLAVGYGFEGED---VDGKKYWIVKNSWSENWGDKGYIYMAKDRHNHCGIATAAS 306
>gi|344310882|gb|AEN03980.1| cathepsin-like cysteine proteinase [Helicoverpa armigera NPV strain
Australia]
Length = 367
Score = 207 bits (528), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 120/329 (36%), Positives = 172/329 (52%), Gaps = 39/329 (11%)
Query: 52 DLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQK------------LDP 99
+L +E +F F +++NK+Y +E+ +R+ +FK NL + + L
Sbjct: 49 NLDQSEIYFKHFLQQYNKSYDDPKEYQYRYNVFKDNLNKINSQNRENLLNNKNNNDSLST 108
Query: 100 SATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPIL---PTNDLPADFDWREKGAV 156
SA G+ +FSD TP E + G L + I+ P LP +DWR+ V
Sbjct: 109 SAQFGVNKFSDKTPDEVLHSNTGFFLNLSQHYTLCENRIVKGAPNIRLPDYYDWRDTNKV 168
Query: 157 GPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCN 216
P+KDQG CGSCW+F G +E + KL+ LSEQQL+DCD D GCN
Sbjct: 169 TPIKDQGVCGSCWAFVAIGNIESQYAIRHNKLIDLSEQQLLDCD---------EVDLGCN 219
Query: 217 GGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVAN-FSVVSLDEDQIA 275
GGLM+ AF+ L GG+ E DYPY G+++ C D KIA + + F DE+++
Sbjct: 220 GGLMHLAFQELLLMGGVETEADYPYQGSEQ--MCTLDNRKIAVKLNSCFKYDIRDENKLK 277
Query: 276 ANLVKNGPLAVAINAVYMQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKP 333
+ GP+A+A++A+ + Y G+ C L+H VLL+G+G P
Sbjct: 278 ELVYTTGPVAIAVDAMDIINYRRGILNQCHIY---DLNHAVLLIGWGIEN-------NVP 327
Query: 334 YWIIKNSWGESWGENGYYKICRGRNVCGV 362
YWIIKNSWGE WGENGY ++ R N CG+
Sbjct: 328 YWIIKNSWGEDWGENGYLRVRRNVNACGL 356
>gi|42564159|gb|AAS20591.1| digestive cysteine proteinase intestain [Leptinotarsa decemlineata]
Length = 326
Score = 207 bits (528), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 121/312 (38%), Positives = 174/312 (55%), Gaps = 22/312 (7%)
Query: 63 FKKKFNKAYASQEEHDHRFTIFKANLRRA----ARHQKLDPSATHGITQFSDLTPAEFRR 118
FK+ K Y S E RF IF++NLR+ A++ K + S G+T F+DLT EF+
Sbjct: 26 FKQTHGKTYKSLLEERTRFGIFQSNLRKIEEHNAKYDKGEESYFLGVTPFADLTHDEFKD 85
Query: 119 TYLGLRRKLRLPKDADQA-PILPTN-DLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGA 176
LRR+++ + + + P ++P DW +KGAV VK QG CGSCW+FS TGA
Sbjct: 86 E---LRRQIKTKPNVEATLAVFPEGLEVPDSIDWTQKGAVLDVKYQGGCGSCWAFSATGA 142
Query: 177 LEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMRE 236
LEG N + + LSEQQL+DC +P D +GGLM+ AF+Y L G+ +
Sbjct: 143 LEGQNAIVNNVKIPLSEQQLLDC------SKPYGNDDCEHGGLMSFAFDYVLDK-GIEAD 195
Query: 237 EDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTY 296
YPY G D C++D K + + VS E+++ + GP++VAI+A +Q Y
Sbjct: 196 SSYPYKGIDT--PCQYDAKKTVLKIKGYKNVSNSEEELKKAVGTVGPVSVAIDADPIQLY 253
Query: 297 IGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR- 355
GG+ C+ L+HGVL VGYG + +K +W +KNSWG+ WGE GY++I R
Sbjct: 254 FGGILDGLFCTHNLNHGVLAVGYGEEDHL---FGKKKFWKVKNSWGKDWGEQGYFRIKRD 310
Query: 356 GRNVCGVDSMVS 367
N+CG+ S
Sbjct: 311 ANNLCGIADKAS 322
>gi|449455625|ref|XP_004145553.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 351
Score = 207 bits (528), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 128/328 (39%), Positives = 178/328 (54%), Gaps = 31/328 (9%)
Query: 52 DLLGAEHHFSLFKKKFN---KAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQF 108
DL + LF++ + K Y + EE HRF +FK NL+ K S G+ +F
Sbjct: 37 DLTSMDRLIELFEEWISNHGKIYETIEEKWHRFEVFKDNLKHIDETNKKVTSYWLGVNEF 96
Query: 109 SDLTPAEFRRTYLGLR-RKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGS 167
+DLT EF+ YLGL+ R + ++ DLP DWR+KGAV VK+QGSCGS
Sbjct: 97 ADLTHQEFKNMYLGLKVESSRTRQSPEEFTYKDVVDLPKSVDWRKKGAVTRVKNQGSCGS 156
Query: 168 CWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYT 227
CW+FST A+EG N + G L SLSEQ+L+DCD ++GC+GGLM+ AF +
Sbjct: 157 CWAFSTVAAVEGINKIVGGNLTSLSEQELIDCDR--------PYNNGCHGGLMDYAFSFI 208
Query: 228 LKAGGLMREEDYPYTGTDRGHACKFDKSKI-AASVANFSVVSLDEDQIAANLVKNGPLAV 286
+ +GGL +EEDYPY + C K ++ +++ + V + + + + PL+V
Sbjct: 209 VSSGGLHKEEDYPYLEVES--TCDNKKGELEVVTISGYKDVPENNEASLIKALAHQPLSV 266
Query: 287 AINAVY--MQTYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGE 343
AI A Q Y GGV P C +LDHGV VGYGS+ K Y I+KNSWG
Sbjct: 267 AIEASGRDFQFYSGGVFDGP--CGTQLDHGVTAVGYGSS-------KGVDYIIVKNSWGP 317
Query: 344 SWGENGYYKICRGR----NVCGVDSMVS 367
WGE GY ++ R +CG++ M S
Sbjct: 318 KWGEKGYIRMKRNTGKPAGLCGINKMAS 345
>gi|394331826|gb|AFN27132.1| cysteine protease [Leishmania tropica]
Length = 443
Score = 207 bits (528), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 127/312 (40%), Positives = 165/312 (52%), Gaps = 31/312 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK+ + +AY + E R F+ NL HQ +P A GIT+F DL+ AEF
Sbjct: 38 FEEFKRTYRRAYGTVAEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 120 YL-GLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YL G + A Q DL P DWR+KGAV PVKDQG+CGSCW+FS G
Sbjct: 98 YLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWRKKGAVTPVKDQGACGSCWAFSAVG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGL 233
++E LA L +LSEQQLV CD + D+GC+GGLM AFE+ L+ G +
Sbjct: 158 SIESQWALAGHGLTALSEQQLVSCDDK---------DNGCSGGLMLQAFEWLLRNMNGTM 208
Query: 234 MREEDYPYTGTDRGHACKFDKSK---IAASVANFSVVSLDEDQIAANLVKNGPLAVAINA 290
E+ YPY + G+ + S A + + + E A L KNGP+++A++A
Sbjct: 209 FTEDSYPYVSSS-GYVPECSNSSQLVPGARIEGYMTIESSETVKGAWLAKNGPISIAVDA 267
Query: 291 VYMQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+Y GV SC L+HGVLLVGY G E PYW+IKNSWGE WGE
Sbjct: 268 SSFMSYQSGVLTSC---AGDALNHGVLLVGYNRTG-------EVPYWVIKNSWGEDWGEK 317
Query: 349 GYYKICRGRNVC 360
GY ++ G N C
Sbjct: 318 GYVRVTMGVNAC 329
>gi|226508570|ref|NP_001141984.1| uncharacterized protein LOC100274134 precursor [Zea mays]
gi|194706676|gb|ACF87422.1| unknown [Zea mays]
gi|413920745|gb|AFW60677.1| vignain [Zea mays]
Length = 363
Score = 207 bits (528), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 121/342 (35%), Positives = 173/342 (50%), Gaps = 39/342 (11%)
Query: 36 TDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQ 95
T GGDE + + + ++ + Y E HRF +FKAN R
Sbjct: 47 TTGGDEAMMMA------------RYKKWMAQYRRKYKDDAEKAHRFQVFKANAEFIDRSN 94
Query: 96 KLDPSA-THGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTN-------DLPAD 147
G QF+DLT EF Y GLR+ +P A Q P + D
Sbjct: 95 AGGKKKYVLGTNQFADLTSKEFAAMYTGLRKPAAVPSGAKQIPAAGSKYQNFTRLDDDVQ 154
Query: 148 FDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEE 207
DWR++GAV PVK+QG CG CW+FS GA+EG + TG LVSLSEQQ++DCD E D +
Sbjct: 155 VDWRQQGAVTPVKNQGQCGCCWAFSAVGAMEGLIMITTGNLVSLSEQQILDCD-ESDGNQ 213
Query: 208 PGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVV 267
GCNGG M++AF+Y + GG+ E+ YPY+ C+ + AA+++ F +
Sbjct: 214 ------GCNGGYMDNAFQYVINNGGVTTEDAYPYSAVQ--GTCQ--NVQPAATISGFQDL 263
Query: 268 SLDEDQIAANLVKNGPLAVAIN--AVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYA 325
++ AN V N P++V ++ + Q Y GG+ C ++H V +GYG+
Sbjct: 264 PSGDENALANAVANQPVSVGVDGGSSPFQFYQGGIYDGDGCGTDMNHAVTAIGYGA---- 319
Query: 326 PIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVS 367
+ YWI+KNSWG WGENG+ ++ G CG+ +M S
Sbjct: 320 --DDQGTQYWILKNSWGTGWGENGFMQLQMGVGACGISTMAS 359
>gi|354507493|ref|XP_003515790.1| PREDICTED: cathepsin L1-like [Cricetulus griseus]
gi|344259154|gb|EGW15258.1| Cathepsin L1 [Cricetulus griseus]
Length = 333
Score = 207 bits (528), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 127/325 (39%), Positives = 169/325 (52%), Gaps = 25/325 (7%)
Query: 51 NDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQ--- 107
N A+ H +K + + Y + EE + R +++ N++ H HG T
Sbjct: 22 NQTFNAQWH--KWKSTYRRLYGTNEE-EWRRAVWEKNMKMIELHNGEYSEGKHGYTMEMN 78
Query: 108 -FSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCG 166
F D+T EFR+ G + + Q P++ LP DWREKG V PVK+QG CG
Sbjct: 79 AFGDMTNEEFRQLVNGYKHQKHRKGKVFQEPLML--QLPKSVDWREKGCVTPVKNQGQCG 136
Query: 167 SCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEY 226
SCW+FS GALEG L TG LVSLSEQ LVDC + GCNGGLM+ AF+Y
Sbjct: 137 SCWAFSACGALEGQMCLKTGVLVSLSEQNLVDCSQ-------AEGNQGCNGGLMDFAFQY 189
Query: 227 TLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAV 286
L GL EE YPY D CK+ AA+ + + E + + GP+A+
Sbjct: 190 VLNNKGLDSEESYPYEAKDG--TCKYKPEFAAANDTGYVDIPQLEKALMKAVATVGPIAI 247
Query: 287 AINAVY--MQTYIGGVSC-PYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGE 343
AI+A + Q Y G+ P S+ LDHGVL+VGYG G +K YWI+KNSWG
Sbjct: 248 AIDASHPSFQFYSSGIYYEPNCSSKELDHGVLVVGYGFEG---TDSNKKKYWIVKNSWGS 304
Query: 344 SWGENGYYKICRGRNV-CGVDSMVS 367
SWG G++ I + +N CGV + S
Sbjct: 305 SWGMGGFFHIAKDKNNHCGVATAAS 329
>gi|417399160|gb|JAA46608.1| Putative pro-cathepsin h [Desmodus rotundus]
Length = 336
Score = 207 bits (528), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 126/318 (39%), Positives = 171/318 (53%), Gaps = 31/318 (9%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRR 118
HF + ++ K Y S EE+ HR F +N R+ H + + GI FSD+T AEF+R
Sbjct: 35 HFKSWMEQHQKTY-SAEEYRHRLQTFASNQRKIKEHNARNHTFKMGINPFSDMTFAEFKR 93
Query: 119 TYLGLRRKLRLPKD--ADQAPILPTN-DLPADFDWREKGA-VGPVKDQGSCGSCWSFSTT 174
YL P++ A ++ L + P DWR+KG V PVK+QG CGSCW+FSTT
Sbjct: 94 RYL-----WSEPQNCSATKSNYLRGHGPYPTSVDWRKKGRFVSPVKNQGGCGSCWTFSTT 148
Query: 175 GALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLM 234
GALE A + TGK++SLSEQQLVDC + + GC GGL + AFEY G+M
Sbjct: 149 GALESAIAIKTGKMLSLSEQQLVDCAQNFN-------NHGCQGGLPSQAFEYIRYNKGIM 201
Query: 235 REEDYPYTGTDRGHACKFDKSKIAASVANFSVVSL-DEDQIAANLVKNGPLAVAINAVY- 292
E+ YPY G D C+F K A V + + ++L DE + + P++ A
Sbjct: 202 EEDSYPYEGKDSN--CRFQPEKAIAFVKDVANITLNDEAAMVEAVALYNPVSFAFEVTSD 259
Query: 293 MQTYIGGVSCPYICSR---RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
Y G+ C + +++H VL VGYG KPYWI+KNSWG WG NG
Sbjct: 260 FMLYRKGIYSSTSCHKTPDKVNHAVLAVGYGEQ-------NGKPYWIVKNSWGPYWGMNG 312
Query: 350 YYKICRGRNVCGVDSMVS 367
Y+ I RG N+CG+ + S
Sbjct: 313 YFLIERGTNMCGLAACAS 330
>gi|2804266|dbj|BAA24444.1| cysteine proteinase [Sitophilus zeamais]
Length = 331
Score = 207 bits (528), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 132/324 (40%), Positives = 177/324 (54%), Gaps = 30/324 (9%)
Query: 56 AEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSA----THGITQFSDL 111
+ +S FK + +K Y S+ E R IF N + A+H KL G+ +++D+
Sbjct: 23 VQEQWSSFKMQHSKNYDSETEERFRMKIFMENDHKVAKHSKLFSQGFVKFKLGLNKYADM 82
Query: 112 TPAEFRRTYLGLRR-KLRLPKDADQAP----ILPTN-DLPADFDWREKGAVGPVKDQGSC 165
EF T G + K + K +D I P N LP DWR+KGAV VKDQG C
Sbjct: 83 LHHEFVSTLNGFNKTKNNILKGSDLNDAVRFISPANVKLPDTVDWRDKGAVTKVKDQGHC 142
Query: 166 GSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFE 225
GSCWSFS +G+LEG +F TGKLVSLSEQ LVDC ++GCNGGLM++AF
Sbjct: 143 GSCWSFSGSGSLEGQHFRKTGKLVSLSEQNLVDCSGRYG-------NTGCNGGLMDNAFR 195
Query: 226 YTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPL 284
Y GG+ E+ YPY D C + A+ F + +ED + A + GP+
Sbjct: 196 YIKDNGGIDTEQSYPYLAEDE--KCHYKTQNSGATDKGFVDIEEGNEDDLKAAVATVGPV 253
Query: 285 AVAINAVY--MQTYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSW 341
++AI+A Y Q Y GV S P S+ LDHGVL+VGYG++ + YW++KNSW
Sbjct: 254 SIAIDASYETFQLYSDGVYSDPECSSQELDHGVLVVGYGTSDDG------QDYWLVKNSW 307
Query: 342 GESWGENGYYKICRGR-NVCGVDS 364
S G NGY K+ R + N+CGV S
Sbjct: 308 RPSCGLNGYIKMARNQDNMCGVAS 331
>gi|402770505|gb|AFQ98387.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 207 bits (528), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 131/320 (40%), Positives = 166/320 (51%), Gaps = 28/320 (8%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKAN----LRRAARHQKLDPSATHGITQFSDLTPA 114
+ FK K Y S E RF IF N + A++ K S G+ QF DL
Sbjct: 26 QWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAH 85
Query: 115 EFRRTYLGLRRKLRLPKDADQAPILPTND--LPADFDWREKGAVGPVKDQGSCGSCWSFS 172
EF R + G R R + P ND LP DWR+KGAV PVKDQG CGSCW+FS
Sbjct: 86 EFARIFNG-HRGTRKTGGSTFLPPANVNDSSLPKAVDWRKKGAVTPVKDQGQCGSCWAFS 144
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
TG+LEG +FL G+LVSLSEQ LVDC ++GC GGLM AF+Y G
Sbjct: 145 ATGSLEGQHFLKNGELVSLSEQNLVDCSQSFG-------NNGCEGGLMEDAFKYIKANDG 197
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAINAV 291
+ E+ YPY D C+F K + A+ + + + E + + GP++VAI+A
Sbjct: 198 IDTEKSYPYEAVD--GECRFKKEDVGATDTGYVEIKAGSEVDLKKAVATVGPISVAIDAS 255
Query: 292 Y--MQTYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+ Q Y GV P S LDHGVL+VGYG G K YW++KNSW ESWG+
Sbjct: 256 HSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGVKG-------GKKYWLVKNSWAESWGDQ 308
Query: 349 GYYKICR-GRNVCGVDSMVS 367
GY + R N CG+ S S
Sbjct: 309 GYILMSRDNNNQCGIASQAS 328
>gi|299507656|gb|ADJ21807.1| cathepsin L [Oplegnathus fasciatus]
Length = 336
Score = 207 bits (528), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 131/322 (40%), Positives = 174/322 (54%), Gaps = 25/322 (7%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH----GITQFSDLT 112
+ H+ L+K K Y +EE R +++ NL++ H H G+ F D+T
Sbjct: 25 DEHWDLWKSWHTKKYHEKEE-GWRRMVWEKNLKKIELHNLEHSMGEHTYRLGMNHFGDMT 83
Query: 113 PAEFRRTYLGLRRKLRLPKDADQAPILPTNDL--PADFDWREKGAVGPVKDQGSCGSCWS 170
EFR+ G +RK + + + N L P DWR+ G V PVKDQG CGSCW+
Sbjct: 84 HEEFRQIMNGYKRKSE--RKFKGSLFMEPNFLEAPRSVDWRDNGYVTPVKDQGQCGSCWA 141
Query: 171 FSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA 230
FSTTGA+EG +F TGKLVSLSEQ LVDC PE + GCNGGLM+ AF+Y
Sbjct: 142 FSTTGAMEGQHFRKTGKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYIKDN 194
Query: 231 GGLMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAIN 289
GL E+ YPY GTD C +D +A+ F + S E + + GP++VAI+
Sbjct: 195 QGLDSEDSYPYLGTD-DQPCHYDPKYNSANDTGFIDIPSGKERALMKAVAAVGPVSVAID 253
Query: 290 AVY--MQTYIGGVSCPYIC-SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWG 346
A + Q Y G+ C S LDHGVL+VGYG G + K YWI+KNSW E WG
Sbjct: 254 AGHESFQFYQSGIYYEKECSSEELDHGVLVVGYGFEGE---DVDGKKYWIVKNSWSEKWG 310
Query: 347 ENGYYKICRGR-NVCGVDSMVS 367
+ GY + + R N CG+ + S
Sbjct: 311 DKGYIYMAKDRKNHCGIATAAS 332
>gi|281204231|gb|EFA78427.1| cysteine proteinase 3 [Polysphondylium pallidum PN500]
Length = 329
Score = 207 bits (528), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 132/330 (40%), Positives = 173/330 (52%), Gaps = 40/330 (12%)
Query: 54 LGAEHHFSLFKKKFNKAYASQE------EHDHRFTIFKANLRRAARHQKLDPSATHGITQ 107
L AE H+ + +F Q+ E R++ FK NL R ++ G T
Sbjct: 19 LFAEKHY---QNQFTNWMVVQDRQYDAYEFRTRYSAFKDNLDFIHRWNAVNKETELGATV 75
Query: 108 FSDLTPAEFRRTYLGLRRKLR----LPKDADQA--PILPTNDLPADFDWREKGAVGPVKD 161
F+DLT E+R YLG+ P DQ P+ T DWR GAVG VKD
Sbjct: 76 FADLTNEEYRAVYLGMNVDASNFAAQPATLDQVYQPVRST------LDWRNNGAVGRVKD 129
Query: 162 QGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMN 221
QG CGSCW+FSTTGA+EGA+ +ATG VSLSEQQL+DC + GC GGLM+
Sbjct: 130 QGQCGSCWAFSTTGAVEGAHQIATGNFVSLSEQQLMDCSRSYG-------NHGCQGGLMD 182
Query: 222 SAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKN 281
SA Y +K GG+ EE YPY D + CK++ + A ++ +S + + A +
Sbjct: 183 SAMSYIVKQGGINTEESYPYEMRD-SYTCKYNPANNGAKLSGYSNIKRGSEADLAAKLNI 241
Query: 282 GPLAVAINAVY--MQTYIGGVSC-PYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIK 338
GP+A+A++A + Q Y GV P S L HGVL VGYG+ G YWI+K
Sbjct: 242 GPVAIALDASHSSFQLYKSGVFYDPACSSTSLSHGVLAVGYGTEG-------SSAYWIVK 294
Query: 339 NSWGESWGENGYYKICRGRNV-CGVDSMVS 367
NSWG WG+ GY I + RN CGV +M S
Sbjct: 295 NSWGTRWGDAGYIWIAKDRNNHCGVATMSS 324
>gi|5777889|emb|CAB53515.1| cysteine protease [Solanum tuberosum]
Length = 466
Score = 207 bits (528), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 132/362 (36%), Positives = 193/362 (53%), Gaps = 41/362 (11%)
Query: 3 SKTVVLFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSL 62
S T+ + L+ +++FS +SS + + + DE HH S +D + A + L
Sbjct: 5 SSTLTISLLLMLIFSTLSSASDMSIISY---------DETHIHHRS--DDEVSALYESWL 53
Query: 63 FKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKL-DPSATHGITQFSDLTPAEFRRTYL 121
+ K+Y + E D RF IFK NL+ + + S G+T+F+DLT E+R YL
Sbjct: 54 IE--HGKSYNALGEKDKRFQIFKDNLKYIDEQNSVPNQSYKLGLTKFADLTNEEYRSIYL 111
Query: 122 GL-----RRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGA 176
G RRKL K P + + LP DWR+KG + VKDQGSCGSCW+FS A
Sbjct: 112 GTKSSGDRRKLSKNKSDRYLPKV-GDSLPESVDWRDKGVLVGVKDQGSCGSCWAFSAVAA 170
Query: 177 LEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMRE 236
+E N + TG L+SLSEQ+LVDCD S + GC+GGLM+ AFE+ + GG+ E
Sbjct: 171 MESINAIVTGNLISLSEQELVDCDK--------SYNEGCDGGLMDYAFEFVINNGGIDTE 222
Query: 237 EDYPYTGTDRGHAC-KFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA--VYM 293
EDYPY +R C ++ K+ + ++ V ++ ++ V + P+++AI A +
Sbjct: 223 EDYPY--KERNDVCDQYRKNAKVVKIDSYEDVPVNNEKALQKAVAHQPVSIAIEAGGRDL 280
Query: 294 QTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKI 353
Q Y G+ C +DHGV+ GYGS YWI++NSWG WGE GY ++
Sbjct: 281 QHYKSGIFTGK-CGTAVDHGVVAAGYGSE-------NGMDYWIVRNSWGAKWGEKGYLRV 332
Query: 354 CR 355
R
Sbjct: 333 QR 334
>gi|139947602|ref|NP_001077155.1| cathepsin L1 precursor [Bos taurus]
gi|134025180|gb|AAI34742.1| CTSL1 protein [Bos taurus]
gi|296484500|tpg|DAA26615.1| TPA: cathepsin L1 [Bos taurus]
Length = 333
Score = 207 bits (528), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 126/317 (39%), Positives = 168/317 (52%), Gaps = 23/317 (7%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQ----FSDLTPA 114
+ L+K K Y EE R ++K N++ H + H + F D+T
Sbjct: 28 QWKLWKAAHRKPYDLNEE-GWRKAVWKKNMKMIELHNQEYSQGKHSFSMAMNAFGDMTNE 86
Query: 115 EFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTT 174
EFR T G +R+ I + +P DWREKG V PVK+QG CGSCW+FS T
Sbjct: 87 EFRHTMNGFQRQKNKKGKEFHETIFAS--IPPSVDWREKGYVTPVKNQGKCGSCWAFSAT 144
Query: 175 GALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLM 234
GALEG F TGKLVSLSEQ LVDC PE + GC+GG +++AF+Y L GGL
Sbjct: 145 GALEGQMFQKTGKLVSLSEQNLVDCSQ---PE----GNRGCHGGFIDNAFQYVLDVGGLD 197
Query: 235 REEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA--VY 292
EE YPYTG C ++ + AA+ F + E + + GP++VA++A
Sbjct: 198 SEESYPYTGLVG--TCLYNPNNSAANETGFVDLPKQEKALMKAVANLGPISVAVDAHNPS 255
Query: 293 MQTYIGGVSC-PYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYY 351
Q Y G+ P S +DH VL+VGYG G + YW++KNSWGE WG NGY
Sbjct: 256 FQFYKSGIYYEPNCSSESVDHAVLVVGYGFEG---ADSDDNKYWLVKNSWGEHWGMNGYI 312
Query: 352 KICRGRNV-CGVDSMVS 367
K+ + RN CG+ +M S
Sbjct: 313 KMAKDRNNHCGIATMAS 329
>gi|454101|gb|AAA82966.1| cathepsin H prepropeptide [Mus musculus]
Length = 333
Score = 207 bits (528), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 120/318 (37%), Positives = 172/318 (54%), Gaps = 31/318 (9%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRR 118
HF + K+ K Y+S E++HR +F N R+ H + + + + QFSD++ AE +
Sbjct: 32 HFKSWMKQHQKTYSS-VEYNHRLQMFANNWRKIQAHNQRNHTFKMALNQFSDMSFAEIKH 90
Query: 119 TYLGLRRKLRLPKDADQAP---ILPTNDLPADFDWREKG-AVGPVKDQGSCGSCWSFSTT 174
+L P++ + T P+ DWR+KG V PVK+QG+C SCW+FSTT
Sbjct: 91 KFLWSE-----PQNCSATKSNYLRGTGPYPSSMDWRKKGNVVSPVKNQGACASCWTFSTT 145
Query: 175 GALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLM 234
GALE A +A+GK++SL+EQQLVDC + + GC GGL + AFEY L G+M
Sbjct: 146 GALESAVAIASGKMLSLAEQQLVDCAQAFN-------NHGCKGGLPSQAFEYILYNKGIM 198
Query: 235 REEDYPYTGTDRGHACKFDKSKIAASVANFSVVSL-DEDQIAANLVKNGPLAVAINAVY- 292
E+ YPY G D +C+F+ K A V N ++L DE + + P++ A
Sbjct: 199 EEDSYPYIGKDS--SCRFNPQKAVAFVKNVVNITLNDEAAMVEAVALYNPVSFAFEVTED 256
Query: 293 MQTYIGGVSCPYICSR---RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
Y GV C + +++H VL VGYG YWI+KNSWG WGENG
Sbjct: 257 FLMYKSGVYSSKSCHKTPDKVNHAVLAVGYGEQN-------GLLYWIVKNSWGSQWGENG 309
Query: 350 YYKICRGRNVCGVDSMVS 367
Y+ I RG+N+CG+ + S
Sbjct: 310 YFLIERGKNMCGLAACAS 327
>gi|226477902|emb|CAX72658.1| Cathepsin L precursor [Schistosoma japonicum]
gi|226488903|emb|CAX74801.1| Cathepsin L precursor [Schistosoma japonicum]
Length = 372
Score = 207 bits (528), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 141/357 (39%), Positives = 185/357 (51%), Gaps = 38/357 (10%)
Query: 28 VDQLIRQVTDGGDEILSHHESTNNDLL---GAEHHFSLFKKKFNKAYASQEEHDHRFTIF 84
+ QL +Q G D I + S N +LL GA F FK F +AY + E RF IF
Sbjct: 33 LSQLFKQKAVG-DGIFN---SENLELLSNIGAAWKF--FKINFKRAYGNVMEETKRFLIF 86
Query: 85 KANLRRAARHQKL--DPSATH--GITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILP 140
N + H + + AT+ G+ F+D T E R+ G R R+ K I
Sbjct: 87 GTNFIKMMEHNRAYQEGKATYKMGVNNFTDKTEYELRKL-RGYRSACRIAKPKGSTFISS 145
Query: 141 TN-DLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDC 199
+ LP DWR GAV PVK+QG CGSCW+FS+TGA+EG ++ T +LV+LSEQQL+DC
Sbjct: 146 EHAKLPDRVDWRRNGAVTPVKNQGQCGSCWAFSSTGAIEGQHYRKTNRLVNLSEQQLIDC 205
Query: 200 DHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTG--TDRGHACKFDKSKI 257
++GC GGLM+ AF+Y G+ E YPY D C F+ + I
Sbjct: 206 SKSYG-------NNGCEGGLMDLAFQYVRDNEGIDSEISYPYISGDGDENVRCLFNSTNI 258
Query: 258 AASVANFSVVSLDEDQIAANLVKN-GPLAVAINA--VYMQTYIGGVSCPYIC---SRRLD 311
A V + + +++ N V GP++VAINA Y G+ C S LD
Sbjct: 259 MAQVTGYINIHEGDERALMNAVATIGPVSVAINAGLSSFSMYKSGIYSDPECASASEDLD 318
Query: 312 HGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR-GRNVCGVDSMVS 367
HGVLLVGYG KPYW+IKNSWGE WG+ GY KI + +N+CGV S S
Sbjct: 319 HGVLLVGYGIE-------DGKPYWLIKNSWGEDWGDKGYVKILKDSKNMCGVASAAS 368
>gi|38345906|emb|CAE04498.2| OSJNBb0059K02.8 [Oryza sativa Japonica Group]
Length = 458
Score = 207 bits (528), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 122/321 (38%), Positives = 175/321 (54%), Gaps = 27/321 (8%)
Query: 42 ILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSA 101
I+S+ E + + A ++ +K + K+Y + E + R+ F+ NLR H +
Sbjct: 25 IVSYGERSEEE---ARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAG 81
Query: 102 TH----GITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTND-LPADFDWREKGAV 156
H G+ +F+DLT E+R TYLGLR K R + + N+ LP DWR KGAV
Sbjct: 82 VHSFRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEALPESVDWRTKGAV 141
Query: 157 GPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCN 216
+KDQ GSCW+FS A+EG N + TG L+SLSEQ+LVDCD S + GCN
Sbjct: 142 AEIKDQEVAGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDT--------SYNEGCN 193
Query: 217 GGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAA 276
GGLM+ AF++ + GG+ E+DYPY G D +K+ ++ ++ V+ + +
Sbjct: 194 GGLMDYAFDFIINNGGIDTEDDYPYKGKDERCDVNRKNAKV-VTIDSYEDVTPNSETSLQ 252
Query: 277 NLVKNGPLAVAINA--VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPY 334
V N P++VAI A Q Y G+ C LDHGV VGYG+ K Y
Sbjct: 253 KAVANQPVSVAIEAGGRAFQLYSSGIFTG-KCGTALDHGVAAVGYGTE-------NGKDY 304
Query: 335 WIIKNSWGESWGENGYYKICR 355
WI++NSWG+SWGE+GY ++ R
Sbjct: 305 WIVRNSWGKSWGESGYVRMER 325
>gi|157093357|gb|ABV22333.1| cysteine protease 1 [Noctiluca scintillans]
Length = 338
Score = 207 bits (528), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 133/332 (40%), Positives = 171/332 (51%), Gaps = 41/332 (12%)
Query: 42 ILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSA 101
+ + HE D + F+ FK K+ K Y E RF IFKAN+ + +
Sbjct: 12 VAAGHEVPPPDYM---MMFNNFKTKYGKVYNGINEDAVRFGIFKANVDIIYATNARNLTF 68
Query: 102 THGITQFSDLTPAEFRRTYLGLRRKLR---LPK----DADQAPILPTNDLPADFDWREKG 154
G+ +F+DLT EF +Y GL+ LP+ + + AP L + DW +G
Sbjct: 69 ALGVNEFTDLTQEEFAASYTGLKPASLWSGLPRLSTHEYNGAP------LASSVDWTTQG 122
Query: 155 AVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSG 214
V PVK+QG CGSCWSFSTTGALEGA L+TG LVSLSEQQ DCD + DSG
Sbjct: 123 VVTPVKNQGQCGSCWSFSTTGALEGAWALSTGNLVSLSEQQFEDCD---------TTDSG 173
Query: 215 CNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIA---ASVANFSVVSLDE 271
CNGG M++AF + K + E YPYT TD C ++ V ++ VS D
Sbjct: 174 CNGGWMDNAFSFA-KKNSICTEGSYPYTATD--GTCNLSGCQVGIPQGGVVGYTDVSTDS 230
Query: 272 DQIAANLVKNGPLAVAINA--VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRL 329
+Q + V P+++AI A Q Y GV C RLDHGVL VGYGS
Sbjct: 231 EQAMMSAVAQQPVSIAIEADQYSFQLYSSGV-LTASCGTRLDHGVLAVGYGSE------- 282
Query: 330 KEKPYWIIKNSWGESWGENGYYKICRGRNVCG 361
YW +KNSWG SWGE GY ++ RG+ G
Sbjct: 283 AGTDYWKVKNSWGSSWGEQGYVRLQRGKGGAG 314
>gi|312381833|gb|EFR27483.1| hypothetical protein AND_05794 [Anopheles darlingi]
Length = 344
Score = 207 bits (528), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 135/329 (41%), Positives = 177/329 (53%), Gaps = 36/329 (10%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARH-QKLDPSATH---GITQFSDLTPAE 115
++ FK + K Y S+ E R I+ N + A+H Q+ D + +++DL E
Sbjct: 27 WNAFKLQHRKKYDSESEERIRMKIYVQNKHKIAKHNQRYDLGQEKFRLRVNKYADLLHEE 86
Query: 116 FRRTYLGLRR------KL--RLPKDADQAPIL---PTN-DLPADFDWREKGAVGPVKDQG 163
F T G R KL R + PI P N D+P DWREKGAV PVKDQG
Sbjct: 87 FVHTLNGFNRSAAAGSKLLGREQLMTIEEPITWIEPANVDVPTTIDWREKGAVTPVKDQG 146
Query: 164 SCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSA 223
CGSCWSFS TGALEG +F TGKLVSLSEQ LVDC + ++GCNGGLM++A
Sbjct: 147 HCGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSTKYG-------NNGCNGGLMDNA 199
Query: 224 FEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNG 282
F+Y G+ E+ YPY D C ++ I A+ F + DE + L G
Sbjct: 200 FQYVKDNKGIDTEKAYPYEAID--DECHYNPKAIGATDKGFVDIPQGDEKALKKALATVG 257
Query: 283 PLAVAINAVY--MQTYIGGVSCPYIC-SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKN 339
P++VAI+A + Q Y GV C S +LDHGVL VGYG+ + YW++KN
Sbjct: 258 PVSVAIDASHESFQFYSEGVYYEPQCDSEQLDHGVLAVGYGTTEDG------EDYWLVKN 311
Query: 340 SWGESWGENGYYKICRGR-NVCGVDSMVS 367
SWG +WG+ GY K+ R R N CG+ + S
Sbjct: 312 SWGTTWGDQGYVKMARNRENHCGIATTAS 340
>gi|2582045|gb|AAB82449.1| lymphopain [Homo sapiens]
gi|2582181|gb|AAB82457.1| lymphopain [Homo sapiens]
gi|3033547|gb|AAC32181.1| cathepsin W [Homo sapiens]
Length = 376
Score = 207 bits (527), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 125/335 (37%), Positives = 170/335 (50%), Gaps = 42/335 (12%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLD-PSATHGITQFSDLTPAEFRR 118
F LF+ +FN++Y S EEH HR IF NL +A R Q+ D +A G+T FSDLT EF +
Sbjct: 42 FKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDLTEEEFGQ 101
Query: 119 TYLGLRRKLRLPKDADQAPIL--------PTNDLPADFDWRE-KGAVGPVKDQGSCGSCW 169
Y G RR A P + P +P DWR+ GA+ P+KDQ +C CW
Sbjct: 102 LY-GYRRA------AGGVPSMGREIRSEEPEESVPFSCDWRKVAGAISPIKDQKNCNCCW 154
Query: 170 SFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK 229
+ + G +E ++ V +S +L+DC G C GC+GG + AF L
Sbjct: 155 AMAAAGNIETLWRISFWDFVDVSVHELLDC---------GRCGDGCHGGFVWDAFITVLN 205
Query: 230 AGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAIN 289
GL E+DYP+ G R H C K + A + +F ++ +E +IA L GP+ V IN
Sbjct: 206 NSGLASEKDYPFQGKVRAHRCHPKKYQKVAWIQDFIMLQNNEHRIAQYLATYGPITVTIN 265
Query: 290 AVYMQTYIGGV--SCPYICSRRL-DHGVLLVGYG-------------SAGYAPIRLKEKP 333
+Q Y GV + P C +L DH VLLVG+G S+ P P
Sbjct: 266 MKPLQLYRKGVIKATPTTCDPQLVDHSVLLVGFGSVKSEEGIWAETVSSQSQPQPPHPTP 325
Query: 334 YWIIKNSWGESWGENGYYKICRGRNVCGVDSMVST 368
YWI+KNSWG WGE GY+++ RG N CG+ T
Sbjct: 326 YWILKNSWGAQWGEKGYFRLHRGSNTCGITKFPLT 360
>gi|259016196|sp|P56202.2|CATW_HUMAN RecName: Full=Cathepsin W; AltName: Full=Lymphopain; Flags:
Precursor
Length = 376
Score = 207 bits (527), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 125/335 (37%), Positives = 170/335 (50%), Gaps = 42/335 (12%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLD-PSATHGITQFSDLTPAEFRR 118
F LF+ +FN++Y S EEH HR IF NL +A R Q+ D +A G+T FSDLT EF +
Sbjct: 42 FKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDLTEEEFGQ 101
Query: 119 TYLGLRRKLRLPKDADQAPIL--------PTNDLPADFDWRE-KGAVGPVKDQGSCGSCW 169
Y G RR A P + P +P DWR+ A+ P+KDQ +C CW
Sbjct: 102 LY-GYRRA------AGGVPSMGREIRSEEPEESVPFSCDWRKVASAISPIKDQKNCNCCW 154
Query: 170 SFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK 229
+ + G +E ++ V +S Q+L+DC G C GC+GG + AF L
Sbjct: 155 AMAAAGNIETLWRISFWDFVDVSVQELLDC---------GRCGDGCHGGFVWDAFITVLN 205
Query: 230 AGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAIN 289
GL E+DYP+ G R H C K + A + +F ++ +E +IA L GP+ V IN
Sbjct: 206 NSGLASEKDYPFQGKVRAHRCHPKKYQKVAWIQDFIMLQNNEHRIAQYLATYGPITVTIN 265
Query: 290 AVYMQTYIGGV--SCPYICSRRL-DHGVLLVGYG-------------SAGYAPIRLKEKP 333
+Q Y GV + P C +L DH VLLVG+G S+ P P
Sbjct: 266 MKPLQLYRKGVIKATPTTCDPQLVDHSVLLVGFGSVKSEEGIWAETVSSQSQPQPPHPTP 325
Query: 334 YWIIKNSWGESWGENGYYKICRGRNVCGVDSMVST 368
YWI+KNSWG WGE GY+++ RG N CG+ T
Sbjct: 326 YWILKNSWGAQWGEKGYFRLHRGSNTCGITKFPLT 360
>gi|121543825|gb|ABM55577.1| putative cathepsin L-like protease [Maconellicoccus hirsutus]
Length = 341
Score = 207 bits (527), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 128/332 (38%), Positives = 180/332 (54%), Gaps = 33/332 (9%)
Query: 51 NDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKL----DPSATHGIT 106
NDL+ E + LFK +F+KAY ++ E R +F N + ARH KL + S +
Sbjct: 24 NDLIAEE--WELFKTQFSKAYNTEIEEKFRMKVFMDNKHKIARHNKLFQNGEVSYELEMN 81
Query: 107 QFSDLTPAEFRRTYLGLRRKLR--LPKDADQAPILPTNDL--PADFDWREKGAVGPVKDQ 162
F DL EF +T G R LR + D +P ++ P DWR +GAV VK+Q
Sbjct: 82 HFGDLLHHEFVKTVNGYRHSLRRVTGDEIDSVTFIPAYNVTVPDSVDWRTEGAVTEVKNQ 141
Query: 163 GSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNS 222
G CGSCW+FSTTG+LEG +F T +L SLSEQ L+DC + ++GC+GGLM++
Sbjct: 142 GQCGSCWAFSTTGSLEGQHFRNTKQLTSLSEQNLIDCSGKYG-------NNGCSGGLMDN 194
Query: 223 AFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKN 281
AF Y G+ E+ YPY G D C++ + A+ F + DE+++ +
Sbjct: 195 AFAYIKSNKGIDTEQSYPYEGIDD--KCRYKPQESGATDKGFVDIPQGDEEKLKLAVATV 252
Query: 282 GPLAVAINAVY--MQTYIGGVSCPYIC---SRRLDHGVLLVGYGSAGYAPIRLKEKPYWI 336
GP++VAI+A + Q Y GV C LDHGVL VGYG+ K YW+
Sbjct: 253 GPISVAIDASHQSFQFYKKGVYYDKGCGNGEEDLDHGVLAVGYGTE-------NGKDYWL 305
Query: 337 IKNSWGESWGENGYYKICRGR-NVCGVDSMVS 367
+KNSWG+ WG +GY K+ R + N CG+ + S
Sbjct: 306 VKNSWGKRWGLDGYIKMARNKHNHCGIATSAS 337
>gi|374713651|gb|AEZ65083.1| cysteine protease [Carica papaya]
Length = 467
Score = 207 bits (527), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 138/380 (36%), Positives = 196/380 (51%), Gaps = 43/380 (11%)
Query: 1 MGSKTVVLFLVSLVVF---SAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAE 57
MG L L L++F SAV + D + DE+++ +E+
Sbjct: 1 MGLHRSSLSLFLLMIFTASSAVDMSIVSYDQRHADKSSWRTDDEVMAMYEA--------- 51
Query: 58 HHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFR 117
+ K KAY + E + RF IFK NLR H + + G+ +F+DLT E+R
Sbjct: 52 -----WLVKHGKAYNALGEKEKRFGIFKDNLRFIDEHNSQNLTYRLGLNRFADLTNEEYR 106
Query: 118 RTYLGL-----RRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFS 172
YLG+ R ++ + +D+ + LP DWR++GAV VKDQGSCGSCW+FS
Sbjct: 107 SMYLGVKPGATRVTRKVSRKSDRFAARVGDALPDFIDWRKEGAVVGVKDQGSCGSCWAFS 166
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
T A+EG N + TG L+SLSEQ+LVDCD S + GCNGGLM+ AFE+ + GG
Sbjct: 167 TIAAVEGINQIVTGDLISLSEQELVDCDT--------SYNEGCNGGLMDYAFEFIINNGG 218
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA-- 290
+ EEDYPY D+ ++ K+ S+ + V +++ V P++VAI A
Sbjct: 219 IDSEEDYPYRAADQ-KCDQYRKNANVVSIDGYEDVPENDEAALKKAVAKQPVSVAIEAGG 277
Query: 291 VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
Q Y GV C LDHGV VGYG+ + YWI+ NSWG++WGE+GY
Sbjct: 278 RAFQLYQSGVFTGK-CGTSLDHGVAAVGYGTE-------NGQDYWIVGNSWGKNWGEDGY 329
Query: 351 YKICRGRNVCGVDSMVSTVA 370
++ RN+ G S +A
Sbjct: 330 IRM--ERNLAGSSSGKCGIA 347
>gi|109112413|ref|XP_001106814.1| PREDICTED: cathepsin L2 isoform 3 [Macaca mulatta]
gi|297271422|ref|XP_002800251.1| PREDICTED: cathepsin L2 [Macaca mulatta]
Length = 334
Score = 207 bits (527), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 131/315 (41%), Positives = 168/315 (53%), Gaps = 26/315 (8%)
Query: 63 FKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQ----FSDLTPAEFRR 118
+K + Y + EE R +++ N++ H HG T F D+T EFR+
Sbjct: 32 WKATHRRLYGASEEGWRR-AVWEKNMKMIELHNGEYSQGKHGFTMAMNAFGDMTNEEFRQ 90
Query: 119 TYLGLR-RKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
R +KLR K + L DLP DWR+KG V PVK+Q CGSCW+FS TGAL
Sbjct: 91 VMGCFRNQKLRKGKLFREPLFL---DLPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGAL 147
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREE 237
EG F TGKLVSLSEQ LVDC H P+ + GCNGG MNSAF Y + GGL EE
Sbjct: 148 EGQMFRKTGKLVSLSEQNLVDCSH---PQG----NQGCNGGFMNSAFRYVKENGGLDSEE 200
Query: 238 DYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKN-GPLAVAINAVY--MQ 294
YPY D CK+ A+ F VV +++ V GP++VA++A + Q
Sbjct: 201 SYPYVAMD--GICKYRSENSVANDTGFKVVPAGKEKALMKAVATVGPISVAMDAGHSSFQ 258
Query: 295 TYIGGVSC-PYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKI 353
Y G+ P S+ LDHGVL+VGYG G YW++KNSWG WG NGY KI
Sbjct: 259 FYKSGIYFEPDCSSKNLDHGVLVVGYGFEG---ANSDNNKYWLVKNSWGPEWGSNGYVKI 315
Query: 354 CRGR-NVCGVDSMVS 367
+ + N CG+ + S
Sbjct: 316 AKDKDNHCGIATAAS 330
>gi|157278115|ref|NP_001098156.1| cathepsin L precursor [Oryzias latipes]
gi|50251128|dbj|BAD27581.1| cathepsin L [Oryzias latipes]
Length = 336
Score = 207 bits (527), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 132/322 (40%), Positives = 175/322 (54%), Gaps = 25/322 (7%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH----GITQFSDLT 112
+ H+ L+K +K Y +EE R +++ NLR+ H H G+ F D+T
Sbjct: 25 DQHWQLWKGWHSKNYHEKEEGWRRL-VWEKNLRKIELHNLEHSMGKHSYRLGMNHFGDMT 83
Query: 113 PAEFRRTYLGLRRKLRLPKDADQAPILPTNDL--PADFDWREKGAVGPVKDQGSCGSCWS 170
EFR+ G +R R + + + N L P DWR+KG V PVKDQG CGSCW+
Sbjct: 84 HEEFRQIMNGYKR--REQRKYSGSLFMEPNFLEAPRAVDWRDKGYVTPVKDQGQCGSCWA 141
Query: 171 FSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA 230
FSTTGALEG F TGKLVSLSEQ LVDC PE + GCNGGLM+ AF+Y
Sbjct: 142 FSTTGALEGQQFRKTGKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYVKDN 194
Query: 231 GGLMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAIN 289
GL E+ YPY GTD C+++ A + F + S E + + GP++VAI+
Sbjct: 195 QGLDSEDFYPYKGTD-DQPCQYNAQYSAVNDTGFVDIPSGKERALMKAVASVGPVSVAID 253
Query: 290 AVY--MQTYIGGVSCPYICSR-RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWG 346
A + Q Y G+ CS LDHGVL+VGYG G + K YWI+KNSW E WG
Sbjct: 254 AGHESFQFYQSGIYFEKECSSDELDHGVLVVGYGFEG---EDVDGKKYWIVKNSWSEKWG 310
Query: 347 ENGYYKICRGR-NVCGVDSMVS 367
+ G+ + + R N CG+ + S
Sbjct: 311 DKGFIYMAKDRHNHCGIATAAS 332
>gi|56758090|gb|AAW27185.1| SJCHGC06231 protein [Schistosoma japonicum]
Length = 372
Score = 207 bits (527), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 141/357 (39%), Positives = 185/357 (51%), Gaps = 38/357 (10%)
Query: 28 VDQLIRQVTDGGDEILSHHESTNNDLL---GAEHHFSLFKKKFNKAYASQEEHDHRFTIF 84
+ QL +Q G D I + S N +LL GA F FK F +AY + E RF IF
Sbjct: 33 LSQLFKQKAVG-DGIFN---SENLELLSNIGAAWKF--FKINFKRAYGNVMEETKRFLIF 86
Query: 85 KANLRRAARHQKL--DPSATH--GITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILP 140
N + H + + AT+ G+ F+D T E R+ G R R+ K I
Sbjct: 87 GTNFIKMMEHNRAYQEGKATYKMGVNNFTDKTEYELRKL-RGYRSACRIAKPKGSTFISS 145
Query: 141 TN-DLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDC 199
+ LP DWR GAV PVK+QG CGSCW+FS+TGA+EG ++ T +LV+LSEQQL+DC
Sbjct: 146 EHAKLPDRVDWRRNGAVTPVKNQGQCGSCWAFSSTGAIEGQHYRKTNRLVNLSEQQLIDC 205
Query: 200 DHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTG--TDRGHACKFDKSKI 257
++GC GGLM+ AF+Y G+ E YPY D C F+ + I
Sbjct: 206 SKSYG-------NNGCEGGLMDLAFQYVRDNKGIDSEISYPYISGDGDENVRCLFNSTNI 258
Query: 258 AASVANFSVVSLDEDQIAANLVKN-GPLAVAINAVY--MQTYIGGVSCPYIC---SRRLD 311
A V + + +++ N V GP++VAINA Y G+ C S LD
Sbjct: 259 MAQVTGYINIHEGDERALMNAVATIGPVSVAINAGLPSFSMYKSGIYSDPECASASEDLD 318
Query: 312 HGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR-GRNVCGVDSMVS 367
HGVLLVGYG KPYW+IKNSWGE WG+ GY KI + +N+CGV S S
Sbjct: 319 HGVLLVGYGIE-------DGKPYWLIKNSWGEDWGDKGYVKILKDSKNMCGVASAAS 368
>gi|220983358|dbj|BAH11164.1| cysteine protease [Hordeum vulgare]
Length = 462
Score = 207 bits (527), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 123/308 (39%), Positives = 166/308 (53%), Gaps = 35/308 (11%)
Query: 56 AEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH----GITQFSDL 111
AEHH Y E + RF F+ NLR +H + H G+ +F+DL
Sbjct: 47 AEHH---------STYNPIGEEERRFEAFRNNLRYIDQHNAAADAGVHSFRLGLNRFADL 97
Query: 112 TPAEFRRTYLGLRRKL-RLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWS 170
T E+R TYLG R K R K + + ++LP DWR+KGAVG VKDQG CGSCW+
Sbjct: 98 TNEEYRSTYLGARTKPDRERKLSARYQAADNDELPESVDWRKKGAVGAVKDQGGCGSCWA 157
Query: 171 FSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA 230
FS A+EG N + TG ++ LSEQ+LVDCD S + GCNGGLM+ AFE+ +
Sbjct: 158 FSAIAAVEGINQIVTGDMIPLSEQELVDCDT--------SYNQGCNGGLMDYAFEFIINN 209
Query: 231 GGLMREEDYPYTGTDRGHACKFDKSKI-AASVANFSVVSLDEDQIAANLVKNGPLAVAIN 289
GG+ EEDYPY +R + C +K ++ + V ++ ++ V N P++VAI
Sbjct: 210 GGIDSEEDYPY--KERDNRCDANKKNAKVVTIDGYEDVPVNSEKSLQKAVANQPISVAIE 267
Query: 290 A--VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGE 347
A Q Y G+ C LDHGV VGYG+ K YW+++NSWG WGE
Sbjct: 268 AGGRAFQLYKSGIFTG-TCGTALDHGVAAVGYGTE-------NGKDYWLVRNSWGSVWGE 319
Query: 348 NGYYKICR 355
NGY ++ R
Sbjct: 320 NGYIRMER 327
>gi|34223513|gb|AAQ62999.1| oil palm polygalacturonase allergen PEST472 [Elaeis guineensis]
Length = 525
Score = 207 bits (527), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 126/312 (40%), Positives = 172/312 (55%), Gaps = 34/312 (10%)
Query: 66 KFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH----GITQFSDLTPAEFRRTYL 121
K +A + E + RF IFK N+R H S G+ +F+D+T E+R YL
Sbjct: 56 KHGRADNALGEKERRFEIFKDNVRFIDAHNAAADSGHRSFRLGLNRFADMTNEEYRTVYL 115
Query: 122 GLR-----RKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGA 176
G R R+ RL +D+ +LP DWR+KGAV VKDQGSCGSCW+FST A
Sbjct: 116 GTRPASHRRRARL--GSDRYRYNAGEELPESVDWRDKGAVTTVKDQGSCGSCWAFSTIAA 173
Query: 177 LEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMRE 236
+EG N + TG L+SLSEQ+LVDCD+ + GCNGGLM+ AFE+ + GG+ E
Sbjct: 174 VEGINKIVTGDLISLSEQELVDCDN--------GQNQGCNGGLMDYAFEFIINNGGIDTE 225
Query: 237 EDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA--VYMQ 294
EDYPY D G ++ K+ S+ + V +++++ V N P++VAI A Q
Sbjct: 226 EDYPYKARD-GKCDQYRKNAKVVSIDGYEDVPVNDEKALQKAVANQPVSVAIEAGGREFQ 284
Query: 295 TYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
Y G+ C LDHGV+ VGYG+ K YWI++NSWG WGE+GY ++
Sbjct: 285 LYHSGIFTGR-CGTDLDHGVVAVGYGTE-------NGKDYWIVRNSWGGDWGESGYIRME 336
Query: 355 RGRNV----CGV 362
R N CG+
Sbjct: 337 RNVNASTGKCGI 348
>gi|340380715|ref|XP_003388867.1| PREDICTED: pro-cathepsin H-like [Amphimedon queenslandica]
Length = 347
Score = 207 bits (527), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 124/318 (38%), Positives = 172/318 (54%), Gaps = 27/318 (8%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAAR-HQKLDPSATHGITQFSDLTPAEFRR 118
F + K K YA+ EE++ R ++ AN R ++ P+ + QF+DLT AEF+R
Sbjct: 43 FERWTIKHKKTYATAEEYNWRLRVYTANHYYVKRLNEGHGPATEFELNQFADLTFAEFKR 102
Query: 119 TYLGLRRK-LRLPKDADQAPILPTN-DLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGA 176
YL + R Q P+ N + P DWR++ + PV+DQGSCGSCW+FS T
Sbjct: 103 IYLSSSSQHCRATTGNFQMPVKKNNVEDPVAIDWRKRNVITPVRDQGSCGSCWAFSATSC 162
Query: 177 LEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMRE 236
L L TG+L+SLS+QQL+DC + + GC GGL + AFEY GG+ E
Sbjct: 163 LSAHLALKTGQLISLSKQQLLDCSRSFN-------NRGCKGGLPSQAFEYIRYNGGIESE 215
Query: 237 EDYPYTGTDRGHACKFDKSKIAAS---VANFSVVSLDEDQIAANLVKNGPLAVAINAVY- 292
DYPY DR C F S +AA+ V NF+ + ED IA L GP+++ I++
Sbjct: 216 RDYPY--KDREEKCHFKPSLVAATVTGVVNFTQGA--EDDIAVALANIGPVSIGIHSTKS 271
Query: 293 MQTYIGGVSCPYICS---RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
TY G+ +CS R+++H VL+VGY + YWI KNSWG +WG NG
Sbjct: 272 FATYKKGIYQGKLCSKNPRKINHAVLIVGYDQTASG------EKYWIGKNSWGTNWGMNG 325
Query: 350 YYKICRGRNVCGVDSMVS 367
Y+ I RG N CG+ + S
Sbjct: 326 YFWIRRGHNACGLATCAS 343
>gi|1134882|emb|CAA92583.1| cysteine protease [Pisum sativum]
Length = 350
Score = 207 bits (526), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 135/362 (37%), Positives = 187/362 (51%), Gaps = 35/362 (9%)
Query: 9 FLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHH---FSLFKK 65
L+ L ++ ++G D + IR V+D +++L ++G H F+ F
Sbjct: 6 LLIVLFCVASAAAGFSFHDSNP-IRMVSDVEEQLL--------QVIGESRHAVSFARFAN 56
Query: 66 KFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRR 125
++ K Y S +E RF IF NL K S G+ F+D T EFR LG +
Sbjct: 57 RYGKRYDSVDEMKLRFKIFSENLELIRSSNKRRLSYKLGVNHFADWTWEEFRSHRLGAAQ 116
Query: 126 KLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLAT 185
+ + +LP + DWR++G V VKDQGSCGSCW+FSTTGALE A A
Sbjct: 117 NCSATLKGNHK--ITDANLPDEKDWRKEGIVSGVKDQGSCGSCWTFSTTGALESAYAQAF 174
Query: 186 GKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTD 245
GK +SLSEQQLVDC + + GC+GGL + AFEY GGL EE YPYTG++
Sbjct: 175 GKNISLSEQQLVDCAGAFN-------NFGCSGGLPSQAFEYIKYNGGLETEEAYPYTGSN 227
Query: 246 RGHACKFDKSKIAASVANFSVVSLD-EDQIAANLVKNGPLAVAINAVY-MQTYIGGVSCP 303
CKF +A V ++L ED++ + P++VA V+ + Y GV
Sbjct: 228 --GLCKFRSEHVAVKVLGSVNITLGAEDELKHAIAFARPVSVAFEVVHDFRLYKSGVYTS 285
Query: 304 YICSRR---LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVC 360
C ++H VL VGYG PYW+IKNSWG WG++GY+K+ G+N+C
Sbjct: 286 TACGSTPMDVNHAVLAVGYGIE-------DGIPYWLIKNSWGGDWGDHGYFKMEMGKNMC 338
Query: 361 GV 362
GV
Sbjct: 339 GV 340
>gi|195628596|gb|ACG36128.1| vignain precursor [Zea mays]
Length = 362
Score = 207 bits (526), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 121/341 (35%), Positives = 172/341 (50%), Gaps = 38/341 (11%)
Query: 36 TDGGDEILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQ 95
T GGDE + + + ++ + Y E HRF +FKAN R
Sbjct: 47 TTGGDEAMMMA------------RYKKWMAQYRRKYKDDAEKAHRFQVFKANAEFIDRSN 94
Query: 96 KLDPSA-THGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPI------LPTNDLPADF 148
G QF+DLT EF Y GLR+ +P A Q P D
Sbjct: 95 AGGKKKYVLGTNQFADLTSKEFAAMYTGLRKPAAVPSGAKQIPAGFKYQNFTRLDDDVQV 154
Query: 149 DWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEP 208
DWR++GAV PVK+QG CG CW+FS GA+EG + TG LVSLSEQQ++DCD E D +
Sbjct: 155 DWRQQGAVTPVKNQGQCGCCWAFSAVGAMEGLIMITTGNLVSLSEQQILDCD-ESDGNQ- 212
Query: 209 GSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVS 268
GCNGG M++AF+Y + GG+ E+ YPY+ C+ + AA+++ F +
Sbjct: 213 -----GCNGGYMDNAFQYVVNNGGVTTEDAYPYSAVQ--GTCQ--NVQPAATISGFQDLP 263
Query: 269 LDEDQIAANLVKNGPLAVAIN--AVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAP 326
++ AN V N P++V ++ + Q Y GG+ C ++H V +GYG+
Sbjct: 264 SGDENALANAVANQPVSVGVDGGSSPFQFYQGGIYDGDGCGTDMNHAVTAIGYGADD--- 320
Query: 327 IRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVS 367
+ YWI+KNSWG WGENG+ ++ G CG+ +M S
Sbjct: 321 ---QGTQYWILKNSWGTGWGENGFMQLQMGVGACGISTMAS 358
>gi|146217394|gb|ABQ10739.1| cathepsin L [Penaeus monodon]
Length = 341
Score = 207 bits (526), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 131/324 (40%), Positives = 169/324 (52%), Gaps = 31/324 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH----GITQFSDLTPAE 115
+ FK + +K Y S+ E R IF N + A H K H + ++ D+ E
Sbjct: 29 WEAFKLEHSKKYDSEVEESFRMKIFTENKHKIANHNKGFAQGHHTYKLSMNKYGDMLHHE 88
Query: 116 FRRTYLGLRRKLRLPKDADQAP-----ILPTND--LPADFDWREKGAVGPVKDQGSCGSC 168
F T G R ++A I P +D LP + DWR KGAV P+KDQG CGSC
Sbjct: 89 FVSTMNGFRGNHTGGYKNNRAYTGATFIEPDDDVQLPKNVDWRTKGAVTPIKDQGQCGSC 148
Query: 169 WSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTL 228
W+FS TGALEG F TG+LVSLSEQ LVDC + ++GCNGGLM++AFEY
Sbjct: 149 WAFSATGALEGQTFRKTGQLVSLSEQNLVDCSRKFG-------NNGCNGGLMDNAFEYVK 201
Query: 229 KAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVA 287
+ GG+ EE YPY D C ++ A F V E + + GP++VA
Sbjct: 202 ENGGIDTEESYPYDAEDE--KCHYNPRAAGAEDKGFVDVREGSEHALKKAVATVGPVSVA 259
Query: 288 INAVY--MQTYIGGVSCPYICS-RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGES 344
I+A + Q Y GV CS LDHGVL+VGYG I YW++KNSWG +
Sbjct: 260 IDASHESFQFYSHGVYIEPECSPEMLDHGVLVVGYG------IDDDGTDYWLVKNSWGTT 313
Query: 345 WGENGYYKICRGR-NVCGVDSMVS 367
WG+ GY K+ R R N CG+ S S
Sbjct: 314 WGDQGYVKMARNRDNQCGIASSAS 337
>gi|410907221|ref|XP_003967090.1| PREDICTED: pro-cathepsin H-like [Takifugu rubripes]
Length = 324
Score = 207 bits (526), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 125/316 (39%), Positives = 172/316 (54%), Gaps = 34/316 (10%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
E F + NKAY ++ D R +F N RR +H + + S + Q+SD+T AEF
Sbjct: 24 EREFRSWMALHNKAYV--KDFDQRLQVFTENKRRIDKHNEGNHSFAMRLNQYSDMTFAEF 81
Query: 117 RRTYLGLRRKLRLPKD--ADQAPILPTND-LPADFDWREKGA-VGPVKDQGSCGSCWSFS 172
R+ +L P++ A + + TN P DWR+KG V PVK+QGSCGSCW+FS
Sbjct: 82 RKHFLWAE-----PQNCSATKGSYIQTNSPHPESIDWRKKGNYVTPVKNQGSCGSCWTFS 136
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
TTG LE + +GKLV LSEQQLVDC + + + GCNGGL + AFEY G
Sbjct: 137 TTGCLESVTAINSGKLVPLSEQQLVDCAQDFN-------NHGCNGGLPSQAFEYIKYNKG 189
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVAN-FSVVSLDEDQIAANLVKNGPLAVAINAV 291
LM E DYPYT + C + AA V N ++ + DE ++ + P++ A
Sbjct: 190 LMTESDYPYTAFED--KCTYKPELAAAFVKNVVNITAYDEKEMEDAVATRNPVSFAFEVT 247
Query: 292 --YMQTYIGGVSCPYIC---SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWG 346
+M Y GV C + +++H VL VGYGS PYWI+KNSWG WG
Sbjct: 248 PDFMH-YSSGVYSSSTCHTTTDKVNHAVLAVGYGSEN-------GTPYWIVKNSWGPGWG 299
Query: 347 ENGYYKICRGRNVCGV 362
++GY+ I RG+N+CG+
Sbjct: 300 QDGYFLIMRGKNMCGL 315
>gi|18390634|ref|NP_563764.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|8844131|gb|AAF80223.1|AC025290_12 Contains similarity to a cysteine endopeptidase 1 from Phaseolus
vulgaris gb|U52970 and is a member of the papain
cysteine protease family PF|00112 [Arabidopsis thaliana]
gi|332189848|gb|AEE27969.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 343
Score = 207 bits (526), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 127/322 (39%), Positives = 177/322 (54%), Gaps = 33/322 (10%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
+ F + K +K Y ++E RF I+++N++ L +F+D+T +EF
Sbjct: 40 KQRFEKWLKTHSKLYGGRDEWMLRFGIYQSNVQLIDYINSLHLPFKLTDNRFADMTNSEF 99
Query: 117 RRTYLGLR-RKLRLPKDADQAPIL-PTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTT 174
+ +LGL LRL K Q P+ P ++P DWR +GAV P+++QG CG CW+FS
Sbjct: 100 KAHFLGLNTSSLRLHKK--QRPVCDPAGNVPDAVDWRTQGAVTPIRNQGKCGGCWAFSAV 157
Query: 175 GALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLM 234
A+EG N + TG LVSLSEQQL+DCD G+ + GC+GGLM +AFE+ GGL
Sbjct: 158 AAIEGINKIKTGNLVSLSEQQLIDCD-------VGTYNKGCSGGLMETAFEFIKTNGGLA 210
Query: 235 REEDYPYTGTDRGHACKFDKSK-IAASVANFSVVSLDED--QIAANLVKNGPLAVAINA- 290
E DYPYTG + C +KSK ++ + V+ +E QIAA P++V I+A
Sbjct: 211 TETDYPYTGIE--GTCDQEKSKNKVVTIQGYQKVAQNEASLQIAA---AQQPVSVGIDAG 265
Query: 291 -VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
Q Y GV Y C L+HGV +VGYG G ++ YWI+KNSWG WGE G
Sbjct: 266 GFIFQLYSSGVFTNY-CGTNLNHGVTVVGYGVEG-------DQKYWIVKNSWGTGWGEEG 317
Query: 350 YYKICRG----RNVCGVDSMVS 367
Y ++ RG CG+ M S
Sbjct: 318 YIRMERGVSEDTGKCGIAMMAS 339
>gi|195624522|gb|ACG34091.1| thiol protease aleurain precursor [Zea mays]
Length = 360
Score = 207 bits (526), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 136/346 (39%), Positives = 176/346 (50%), Gaps = 31/346 (8%)
Query: 32 IRQVTDGGDEILSHHESTNNDLLGAEH---HFSLFKKKFNKAYASQEEHDHRFTIFKANL 88
IR VTD L EST LG F+ F ++ K+Y S E RF IF +L
Sbjct: 31 IRPVTDRAASAL---ESTVFAALGRTRDALRFARFAVRYGKSYESAAEVHKRFRIFSESL 87
Query: 89 RRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADF 148
+ + S GI +F+D++ EFR T LG + + LP
Sbjct: 88 QLVRSTNRKGLSYRLGINRFADMSWEEFRATRLGAAQNCSATLTGNHRMRAAAVALPETK 147
Query: 149 DWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEP 208
DWRE G V PVK+QG CGSCW+FSTTGALE A ATGK +SLSEQQL+DC +
Sbjct: 148 DWREDGIVSPVKNQGHCGSCWTFSTTGALEAAYTQATGKPISLSEQQLIDCGFAFN---- 203
Query: 209 GSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASV---ANFS 265
+ GCNGGL + AFEY GGL EE YPY G + CKF + V N +
Sbjct: 204 ---NFGCNGGLPSQAFEYIKYNGGLDTEESYPYQGVN--GICKFKNENVGFKVLDSVNIT 258
Query: 266 VVSLDEDQIAANLVKNGPLAVAINAVY-MQTYIGGVSCPYICSRR---LDHGVLLVGYGS 321
+ + DE + A LV+ P++VA + + Y GV C ++H VL VGYG
Sbjct: 259 LGAEDELKDAVGLVR--PVSVAFEVITGFRLYKSGVYTSDHCGTTPMDVNHAVLAVGYGV 316
Query: 322 AGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVS 367
PYW+IKNSWG WG+ GY+K+ G+N+CGV + S
Sbjct: 317 E-------DGVPYWLIKNSWGADWGDEGYFKMEMGKNMCGVATCAS 355
>gi|50657027|emb|CAH04631.1| cathepsin H [Suberites domuncula]
Length = 335
Score = 207 bits (526), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 122/311 (39%), Positives = 170/311 (54%), Gaps = 21/311 (6%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
E +F +++K K Y+++EE R +F N+ H K S + +++D+T EF
Sbjct: 32 EDYFKEWQEKHGKVYSTEEESQSRLKVFMKNVIYIDNHNKQGHSYELEVNEYADMTLDEF 91
Query: 117 RRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGA 176
+ YL + P D P DWR KGAV PVK+QG CGSCW+FSTTG
Sbjct: 92 KDQYLMEPQHCSATHSLKSDPP-KYRDPPKAIDWRSKGAVTPVKNQGQCGSCWTFSTTGC 150
Query: 177 LEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMRE 236
LE +FL TG+LVSLSEQQLVDC + ++GCNGGL + AFEY GGL E
Sbjct: 151 LESHHFLKTGQLVSLSEQQLVDCAQAFN-------NNGCNGGLPSQAFEYIHYNGGLDSE 203
Query: 237 EDYPYTGTDRGHACKFDKSKIAASVAN-FSVVSLDEDQIAANLVKNGPLAVAIN-AVYMQ 294
E YPY D C F S+++A+V+N ++ S DE Q+ + GP+++A + + +
Sbjct: 204 ESYPYRAHDE--KCHFVPSEVSATVSNVVNITSKDEMQLYNAVGTVGPVSIAYDVSADFR 261
Query: 295 TYIGGVSCPYICS---RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYY 351
Y GV C ++H VL VGY + + YWI+KNSWG +G NGY+
Sbjct: 262 FYKKGVYKSKECKTDPEHVNHAVLAVGYNTTESG------EDYWIVKNSWGTKFGINGYF 315
Query: 352 KICRGRNVCGV 362
I RG N+CG+
Sbjct: 316 WIARGENMCGL 326
>gi|194352754|emb|CAQ00105.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326513690|dbj|BAJ87864.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514532|dbj|BAJ96253.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 463
Score = 207 bits (526), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 121/322 (37%), Positives = 175/322 (54%), Gaps = 29/322 (9%)
Query: 42 ILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSA 101
I+S+ E + ++ ++ + + Y + E + RF F+ NLR +H +
Sbjct: 28 IVSYGERSEEEV---RRMYAEWMAEHGSTYNAIGEEERRFEAFRDNLRYIDQHNAAADAG 84
Query: 102 TH----GITQFSDLTPAEFRRTYLGLRRKL-RLPKDADQAPILPTNDLPADFDWREKGAV 156
H G+ +F+DLT E+R TYLG R K R K + + ++LP DWR+KGAV
Sbjct: 85 VHSFRLGLNRFADLTNEEYRSTYLGARTKPDRERKLSARYQAADNDELPESVDWRKKGAV 144
Query: 157 GPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCN 216
G VKDQG CGSCW+FS A+EG N + TG ++ LSEQ+LVDCD S + GCN
Sbjct: 145 GAVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDT--------SYNQGCN 196
Query: 217 GGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKI-AASVANFSVVSLDEDQIA 275
GGLM+ AFE+ + GG+ EEDYPY +R + C +K ++ + V ++ ++
Sbjct: 197 GGLMDYAFEFIINNGGIDSEEDYPY--KERDNRCDANKKNAKVVTIDGYEDVPVNSEKSL 254
Query: 276 ANLVKNGPLAVAINA--VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKP 333
V N P++VAI A Q Y G+ C LDHGV VGYG+ K
Sbjct: 255 QKAVANQPISVAIEAGGRAFQLYKSGIFTG-TCGTALDHGVAAVGYGTE-------NGKD 306
Query: 334 YWIIKNSWGESWGENGYYKICR 355
YW+++NSWG WGE+GY ++ R
Sbjct: 307 YWLVRNSWGSVWGEDGYIRMER 328
>gi|30387350|ref|NP_848429.1| cathepsin [Choristoneura fumiferana MNPV]
gi|1168799|sp|P41715.1|CATV_NPVCF RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|332509|gb|AAA96732.1| cathepsin [Choristoneura fumiferana MNPV]
gi|30270084|gb|AAP29900.1| cathepsin [Choristoneura fumiferana MNPV]
Length = 324
Score = 207 bits (526), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 112/321 (34%), Positives = 174/321 (54%), Gaps = 21/321 (6%)
Query: 52 DLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDL 111
D+L A ++F F KFNK+Y+S+ E RF IF+ NL D +A + I +F+DL
Sbjct: 20 DVLKAPNYFEDFLHKFNKSYSSESEKLRRFQIFRHNLEEIINKNHNDSTAQYEINKFADL 79
Query: 112 TPAEFRRTYLGLRRKLRLPKDADQAPI-LPTNDLPADFDWREKGAVGPVKDQGSCGSCWS 170
+ E Y GL L+ + + P + P +FDWR V VK+QG CG+CW+
Sbjct: 80 SKDETISKYTGLSLPLQTQNFCEVVVLDRPPDKGPLEFDWRRLNKVTSVKNQGMCGACWA 139
Query: 171 FSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA 230
F+T G+LE + + ++LSEQQL+DCD D+GC+GGL+++AFE +
Sbjct: 140 FATLGSLESQFAIKHNQFINLSEQQLIDCDF---------VDAGCDGGLLHTAFEAVMNM 190
Query: 231 GGLMREEDYPYTGTDRGHACKFDKSKIAASVAN-FSVVSLDEDQIAANLVKNGPLAVAIN 289
GG+ E DYPY + C+ + +K V + +++ E+++ L GP+ VAI+
Sbjct: 191 GGIQAESDYPYEANNGD--CRANAAKFVVKVKKCYRYITVFEEKLKDLLRSVGPIPVAID 248
Query: 290 AVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
A + Y G+ Y + L+H VLLVGY P+WI+KN+WG WGE G
Sbjct: 249 ASDIVNYKRGIM-KYCANHGLNHAVLLVGYAVENGV-------PFWILKNTWGADWGEQG 300
Query: 350 YYKICRGRNVCGVDSMVSTVA 370
Y+++ + N CG+ + + + A
Sbjct: 301 YFRVQQNINACGIQNELPSSA 321
>gi|13905172|gb|AAH06878.1| Cathepsin H [Mus musculus]
Length = 333
Score = 207 bits (526), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 120/318 (37%), Positives = 172/318 (54%), Gaps = 31/318 (9%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRR 118
HF + K+ K Y+S E++HR +F N R+ H + + + + QFSD++ AE +
Sbjct: 32 HFKSWMKQHQKTYSS-VEYNHRLQMFANNWRKIQAHNQRNHTFKMALNQFSDMSFAEIKH 90
Query: 119 TYLGLRRKLRLPKDADQAP---ILPTNDLPADFDWREKG-AVGPVKDQGSCGSCWSFSTT 174
+L P++ + T P+ DWR+KG V PV +QG+CGSCW+FSTT
Sbjct: 91 KFLWSE-----PQNCSATKSNYLRGTGPYPSSMDWRKKGNVVSPVINQGACGSCWTFSTT 145
Query: 175 GALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLM 234
GALE A +A+GK++SL+EQQLVDC + + GC GGL + AFEY L G+M
Sbjct: 146 GALESAVAIASGKMLSLAEQQLVDCAQAFN-------NHGCKGGLPSQAFEYILYNKGIM 198
Query: 235 REEDYPYTGTDRGHACKFDKSKIAASVANFSVVSL-DEDQIAANLVKNGPLAVAINAVY- 292
E+ YPY G D +C+F+ K A V N ++L DE + + P++ A
Sbjct: 199 EEDSYPYIGKDS--SCRFNPQKAVAFVKNVVNITLNDEAAMVEAVALYNPVSFAFEVTED 256
Query: 293 MQTYIGGVSCPYICSR---RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
Y GV C + +++H VL VGYG YWI+KNSWG WGENG
Sbjct: 257 FLMYKSGVYSSKSCHKTPDKVNHAVLAVGYGEQ-------NGLLYWIVKNSWGSQWGENG 309
Query: 350 YYKICRGRNVCGVDSMVS 367
Y+ I RG+N+CG+ + S
Sbjct: 310 YFLIERGKNMCGLAACAS 327
>gi|294883322|ref|XP_002770704.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
gi|239873993|gb|EER02713.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
Length = 333
Score = 207 bits (526), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 127/305 (41%), Positives = 165/305 (54%), Gaps = 22/305 (7%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
E F F+ KF K Y S+EE R IF+ANL + D S G+ + +DLT EF
Sbjct: 25 ELAFMGFQHKFGKNYESKEEEVKRNAIFQANLHHIEQVNAKDLSYKLGVNEHADLTHEEF 84
Query: 117 RRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGA 176
LG K+ +D T LP DWR K + PVKDQGSCGSCW+FSTTGA
Sbjct: 85 AALKLG-TLKMSTRRDDKFVIEADTTQLPTSVDWRNKNVLTPVKDQGSCGSCWAFSTTGA 143
Query: 177 LEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMRE 236
LE +ATGKL+SLSEQQLVDC G ++GC GGLM+ A+EY +K+ GL +E
Sbjct: 144 LEAQYAIATGKLLSLSEQQLVDC-------SSGYGNNGCEGGLMDDAYEY-IKSAGLDQE 195
Query: 237 EDYPYTGTD---RGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV-- 291
Y Y GTD +G K A V F ++ E + L + P++VA+ A
Sbjct: 196 STYSYNGTDDVCQGSLAKRSDGIPAGEVTGFHMLDKTEQSLMKALA-DAPVSVAMYAADP 254
Query: 292 YMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYY 351
+ Y GV C+ +LDHGV+ VGYG+ Y+II+NSWG SWG+ GY+
Sbjct: 255 DFRFYKSGVYSSATCNGKLDHGVVAVGYGTE-------NGSDYFIIRNSWGSSWGQAGYF 307
Query: 352 KICRG 356
+ RG
Sbjct: 308 YLKRG 312
>gi|357474523|ref|XP_003607546.1| Cysteine proteinase [Medicago truncatula]
gi|358347207|ref|XP_003637651.1| Cysteine proteinase [Medicago truncatula]
gi|355503586|gb|AES84789.1| Cysteine proteinase [Medicago truncatula]
gi|355508601|gb|AES89743.1| Cysteine proteinase [Medicago truncatula]
Length = 345
Score = 207 bits (526), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 119/329 (36%), Positives = 179/329 (54%), Gaps = 26/329 (7%)
Query: 46 HESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGI 105
H+ ++D+ + F + K+ + Y +E + RF I++AN++ S
Sbjct: 32 HKQKSSDVEAMKKRFDGWVKRHGRKYKHNDEREVRFGIYQANVQYIQCKNAQKNSYNLTD 91
Query: 106 TQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSC 165
+F+DLT EF+ TY+GL +LR + DLP DWR++GAV + DQG C
Sbjct: 92 NKFADLTNEEFQSTYMGLSTRLRSHNTGFRYD--EHGDLPESKDWRKEGAVTEIMDQGQC 149
Query: 166 GSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFE 225
G CW+F+ A+EG N + +GKL+SLSEQ+L+DCD + S + GC GGLM +A+
Sbjct: 150 GGCWAFAAVAAVEGINKIKSGKLISLSEQELIDCDVK-------SGNQGCQGGLMETAYT 202
Query: 226 YTLKAGGLMREEDYPYTGTDRGHACKFDK-SKIAASVANFSVVSLDEDQIAANLVKNGPL 284
+ ++ GGL E+DYPY G D CK +K + AAS++ + V D + + P+
Sbjct: 203 FIIENGGLTTEQDYPYEGVD--GTCKMEKAAHYAASISGYEEVPADNEAKLKAAAAHQPV 260
Query: 285 AVAINA--VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWG 342
+VAI+A Q Y GV IC ++L+HGV +VGYG + YWI+KNSWG
Sbjct: 261 SVAIDAGGYSFQFYSEGVFSG-ICGKQLNHGVTVVGYG-------KETINKYWIVKNSWG 312
Query: 343 ESWGENGYYKICR----GRNVCGVDSMVS 367
WGE+GY ++ R +CG+ S
Sbjct: 313 ADWGESGYIRMKRDTLSKEGMCGIAMQAS 341
>gi|168063167|ref|XP_001783545.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664932|gb|EDQ51634.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 461
Score = 207 bits (526), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 123/318 (38%), Positives = 171/318 (53%), Gaps = 35/318 (11%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRR 118
F+ + K KAY E+ HRF ++K NL RH + + + + G+T+F+DLT EFRR
Sbjct: 53 QFAAWAHKHGKAYHDAEQCLHRFAVWKDNLAYI-RHSETNRTYSLGLTKFADLTNEEFRR 111
Query: 119 TYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALE 178
Y G R ++ P DWR+ GAV VKDQGSCGSCW+FS G++E
Sbjct: 112 MYTGTRIDRSRRAKRRTGFRYADSEAPESVDWRKNGAVTSVKDQGSCGSCWAFSAVGSVE 171
Query: 179 GANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREED 238
G N + G+ VSLSEQ+LVDCD E + GCNGGLM+ AF++ ++ GG+ E+D
Sbjct: 172 GINAIRNGEAVSLSEQELVDCDLE--------YNQGCNGGLMDYAFDFIIQNGGIDTEKD 223
Query: 239 YPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA------VY 292
YPY G D G K+ ++ + V ++++ V P++VAI A +Y
Sbjct: 224 YPYKGFD-GRCDNSKKNAHVVTIDGYEDVPENDEEALKKAVAGQPVSVAIEAGGRDFQLY 282
Query: 293 MQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYK 352
Q G C LDHGVL VGYG+ YWI+KNSWGE WGE+GY +
Sbjct: 283 AQGVFSGE-----CGTDLDHGVLAVGYGTEDGV-------DYWIVKNSWGEYWGESGYLR 330
Query: 353 ICR-------GRNVCGVD 363
+ R G +CG++
Sbjct: 331 MKRNMKDSNDGPGLCGIN 348
>gi|29165304|gb|AAO65603.1| cathepsin L precursor [Hydra vulgaris]
Length = 324
Score = 207 bits (526), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 128/314 (40%), Positives = 172/314 (54%), Gaps = 32/314 (10%)
Query: 63 FKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFR--RTY 120
+K NK Y+ E R+TI+K N RR H + QF D+T +EF+ Y
Sbjct: 30 WKMYHNKVYSHDGEETVRYTIWKDNERRIREHNLKGGDFILKMNQFGDMTNSEFKAFNGY 89
Query: 121 LGLRRKLRLPKDADQAPILPTNDL--PADFDWREKGAVGPVKDQGSCGSCWSFSTTGALE 178
L K + + L N+ P DWR +G V PVKDQG CGSCW+FSTTG+LE
Sbjct: 90 LS-------HKHVNGSTFLTPNNFVAPDTVDWRNEGYVTPVKDQGQCGSCWAFSTTGSLE 142
Query: 179 GANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREED 238
G +F TGKLVSLSEQ LVDC ++GC+GGLM++AF Y + G+ E
Sbjct: 143 GQHFKKTGKLVSLSEQNLVDC-------STAYGNNGCDGGLMDNAFTYIKENKGIDSEAS 195
Query: 239 YPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAINAVY--MQT 295
YPYT D C F KS +AA+ F + +E+++ + GP++VAI+A + Q
Sbjct: 196 YPYTAEDG--KCVFKKSSVAATDTGFVDIPEGNENKLKEAVASVGPISVAIDASHESFQF 253
Query: 296 YIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
Y GV + P S LDHGVL+VGYG+ K YW++KNSW SWG+ GY K+
Sbjct: 254 YSSGVYNEPSCSSTELDHGVLVVGYGTE-------SGKDYWLVKNSWNTSWGDKGYIKMR 306
Query: 355 R-GRNVCGVDSMVS 367
R +N CG+ + S
Sbjct: 307 RNAKNQCGIATKAS 320
>gi|13897890|gb|AAK48495.1|AF259983_1 putative cysteine protease [Ipomoea batatas]
Length = 462
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 127/298 (42%), Positives = 164/298 (55%), Gaps = 28/298 (9%)
Query: 76 EHDHRFTIFKANLRRA-ARHQKLDPSATHGITQFSDLTPAEFRRTYLGL----RRKLRLP 130
E D RF IFK NLR ++ + D S G+ +F+DLT E+R TYLG RR++
Sbjct: 66 EKDKRFEIFKDNLRYIDEQNSRGDRSYKLGLNRFADLTNEEYRSTYLGAKTDARRRIAKT 125
Query: 131 KDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVS 190
K + LP DWREKGAV VKDQGSCGSCW+FST A+EG N + TG+L+S
Sbjct: 126 KSDRRYAPKAGGSLPDSIDWREKGAVAEVKDQGSCGSCWAFSTIAAVEGINQIVTGELIS 185
Query: 191 LSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHAC 250
LSEQ+LVDCD S + GCNGGLM+ AFE+ +K GG+ E DYPYTG G
Sbjct: 186 LSEQELVDCDT--------SYNEGCNGGLMDYAFEFIIKNGGIDTEADYPYTGR-YGRCD 236
Query: 251 KFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA--VYMQTYIGGVSCPYICSR 308
+ K+ S+ + V+ ++ V P++VAI A Q Y G+ C
Sbjct: 237 QTRKNAKVVSIDGYEDVTPYDEAALKEAVAGQPVSVAIEAGGRDFQLYSSGIFTG-SCGT 295
Query: 309 RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRG----RNVCGV 362
LDHGV VGYG+ YWI+KNSW SWGE GY ++ R +CG+
Sbjct: 296 DLDHGVTAVGYGTENGV-------DYWIVKNSWAASWGEKGYLRMQRNVKDKNGLCGI 346
>gi|37994576|gb|AAH60335.1| Unknown (protein for MGC:68554) [Xenopus laevis]
Length = 335
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 127/320 (39%), Positives = 173/320 (54%), Gaps = 23/320 (7%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH----GITQFSDLT 112
++H+ +K K YA +EE R +++ NL+ H H G+ QF D+T
Sbjct: 26 DNHWYSWKDWHKKTYAPKEE-GWRRVLWEKNLKMIEFHNLDHSLGKHSYRLGMNQFGDMT 84
Query: 113 PAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFS 172
EF++ G + + + AP + P DWR+KG V PVKDQG CGSCW+FS
Sbjct: 85 NEEFKQLMNGYKNQKMIRGSTFLAP--NNFEAPKSVDWRKKGYVTPVKDQGQCGSCWAFS 142
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
TTGALEG ++ T KL+SLSEQ LVDC + GCNGGLM+ AF+Y GG
Sbjct: 143 TTGALEGQHYRKTSKLISLSEQNLVDCSR-------AQGNEGCNGGLMDQAFQYVKDNGG 195
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAINAV 291
+ E+ YPYT D C +D + +A+ F V S E + + GP++VAI+A
Sbjct: 196 IDSEDSYPYTAKDD-QECHYDPNNNSANDTGFVDVQSGCEKDLMKAVASVGPVSVAIDAG 254
Query: 292 Y--MQTYIGGVSC-PYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+ Q Y G+ P S LDHGVL+VGY G+ + K YWI+KNSW E WG+N
Sbjct: 255 HQSFQFYQSGIYYEPECSSEDLDHGVLVVGY---GFESEDVDGKKYWIVKNSWSEKWGDN 311
Query: 349 GYYKICRGR-NVCGVDSMVS 367
GY I + R N CG+ + S
Sbjct: 312 GYINIAKDRHNHCGIATAAS 331
>gi|410990008|ref|XP_004001242.1| PREDICTED: cathepsin L1 isoform 1 [Felis catus]
Length = 333
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 124/316 (39%), Positives = 171/316 (54%), Gaps = 23/316 (7%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQ----FSDLTPAE 115
+S +K K Y +E R +++ N++ +H + H T F D+T E
Sbjct: 29 WSQWKATHGKLYGMNDEVWRR-AVWERNMKMIEQHNREHSQGKHTFTMAMNAFGDMTNEE 87
Query: 116 FRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
FR+ GL+ + R QAP ++P+ DWREKG V PVKDQG C CW+FS TG
Sbjct: 88 FRQVMNGLKIQKRKKWKVFQAPFFV--EIPSSVDWREKGYVTPVKDQGYCLCCWAFSATG 145
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMR 235
ALEG F TGKLVSLSEQ LVDC + G +GGL++ AF+Y GGL
Sbjct: 146 ALEGQMFRKTGKLVSLSEQNLVDCSQT-------EGNEGYSGGLIDDAFQYVKDNGGLDS 198
Query: 236 EEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY--M 293
EE YPY +G +CK+ A+V ++ + E+++ L GP++ AI+A
Sbjct: 199 EESYPYHA--QGDSCKYRPENSVANVTDYWDIPSKENELMITLAAVGPISAAIDASLDTF 256
Query: 294 QTYIGGVSC-PYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYK 352
+ Y G+ P S +DHGVL+VGYG+ G + K YWIIKNSWG WG +GY K
Sbjct: 257 RFYKEGIYYDPSCSSEDVDHGVLVVGYGADG---TETENKKYWIIKNSWGTDWGMDGYIK 313
Query: 353 ICRGR-NVCGVDSMVS 367
+ + R N CG+ S+ S
Sbjct: 314 MAKDRDNHCGIASLAS 329
>gi|75067394|sp|Q9GKL8.1|CATL1_CERAE RecName: Full=Cathepsin L1; AltName: Full=Major excreted protein;
Short=MEP; Contains: RecName: Full=Cathepsin L1 heavy
chain; Contains: RecName: Full=Cathepsin L1 light chain;
Flags: Precursor
gi|11493685|gb|AAG35605.1|AF201700_1 cysteine protease [Chlorocebus aethiops]
Length = 333
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 124/319 (38%), Positives = 167/319 (52%), Gaps = 23/319 (7%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQ----FSDLT 112
E ++ +K N+ Y EE R +++ N++ H + H T F D+T
Sbjct: 26 EAQWTKWKAMHNRLYGMNEEGWRR-AVWEKNMKMIELHNQEYSQGKHSFTMAMNTFGDMT 84
Query: 113 PAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFS 172
EFR+ G + + Q P+ + P DWREKG V PVK+QG CGSCW+FS
Sbjct: 85 SEEFRQVMNGFQNRKPRKGKVFQEPLF--YEAPRSVDWREKGYVTPVKNQGQCGSCWAFS 142
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
TGALEG F TGKLVSLSEQ LVDC P+ + GCNGGLM+ AF+Y GG
Sbjct: 143 ATGALEGQMFRKTGKLVSLSEQNLVDCS---GPQ----GNEGCNGGLMDYAFQYVADNGG 195
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY 292
L EE YPY T+ +CK++ A+ F + E + + GP++VAI+A +
Sbjct: 196 LDSEESYPYEATEE--SCKYNPEYSVANDTGFVDIPKQEKALMKAVATVGPISVAIDAGH 253
Query: 293 --MQTYIGGVSCPYICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
Y G+ CS +DHGVL+VGY G+ YW++KNSWGE WG G
Sbjct: 254 ESFMFYKEGIYFEPDCSSEDMDHGVLVVGY---GFESTESDNSKYWLVKNSWGEEWGMGG 310
Query: 350 YYKICRG-RNVCGVDSMVS 367
Y K+ + RN CG+ S S
Sbjct: 311 YIKMAKDRRNHCGIASAAS 329
>gi|109112057|ref|XP_001086247.1| PREDICTED: cathepsin L1-like isoform 5 [Macaca mulatta]
gi|402897797|ref|XP_003911929.1| PREDICTED: cathepsin L1 [Papio anubis]
Length = 333
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 124/319 (38%), Positives = 167/319 (52%), Gaps = 23/319 (7%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQ----FSDLT 112
E ++ +K N+ Y EE R +++ N++ H + H T F D+T
Sbjct: 26 EAQWTKWKAMHNRLYGMNEEGWRR-AVWEKNMKMIELHNQEYSQGKHSFTMAMNTFGDMT 84
Query: 113 PAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFS 172
EFR+ G + + Q P+ + P DWREKG V PVK+QG CGSCW+FS
Sbjct: 85 SEEFRQVMNGFQNRKPRKGKVFQEPLF--YEAPRSVDWREKGYVTPVKNQGQCGSCWAFS 142
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
TGALEG F TGKLVSLSEQ LVDC P+ + GCNGGLM+ AF+Y GG
Sbjct: 143 ATGALEGQMFRKTGKLVSLSEQNLVDCS---GPQ----GNEGCNGGLMDYAFQYVADNGG 195
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY 292
L EE YPY T+ +CK++ A+ F + E + + GP++VAI+A +
Sbjct: 196 LDSEESYPYEATEE--SCKYNPEYSVANDTGFVDIPKQEKALMKAVATVGPISVAIDAGH 253
Query: 293 --MQTYIGGVSCPYICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
Y G+ CS +DHGVL+VGY G+ YW++KNSWGE WG G
Sbjct: 254 ESFMFYKEGIYFEPDCSSEDMDHGVLVVGY---GFESTESDNSKYWLVKNSWGEEWGMGG 310
Query: 350 YYKICRG-RNVCGVDSMVS 367
Y K+ + RN CG+ S S
Sbjct: 311 YIKMAKDRRNHCGIASAAS 329
>gi|6650705|gb|AAF21977.1|AF115280_1 thiolproteinase SmTP1 [Sarcocystis muris]
Length = 394
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 127/307 (41%), Positives = 167/307 (54%), Gaps = 31/307 (10%)
Query: 57 EHHFS----LFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLT 112
+HHF F++ NK YA++EE R+ IFK NL H S + +F DLT
Sbjct: 82 DHHFQSQFYQFQRDHNKFYATEEERLKRYAIFKNNLTYIHNHNMQGYSYVLKMNKFGDLT 141
Query: 113 PAEFRRTYLGLRR-KLRLP-KDADQA-PILPTNDLPADFDWREKGAVGPVKDQGSCGSCW 169
EFR+ YLG ++ LR P ++ D + ND+P DWR++G V VKDQG CGSCW
Sbjct: 142 LEEFRQRYLGYKKPDLRTPPREVDTTLESVEDNDIPTHVDWRQRGCVTSVKDQGDCGSCW 201
Query: 170 SFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK 229
+FS TGA+EG TGKLV+LS+QQLVDC + GC+GG M AFEY ++
Sbjct: 202 AFSATGAMEGVYCAKTGKLVNLSQQQLVDCSRFLG-------NQGCDGGRMEEAFEYVVE 254
Query: 230 AGGLMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAI 288
GG+ E+YPY D CK + A++ + SV E + L P++VAI
Sbjct: 255 NGGICSGENYPYMRKD--GVCKSSQCTSVATITGYRSVPRRSEKSMKTALALRSPVSVAI 312
Query: 289 --NAVYMQTYIGGV-SCPYICSRRLDHGVLLVGYG--SAGYAPIRLKEKPYWIIKNSWGE 343
N Q Y G+ P C LDHGVLLVGY +AG + YWI+KNSWG
Sbjct: 313 QANQAAFQFYYDGIFDAP--CGTNLDHGVLLVGYSAETAG-------QGDYWIMKNSWGA 363
Query: 344 SWGENGY 350
+WG+ GY
Sbjct: 364 AWGKGGY 370
>gi|328869030|gb|EGG17408.1| cysteine protease [Dictyostelium fasciculatum]
Length = 379
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 125/340 (36%), Positives = 184/340 (54%), Gaps = 47/340 (13%)
Query: 66 KFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRR 125
+F K+Y S + RF +FK N+ + QF+D+T E+RR YLG R
Sbjct: 45 RFEKSYESFD-FLQRFAVFKTNMDYVHEWNSKKLPTVLELNQFADITNQEYRRLYLGTRI 103
Query: 126 KLR----LPKDADQAPIL-------PTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTT 174
R P + + ++ A DWR KGAV P+K+QG CGSCWSFSTT
Sbjct: 104 NARHLLGTPGTHEMSNNFGKVFGDDDSDSSGATVDWRAKGAVSPIKNQGQCGSCWSFSTT 163
Query: 175 GALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLM 234
G++EGA++++TGK+V LSEQ LVDC + GC GGLMN AF+Y +K G+
Sbjct: 164 GSVEGAHYISTGKMVPLSEQNLVDCSGS-------EGNMGCQGGLMNLAFDYIIKNEGID 216
Query: 235 REEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKN-GPLAVAINAVY- 292
E+ YPY+ + G C F+K+ + A+++++ ++ ++ A+ VKN GP++VAI+A +
Sbjct: 217 TEDSYPYS-AETGKKCLFNKTNVGATISSYKNITSGDESNLADAVKNAGPVSVAIDASHN 275
Query: 293 -MQTYIGGVSCPYICSR-RLDHGVLLVGYGSAGYAPI-----------------RLKEKP 333
Q Y G+ CS LDHGVL+VGYGS + + R+ + P
Sbjct: 276 SFQLYSHGIYYEKDCSSVNLDHGVLVVGYGSGDPSSLANNVGGRSGPKMVVFNNRMVKTP 335
Query: 334 -----YWIIKNSWGESWGENGYYKICRGR-NVCGVDSMVS 367
YWI+KNSWG +WG +G+ + R N CG+ + S
Sbjct: 336 SSNGDYWIVKNSWGSTWGSHGFIFMSMNRDNNCGIATSAS 375
>gi|256082975|ref|XP_002577726.1| subfamily C1A unassigned peptidase (C01 family) [Schistosoma
mansoni]
Length = 1471
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 130/336 (38%), Positives = 175/336 (52%), Gaps = 42/336 (12%)
Query: 51 NDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARH----QKLDPSATHGIT 106
+D++ A + FK +F +AY E RF IF AN + H Q+ + G+
Sbjct: 54 DDIIAA---WKFFKIQFKRAYNGIHEETRRFFIFSANFVKMMEHNHAFQEGKVTYKMGVN 110
Query: 107 QFSDLTPAEFRR------TYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVK 160
+F+D T E ++ T +R K ++ LP+ DWR +GAV VK
Sbjct: 111 EFTDKTDYELKKLRGYKVTSGAIRHKGSTFIRSEHTK------LPSKVDWRREGAVTDVK 164
Query: 161 DQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLM 220
+QG CGSCW+FSTTGA+EG ++ T +LV+LSEQQLVDC ++GC+GGLM
Sbjct: 165 NQGQCGSCWAFSTTGAIEGQHYRKTNRLVNLSEQQLVDCS-------KSYGNNGCSGGLM 217
Query: 221 NSAFEYTLKAGGLMREEDYPYTGTD--RGHACKFDKSKIAASVANF-SVVSLDEDQIAAN 277
NSAFEY G+ E YPY D + C F+ S I A V + ++ DE +
Sbjct: 218 NSAFEYVRDNEGIDSEISYPYVSGDGTENNRCLFNASNILAQVTGYVNIHEGDERALMDA 277
Query: 278 LVKNGPLAVAINA--VYMQTYIGGVSCPYICS---RRLDHGVLLVGYGSAGYAPIRLKEK 332
+ GP++VAINA Y G+ C LDHGVL+VGYG +
Sbjct: 278 VATKGPVSVAINAGLPSFSMYKSGIYSDTDCEGTLDALDHGVLVVGYGEEN-------GR 330
Query: 333 PYWIIKNSWGESWGENGYYKICRG-RNVCGVDSMVS 367
YW+IKNSWGE WGE GY KI +G N+CGV S S
Sbjct: 331 SYWLIKNSWGEEWGEKGYIKISKGSHNMCGVASAAS 366
>gi|431897851|gb|ELK06685.1| Cathepsin L1 [Pteropus alecto]
Length = 331
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 119/300 (39%), Positives = 162/300 (54%), Gaps = 22/300 (7%)
Query: 76 EHDHRFTIFKANLR----RAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPK 131
E R +++ N++ H++ S T I F D+T EFR+ GL+ +
Sbjct: 42 EEGWRRAVWEKNMKMIELHNQEHRQGKHSFTMAINAFGDMTNEEFRKLMNGLQNQKHWKG 101
Query: 132 DADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSL 191
Q P P ++P DWR+KG V PVKDQG CGSCW+FS TGALEG F TGKL+SL
Sbjct: 102 KLFQEPPFP--EIPPSVDWRQKGYVTPVKDQGQCGSCWAFSATGALEGQMFRKTGKLISL 159
Query: 192 SEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACK 251
SEQ LVDC + GC+GGLM++AF+Y GGL EE YPY D +CK
Sbjct: 160 SEQNLVDCSQ-------SQGNEGCDGGLMDNAFQYVKDNGGLDSEESYPYLARDE--SCK 210
Query: 252 FDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY--MQTYIGGVSC-PYICSR 308
+ AA+ + F + E + + GP++V I+A Y Q Y G+ P S
Sbjct: 211 YKPEFSAANDSGFVDIHKQERSLMKAVASVGPISVGIDASYSSFQFYEKGIYYEPECSSE 270
Query: 309 RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNV-CGVDSMVS 367
L+HGVL+VGY G+ + YWI+KNSWG +WG NGY + + +N CG+ + S
Sbjct: 271 DLNHGVLVVGY---GFERAESNKNKYWIVKNSWGTNWGMNGYINMAKDQNNHCGIATAAS 327
>gi|298713906|emb|CBJ33775.1| Cathepsin-like proteinase [Ectocarpus siliculosus]
Length = 462
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 135/350 (38%), Positives = 180/350 (51%), Gaps = 49/350 (14%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
E F F KF K+Y + +E RF +FK NL+R + +T ++DLT EF
Sbjct: 123 ESLFQEFGIKFEKSYENDDEKAMRFEVFKRNLKRIDERNSKSLGVKYDVTMWTDLTHEEF 182
Query: 117 R--RTYLGL--------RRKLRLPKDAD----------QAPILP---TNDLPADFDWREK 153
+ + Y + R K KDA + P L T DLP +FDWR+
Sbjct: 183 KGYQNYGKISDEAKEVARSKAMSTKDASDMYESCQSCTRFPELEQYITGDLPTEFDWRDY 242
Query: 154 GAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDS 213
GAV PVK+Q CGSCW+FSTTG LEGA +L+ L SLSEQQLV CD S +
Sbjct: 243 GAVTPVKNQAYCGSCWTFSTTGCLEGAWYLSGHPLESLSEQQLVACDT--------SYNQ 294
Query: 214 GCNGGLMNSAFEYTLKAGGLMREEDYPYTGT-DRGH----ACK--FDKSKIAASVA---N 263
GCNGG + + +Y K GG++ E YPY GH C + AA++A
Sbjct: 295 GCNGGWPSISMDYISKNGGIVPESIYPYRKVFMNGHLGDPVCSDVVKEGNYAATLAIEVA 354
Query: 264 FSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICS-RRLDHGVLLVGYGSA 322
+ S+ E+ +A L+ NGPL+VA++A+ M Y G+ C +DH VL+VGYG
Sbjct: 355 LAEDSMTEEAMARWLILNGPLSVALDAMGMDYYSEGIDMGEYCEPLEIDHAVLIVGYGEE 414
Query: 323 GYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVSTVAAA 372
YWIIKNSW WGE GYY++ RG N CG+ V+T+ A
Sbjct: 415 DGV-------KYWIIKNSWKYLWGERGYYRLVRGVNACGIADDVTTIIVA 457
>gi|224460525|gb|ACN43674.1| cathepsin L [Paralichthys olivaceus]
Length = 334
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 126/324 (38%), Positives = 176/324 (54%), Gaps = 29/324 (8%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKL----DPSATHGITQFSDLT 112
+H F +K KF ++Y S E D R I+ N H + + G+T ++DL
Sbjct: 23 DHDFHAWKLKFGRSYNSSSEEDKRMQIWLRNREIVMAHNAMADQGHSTYRLGMTFYADLE 82
Query: 113 PAEFRRTYLGL-RRKLRLPKDADQAPILPTN---DLPADFDWREKGAVGPVKDQGSCGSC 168
EF++T G+ K + L + +LP DWR+ G V PVK+QGSCGSC
Sbjct: 83 HEEFKQTVFGVCLGSFNASKPRGGSSFLKMHRFYNLPQTIDWRQWGFVTPVKNQGSCGSC 142
Query: 169 WSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTL 228
WSFS+TGALEG NF TG+LVSLSEQ+LVDC + GCNGG M++AF Y +
Sbjct: 143 WSFSSTGALEGQNFRKTGRLVSLSEQELVDCSGNYG-------NYGCNGGWMDNAFRYIV 195
Query: 229 KAGGLMREEDYPYTGTDRGHACKFDKSKIAASVAN-FSVVSLDEDQIAANLVKNGPLAVA 287
GG+ E+ YPY G + C+ + +I A+ + + S +E + + GP++VA
Sbjct: 196 NKGGIHTEDSYPYEG--QVGQCRANYGEIGATCTGYYDIPSGNEHALKEAVATFGPVSVA 253
Query: 288 INAV--YMQTYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGES 344
I+A Q Y GV + PY LDH VL+VGYG+ + YW++KNSWG +
Sbjct: 254 IHASDQSFQLYHSGVYNNPYCSGTALDHAVLIVGYGTE-------YGQDYWLVKNSWGPA 306
Query: 345 WGENGYYKICRGR-NVCGVDSMVS 367
WG+ GY K+ R R N CG+ S S
Sbjct: 307 WGDQGYIKMSRNRYNQCGIASAAS 330
>gi|15320768|ref|NP_203280.1| V-CATH [Epiphyas postvittana NPV]
gi|37077652|sp|Q91GE3.1|CATV_NPVEP RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|15213236|gb|AAK85675.1| V-CATH [Epiphyas postvittana NPV]
Length = 323
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 115/321 (35%), Positives = 176/321 (54%), Gaps = 22/321 (6%)
Query: 52 DLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDL 111
D+L A ++F F +++NK Y S+ E R+ IF+ NL + D +A + I +FSDL
Sbjct: 20 DILKAPNYFEEFVRQYNKQYDSEYEKLRRYKIFQHNLNDIITKNRND-TAVYKINKFSDL 78
Query: 112 TPAEFRRTYLGLRRKLRLPKDADQAPI-LPTNDLPADFDWREKGAVGPVKDQGSCGSCWS 170
+ E Y GL L + + P P +FDWR + VK+QG CG+CW+
Sbjct: 79 SKDETIAKYTGLSLPLHTQNFCEVVVLDRPPGKGPLEFDWRRFNKITSVKNQGMCGACWA 138
Query: 171 FSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA 230
F+T +LE +A +L++LSEQQ++DCD S D GC GGL+++AFE +
Sbjct: 139 FATLASLESQFAIAHDRLINLSEQQMIDCD---------SVDVGCEGGLLHTAFEAIISM 189
Query: 231 GGLMREEDYPYTGTDRGHACKFDKSKIAASVANFS-VVSLDEDQIAANLVKNGPLAVAIN 289
GG+ E DYPY ++ + C+ D +K V + +++ E+++ L GP+ VAI+
Sbjct: 190 GGVQIENDYPYESSN--NYCRMDPTKFVVGVKQCNRYITIYEEKLKDVLRLAGPIPVAID 247
Query: 290 AVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
A + Y G+ Y + L+H VLLVGYG PYWI+KNSWG WGE G
Sbjct: 248 ASDILNYEQGI-IKYCANNGLNHAVLLVGYGVEN-------NVPYWILKNSWGTDWGEQG 299
Query: 350 YYKICRGRNVCGVDSMVSTVA 370
++KI + N CG+ + +++ A
Sbjct: 300 FFKIQQNVNACGIKNELASTA 320
>gi|355753449|gb|EHH57495.1| Cathepsin L1 [Macaca fascicularis]
Length = 333
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 125/319 (39%), Positives = 167/319 (52%), Gaps = 23/319 (7%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQ----FSDLT 112
E ++ +K N+ Y EE R +++ N++ H + H T F D+T
Sbjct: 26 EAQWTKWKAMHNRLYGMNEEGWRR-AVWEKNMKMIELHNQEYSQGKHSFTMAMNTFGDMT 84
Query: 113 PAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFS 172
EFR+ G + R P+ L + P DWREKG V PVK+QG CGSCW+FS
Sbjct: 85 SEEFRQVMNGFQN--RKPRKGKVFQELLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFS 142
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
TGALEG F TGKLVSLSEQ LVDC P+ + GCNGGLM+ AF+Y GG
Sbjct: 143 ATGALEGQMFRKTGKLVSLSEQNLVDCSW---PQ----GNEGCNGGLMDYAFQYVADNGG 195
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY 292
L EE YPY T+ +CK++ A+ F + E + + GP++VAI+A +
Sbjct: 196 LDSEESYPYEATEE--SCKYNPEYSVANDTGFVDIPKQEKALMKAVATVGPISVAIDAGH 253
Query: 293 --MQTYIGGVSCPYICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
Y G+ CS +DHGVL+VGY G+ YW++KNSWGE WG G
Sbjct: 254 ESFMFYKEGIYFEPDCSSEDMDHGVLVVGY---GFESTESDNSKYWLVKNSWGEEWGMGG 310
Query: 350 YYKICRG-RNVCGVDSMVS 367
Y K+ + RN CG+ S S
Sbjct: 311 YIKMAKDRRNHCGIASAAS 329
>gi|222629922|gb|EEE62054.1| hypothetical protein OsJ_16838 [Oryza sativa Japonica Group]
Length = 336
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 124/314 (39%), Positives = 170/314 (54%), Gaps = 32/314 (10%)
Query: 67 FNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGL--- 123
+ KAYAS EE RF +FK NL K S G+ +F+DLT EF+ TYLGL
Sbjct: 36 YRKAYASFEEKVRRFEVFKDNLNHIDDINKKVTSYWLGLNEFADLTHDEFKATYLGLTPP 95
Query: 124 ---RRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGA 180
+ + + ++P + DWR+K AV VK+QG CGSCW+FST A+EG
Sbjct: 96 PTRSNSKHYSSEEFRYGKMSNGEVPKEMDWRKKNAVTEVKNQGQCGSCWAFSTVAAVEGI 155
Query: 181 NFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYP 240
N + TG L SLSEQ+L+DC + ++GCNGGLM+ AF Y GGL EE YP
Sbjct: 156 NAIVTGNLTSLSEQELIDCSTD--------GNNGCNGGLMDYAFSYIASTGGLRTEEAYP 207
Query: 241 YTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV--YMQTYIG 298
Y + G C K +++ + V +++Q + + P++VAI A + Q Y G
Sbjct: 208 YA-MEEGD-CDEGKGAAVVTISGYEDVPANDEQALVKALAHQPVSVAIEASGRHFQFYSG 265
Query: 299 GV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR-- 355
GV P C +LDHGV VGYG++ K + Y I+KNSWG WGE GY ++ R
Sbjct: 266 GVFDGP--CGEQLDHGVTAVGYGTS-------KGQDYIIVKNSWGPHWGEKGYIRMKRGT 316
Query: 356 --GRNVCGVDSMVS 367
G +CG++ M S
Sbjct: 317 GKGEGLCGINKMAS 330
>gi|17062058|gb|AAL34984.1|AF320565_1 cathepsine L-like cysteine protease [Rhodnius prolixus]
Length = 316
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 124/315 (39%), Positives = 171/315 (54%), Gaps = 28/315 (8%)
Query: 63 FKKKFNKAYASQEEHDHRFTIFKANLRRA----ARHQKLDPSATHGITQFSDLTPAEFRR 118
FK K Y +Q E R +F N ++ A+++ + S + DL EF+
Sbjct: 16 FKAMHGKNYRNQFEEIFRMKVFIDNKKKIDEHNAKYELGEASYKMKMNHLGDLMVHEFKA 75
Query: 119 TYLGLRRKLRLPKDADQAPILPTND-LPADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
G ++ ++ +P+N+ LP DWR++GAV PVKDQG CGSCWSFS TG+L
Sbjct: 76 LMNGFKKTPNAERNG--KIYVPSNENLPKSVDWRQRGAVTPVKDQGHCGSCWSFSATGSL 133
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREE 237
EG FL TG+LVSLSEQ LVDC +SGC GGLMN AF+Y G+ E
Sbjct: 134 EGQLFLKTGRLVSLSEQNLVDCSKTYG-------NSGCEGGLMNQAFQYVRDNKGIDTEA 186
Query: 238 DYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAINAVY--MQ 294
YPY R + C+F + K+ + + ++ E + + + GP++V I+A + Q
Sbjct: 187 SYPYEA--RENNCRFKEDKVGGTDKGYVDILEASEKDLQSAVATVGPISVRIDASHESFQ 244
Query: 295 TYIGGVSCPYICS-RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKI 353
Y GV CS +LDHGVL VGYG+ + YW++KNSWG SWGE+GY KI
Sbjct: 245 FYSEGVYKEQYCSPSQLDHGVLTVGYGTE-------NGQDYWLVKNSWGPSWGESGYIKI 297
Query: 354 CRG-RNVCGVDSMVS 367
R +N CG+ SM S
Sbjct: 298 ARNHKNHCGIASMAS 312
>gi|30141021|dbj|BAC75924.1| cysteine protease-2 [Helianthus annuus]
Length = 362
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 123/318 (38%), Positives = 172/318 (54%), Gaps = 34/318 (10%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
+ ++++ +K + E RF +FK+N+ K+D + +F+D+T EFR
Sbjct: 37 WDMYERWRHKVATNHGEKLRRFNVFKSNVLHVHETNKMDKPYKLKLNKFADMTNHEFRSV 96
Query: 120 YLGLR-----RKLRLPKDADQAPILPT-NDLPADFDWREKGAVGPVKDQGSCGSCWSFST 173
Y G + R L+ + + + +P DWR+KGAV PVKDQG CGSCW+FST
Sbjct: 97 YAGSKIHHHDRSLQGDRSGSKTFMYANVESVPTSVDWRKKGAVAPVKDQGQCGSCWAFST 156
Query: 174 TGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGL 233
A+EG N + T +LVSLSEQ+LVDCD + GCNGGLM+ AF++ K GGL
Sbjct: 157 VAAVEGINKIKTNELVSLSEQELVDCD--------TLENQGCNGGLMDLAFDFIKKTGGL 208
Query: 234 MREEDYPYTGTDRGHACKFDKSKIAASVANF---SVVSLDEDQIAANLVKNGPLAVAINA 290
RE+ YPY D K D +K+ + V + V +++Q V N P+AVAI+A
Sbjct: 209 TREDAYPYAAED----GKCDSNKMNSPVVSIDGHEDVPKNDEQSLMKAVANQPVAVAIDA 264
Query: 291 --VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
Q Y GV C +LDHGV VGYG+ L YWI++NSWG WGE
Sbjct: 265 GSSDFQFYSEGVFTGK-CGTQLDHGVAAVGYGTT------LDGTKYWIVRNSWGSEWGEK 317
Query: 349 GYYKICRG----RNVCGV 362
GY ++ RG R +CG+
Sbjct: 318 GYIRMERGISDKRGLCGI 335
>gi|359359213|gb|AEV41117.1| putative oryzain beta chain precursor [Oryza officinalis]
Length = 465
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 123/320 (38%), Positives = 174/320 (54%), Gaps = 24/320 (7%)
Query: 56 AEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKL--DPSATHGITQFSDLTP 113
A + L+ + ++Y + EH+ RF +F NLR A H D G+ +F+DLT
Sbjct: 50 ARAAYDLWLAENGRSYNALGEHERRFRVFWDNLRFADAHNARADDHGFRLGMNRFADLTN 109
Query: 114 AEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFST 173
EFR T+LG + R ++ +LP DWREKGAV PVK+QG CGSCW+FS
Sbjct: 110 EEFRATFLGAKVVERSRAAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWAFSA 169
Query: 174 TGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGL 233
+E N L TG++++LSEQ+LV+C +SGCNGGLM+ AF++ +K GG+
Sbjct: 170 VSTVESINQLVTGEMITLSEQELVEC-------STNGQNSGCNGGLMDDAFDFIIKNGGI 222
Query: 234 MREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA--V 291
E+DYPY D + +K+ S+ F V ++++ V + P++VAI A
Sbjct: 223 DTEDDYPYKAVDGKCDINRENAKV-VSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGR 281
Query: 292 YMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYY 351
Q Y GV C LDHGV+ VGYG+ K YWI++NSWG WGE+GY
Sbjct: 282 EFQLYHSGVFSGR-CGTSLDHGVVAVGYGTD-------NGKDYWIVRNSWGPKWGESGYV 333
Query: 352 KICRGRNV----CGVDSMVS 367
++ R NV CG+ M S
Sbjct: 334 RMERNINVTTGKCGIAMMAS 353
>gi|91992508|gb|ABE72970.1| cathepsin L [Aedes aegypti]
Length = 339
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 130/324 (40%), Positives = 178/324 (54%), Gaps = 31/324 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARH-QKLDPSATH---GITQFSDLTPAE 115
++ FK + K Y S+ E R I+ N + A+H Q+ D + +++DL E
Sbjct: 27 WNAFKLQHRKNYDSETEERIRLKIYVQNKHKIAKHNQRFDLGQEKYRLRVNKYADLLHEE 86
Query: 116 FRRTYLGLRR---KLRLPKDADQAP---ILPTN-DLPADFDWREKGAVGPVKDQGSCGSC 168
F +T G R K L + P I P N ++P DWR+KGAV PVKDQG CGSC
Sbjct: 87 FVQTVNGFNRTDSKKSLKGVRIEEPVTFIEPANVEVPTTVDWRKKGAVTPVKDQGHCGSC 146
Query: 169 WSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTL 228
WSFS TGALEG +F TGKLVSLSEQ LVDC + ++GCNGG+M+ AF+Y
Sbjct: 147 WSFSATGALEGQHFRKTGKLVSLSEQNLVDCSGKYG-------NNGCNGGMMDYAFQYIK 199
Query: 229 KAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVA 287
GG+ E+ YPY D C F+ + A+ + + DE+ + L GP+++A
Sbjct: 200 DNGGIDTEKSYPYEAID--DTCHFNPKAVGATDKGYVDIPQGDEEALKKALATVGPVSIA 257
Query: 288 INAVY--MQTYIGGVSCPYIC-SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGES 344
I+A + Q Y GV C S LDHGVL VGYG++ + + YW++KNSWG +
Sbjct: 258 IDASHESFQFYSEGVYYEPQCDSENLDHGVLAVGYGTSE------EGEDYWLVKNSWGTT 311
Query: 345 WGENGYYKICRGR-NVCGVDSMVS 367
WG+ GY K+ R N CGV + S
Sbjct: 312 WGDQGYVKMARNHDNHCGVATCAS 335
>gi|148908373|gb|ABR17300.1| unknown [Picea sitchensis]
Length = 357
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 137/379 (36%), Positives = 192/379 (50%), Gaps = 42/379 (11%)
Query: 3 SKTVVLFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEH---H 59
++ + + L +L+ + S + + I VTD + + ES+ +LG
Sbjct: 2 ARILAIVLSTLLALAIAVSAARSFEETEYIDMVTDK----IQNLESSLFKILGTNPKSVQ 57
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F+ F ++ K Y S + HRF F N+ ++ T I +F+D+T EF
Sbjct: 58 FAEFALRYGKRYDSVRQLVHRFNAFVKNVELIESRNSMNLPYTLAINEFADITWEEFHGQ 117
Query: 120 YLGLRRKLRLPKD----ADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YLG + K D P P DWRE+G V PVK+Q CGSCW+FSTTG
Sbjct: 118 YLGASQNCSATKSNHKFTDAQP-------PTKKDWREEGIVSPVKNQAHCGSCWTFSTTG 170
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMR 235
ALE A ATGK V LSEQQLVDC + + GC+GGL + AFEY GGL
Sbjct: 171 ALEAAYTQATGKTVILSEQQLVDCAGAFN-------NFGCSGGLPSQAFEYIKYNGGLDT 223
Query: 236 EEDYPYTGTDRGHACKFDKSKIAASVA---NFSVVSLDEDQIAANLVKNGPLAVAINAVY 292
EE YPYT D C +D + + VA N S+ + D+ + A LV+ P++VA +
Sbjct: 224 EEAYPYTAKD--GVCNYDVNNVGVKVADSVNISLGAEDKLKSAVGLVR--PVSVAFQVIQ 279
Query: 293 -MQTYIGGVSCPYICSR---RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+ Y GV C + ++H VL VGYG + + P+WIIKNSWG+SWG
Sbjct: 280 DFRFYKEGVFTSTTCGQGPMDVNHAVLAVGYG------VSEEGTPHWIIKNSWGKSWGVE 333
Query: 349 GYYKICRGRNVCGVDSMVS 367
GY+K+ G+N+CGV + S
Sbjct: 334 GYFKMEMGKNMCGVATCAS 352
>gi|14422331|emb|CAC41636.1| early leaf senescence abundant cysteine protease [Pisum sativum]
Length = 350
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 134/362 (37%), Positives = 187/362 (51%), Gaps = 35/362 (9%)
Query: 9 FLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHH---FSLFKK 65
L+ L ++ ++G D + IR V+D +++L ++G H F+ F
Sbjct: 6 LLIVLFCVASAAAGFSFHDSNP-IRMVSDVEEQLL--------QVIGESRHAVSFARFAN 56
Query: 66 KFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRR 125
++ K Y S +E RF IF N+ K S G+ F+D T EFR LG +
Sbjct: 57 RYGKRYDSVDEMKLRFKIFSENIELIRSSNKRRLSYKLGVNHFADWTWEEFRSHRLGAAQ 116
Query: 126 KLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLAT 185
+ + +LP + DWR++G V VKDQGSCGSCW+FSTTGALE A A
Sbjct: 117 NCSATLKGNHK--ITDANLPDEKDWRKEGIVSGVKDQGSCGSCWTFSTTGALESAYAQAF 174
Query: 186 GKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTD 245
GK +SLSEQQLVDC + + GC+GGL + AFEY GGL EE YPYTG++
Sbjct: 175 GKNISLSEQQLVDCAGAFN-------NFGCSGGLPSQAFEYIKYNGGLETEEAYPYTGSN 227
Query: 246 RGHACKFDKSKIAASVANFSVVSLD-EDQIAANLVKNGPLAVAINAVY-MQTYIGGVSCP 303
CKF +A V ++L ED++ + P++VA V+ + Y GV
Sbjct: 228 --GLCKFRSEHVAVKVLGSVNITLGAEDELKHAIAFARPVSVAFEVVHDFRLYKSGVYTS 285
Query: 304 YICSRR---LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVC 360
C ++H VL VGYG PYW+IKNSWG WG++GY+K+ G+N+C
Sbjct: 286 TACGSTPMDVNHAVLAVGYGIE-------DGIPYWLIKNSWGGDWGDHGYFKMEMGKNMC 338
Query: 361 GV 362
GV
Sbjct: 339 GV 340
>gi|1185457|gb|AAA87848.1| cathepsin L, partial [Schistosoma japonicum]
Length = 224
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 106/231 (45%), Positives = 146/231 (63%), Gaps = 20/231 (8%)
Query: 137 PILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQL 196
P D+P +FDWREKGAV VK+QG CGSCW+FSTTG +E F TGKL+SLSEQQL
Sbjct: 3 PRREVGDIPNNFDWREKGAVTEVKNQGMCGSCWAFSTTGNIESQWFRKTGKLLSLSEQQL 62
Query: 197 VDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSK 256
VDCD S D GCNGGL ++A+E ++ GGLM E++YPY + C
Sbjct: 63 VDCD---------SLDDGCNGGLPSNAYESIIRMGGLMLEDNYPYDA--KNEKCHLKVGN 111
Query: 257 IAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPY--ICSRR-LDHG 313
+AA + + ++ DE ++A L + ++V +NA+ +Q Y G+S P+ CS+ LDH
Sbjct: 112 VAAYINSSVNLTQDESELAIWLYHHSAISVGMNALLLQFYRHGISHPWWIFCSKYLLDHA 171
Query: 314 VLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRNVCGVDS 364
VLLVGYG + K +P+WI+KNSWG WGE GY+++ RG CG+++
Sbjct: 172 VLLVGYG------VSEKNEPFWIVKNSWGVEWGEKGYFRMYRGDGTCGINT 216
>gi|330803818|ref|XP_003289899.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
gi|325080010|gb|EGC33584.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
Length = 326
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 118/312 (37%), Positives = 172/312 (55%), Gaps = 25/312 (8%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F + K K+Y + +E R+++F+ N+ A+ + + G+ +DLT EF++
Sbjct: 32 FQNWMVKHQKSY-TNDEFGSRYSVFQDNMDIVAKWNQKGSNTILGLNVMADLTNEEFKKL 90
Query: 120 YLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEG 179
YLG + + K ++ + LPA DWR GAV VK+QG CG C++FSTTG++EG
Sbjct: 91 YLGTKANVTYKKKT----LVGVSGLPASVDWRANGAVTAVKNQGQCGGCYAFSTTGSVEG 146
Query: 180 ANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDY 239
+ + + +LV LSEQQ++DC ++GC+GGLM ++FEY + GGL E Y
Sbjct: 147 IHEITSQQLVPLSEQQILDCSGS-------EGNNGCDGGLMTNSFEYIIAVGGLDTEASY 199
Query: 240 PYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY--MQTYI 297
PYTG CKF+K I A++ + V + V P++VAI+A Q Y
Sbjct: 200 PYTG--EVGKCKFNKKNIGATITGYKNVESGSESDLQTAVAAQPVSVAIDASQSSFQLYA 257
Query: 298 GGVSC-PYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRG 356
GV P S +LDHGVL VGYGS + YWI+KNSWG WGENG+ + R
Sbjct: 258 SGVYYEPECSSTQLDHGVLAVGYGSQ-------SGQDYWIVKNSWGADWGENGFILMARN 310
Query: 357 R-NVCGVDSMVS 367
+ N CG+ +M S
Sbjct: 311 KDNNCGIATMAS 322
>gi|15593246|gb|AAL02220.1|AF410880_1 cysteine protease CP7 precursor [Frankliniella occidentalis]
Length = 333
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 133/321 (41%), Positives = 163/321 (50%), Gaps = 30/321 (9%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSA----THGITQFSDLTPA 114
H+ FK K YA+ E +R +FK N R A+H S G Q++D+
Sbjct: 27 HWESFKATHAKTYANAAEEAYRAKVFKENAIRIAKHNDRFASGEVTFKVGYNQYADMHTH 86
Query: 115 EFRRTYLGLRRKLRLPKDADQAPILPTND---LPADFDWREKGAVGPVKDQGSCGSCWSF 171
E G R L K A +ND DWR KGAV P+KDQG CGSCWSF
Sbjct: 87 EVTEKLNGYRSGL---KQASAFVHTASNDSWPWSKKVDWRSKGAVTPIKDQGQCGSCWSF 143
Query: 172 STTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAG 231
S TG+LEG FL LVSLSEQ LVDC + E GCNGGLM+SAFEY G
Sbjct: 144 SATGSLEGQLFLKNKNLVSLSEQNLVDCSWDFGNE-------GCNGGLMDSAFEYVKSYG 196
Query: 232 GLMREEDYPYTGTDRGHACKFDKSKIAASVANFS-VVSLDEDQIAANLVKNGPLAVAINA 290
G+ EE YPYT D C + + A + V + E + + K GP++VAI+A
Sbjct: 197 GIDTEESYPYTAEDG--TCLYKAANNAGVNTGYKDVQAKSESALRDAVEKVGPVSVAIDA 254
Query: 291 V--YMQTYIGGVSCPYICSR-RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGE 347
Q Y G+ CS LDHGVL VGYGS K +WI+KNSWG SWGE
Sbjct: 255 SNWSFQMYTSGIYYEPACSSDSLDHGVLAVGYGS------EWPNKEFWIVKNSWGTSWGE 308
Query: 348 NGYYKICRG-RNVCGVDSMVS 367
GY K+ R +N CG+ + S
Sbjct: 309 EGYIKMARNKKNNCGIATEAS 329
>gi|440910969|gb|ELR60703.1| Cathepsin H, partial [Bos grunniens mutus]
Length = 329
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 126/319 (39%), Positives = 176/319 (55%), Gaps = 33/319 (10%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRR 118
HF + + K Y+S EE+ HR +F +NLR H + + G+ QFSD++ E +R
Sbjct: 28 HFQSWMVQHQKKYSS-EEYYHRLQVFASNLREINAHNARNHTFKMGLNQFSDMSFDELKR 86
Query: 119 TYLGLRRKLRLPKDADQAP---ILPTNDLPADFDWREKGA-VGPVKDQGSCGSCWSFSTT 174
YL P++ + T P DWR+KG V PVK+QGSCGSCW+FSTT
Sbjct: 87 KYL-----WSEPQNCSATKSNYLRGTGPYPPSMDWRKKGNFVTPVKNQGSCGSCWTFSTT 141
Query: 175 GALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLM 234
GALE A +ATGKL L+EQQLVDC + + GC GGL + AFEY G+M
Sbjct: 142 GALESAVAIATGKLPFLAEQQLVDCAQNFN-------NHGCQGGLPSQAFEYIRYNKGIM 194
Query: 235 REEDYPYTGTDRGHACKFDKSKIAASVANFSVVSL-DEDQIAANLVKNGPLAVA--INAV 291
E+ YPY G D CK+ SK A V + + ++L DE+ + + + P++ A + A
Sbjct: 195 GEDTYPYRGQDGD--CKYQPSKAIAFVKDVANITLNDEEAMVEAVALHNPVSFAFEVTAD 252
Query: 292 YMQTYIGGVSCPYICSR---RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+M Y G+ C + +++H VL VGYG K PYWI+KNSWG +WG
Sbjct: 253 FM-MYRKGIYSSTSCHKTPDKVNHAVLAVGYGEE-------KGIPYWIVKNSWGPNWGMK 304
Query: 349 GYYKICRGRNVCGVDSMVS 367
GY+ I RG+N+CG+ + S
Sbjct: 305 GYFLIERGKNMCGLAACAS 323
>gi|118119|sp|P13277.2|CYSP1_HOMAM RecName: Full=Digestive cysteine proteinase 1; Flags: Precursor
gi|11051|emb|CAA45127.1| cysteine proteinase preproenzyme [Homarus americanus]
gi|228243|prf||1801240A Cys protease 1
Length = 322
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 127/322 (39%), Positives = 176/322 (54%), Gaps = 33/322 (10%)
Query: 53 LLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRA----ARHQKLDPSATHGITQF 108
L A + FK KF + Y EE +R +F NL+ ++++ + + I QF
Sbjct: 13 LAAANPSWEEFKGKFGRKYVDLEEERYRLNVFLDNLQYIEEFNKKYERGEVTYNLAINQF 72
Query: 109 SDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTNDLP--ADFDWREKGAVGPVKDQGSCG 166
SD+T +F G ++ P+ A A T+ P + DWR KGAV PVKDQG CG
Sbjct: 73 SDMTNEKFNAVMKGYKKG---PRPA--AVFTSTDAAPESTEVDWRTKGAVTPVKDQGQCG 127
Query: 167 SCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGS-CDSGCNGGLMNSAFE 225
SCW+FSTTG +EG +FL TG+LVSLSEQQLVDC GS + GCNGG + A
Sbjct: 128 SCWAFSTTGGIEGQHFLKTGRLVSLSEQQLVDC-------AGGSYYNQGCNGGWVERAIM 180
Query: 226 YTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKN-GPL 284
Y GG+ E YPY D + C+F+ + I A+ + ++ + ++ GP+
Sbjct: 181 YVRDNGGVDTESSYPYEARD--NTCRFNSNTIGATCTGYVGIAQGSESALKTATRDIGPI 238
Query: 285 AVAINAVY--MQTYIGGVSC-PYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSW 341
+VAI+A + Q+Y GV P S +LDH VL VGYGS G + +W++KNSW
Sbjct: 239 SVAIDASHRSFQSYYTGVYYEPSCSSSQLDHAVLAVGYGSEG-------GQDFWLVKNSW 291
Query: 342 GESWGENGYYKICRGR-NVCGV 362
SWGE+GY K+ R R N CG+
Sbjct: 292 ATSWGESGYIKMARNRNNNCGI 313
>gi|170041165|ref|XP_001848344.1| cathepsin l [Culex quinquefasciatus]
gi|167864709|gb|EDS28092.1| cathepsin l [Culex quinquefasciatus]
Length = 340
Score = 206 bits (524), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 131/332 (39%), Positives = 179/332 (53%), Gaps = 39/332 (11%)
Query: 60 FSLFKKKFN-------KAYASQEEHDHRFTIFKANLRRAARH-QKLDPSATH---GITQF 108
F L K+++N K Y S+ E R I+ N + A+H Q+ + + ++
Sbjct: 20 FELVKEEWNAYKLQHRKKYDSETEERLRLKIYVQNKHKIAKHNQRFEQGQEKFRLRVNKY 79
Query: 109 SDLTPAEFRRTYLGLRR-KLRLPK------DADQAPILPTN-DLPADFDWREKGAVGPVK 160
+DL EF +T G R + P D I P N ++P DWREKGAV PVK
Sbjct: 80 TDLLHEEFVQTLNGFNRTNAKKPMLKGVKIDEPVTYIEPANVEVPKTVDWREKGAVTPVK 139
Query: 161 DQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLM 220
DQG CGSCWSFS TGALEG +F TGKLVSLSEQ LVDC + ++GCNGG+M
Sbjct: 140 DQGHCGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSTKYG-------NNGCNGGMM 192
Query: 221 NSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLV 279
+ AF+Y GG+ E+ YPY D C ++ + A+ F + DE + +
Sbjct: 193 DFAFQYIKDNGGIDTEKAYPYEAID--DTCHYNPKAVGATDKGFVDIPQGDEKALMKAIA 250
Query: 280 KNGPLAVAINAVY--MQTYIGGVSCPYIC-SRRLDHGVLLVGYGSAGYAPIRLKEKPYWI 336
GP++VAI+A + Q Y GV C S LDHGVL VGYG++ + + YW+
Sbjct: 251 TAGPVSVAIDASHESFQFYSEGVYYEPQCDSENLDHGVLAVGYGTSE------EGEDYWL 304
Query: 337 IKNSWGESWGENGYYKICRGR-NVCGVDSMVS 367
+KNSWG +WG+ GY K+ R R N CG+ + S
Sbjct: 305 VKNSWGTTWGDQGYVKMARNRDNHCGIATAAS 336
>gi|380790141|gb|AFE66946.1| cathepsin L1 preproprotein [Macaca mulatta]
gi|384939708|gb|AFI33459.1| cathepsin L1 preproprotein [Macaca mulatta]
Length = 333
Score = 206 bits (524), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 124/319 (38%), Positives = 167/319 (52%), Gaps = 23/319 (7%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQ----FSDLT 112
E ++ +K N+ Y EE R +++ N++ H + H T F D+T
Sbjct: 26 EAQWTKWKAMHNRLYGMNEEGWRR-AVWEKNMKMIELHNQEYSQGKHSFTMAMNTFGDMT 84
Query: 113 PAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFS 172
EFR+ G + + Q P+ + P DWREKG V PVK+QG CGSCW+FS
Sbjct: 85 SEEFRQLMNGFQNRKPRKGKVFQEPLF--YEAPRSVDWREKGYVTPVKNQGQCGSCWAFS 142
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
TGALEG F TGKLVSLSEQ LVDC P+ + GCNGGLM+ AF+Y GG
Sbjct: 143 ATGALEGQMFRKTGKLVSLSEQNLVDCS---GPQ----GNEGCNGGLMDYAFQYVADNGG 195
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY 292
L EE YPY T+ +CK++ A+ F + E + + GP++VAI+A +
Sbjct: 196 LDSEESYPYEATEE--SCKYNPEYSVANDTGFVDIPKQEKALMKAVATVGPISVAIDAGH 253
Query: 293 --MQTYIGGVSCPYICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
Y G+ CS +DHGVL+VGY G+ YW++KNSWGE WG G
Sbjct: 254 ESFMFYKEGIYFEPDCSSEDMDHGVLVVGY---GFESTESDNSKYWLVKNSWGEEWGMGG 310
Query: 350 YYKICRG-RNVCGVDSMVS 367
Y K+ + RN CG+ S S
Sbjct: 311 YIKMAKDRRNHCGIASAAS 329
>gi|355567871|gb|EHH24212.1| Cathepsin L1 [Macaca mulatta]
Length = 333
Score = 206 bits (524), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 124/319 (38%), Positives = 167/319 (52%), Gaps = 23/319 (7%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQ----FSDLT 112
E ++ +K N+ Y EE R +++ N++ H + H T F D+T
Sbjct: 26 EAQWTKWKAMHNRLYGMNEEGWRR-AVWEKNMKMIELHNQEYSQGKHSFTMAMNTFGDMT 84
Query: 113 PAEFRRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFS 172
EFR+ G + + Q P+ + P DWREKG V PVK+QG CGSCW+FS
Sbjct: 85 SEEFRQVMNGFQNRKPRKGKVFQEPLF--YEAPRSVDWREKGYVTPVKNQGQCGSCWAFS 142
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
TGALEG F TGKLVSLSEQ LVDC P+ + GCNGGLM+ AF+Y GG
Sbjct: 143 ATGALEGQMFRKTGKLVSLSEQNLVDCS---GPQ----GNEGCNGGLMDYAFQYVADNGG 195
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY 292
L EE YPY T+ +CK++ A+ F + E + + GP++VAI+A +
Sbjct: 196 LDSEEAYPYEATEE--SCKYNPEYSVANDTGFVDIPKQEKALMKAVATVGPISVAIDAGH 253
Query: 293 --MQTYIGGVSCPYICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
Y G+ CS +DHGVL+VGY G+ YW++KNSWGE WG G
Sbjct: 254 ESFMFYKEGIYFEPDCSSEDMDHGVLVVGY---GFESTESDNSKYWLVKNSWGEEWGMGG 310
Query: 350 YYKICRG-RNVCGVDSMVS 367
Y K+ + RN CG+ S S
Sbjct: 311 YIKMAKDRRNHCGIASAAS 329
>gi|38147395|gb|AAR12010.1| cathepsin L-like proteinase [Triatoma infestans]
Length = 328
Score = 206 bits (524), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 124/316 (39%), Positives = 168/316 (53%), Gaps = 31/316 (9%)
Query: 63 FKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKL----DPSATHGITQFSDLTPAEFRR 118
FK +F K+Y + E R ++K N R+ H K + S + F DL EF+
Sbjct: 29 FKAQFGKSYKNSFEELFRMNVYKENQRKIDEHNKRYENGEVSYKLKMNHFGDLMQHEFK- 87
Query: 119 TYLGLRRKLRLPKDADQAPILPTN--DLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGA 176
L + R K + + LPA DWR+KGAV PVKD G CGSCW+FS+TG+
Sbjct: 88 ---ALNKLKRSAKQQNSGEVFRATGGKLPAKVDWRQKGAVTPVKDPGQCGSCWAFSSTGS 144
Query: 177 LEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMRE 236
L G FL KLVSLSEQQLVDC + GC+GG+M AF+Y GG+ E
Sbjct: 145 LGGQLFLKNKKLVSLSEQQLVDCSGNYG-------NDGCDGGIMVQAFQYIKGNGGIDTE 197
Query: 237 EDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAINA--VYM 293
YPY D C++ +A + + + DE+ + + + GP++VAI+A +
Sbjct: 198 GSYPYEAED--DKCRYKTKSVAGTDKGYVDIAQGDENALKEAVAEIGPISVAIDAGNLSF 255
Query: 294 QTYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYK 352
Q Y G+ P+ + LDHGVL+VGYG+ + YW++KNSWG SWGENGY K
Sbjct: 256 QFYSEGIYDEPFCSNTELDHGVLVVGYGTE-------NGQDYWLVKNSWGPSWGENGYIK 308
Query: 353 ICRGRNV-CGVDSMVS 367
I R N CG+ SM S
Sbjct: 309 IARNHNNHCGIASMAS 324
>gi|28192373|gb|AAK07730.1| CPR1-like cysteine proteinase [Nicotiana tabacum]
Length = 374
Score = 206 bits (524), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 121/307 (39%), Positives = 170/307 (55%), Gaps = 26/307 (8%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKL-DPSATHGITQFSDLTPAE 115
++ + ++ + +AY + E + RF IFK NLR H + + G+ QF+DLT E
Sbjct: 47 KNRYEMWLAEHGRAYNALGEKEKRFEIFKDNLRFIEEHNNSGNRTYKVGLNQFADLTNEE 106
Query: 116 FRRTYLGL----RRKLRLPKDADQAPILPTNDL-PADFDWREKGAVGPVKDQGSCGSCWS 170
+R YLG RR+ K+ Q N+L P DWR++GAV P+K+QGSCGSCW+
Sbjct: 107 YRTMYLGTKSDARRRFVKSKNPSQRYASRPNELMPHSVDWRKRGAVAPIKNQGSCGSCWA 166
Query: 171 FSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA 230
FST A+ G N + TG++++LSEQ+LVDCD +SGCNGGLM+ AFE+ +
Sbjct: 167 FSTVAAVGGINQIVTGEMITLSEQELVDCDR--------VQNSGCNGGLMDYAFEFIISN 218
Query: 231 GGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA 290
GG+ E+ YPY G + G K+ S+ + V +E + V + P+ VAI A
Sbjct: 219 GGMDTEKHYPYRGVE-GRCDPVRKNYKVVSIDGYEDVPRNERALQK-AVAHQPVCVAIEA 276
Query: 291 V--YMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
Q Y GV C +DHGV++VGYGS YWI++NSWG WGEN
Sbjct: 277 SGRAFQLYSSGVFTGE-CGEEVDHGVVVVGYGSEDGV-------DYWIVRNSWGTKWGEN 328
Query: 349 GYYKICR 355
GY K+ R
Sbjct: 329 GYVKMER 335
>gi|18138384|ref|NP_542680.1| cathepsin [Helicoverpa zea SNPV]
gi|209401110|ref|YP_002273979.1| viral cathepsin-like protein [Helicoverpa armigera NPV NNg1]
gi|37077430|sp|Q8V5U0.1|CATV_NPVHZ RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|18028766|gb|AAL56202.1|AF334030_127 ORF57 [Helicoverpa zea SNPV]
gi|209364362|dbj|BAG74621.1| viral cathepsin-like protein [Helicoverpa armigera NPV NNg1]
Length = 367
Score = 206 bits (524), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 119/329 (36%), Positives = 172/329 (52%), Gaps = 39/329 (11%)
Query: 52 DLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQK------------LDP 99
+L +E +F F +++NK+Y +E+ +R+ +FK NL + + L
Sbjct: 49 NLDQSEIYFKHFLQQYNKSYDDPKEYQYRYNVFKDNLNKINSQNRENLLNNKNNNDSLST 108
Query: 100 SATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPIL---PTNDLPADFDWREKGAV 156
SA G+ +FSD TP E + G L + I+ P LP +DWR+ V
Sbjct: 109 SAQFGVNKFSDKTPDEVLHSNTGFFLNLSQHYTLCENRIVKGAPDIRLPDYYDWRDTNKV 168
Query: 157 GPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCN 216
P+KDQG CGSCW+F G +E + KL+ LSEQQL+DCD D GCN
Sbjct: 169 TPIKDQGVCGSCWAFVAIGNIESQYAIRHNKLIDLSEQQLLDCD---------EVDLGCN 219
Query: 217 GGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVAN-FSVVSLDEDQIA 275
GGLM+ AF+ L GG+ E DYPY G+++ C D KIA + + F DE+++
Sbjct: 220 GGLMHLAFQELLLMGGVETEADYPYQGSEQ--MCTLDNRKIAVKLNSCFKYDIRDENKLK 277
Query: 276 ANLVKNGPLAVAINAVYMQTYIGGV--SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKP 333
+ GP+A+A++A+ + Y G+ C L+H VLL+G+G P
Sbjct: 278 ELVYTTGPVAIAVDAMDIINYRRGILNQCHIY---DLNHAVLLIGWGIEN-------NVP 327
Query: 334 YWIIKNSWGESWGENGYYKICRGRNVCGV 362
YWIIKNSWGE WGENG+ ++ R N CG+
Sbjct: 328 YWIIKNSWGEDWGENGFLRVRRNVNACGL 356
>gi|356559055|ref|XP_003547817.1| PREDICTED: cysteine proteinase RD21a [Glycine max]
Length = 366
Score = 206 bits (524), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 121/301 (40%), Positives = 162/301 (53%), Gaps = 31/301 (10%)
Query: 66 KFNKAYASQEEHDHRFTIFKANLRRAARHQK-LDPSATHGITQFSDLTPAEFRRTYLGL- 123
K K Y E D RF +FK NL H + + G+ QF+D+T E+R Y G
Sbjct: 46 KHQKVYNGLREKDKRFQVFKDNLGFIQEHNNNQNNTYKLGLNQFADMTNEEYRVMYFGTK 105
Query: 124 ----RRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEG 179
RR ++ + + LP DWR KGAV P+KDQGSCGSCW+FST +E
Sbjct: 106 SDAKRRLMKTKSTGHRYAYSAGDRLPVHVDWRVKGAVAPIKDQGSCGSCWAFSTVATVEA 165
Query: 180 ANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDY 239
N + TGK VSLSEQ+LVDCD + + GCNGGLM+ AFE+ ++ GG+ ++DY
Sbjct: 166 INKIVTGKFVSLSEQELVDCDR--------AYNEGCNGGLMDYAFEFIIQNGGIDTDKDY 217
Query: 240 PYTGTDRGHACKFDKSKIAASVAN---FSVVSLDEDQIAANLVKNGPLAVAINAV--YMQ 294
PY G D C D +K A V N F V ++ V + P+++AI A +Q
Sbjct: 218 PYRGFD--GIC--DPTKKNAKVVNIDGFEDVPPYDENALKKAVAHQPVSIAIEASGRDLQ 273
Query: 295 TYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
Y GV C LDHGV++VGYGS YW+++NSWG WGE+GY+K+
Sbjct: 274 LYQSGVFTGK-CGTSLDHGVVVVGYGSENGV-------DYWLVRNSWGTGWGEDGYFKMQ 325
Query: 355 R 355
R
Sbjct: 326 R 326
>gi|146078033|ref|XP_001463431.1| cathepsin L-like protease [Leishmania infantum JPCM5]
gi|134067516|emb|CAM65796.1| cathepsin L-like protease [Leishmania infantum JPCM5]
Length = 381
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 126/309 (40%), Positives = 162/309 (52%), Gaps = 38/309 (12%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F FK+ + +AY + E R F+ NL HQ +P A GIT+F DL+ AEF
Sbjct: 38 FEEFKRTYRRAYGTLAEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 120 YL-GLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
YL G + A Q DL P DWREKGAV PVKDQG+CGSCW+FS G
Sbjct: 98 YLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFSAVG 157
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGL 233
+E A LVSLSEQQLV CD + D+GCNGGLM AFE+ L+ G +
Sbjct: 158 NIESQWARAGHGLVSLSEQQLVSCDDK---------DNGCNGGLMLQAFEWLLRHMYGIV 208
Query: 234 MREEDYPYTGTDRGHACKFDKSKI--AASVANFSVVSLDEDQIAANLVKNGPLAVAINAV 291
E+ YPYT + A + SK+ A + + ++ +E +AA L +NGP+A+A++A
Sbjct: 209 FTEKSYPYTSGNGDVAECLNSSKLVPGAQIDGYVMIPSNETVMAAWLAENGPIAIAVDAS 268
Query: 292 YMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYY 351
+Y GVLLVGY G PYW+IKNSWGE WGE GY
Sbjct: 269 SFMSY--------------QSGVLLVGYNKTGGV-------PYWVIKNSWGEDWGEKGYV 307
Query: 352 KICRGRNVC 360
++ G N C
Sbjct: 308 RVAMGLNAC 316
>gi|74927078|sp|Q86GF7.1|CRUST_PANBO RecName: Full=Crustapain; AltName: Full=NsCys; Flags: Precursor
gi|28971811|dbj|BAC65417.1| crustapain [Pandalus borealis]
Length = 323
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 129/313 (41%), Positives = 164/313 (52%), Gaps = 34/313 (10%)
Query: 63 FKKKFNKAYASQEEHDHRFTIFKANLR----RAARHQKLDPSATHGITQFSDLTPAEFRR 118
FK KF K YA+ EE HR ++F L+ R+ K + + I FSDLT E
Sbjct: 23 FKTKFGKKYANSEEESHRMSVFMDKLKFIQEHNERYDKGEVTYWLKINNFSDLTHEEVLA 82
Query: 119 TYLGLRRKLR----LPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTT 174
T G+ R+ LPK A PT + AD DWR KGAV PVKDQG CGSCW+FS
Sbjct: 83 TKTGMTRRRHPLSVLPKSA------PTTPMAADVDWRNKGAVTPVKDQGQCGSCWAFSAV 136
Query: 175 GALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLM 234
ALEGA+FL TG LVSLSEQ LVDC + GCNGG A++Y + G+
Sbjct: 137 AALEGAHFLKTGDLVSLSEQNLVDCSSSYG-------NQGCNGGWPYQAYQYIIANRGID 189
Query: 235 REEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAINA--V 291
E YPY D C++D I A+V+++ S DE + + GP++V I+A
Sbjct: 190 TESSYPYKAIDDN--CRYDAGNIGATVSSYVEPASGDESALQHAVQNEGPVSVCIDAGQS 247
Query: 292 YMQTYIGGVSCPYIC-SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGY 350
+Y GGV C S +H V VGYG+ YWI+KNSWG WGE+GY
Sbjct: 248 SFGSYGGGVYYEPNCDSWYANHAVTAVGYGT------DANGGDYWIVKNSWGAWWGESGY 301
Query: 351 YKICRGR-NVCGV 362
K+ R R N C +
Sbjct: 302 IKMARNRDNNCAI 314
>gi|357619726|gb|EHJ72185.1| cathepsin [Danaus plexippus]
Length = 1118
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 124/340 (36%), Positives = 183/340 (53%), Gaps = 41/340 (12%)
Query: 37 DGGDEILSHHESTNNDLLGAEHHFSL---------FKKKFNKAYASQEEHDHRFTIFKAN 87
+GG+++ + + +L G +H +SL F K +NK Y + E + RF IF N
Sbjct: 788 NGGEKVALQYNVYSREL-GQKHLYSLEEAPTLFEQFIKDYNKEY-DESEKEERFKIFVNN 845
Query: 88 LRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTN---DL 144
L+ + +A +GI +FSDL+ EF + Y GL+R+ + + LP +
Sbjct: 846 LKDINAMNERSSNAVYGINKFSDLSKDEFVKFYTGLKREESPSNEDHKKTDLPKSFNVTA 905
Query: 145 PADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECD 204
P FDWR+KG V VK QG C SCW+FS G +E N + TGKL+ +SEQQLVDCD
Sbjct: 906 PDQFDWRKKGVVSSVKFQGHCVSCWAFSVAGNVESINAIKTGKLIDVSEQQLVDCDE--- 962
Query: 205 PEEPGSCDSGCNGGLM--NSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVA 262
+ GC+GG+ S F Y K G M E YPY G + C+++ SK+ +
Sbjct: 963 ------WNFGCSGGIACSKSHFSYFHKKGA-MSLESYPYVGKE--GQCRYNSSKVVIRLK 1013
Query: 263 NFS-VVSLDEDQIAANLVKNGPLAVAINAVYMQTYIGGV---SCPYICSRRLDHGVLLVG 318
++ ++L ED+I L GPL++ I++ + Y GG+ C + ++ +H VLLVG
Sbjct: 1014 DYQYFIALSEDEIKEYLYNIGPLSIDIDSSQIHHYKGGIVIKECQEV--KKTNHAVLLVG 1071
Query: 319 YGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGRN 358
YG YWI+KNSWG++WGE GY++I RG N
Sbjct: 1072 YGKENGV-------EYWIVKNSWGQNWGEKGYFRIQRGVN 1104
Score = 188 bits (477), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 110/294 (37%), Positives = 157/294 (53%), Gaps = 27/294 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F F K +NK Y + E + RF IF NL+ + +A +GI +FSDL+ EF +
Sbjct: 519 FEQFIKDYNKEY-DESEKEERFKIFVNNLKDINAMNERSSNAVYGINKFSDLSKEEFIKY 577
Query: 120 YLGLRRKLRLPKDADQAPILPTN---DLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGA 176
Y GL+R+ + + LP + P FDWR+KG V +K+Q CGSCW+FS G
Sbjct: 578 YTGLKREESPSNEDHKKTDLPESFNVTAPDQFDWRKKGVVSSIKNQKHCGSCWAFSAAGN 637
Query: 177 LEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMRE 236
+E + + TGKLV +SEQQLVDCD S DSGC+GGL +A Y + G +
Sbjct: 638 VESIHAIKTGKLVHVSEQQLVDCD---------SQDSGCSGGLTWNAMRY-FRTNGAVSL 687
Query: 237 EDYPYTGTDRGHACKFDKSKIAASVANFS-VVSLDEDQIAANLVKNGPLAVAINAVYMQT 295
+ YPY + C++D +K+ + ++ + L EDQI +L G L++ I + +
Sbjct: 688 KSYPYVAQNEN--CRYDSNKVVIRLKDYKHITQLSEDQIKEHLYNIGLLSIDITSTQLTW 745
Query: 296 YIGGVSCPYICSRR--LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGE 347
Y GG+ C R +DH VLLV YG YWI+KNSWG++ GE
Sbjct: 746 YEGGILIEE-CRRSDLVDHAVLLVEYGKENSV-------EYWIVKNSWGQNGGE 791
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 104/265 (39%), Positives = 148/265 (55%), Gaps = 26/265 (9%)
Query: 100 SATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPILPTN---DLPADFDWREKGAV 156
+A +GI +FSDL+ EF + Y GL+R+ + + LP + P FDWR+KG V
Sbjct: 7 NAVYGINKFSDLSKEEFVKYYTGLKREESPSNEDHKKTDLPESFNVTAPDQFDWRKKGVV 66
Query: 157 GPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCN 216
+K+Q CGSCW+FS +E + + TGKL+ +SEQQL+DCD DSGC+
Sbjct: 67 SSIKNQKHCGSCWAFSAAANVESIHAIKTGKLIDVSEQQLLDCD---------KYDSGCS 117
Query: 217 GGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFS-VVSLDEDQIA 275
GGL A Y + A G M + YPY + C++D SK+ + + L EDQI
Sbjct: 118 GGLPWDALRYFV-ANGAMSLKSYPYVAKE--GKCRYDSSKVEIRLKEYKHKEKLSEDQIK 174
Query: 276 ANLVKNGPLAVAINAVYMQTYIGGVSCPYICSRR--LDHGVLLVGYGSAGYAPIRLKEKP 333
+L GPL++AI + + +Y GG+ C R ++H VLLVGYG
Sbjct: 175 EHLYNIGPLSIAITSSPLASYNGGILIEE-CHRSYLINHAVLLVGYGKENGV-------K 226
Query: 334 YWIIKNSWGESWGENGYYKICRGRN 358
YWI+KNSWG++WGENGY+++ G N
Sbjct: 227 YWIVKNSWGQNWGENGYFRMKMGVN 251
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 66/163 (40%), Positives = 91/163 (55%), Gaps = 13/163 (7%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F F K +NK Y + E + RF IF NL+ + +A +GI +FSDL+ EF +
Sbjct: 302 FEQFIKDYNKEY-DESEKEERFKIFVNNLKDINAMNERSSNAVYGINKFSDLSKEEFIKY 360
Query: 120 YLGLRRKLRLPKDADQAPILPTN---DLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGA 176
Y GL+R + ++ LP + P FDWR+KG V VK+Q CGSCW+FS
Sbjct: 361 YTGLKRDRCTTTEHHKSTDLPKSFNITAPDQFDWRKKGVVSSVKNQRHCGSCWAFSAAAN 420
Query: 177 LEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGL 219
+E + + TGKL+ +SEQQL+DCD DSGC+GGL
Sbjct: 421 VESIHAIKTGKLIDVSEQQLLDCD---------KYDSGCSGGL 454
>gi|125547256|gb|EAY93078.1| hypothetical protein OsI_14879 [Oryza sativa Indica Group]
Length = 339
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 123/330 (37%), Positives = 174/330 (52%), Gaps = 29/330 (8%)
Query: 42 ILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSA 101
+L+ E +++ + A H + +++ + Y E RF IFKAN+ +
Sbjct: 21 VLAAREQSDHAAMVARHE--RWMEQYGRVYKDATEKARRFEIFKANVAFIESFNAGNHKF 78
Query: 102 THGITQFSDLTPAEFRRTYLG---LRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGP 158
G+ QF+DLT EFR T + +R+P + + LPA DWR KGAV P
Sbjct: 79 WLGVNQFADLTNYEFRATKTNKGFIPSTVRVPTTFRYENV-SIDTLPATVDWRTKGAVTP 137
Query: 159 VKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGG 218
+KDQG CG CW+FS A+EG L+TGKL+SLSEQ+LVDCD + D GC GG
Sbjct: 138 IKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGE-------DQGCEGG 190
Query: 219 LMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANL 278
LM+ AF++ +K GGL E YPYT D C S AA++ + V + +
Sbjct: 191 LMDDAFKFIIKNGGLTTESKYPYTAAD--GKCN-GGSNSAATIKGYEEVPANNEAALMKA 247
Query: 279 VKNGPLAVAINA--VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWI 336
V N P++VA++ + Q Y GGV C LDHG++ +GYG G YW+
Sbjct: 248 VANQPVSVAVDGGDMTFQFYSGGVMTGS-CGTDLDHGIVAIGYGKDG------DGTQYWL 300
Query: 337 IKNSWGESWGENGYYK----ICRGRNVCGV 362
+KNSWG +WGENG+ + I R +CG+
Sbjct: 301 LKNSWGTTWGENGFLRMEKDISDKRGMCGL 330
>gi|307111936|gb|EFN60170.1| hypothetical protein CHLNCDRAFT_59551 [Chlorella variabilis]
Length = 364
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 124/297 (41%), Positives = 168/297 (56%), Gaps = 19/297 (6%)
Query: 64 KKKFNKAYASQEE-HDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLG 122
K N+AYAS E ++ RF I+ NLR A + S + ++DL+ E+R LG
Sbjct: 54 KPPSNRAYASSAEVYERRFNIWLDNLRFAHEYNARHTSHWLSMGVYADLSQDEYRSKALG 113
Query: 123 LRRKLRLPKDADQAPILPTNDLPAD-FDWREKGAVGPVKDQGSCGSCWSFSTTGALEGAN 181
L + AP L +P + DW GAV PVKDQ CGSCW+FSTTGA+EGAN
Sbjct: 114 YNAHLHKKRPLRAAPFLYKGTVPPEEVDWVAGGAVTPVKDQLLCGSCWAFSTTGAVEGAN 173
Query: 182 FLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPY 241
+ATGKLVSLSEQ LVDCD E D+GC GG M+SAF++ + GG+ E+DYPY
Sbjct: 174 AIATGKLVSLSEQMLVDCDRE--------YDTGCRGGFMDSAFDFIVNNGGIDTEDDYPY 225
Query: 242 TGTDRGHACKFDKSKI-AASVANFSVVSLDEDQIAANLVKNGPLAVAINA--VYMQTYIG 298
D C+ ++++ ++ + V +++ V + P++VAI A + Q Y G
Sbjct: 226 RAED--GICQDNRTRRHVVTIDGYQDVPPNDENALMKAVAHQPVSVAIEADQLAFQLYGG 283
Query: 299 GVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
GV C LDH VL+VGYG+A L PYW++KNSWG WGE GY ++ R
Sbjct: 284 GV-FDAECGTALDHAVLVVGYGTASNGTHNL---PYWLVKNSWGAEWGEKGYIRLLR 336
>gi|118360450|ref|XP_001013459.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89295226|gb|EAR93214.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 320
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 128/310 (41%), Positives = 176/310 (56%), Gaps = 38/310 (12%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
+S FK +NK YA + +R +F NL+ + GIT+F DLT EF++T
Sbjct: 43 WSTFKNSYNKKYADPDFEQYRIEVFTENLKIIDSN-----CQNFGITKFMDLTQEEFKQT 97
Query: 120 YLGLRRKLRLPKDADQAPILPTNDLPADF--DWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
YL L+ K K ++ P ND D DW KGAV PVKDQG CGSCWSFSTTGA+
Sbjct: 98 YLTLKTK----KYIEEIPETVFNDSNGDIEIDWTMKGAVTPVKDQGKCGSCWSFSTTGAV 153
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREE 237
EGA+FL++ +LVSLSEQ L+DC + + GCNGGLM++AF++ + G+ E
Sbjct: 154 EGAHFLSSNELVSLSEQYLIDCSK--------NGNEGCNGGLMDTAFDF-IAQNGIPTEN 204
Query: 238 DYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVYMQTYI 297
YPY D CK S +++S ++ + + L K P+A+A++A Q Y
Sbjct: 205 AYPYKALD--GTCKMTTGPYKISSYQ-NIISCND--LLSKLQKQ-PIAIAVDANNFQFYT 258
Query: 298 GGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGR 357
G+ C + LDHGVLLVGY S K+K +W +KNSWG SWGE+GY ++ G
Sbjct: 259 KGIFSK--CGKNLDHGVLLVGYSS--------KDK-FWKVKNSWGSSWGEDGYIRLSAG- 306
Query: 358 NVCGVDSMVS 367
N CG+ + S
Sbjct: 307 NTCGLCNQAS 316
>gi|309380130|gb|ADO65978.1| cathepsin L [Eriocheir sinensis]
gi|309380134|gb|ADO65980.1| cathepsin L [Eriocheir sinensis]
Length = 325
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 121/320 (37%), Positives = 177/320 (55%), Gaps = 28/320 (8%)
Query: 58 HHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP----SATHGITQFSDLTP 113
+ + FK ++ K Y S +E +R ++++ N H + S T + QF D+T
Sbjct: 20 NEWQQFKARYGKQYRSTKEDSYRQSVYEQNQEFINSHNEQYENGLVSFTLAMNQFGDMTT 79
Query: 114 AEFRRTYLG-LRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFS 172
E G L ++P+ P++ ++LP DWR+KGAV PVKDQ +CGSCW+FS
Sbjct: 80 EEINAAMNGFLSAGKKVPRGTMYQPLV--DELPDTVDWRDKGAVTPVKDQKACGSCWAFS 137
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
TG+LEG +FL+TGKLVSLSEQ LVDC + + GC GGLM++AF Y G
Sbjct: 138 ATGSLEGQHFLSTGKLVSLSEQNLVDCSDKYG-------NFGCGGGLMDNAFRYIKDNNG 190
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAINA- 290
+ EE YPY + C+F+ + A+++++ + ED + + + GP++VAI+A
Sbjct: 191 IDTEESYPYEA--KNGPCRFNSDNVGATLSSYVDIQHGSEDDLQKAVAEKGPVSVAIDAS 248
Query: 291 -VYMQTYIGGVSCPYICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
Y G+ CS LDHGVL VGYG+ YW++KNSW E+WG++
Sbjct: 249 TSTFHFYSRGIYYDEKCSSSFLDHGVLAVGYGTD-------DSSDYWLVKNSWNETWGDS 301
Query: 349 GYYKICRGR-NVCGVDSMVS 367
GY K+ R R N CG+ S S
Sbjct: 302 GYIKMSRNRNNNCGIASQAS 321
>gi|2511693|emb|CAB17076.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 455
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 123/297 (41%), Positives = 163/297 (54%), Gaps = 26/297 (8%)
Query: 66 KFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGL-- 123
K K Y + E D RF IFK NLR + + + G+ +F+DLT E+R YLG
Sbjct: 46 KHGKLYNALGEKDKRFQIFKDNLRFIDQQNAENRTYKLGLNRFADLTNEEYRARYLGTKI 105
Query: 124 ---RRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGA 180
RR R P + + T LP DWR++GAV PVKDQ SCGSCW+FS GA+EG
Sbjct: 106 DPNRRLGRTPSNRYAPRVGET--LPDSVDWRKEGAVVPVKDQASCGSCWAFSAIGAVEGI 163
Query: 181 NFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYP 240
N + TG L+SLSEQ+LVDCD + GCNGGLM+ AFE+ +K GG+ EEDYP
Sbjct: 164 NKIVTGDLISLSEQELVDCDT--------GYNMGCNGGLMDYAFEFIIKNGGIDSEEDYP 215
Query: 241 YTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAIN--AVYMQTYIG 298
Y G D G ++ K+ S+ + V+ ++ V N P++VA+ Q Y
Sbjct: 216 YKGVD-GRCDEYRKNAKVVSIDGYEDVNTYDELALKKAVANQPVSVAVEGGGREFQLYSS 274
Query: 299 GVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
GV C LDHGV+ VGYG+ +WI++NSWG WGE GY ++ R
Sbjct: 275 GVFTGR-CGTALDHGVVAVGYGTD-------NGHDFWIVRNSWGADWGEEGYIRLER 323
>gi|397516975|ref|XP_003828695.1| PREDICTED: cathepsin W [Pan paniscus]
Length = 376
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 124/335 (37%), Positives = 169/335 (50%), Gaps = 42/335 (12%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLD-PSATHGITQFSDLTPAEFRR 118
F LF+ +FN++Y S EEH HR IF NL +A R Q+ D +A G+T FSDLT EF +
Sbjct: 42 FKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDLTEEEFGQ 101
Query: 119 TYLGLRRKLRLPKDADQAPIL--------PTNDLPADFDWRE-KGAVGPVKDQGSCGSCW 169
Y G RR A P + P +P DWR+ GA+ P+KDQ +C CW
Sbjct: 102 LY-GYRRA------AGGVPSMGREIRSEEPEESVPFSCDWRKVAGAISPIKDQKNCNCCW 154
Query: 170 SFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK 229
+ + G +E ++ V +S Q+L+DC C GC GG + AF L
Sbjct: 155 AMAAAGNIETLWRISFWDFVDVSVQELLDCSR---------CGDGCQGGFVWDAFITVLN 205
Query: 230 AGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAIN 289
GL E+DYP+ G R H C K + A + +F ++ +E +IA L GP+ V IN
Sbjct: 206 NSGLASEKDYPFQGKVRAHRCHPKKYQKVAWIQDFIMLQNNEHRIAQYLATYGPITVTIN 265
Query: 290 AVYMQTYIGGV--SCPYICSRRL-DHGVLLVGYG-------------SAGYAPIRLKEKP 333
++ Y GV + P C +L DH VLLVG+G S+ P P
Sbjct: 266 MKPLRLYRKGVIKATPTTCDPQLVDHSVLLVGFGSVKSEEGIWAETVSSQSQPQPPHPTP 325
Query: 334 YWIIKNSWGESWGENGYYKICRGRNVCGVDSMVST 368
YWI+KNSWG WGE GY+++ RG N CG+ T
Sbjct: 326 YWILKNSWGAQWGEKGYFRLHRGSNTCGITKFPLT 360
>gi|194883222|ref|XP_001975702.1| GG20414 [Drosophila erecta]
gi|190658889|gb|EDV56102.1| GG20414 [Drosophila erecta]
Length = 341
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 132/332 (39%), Positives = 183/332 (55%), Gaps = 33/332 (9%)
Query: 52 DLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKL----DPSATHGITQ 107
D++ E H FK + K Y E R IF N + A+H + S + +
Sbjct: 23 DVVMEEWH--TFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRYAEGKVSFKLAVNK 80
Query: 108 FSDLTPAEFRRTYLG----LRRKLRLPKDADQAP--ILPTN-DLPADFDWREKGAVGPVK 160
++DL EFR+ G L ++LR D+ + I P + LP DWR KGAV VK
Sbjct: 81 YADLLHHEFRQLMNGFNYTLHKQLRSTDDSFKGVTFISPAHVTLPKSVDWRTKGAVTAVK 140
Query: 161 DQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLM 220
DQG CGSCW+FS+TGALEG +F +G LVSLSEQ LVDC + ++GCNGGLM
Sbjct: 141 DQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYG-------NNGCNGGLM 193
Query: 221 NSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSL-DEDQIAANLV 279
++AF Y GG+ E+ YPY D +C F+K I A+ F+ + DE ++A +
Sbjct: 194 DNAFRYIKDNGGIDTEKSYPYEAIDD--SCHFNKGAIGATDRGFTDIPQGDEKKMAEAVA 251
Query: 280 KNGPLAVAINAVY--MQTYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWI 336
GP+AVAI+A + Q Y GV + P ++ LDHGVL+VGYG+ YW+
Sbjct: 252 TVGPVAVAIDASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGYGTDESGD------DYWL 305
Query: 337 IKNSWGESWGENGYYKICRGR-NVCGVDSMVS 367
+KNSWG +WG+ G+ K+ R + N CG+ S S
Sbjct: 306 VKNSWGTTWGDKGFIKMLRNKDNQCGIASASS 337
>gi|114638622|ref|XP_001170363.1| PREDICTED: cathepsin W [Pan troglodytes]
Length = 376
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 124/335 (37%), Positives = 169/335 (50%), Gaps = 42/335 (12%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLD-PSATHGITQFSDLTPAEFRR 118
F LF+ +FN++Y S EEH HR IF NL +A R Q+ D +A G+T FSDLT EF +
Sbjct: 42 FKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDLTEEEFGQ 101
Query: 119 TYLGLRRKLRLPKDADQAPIL--------PTNDLPADFDWRE-KGAVGPVKDQGSCGSCW 169
Y G RR A P + P +P DWR+ GA+ P+KDQ +C CW
Sbjct: 102 LY-GYRRA------AGGVPSMGREIRSEEPEESVPFSCDWRKVAGAISPIKDQKNCNCCW 154
Query: 170 SFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK 229
+ + G +E ++ V +S Q+L+DC C GC GG + AF L
Sbjct: 155 AMAAAGNIETLWRISFWDFVDVSVQELLDCSR---------CGDGCQGGFVWDAFITVLN 205
Query: 230 AGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAIN 289
GL E+DYP+ G R H C K + A + +F ++ +E +IA L GP+ V IN
Sbjct: 206 NSGLASEKDYPFQGKVRAHRCHPKKYQKVAWIQDFIMLQNNEHRIAQYLATYGPITVTIN 265
Query: 290 AVYMQTYIGGV--SCPYICSRRL-DHGVLLVGYG-------------SAGYAPIRLKEKP 333
++ Y GV + P C +L DH VLLVG+G S+ P P
Sbjct: 266 MKPLRLYRKGVIKATPTTCDPQLVDHSVLLVGFGSVKSEEGIWAERVSSQSQPQPPHPTP 325
Query: 334 YWIIKNSWGESWGENGYYKICRGRNVCGVDSMVST 368
YWI+KNSWG WGE GY+++ RG N CG+ T
Sbjct: 326 YWILKNSWGAQWGEKGYFRLHRGSNTCGITKFPLT 360
>gi|387015020|gb|AFJ49629.1| Cathepsin H [Crotalus adamanteus]
Length = 337
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 125/323 (38%), Positives = 170/323 (52%), Gaps = 36/323 (11%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
E F + + +AY S+EE HR IF N ++ +H + S G+ QFSD+T EF
Sbjct: 33 EQLFKAWASQHRRAYRSEEEFRHRLQIFLDNKQKIDKHNAGNSSFRMGLNQFSDMTFTEF 92
Query: 117 RRTYLGLRRKL------RLPKDADQAPILPTNDLPADFDWREKGA-VGPVKDQGSCGSCW 169
R+ YL + P+ A P DWR+KG V PVK+QGSCGSCW
Sbjct: 93 RKKYLWQEPQNCSATMGNFPRSA--------GPCPKAIDWRKKGKFVSPVKNQGSCGSCW 144
Query: 170 SFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK 229
+FSTTG LE A + TGKL++L+EQQL+DC + + GC+GGL + AFEY L
Sbjct: 145 TFSTTGCLESAIAIKTGKLLNLAEQQLIDCAQNFN-------NFGCSGGLPSQAFEYILY 197
Query: 230 AGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSL-DEDQIAANLVKNGPLAVAI 288
GLM EE YPY + CKF K A + + +SL DE + + P+++A
Sbjct: 198 NKGLMDEEAYPYRAQN--GTCKFQPQKAVAFIKDVVNISLYDEQGLVQAVGTYNPVSIAF 255
Query: 289 NAVY-MQTYIGGVSCPYICSR---RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGES 344
Y GV C + +++H VL VGYG G P+WI+KNSWG S
Sbjct: 256 EVREDFVHYQEGVYTSTDCDKTPDKVNHAVLAVGYGEEGGV-------PFWIVKNSWGTS 308
Query: 345 WGENGYYKICRGRNVCGVDSMVS 367
WG +GY+ I RG+N+CG+ S
Sbjct: 309 WGLDGYFNIERGKNMCGLADCAS 331
>gi|387915132|gb|AFK11175.1| cathspsin H [Callorhinchus milii]
Length = 330
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 126/313 (40%), Positives = 169/313 (53%), Gaps = 33/313 (10%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRT 119
F + + NK Y+S EE+ +R F N R+ H S G+ QFSD+T +EF++
Sbjct: 30 FKTWMTQHNKHYSS-EEYSYRLRTFIQNKRKVEEHNSGRHSYRMGLNQFSDMTFSEFKKL 88
Query: 120 YLGLRRKLRLPKDADQAP---ILPTNDLPADFDWREKG-AVGPVKDQGSCGSCWSFSTTG 175
YL LR P++ +L P DWR KG V PVK+QG CGSCW+FSTTG
Sbjct: 89 YL-----LREPQNCSATRGNHVLSMGPYPDFVDWRTKGNYVTPVKNQGGCGSCWTFSTTG 143
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMR 235
LE A + TGKL+SL+EQQLVDC + GCNGGL + AFEY GGL
Sbjct: 144 CLESAIAIKTGKLLSLAEQQLVDCAGAYK-------NHGCNGGLPSQAFEYIKYNGGLEA 196
Query: 236 EEDYPYTGTDRGHACKFDKSKIAASVAN-FSVVSLDEDQIAANLVKNGPLAVAINAV--Y 292
E+DYPYT D+ C++ +K A V ++ DE+ I + + P+++A +
Sbjct: 197 EKDYPYTAQDQ--HCQYQPNKAVAFVKEVVNITQYDENGIVDAVARLNPVSIAFEVTDDF 254
Query: 293 MQTYIGGVSCPYICSR---RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
Q Y GGV C +++H VL VGYG YWI+KNSWG WG NG
Sbjct: 255 FQ-YEGGVYSNSNCDSTPDKVNHAVLAVGYGVQ-------NGTKYWIVKNSWGPEWGLNG 306
Query: 350 YYKICRGRNVCGV 362
Y+ I RG+N+CG+
Sbjct: 307 YFYIIRGKNMCGL 319
>gi|111036374|dbj|BAF02516.1| cathepsin L-like proteinase [Echinococcus multilocularis]
Length = 338
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 133/314 (42%), Positives = 175/314 (55%), Gaps = 31/314 (9%)
Query: 68 NKAYASQEEHDHRFTIFKANLRRAARHQK-----LDPSATHGITQFSDLTPAEFRRTYLG 122
NK YA+ E R IF N H + L+ +T + F+DLT EF YL
Sbjct: 38 NKTYATLREEHLRMRIFINNYLFVRWHNERYYLGLETYST-ALNAFADLTLEEFAEKYLT 96
Query: 123 LRRKLR--LPKD-ADQAPILPTNDL-PADFDWREKGAVGPVKDQGSCGSCWSFSTTGALE 178
L++ + +D + Q PT L P DWR+KG V P+KDQG CGSCW+FS TGALE
Sbjct: 97 LKQTPMEGIWQDMSTQYVERPTRMLVPDSIDWRKKGLVTPIKDQGDCGSCWAFSATGALE 156
Query: 179 GANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREED 238
G TGKL+SLSEQQLVDC E GCNGG MN AF Y ++ G E D
Sbjct: 157 GQLKRKTGKLISLSEQQLVDCSTYTGNE-------GCNGGDMNDAFRYWMRNGA-ESESD 208
Query: 239 YPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAINAVY--MQT 295
YPYT D CKF+ SK+ V+ F V EDQ+ ++ + GP++VAI+A
Sbjct: 209 YPYTAMD--GKCKFNSSKVVTKVSKFVKVPKKREDQLKLSVAQVGPVSVAIDATSSGFML 266
Query: 296 YIGGVSCPYICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
Y G+ CS++ LDH VL+VGY + + ++K YWI+KNSWGE WG+ GY +
Sbjct: 267 YKKGIYQDNTCSQQYLDHAVLVVGYDAD-----KTRQK-YWIVKNSWGEDWGQRGYIWMA 320
Query: 355 RGR-NVCGVDSMVS 367
R + N+CG+ +M S
Sbjct: 321 RDKGNMCGIATMAS 334
>gi|332252750|ref|XP_003275518.1| PREDICTED: pro-cathepsin H [Nomascus leucogenys]
Length = 335
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 121/318 (38%), Positives = 174/318 (54%), Gaps = 31/318 (9%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRR 118
HF + K +K Y+++E H HR +F +N R+ H + + + QFSD++ AE +
Sbjct: 34 HFKSWMSKHHKTYSTEEYH-HRLQMFASNWRKINAHNNGNHTFKMALNQFSDMSFAEIKH 92
Query: 119 TYLGLRRKLRLPKDADQAP---ILPTNDLPADFDWREKGA-VGPVKDQGSCGSCWSFSTT 174
YL P++ + T P DWR+KG V PVK+QG+CGSCW+FSTT
Sbjct: 93 KYL-----WSEPQNCSATKSNYLRGTGPYPPSMDWRKKGNFVSPVKNQGACGSCWTFSTT 147
Query: 175 GALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLM 234
GALE A +ATGK++SL+EQQLVDC + + + GC GGL + AFEY L G+M
Sbjct: 148 GALESAIAIATGKMLSLAEQQLVDCAQDFN-------NHGCQGGLPSQAFEYILYNKGIM 200
Query: 235 REEDYPYTGTDRGHACKFDKSKIAASVANFSVVSL-DEDQIAANLVKNGPLAVAINAVY- 292
E+ YPY G D G+ CKF K V + + +++ DE+ + + P++ A
Sbjct: 201 GEDTYPYQGKD-GY-CKFRPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEVTQD 258
Query: 293 MQTYIGGVSCPYICSR---RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
Y G+ C + +++H VL VGYG PYWI+KNSWG WG NG
Sbjct: 259 FMMYRRGIYSSTSCHKTPDKVNHAVLAVGYGEK-------NGIPYWIVKNSWGPQWGMNG 311
Query: 350 YYKICRGRNVCGVDSMVS 367
Y+ I RG+N+CG+ + S
Sbjct: 312 YFLIERGKNMCGLAACAS 329
>gi|242020003|ref|XP_002430447.1| Cathepsin L precursor, putative [Pediculus humanus corporis]
gi|212515585|gb|EEB17709.1| Cathepsin L precursor, putative [Pediculus humanus corporis]
Length = 345
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 132/347 (38%), Positives = 182/347 (52%), Gaps = 37/347 (10%)
Query: 42 ILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARH----QKL 97
+LS + + DL+ E + LFK + K Y + E R IF N ++ +H Q+
Sbjct: 11 VLSINAVSFYDLVMEE--WQLFKAEHKKNYNNDVEEKFRMKIFMDNKQKITKHNTKYQRG 68
Query: 98 DPSATHGITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAP---------ILPTN-DLPAD 147
+ G+ ++SD+ EF T+ G + + P I P N LP
Sbjct: 69 EVGYKLGLNKYSDMLHHEFINTFNGFNKSIIPPHLRSNNGKTHLKGSFFIPPANVKLPKH 128
Query: 148 FDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEE 207
DW + GAV PVKDQG CGSCW+FS TGALEG +F T LVSLSEQ L+DC E
Sbjct: 129 VDWVKLGAVTPVKDQGHCGSCWAFSATGALEGLHFRKTKVLVSLSEQNLIDCSTE----- 183
Query: 208 PGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVV 267
++GCNGGLM+ AF+Y GG+ E YPY G + C+++ A ++ V
Sbjct: 184 --EGNNGCNGGLMDQAFQYVRINGGIDTERSYPYEGNND--VCRYEPENSGAIDTGYTDV 239
Query: 268 SL-DEDQIAANLVKNGPLAVAINAVY--MQTYIGGVSCPYICSRR---LDHGVLLVGYGS 321
L DED + + + GP++VAI+A Q Y GV C LDHGVL+VGYG+
Sbjct: 240 PLGDEDALKSAVATVGPVSVAIDASQESFQLYSSGVYFEPNCKNEPESLDHGVLVVGYGT 299
Query: 322 AGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR-GRNVCGVDSMVS 367
++ YW++KNSWG+SWGENGY K+ R N CG+ + S
Sbjct: 300 D-----EETQQDYWLVKNSWGDSWGENGYIKMARNADNQCGIATQPS 341
>gi|317106666|dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]
Length = 441
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 128/323 (39%), Positives = 176/323 (54%), Gaps = 33/323 (10%)
Query: 58 HHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKL-DPSATHGITQFSDLTPAEF 116
H F + ++ K YASQEE R +F+ N H + S T + F+DLT EF
Sbjct: 28 HLFETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHNSQGNSSYTLSLNAFADLTHHEF 87
Query: 117 RRTYLGLRR----KLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFS 172
+ + LGL L + + Q P D+PA DWR+ GAV VKDQG+CG+CWSFS
Sbjct: 88 KASRLGLSSAASASLNVDRSNRQIPDFVA-DVPASVDWRKNGAVTQVKDQGNCGACWSFS 146
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
TGA+EG N + TG LVSLSEQ+LVDCD S ++GC GG+M+ AF++ + G
Sbjct: 147 ATGAIEGINKIVTGSLVSLSEQELVDCDK--------SYNNGCEGGIMDYAFQFVIDNHG 198
Query: 233 LMREEDYPYTGTDRGHACKFDKSKI-AASVANFSVVSLDEDQIAANLVKNGPLAVAI--N 289
+ EEDYPY G DR +C +K K ++ + V + ++ V N P++V I +
Sbjct: 199 IDTEEDYPYQGRDR--SCNKEKLKRHVVTIDGYVDVPQNNEKELLKAVANQPVSVGICGS 256
Query: 290 AVYMQTYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
Q Y G+ + P CS LDH VL+VGYGS YWI+KNSWG WG +
Sbjct: 257 ERAFQLYSKGIFTGP--CSTSLDHAVLIVGYGSENGV-------DYWIVKNSWGSYWGMD 307
Query: 349 GYYKICRG----RNVCGVDSMVS 367
GY + R R +CG++ + S
Sbjct: 308 GYMHMQRNSGSSRGLCGINMLAS 330
>gi|281427380|ref|NP_001163996.1| cathepsin L-like proteinase precursor [Tribolium castaneum]
gi|281427798|ref|NP_001164001.1| cathepsin L-like proteinase precursor [Tribolium castaneum]
gi|270001241|gb|EEZ97688.1| cathepsin L precursor [Tribolium castaneum]
gi|270016928|gb|EFA13374.1| hypothetical protein TcasGA2_TC001950 [Tribolium castaneum]
Length = 328
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 132/323 (40%), Positives = 175/323 (54%), Gaps = 39/323 (12%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQ----KLDPSATHGITQFSDLTPAE 115
++ FK K Y+S E R IF+ NL + H K + + T + QF+D+T E
Sbjct: 26 WAEFKLTHKKQYSSPIEELRRKAIFQDNLVKIEEHNAKFAKGEVTYTKAVNQFADMTADE 85
Query: 116 FRR-------TYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSC 168
F T + KLR+P P A+ DWR K AV VKDQG CGSC
Sbjct: 86 FMAYVNRGLATKPKMNEKLRIPFVKSGKPA------AAEVDWRSK-AVTEVKDQGQCGSC 138
Query: 169 WSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTL 228
WSFSTTGA+EG ++ L SLSEQ LVDC + ++GCNGG M+SAF+Y +
Sbjct: 139 WSFSTTGAVEGQLAISGKGLTSLSEQNLVDCSSQYG-------NAGCNGGWMDSAFDY-I 190
Query: 229 KAGGLMREEDYPYTGTDRGHACKFDKSKIAASVAN-FSVVSLDEDQIAANLVKNGPLAVA 287
G+M E YPYT D C+FD S+ S+ + + S DE + + NGP+AVA
Sbjct: 191 HDNGIMSESAYPYTAMDGN--CRFDASQSVTSLQGYYDIPSGDESALQDAVANNGPVAVA 248
Query: 288 INAV-YMQTYIGGVSCPYICS-RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESW 345
++A +Q Y GGV CS + L+HGVL+VGYGS G + YWI+KNSWG W
Sbjct: 249 LDATEELQLYSGGVLYDTTCSAQALNHGVLVVGYGSEG-------GQDYWIVKNSWGSGW 301
Query: 346 GENGYYKICRGR-NVCGVDSMVS 367
GE GY++ R R N CG+ + S
Sbjct: 302 GEQGYWRQARNRNNNCGIATAAS 324
>gi|15593252|gb|AAL02222.1|AF410882_1 cysteine protease CP14 precursor [Frankliniella occidentalis]
Length = 333
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 133/321 (41%), Positives = 163/321 (50%), Gaps = 30/321 (9%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSA----THGITQFSDLTPA 114
H+ FK K YA+ E +R +FK N R A+H S G Q++D+
Sbjct: 27 HWESFKATHAKTYANAVEEAYRAKVFKENAIRIAKHNDRFASGEVTFKVGYNQYADMHTH 86
Query: 115 EFRRTYLGLRRKLRLPKDADQAPILPTND---LPADFDWREKGAVGPVKDQGSCGSCWSF 171
E G R L K A +ND DWR KGAV P+KDQG CGSCWSF
Sbjct: 87 EVTEKLNGYRSGL---KQASAFVHTASNDSWPWSKKVDWRSKGAVTPIKDQGQCGSCWSF 143
Query: 172 STTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAG 231
S TG+LEG FL LVSLSEQ LVDC + E GCNGGLM+SAFEY G
Sbjct: 144 SATGSLEGQLFLKNKNLVSLSEQNLVDCSWDFGNE-------GCNGGLMDSAFEYVKSNG 196
Query: 232 GLMREEDYPYTGTDRGHACKFDKSKIAASVANFS-VVSLDEDQIAANLVKNGPLAVAINA 290
G+ EE YPYT D C + + A + V + E + + K GP++VAI+A
Sbjct: 197 GIDTEESYPYTAEDG--TCLYKAANNAGVNTGYKDVQAKSESALRDAVEKVGPVSVAIDA 254
Query: 291 V--YMQTYIGGVSCPYICSR-RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGE 347
Q Y G+ CS LDHGVL VGYGS K +WI+KNSWG SWGE
Sbjct: 255 SNWSFQMYTSGIYYEPACSSDSLDHGVLAVGYGS------EWPNKEFWIVKNSWGTSWGE 308
Query: 348 NGYYKICRG-RNVCGVDSMVS 367
GY K+ R +N CG+ + S
Sbjct: 309 EGYIKMARNKKNNCGIATEAS 329
>gi|357465603|ref|XP_003603086.1| Cysteine proteinase [Medicago truncatula]
gi|355492134|gb|AES73337.1| Cysteine proteinase [Medicago truncatula]
Length = 474
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 116/288 (40%), Positives = 162/288 (56%), Gaps = 25/288 (8%)
Query: 76 EHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLR------RKLRL 129
E D RF IFK NL+ H + + G+ +F+DL+ E+R YLG + R
Sbjct: 71 EKDKRFEIFKDNLKFIDEHNAENRTYKVGLNRFADLSNEEYRSRYLGTKIDPIGMMMART 130
Query: 130 PKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLV 189
+++ + LP DWR +GAV VKDQGSCGSCW+FST A+EG N + TG+LV
Sbjct: 131 KTRSNRYAPSVGDKLPKSVDWRSQGAVVQVKDQGSCGSCWAFSTIAAVEGINKIVTGELV 190
Query: 190 SLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHA 249
SLSEQ+LVDCD + ++GC+GGLM AFE+ + GG+ +EDYPY G D G
Sbjct: 191 SLSEQELVDCDR--------TVNAGCDGGLMEYAFEFIINNGGIDSDEDYPYRGVD-GKC 241
Query: 250 CKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA--VYMQTYIGGVSCPYICS 307
++ K+ S+ ++ V ++ V N P++VAI A Q Y+ G+ C
Sbjct: 242 DQYKKNARVVSIDDYEQVPAYDELALKKAVANQPISVAIEAGGREFQLYVSGIFTGK-CG 300
Query: 308 RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
LDHGV VGYG+ YWI++NSWG+SWGE+GY ++ R
Sbjct: 301 TALDHGVTAVGYGTENGV-------DYWIVRNSWGKSWGESGYVRMER 341
>gi|221090861|ref|XP_002167224.1| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 324
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 128/314 (40%), Positives = 171/314 (54%), Gaps = 32/314 (10%)
Query: 63 FKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFR--RTY 120
+K NK Y+ E R+TI+K N RR H + QF D+T +EF+ Y
Sbjct: 30 WKMYHNKVYSHDGEETVRYTIWKDNERRIREHNLKGGDFLLKMNQFGDMTNSEFKAFNGY 89
Query: 121 LGLRRKLRLPKDADQAPILPTNDL--PADFDWREKGAVGPVKDQGSCGSCWSFSTTGALE 178
L K + + L N+ P DWR +G V PVKDQG CGSCW+FSTTG+LE
Sbjct: 90 LS-------HKHVNGSTFLTPNNFVAPDTVDWRNEGYVTPVKDQGQCGSCWAFSTTGSLE 142
Query: 179 GANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREED 238
G +F TGKLVSLSEQ LVDC ++GCNGGLM++AF Y + G+ E
Sbjct: 143 GQHFKKTGKLVSLSEQNLVDC-------STAYGNNGCNGGLMDNAFTYIKENKGIDSEAS 195
Query: 239 YPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAINAVY--MQT 295
YPYT D C F K +AA+ F + +E+++ + GP++VAI+A + Q
Sbjct: 196 YPYTAEDG--KCVFKKPSVAATDTGFVDLPEGNENKLKEAVASVGPISVAIDASHESFQF 253
Query: 296 YIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKIC 354
Y GV + P S LDHGVL+VGYG+ K YW++KNSW SWG+ GY K+
Sbjct: 254 YSSGVYNEPSCSSTELDHGVLVVGYGTE-------SGKDYWLVKNSWNTSWGDKGYIKMR 306
Query: 355 R-GRNVCGVDSMVS 367
R +N CG+ + S
Sbjct: 307 RNAKNQCGIATKAS 320
>gi|356582227|ref|NP_001239115.1| cathepsin L1 precursor [Canis lupus familiaris]
gi|62899810|sp|Q9GL24.1|CATL1_CANFA RecName: Full=Cathepsin L1; Contains: RecName: Full=Cathepsin L1
heavy chain; Contains: RecName: Full=Cathepsin L1 light
chain; Flags: Precursor
gi|10185020|emb|CAC08809.1| cathepsin L [Canis lupus familiaris]
Length = 333
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 124/313 (39%), Positives = 163/313 (52%), Gaps = 23/313 (7%)
Query: 63 FKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQ----FSDLTPAEFRR 118
+K + Y EE R +++ N++ H + HG T F D+T EFR+
Sbjct: 32 WKATHRRLYGMNEEGWRR-AVWEKNMKMIELHNREYSQGKHGFTMAMNAFGDMTNEEFRQ 90
Query: 119 TYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALE 178
G + + Q P+ ++P DWREKG V PVK+QG CGSCW+FS TGALE
Sbjct: 91 VMNGFQNQKHKKGKMFQEPLFA--EIPKSVDWREKGYVTPVKNQGQCGSCWAFSATGALE 148
Query: 179 GANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREED 238
G F TGKLVSLSEQ LVDC + GCNGGLM++AF Y GGL EE
Sbjct: 149 GQMFRKTGKLVSLSEQNLVDCSR-------AQGNEGCNGGLMDNAFRYVKDNGGLDSEES 201
Query: 239 YPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY--MQTY 296
YPY G D C + AA+ F + E + + GP++VAI+A + Q Y
Sbjct: 202 YPYLGRDT-ETCNYKPECSAANDTGFVDLPQREKALMKAVATLGPISVAIDAGHQSFQFY 260
Query: 297 IGGVSC-PYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
G+ P S+ LDHGVL+VGYG G +WI+KNSWG WG NGY K+ +
Sbjct: 261 KSGIYFDPDCSSKDLDHGVLVVGYGFEGTD----SNNKFWIVKNSWGPEWGWNGYVKMAK 316
Query: 356 GRNV-CGVDSMVS 367
+N CG+ + S
Sbjct: 317 DQNNHCGIATAAS 329
>gi|355567966|gb|EHH24307.1| Cathepsin L2 [Macaca mulatta]
gi|355753494|gb|EHH57540.1| Cathepsin L2 [Macaca fascicularis]
gi|380790509|gb|AFE67130.1| cathepsin L2 preproprotein [Macaca mulatta]
Length = 334
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 130/315 (41%), Positives = 168/315 (53%), Gaps = 26/315 (8%)
Query: 63 FKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHG----ITQFSDLTPAEFRR 118
+K + Y + EE R +++ N++ H HG + F D+T EFR+
Sbjct: 32 WKATHRRLYGASEEGWRR-AVWEKNMKMIELHNGEYSQGKHGFAMAMNAFGDMTNEEFRQ 90
Query: 119 TYLGLR-RKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
R +KLR K + L DLP DWR+KG V PVK+Q CGSCW+FS TGAL
Sbjct: 91 VMGCFRNQKLRKGKLFREPLFL---DLPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGAL 147
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREE 237
EG F TGKLVSLSEQ LVDC H P+ + GCNGG MNSAF Y + GGL EE
Sbjct: 148 EGQMFRKTGKLVSLSEQNLVDCSH---PQG----NQGCNGGFMNSAFRYVKENGGLDSEE 200
Query: 238 DYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKN-GPLAVAINAVY--MQ 294
YPY D CK+ A+ F VV +++ V GP++VA++A + Q
Sbjct: 201 SYPYVAMD--GICKYRPENSVANDTGFEVVPAGKEKALMKAVATVGPISVAMDAGHSSFQ 258
Query: 295 TYIGGVSC-PYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKI 353
Y G+ P S+ LDHGVL+VGYG G YW++KNSWG WG NGY KI
Sbjct: 259 FYKSGIYFEPDCSSKNLDHGVLVVGYGFEG---ANSDNNKYWLVKNSWGPEWGSNGYVKI 315
Query: 354 CRGR-NVCGVDSMVS 367
+ + N CG+ + S
Sbjct: 316 AKDKDNHCGIATAAS 330
>gi|38346003|emb|CAD40112.2| OSJNBa0035O13.5 [Oryza sativa Japonica Group]
gi|125589427|gb|EAZ29777.1| hypothetical protein OsJ_13835 [Oryza sativa Japonica Group]
Length = 339
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 123/330 (37%), Positives = 174/330 (52%), Gaps = 29/330 (8%)
Query: 42 ILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSA 101
+L+ E +++ + A H + +++ + Y E RF IFKAN+ +
Sbjct: 21 VLAAREQSDHAAMVARHE--RWMEQYGRVYKDATEKARRFEIFKANVAFIESFNAGNHKF 78
Query: 102 THGITQFSDLTPAEFRRTYLG---LRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGP 158
G+ QF+DLT EFR T + +R+P + + LPA DWR KGAV P
Sbjct: 79 WLGVNQFADLTNYEFRATKTNKGFIPSTVRVPTTFRYENV-SIDTLPATVDWRTKGAVTP 137
Query: 159 VKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGG 218
+KDQG CG CW+FS A+EG L+TGKL+SLSEQ+LVDCD + D GC GG
Sbjct: 138 IKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGE-------DQGCEGG 190
Query: 219 LMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANL 278
LM+ AF++ +K GGL E YPYT D C S AA++ + V + +
Sbjct: 191 LMDDAFKFIIKNGGLTTESKYPYTAAD--GKCN-GGSNSAATIKGYEDVPANNEAALMKA 247
Query: 279 VKNGPLAVAINA--VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWI 336
V N P++VA++ + Q Y GGV C LDHG++ +GYG G YW+
Sbjct: 248 VANQPVSVAVDGGDMTFQFYSGGVMTGS-CGTDLDHGIVAIGYGKDG------DGTQYWL 300
Query: 337 IKNSWGESWGENGYYK----ICRGRNVCGV 362
+KNSWG +WGENG+ + I R +CG+
Sbjct: 301 LKNSWGTTWGENGFLRMEKDISDKRGMCGL 330
>gi|111073715|dbj|BAF02546.1| triticain alpha [Triticum aestivum]
gi|388890585|gb|AFK80346.1| cysteine endopeptidase EP alpha [Secale cereale x Triticum durum]
Length = 461
Score = 205 bits (522), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 121/327 (37%), Positives = 175/327 (53%), Gaps = 39/327 (11%)
Query: 42 ILSHHESTNNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSA 101
I+S+ E + ++ ++ + + + Y + E + RF +F+ NLR +H +
Sbjct: 26 IVSYGERSEEEV---RRMYAEWMSEHRRTYNAIGEEERRFEVFRDNLRYIDQHNAAADAG 82
Query: 102 TH----GITQFSDLTPAEFRRTYLGLR------RKLRLPKDADQAPILPTNDLPADFDWR 151
H G+ +F+DLT E+R TYLG R RKL AD +LP DWR
Sbjct: 83 LHSFRLGLNRFADLTNEEYRSTYLGARTKPDRERKLSARYQADD-----NEELPETVDWR 137
Query: 152 EKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSC 211
+KGAV +KDQG CGSCW+FS A+EG N + TG ++ LSEQ+LVDCD S
Sbjct: 138 KKGAVAAIKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDT--------SY 189
Query: 212 DSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKI-AASVANFSVVSLD 270
+ GCNGGLM+ AFE+ + GG+ EEDYPY +R + C +K ++ + V ++
Sbjct: 190 NEGCNGGLMDYAFEFIINNGGIDSEEDYPY--KERDNRCDANKKNAKVVTIDGYEDVPVN 247
Query: 271 EDQIAANLVKNGPLAVAINA--VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIR 328
++ V N P++VAI A Q Y G+ C LDHGV VGYG+
Sbjct: 248 SEKSLQKAVANQPISVAIEAGGRAFQLYKSGIFTG-TCGTALDHGVAAVGYGTE------ 300
Query: 329 LKEKPYWIIKNSWGESWGENGYYKICR 355
K YW+++NSWG WGE+GY ++ R
Sbjct: 301 -NGKDYWLVRNSWGTVWGEDGYIRMER 326
>gi|157779038|gb|ABV71063.1| cathepsin L3 precursor [Schistosoma mansoni]
gi|360044915|emb|CCD82463.1| subfamily C1A unassigned peptidase (C01 family) [Schistosoma
mansoni]
Length = 370
Score = 205 bits (522), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 130/336 (38%), Positives = 175/336 (52%), Gaps = 42/336 (12%)
Query: 51 NDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARH----QKLDPSATHGIT 106
+D++ A + FK +F +AY E RF IF AN + H Q+ + G+
Sbjct: 54 DDIIAA---WKFFKIQFKRAYNGIHEETRRFFIFSANFVKMMEHNHAFQEGKVTYKMGVN 110
Query: 107 QFSDLTPAEFRR------TYLGLRRKLRLPKDADQAPILPTNDLPADFDWREKGAVGPVK 160
+F+D T E ++ T +R K ++ LP+ DWR +GAV VK
Sbjct: 111 EFTDKTDYELKKLRGYKVTSGAIRHKGSTFIRSEHTK------LPSKVDWRREGAVTDVK 164
Query: 161 DQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLM 220
+QG CGSCW+FSTTGA+EG ++ T +LV+LSEQQLVDC ++GC+GGLM
Sbjct: 165 NQGQCGSCWAFSTTGAIEGQHYRKTNRLVNLSEQQLVDCSKSYG-------NNGCSGGLM 217
Query: 221 NSAFEYTLKAGGLMREEDYPYTGTD--RGHACKFDKSKIAASVANF-SVVSLDEDQIAAN 277
NSAFEY G+ E YPY D + C F+ S I A V + ++ DE +
Sbjct: 218 NSAFEYVRDNEGIDSEISYPYVSGDGTENNRCLFNASNILAQVTGYVNIHEGDERALMDA 277
Query: 278 LVKNGPLAVAINAVY--MQTYIGGVSCPYICS---RRLDHGVLLVGYGSAGYAPIRLKEK 332
+ GP++VAINA Y G+ C LDHGVL+VGYG +
Sbjct: 278 VATKGPVSVAINAGLPSFSMYKSGIYSDTDCEGTLDALDHGVLVVGYGEE-------NGR 330
Query: 333 PYWIIKNSWGESWGENGYYKICRG-RNVCGVDSMVS 367
YW+IKNSWGE WGE GY KI +G N+CGV S S
Sbjct: 331 SYWLIKNSWGEEWGEKGYIKISKGSHNMCGVASAAS 366
>gi|351710879|gb|EHB13798.1| Cathepsin F [Heterocephalus glaber]
Length = 482
Score = 205 bits (522), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 134/320 (41%), Positives = 181/320 (56%), Gaps = 37/320 (11%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDP-SATHGITQFSDLTPAEFRR 118
F F +N+ Y S++E R ++F N+ A R Q LD +A +G+T+FSDLT EFR
Sbjct: 185 FKNFVATYNRTYESKKEAQWRLSVFTRNMVLAQRIQALDHGTAQYGVTKFSDLTEEEFRT 244
Query: 119 TYLG--LR----RKLRLPKDA-DQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSF 171
YL LR +K+ L K D AP+ ++DWR+KGAV VK+QG CGSCW+F
Sbjct: 245 IYLNPLLREEPGKKMHLAKAVRDPAPL--------EWDWRKKGAVTEVKNQGMCGSCWAF 296
Query: 172 STTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAG 231
S TG +EG FL G L+SLSEQ+L+DCD D C GG ++A+ G
Sbjct: 297 SVTGNVEGQWFLNRGTLLSLSEQELLDCD---------KMDKACMGGFPSNAYLAIKSLG 347
Query: 232 GLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV 291
GL E+DY Y G + AC F K + + +S +E ++AA L GP++VAINA
Sbjct: 348 GLETEDDYSYQGHMK--ACNFSAKKAKVYINDSVELSKNEQKLAAWLAVKGPISVAINAF 405
Query: 292 YMQTYIGGVSCPY--ICSRR-LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
MQ Y G++ P +CS +DH +L+VGYG+ P+W IKNSWG WGE
Sbjct: 406 GMQFYRHGIAHPLRPLCSPWFIDHAMLVVGYGNR-------SNVPFWAIKNSWGTDWGEE 458
Query: 349 GYYKICRGRNVCGVDSMVST 368
GYY + RG CGV+ M S+
Sbjct: 459 GYYYLHRGSGACGVNIMASS 478
>gi|7381610|gb|AAF61565.1|AF227957_1 cathepsin L-like proteinase precursor [Rhipicephalus microplus]
Length = 332
Score = 205 bits (522), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 130/320 (40%), Positives = 166/320 (51%), Gaps = 28/320 (8%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKAN----LRRAARHQKLDPSATHGITQFSDLTPA 114
+ FK K+Y S E RF IF N + A++ K S G+ QF DL
Sbjct: 26 QWEAFKTTHKKSYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAH 85
Query: 115 EFRRTYLGLRRKLRLPKDADQAPILPTND--LPADFDWREKGAVGPVKDQGSCGSCWSFS 172
EF R + G R + P ND LP DWR+KGAV PVKDQG CGSCW+FS
Sbjct: 86 EFARIFNG-HHGTRKTGGSTFLPPANVNDSSLPKVVDWRKKGAVTPVKDQGQCGSCWAFS 144
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
TG+LEG +FL G+LVSLSEQ LVDC ++GC GGLM AF+Y G
Sbjct: 145 ATGSLEGQHFLKNGELVSLSEQNLVDCSQSFG-------NNGCEGGLMEDAFKYIKANDG 197
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAINAV 291
+ E+ YPY D C+F K + A+ + + + E + + GP++VAI+A
Sbjct: 198 IDTEKSYPYEAVDG--ECRFKKEDVGATDTGYVEIKAGSEVDLKKAVATVGPISVAIDAS 255
Query: 292 Y--MQTYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+ Q Y GV P S LDHGVL+VGYG G K YW++KNSW ESWG+
Sbjct: 256 HSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGVKG-------GKKYWLVKNSWAESWGDQ 308
Query: 349 GYYKICR-GRNVCGVDSMVS 367
GY + R N CG+ S S
Sbjct: 309 GYILMSRDNNNQCGIASQAS 328
>gi|402770517|gb|AFQ98393.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 205 bits (522), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 130/320 (40%), Positives = 165/320 (51%), Gaps = 28/320 (8%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKAN----LRRAARHQKLDPSATHGITQFSDLTPA 114
+ FK K Y S E RF IF N + A++ K S G+ QF DL
Sbjct: 26 QWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAH 85
Query: 115 EFRRTYLGLRRKLRLPKDADQAPILPTND--LPADFDWREKGAVGPVKDQGSCGSCWSFS 172
EF R + G R + P ND LP DWR+KGAV PVKDQG CGSCW+FS
Sbjct: 86 EFARIFNG-HHGTRKTGGSSFLPPANVNDSSLPKVVDWRKKGAVTPVKDQGQCGSCWAFS 144
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
TG+LEG +FL G+LVSLSEQ LVDC ++GC GGLM AF+Y G
Sbjct: 145 ATGSLEGQHFLKNGELVSLSEQNLVDCSQSFG-------NNGCEGGLMEDAFKYIKANDG 197
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAINAV 291
+ E+ YPY D C+F K + A+ + + + E + + GP++VAI+A
Sbjct: 198 IDTEKSYPYKAVDG--ECRFKKEDVGATDTGYVEIKAGSEVDLKKAVATVGPISVAIDAS 255
Query: 292 Y--MQTYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+ Q Y GV P S LDHGVL+VGYG G K YW++KNSW ESWG+
Sbjct: 256 HSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGVKG-------GKKYWLVKNSWAESWGDQ 308
Query: 349 GYYKICR-GRNVCGVDSMVS 367
GY + R N CG+ S S
Sbjct: 309 GYILMSRDNNNQCGIASQAS 328
>gi|47524507|gb|AAT34987.1| putative cysteine protease [Gossypium hirsutum]
Length = 344
Score = 205 bits (522), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 123/307 (40%), Positives = 167/307 (54%), Gaps = 30/307 (9%)
Query: 69 KAYA-SQEEH-DHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLRRK 126
+ YA QE+H + RF +FK N+ R + I QF+DLT EFR +Y G +
Sbjct: 46 RVYADEQEDHKNKRFNVFKENVERIEEFND-GKTFKLAINQFADLTNEEFRASYNGFKGP 104
Query: 127 LRLPKDADQ-APILPTN---DLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANF 182
+ L + P N LP DWR+KGAV PVK+QG CG CW+FS A+EG
Sbjct: 105 MVLSSQITKPTPFRYENVSSALPVSVDWRKKGAVTPVKNQGQCGCCWAFSAVAAIEGITQ 164
Query: 183 LATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYT 242
++TGKL+SLSEQ+LVDCD + D GC GGLM++AFE+ + GGL E +YPY
Sbjct: 165 ISTGKLISLSEQELVDCDTK-------GIDHGCEGGLMDTAFEFIINNGGLTTESNYPYK 217
Query: 243 GTDRGHACKFDKSK-IAASVANFSVVSLDEDQIAANLVKNGPLAVAINA--VYMQTYIGG 299
G D C F+K+ IA S+ + V +++Q V + P++VAI A Q Y G
Sbjct: 218 GED--GTCNFNKTNPIAVSITGYEDVPANDEQALMKAVAHQPVSVAIEAGGSDFQFYSSG 275
Query: 300 VSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR---- 355
V C LDH V VGYG + YWI+KNSWG WGE+GY ++ +
Sbjct: 276 VFTGE-CGTELDHAVTAVGYGESEDG------SKYWIVKNSWGTKWGESGYIEMQKDIKV 328
Query: 356 GRNVCGV 362
+ +CG+
Sbjct: 329 KQGLCGI 335
>gi|297843430|ref|XP_002889596.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp.
lyrata]
gi|297335438|gb|EFH65855.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp.
lyrata]
Length = 343
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 125/322 (38%), Positives = 177/322 (54%), Gaps = 33/322 (10%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEF 116
+ F + K +K Y ++E RF I+++N++ L +F+D+T +EF
Sbjct: 40 KQRFEKWLKTHSKLYGGRDEWMLRFGIYQSNVQLIDYINSLHLPFKLTDNRFADMTNSEF 99
Query: 117 RRTYLGLR-RKLRLPKDADQAPIL-PTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTT 174
+ +LGL LRL K Q P+ P ++P DWR +GAV P+++QG CG CW+FS
Sbjct: 100 KAHFLGLNTSSLRLHKK--QRPVCDPAGNVPDAVDWRTQGAVTPIRNQGKCGGCWAFSAV 157
Query: 175 GALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLM 234
A+EG N + TG LVSLSEQQL+DCD G+ + GC+GGLM +AFE+ GGL
Sbjct: 158 AAIEGINKIKTGNLVSLSEQQLIDCD-------VGTYNKGCSGGLMETAFEFIKSNGGLT 210
Query: 235 REEDYPYTGTDRGHACKFDKSK-IAASVANFSVVSLDED--QIAANLVKNGPLAVAINA- 290
E DYPYTG + C +K+K ++ + V+ +E QIAA P++V I+A
Sbjct: 211 TETDYPYTGIE--GTCDQEKAKNKVVTIQGYQKVAQNEASLQIAA---AQQPVSVGIDAG 265
Query: 291 -VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
Q Y GV Y C L+HGV +VGYG G ++ YWI+KNSWG WGE G
Sbjct: 266 GFIFQLYSSGVFTSY-CGTNLNHGVTVVGYGVEG-------DQKYWIVKNSWGTGWGEEG 317
Query: 350 YYKICRG----RNVCGVDSMVS 367
Y ++ RG CG+ + S
Sbjct: 318 YIRMERGISEDTGKCGIAMLAS 339
>gi|224065647|ref|XP_002301901.1| predicted protein [Populus trichocarpa]
gi|222843627|gb|EEE81174.1| predicted protein [Populus trichocarpa]
Length = 336
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 129/329 (39%), Positives = 175/329 (53%), Gaps = 27/329 (8%)
Query: 49 TNNDLLGAEHHFSLFKK---KFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGI 105
T DL + LF+ K K Y S EE RF IFK NL K + G+
Sbjct: 19 TPEDLTSGDKIIDLFESWISKHGKIYESIEEKWLRFEIFKDNLFHIDETNKKVVNYWLGL 78
Query: 106 TQFSDLTPAEFRRTYLGLRRKLRLPKDADQA-PILPTNDLPADFDWREKGAVGPVKDQGS 164
+FSDL+ EF+ YLGL+ + ++ Q +P DWR+KGAV VK+QGS
Sbjct: 79 NEFSDLSHEEFKNKYLGLKVDMSERRECSQEFNYKDVMSIPKSVDWRKKGAVTDVKNQGS 138
Query: 165 CGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAF 224
CGSCW+FST A+EG N + TG L SLSEQ+LVDCD + + GCNGGLM+ AF
Sbjct: 139 CGSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDCDT--------TNNYGCNGGLMDYAF 190
Query: 225 EYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPL 284
Y + GGL +E DYPY + + ++S++ +++ + V + ++ + N PL
Sbjct: 191 SYIISNGGLHKEVDYPYIMEEGTCEMRKEESEV-VTISGYHDVPQNSEESLLKALANQPL 249
Query: 285 AVAINAVY--MQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWG 342
+VAI A Q Y GGV + C +LDHGV VGYGS Y I+KNSWG
Sbjct: 250 SVAIEASGRDFQFYSGGVFDGH-CGTQLDHGVAAVGYGSTNGL-------DYIIVKNSWG 301
Query: 343 ESWGENGYYKICRGR----NVCGVDSMVS 367
WGE GY ++ R +CG++ M S
Sbjct: 302 SKWGEKGYIRMKRNTGKPAGLCGINKMAS 330
>gi|402770511|gb|AFQ98390.1| cathepsin L [Rhipicephalus microplus]
gi|402770513|gb|AFQ98391.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 130/320 (40%), Positives = 165/320 (51%), Gaps = 28/320 (8%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKAN----LRRAARHQKLDPSATHGITQFSDLTPA 114
+ FK K Y S E RF IF N + A++ K S G+ QF DL
Sbjct: 26 QWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAH 85
Query: 115 EFRRTYLGLRRKLRLPKDADQAPILPTND--LPADFDWREKGAVGPVKDQGSCGSCWSFS 172
EF R + G R + P ND LP DWR+KGAV PVKDQG CGSCW+FS
Sbjct: 86 EFARIFNG-HHGTRKTGGSSFLPPANVNDSSLPKVVDWRKKGAVTPVKDQGQCGSCWAFS 144
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
TG+LEG +FL G+LVSLSEQ LVDC ++GC GGLM AF+Y G
Sbjct: 145 ATGSLEGQHFLKNGELVSLSEQNLVDCSQSFG-------NNGCEGGLMEDAFKYIKANDG 197
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAINAV 291
+ E+ YPY D C+F K + A+ + + + E + + GP++VAI+A
Sbjct: 198 IDTEKSYPYEAVDG--ECRFKKEDVGATDTGYVEIKAGSEVDLKKAVATVGPISVAIDAS 255
Query: 292 Y--MQTYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+ Q Y GV P S LDHGVL+VGYG G K YW++KNSW ESWG+
Sbjct: 256 HSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGVKG-------GKKYWLVKNSWAESWGDQ 308
Query: 349 GYYKICR-GRNVCGVDSMVS 367
GY + R N CG+ S S
Sbjct: 309 GYILMSRDNNNQCGIASQAS 328
>gi|440792185|gb|ELR13413.1| cathepsin L, putative [Acanthamoeba castellanii str. Neff]
Length = 331
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 117/316 (37%), Positives = 165/316 (52%), Gaps = 22/316 (6%)
Query: 58 HHFSLFKKKFNKAYASQEEHDHRFTIFKANL-RRAARHQKLDPSATHGITQFSDLTPAEF 116
F+ F +++ K+YAS EE + RF IF NL AA + K + GIT+F+D++ EF
Sbjct: 32 EQFNAFVQRYGKSYASAEEAEQRFAIFTQNLAETAALNIKYEGKTQFGITKFADMSQEEF 91
Query: 117 RRTYLGLRRKLRLPKDADQAPILPTNDLPADFDWREK-GAVGPVKDQGSCGSCWSFSTTG 175
+ L + + P P+ FDWR K G V PV DQG CGSCW+FS T
Sbjct: 92 QSRVLMSNPPPPPTEKPYRGPKFEGFTAPSTFDWRNKPGVVTPVYDQGQCGSCWAFSATE 151
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMR 235
+E LA KL LS QQ+VDC D GC GG + A++Y + A GL
Sbjct: 152 NIESQWALAGHKLTGLSMQQIVDCSW---------WDDGCGGGFPSYAYDYVIDAPGLDA 202
Query: 236 EEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLD--EDQIAANLVKNGPLAVAINAVYM 293
+YPYT G +C F +S++ A +++++ + D E Q+A L ++GP++V ++A
Sbjct: 203 LANYPYTAV--GGSCAFKESQVVAKISSWTYTTTDSNEHQMANYLAQHGPISVCVDAESW 260
Query: 294 QTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKI 353
+Y GGV C +DH VL VGY PYWII+NSWG SWG GY +
Sbjct: 261 PSYTGGVYRASACGTSIDHCVLAVGYN-------LTANPPYWIIRNSWGTSWGLEGYMHL 313
Query: 354 CRGRNVCGVDSMVSTV 369
G + C V M ++
Sbjct: 314 EFGTDACAVAEMTTSA 329
>gi|22759715|dbj|BAC10906.1| cysteine proteinase [Zinnia elegans]
Length = 352
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 125/318 (39%), Positives = 175/318 (55%), Gaps = 26/318 (8%)
Query: 58 HHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFR 117
H F + K +K Y S +E HRF IF NL+ K + G+ +F+DLT EF+
Sbjct: 47 HLFESWLVKHSKFYESLDEKLHRFEIFMDNLKHIDETNKKVSNYWLGLNEFADLTHEEFK 106
Query: 118 RTYLGLRRKLRLPKDADQAPILPTN--DLPADFDWREKGAVGPVKDQGSCGSCWSFSTTG 175
+LG + +L KD + DLP DWR+KGAV PVK+QG CGSCW+FST
Sbjct: 107 HKFLGFKGELAERKDESSKEFGYRDFVDLPKSVDWRKKGAVAPVKNQGQCGSCWAFSTVA 166
Query: 176 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMR 235
A+EG N + TG L LSEQ+L+DCD + ++GCNGGLM+ AF Y +++ GL +
Sbjct: 167 AVEGINQIVTGNLTMLSEQELIDCD--------TTFNNGCNGGLMDYAFAYVMRS-GLHK 217
Query: 236 EEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAVY--M 293
EE+YPY ++ K D S+ +++ + V +++ + N P++VAI A
Sbjct: 218 EEEYPYIMSEGTCDEKKDVSE-KVTISGYHDVPRNDEASFLKALANQPISVAIEASGRDF 276
Query: 294 QTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKI 353
Q Y GGV + C LDHGV VGYG+ K Y I++NSWG WGE GY ++
Sbjct: 277 QFYSGGVFDGH-CGTELDHGVAAVGYGTT-------KGLDYVIVRNSWGPKWGEKGYIRM 328
Query: 354 CRG----RNVCGVDSMVS 367
RG +CG+ M S
Sbjct: 329 KRGSGKPHGMCGLYMMAS 346
>gi|402770515|gb|AFQ98392.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 130/320 (40%), Positives = 165/320 (51%), Gaps = 28/320 (8%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKAN----LRRAARHQKLDPSATHGITQFSDLTPA 114
+ FK K Y S E RF IF N + A++ K S G+ QF DL
Sbjct: 26 QWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAH 85
Query: 115 EFRRTYLGLRRKLRLPKDADQAPILPTND--LPADFDWREKGAVGPVKDQGSCGSCWSFS 172
EF R + G R + P ND LP DWR+KGAV PVKDQG CGSCW+FS
Sbjct: 86 EFARIFNG-HHGTRKTGGSSFLPPANVNDSSLPKVVDWRKKGAVTPVKDQGQCGSCWAFS 144
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
TG+LEG +FL G+LVSLSEQ LVDC ++GC GGLM AF+Y G
Sbjct: 145 ATGSLEGQHFLKNGELVSLSEQNLVDCSQSFG-------NNGCEGGLMEDAFKYIKANDG 197
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAINAV 291
+ E+ YPY D C+F K + A+ + + + E + + GP++VAI+A
Sbjct: 198 IDTEKSYPYEAVDG--ECRFKKEDVGATDTGYVEIKAGSEVDLKKAVATVGPISVAIDAS 255
Query: 292 Y--MQTYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+ Q Y GV P S LDHGVL+VGYG G K YW++KNSW ESWG+
Sbjct: 256 HSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGVKG-------GKKYWLVKNSWAESWGDQ 308
Query: 349 GYYKICR-GRNVCGVDSMVS 367
GY + R N CG+ S S
Sbjct: 309 GYILMSRDNNNQCGIASQAS 328
>gi|302796898|ref|XP_002980210.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
gi|300151826|gb|EFJ18470.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
Length = 479
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 129/304 (42%), Positives = 166/304 (54%), Gaps = 32/304 (10%)
Query: 76 EHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLGLR---RKLRLPKD 132
E R+ IFK NLR + + G+ F+DLT EFR G R + R +
Sbjct: 81 EKATRYGIFKDNLRFIHGENEKNQGYFLGLNAFADLTNEEFRAQRHGGRFDRSRERTSHE 140
Query: 133 ADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGKLVSLS 192
+ + DLP DWREKGAV VKDQGSCGSCW+FS A+EG N LATG+LVSLS
Sbjct: 141 EFRYGSVQLKDLPDSIDWREKGAVVGVKDQGSCGSCWAFSAVAAIEGVNKLATGELVSLS 200
Query: 193 EQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGHACKF 252
EQ+LVDCD D GCNGGLM+ AF + +K GGL E DYPY +G+ +
Sbjct: 201 EQELVDCDK--------GEDEGCNGGLMDYAFGFVIKNGGLDTEADYPY----KGYGTRC 248
Query: 253 DKSKIAASVAN---FSVVSLDEDQIAANLVKNGPLAVAINA--VYMQTYIGGVSCPYICS 307
D+SK+ A V + V ++++ V + P++VAI+A MQ Y G+ C
Sbjct: 249 DRSKMNAKVVTIDGYEDVPVNDETALLKAVAHQPVSVAIDAGGSSMQFYRSGIFTGR-CG 307
Query: 308 RRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRGR----NVCGVD 363
LDHGV VGYG + K YWIIKNSWG +WGE GY K+ R +CG++
Sbjct: 308 TDLDHGVTNVGYG-------KEDGKAYWIIKNSWGSNWGEKGYVKMARNTGLAAGLCGIN 360
Query: 364 SMVS 367
S
Sbjct: 361 MEAS 364
>gi|302816909|ref|XP_002990132.1| hypothetical protein SELMODRAFT_428615 [Selaginella moellendorffii]
gi|300142145|gb|EFJ08849.1| hypothetical protein SELMODRAFT_428615 [Selaginella moellendorffii]
Length = 358
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 122/306 (39%), Positives = 171/306 (55%), Gaps = 20/306 (6%)
Query: 69 KAYASQEEHDHRFTIFKANLRRAARH-QKLDPSATHGITQFSDLTPAEFRRTYLGLRRKL 127
+ Y E + RF IF+ N H ++++ + G+ F+D+T EF+ Y G + L
Sbjct: 43 RVYNGIGEKERRFQIFRDNAEYIEEHNRQVNQTYWLGLNNFADMTHDEFKALYFGTKVPL 102
Query: 128 RLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGK 187
+ TN LP D DWR KGAV VK+QG+CGSCW+FST A+EG N + TG+
Sbjct: 103 SNTIKSGFRYEDATN-LPLDTDWRSKGAVATVKNQGACGSCWAFSTVAAVEGVNQIVTGE 161
Query: 188 LVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRG 247
LVSLSEQ+LVDCD + + GCNGGLM+SAFE+ ++ GGL E DYPY G
Sbjct: 162 LVSLSEQELVDCDKQ--------KNQGCNGGLMDSAFEFIIQNGGLDSEADYPYKAVS-G 212
Query: 248 HACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV--YMQTYIGGVSCPYI 305
+ ++ ++ F V + + V N P++VAI A Q Y GGV +
Sbjct: 213 SCDESRRNSHVVTIDGFEDVPAESEADLLKAVANQPVSVAIEASGRNFQLYSGGVYTGH- 271
Query: 306 CSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRG----RNVCG 361
C LDHGV+ VGYG++ P + YWI++NSWG++WGE+GY ++ R R CG
Sbjct: 272 CGYELDHGVVAVGYGTSK-TPDGVA-TDYWIVRNSWGDAWGESGYIRLQRNVASSRGKCG 329
Query: 362 VDSMVS 367
+ M S
Sbjct: 330 IAMMAS 335
>gi|156739281|ref|NP_001096588.1| cathepsin L1-like precursor [Danio rerio]
gi|166158351|ref|NP_001107526.1| uncharacterized protein LOC100135391 precursor [Xenopus (Silurana)
tropicalis]
gi|326672305|ref|XP_003199634.1| PREDICTED: cathepsin L1-like [Danio rerio]
gi|156230096|gb|AAI52237.1| MGC174155 protein [Danio rerio]
gi|163916362|gb|AAI57707.1| LOC100135391 protein [Xenopus (Silurana) tropicalis]
Length = 335
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 122/320 (38%), Positives = 170/320 (53%), Gaps = 26/320 (8%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH----GITQFSDLTPA 114
H++ +K + K+Y E R I++ NLR+ +H H G+ QF D+T
Sbjct: 27 HWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIEQHNFEYSYGNHTFKMGMNQFGDMTNE 85
Query: 115 EFRRTYLGLRRKLRLPKDADQAPILPTNDL---PADFDWREKGAVGPVKDQGSCGSCWSF 171
EFR+ G + P Q P+ P DWR++G V PVKDQ CGSCWSF
Sbjct: 86 EFRQAMNGYKHD---PNRTSQGPLFMEPSFFAAPQQVDWRQRGYVTPVKDQKQCGSCWSF 142
Query: 172 STTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAG 231
S+TGALEG F TGKL+S+SEQ LVDC P+ + GCNGG+M+ AF+Y +
Sbjct: 143 SSTGALEGQLFRKTGKLISMSEQNLVDCSR---PQG----NQGCNGGIMDQAFQYVKENK 195
Query: 232 GLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKN-GPLAVAINA 290
GL E+ YPY D C++D A + F + + N V GP++VAI+A
Sbjct: 196 GLDSEQSYPYLARD-DLPCRYDPRFNVAKITGFVDIPRGNELALMNAVAAVGPVSVAIDA 254
Query: 291 VY--MQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+ +Q Y G+ C+ RLDH VL+VGY GY + YWI+KNSW + WG+
Sbjct: 255 SHQSLQFYQSGIYYERACTSRLDHAVLVVGY---GYQGADVAGNRYWIVKNSWSDKWGDK 311
Query: 349 GYYKICRGRNV-CGVDSMVS 367
GY + + +N CG+ +M S
Sbjct: 312 GYIYMAKDKNNHCGIATMAS 331
>gi|30141027|dbj|BAC75927.1| cysteine protease-5 [Helianthus annuus]
Length = 365
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 126/325 (38%), Positives = 176/325 (54%), Gaps = 36/325 (11%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQ-KLDPSATHGITQFSDLTPAEFRR 118
+ L+ + K Y + E + RF IF NL+ H + S G+ QF+DLT E+R
Sbjct: 36 YELWLARHGKTYNALGEKESRFRIFADNLKFIDEHNLSGNRSYKVGLNQFADLTNEEYRS 95
Query: 119 TYLG-----LRR--KLRLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSF 171
YLG RR K++ + + + + PA DWRE+GAV PVK+QG CGSCW+F
Sbjct: 96 MYLGTKVDPYRRIAKMQRGEISRRYAVQENEMFPAKVDWRERGAVSPVKNQGGCGSCWAF 155
Query: 172 STTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAG 231
ST ++EG N + TG L+SLSEQ+LVDCD++ +SGCNGG M+ AF++ + G
Sbjct: 156 STVASVEGINKIVTGDLISLSEQELVDCDNK--------YNSGCNGGSMDYAFQFIVSNG 207
Query: 232 GLMREEDYPYTGTDRGHACK--FDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAIN 289
G+ E DYPY G G C +K+KI S+ + V ++ V + P++V I
Sbjct: 208 GIDSESDYPYKGV--GAVCDPVRNKAKI-VSIDGYEDVPPMNEKALMKAVAHQPVSVGIE 264
Query: 290 AV--YMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGE 347
A Q Y GV C LDHGV++VGYGS K YWI++NSWG WGE
Sbjct: 265 ASGRAFQLYTSGVLTGS-CGTNLDHGVVVVGYGSE-------NGKDYWIVRNSWGPEWGE 316
Query: 348 NGYYKICRGR-----NVCGVDSMVS 367
+GY ++ R +CG+ M S
Sbjct: 317 DGYIRMERNMVDTPVGMCGITLMAS 341
>gi|109082090|ref|XP_001108862.1| PREDICTED: cathepsin H isoform 2 [Macaca mulatta]
Length = 335
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 120/318 (37%), Positives = 171/318 (53%), Gaps = 31/318 (9%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRR 118
HF + K +K Y+++E H HR F +N R+ H + + + QFSD++ AE +
Sbjct: 34 HFKSWMSKHHKTYSTEEYH-HRMQTFASNWRKINAHNNGNHTFKMALNQFSDMSFAEIKH 92
Query: 119 TYLGLRRKLRLPKDADQAP---ILPTNDLPADFDWREKGA-VGPVKDQGSCGSCWSFSTT 174
YL P++ + T P DWR+KG V PVK+QG+CGSCW+FSTT
Sbjct: 93 KYL-----WSEPQNCSATKSNYLRGTGPYPPSMDWRKKGNFVSPVKNQGACGSCWTFSTT 147
Query: 175 GALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLM 234
GALE A +ATGK++SL+EQQLVDC + + + GC GGL + AFEY L G+M
Sbjct: 148 GALESAIAIATGKMLSLAEQQLVDCAQDFN-------NHGCQGGLPSQAFEYILYNKGIM 200
Query: 235 REEDYPYTGTDRGHACKFDKSKIAASVANFSVVSL-DEDQIAANLVKNGPLAVAINAVY- 292
E+ YPY G D CKF K V + + +++ DE+ + + P++ A
Sbjct: 201 GEDTYPYQGKDGD--CKFRPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEVTQD 258
Query: 293 MQTYIGGVSCPYICSR---RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
Y G+ C + +++H VL VGYG PYWI+KNSWG WG NG
Sbjct: 259 FMIYKTGIYSSTSCHKTPDKVNHAVLAVGYGEE-------NGIPYWIVKNSWGPQWGMNG 311
Query: 350 YYKICRGRNVCGVDSMVS 367
Y+ I RG+N+CG+ + S
Sbjct: 312 YFLIERGKNMCGLAACAS 329
>gi|238006338|gb|ACR34204.1| unknown [Zea mays]
Length = 465
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 124/326 (38%), Positives = 176/326 (53%), Gaps = 40/326 (12%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHG----ITQFSDLTPAE 115
+ L+ + +AY + E D RF +F NLR H + +A HG + QF+DLT E
Sbjct: 52 YELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHN--ERAAEHGFRLGMNQFADLTNDE 109
Query: 116 FRRTYLGLRRKLRLPKDADQAPILP--------TNDLPADFDWREKGAVGPVKDQGSCGS 167
FR YLG R +P + + +LP DWREKGAV PVK+QG CGS
Sbjct: 110 FRAAYLGAR----IPASRRRGTAVGERYRHGGGAEELPESVDWREKGAVAPVKNQGQCGS 165
Query: 168 CWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYT 227
CW+FS ++E N + TG++V+LSEQ+LV+C + +SGCNGGLM++AF++
Sbjct: 166 CWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTD-------GGNSGCNGGLMDAAFDFI 218
Query: 228 LKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVA 287
+K GG+ E DYPY D + +K+ S+ F V ++++ V + P++VA
Sbjct: 219 IKNGGIDTEGDYPYKAVDGKCDINRENAKV-VSIDGFEDVPENDEKSLQKAVAHQPVSVA 277
Query: 288 INA--VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESW 345
I A Q Y GV C+ LDHGV+ VGYG+ K YWI++NSWG W
Sbjct: 278 IEAGGREFQLYKAGVF-TGTCTTNLDHGVVAVGYGTE-------NGKDYWIVRNSWGAKW 329
Query: 346 GENGYYKICRGRNV----CGVDSMVS 367
GE+GY ++ R N CG+ M S
Sbjct: 330 GEDGYIRMERNVNATTGKCGIAMMAS 355
>gi|449469929|ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 431
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 127/322 (39%), Positives = 178/322 (55%), Gaps = 38/322 (11%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSA-THGITQFSDLTPAEFRR 118
F ++ + K+Y+S EE +R +F N H LD S+ T + ++DLT EF+
Sbjct: 29 FEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHHNNLDNSSYTLSLNSYADLTHHEFKV 88
Query: 119 TYLGLRRKLR-----LPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFST 173
+ LG LR LP Q P LP D+P DWR+KGAV VKDQGSCG+CWSFS
Sbjct: 89 SRLGFSPALRNFRPVLP----QEPSLP-RDVPDSLDWRKKGAVTAVKDQGSCGACWSFSA 143
Query: 174 TGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGL 233
TGA+EG N + TG L+SLSEQ+L+DCD S +SGC GGLM+ A+++ + G+
Sbjct: 144 TGAMEGINQIMTGSLISLSEQELIDCDR--------SYNSGCGGGLMDYAYQFVISNHGI 195
Query: 234 MREEDYPYTGTDRGHACKFDK-SKIAASVANFSVVSLDEDQIAANLVKNGPLAVAI--NA 290
E DYPY D +C+ DK + ++ ++ + +++ V P++V I +
Sbjct: 196 DTENDYPYQARD--GSCRKDKLQRNVVTIDGYADIPSNDEGKLLQAVAAQPVSVGICGSE 253
Query: 291 VYMQTYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
Q Y G+ S P CS LDH VL+VGYGS YWI+KNSWG+SWG +G
Sbjct: 254 RAFQLYSKGIFSGP--CSTSLDHAVLIVGYGSENGV-------DYWIVKNSWGKSWGMDG 304
Query: 350 YYKICRG----RNVCGVDSMVS 367
Y + R VCG++ + S
Sbjct: 305 YMHMQRNSGNSEGVCGINKLAS 326
>gi|302816222|ref|XP_002989790.1| hypothetical protein SELMODRAFT_184826 [Selaginella moellendorffii]
gi|300142356|gb|EFJ09057.1| hypothetical protein SELMODRAFT_184826 [Selaginella moellendorffii]
Length = 358
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 122/306 (39%), Positives = 171/306 (55%), Gaps = 20/306 (6%)
Query: 69 KAYASQEEHDHRFTIFKANLRRAARH-QKLDPSATHGITQFSDLTPAEFRRTYLGLRRKL 127
+ Y E + RF IF+ N H ++++ + G+ F+D+T EF+ Y G + L
Sbjct: 43 RVYNGIGEKERRFQIFRDNAEYIEEHNRQVNQTYWLGLNNFADMTHDEFKALYFGTKVPL 102
Query: 128 RLPKDADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGANFLATGK 187
+ TN LP D DWR KGAV VK+QG+CGSCW+FST A+EG N + TG+
Sbjct: 103 SNTIKSGFRYKDATN-LPLDTDWRSKGAVATVKNQGACGSCWAFSTVAAVEGVNQIVTGE 161
Query: 188 LVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRG 247
LVSLSEQ+LVDCD + + GCNGGLM+SAFE+ ++ GGL E DYPY G
Sbjct: 162 LVSLSEQELVDCDKQ--------KNQGCNGGLMDSAFEFIIQNGGLDSEADYPYKAVS-G 212
Query: 248 HACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINAV--YMQTYIGGVSCPYI 305
+ ++ ++ F V + + V N P++VAI A Q Y GGV +
Sbjct: 213 SCDESRRNSHVVTIDGFEDVPAESEADLLKAVANQPVSVAIEASGRNFQLYSGGVYTGH- 271
Query: 306 CSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICRG----RNVCG 361
C LDHGV+ VGYG++ P + YWI++NSWG++WGE+GY ++ R R CG
Sbjct: 272 CGYELDHGVVAVGYGTSK-TPDGVA-TDYWIVRNSWGDAWGESGYIRLQRNVASPRGKCG 329
Query: 362 VDSMVS 367
+ M S
Sbjct: 330 IAMMAS 335
>gi|167833701|gb|ACA02577.1| cathepsin [Spodoptera frugiperda MNPV]
Length = 340
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 115/322 (35%), Positives = 178/322 (55%), Gaps = 24/322 (7%)
Query: 56 AEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAE 115
A +F F ++NK Y S++E +R+ IF+ N+ + + SA + I +F+D+T E
Sbjct: 39 APLYFEKFIAQYNKQYKSEDEKKYRYNIFRHNIESINQKNSRNDSAVYKINRFADMTKNE 98
Query: 116 FRRTYLGLRRKLRLPKDADQAPIL---PTNDLPADFDWREKGAVGPVKDQGSCGSCWSFS 172
+ GL L + + ++ PA+FDWR V VKDQG CG+CW+F+
Sbjct: 99 IVIRHTGLASG-ELGANFCETVVVDGPAQRQRPANFDWRTLNKVTSVKDQGMCGACWAFA 157
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
GALE + +L+ L+EQQLVDCD D GC+GGL+++A+E ++ GG
Sbjct: 158 GLGALESQYAIKYDRLIDLAEQQLVDCDF---------VDMGCDGGLIHTAYEQIMRMGG 208
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVAN-FSVVSLDEDQIAANLVKNGPLAVAINAV 291
+ +E DYPY + C K AA V N + V ++E+++ L GP+A+A++AV
Sbjct: 209 VEQEFDYPYKAERQ--PCALKPHKFAAGVRNCYRYVLMNEERLEDLLRYVGPIAIAVDAV 266
Query: 292 YMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYY 351
+ Y GG+ + + L+H VLLVGYG PYWIIKNSWG +GE+GY
Sbjct: 267 DLTDYYGGI-VSFCKNNGLNHAVLLVGYGVE-------NNVPYWIIKNSWGSDYGEDGYV 318
Query: 352 KICRGRNVCGVDSMVSTVAAAV 373
++ RG N CG+ + +++ A +
Sbjct: 319 RVRRGVNSCGMINELASSAQVI 340
>gi|125860143|ref|YP_001036312.1| viral cathepsin [Spodoptera frugiperda MNPV]
gi|120969288|gb|ABM45731.1| viral cathepsin [Spodoptera frugiperda MNPV]
gi|319997353|gb|ADV91251.1| V-CATH [Spodoptera frugiperda MNPV]
gi|384087478|gb|AFH58958.1| v-cath [Spodoptera frugiperda MNPV]
Length = 339
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 115/322 (35%), Positives = 178/322 (55%), Gaps = 24/322 (7%)
Query: 56 AEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAE 115
A +F F ++NK Y S++E +R+ IF+ N+ + + SA + I +F+D+T E
Sbjct: 38 APLYFEKFIAQYNKQYKSEDEKKYRYNIFRHNIESINQKNSRNDSAVYKINRFADMTKNE 97
Query: 116 FRRTYLGLRRKLRLPKDADQAPIL---PTNDLPADFDWREKGAVGPVKDQGSCGSCWSFS 172
+ GL L + + ++ PA+FDWR V VKDQG CG+CW+F+
Sbjct: 98 IVIRHTGLASG-ELGANFCETVVVDGPAQRQRPANFDWRTLNKVTSVKDQGMCGACWAFA 156
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
GALE + +L+ L+EQQLVDCD D GC+GGL+++A+E ++ GG
Sbjct: 157 GLGALESQYAIKYDRLIDLAEQQLVDCDF---------VDMGCDGGLIHTAYEQIMRMGG 207
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVAN-FSVVSLDEDQIAANLVKNGPLAVAINAV 291
+ +E DYPY + C K AA V N + V ++E+++ L GP+A+A++AV
Sbjct: 208 VEQEFDYPYKAERQ--PCALKPHKFAAGVRNCYRYVLMNEERLEDLLRYVGPIAIAVDAV 265
Query: 292 YMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYY 351
+ Y GG+ + + L+H VLLVGYG PYWIIKNSWG +GE+GY
Sbjct: 266 DLTDYYGGI-VSFCKNNGLNHAVLLVGYGVE-------NNVPYWIIKNSWGSDYGEDGYV 317
Query: 352 KICRGRNVCGVDSMVSTVAAAV 373
++ RG N CG+ + +++ A +
Sbjct: 318 RVRRGVNSCGMINELASSAQVI 339
>gi|9630063|ref|NP_046281.1| cathepsin [Orgyia pseudotsugata MNPV]
gi|2499880|sp|O10364.1|CATV_NPVOP RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|7435821|pir||T10394 cathepsin - Orgyia pseudotsugata nuclear polyhedrosis virus
gi|1911371|gb|AAC59124.1| cathepsin [Orgyia pseudotsugata MNPV]
Length = 324
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 119/324 (36%), Positives = 179/324 (55%), Gaps = 30/324 (9%)
Query: 52 DLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDL 111
DLL A ++F F KFNK Y+S+ E HRF IF+ NL + D +A + I +FSDL
Sbjct: 20 DLLKAPNYFEDFLHKFNKNYSSESEKLHRFKIFQHNLEEIINKNQNDSTAQYEINKFSDL 79
Query: 112 TPAEFRRTYLGLRRKLRLPKDAD---QAPIL--PTNDLPADFDWREKGAVGPVKDQGSCG 166
+ E Y GL LP + IL P + P +FDWR+ V VK+QG CG
Sbjct: 80 SKEEAISKYTGLS----LPHQTQNFCEVVILDRPPDRGPLEFDWRQFNKVTSVKNQGVCG 135
Query: 167 SCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEY 226
+CW+F+T G+LE + +L++LSEQQ +DCD ++GC+GGL+++AFE
Sbjct: 136 ACWAFATLGSLESQFAIKYNRLINLSEQQFIDCDR---------VNAGCDGGLLHTAFES 186
Query: 227 TLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLA 285
++ GG+ E DYPY T G C+ + ++ V + + + E+++ L GP+
Sbjct: 187 AMEMGGVQMESDYPYE-TANGQ-CRINPNRFVVGVRSCRRYIVMFEEKLKDLLRAVGPIP 244
Query: 286 VAINAVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESW 345
VAI+A + Y G+ + L+H VLLVGY PYWI+KN+WG W
Sbjct: 245 VAIDASDIVNYRRGIM-RQCANHGLNHAVLLVGYAVEN-------NIPYWILKNTWGTDW 296
Query: 346 GENGYYKICRGRNVCGV-DSMVST 368
GE+GY+++ + N CG+ + +VS+
Sbjct: 297 GEDGYFRVQQNINACGIRNELVSS 320
>gi|414584879|tpg|DAA35450.1| TPA: cysteine protease 1 [Zea mays]
Length = 522
Score = 205 bits (521), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 124/326 (38%), Positives = 176/326 (53%), Gaps = 40/326 (12%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHG----ITQFSDLTPAE 115
+ L+ + +AY + E D RF +F NLR H + +A HG + QF+DLT E
Sbjct: 109 YELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHN--ERAAEHGFRLGMNQFADLTNDE 166
Query: 116 FRRTYLGLRRKLRLPKDADQAPILP--------TNDLPADFDWREKGAVGPVKDQGSCGS 167
FR YLG R +P + + +LP DWREKGAV PVK+QG CGS
Sbjct: 167 FRAAYLGAR----IPASRRRGTAVGERYRHGGGAEELPESVDWREKGAVAPVKNQGQCGS 222
Query: 168 CWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYT 227
CW+FS ++E N + TG++V+LSEQ+LV+C + +SGCNGGLM++AF++
Sbjct: 223 CWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTD-------GGNSGCNGGLMDAAFDFI 275
Query: 228 LKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVA 287
+K GG+ E DYPY D + +K+ S+ F V ++++ V + P++VA
Sbjct: 276 IKNGGIDTEGDYPYKAVDGKCDINRENAKV-VSIDGFEDVPENDEKSLQKAVAHQPVSVA 334
Query: 288 INA--VYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESW 345
I A Q Y GV C+ LDHGV+ VGYG+ K YWI++NSWG W
Sbjct: 335 IEAGGREFQLYKAGVF-TGTCTTNLDHGVVAVGYGTE-------NGKDYWIVRNSWGAKW 386
Query: 346 GENGYYKICRGRNV----CGVDSMVS 367
GE+GY ++ R N CG+ M S
Sbjct: 387 GEDGYIRMERNVNATTGKCGIAMMAS 412
>gi|390368662|ref|XP_780781.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 333
Score = 205 bits (521), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 128/320 (40%), Positives = 172/320 (53%), Gaps = 30/320 (9%)
Query: 60 FSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQ-KLDP---SATHGITQFSDLTPAE 115
++ +K + K Y S EE R I++ NL +H K D + G+ QF+DL E
Sbjct: 28 WNQWKNEHGKRYLSDEEEASRKLIWEKNLDIVIKHNLKYDLGHFTYALGMNQFADLQNEE 87
Query: 116 FRRTYLGLRRKLRLPKDADQAPILPTND---LPADFDWREKGAVGPVKDQGSCGSCWSFS 172
F G R K A + LP+N+ LP DWR KG V PVKDQG CGSCW+FS
Sbjct: 88 FVAMMTGFRVN-GTSKAAKGSTFLPSNNVDKLPKTVDWRTKGYVTPVKDQGQCGSCWAFS 146
Query: 173 TTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGG 232
TG+LEG F TGKLVSLSEQ LVDC + + GC+GG M+ AF+Y + AGG
Sbjct: 147 ATGSLEGQQFKKTGKLVSLSEQNLVDCSYR---------NYGCHGGFMDRAFQYIIDAGG 197
Query: 233 LMREEDYPYTGTDRGHACKFDKSKIAASVANFS-VVSLDEDQIAANLVKNGPLAVAINAV 291
+ E Y Y D C F K+ + A+V ++ V S E + + GP++VAI+A
Sbjct: 198 IDTEATYSYRAVDGN--CHFKKANVGATVTGYTDVTSGSEKALQKAVAHIGPISVAIDAS 255
Query: 292 --YMQTYIGGV-SCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGEN 348
+ + Y GV + P + RL H VL+VGYG+ YWI+KNSW ++WG N
Sbjct: 256 HKFFKFYKSGVYNEPGCSTTRLGHAVLVVGYGTTS------DGTDYWIVKNSWAKTWGMN 309
Query: 349 GYYKICRGR-NVCGVDSMVS 367
GY + R + N CG+ S S
Sbjct: 310 GYLWMSRNKDNQCGIASEAS 329
>gi|144905116|dbj|BAF56430.1| cysteine proteinase [Lotus japonicus]
Length = 341
Score = 205 bits (521), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 122/297 (41%), Positives = 161/297 (54%), Gaps = 27/297 (9%)
Query: 66 KFNKAYASQEEHDHRFTIFKANLRR-AARHQKLDPSATHGITQFSDLTPAEFRRTYLGLR 124
++ K Y E + R IFK N++R A + + S GI QF+DLT EF+ R
Sbjct: 45 QYGKVYKDSYEKELRSKIFKENVQRIEAFNNAGNKSYKLGINQFADLTNEEFKARN---R 101
Query: 125 RKLRLPKDADQAPILP---TNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGALEGAN 181
K + ++ + P +PA DWR+KGAV P+KDQG CG CW+FS A EG
Sbjct: 102 FKGHMCSNSTRTPTFKYEHVTSVPASLDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIT 161
Query: 182 FLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPY 241
L+TGKL+SLSEQ+LVDCD + D GC GGLM+ AF++ ++ GL E YPY
Sbjct: 162 KLSTGKLISLSEQELVDCDTK-------GVDQGCEGGLMDDAFKFIMQNKGLNTEAKYPY 214
Query: 242 TGTDRGHACKFD-KSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAINA--VYMQTYIG 298
G D C + ++K AAS+ F V + + V N P++VAI+A Q Y
Sbjct: 215 QGVDA--TCNANAEAKDAASIKGFEDVPANSESALLKAVANQPISVAIDASGSEFQFYSS 272
Query: 299 GVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
GV C LDHGV VGYGS G YW++KNSWGE WGE GY ++ R
Sbjct: 273 GVFTGS-CGTELDHGVTAVGYGSDG-------GTKYWLVKNSWGEQWGEQGYIRMQR 321
>gi|351629615|gb|AEQ54771.1| KDDL-tailed cysteine proteinase CP4 [Coffea canephora]
Length = 359
Score = 205 bits (521), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 129/331 (38%), Positives = 177/331 (53%), Gaps = 34/331 (10%)
Query: 47 ESTNNDLLGAEHHFSLFKKKFNKAYASQE--EHDHRFTIFKANLRRAARHQKLDPSATHG 104
E T DL E + L+++ + S++ E RF +FKAN+ + + D
Sbjct: 24 EITERDLASEESLWDLYERWRSHHTVSRDLSEKRKRFNVFKANVHHIHKVNQKDKPYKLK 83
Query: 105 ITQFSDLTPAEFRRTYLGLRRKLRLPKDADQAPIL---PTNDLPADFDWREKGAVGPVKD 161
+ F+D+T EFR Y + R+ + T LPA DWR++GAV VK+
Sbjct: 84 LNSFADMTNHEFREFYSSKVKHYRMLHGSRANTGFMHGKTESLPASVDWRKQGAVTGVKN 143
Query: 162 QGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMN 221
QG CGSCW+FST +EG N + TG+LVSLSEQ+LVDC E D E GCNGGLM
Sbjct: 144 QGKCGSCWAFSTVVGVEGINKIKTGQLVSLSEQELVDC--ETDNE-------GCNGGLME 194
Query: 222 SAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKI---AASVANFSVVSLDEDQIAANL 278
+A+E+ K+GG+ E YPY D +C D SK+ A ++ +V +++
Sbjct: 195 NAYEFIKKSGGITTERLYPYKARDG--SC--DSSKMNAPAVTIDGHEMVPANDENALMKA 250
Query: 279 VKNGPLAVAINAVY--MQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWI 336
V N P++VAI+A MQ Y GV C LDHGV +VGYG+A L YWI
Sbjct: 251 VANQPVSVAIDASGSDMQFYSEGVYAGDSCGNELDHGVAVVGYGTA------LDGTKYWI 304
Query: 337 IKNSWGESWGENGYYKICRGRN-----VCGV 362
+KNSWG WGE GY ++ RG + VCG+
Sbjct: 305 VKNSWGTGWGEQGYIRMQRGVDAAEGGVCGI 335
>gi|297297049|ref|XP_002804951.1| PREDICTED: cathepsin H [Macaca mulatta]
Length = 323
Score = 205 bits (521), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 120/318 (37%), Positives = 171/318 (53%), Gaps = 31/318 (9%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRR 118
HF + K +K Y+++E H HR F +N R+ H + + + QFSD++ AE +
Sbjct: 22 HFKSWMSKHHKTYSTEEYH-HRMQTFASNWRKINAHNNGNHTFKMALNQFSDMSFAEIKH 80
Query: 119 TYLGLRRKLRLPKDADQAP---ILPTNDLPADFDWREKGA-VGPVKDQGSCGSCWSFSTT 174
YL P++ + T P DWR+KG V PVK+QG+CGSCW+FSTT
Sbjct: 81 KYL-----WSEPQNCSATKSNYLRGTGPYPPSMDWRKKGNFVSPVKNQGACGSCWTFSTT 135
Query: 175 GALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLM 234
GALE A +ATGK++SL+EQQLVDC + + + GC GGL + AFEY L G+M
Sbjct: 136 GALESAIAIATGKMLSLAEQQLVDCAQDFN-------NHGCQGGLPSQAFEYILYNKGIM 188
Query: 235 REEDYPYTGTDRGHACKFDKSKIAASVANFSVVSL-DEDQIAANLVKNGPLAVAINAVY- 292
E+ YPY G D CKF K V + + +++ DE+ + + P++ A
Sbjct: 189 GEDTYPYQGKDGD--CKFRPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEVTQD 246
Query: 293 MQTYIGGVSCPYICSR---RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
Y G+ C + +++H VL VGYG PYWI+KNSWG WG NG
Sbjct: 247 FMIYKTGIYSSTSCHKTPDKVNHAVLAVGYGEE-------NGIPYWIVKNSWGPQWGMNG 299
Query: 350 YYKICRGRNVCGVDSMVS 367
Y+ I RG+N+CG+ + S
Sbjct: 300 YFLIERGKNMCGLAACAS 317
>gi|46401612|dbj|BAD16614.1| cysteine proteinase [Dianthus caryophyllus]
Length = 459
Score = 205 bits (521), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 135/371 (36%), Positives = 191/371 (51%), Gaps = 46/371 (12%)
Query: 3 SKTVVLFLVSLVVFSAVSSGTLIDDVDQLIRQVTDGGDEILSHHESTNNDLLGAEHHFSL 62
SK+ + L S++ VSS L D+ + R DEI S +E+
Sbjct: 4 SKSTIFLLFSIIFI--VSSSAL--DLSIIDRAFNRPDDEIASLYET-------------- 45
Query: 63 FKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRRTYLG 122
+ K K Y E RF IFK NLR + S G+ +F+DLT E+R YLG
Sbjct: 46 WLVKHGKNYNGLGEKQLRFNIFKDNLRFVDERNSENLSFKLGLNRFADLTNEEYRSVYLG 105
Query: 123 LR-RKLRLPKD----ADQAPILPTNDLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
R R + + + +D+ + LP DWR+KGAV +KDQGSCGSCW+FS A+
Sbjct: 106 TRPRSVAVARSGRSKSDRYAFRAGDTLPESVDWRKKGAVAGIKDQGSCGSCWAFSAIAAV 165
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREE 237
EG N + TG L+SLSEQ+LV+CD S + GC+GGLM+ AFE+ +K G+ +E
Sbjct: 166 EGVNQIVTGDLISLSEQELVECDT--------SYNDGCDGGLMDYAFEFIIKNEGIDSDE 217
Query: 238 DYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAVAIN--AVYMQT 295
DYPYTG D G K+ ++ ++ + +++ V N P++VAI Q
Sbjct: 218 DYPYTGRD-GRCDTNRKNAKVVTIDDYEDSPVYDEKSLQKAVANQPVSVAIEGGGRDFQL 276
Query: 296 YIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKICR 355
Y GV C LDHGV +VGYG+ YWI++NSWG++WGE GY ++ R
Sbjct: 277 YDSGVFTGK-CGTALDHGVAVVGYGTE-------DGLDYWIVRNSWGDTWGEGGYIRMQR 328
Query: 356 GRN----VCGV 362
+CG+
Sbjct: 329 NTKLPSGICGI 339
>gi|37786769|gb|AAO64471.1| cathepsin L precursor [Fundulus heteroclitus]
Length = 337
Score = 205 bits (521), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 131/322 (40%), Positives = 175/322 (54%), Gaps = 25/322 (7%)
Query: 57 EHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH----GITQFSDLT 112
+ H++L+K +K Y +EE R +++ NL++ H H G+ F D+T
Sbjct: 26 DQHWNLWKSWHSKNYHQREEGWRRL-VWEKNLKKIELHNLEHSMGKHSYRLGMNHFGDMT 84
Query: 113 PAEFRRTYLGLRRKLRLPKDADQAPILPTNDL--PADFDWREKGAVGPVKDQGSCGSCWS 170
EF++ G + K + + L N L P DWREKG V PVKDQG CGSCW+
Sbjct: 85 HEEFKQIMNGYKHKAE--RKFKGSLFLEPNFLEAPRSVDWREKGYVTPVKDQGECGSCWA 142
Query: 171 FSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKA 230
FSTTGALEG F TGKLVSLS Q LV+C PE + GCNGGLM+ AF+Y
Sbjct: 143 FSTTGALEGQEFTRTGKLVSLSGQNLVECSR---PE----GNEGCNGGLMDQAFQYVKDN 195
Query: 231 GGLMREEDYPYTGTDRGHACKFDKSKIAASVANF-SVVSLDEDQIAANLVKNGPLAVAIN 289
GL E+ YPY GTD C +D AA+ F + S +E + + GP++VAI+
Sbjct: 196 QGLDSEDSYPYLGTDD-QPCHYDPKFSAANDTGFVDIPSGNERALMKAVASVGPVSVAID 254
Query: 290 AVY--MQTYIGGVSCPYIC-SRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWG 346
A + Q Y G+ C S LDHGVL VGYG G + K +WI+KNSW E+WG
Sbjct: 255 AGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFQGE---DVDGKKFWIVKNSWSENWG 311
Query: 347 ENGYYKICRGR-NVCGVDSMVS 367
+ GY + + R N CG+ + S
Sbjct: 312 DKGYIYMAKDRKNHCGIATAAS 333
>gi|355778231|gb|EHH63267.1| Cathepsin H, partial [Macaca fascicularis]
Length = 305
Score = 205 bits (521), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 120/318 (37%), Positives = 171/318 (53%), Gaps = 31/318 (9%)
Query: 59 HFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATHGITQFSDLTPAEFRR 118
HF + K +K Y+++E H HR F +N R+ H + + + QFSD++ AE +
Sbjct: 4 HFKSWMSKHHKTYSTEEYH-HRMQTFASNWRKINAHNNGNHTFKMALNQFSDMSFAEIKH 62
Query: 119 TYLGLRRKLRLPKDADQAP---ILPTNDLPADFDWREKGA-VGPVKDQGSCGSCWSFSTT 174
YL P++ + T P DWR+KG V PVK+QG+CGSCW+FSTT
Sbjct: 63 KYL-----WSEPQNCSATKSNYLRGTGPYPPSMDWRKKGNFVSPVKNQGACGSCWTFSTT 117
Query: 175 GALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLM 234
GALE A +ATGK++SL+EQQLVDC + + + GC GGL + AFEY L G+M
Sbjct: 118 GALESAIAIATGKMLSLAEQQLVDCAQDFN-------NHGCQGGLPSQAFEYILYNKGIM 170
Query: 235 REEDYPYTGTDRGHACKFDKSKIAASVANFSVVSL-DEDQIAANLVKNGPLAVAINAVY- 292
E+ YPY G D CKF K V + + +++ DE+ + + P++ A
Sbjct: 171 GEDTYPYQGKDGD--CKFRPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEVTQD 228
Query: 293 MQTYIGGVSCPYICSR---RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENG 349
Y G+ C + +++H VL VGYG PYWI+KNSWG WG NG
Sbjct: 229 FMMYKTGIYSSTSCHKTPDKVNHAVLAVGYGEE-------NGIPYWIVKNSWGPQWGMNG 281
Query: 350 YYKICRGRNVCGVDSMVS 367
Y+ I RG+N+CG+ + S
Sbjct: 282 YFLIERGKNMCGLAACAS 299
>gi|291224870|ref|XP_002732425.1| PREDICTED: cathepsin L2-like [Saccoglossus kowalevskii]
Length = 326
Score = 205 bits (521), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 125/315 (39%), Positives = 173/315 (54%), Gaps = 27/315 (8%)
Query: 63 FKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSA----THGITQFSDLTPAEFRR 118
+K+ + K Y +Q+E R I+ NL+ H + S T + QF DLT E+R
Sbjct: 25 WKRTYGKEY-TQKEEALRHMIWNVNLKMIQMHNEKYMSGKSTYTQNMNQFGDLTNEEYRE 83
Query: 119 TYLGLRRKLRLPKDADQAPILPTN-DLPADFDWREKGAVGPVKDQGSCGSCWSFSTTGAL 177
G ++ + +LP+N PA DWR +G V VKDQG+CGSCW+FS+TG+L
Sbjct: 84 LMCGYKKSNKTVISKPSTFLLPSNYRAPASIDWRTQGYVTDVKDQGACGSCWAFSSTGSL 143
Query: 178 EGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREE 237
EG F TGKLV LSEQQLVDC + + GC GG M+ AF Y +K G E+
Sbjct: 144 EGQTFKKTGKLVPLSEQQLVDCSGDYG-------NMGCGGGWMDQAFSY-IKDKGEESED 195
Query: 238 DYPYTGTDRGHACKFDKSKIAASVANFS-VVSLDEDQIAANLVKNGPLAVAINAVY--MQ 294
YPYTGTD C +D SK+ A+ ++ + +DE+ + + GP++VAI+A + Q
Sbjct: 196 GYPYTGTD--DTCVYDASKVVATDTGYTDIPEMDENALQQAVATVGPISVAIDATHSSFQ 253
Query: 295 TYIGGVSCPYICSR-RLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGESWGENGYYKI 353
Y GV CS+ LDH VL VGYG++ + YWI+KNSW WG GY ++
Sbjct: 254 FYESGVYDEPECSQTNLDHAVLAVGYGTSE------EGLDYWIVKNSWSTGWGMQGYIEM 307
Query: 354 CRGR-NVCGVDSMVS 367
R + N CG+ S S
Sbjct: 308 SRNKDNQCGIASKAS 322
>gi|297794671|ref|XP_002865220.1| senescence-associated gene 12 [Arabidopsis lyrata subsp. lyrata]
gi|297311055|gb|EFH41479.1| senescence-associated gene 12 [Arabidopsis lyrata subsp. lyrata]
Length = 346
Score = 205 bits (521), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 120/327 (36%), Positives = 169/327 (51%), Gaps = 32/327 (9%)
Query: 50 NNDLLGAEHHFSLFKKKFNKAYASQEEHDHRFTIFKANLRRAARHQKLDPSATH--GITQ 107
+N+L+ + H + K + YA +E +R+ +FK+N+ R + T + Q
Sbjct: 29 DNELIMQKRHIE-WMTKHGRVYADVKEKSNRYVVFKSNVERIEHLNNIPAGRTFKLAVNQ 87
Query: 108 FSDLTPAEFRRTYLGLRRKLRLPKDADQAPI------LPTNDLPADFDWREKGAVGPVKD 161
F+DLT EFR Y G + L + + + LP DWR KGAV P+K+
Sbjct: 88 FADLTNDEFRSMYTGFKGVSSLSSQSQTKTTSFRYQNVSSGALPISVDWRTKGAVTPIKN 147
Query: 162 QGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMN 221
QGSCG CW+FS A+EGA + GKL+SLSEQQLVDCD + D GC GGLM+
Sbjct: 148 QGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCD---------TNDFGCEGGLMD 198
Query: 222 SAFEYTLKAGGLMREEDYPYTGTDRGHACKFDKSKIAASVANFSVVSLDEDQIAANLVKN 281
+AFE+ + GGL E +YPY G D K K A S+ + V ++++Q V +
Sbjct: 199 TAFEHIMATGGLTTESNYPYKGEDATCNSKKTNPK-ATSITGYEDVPVNDEQALMKAVAH 257
Query: 282 GPLAVAIN--AVYMQTYIGGVSCPYICSRRLDHGVLLVGYGSAGYAPIRLKEKPYWIIKN 339
P++V I Q Y GV C+ LDH V +GYG + YWIIKN
Sbjct: 258 QPVSVGIEGGGFDFQFYSSGVFTGE-CTTYLDHAVTAIGYGQS------TNGSKYWIIKN 310
Query: 340 SWGESWGENGYYKICR----GRNVCGV 362
SWG WGE+GY +I + + +CG+
Sbjct: 311 SWGTKWGESGYMRIQKDIKDKQGLCGL 337
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.318 0.135 0.413
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,184,219,943
Number of Sequences: 23463169
Number of extensions: 271272428
Number of successful extensions: 569318
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 6703
Number of HSP's successfully gapped in prelim test: 742
Number of HSP's that attempted gapping in prelim test: 540087
Number of HSP's gapped (non-prelim): 9053
length of query: 373
length of database: 8,064,228,071
effective HSP length: 144
effective length of query: 229
effective length of database: 8,980,499,031
effective search space: 2056534278099
effective search space used: 2056534278099
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 77 (34.3 bits)