BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy3960
(351 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|350425511|ref|XP_003494144.1| PREDICTED: counting factor associated protein D-like [Bombus
impatiens]
Length = 549
Score = 341 bits (875), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 159/202 (78%), Positives = 175/202 (86%), Gaps = 1/202 (0%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQSVCGSCWSFGTTGAVEGAYYMK+ KL LSQQALIDCSWGYGNNGCDGGEDFRSYQWI
Sbjct: 348 DQSVCGSCWSFGTTGAVEGAYYMKYGKLVRLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 407
Query: 211 MKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
MKHG LPT+DDYG YLGQD YCHI N T TA +TG+VNVT +ALK+A+AKHGP+SVA
Sbjct: 408 MKHGGLPTEDDYGGYLGQDGYCHINNATVTAKITGYVNVTSGDANALKVAIAKHGPISVA 467
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS K+FSFY +GVYYDE C N+ + LDHAVLAVGYG L+GK YW VKNSWS YWGN G
Sbjct: 468 IDASHKTFSFYSHGVYYDESCGNTEESLDHAVLAVGYGSLNGKDYWLVKNSWSNYWGNDG 527
Query: 330 YVLMSIKDNNCGVMTAPTYVTM 351
Y+LMS + NNCGV+TAPTYVTM
Sbjct: 528 YILMSQEKNNCGVLTAPTYVTM 549
Score = 114 bits (285), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 57/142 (40%), Positives = 85/142 (59%), Gaps = 29/142 (20%)
Query: 14 GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
GMVKT+QLS G+ G S+K+AP+TTE+ LN+ +C G + ++PQ+++PD ++ +
Sbjct: 65 GMVKTFQLSHKGSYGKSIKIAPITTESVLNEETCLEVNGTKEIKVQPQAIIPDTTEMEC- 123
Query: 74 IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKAS 133
IGEE++NG++ EKWRL K++KY+ W+RY K+
Sbjct: 124 -----------------------IGEEVINGLLCEKWRLEQEIDGKLNKYTFWIRYKKSP 160
Query: 134 -----KDAIPVRYEMKGYNSLL 150
K+ IPVRYEMKG+NSLL
Sbjct: 161 RVPGLKEPIPVRYEMKGFNSLL 182
>gi|340728972|ref|XP_003402785.1| PREDICTED: counting factor associated protein D-like [Bombus
terrestris]
Length = 549
Score = 338 bits (867), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 157/202 (77%), Positives = 175/202 (86%), Gaps = 1/202 (0%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQSVCGSCWSFGTTGAVEGAYYMK+ KL LSQQALIDCSWGYGNNGCDGGEDFRSYQWI
Sbjct: 348 DQSVCGSCWSFGTTGAVEGAYYMKYGKLVRLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 407
Query: 211 MKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
MKHG LPT+D+YG YLGQD YCH+ N T TA +TG+VNVT +ALK+A+AKHGP+SVA
Sbjct: 408 MKHGGLPTEDEYGGYLGQDGYCHVNNVTLTAKITGYVNVTSGDANALKVAIAKHGPISVA 467
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS K+FSFY +GVYYDE C N+ + LDHAVLAVGYG L+GK YW VKNSWS YWGN G
Sbjct: 468 IDASHKTFSFYSHGVYYDESCGNTEESLDHAVLAVGYGSLNGKDYWLVKNSWSNYWGNDG 527
Query: 330 YVLMSIKDNNCGVMTAPTYVTM 351
Y+LMS + NNCGV+TAPTYVTM
Sbjct: 528 YILMSQEKNNCGVLTAPTYVTM 549
Score = 114 bits (285), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 57/142 (40%), Positives = 85/142 (59%), Gaps = 29/142 (20%)
Query: 14 GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
GMVKT+QLS G+ G S+K+AP+TTE+ LN+ +C G + ++PQ+++PD ++ +
Sbjct: 65 GMVKTFQLSHKGSYGKSIKIAPITTESVLNEETCLEVNGTKEIKVQPQAIIPDTTEMEC- 123
Query: 74 IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKAS 133
IGEE++NG++ EKWRL K++KY+ W+RY K+
Sbjct: 124 -----------------------IGEEVINGLLCEKWRLEQEIDGKLNKYTFWIRYKKSP 160
Query: 134 -----KDAIPVRYEMKGYNSLL 150
K+ IPVRYEMKG+NSLL
Sbjct: 161 RVPGFKEPIPVRYEMKGFNSLL 182
>gi|322787487|gb|EFZ13575.1| hypothetical protein SINV_11743 [Solenopsis invicta]
Length = 552
Score = 337 bits (865), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 156/202 (77%), Positives = 173/202 (85%), Gaps = 1/202 (0%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQSVCGSCWSFGTTGAVEGAYYMKH KL LSQQALIDCSWG+GNNGCDGGEDFRSYQW+
Sbjct: 351 DQSVCGSCWSFGTTGAVEGAYYMKHNKLVRLSQQALIDCSWGFGNNGCDGGEDFRSYQWM 410
Query: 211 MKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
MKHG LPT++DYG YLGQD YCHI N T TA +TGFVNV P S ALK+A+AKHGP+SVA
Sbjct: 411 MKHGGLPTEEDYGGYLGQDGYCHINNVTLTANITGFVNVPPRSVQALKIAIAKHGPISVA 470
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS K+FSFY +GVYYD C N+ D LDHAVLAVGYG ++GK YW VKNSWS YWGN G
Sbjct: 471 IDASHKTFSFYSHGVYYDPACGNTEDKLDHAVLAVGYGTMNGKGYWLVKNSWSNYWGNDG 530
Query: 330 YVLMSIKDNNCGVMTAPTYVTM 351
Y+LM+ KD+NCGV+ PTYVTM
Sbjct: 531 YILMAQKDDNCGVLLTPTYVTM 552
Score = 118 bits (295), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 61/142 (42%), Positives = 84/142 (59%), Gaps = 31/142 (21%)
Query: 14 GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
GMVKT+QL+ G G S+K+AP+TTET N+ +C G D I+PQS+LPD S+ +
Sbjct: 70 GMVKTFQLANKGLYGASIKLAPITTETVSNEETCLQVNGTDDVKIQPQSILPDASEMEA- 128
Query: 74 IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKAS 133
+E++NG++ EKWRLV GEK +KY+LW+RY K+
Sbjct: 129 -------------------------KEMINGLLCEKWRLVQEIGEKTNKYTLWIRYKKSP 163
Query: 134 -----KDAIPVRYEMKGYNSLL 150
K+ IP+RYEMKG+NSLL
Sbjct: 164 YLPKVKEPIPMRYEMKGFNSLL 185
>gi|383860620|ref|XP_003705787.1| PREDICTED: counting factor associated protein D-like [Megachile
rotundata]
Length = 549
Score = 337 bits (864), Expect = 5e-90, Method: Compositional matrix adjust.
Identities = 155/202 (76%), Positives = 174/202 (86%), Gaps = 1/202 (0%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQSVCGSCWSFGTTGAVEGAYYMK+ KL LSQQALIDCSWG+GNNGCDGGEDFRSYQWI
Sbjct: 348 DQSVCGSCWSFGTTGAVEGAYYMKYGKLVRLSQQALIDCSWGFGNNGCDGGEDFRSYQWI 407
Query: 211 MKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
MKHG LP +D+YG YLGQD YCH N T A +TGFVNVTP +ALK+A+AKHGP+SVA
Sbjct: 408 MKHGGLPAEDEYGGYLGQDGYCHANNVTKVAKITGFVNVTPGDPNALKVAIAKHGPISVA 467
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDA+ K+FSFY +GVYYDE C N+ + LDHAVLAVGYG+L+GK YW VKNSWS YWGN G
Sbjct: 468 IDAAHKTFSFYSHGVYYDESCGNTEESLDHAVLAVGYGKLNGKDYWLVKNSWSNYWGNDG 527
Query: 330 YVLMSIKDNNCGVMTAPTYVTM 351
Y+LMS + NNCGV+TAPTYVTM
Sbjct: 528 YILMSQEKNNCGVLTAPTYVTM 549
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 65/142 (45%), Positives = 85/142 (59%), Gaps = 29/142 (20%)
Query: 14 GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
GMVKTYQLS G+ G S+K+AP+TTE LN +C G D IEPQ+++PD + +
Sbjct: 65 GMVKTYQLSHKGSYGKSIKIAPITTENILNAETCLEVNGTRDLKIEPQTIIPDTTGMEC- 123
Query: 74 IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKAS 133
IGEE+VNG++ EKWRLV + EK +KY+LW+RY K+
Sbjct: 124 -----------------------IGEEVVNGLLCEKWRLVETVNEKTNKYTLWIRYKKSP 160
Query: 134 -----KDAIPVRYEMKGYNSLL 150
K+ IPVRYEMKG+NSLL
Sbjct: 161 RVPQFKEPIPVRYEMKGFNSLL 182
>gi|307202924|gb|EFN82144.1| Digestive cysteine proteinase 1 [Harpegnathos saltator]
Length = 484
Score = 335 bits (860), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 155/202 (76%), Positives = 174/202 (86%), Gaps = 1/202 (0%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQSVCGSCWSFGTTG +EGAYYMK+ KL LSQQALIDCSWGYGNNGCDGGEDFR+YQWI
Sbjct: 283 DQSVCGSCWSFGTTGTIEGAYYMKYNKLVRLSQQALIDCSWGYGNNGCDGGEDFRAYQWI 342
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
MKH GLPT++DYG YLGQD YCHI N T TA +TG+VNVTP S ALK+A+AKHGP+SVA
Sbjct: 343 MKHDGLPTEEDYGGYLGQDGYCHINNVTKTAKITGYVNVTPLSVAALKIAIAKHGPISVA 402
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS K+FSFY +GVYYD C N+ D LDHAVLAVGYG ++GK YW VKNSWS YWGN G
Sbjct: 403 IDASHKTFSFYSHGVYYDPACGNTEDKLDHAVLAVGYGNMNGKDYWLVKNSWSNYWGNDG 462
Query: 330 YVLMSIKDNNCGVMTAPTYVTM 351
Y+LM+ KD+NCGV+ APTYVTM
Sbjct: 463 YILMAQKDDNCGVLLAPTYVTM 484
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 65/141 (46%), Positives = 85/141 (60%), Gaps = 29/141 (20%)
Query: 15 MVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVNI 74
MVKTYQLS G G S+K+AP+TTET NK +C G D I+PQ++LPD S+ +
Sbjct: 1 MVKTYQLSHKGQYGASIKIAPVTTETASNKDTCLQVNGTSDLKIQPQTILPDTSEMEC-- 58
Query: 75 YRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKAS- 133
IGEE++NG++ EKWRLV EK +KY+LW+RY K+
Sbjct: 59 ----------------------IGEEMINGLMCEKWRLVQEIEEKTNKYTLWIRYKKSPY 96
Query: 134 ----KDAIPVRYEMKGYNSLL 150
K+AIPVRYEM+G+NSLL
Sbjct: 97 VPQLKEAIPVRYEMRGFNSLL 117
>gi|307180251|gb|EFN68284.1| Cathepsin L [Camponotus floridanus]
Length = 555
Score = 335 bits (859), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 152/202 (75%), Positives = 174/202 (86%), Gaps = 1/202 (0%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQSVCGSCWSFGTTG +EGAYY+K+ KL LSQQALIDCSWGYGNNGCDGGEDFRSYQWI
Sbjct: 354 DQSVCGSCWSFGTTGTIEGAYYLKYNKLVRLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 413
Query: 211 MKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
MKHG LPT+DDYG YLGQD +CH+ N T TA + GFVNVTP + DALK+A+AKHGP+SVA
Sbjct: 414 MKHGGLPTEDDYGSYLGQDGFCHVNNVTLTAKIKGFVNVTPRNVDALKIAIAKHGPISVA 473
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS K+FSFY +GVYYD C N+ D LDHAVLAVGYG ++GK YW +KNSWS YWGN G
Sbjct: 474 IDASHKTFSFYSHGVYYDPTCGNTEDTLDHAVLAVGYGTMNGKGYWLIKNSWSNYWGNDG 533
Query: 330 YVLMSIKDNNCGVMTAPTYVTM 351
Y+LM+ +D+NCGV+ APTYVTM
Sbjct: 534 YILMAQQDDNCGVLLAPTYVTM 555
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 63/142 (44%), Positives = 84/142 (59%), Gaps = 29/142 (20%)
Query: 14 GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
GMVKT+QLS G G S+K+AP+TTET NK +C G + I+PQS++PD S+ +
Sbjct: 71 GMVKTFQLSHKGEFGASIKIAPVTTETVNNKETCLQVNGTINLKIQPQSIVPDASEMEC- 129
Query: 74 IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKAS 133
IGEE++NG+ EKWRLV G K +KY+LW+RY K+
Sbjct: 130 -----------------------IGEEMINGLSCEKWRLVQEIGGKTNKYTLWIRYKKSP 166
Query: 134 -----KDAIPVRYEMKGYNSLL 150
K+ IPVRYEM+G+NSLL
Sbjct: 167 SLPQVKEPIPVRYEMRGFNSLL 188
>gi|156554010|ref|XP_001605879.1| PREDICTED: counting factor associated protein D-like [Nasonia
vitripennis]
Length = 553
Score = 327 bits (837), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 152/223 (68%), Positives = 181/223 (81%), Gaps = 4/223 (1%)
Query: 131 KASKDAIPVRYEMKGYNSLL---DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALI 187
+ K +P ++ + Y ++ DQSVCGSCWSFGTTGAVEGAY+MK+KKL LSQQALI
Sbjct: 328 EKEKADVPDSFDWRLYGAVTPVKDQSVCGSCWSFGTTGAVEGAYFMKYKKLVRLSQQALI 387
Query: 188 DCSWGYGNNGCDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFV 246
DCSWG+GNNGCDGGEDFRSYQWI+KHG LPT+++YG YLGQD YCHI N T A + GFV
Sbjct: 388 DCSWGFGNNGCDGGEDFRSYQWIIKHGGLPTEEEYGGYLGQDGYCHIKNVTQIAKLKGFV 447
Query: 247 NVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGY 306
NV N+ DA+KLAL KHGP+SVAIDAS K+FSFY NGVYY+ C N+ + LDHAVLAVGY
Sbjct: 448 NVDTNNVDAMKLALFKHGPISVAIDASHKTFSFYSNGVYYEPACGNTENSLDHAVLAVGY 507
Query: 307 GELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYV 349
G ++GK +W +KNSWS YWGN GY+LM+ K+NNCGVMTAPTY
Sbjct: 508 GTINGKGFWLIKNSWSNYWGNDGYILMAQKNNNCGVMTAPTYA 550
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 63/142 (44%), Positives = 85/142 (59%), Gaps = 29/142 (20%)
Query: 14 GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
GMVKTYQLSK G G S+K+AP+T E NK +C G DA I+ Q+++PD + +
Sbjct: 67 GMVKTYQLSKEGPYGSSIKIAPVTDEDNFNKETCLQVNGTSDARIDLQTIIPDTTGMEC- 125
Query: 74 IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKAS 133
IGEE++NG+ EKWRL+ S GEK +KY+LW+RY K+
Sbjct: 126 -----------------------IGEEMINGLACEKWRLIDSFGEKTNKYTLWIRYKKSP 162
Query: 134 -----KDAIPVRYEMKGYNSLL 150
K+AIPVRYEM+G+N+LL
Sbjct: 163 STPQMKEAIPVRYEMRGFNTLL 184
>gi|6448469|dbj|BAA86911.1| homologue of Sarcophaga 26,29kDa proteinase [Periplaneta americana]
Length = 552
Score = 327 bits (837), Expect = 7e-87, Method: Compositional matrix adjust.
Identities = 154/224 (68%), Positives = 178/224 (79%), Gaps = 4/224 (1%)
Query: 132 ASKDAIPVRYEMKGYNSLL---DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALID 188
A D IP + + Y ++ DQSVCGSCWSFGTTG +EGAY++K+ L LSQQALID
Sbjct: 329 AIMDQIPDDLDWRIYGAVTPVKDQSVCGSCWSFGTTGTIEGAYFLKYGHLVRLSQQALID 388
Query: 189 CSWGYGNNGCDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVN 247
CSWGYGNNGCDGGEDFRSY+W+MKHG +P +D+YG YLGQD YCH+ N T TA +TG+VN
Sbjct: 389 CSWGYGNNGCDGGEDFRSYEWMMKHGGIPLEDEYGGYLGQDGYCHVENVTLTAKITGYVN 448
Query: 248 VTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG 307
VT DALK+ALAKHGP+SVAIDAS K+FSFY NG+YYD +C N D LDHAVL VGYG
Sbjct: 449 VTSGDIDALKVALAKHGPISVAIDASHKTFSFYSNGIYYDPECGNKLDQLDHAVLLVGYG 508
Query: 308 ELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
++G PYW VKNSWS YWGN GY+LMS KDNNCGV T PTYVTM
Sbjct: 509 IINGNPYWLVKNSWSNYWGNDGYILMSPKDNNCGVATDPTYVTM 552
Score = 130 bits (327), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 71/142 (50%), Positives = 88/142 (61%), Gaps = 29/142 (20%)
Query: 14 GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
GMVKTYQ+SK G G SLKVAP+TTE NK +C G D IEPQS+LPD++ FK+
Sbjct: 67 GMVKTYQISKYGNFGTSLKVAPVTTEKVENKETCLQVNGTKDNKIEPQSILPDLTGFKL- 125
Query: 74 IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKAS 133
IG E +NG+ EKWRLV ++G KV+KY++W+RY K++
Sbjct: 126 -----------------------IGTEPINGLNCEKWRLVDTKGAKVNKYTMWIRYKKSA 162
Query: 134 KD-----AIPVRYEMKGYNSLL 150
D AIPVRYEMKGYNSLL
Sbjct: 163 NDPGVKIAIPVRYEMKGYNSLL 184
>gi|380013206|ref|XP_003690657.1| PREDICTED: counting factor associated protein D-like [Apis florea]
Length = 549
Score = 325 bits (833), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 147/202 (72%), Positives = 172/202 (85%), Gaps = 1/202 (0%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQSVCGSCWSFGTTGAVEGAY+MK+ KL LSQQALIDCSWG+GNNGCDGGEDFRSYQWI
Sbjct: 348 DQSVCGSCWSFGTTGAVEGAYFMKYGKLVRLSQQALIDCSWGFGNNGCDGGEDFRSYQWI 407
Query: 211 MKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
MKHG LPT+D+YG YLGQD YCH+ N + A +TG+VNVT +ALK+A+AKHGP+SVA
Sbjct: 408 MKHGGLPTEDEYGGYLGQDGYCHVNNISMIAKITGYVNVTSGDANALKIAIAKHGPISVA 467
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS K+FSFY +G+YY+ C N + LDHAVLAVGYG+++GK YW +KNSWS YWGN G
Sbjct: 468 IDASHKTFSFYSHGIYYESTCGNIEESLDHAVLAVGYGKINGKDYWLIKNSWSNYWGNDG 527
Query: 330 YVLMSIKDNNCGVMTAPTYVTM 351
Y+LMS + NNCGV+T PTYVTM
Sbjct: 528 YILMSQEKNNCGVLTTPTYVTM 549
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 61/142 (42%), Positives = 83/142 (58%), Gaps = 29/142 (20%)
Query: 14 GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
GMVKT+QLS G G S+K+AP+TTE+ N+ +C G + I+PQ +LPD + +
Sbjct: 65 GMVKTFQLSHKGEYGKSIKIAPVTTESVTNEETCLEVNGTKEIKIQPQVILPDTTSMQC- 123
Query: 74 IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKAS 133
IGEE+VNG++ EKWRL EK++KY+LW+RY K+
Sbjct: 124 -----------------------IGEEIVNGLLCEKWRLEQQIDEKINKYTLWIRYKKSP 160
Query: 134 -----KDAIPVRYEMKGYNSLL 150
K+ IPVRYEMKG+NSLL
Sbjct: 161 HLPNFKEPIPVRYEMKGFNSLL 182
>gi|195128649|ref|XP_002008774.1| GI11630 [Drosophila mojavensis]
gi|193920383|gb|EDW19250.1| GI11630 [Drosophila mojavensis]
Length = 547
Score = 317 bits (812), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 149/223 (66%), Positives = 178/223 (79%), Gaps = 5/223 (2%)
Query: 134 KDAIPVRYEMKGYNSL---LDQSVCGSCWSFGTTGAVEGAYYMKHK-KLAVLSQQALIDC 189
KD IP +Y+ + + ++ DQSVCGSCWSFGT G +EGAY++K+ L LSQQALIDC
Sbjct: 325 KDEIPSQYDWRLFGAVSPVKDQSVCGSCWSFGTVGHLEGAYFLKNGGNLVRLSQQALIDC 384
Query: 190 SWGYGNNGCDGGEDFRSYQWIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
SWGYGNNGCDGGEDFR YQW+M+ G+PT+++YGPYLGQD YCH+ N T A + GFVNV
Sbjct: 385 SWGYGNNGCDGGEDFRVYQWMMQVGGVPTEEEYGPYLGQDGYCHVNNVTLVAPIKGFVNV 444
Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
T N +A KLA+ KHGP+SVAIDAS K+FSFY NGVYY+ +C N DGLDHAVLAVGYG
Sbjct: 445 TSNDPNAFKLAMLKHGPLSVAIDASPKTFSFYSNGVYYEPECKNDVDGLDHAVLAVGYGT 504
Query: 309 LDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
++G+ YW VKNSWSTYWGN GY+LMS + NNCGVMT PTYV M
Sbjct: 505 INGEDYWLVKNSWSTYWGNDGYILMSARKNNCGVMTMPTYVEM 547
Score = 103 bits (258), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 59/142 (41%), Positives = 79/142 (55%), Gaps = 29/142 (20%)
Query: 14 GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
GMVKTYQL+ G G +LK+AP++T+ ELNK+SC G D +E QS+LPD F +
Sbjct: 64 GMVKTYQLAGEGQYGTALKLAPISTKEELNKLSCLQVNGTADQVVEIQSILPDAKPFAL- 122
Query: 74 IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKA- 132
IG E G +K+RL + G+K + Y+LWVRY K+
Sbjct: 123 -----------------------IGTESFLGYTCDKFRLEQTIGQKKNVYTLWVRYKKSP 159
Query: 133 ----SKDAIPVRYEMKGYNSLL 150
S+ IPVRYEM+GYN+LL
Sbjct: 160 HYPGSRMPIPVRYEMRGYNTLL 181
>gi|195020395|ref|XP_001985188.1| GH14645 [Drosophila grimshawi]
gi|193898670|gb|EDV97536.1| GH14645 [Drosophila grimshawi]
Length = 549
Score = 316 bits (810), Expect = 9e-84, Method: Compositional matrix adjust.
Identities = 149/223 (66%), Positives = 178/223 (79%), Gaps = 5/223 (2%)
Query: 134 KDAIPVRYEMKGYNSL---LDQSVCGSCWSFGTTGAVEGAYYMKHK-KLAVLSQQALIDC 189
KD IP +Y+ + Y ++ DQSVCGSCWSFGT G +EGA+++K+ L LSQQALIDC
Sbjct: 327 KDEIPSQYDWRLYGAVSPVKDQSVCGSCWSFGTIGHLEGAFFLKNGGNLVRLSQQALIDC 386
Query: 190 SWGYGNNGCDGGEDFRSYQWIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
SWGYGNNGCDGGEDFR YQW+M+ G+PT+++YGPYLGQD YCH+ N T A + GFVNV
Sbjct: 387 SWGYGNNGCDGGEDFRVYQWMMQVGGVPTEEEYGPYLGQDGYCHVNNVTLVAPIKGFVNV 446
Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
T N +A KLAL KHGP+SVAIDAS K+FSFY +GVYY+ +C N DGLDHAVLAVGYG
Sbjct: 447 TANDPNAFKLALLKHGPISVAIDASPKTFSFYSHGVYYEPECKNDVDGLDHAVLAVGYGT 506
Query: 309 LDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
++G+ YW VKNSWSTYWGN GY+LMS + NNCGVMT PTYV M
Sbjct: 507 INGEDYWLVKNSWSTYWGNDGYILMSARKNNCGVMTMPTYVEM 549
Score = 100 bits (250), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 58/142 (40%), Positives = 79/142 (55%), Gaps = 29/142 (20%)
Query: 14 GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
GMVKTYQL+ G G +LK+ P+TT+ ELNK+SC G + ++ QS+LPD F++
Sbjct: 64 GMVKTYQLAGEGQFGTALKLTPITTKDELNKLSCLLVNGTAEQPVQIQSILPDAKPFEL- 122
Query: 74 IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNK-- 131
IG E G +K+RL + G+K + Y+LWVRY K
Sbjct: 123 -----------------------IGTETFLGYTCDKFRLESVIGQKKNVYTLWVRYKKSP 159
Query: 132 ---ASKDAIPVRYEMKGYNSLL 150
AS+ IPVRYEM+GYN+LL
Sbjct: 160 HYPASRMPIPVRYEMRGYNTLL 181
>gi|282158089|ref|NP_001164088.1| cathepsin L precursor [Tribolium castaneum]
Length = 552
Score = 315 bits (807), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 148/228 (64%), Positives = 179/228 (78%), Gaps = 5/228 (2%)
Query: 129 YNKASKDAIPVRYE---MKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHK-KLAVLSQQ 184
Y +K+ +P +++ + + DQSVCGSCWSFGT G VEGA ++ + +L LSQQ
Sbjct: 325 YENINKEDLPDQWDWRLLGAVTPVKDQSVCGSCWSFGTVGTVEGALFLHNGGRLFRLSQQ 384
Query: 185 ALIDCSWGYGNNGCDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANTTATATMT 243
AL+DCSWGYGNNGCDGGEDFR+YQW++KHG +PT++ YGPYLGQD YCH A +T
Sbjct: 385 ALVDCSWGYGNNGCDGGEDFRAYQWMLKHGGIPTEEAYGPYLGQDGYCHADKVQKVAKIT 444
Query: 244 GFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLA 303
G+VNVT N E+AL+LAL KHGP+SVAIDASQ++FSFY NGVYY+ KC N D LDHAVLA
Sbjct: 445 GYVNVTTNDENALRLALFKHGPISVAIDASQRTFSFYSNGVYYEPKCGNKIDELDHAVLA 504
Query: 304 VGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
VGYG ++G+ YW VKNSWS YWGN GY+LMS KDNNCGVMT PTYVTM
Sbjct: 505 VGYGTINGENYWLVKNSWSNYWGNDGYILMSSKDNNCGVMTTPTYVTM 552
Score = 114 bits (284), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 61/141 (43%), Positives = 83/141 (58%), Gaps = 29/141 (20%)
Query: 15 MVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVNI 74
MVKTYQLS G G SLK+AP+TTETE N +C G + TIEPQS+LPD++ F+
Sbjct: 68 MVKTYQLSTEGPYGSSLKIAPVTTETETNSRTCLQVNGTEENTIEPQSILPDLTGFEC-- 125
Query: 75 YRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRY----- 129
GEE ++G++ EKW L + G+KV+KY++W+ Y
Sbjct: 126 ----------------------TGEETISGVLTEKWALSQTIGQKVNKYAMWLYYKDDPD 163
Query: 130 NKASKDAIPVRYEMKGYNSLL 150
N K A+PV+YEM+G+NSLL
Sbjct: 164 NTGVKIAVPVKYEMRGFNSLL 184
>gi|195428245|ref|XP_002062184.1| GK16790 [Drosophila willistoni]
gi|194158269|gb|EDW73170.1| GK16790 [Drosophila willistoni]
Length = 549
Score = 314 bits (804), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 147/222 (66%), Positives = 178/222 (80%), Gaps = 5/222 (2%)
Query: 135 DAIPVRYEMKGYNSLL---DQSVCGSCWSFGTTGAVEGAYYMKHK-KLAVLSQQALIDCS 190
D IP +Y+ + Y ++ DQSVCGSCWSFGT G +EGA+++K+ L LSQQALIDCS
Sbjct: 328 DDIPSQYDWRLYGAVTPVKDQSVCGSCWSFGTIGHLEGAFFLKNGGNLVRLSQQALIDCS 387
Query: 191 WGYGNNGCDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVT 249
WG+GNNGCDGGEDFR YQW++K G +PT+++YGPYLGQD YCH+ N T A +TGFVNVT
Sbjct: 388 WGFGNNGCDGGEDFRVYQWMLKSGGVPTEEEYGPYLGQDGYCHVNNVTLVAPITGFVNVT 447
Query: 250 PNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGEL 309
N +A K+AL KHGP+SVAIDAS K+FSFY +GVYY+ +C N DGLDHAVLAVGYG +
Sbjct: 448 SNDPNAFKIALLKHGPLSVAIDASPKTFSFYSHGVYYEPECKNDVDGLDHAVLAVGYGTI 507
Query: 310 DGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
+G+ YW VKNSWSTYWGN GY+LMS + NNCGVMT PTYV M
Sbjct: 508 NGEDYWLVKNSWSTYWGNDGYILMSARKNNCGVMTMPTYVEM 549
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 60/142 (42%), Positives = 77/142 (54%), Gaps = 29/142 (20%)
Query: 14 GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
GMVKTYQL+ G G LK+AP+TT +LNK++C G D +E QS+LPD F
Sbjct: 64 GMVKTYQLAGEGDFGTMLKLAPITTSDDLNKLTCLQVNGTTDQKVEIQSILPDAKPF--- 120
Query: 74 IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNK-- 131
T IG E G +K+RL S G+K + Y+LWVRY K
Sbjct: 121 ---------------------TLIGTETFLGYTCDKFRLEQSIGQKKNVYTLWVRYKKSP 159
Query: 132 ---ASKDAIPVRYEMKGYNSLL 150
+S+ IPVRYEM+GYN+LL
Sbjct: 160 HYPSSRMPIPVRYEMRGYNTLL 181
>gi|209693437|ref|NP_001129411.1| cathepsin K-1 precursor [Acyrthosiphon pisum]
Length = 558
Score = 313 bits (802), Expect = 7e-83, Method: Compositional matrix adjust.
Identities = 143/228 (62%), Positives = 175/228 (76%), Gaps = 5/228 (2%)
Query: 129 YNKASKDAIPVRYEMK---GYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQA 185
YNK +P + + + DQS+CGSCWSFGTTG +EGAY+MKH++ S+QA
Sbjct: 331 YNKTDFKNLPSEIDWRIAGAVTPVKDQSICGSCWSFGTTGTIEGAYFMKHRERKSFSEQA 390
Query: 186 LIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTT--ATATMT 243
L+DCSWG+GNNGCDGGEDFR+Y WI KHGLPT+++YGPYL QD +CHIAN + +T
Sbjct: 391 LVDCSWGFGNNGCDGGEDFRAYSWIEKHGLPTEEEYGPYLSQDGFCHIANISNGDLTKIT 450
Query: 244 GFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLA 303
GFVNVT E+ALK+AL GP+S+AI AS K+FSFY NGVYYD+ C NSP L+HAVL
Sbjct: 451 GFVNVTTYDEEALKIALVNEGPISIAIYASLKTFSFYSNGVYYDKDCRNSPLELNHAVLL 510
Query: 304 VGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
VGYG LDG PYW VKNSWST+WGN GYVL+S KDNNCGV+++PTYV +
Sbjct: 511 VGYGTLDGHPYWLVKNSWSTHWGNAGYVLISRKDNNCGVLSSPTYVIL 558
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 52/137 (37%), Positives = 72/137 (52%), Gaps = 26/137 (18%)
Query: 14 GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
GM KTYQLS G+ G SLK+ P+TTETE N + C G + I PQS LPD+SDFK+
Sbjct: 79 GMAKTYQLSNKGSFGSSLKIVPVTTETETNTLKCLEVNGTDEFHISPQSTLPDLSDFKL- 137
Query: 74 IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKAS 133
+ E L NGI++EKW + + V KY +W+ +
Sbjct: 138 -----------------------VSESLENGIVVEKWVYIKRVDKTVGKYIMWLY--RKD 172
Query: 134 KDAIPVRYEMKGYNSLL 150
A P+R++M G+NS+
Sbjct: 173 DIAYPMRFQMNGFNSVF 189
>gi|332374780|gb|AEE62531.1| unknown [Dendroctonus ponderosae]
Length = 544
Score = 313 bits (802), Expect = 8e-83, Method: Compositional matrix adjust.
Identities = 145/220 (65%), Positives = 178/220 (80%), Gaps = 5/220 (2%)
Query: 137 IPVRYEMKGYNSLL---DQSVCGSCWSFGTTGAVEGAYYMKHK-KLAVLSQQALIDCSWG 192
+P +++ + Y ++ DQSVCGSCWSFGT G VEGA+++K+ L LSQQALIDCSWG
Sbjct: 324 LPDQWDWRLYGAVTPVKDQSVCGSCWSFGTIGTVEGAFFLKNGGHLVRLSQQALIDCSWG 383
Query: 193 YGNNGCDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPN 251
+GNNGCDGGEDFR+YQWI KHG +PT++DYGPYLGQD YCH A ++ + NVT N
Sbjct: 384 FGNNGCDGGEDFRAYQWIKKHGGIPTEEDYGPYLGQDGYCHADKLPKVAKLSSWTNVTTN 443
Query: 252 SEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDG 311
E+AL+LA+ KHGP+SVAIDASQ++FSFY NGVY++EKC+N D LDHAVLAVGYG ++G
Sbjct: 444 DENALRLAIFKHGPISVAIDASQRTFSFYSNGVYFEEKCHNKVDELDHAVLAVGYGSING 503
Query: 312 KPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
YW VKNSWS YWGN GYVLMS K+NNCGVM+APTYVT+
Sbjct: 504 NHYWLVKNSWSNYWGNDGYVLMSSKNNNCGVMSAPTYVTL 543
Score = 107 bits (268), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 59/138 (42%), Positives = 83/138 (60%), Gaps = 27/138 (19%)
Query: 15 MVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVNI 74
MVKTYQL K+G+ G SLK+AP+TTE E+N +C G + I PQ+VLP+++ F+
Sbjct: 63 MVKTYQLGKDGS-GTSLKIAPVTTEEEVNIRTCLQVNGSQEQPIRPQTVLPELTGFE--- 118
Query: 75 YRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNK--A 132
Y+G+E++ GI +KW L T+ GEK +KY LWV + +
Sbjct: 119 ---------------------YLGDEVIEGIQTQKWNLTTTVGEKENKYFLWVTFKEDGD 157
Query: 133 SKDAIPVRYEMKGYNSLL 150
K A+PVRYEM+G+NSLL
Sbjct: 158 QKIAVPVRYEMRGFNSLL 175
>gi|195379510|ref|XP_002048521.1| GJ11312 [Drosophila virilis]
gi|194155679|gb|EDW70863.1| GJ11312 [Drosophila virilis]
Length = 549
Score = 313 bits (801), Expect = 9e-83, Method: Compositional matrix adjust.
Identities = 148/223 (66%), Positives = 176/223 (78%), Gaps = 5/223 (2%)
Query: 134 KDAIPVRYEMKGYNSLL---DQSVCGSCWSFGTTGAVEGAYYMKHK-KLAVLSQQALIDC 189
KD IP +Y+ + + ++ DQSVCGSCWSFGT G +EGAY++K+ L LS QALIDC
Sbjct: 327 KDEIPPQYDWRLFGAVTPVKDQSVCGSCWSFGTIGHLEGAYFLKNGGNLVRLSTQALIDC 386
Query: 190 SWGYGNNGCDGGEDFRSYQWIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
SWGYGNNGCDGGEDFR YQW+M+ G+PT++ YGPYLGQD YCH+ N T A + GFVNV
Sbjct: 387 SWGYGNNGCDGGEDFRVYQWVMQVGGVPTEEQYGPYLGQDGYCHVNNVTLVAPIKGFVNV 446
Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
T N +A KLAL KHGP+SVAIDAS K+FSFY +GVYY+ +C N DGLDHAVLAVGYG
Sbjct: 447 TSNDPNAFKLALLKHGPLSVAIDASPKTFSFYSHGVYYEPECKNDVDGLDHAVLAVGYGT 506
Query: 309 LDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
++G+ YW VKNSWSTYWGN GY+LMS + NNCGVMT PTYV M
Sbjct: 507 INGEDYWLVKNSWSTYWGNDGYILMSARKNNCGVMTMPTYVEM 549
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 58/142 (40%), Positives = 79/142 (55%), Gaps = 29/142 (20%)
Query: 14 GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
GMVKTYQL+ G G +LK+AP++T ELNK+SC G + +E QS+LPD F++
Sbjct: 64 GMVKTYQLAGEGQYGTALKLAPVSTRDELNKLSCLQVNGTAEQPVEIQSILPDAKPFEL- 122
Query: 74 IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNK-- 131
IG E G +K+RL + G+K + Y+LWVRY K
Sbjct: 123 -----------------------IGTETFMGYTCDKFRLEQTIGQKKNVYTLWVRYKKSP 159
Query: 132 ---ASKDAIPVRYEMKGYNSLL 150
+S+ IPVRYEM+GYN+LL
Sbjct: 160 HYPSSRMPIPVRYEMRGYNTLL 181
>gi|389611850|dbj|BAM19484.1| cathepsin L [Papilio xuthus]
Length = 342
Score = 312 bits (800), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 161/289 (55%), Positives = 199/289 (68%), Gaps = 10/289 (3%)
Query: 71 KVNIYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYN 130
++NI+R LR H N + R +T L + E L S L Y
Sbjct: 56 RLNIFRQN-LR-YIHSNNRARRGFTLAVNHLADWAEDELAALRGRRYSGPSPLGLQFPYG 113
Query: 131 KASKDA----IPVRYEMKGYNSLL---DQSVCGSCWSFGTTGAVEGAYYMKHK-KLAVLS 182
+A + +P ++ + + ++ DQSVCGSCWSFGT GAVEGA ++ + L LS
Sbjct: 114 EAELEEMSVKLPPEFDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRLS 173
Query: 183 QQALIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATM 242
QQ LIDCSWG+GNNGCDGGEDFR+YQWIMKHGLPT++DYG YLGQD YCH+ N T ++
Sbjct: 174 QQTLIDCSWGFGNNGCDGGEDFRAYQWIMKHGLPTEEDYGDYLGQDGYCHVDNVTLVTSI 233
Query: 243 TGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVL 302
G+VNVT N+E+AL+LA+ KHGP+SVAIDAS KSFSFY +GVYY+ KC N D LDHAVL
Sbjct: 234 KGWVNVTTNNENALRLAIFKHGPISVAIDASHKSFSFYSHGVYYEPKCKNKIDELDHAVL 293
Query: 303 AVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
AVGYG L+G+ YW VKNSWS WGN GYVLMS +DNNCGV +APTYV +
Sbjct: 294 AVGYGVLNGQKYWLVKNSWSNLWGNDGYVLMSSRDNNCGVESAPTYVLI 342
>gi|195590156|ref|XP_002084812.1| GD14469 [Drosophila simulans]
gi|194196821|gb|EDX10397.1| GD14469 [Drosophila simulans]
Length = 549
Score = 312 bits (800), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 148/223 (66%), Positives = 176/223 (78%), Gaps = 5/223 (2%)
Query: 134 KDAIPVRYEMKGYNSLL---DQSVCGSCWSFGTTGAVEGAYYMKHK-KLAVLSQQALIDC 189
KD IP +Y+ + Y ++ DQSVCGSCWSFGT G +EGA+++K+ L LSQQALIDC
Sbjct: 327 KDEIPDQYDWRLYGAVTPVKDQSVCGSCWSFGTIGHLEGAFFLKNGGNLVRLSQQALIDC 386
Query: 190 SWGYGNNGCDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
SW YGNNGCDGGEDFR YQW+++ G +PT+++YGPYLGQD YCH+ N T A + GFVNV
Sbjct: 387 SWAYGNNGCDGGEDFRVYQWMLQSGGVPTEEEYGPYLGQDGYCHVNNVTLVAPIKGFVNV 446
Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
T N +A KLAL KHGP+SVAIDAS K+FSFY +GVYY+ C N DGLDHAVLAVGYG
Sbjct: 447 TSNDPNAFKLALLKHGPLSVAIDASPKTFSFYSHGVYYEPTCKNDVDGLDHAVLAVGYGS 506
Query: 309 LDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
++G+ YW VKNSWSTYWGN GY+LMS K NNCGVMT PTYV M
Sbjct: 507 INGEDYWLVKNSWSTYWGNDGYILMSAKKNNCGVMTMPTYVEM 549
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 58/142 (40%), Positives = 80/142 (56%), Gaps = 29/142 (20%)
Query: 14 GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
GMVKTYQL+ G G LK+AP+TT+TE NK++C G D T++ QS+LPD F +
Sbjct: 64 GMVKTYQLAGEGQYGTLLKLAPITTKTENNKLTCLQVNGTADQTVDIQSILPDAKPFSL- 122
Query: 74 IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNK-- 131
+G E G +K+RL ++ G+K + Y+LWVRY K
Sbjct: 123 -----------------------VGTESFLGYTCDKFRLESTIGQKKNIYTLWVRYKKSP 159
Query: 132 ---ASKDAIPVRYEMKGYNSLL 150
+S+ IPVRYEM+GYN+LL
Sbjct: 160 HYPSSRMPIPVRYEMRGYNTLL 181
>gi|195327474|ref|XP_002030443.1| GM25442 [Drosophila sechellia]
gi|194119386|gb|EDW41429.1| GM25442 [Drosophila sechellia]
Length = 549
Score = 312 bits (800), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 148/223 (66%), Positives = 176/223 (78%), Gaps = 5/223 (2%)
Query: 134 KDAIPVRYEMKGYNSLL---DQSVCGSCWSFGTTGAVEGAYYMKHK-KLAVLSQQALIDC 189
KD IP +Y+ + Y ++ DQSVCGSCWSFGT G +EGA+++K+ L LSQQALIDC
Sbjct: 327 KDEIPDQYDWRLYGAVTPVKDQSVCGSCWSFGTIGHLEGAFFLKNGGNLVRLSQQALIDC 386
Query: 190 SWGYGNNGCDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
SW YGNNGCDGGEDFR YQW+++ G +PT+++YGPYLGQD YCH+ N T A + GFVNV
Sbjct: 387 SWAYGNNGCDGGEDFRVYQWMLQSGGVPTEEEYGPYLGQDGYCHVNNVTLVAPIKGFVNV 446
Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
T N +A KLAL KHGP+SVAIDAS K+FSFY +GVYY+ C N DGLDHAVLAVGYG
Sbjct: 447 TSNDPNAFKLALLKHGPLSVAIDASPKTFSFYSHGVYYEPTCKNDVDGLDHAVLAVGYGS 506
Query: 309 LDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
++G+ YW VKNSWSTYWGN GY+LMS K NNCGVMT PTYV M
Sbjct: 507 INGEDYWLVKNSWSTYWGNDGYILMSAKKNNCGVMTMPTYVEM 549
Score = 101 bits (251), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 58/142 (40%), Positives = 80/142 (56%), Gaps = 29/142 (20%)
Query: 14 GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
GMVKTYQL+ G G LK+AP+TT+TE NK++C G + TI+ QS+LPD F +
Sbjct: 64 GMVKTYQLAGEGQYGTLLKLAPITTKTENNKLTCLQVNGTANQTIDIQSILPDAKPFSL- 122
Query: 74 IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNK-- 131
+G E G +K+RL ++ G+K + Y+LWVRY K
Sbjct: 123 -----------------------VGTESFLGYTCDKFRLESTIGQKKNIYTLWVRYKKSP 159
Query: 132 ---ASKDAIPVRYEMKGYNSLL 150
+S+ IPVRYEM+GYN+LL
Sbjct: 160 HYPSSRMPIPVRYEMRGYNTLL 181
>gi|20428641|ref|NP_620470.1| 26-29kD-proteinase [Drosophila melanogaster]
gi|6448467|dbj|BAA86910.1| homologue of Sarcophaga 26,29kDa proteinase [Drosophila
melanogaster]
gi|7294432|gb|AAF49777.1| 26-29kD-proteinase [Drosophila melanogaster]
gi|21483518|gb|AAM52734.1| RE18380p [Drosophila melanogaster]
Length = 549
Score = 312 bits (800), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 148/223 (66%), Positives = 176/223 (78%), Gaps = 5/223 (2%)
Query: 134 KDAIPVRYEMKGYNSLL---DQSVCGSCWSFGTTGAVEGAYYMKHK-KLAVLSQQALIDC 189
KD IP +Y+ + Y ++ DQSVCGSCWSFGT G +EGA+++K+ L LSQQALIDC
Sbjct: 327 KDEIPDQYDWRLYGAVTPVKDQSVCGSCWSFGTIGHLEGAFFLKNGGNLVRLSQQALIDC 386
Query: 190 SWGYGNNGCDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
SW YGNNGCDGGEDFR YQW+++ G +PT+++YGPYLGQD YCH+ N T A + GFVNV
Sbjct: 387 SWAYGNNGCDGGEDFRVYQWMLQSGGVPTEEEYGPYLGQDGYCHVNNVTLVAPIKGFVNV 446
Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
T N +A KLAL KHGP+SVAIDAS K+FSFY +GVYY+ C N DGLDHAVLAVGYG
Sbjct: 447 TSNDPNAFKLALLKHGPLSVAIDASPKTFSFYSHGVYYEPTCKNDVDGLDHAVLAVGYGS 506
Query: 309 LDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
++G+ YW VKNSWSTYWGN GY+LMS K NNCGVMT PTYV M
Sbjct: 507 INGEDYWLVKNSWSTYWGNDGYILMSAKKNNCGVMTMPTYVEM 549
Score = 100 bits (250), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 57/142 (40%), Positives = 79/142 (55%), Gaps = 29/142 (20%)
Query: 14 GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
GMVKTYQL+ G G LK+AP+TT+TE NK++C G D ++ QS+LPD F +
Sbjct: 64 GMVKTYQLAGEGQYGTLLKLAPITTKTENNKLTCLQVNGTADQAVDIQSILPDAKPFSL- 122
Query: 74 IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNK-- 131
+G E G +K+RL ++ G+K + Y+LWVRY K
Sbjct: 123 -----------------------VGTESFLGYTCDKFRLESTIGQKKNIYTLWVRYKKSP 159
Query: 132 ---ASKDAIPVRYEMKGYNSLL 150
+S+ IPVRYEM+GYN+LL
Sbjct: 160 HYPSSRMPIPVRYEMRGYNTLL 181
>gi|194749983|ref|XP_001957411.1| GF24054 [Drosophila ananassae]
gi|190624693|gb|EDV40217.1| GF24054 [Drosophila ananassae]
Length = 549
Score = 312 bits (799), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 146/223 (65%), Positives = 177/223 (79%), Gaps = 5/223 (2%)
Query: 134 KDAIPVRYEMKGYNSLL---DQSVCGSCWSFGTTGAVEGAYYMKHK-KLAVLSQQALIDC 189
KD IP +Y+ + + ++ DQSVCGSCWSFGT G +EGA+++K+ L LSQQALIDC
Sbjct: 327 KDEIPTQYDWRLFGAVTPVKDQSVCGSCWSFGTIGHLEGAFFLKNGGNLVRLSQQALIDC 386
Query: 190 SWGYGNNGCDGGEDFRSYQWIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
SW YGNNGCDGGEDFR YQW+++ G+PT+++YGPYLGQD YCH+ N T A + GFVNV
Sbjct: 387 SWAYGNNGCDGGEDFRVYQWMLQVGGVPTEEEYGPYLGQDGYCHVNNVTLVAPIKGFVNV 446
Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
T N +A KLAL KHGP+SVAIDAS K+FSFY +GVYY+ +C N DGLDHAVLAVGYG
Sbjct: 447 TSNDPNAFKLALLKHGPLSVAIDASPKTFSFYSHGVYYEPECKNDVDGLDHAVLAVGYGT 506
Query: 309 LDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
++G+ YW VKNSWSTYWGN GY+LMS + NNCGVMT PTYV M
Sbjct: 507 INGEDYWLVKNSWSTYWGNDGYILMSARKNNCGVMTMPTYVEM 549
Score = 101 bits (251), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 57/142 (40%), Positives = 79/142 (55%), Gaps = 29/142 (20%)
Query: 14 GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
GMVKTYQL+ G G LK+AP+TT+TE NK++C G + ++ QS+LPD F++
Sbjct: 64 GMVKTYQLAGEGDFGTMLKLAPITTKTETNKLTCLQVNGTKNEAVQIQSILPDAKPFQL- 122
Query: 74 IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNK-- 131
+G E G +K+RL + G+K + Y+LWVRY K
Sbjct: 123 -----------------------VGTETFLGYTCDKFRLESVIGQKKNVYTLWVRYKKSP 159
Query: 132 ---ASKDAIPVRYEMKGYNSLL 150
AS+ IPVRYEM+GYN+LL
Sbjct: 160 HYPASRMPIPVRYEMRGYNTLL 181
>gi|83944666|gb|ABC48937.1| cathepsin L like protease [Glossina morsitans morsitans]
gi|289739541|gb|ADD18518.1| cathepsin L-like protease [Glossina morsitans morsitans]
Length = 550
Score = 312 bits (799), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 147/223 (65%), Positives = 176/223 (78%), Gaps = 5/223 (2%)
Query: 134 KDAIPVRYEMKGYNSLL---DQSVCGSCWSFGTTGAVEGAYYMKHK-KLAVLSQQALIDC 189
KD IP +Y+ + Y ++ DQSVCGSCWSFGT G +EGAY++K+ L LSQQALIDC
Sbjct: 328 KDQIPDQYDWRLYGAVTPVKDQSVCGSCWSFGTIGHIEGAYFLKNGGNLVRLSQQALIDC 387
Query: 190 SWGYGNNGCDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
SW YGNNGCDGGEDFRSYQW+M+ G +PT+++YGPYLGQD YCH N T A + GFVNV
Sbjct: 388 SWQYGNNGCDGGEDFRSYQWMMQMGGVPTEEEYGPYLGQDGYCHAQNATLVAPIKGFVNV 447
Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
T N DA K+AL HGP+SVAIDAS K+FSFY +GVYY+ C NS D LDHAVLAVGYG+
Sbjct: 448 TSNDPDAFKIALLNHGPLSVAIDASPKTFSFYSHGVYYEPSCKNSMDSLDHAVLAVGYGK 507
Query: 309 LDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
++ + YW +KNSWSTYWGN GYVLMS ++NNCGVMT PT+V M
Sbjct: 508 MNDEHYWLIKNSWSTYWGNDGYVLMSARENNCGVMTTPTFVEM 550
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 56/142 (39%), Positives = 81/142 (57%), Gaps = 29/142 (20%)
Query: 14 GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
GMVKTYQL+ G SLK+AP+T + +LNK++C G D I+ QSVLP+ SDF++
Sbjct: 64 GMVKTYQLAGEDQYGTSLKLAPVTNKNQLNKVTCLQLNGTADNRIQIQSVLPNASDFQL- 122
Query: 74 IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKA- 132
+G E G++ +K+RL + +K + Y+LWVRY K+
Sbjct: 123 -----------------------VGTETFLGLVCDKFRLEETIEKKKNVYTLWVRYKKSP 159
Query: 133 ----SKDAIPVRYEMKGYNSLL 150
S+ +PVRYEM+GYN+LL
Sbjct: 160 HYPGSRMPMPVRYEMRGYNTLL 181
>gi|194870649|ref|XP_001972693.1| GG15663 [Drosophila erecta]
gi|190654476|gb|EDV51719.1| GG15663 [Drosophila erecta]
Length = 549
Score = 311 bits (797), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 147/223 (65%), Positives = 176/223 (78%), Gaps = 5/223 (2%)
Query: 134 KDAIPVRYEMKGYNSLL---DQSVCGSCWSFGTTGAVEGAYYMKHK-KLAVLSQQALIDC 189
+D IP +Y+ + Y ++ DQSVCGSCWSFGT G +EGA+++K+ L LSQQALIDC
Sbjct: 327 QDEIPDQYDWRLYGAVTPVKDQSVCGSCWSFGTIGHLEGAFFLKNGGNLVRLSQQALIDC 386
Query: 190 SWGYGNNGCDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
SW YGNNGCDGGEDFR YQW+++ G +PT+++YGPYLGQD YCH+ N T A + GFVNV
Sbjct: 387 SWAYGNNGCDGGEDFRVYQWMLQSGGVPTEEEYGPYLGQDGYCHVNNVTLVAPIKGFVNV 446
Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
T N +A KLAL KHGP+SVAIDAS K+FSFY +GVYY+ C N DGLDHAVLAVGYG
Sbjct: 447 TSNDPNAFKLALLKHGPLSVAIDASPKTFSFYSHGVYYEPTCKNDVDGLDHAVLAVGYGS 506
Query: 309 LDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
++G+ YW VKNSWSTYWGN GY+LMS K NNCGVMT PTYV M
Sbjct: 507 INGEDYWLVKNSWSTYWGNDGYILMSAKKNNCGVMTMPTYVEM 549
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 57/142 (40%), Positives = 78/142 (54%), Gaps = 29/142 (20%)
Query: 14 GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
GMVKTYQL+ G LK+AP+TT+TE NK++C G D +E QS+LPD F +
Sbjct: 64 GMVKTYQLAGENQYGTLLKLAPITTKTENNKLTCLQVNGTADQAVEIQSILPDAKPFSL- 122
Query: 74 IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNK-- 131
+G E G +K+RL ++ G+K + Y+LWVRY K
Sbjct: 123 -----------------------VGTESFLGYTCDKFRLESTIGQKKNIYTLWVRYKKSP 159
Query: 132 ---ASKDAIPVRYEMKGYNSLL 150
+S+ IPVRYEM+GYN+LL
Sbjct: 160 HYPSSRMPIPVRYEMRGYNTLL 181
>gi|195494228|ref|XP_002094747.1| GE21992 [Drosophila yakuba]
gi|194180848|gb|EDW94459.1| GE21992 [Drosophila yakuba]
Length = 549
Score = 311 bits (797), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 147/223 (65%), Positives = 176/223 (78%), Gaps = 5/223 (2%)
Query: 134 KDAIPVRYEMKGYNSLL---DQSVCGSCWSFGTTGAVEGAYYMKHK-KLAVLSQQALIDC 189
+D IP +Y+ + Y ++ DQSVCGSCWSFGT G +EGA+++K+ L LSQQALIDC
Sbjct: 327 QDEIPDQYDWRLYGAVTPVKDQSVCGSCWSFGTIGHLEGAFFLKNGGNLVRLSQQALIDC 386
Query: 190 SWGYGNNGCDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
SW YGNNGCDGGEDFR YQW+++ G +PT+++YGPYLGQD YCH+ N T A + GFVNV
Sbjct: 387 SWAYGNNGCDGGEDFRVYQWMLQSGGVPTEEEYGPYLGQDGYCHVNNVTLVAPIKGFVNV 446
Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
T N +A KLAL KHGP+SVAIDAS K+FSFY +GVYY+ C N DGLDHAVLAVGYG
Sbjct: 447 TSNDPNAFKLALLKHGPLSVAIDASPKTFSFYSHGVYYEPTCKNDVDGLDHAVLAVGYGS 506
Query: 309 LDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
++G+ YW VKNSWSTYWGN GY+LMS K NNCGVMT PTYV M
Sbjct: 507 INGEDYWLVKNSWSTYWGNDGYILMSAKKNNCGVMTMPTYVEM 549
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 58/142 (40%), Positives = 80/142 (56%), Gaps = 29/142 (20%)
Query: 14 GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
GMVKTYQL+ G G LK+AP+TT+TE NK++C G D T++ QS+LPD F +
Sbjct: 64 GMVKTYQLAGEGQFGTLLKLAPITTKTENNKLTCLQVNGTADQTVDIQSILPDAKPFSL- 122
Query: 74 IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNK-- 131
+G E G +K+RL ++ G+K + Y+LWVRY K
Sbjct: 123 -----------------------VGTESFLGYTCDKFRLESTIGQKKNVYTLWVRYKKSP 159
Query: 132 ---ASKDAIPVRYEMKGYNSLL 150
+S+ IPVRYEM+GYN+LL
Sbjct: 160 HYPSSRMPIPVRYEMRGYNTLL 181
>gi|321478980|gb|EFX89936.1| hypothetical protein DAPPUDRAFT_309603 [Daphnia pulex]
Length = 584
Score = 310 bits (794), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 142/228 (62%), Positives = 178/228 (78%), Gaps = 4/228 (1%)
Query: 128 RYNKASKDAIPVRYEMKGYNSLL---DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQ 184
Y K+ + +P + + + Y ++ DQSVCGSCWSFGT G +EGA ++K KL LSQQ
Sbjct: 357 HYTKSDVENLPEQMDWRLYGAVTPVKDQSVCGSCWSFGTVGTLEGALFLKTGKLTPLSQQ 416
Query: 185 ALIDCSWGYGNNGCDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANTTATATMT 243
AL+DCSWG+GNNGCDGGEDFR YQW+MKHG +P+++ YGPYLG D YCH+ N T A++
Sbjct: 417 ALVDCSWGFGNNGCDGGEDFRVYQWMMKHGGIPSEESYGPYLGADGYCHVDNATLVASIK 476
Query: 244 GFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLA 303
G+VNVT DAL++A+ K+GP+SVAIDA+ ++FSFY NGVYY+ +C + D LDHAVLA
Sbjct: 477 GYVNVTSGDVDALRVAIFKYGPISVAIDAAHRAFSFYANGVYYNPECGSGEDSLDHAVLA 536
Query: 304 VGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
VGYG L G+PYW VKNSWSTYWGN GYVLMS K+NNCGV T+PTYV M
Sbjct: 537 VGYGILKGEPYWLVKNSWSTYWGNSGYVLMSQKENNCGVATSPTYVIM 584
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 57/140 (40%), Positives = 82/140 (58%), Gaps = 27/140 (19%)
Query: 14 GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
GM KTYQL+ G G + K+APMTTET N+++CF G D + Q VLPD+S+FK+
Sbjct: 102 GMEKTYQLANEGPYGTAKKIAPMTTETVRNQLNCFQTSGSADEPVTVQGVLPDISNFKL- 160
Query: 74 IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKA- 132
IG E++NG+ EKW + S G+K ++Y++W+++ A
Sbjct: 161 -----------------------IGTEIMNGVSCEKWEYINSVGDKTNRYTMWLQHQFAG 197
Query: 133 --SKDAIPVRYEMKGYNSLL 150
+ A+PVRYEMKG+NSLL
Sbjct: 198 TRQQIAVPVRYEMKGFNSLL 217
>gi|357624871|gb|EHJ75484.1| putative 26,29kDa proteinase [Danaus plexippus]
Length = 553
Score = 310 bits (793), Expect = 8e-82, Method: Compositional matrix adjust.
Identities = 142/202 (70%), Positives = 165/202 (81%), Gaps = 1/202 (0%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHK-KLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQW 209
DQSVCGSCWSFGT GAVEGA ++ + L LSQQAL+DCSWG+GNNGCDGGED+R+YQW
Sbjct: 352 DQSVCGSCWSFGTVGAVEGALFLSNGGHLVRLSQQALVDCSWGFGNNGCDGGEDYRAYQW 411
Query: 210 IMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
IM+HGLPT+DDYG YLGQD YCH+ N T M G+VNVT +E+ALKLA+ KHGPVSVA
Sbjct: 412 IMRHGLPTEDDYGGYLGQDGYCHMENVTVATKMKGWVNVTAKNENALKLAIFKHGPVSVA 471
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS K+FSFY NGVY++ KC NS + LDHAVLAVG+G L+G YW VKNSWS WGN G
Sbjct: 472 IDASHKTFSFYSNGVYFEPKCKNSVEELDHAVLAVGFGVLNGHKYWLVKNSWSNMWGNDG 531
Query: 330 YVLMSIKDNNCGVMTAPTYVTM 351
YVLMS +D+NCGV APTYV +
Sbjct: 532 YVLMSARDDNCGVQAAPTYVII 553
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 60/141 (42%), Positives = 81/141 (57%), Gaps = 29/141 (20%)
Query: 15 MVKTYQLSKN--GTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKV 72
MVKTYQLS + G S+K+AP+TTE LN+ +C G I Q+VLPD++DFK
Sbjct: 67 MVKTYQLSASVYPQYGTSIKIAPVTTEHVLNQDTCLQVNGTEGENINIQTVLPDMTDFK- 125
Query: 73 NIYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKA 132
++G E + KWR+VTS G+KV+KY++WV+Y K+
Sbjct: 126 -----------------------FVGTETMKDSDTFKWRMVTSVGDKVNKYTMWVKYRKS 162
Query: 133 SKD---AIPVRYEMKGYNSLL 150
+ AIPVRYEMKG+NSLL
Sbjct: 163 LRGDNIAIPVRYEMKGFNSLL 183
>gi|4521167|dbj|BAA76272.1| 26,29kDa proteinase [Sarcophaga peregrina]
Length = 550
Score = 307 bits (786), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 144/223 (64%), Positives = 176/223 (78%), Gaps = 5/223 (2%)
Query: 134 KDAIPVRYEMKGYNSLL---DQSVCGSCWSFGTTGAVEGAYYMKHK-KLAVLSQQALIDC 189
+D +P +Y+ + Y ++ DQSVCGSCWSFGT G +EGA+++K+ L LSQQALIDC
Sbjct: 328 RDTVPDQYDWRLYGAVTPVKDQSVCGSCWSFGTIGHLEGAFFLKNGGNLVRLSQQALIDC 387
Query: 190 SWGYGNNGCDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
SW YGNNGCDGGEDFR+Y+W+M+ G +PT+++YGPYLGQD YCH N + A +TGF NV
Sbjct: 388 SWEYGNNGCDGGEDFRAYKWMMEMGGVPTEEEYGPYLGQDGYCHAKNVSLVAPITGFFNV 447
Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
TPN ALK+AL KHGP+SVAIDAS K+FSFY +GVYY+ C N D LDHAVLAVGYG
Sbjct: 448 TPNDPMALKIALLKHGPISVAIDASPKTFSFYSHGVYYEPTCKNGLDELDHAVLAVGYGT 507
Query: 309 LDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
++G+ YW VKNSWSTYWGN GY+LMS + NNCGVMT PTYV M
Sbjct: 508 INGEDYWLVKNSWSTYWGNDGYILMSARKNNCGVMTMPTYVEM 550
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 58/142 (40%), Positives = 81/142 (57%), Gaps = 29/142 (20%)
Query: 14 GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
GMVKTYQL+ G SLK+AP+TT++ELNK++C G + ++ Q++LP+ DFK+
Sbjct: 64 GMVKTYQLANEHPFGTSLKLAPITTKSELNKVTCLQLNGTSEDPVQVQAILPNAKDFKL- 122
Query: 74 IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNK-- 131
IG E G +K+RL + G+K + Y+LWVRY K
Sbjct: 123 -----------------------IGTETFLGFNCDKFRLEETIGDKKNVYTLWVRYKKSP 159
Query: 132 ---ASKDAIPVRYEMKGYNSLL 150
AS+ IPVRYEM+GYN+LL
Sbjct: 160 HYPASRMPIPVRYEMRGYNTLL 181
>gi|125979159|ref|XP_001353612.1| GA21427 [Drosophila pseudoobscura pseudoobscura]
gi|54642377|gb|EAL31126.1| GA21427 [Drosophila pseudoobscura pseudoobscura]
Length = 549
Score = 306 bits (784), Expect = 9e-81, Method: Compositional matrix adjust.
Identities = 144/223 (64%), Positives = 175/223 (78%), Gaps = 5/223 (2%)
Query: 134 KDAIPVRYEMKGYNSLL---DQSVCGSCWSFGTTGAVEGAYYMKHK-KLAVLSQQALIDC 189
+D IP +Y+ + + ++ DQSVCGSCWSFGT G +EGA+++K+ L LSQQALIDC
Sbjct: 327 QDEIPAQYDWRLFGAVTPVKDQSVCGSCWSFGTIGHLEGAFFLKNGGDLVRLSQQALIDC 386
Query: 190 SWGYGNNGCDGGEDFRSYQWIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
SW +GNNGCDGGEDFR YQW+M+ G+PT+++YGPYLGQD YC I N T A + GFVNV
Sbjct: 387 SWPFGNNGCDGGEDFRVYQWMMQVGGVPTEEEYGPYLGQDGYCRINNVTLVAPIKGFVNV 446
Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
T N +A KLAL KHGP+SVAIDAS K+FSFY +GVYY+ +C N DGLDHAVLAVG+G
Sbjct: 447 TSNDPNAFKLALLKHGPLSVAIDASPKTFSFYSHGVYYEPECKNDVDGLDHAVLAVGFGT 506
Query: 309 LDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
+ G+ YW VKNSWSTYWGN GY+LMS + NNCGVMT PTYV M
Sbjct: 507 IKGEDYWLVKNSWSTYWGNDGYILMSARKNNCGVMTMPTYVEM 549
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 59/142 (41%), Positives = 79/142 (55%), Gaps = 29/142 (20%)
Query: 14 GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
GMVKTYQL+ G G LK+AP+TT+ E+NK++C G + +E QS+LPD FK+
Sbjct: 64 GMVKTYQLAGEGDYGTMLKLAPITTQQEMNKLTCLQVNGTSEQKVETQSILPDAKPFKL- 122
Query: 74 IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNK-- 131
IG E G +K+RL + G+K + Y+LWVRY K
Sbjct: 123 -----------------------IGTETFLGFTCDKFRLEETIGQKKNVYTLWVRYKKSP 159
Query: 132 ---ASKDAIPVRYEMKGYNSLL 150
AS+ IPVRYEM+GYN+LL
Sbjct: 160 HYPASRMPIPVRYEMRGYNTLL 181
>gi|158284547|ref|XP_307325.4| Anopheles gambiae str. PEST AGAP012577-PA [Anopheles gambiae str.
PEST]
gi|157021017|gb|EAA03137.4| AGAP012577-PA [Anopheles gambiae str. PEST]
Length = 547
Score = 302 bits (773), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 139/202 (68%), Positives = 162/202 (80%), Gaps = 1/202 (0%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQSVCGSCWSFGT G +EGAY+ K +KL SQQALIDCSWGYGNNGCDGGEDFR+YQW+
Sbjct: 346 DQSVCGSCWSFGTIGHIEGAYFRKTQKLVRFSQQALIDCSWGYGNNGCDGGEDFRAYQWM 405
Query: 211 MK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
M+ G+P +D+YG YLGQD YCH+ N T A +TG+VNVT DA KLAL KHGP+S+A
Sbjct: 406 MEVGGVPMEDEYGGYLGQDGYCHVQNMTLYAPITGWVNVTSGDPDAFKLALFKHGPLSIA 465
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDA KSFSFY NGVYY+ +C N D LDHAVLAVGYG+L+G+ YW +KNSWS YWGN G
Sbjct: 466 IDAGHKSFSFYSNGVYYEPECKNKLDELDHAVLAVGYGQLNGEDYWLIKNSWSNYWGNDG 525
Query: 330 YVLMSIKDNNCGVMTAPTYVTM 351
Y LM++KDNNCG+ T TYV M
Sbjct: 526 YALMAVKDNNCGLTTDATYVLM 547
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 57/142 (40%), Positives = 78/142 (54%), Gaps = 29/142 (20%)
Query: 14 GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
GMVKTYQL+K G G+SLK+AP+TT+ ++NK +C G I+ Q +LP V DF++
Sbjct: 62 GMVKTYQLTKQGDYGVSLKLAPVTTQNQMNKETCLQVNGSASYKIDVQGILPYVRDFQL- 120
Query: 74 IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNK-- 131
+G E G +K+ L G+K + Y+LWVRY K
Sbjct: 121 -----------------------LGTEQCGGYQCDKFGLTEVIGQKRNVYTLWVRYVKSP 157
Query: 132 ---ASKDAIPVRYEMKGYNSLL 150
A++ IPVRYEMKGYN+LL
Sbjct: 158 KYPAARMPIPVRYEMKGYNTLL 179
>gi|170064305|ref|XP_001867470.1| cathepsin l [Culex quinquefasciatus]
gi|167881732|gb|EDS45115.1| cathepsin l [Culex quinquefasciatus]
Length = 547
Score = 298 bits (764), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 136/202 (67%), Positives = 164/202 (81%), Gaps = 1/202 (0%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQSVCGSCWSFGT G +E AY++K KKL SQQAL+DCSWGYGNNGCDGGEDFR+YQW+
Sbjct: 346 DQSVCGSCWSFGTAGHIESAYFLKTKKLMRFSQQALVDCSWGYGNNGCDGGEDFRAYQWM 405
Query: 211 MK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
M+ G+P++++YG YLGQDAYC + N T A +TG+VNVT DA+KLAL KHGP+S+A
Sbjct: 406 MEVGGIPSEEEYGGYLGQDAYCRLENKTLYAPITGWVNVTSGDADAMKLALFKHGPLSIA 465
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDA KSFS+Y NGVYY+ +C N+ DGLDHAVLAVGYG L+G+ YW +KNSWS WGN G
Sbjct: 466 IDAGHKSFSYYANGVYYEPECKNNLDGLDHAVLAVGYGTLNGEDYWLIKNSWSNLWGNDG 525
Query: 330 YVLMSIKDNNCGVMTAPTYVTM 351
Y LM++KDNNCG+ T TYV M
Sbjct: 526 YALMAMKDNNCGLATDATYVLM 547
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 62/142 (43%), Positives = 78/142 (54%), Gaps = 29/142 (20%)
Query: 14 GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
GMVKTYQL+K G G SLKVAP++T +LNK +C G D IE QS+LP+ +FK+
Sbjct: 62 GMVKTYQLTKAGEYGTSLKVAPVSTRQQLNKETCLQVNGSADNAIEVQSILPNAKEFKLE 121
Query: 74 IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNK-- 131
G E NG +K+ L+ G+K + Y LWVRY K
Sbjct: 122 ------------------------GTETRNGFTCDKFTLIVDVGQKRNYYYLWVRYIKSP 157
Query: 132 ---ASKDAIPVRYEMKGYNSLL 150
AS+ IPVRYEMKGYN+LL
Sbjct: 158 KYPASRMPIPVRYEMKGYNTLL 179
>gi|443691408|gb|ELT93269.1| hypothetical protein CAPTEDRAFT_181131 [Capitella teleta]
Length = 541
Score = 296 bits (759), Expect = 8e-78, Method: Compositional matrix adjust.
Identities = 138/202 (68%), Positives = 163/202 (80%), Gaps = 2/202 (0%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ+VCGSCWSFGTTG VEGA ++K KL LSQQAL+DCSWG GNN CDGGEDFR+YQWI
Sbjct: 341 DQAVCGSCWSFGTTGTVEGANFLKTGKLVRLSQQALMDCSWGEGNNACDGGEDFRAYQWI 400
Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTAT-ATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
KHGLP+++DYG YL QD+YCHIAN + T AT++ +VNVT +E L+ A+A GP+SV
Sbjct: 401 KKHGLPSEEDYGAYLAQDSYCHIANVSNTIATISSYVNVTSTTE-GLRAAIAFKGPISVG 459
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS KS SFY NGVYY+ C N PD LDHAVLAVGYG ++G+ YW +KNSWST+WGN G
Sbjct: 460 IDASHKSLSFYANGVYYEPACGNKPDDLDHAVLAVGYGYMNGQGYWLIKNSWSTHWGNDG 519
Query: 330 YVLMSIKDNNCGVMTAPTYVTM 351
YVLM+ KDNNCGV TA TYV +
Sbjct: 520 YVLMAQKDNNCGVTTAATYVVL 541
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 55/139 (39%), Positives = 76/139 (54%), Gaps = 30/139 (21%)
Query: 14 GMVKTYQLSKNG--TNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFK 71
G+VKTYQL +G T G+S KVAPMTTE LN+I+CF G +A I Q+VLPD+++F
Sbjct: 64 GVVKTYQLPGDGVSTFGISRKVAPMTTEEVLNQINCFQVNGTKEAPITIQTVLPDLANF- 122
Query: 72 VNIYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNK 131
T G E++N + W+ G+K + Y++W+
Sbjct: 123 -----------------------TKTGHEMMNSLNCTVWQYSVKIGQKKNVYTIWL---- 155
Query: 132 ASKDAIPVRYEMKGYNSLL 150
A + +PVRYEM GYNSLL
Sbjct: 156 ADETNLPVRYEMFGYNSLL 174
>gi|91992516|gb|ABE72974.1| cathepsin L [Ochlerotatus atropalpus]
Length = 313
Score = 296 bits (758), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 133/202 (65%), Positives = 163/202 (80%), Gaps = 1/202 (0%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQSVCGSCWSFGT G +E AY++K+ KL SQQALIDCSWGYGNNGCDGGEDFR+YQW+
Sbjct: 111 DQSVCGSCWSFGTAGHIESAYFLKYNKLMRFSQQALIDCSWGYGNNGCDGGEDFRAYQWM 170
Query: 211 MK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
M+ G+P++++YG YLGQD YC + N T A + G+VNVT DA+KLAL KHGP+S+A
Sbjct: 171 MEVGGIPSEEEYGGYLGQDGYCRLENKTLYAAIDGWVNVTSGDSDAMKLALFKHGPLSIA 230
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDA KSFS+Y NGVYY+ +C N+ DGLDHAVLAVGYG+L G+ YW +KNSWS YWGN G
Sbjct: 231 IDAGHKSFSYYANGVYYEPECKNNLDGLDHAVLAVGYGKLKGEDYWLIKNSWSNYWGNDG 290
Query: 330 YVLMSIKDNNCGVMTAPTYVTM 351
Y LM++K+NNCG+ T TYV +
Sbjct: 291 YALMAMKNNNCGLTTDATYVNL 312
>gi|157113282|ref|XP_001657758.1| cathepsin l [Aedes aegypti]
gi|108877803|gb|EAT42028.1| AAEL006389-PA, partial [Aedes aegypti]
Length = 538
Score = 294 bits (752), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 132/202 (65%), Positives = 163/202 (80%), Gaps = 1/202 (0%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQSVCGSCWSFGT G +E AY++K+ KL SQQALIDCSWGYGNNGCDGGEDFR+YQW+
Sbjct: 336 DQSVCGSCWSFGTAGHIESAYFLKYNKLMRFSQQALIDCSWGYGNNGCDGGEDFRAYQWM 395
Query: 211 MK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
M+ G+P++++YG YLGQD YC + N T A + G+VNVT +A+K+AL KHGP+S+A
Sbjct: 396 MEVGGIPSEEEYGGYLGQDGYCRLENKTLYAAIDGWVNVTSGDAEAMKVALFKHGPLSIA 455
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDA KSFS+Y NGVYY+ +C N+ DGLDHAVLAVGYG+L G+ YW +KNSWS YWGN G
Sbjct: 456 IDAGHKSFSYYANGVYYEPECRNNLDGLDHAVLAVGYGKLKGEDYWLIKNSWSNYWGNDG 515
Query: 330 YVLMSIKDNNCGVMTAPTYVTM 351
Y LM++KDNNCG+ T TYV +
Sbjct: 516 YALMAMKDNNCGLSTDATYVNL 537
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 61/141 (43%), Positives = 74/141 (52%), Gaps = 29/141 (20%)
Query: 15 MVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVNI 74
MVKTYQL+K G G SLKVAP+T +NK +C G D I Q +LPD +FK+
Sbjct: 53 MVKTYQLTKEGDYGTSLKVAPVTNRESMNKQTCLQVNGSSDNPISVQGILPDTKNFKL-- 110
Query: 75 YRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNK--- 131
G E NG +K+ LV G+K + YSLWVRY K
Sbjct: 111 ----------------------AGTEQRNGFNCDKFVLVDDVGQKRNYYSLWVRYIKSPK 148
Query: 132 --ASKDAIPVRYEMKGYNSLL 150
AS+ IPVRYEMKGYN+LL
Sbjct: 149 YPASRMPIPVRYEMKGYNTLL 169
>gi|91992512|gb|ABE72972.1| cathepsin L [Aedes aegypti]
Length = 548
Score = 294 bits (752), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 132/202 (65%), Positives = 163/202 (80%), Gaps = 1/202 (0%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQSVCGSCWSFGT G +E AY++K+ KL SQQALIDCSWGYGNNGCDGGEDFR+YQW+
Sbjct: 346 DQSVCGSCWSFGTAGHIESAYFLKYNKLMRFSQQALIDCSWGYGNNGCDGGEDFRAYQWM 405
Query: 211 MK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
M+ G+P++++YG YLGQD YC + N T A + G+VNVT +A+K+AL KHGP+S+A
Sbjct: 406 MEVGGIPSEEEYGGYLGQDGYCRLENKTLYAAIDGWVNVTSGDAEAMKVALFKHGPLSIA 465
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDA KSFS+Y NGVYY+ +C N+ DGLDHAVLAVGYG+L G+ YW +KNSWS YWGN G
Sbjct: 466 IDAGHKSFSYYANGVYYEPECRNNLDGLDHAVLAVGYGKLKGEDYWLIKNSWSNYWGNDG 525
Query: 330 YVLMSIKDNNCGVMTAPTYVTM 351
Y LM++KDNNCG+ T TYV +
Sbjct: 526 YALMAMKDNNCGLSTDATYVNL 547
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 61/141 (43%), Positives = 74/141 (52%), Gaps = 29/141 (20%)
Query: 15 MVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVNI 74
MVKTYQL+K G G SLKVAP+T +NK +C G D I Q +LPD +FK+
Sbjct: 63 MVKTYQLTKEGDYGTSLKVAPVTNRESMNKQTCLQVNGSSDNPISVQGILPDTKNFKL-- 120
Query: 75 YRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNK--- 131
G E NG +K+ LV G+K + YSLWVRY K
Sbjct: 121 ----------------------AGTEQRNGFNCDKFVLVDDVGQKRNYYSLWVRYVKSPK 158
Query: 132 --ASKDAIPVRYEMKGYNSLL 150
AS+ IPVRYEMKGYN+LL
Sbjct: 159 YPASRMPIPVRYEMKGYNTLL 179
>gi|312377879|gb|EFR24605.1| hypothetical protein AND_10691 [Anopheles darlingi]
Length = 375
Score = 291 bits (746), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 136/202 (67%), Positives = 159/202 (78%), Gaps = 2/202 (0%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQSVCGSCWSFGT G +EGAY+ K +KL SQQAL+DCSWGYGNNGCDGGEDFR+YQWI
Sbjct: 175 DQSVCGSCWSFGTIGHIEGAYFRKTQKLVRFSQQALVDCSWGYGNNGCDGGEDFRAYQWI 234
Query: 211 MK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
M+ G+P +D+Y YLGQD YC + N T A +TG+VNVT DA K+AL KHGP+S+A
Sbjct: 235 MQVGGVPMEDEY-EYLGQDGYCRVENVTLYAPITGWVNVTSGDPDAFKVALFKHGPLSIA 293
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDA KSFSFY NGVYY+ +C N D LDHAVLAVGYG L+G+ YW +KNSWS YWGN G
Sbjct: 294 IDAGHKSFSFYSNGVYYEPECKNKLDELDHAVLAVGYGNLNGEDYWLIKNSWSNYWGNDG 353
Query: 330 YVLMSIKDNNCGVMTAPTYVTM 351
Y LM++KDNNCG+ T TYV M
Sbjct: 354 YALMAVKDNNCGLTTDATYVLM 375
>gi|328722454|ref|XP_001951172.2| PREDICTED: counting factor associated protein D-like [Acyrthosiphon
pisum]
Length = 558
Score = 286 bits (731), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 131/228 (57%), Positives = 169/228 (74%), Gaps = 5/228 (2%)
Query: 129 YNKASKDAIPVRYEMK---GYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQA 185
YNK +P + + ++ DQS+CGSCWSF T G +EGAY++K+++ S+QA
Sbjct: 331 YNKTDFQNLPSEIDWRISGAVTAVKDQSICGSCWSFSTIGTIEGAYFIKYRERKNFSEQA 390
Query: 186 LIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTA--TATMT 243
L+DCS G+GN+GCDGGEDFR+Y WI KHGLPT+++YGPYL CHIAN + +T
Sbjct: 391 LVDCSLGFGNDGCDGGEDFRAYSWIEKHGLPTEEEYGPYLFSGGVCHIANISNGNLTKIT 450
Query: 244 GFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLA 303
FVNVT E+ALK+AL GP+S+AI AS K+F FY NGVYYD++C N+P L+HAVL
Sbjct: 451 SFVNVTTYDEEALKIALVNEGPISIAIYASLKTFFFYSNGVYYDKECCNNPLELNHAVLL 510
Query: 304 VGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
VGYG LDG PYW VKNSWST+WGN GY+L+S +DNNCGV+T+PTYV +
Sbjct: 511 VGYGTLDGHPYWLVKNSWSTHWGNSGYILISRRDNNCGVLTSPTYVIL 558
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 47/137 (34%), Positives = 71/137 (51%), Gaps = 26/137 (18%)
Query: 14 GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
GM KTYQLS G+ G LK+ P+TTET+ N + C G + I PQS LPD+SDF+
Sbjct: 79 GMAKTYQLSNKGSFGSYLKIVPVTTETKTNTLKCLEVNGSDEFHILPQSTLPDLSDFR-- 136
Query: 74 IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKAS 133
++ E L +GI++EKW + + V KY +W+ +
Sbjct: 137 ----------------------FVSESLEDGIVVEKWVYIKRVEKTVRKYIMWLY--RKD 172
Query: 134 KDAIPVRYEMKGYNSLL 150
A P++++M G+NS+
Sbjct: 173 DIAYPMKFQMNGFNSVF 189
>gi|156384930|ref|XP_001633385.1| predicted protein [Nematostella vectensis]
gi|156220454|gb|EDO41322.1| predicted protein [Nematostella vectensis]
Length = 548
Score = 285 bits (729), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 139/240 (57%), Positives = 166/240 (69%), Gaps = 16/240 (6%)
Query: 128 RYNKASKDAIPVRYEMKGYNSL---------------LDQSVCGSCWSFGTTGAVEGAYY 172
RY K +P + +M N + DQ+VCGSCWSFGTTG +EG +
Sbjct: 308 RYTKGYNGGLPYKPDMASINDVPDEMNWVIRGAVTPVKDQAVCGSCWSFGTTGTIEGTLF 367
Query: 173 MKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYC 231
+K K L LSQQ L+DCSWG GNN CDGGEDFRSYQ+IMK G + T++ YGPYLG D YC
Sbjct: 368 LKTKYLTRLSQQNLMDCSWGEGNNACDGGEDFRSYQYIMKSGGIATEESYGPYLGADGYC 427
Query: 232 HIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCN 291
H + AT+TG+VN+T ALK A+A+ GP+SV+IDAS KS SFY GVYY+ KC
Sbjct: 428 HKKDAEIGATITGYVNITEGDLSALKTAIAQKGPISVSIDASHKSLSFYSYGVYYEPKCG 487
Query: 292 NSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
N + LDH+VLAVGYG +DGKPYW +KNSWST+WG GYVLMS KDNNCGV TA TYV M
Sbjct: 488 NKNEDLDHSVLAVGYGTMDGKPYWMIKNSWSTHWGMNGYVLMSQKDNNCGVATAATYVLM 547
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 38/137 (27%), Positives = 60/137 (43%), Gaps = 27/137 (19%)
Query: 14 GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
GM T+Q T G + ++ PM+TET+LN SCF G + + PQSVLPD++ F
Sbjct: 69 GMDSTFQRGDILTAGANFRICPMSTETKLNVRSCFQTNGTSASPVGPQSVLPDLAKF--- 125
Query: 74 IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKAS 133
+++G + W + G K +KY++ Y
Sbjct: 126 ---------------------SFVGNSTYGDVKCSIWEYKLTVGHKRNKYTM---YTSMH 161
Query: 134 KDAIPVRYEMKGYNSLL 150
P+ YEM G++ L+
Sbjct: 162 DPPAPIHYEMLGFDDLI 178
>gi|348542138|ref|XP_003458543.1| PREDICTED: counting factor associated protein D-like [Oreochromis
niloticus]
Length = 551
Score = 285 bits (729), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 128/202 (63%), Positives = 161/202 (79%), Gaps = 1/202 (0%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ++CGSCWSF TTGAVEGA ++K L VLSQQ LIDCSWG+GNNGCDGGE++R+Y+WI
Sbjct: 349 DQAICGSCWSFATTGAVEGALFVKTGSLQVLSQQMLIDCSWGFGNNGCDGGEEWRAYEWI 408
Query: 211 MKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
MKHG + T + YG Y+G + +CH+ ++ TA + + NVT + ALK+AL K+GPV+V+
Sbjct: 409 MKHGGIATTETYGAYMGMNGFCHVDSSELTARIQSYTNVTSGDQLALKMALFKNGPVAVS 468
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS +SF FY +GVYY+ C N+ D LDHAVLAVGYG L G+PYW +KNSWSTYWGN G
Sbjct: 469 IDASHRSFVFYSHGVYYEPACGNTVDDLDHAVLAVGYGTLSGEPYWLIKNSWSTYWGNDG 528
Query: 330 YVLMSIKDNNCGVMTAPTYVTM 351
Y+LMS+KDNNCGV T TYVT+
Sbjct: 529 YILMSMKDNNCGVATDATYVTL 550
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 50/142 (35%), Positives = 72/142 (50%), Gaps = 29/142 (20%)
Query: 14 GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
G V TYQL+ G + K++P TTE ELN + CF G D + PQ+ LPD+ DFK
Sbjct: 67 GQVLTYQLASEQQYGTAYKISPETTEEELNVMKCFQINGTKDEAVTPQAALPDLQDFK-- 124
Query: 74 IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKAS 133
FLR ++ G + + W+ VT+ G K + Y+LW+ +++
Sbjct: 125 -----FLRVEYYR-----------------GSLCQVWQNVTTVGYKKNTYTLWMTHSERG 162
Query: 134 KDA-----IPVRYEMKGYNSLL 150
D IP+ YEM GYN+LL
Sbjct: 163 ADGGKDTPIPIHYEMMGYNTLL 184
>gi|209155876|gb|ACI34170.1| Digestive cysteine proteinase 2 precursor [Salmo salar]
Length = 551
Score = 282 bits (721), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 126/202 (62%), Positives = 157/202 (77%), Gaps = 1/202 (0%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ++CGSCWSF TTGAVEGA ++ L VLSQQ L+DCSWG+GNNGCDGGE++R+Y+WI
Sbjct: 349 DQAICGSCWSFATTGAVEGALFLTSGSLQVLSQQMLVDCSWGFGNNGCDGGEEWRAYEWI 408
Query: 211 MKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
MKHG + T + YG Y+G + CH + TA + + NVT +ALK+AL KHGPV+V+
Sbjct: 409 MKHGGIATTETYGSYMGMNGLCHFNTSQLTARIQSYTNVTSGDAEALKVALFKHGPVAVS 468
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDA +SF FY +GVYY+ KC N+ D LDHAVLAVGYG ++ +PYW VKNSWSTYWGN G
Sbjct: 469 IDAGHRSFVFYSHGVYYEPKCGNTTDSLDHAVLAVGYGVMEAEPYWLVKNSWSTYWGNDG 528
Query: 330 YVLMSIKDNNCGVMTAPTYVTM 351
Y+LMS+KDNNCGV T TYVT+
Sbjct: 529 YILMSMKDNNCGVTTDATYVTL 550
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 47/142 (33%), Positives = 65/142 (45%), Gaps = 29/142 (20%)
Query: 14 GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
G V TYQ+ G S K+ P TTE ELN + CF G + + PQ LPD+ F
Sbjct: 67 GQVSTYQMGMELPWGSSYKITPETTEVELNIMKCFQVNGTKEEPVTPQGSLPDLQGF--- 123
Query: 74 IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNK-- 131
FLR ++ G++ E W+ VT G K + Y+LWV +
Sbjct: 124 ----MFLRMEYY-----------------GGVLCEVWQNVTVVGHKKNTYTLWVTRPEGG 162
Query: 132 ---ASKDAIPVRYEMKGYNSLL 150
++ P+ YEM GYN+LL
Sbjct: 163 VMGGAEVTTPLHYEMMGYNTLL 184
>gi|432881828|ref|XP_004073923.1| PREDICTED: counting factor associated protein D-like [Oryzias
latipes]
Length = 563
Score = 282 bits (721), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 126/202 (62%), Positives = 160/202 (79%), Gaps = 1/202 (0%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ++CGSCWSF TTGA+EGA ++K L VLSQQ L+DCSWG+GNNGCDGGE++R+Y+WI
Sbjct: 361 DQAICGSCWSFATTGAIEGALFLKTGSLQVLSQQMLVDCSWGFGNNGCDGGEEWRAYEWI 420
Query: 211 MKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
MKHG + T + YG Y+G + +CH+ ++ A + + NVT +ALKLAL K+GPV+V+
Sbjct: 421 MKHGGIATTETYGSYMGMNGFCHMNSSELVAPIKSYTNVTSGDAEALKLALFKNGPVAVS 480
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS +SF FY GVYY+ C N+ D LDHAVLAVGYG L+G+PYW +KNSWSTYWGN G
Sbjct: 481 IDASHRSFVFYGYGVYYEPACGNTTDDLDHAVLAVGYGTLNGEPYWLIKNSWSTYWGNDG 540
Query: 330 YVLMSIKDNNCGVMTAPTYVTM 351
Y+LMS+KDNNCGV T TYVT+
Sbjct: 541 YILMSMKDNNCGVTTDATYVTL 562
Score = 84.0 bits (206), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 50/142 (35%), Positives = 73/142 (51%), Gaps = 31/142 (21%)
Query: 14 GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
G V TYQL+ + G++ K+ P TTETE N + CF G + ++PQ LPD+ F+
Sbjct: 81 GQVSTYQLASHQEFGVAYKITPETTETESNVMKCFQVNGTKEEAVQPQGTLPDLQGFQ-- 138
Query: 74 IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKAS 133
FLR ++ G + E W+ VT+ G K + Y+LWV K+
Sbjct: 139 -----FLRMEYY-----------------GGSLCEVWQNVTTVGHKKNTYTLWVA--KSE 174
Query: 134 KD-----AIPVRYEMKGYNSLL 150
+D A+P+ YEM GYN+LL
Sbjct: 175 RDGRKGTAVPLHYEMMGYNTLL 196
>gi|332022677|gb|EGI62958.1| Counting factor associated protein D [Acromyrmex echinatior]
Length = 525
Score = 281 bits (719), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 129/175 (73%), Positives = 149/175 (85%), Gaps = 1/175 (0%)
Query: 178 LAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANT 236
+ VL QALIDCSWG+GNNGCDGGEDFRSYQWIMKHG LPT++DYG YLGQD YCHI N
Sbjct: 351 IKVLILQALIDCSWGFGNNGCDGGEDFRSYQWIMKHGGLPTEEDYGDYLGQDGYCHINNV 410
Query: 237 TATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDG 296
T TA +TGFVNVTP S +ALK+A+AKHGP+SVAIDAS K+FSFY +GVYYD C N+ +
Sbjct: 411 TLTAKITGFVNVTPRSVEALKIAIAKHGPISVAIDASHKTFSFYSHGVYYDPACGNTENK 470
Query: 297 LDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
LDHAVLAVGYG ++GK YW +KNSWS YWGN GY+LM+ KD+NCGV+ APTYVTM
Sbjct: 471 LDHAVLAVGYGTMNGKGYWLIKNSWSNYWGNDGYILMAQKDDNCGVLLAPTYVTM 525
Score = 121 bits (303), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 62/142 (43%), Positives = 85/142 (59%), Gaps = 29/142 (20%)
Query: 14 GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
GMVKT+QL+ G G S+K+AP+TTET N+ +C G D I+PQS+LPD + +
Sbjct: 70 GMVKTFQLAHKGQYGASIKIAPITTETVSNEETCLQVNGTSDFKIQPQSILPDTPEMEC- 128
Query: 74 IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKAS 133
IGEE++NG++ EKWRLV G+K +KY+LW+RY K+
Sbjct: 129 -----------------------IGEEMINGLLCEKWRLVQEIGDKTNKYTLWIRYKKSP 165
Query: 134 -----KDAIPVRYEMKGYNSLL 150
K+ IP+RYEMKG+NSLL
Sbjct: 166 YLPKVKEPIPMRYEMKGFNSLL 187
>gi|198285481|gb|ACH85279.1| cathepsin l-like [Salmo salar]
Length = 444
Score = 281 bits (719), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 126/202 (62%), Positives = 157/202 (77%), Gaps = 1/202 (0%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ++CGSCWSF TTGAVEGA ++ L VLSQQ L+DCSWG+GNNGCDGGE++R+Y+WI
Sbjct: 242 DQAICGSCWSFATTGAVEGALFLTSGSLQVLSQQMLVDCSWGFGNNGCDGGEEWRAYEWI 301
Query: 211 MKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
MKHG + T + YG Y+G + CH + TA + + NVT +ALK+AL KHGPV+V+
Sbjct: 302 MKHGGIATTETYGSYMGMNGLCHFNTSQLTARIQSYTNVTSGDAEALKVALFKHGPVAVS 361
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDA +SF FY +GVYY+ KC N+ D LDHAVLAVGYG ++ +PYW VKNSWSTYWGN G
Sbjct: 362 IDAGHRSFVFYSHGVYYEPKCGNTTDSLDHAVLAVGYGVMEAEPYWLVKNSWSTYWGNDG 421
Query: 330 YVLMSIKDNNCGVMTAPTYVTM 351
Y+LMS+KDNNCGV T TYVT+
Sbjct: 422 YILMSMKDNNCGVTTDATYVTL 443
Score = 43.5 bits (101), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 29/98 (29%), Positives = 43/98 (43%), Gaps = 29/98 (29%)
Query: 58 IEPQSVLPDVSDFKVNIYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEG 117
+ PQ LPD+ F FLR ++ G++ E W+ VT G
Sbjct: 4 VTPQGSLPDLQGF-------MFLRMEYY-----------------GGVLCEVWQNVTVVG 39
Query: 118 EKVSKYSLWVRYNK-----ASKDAIPVRYEMKGYNSLL 150
K + Y+LWV + ++ P+ YEM GYN+LL
Sbjct: 40 HKKNTYTLWVTRPEGGVMGGAEVTTPLHYEMMGYNTLL 77
>gi|410911058|ref|XP_003969007.1| PREDICTED: counting factor associated protein D-like [Takifugu
rubripes]
Length = 549
Score = 280 bits (715), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 124/202 (61%), Positives = 158/202 (78%), Gaps = 1/202 (0%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ++CGSCWSF TTG +EGA ++K L VLSQQ L+DCSWG+GNN CDGGE++RSY+WI
Sbjct: 347 DQAICGSCWSFATTGTIEGALFLKTGFLQVLSQQILMDCSWGFGNNACDGGEEWRSYEWI 406
Query: 211 MKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
MKHG + + YGPY+G + +CH+ ++ A + + NVT ALKLAL KHGPV+V+
Sbjct: 407 MKHGGIALAETYGPYMGMNGFCHVNSSELVAQIQSYTNVTSGDAMALKLALFKHGPVAVS 466
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS +SF FY +GVYY+ C ++ D LDHAVLAVGYG L+G+PYW +KNSWSTYWGN G
Sbjct: 467 IDASHRSFVFYSHGVYYEPACGSTIDDLDHAVLAVGYGNLNGEPYWLIKNSWSTYWGNDG 526
Query: 330 YVLMSIKDNNCGVMTAPTYVTM 351
Y+LMS+KDNNCGV T T+VT+
Sbjct: 527 YILMSMKDNNCGVATDATFVTL 548
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 48/142 (33%), Positives = 68/142 (47%), Gaps = 29/142 (20%)
Query: 14 GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
G V TYQL G S K++P T E E N + CF G + + PQ+ LP + F+
Sbjct: 65 GQVSTYQLGGEKQWGTSYKISPETNEVEQNVMKCFQVNGTKEEAVAPQAALPQLQGFQ-- 122
Query: 74 IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWV-RYNKA 132
FLR ++ +G + E W+ VT+ G K + Y+LWV R +A
Sbjct: 123 -----FLRVEYY-----------------SGSLCEVWQNVTNVGYKKNTYTLWVTRAERA 160
Query: 133 S----KDAIPVRYEMKGYNSLL 150
+ +PV YEM GYN+LL
Sbjct: 161 TNGTESPVLPVHYEMMGYNTLL 182
>gi|161408099|dbj|BAF94153.1| 26,29 kDa proteinase-like cysteine protease [Plautia stali]
Length = 521
Score = 279 bits (713), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 132/187 (70%), Positives = 153/187 (81%), Gaps = 4/187 (2%)
Query: 136 AIPVRYEMKGYNSLL---DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG 192
A+P +Y+ + Y ++ DQSVCGSCWSFGTTGAVEGAY++K L LSQQALIDCSWG
Sbjct: 333 AVPDQYDWRLYGAVTPVKDQSVCGSCWSFGTTGAVEGAYFVKTGTLVRLSQQALIDCSWG 392
Query: 193 YGNNGCDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPN 251
YGNNGCDGGEDFRSYQWIMKHG LP + DYG YLGQD YCH+ T TA +TG+VNVT
Sbjct: 393 YGNNGCDGGEDFRSYQWIMKHGGLPLESDYGGYLGQDGYCHVDKTNLTAKITGYVNVTTG 452
Query: 252 SEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDG 311
E+ALKLA+ KHGP+SVAIDASQK+FSFY NGVYYD +C N+P+GLDHAVLAVGYG++ G
Sbjct: 453 DENALKLAIFKHGPISVAIDASQKTFSFYSNGVYYDPECKNTPEGLDHAVLAVGYGKIHG 512
Query: 312 KPYWQVK 318
YW VK
Sbjct: 513 NSYWLVK 519
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 60/139 (43%), Positives = 79/139 (56%), Gaps = 26/139 (18%)
Query: 14 GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
GMVKTYQ+S G G SLK+APMTTE +LN +CF G + I+ Q++LPD++ F+
Sbjct: 70 GMVKTYQISDKGKYGTSLKLAPMTTEDQLNVETCFKVDGTIEDKIQAQAILPDLTGFQC- 128
Query: 74 IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKAS 133
IG+E VNG EKW T +GEK +KY +++R K
Sbjct: 129 -----------------------IGKEWVNGFNCEKWHNETIKGEKSNKYIMYIRRKKTE 165
Query: 134 KDA--IPVRYEMKGYNSLL 150
+ IPVRYEM+GYNSLL
Sbjct: 166 TGSIIIPVRYEMRGYNSLL 184
>gi|346469497|gb|AEO34593.1| hypothetical protein [Amblyomma maculatum]
Length = 557
Score = 276 bits (706), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 143/289 (49%), Positives = 190/289 (65%), Gaps = 12/289 (4%)
Query: 69 DFKVNIYR--LFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLW 126
D + +I+R L F+ N + ++ + I + + RL + +G ++ +
Sbjct: 275 DRRRDIFRQNLRFIDSTNRANLGYQVAVNHLADRTPEEISVMRGRLQSRDGSSTAE--PF 332
Query: 127 VRYNKASKDAIPVRYEMKGYNSLL---DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQ 183
RY +K +P + + + Y ++ DQ+VCGSCWSFGTTG +EGAY+ K KL LS+
Sbjct: 333 PRYKFTAK--LPEQIDWRLYGAVTPVKDQAVCGSCWSFGTTGELEGAYFRKTGKLVRLSE 390
Query: 184 QALIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTAT-ATM 242
Q L+DCSW GNNGCDGGEDFR+Y++I KHGL + +DYG Y+GQD CH AT +T+
Sbjct: 391 QQLVDCSWNSGNNGCDGGEDFRAYEYIRKHGLASDEDYGAYIGQDGVCHDTKVNATISTI 450
Query: 243 TGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVL 302
++N+T + D L ALA GPVSV+IDA+ +SFSFY NGV+YD KC N D LDHAVL
Sbjct: 451 KSYINIT--NRDDLLTALANVGPVSVSIDAALRSFSFYSNGVFYDPKCRNDTDSLDHAVL 508
Query: 303 AVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
AVGYG L +PYW +KNSWSTYWGN GYVL+S KDNNCGV T T V +
Sbjct: 509 AVGYGTLQEQPYWLIKNSWSTYWGNDGYVLISQKDNNCGVATQGTIVEL 557
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 43/146 (29%), Positives = 68/146 (46%), Gaps = 25/146 (17%)
Query: 14 GMVKTYQLSKNGTN------GLSLKVAPMTTETELNKI-SCFHAYGIPDATIEPQSVLPD 66
G+V+T QL+ + G++ K+A M + +CF G + T+ Q+VLP+
Sbjct: 62 GIVQTVQLAPGSESAEGYPYGVNYKIAYMPDPKTWESVRTCFQVNGTENVTVSLQNVLPN 121
Query: 67 VSDFKVNIYRLFFLRPRFHENEKIRYNWTYIGEELVN--GIILEKWRLVTSEGEKVSKYS 124
V+ F F E W+ E L E+++ S +VSKY+
Sbjct: 122 VTGFT------------FVRKESC---WSGNDELLAQYGRKQCERFQETVSTEGRVSKYT 166
Query: 125 LWVRYNKASKDAIPVRYEMKGYNSLL 150
LW+ ++ AIP RY M+GYN+LL
Sbjct: 167 LWLSRDRDGS-AIPRRYLMRGYNTLL 191
>gi|449676370|ref|XP_002156627.2| PREDICTED: counting factor associated protein D-like [Hydra
magnipapillata]
Length = 551
Score = 275 bits (703), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 131/246 (53%), Positives = 175/246 (71%), Gaps = 3/246 (1%)
Query: 109 KWRLVTSEGEK-VSKYSLWVRYNKASKDAIPVRYEMKG-YNSLLDQSVCGSCWSFGTTGA 166
+ R + + EK VS +++ K S + + + G + DQ+VCGSCWSFGTTGA
Sbjct: 305 RGRRTSKKSEKNVSNAKPFIKTMKVSDLPTDINWRLFGAVTPVKDQAVCGSCWSFGTTGA 364
Query: 167 VEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHG-LPTQDDYGPYL 225
+EGA ++K +L LS+Q L+DCSWG+GNNGCDGGE+FR+Y++IMKHG + T D YG YL
Sbjct: 365 IEGALFLKTGRLVRLSEQNLMDCSWGFGNNGCDGGEEFRAYEYIMKHGGIATDDSYGNYL 424
Query: 226 GQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVY 285
G D YCH ++ A + +VNVT DALK+A+ +HGP++V IDA+ +F FY +GVY
Sbjct: 425 GIDGYCHQKSSVIGAKIASYVNVTSGDMDALKMAIVQHGPIAVGIDAAHLAFVFYSHGVY 484
Query: 286 YDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTA 345
Y+ +C N P+ LDHAVLAVGYG +G+PY VKNSWST+WGN GYVLMS +DNNCGV T
Sbjct: 485 YNPECGNKPENLDHAVLAVGYGVQNGEPYTLVKNSWSTHWGNDGYVLMSQRDNNCGVATD 544
Query: 346 PTYVTM 351
T+V +
Sbjct: 545 ATFVNL 550
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 44/137 (32%), Positives = 65/137 (47%), Gaps = 28/137 (20%)
Query: 14 GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
G+ KT Q G G VAPM+ E +LN+I+CF G + PQS+LPD+ +F
Sbjct: 77 GISKTIQRGDKGEFGAHFLVAPMSNEVQLNQINCFELSGTSEEPALPQSLLPDMENF--- 133
Query: 74 IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKAS 133
TY GE+ ++ W+ + K + Y+L+V+ +
Sbjct: 134 ---------------------TYTGEKAFMDFKVDIWQYKYGDDNKYNVYTLYVKKDS-- 170
Query: 134 KDAIPVRYEMKGYNSLL 150
+IPV YEM GY+SLL
Sbjct: 171 --SIPVYYEMIGYDSLL 185
>gi|148970410|gb|ABR20116.1| venom cathepsin cysteine peptidase, partial [Mesobuthus eupeus]
Length = 225
Score = 275 bits (702), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 128/202 (63%), Positives = 154/202 (76%), Gaps = 1/202 (0%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ++CGSCWSFGT G +EGAY++K KKL LSQQ LID SW Y NNGCDGGED+R+Y++I
Sbjct: 19 DQAICGSCWSFGTAGTIEGAYFLKTKKLVRLSQQQLIDGSWKYENNGCDGGEDYRAYRYI 78
Query: 211 MKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G L T+DDYGP LGQD C N + TA + +VNVT DALK A+A HGP+S++
Sbjct: 79 KESGGLATEDDYGPILGQDGICRERNVSKTAKIGSYVNVTSGDLDALKFAIAYHGPISIS 138
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS KS SFY NGVYY+ C N+ D LDH+VLAVGYGEL G+ YW VKNSWSTYWGN G
Sbjct: 139 IDASHKSLSFYANGVYYEPACGNTNDDLDHSVLAVGYGELFGQKYWLVKNSWSTYWGNDG 198
Query: 330 YVLMSIKDNNCGVMTAPTYVTM 351
Y L++ KDNNCG T PT+V +
Sbjct: 199 YFLLNQKDNNCGAATEPTFVVL 220
>gi|405976506|gb|EKC41011.1| Counting factor associated protein D [Crassostrea gigas]
Length = 349
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 139/242 (57%), Positives = 166/242 (68%), Gaps = 21/242 (8%)
Query: 120 VSKYSL--------WVRYNKASKDAI-PVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGA 170
SKYSL W + S+ A+ PV+ DQ+VCGSCWSFGTTG +EGA
Sbjct: 119 TSKYSLRDLPDSMDWRLHGYLSQRAVTPVK----------DQAVCGSCWSFGTTGTIEGA 168
Query: 171 YYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDA 229
Y++K L LSQQ L+DCSWG GNN CDGGEDFR+YQW+MK+G L +++ YGPY QD
Sbjct: 169 YFLKTGDLVRLSQQQLMDCSWGEGNNACDGGEDFRAYQWMMKNGGLTSEELYGPYKAQDG 228
Query: 230 YCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEK 289
C+ T + +VNVT ALK A+A GPVSVAIDAS S SFY NGVYY+ +
Sbjct: 229 KCN-KTITPIVQLKNYVNVTSGDLQALKFAIAHQGPVSVAIDASHLSLSFYANGVYYEPQ 287
Query: 290 CNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYV 349
C N PD LDHAVLAVGYG ++G+ YW +KNSWSTYWGN GYVLMS KDNNCGV T PT+V
Sbjct: 288 CGNKPDDLDHAVLAVGYGVMNGQAYWLIKNSWSTYWGNDGYVLMSQKDNNCGVATDPTFV 347
Query: 350 TM 351
+
Sbjct: 348 IV 349
>gi|224081608|ref|XP_002191568.1| PREDICTED: counting factor associated protein D-like [Taeniopygia
guttata]
Length = 546
Score = 271 bits (694), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 125/199 (62%), Positives = 154/199 (77%), Gaps = 1/199 (0%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ+VCGSCWSF TTGA+EGA ++K L LSQQ LIDCSWG+GN CDGGE++R+Y+WI
Sbjct: 344 DQAVCGSCWSFATTGAMEGALFLKTGVLTPLSQQVLIDCSWGFGNYACDGGEEWRAYEWI 403
Query: 211 MKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
KHG + + + YG Y GQ+ CH + A +TG+VNVT + A+K A+ KHGPV+V+
Sbjct: 404 KKHGGIASTESYGTYKGQNGLCHYNQSEMLAKITGYVNVTSGNITAVKTAIYKHGPVAVS 463
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS KSFSFY NGVYY+ KC+N+P LDHAVLAVGYG L G+ YW +KNSWSTYWGN G
Sbjct: 464 IDASHKSFSFYSNGVYYEPKCDNTPGSLDHAVLAVGYGVLQGETYWLIKNSWSTYWGNDG 523
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y+LM++KDNNCGV T TY
Sbjct: 524 YILMAMKDNNCGVATEATY 542
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 47/137 (34%), Positives = 65/137 (47%), Gaps = 26/137 (18%)
Query: 14 GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
G V TYQL+ G+ K+ P TTE E+N CF G + + QSV P + FK
Sbjct: 69 GQVITYQLAAVKPYGMRYKITPETTEKEVNTRKCFQLPGSKEDVVTAQSVFPSMRGFK-- 126
Query: 74 IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKAS 133
FLR ++++ RY W+ +T +K + Y+LWV +S
Sbjct: 127 -----FLREEYYKD---RY--------------CAVWQNITRWAQKKNVYTLWV--TNSS 162
Query: 134 KDAIPVRYEMKGYNSLL 150
PV YEM+GYNSLL
Sbjct: 163 CGVAPVHYEMRGYNSLL 179
>gi|194246075|gb|ACF35529.1| midgut cysteine proteinase 2 [Dermacentor variabilis]
Length = 235
Score = 271 bits (692), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 129/202 (63%), Positives = 153/202 (75%), Gaps = 3/202 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ+VCGSCWSFGT G ++GAY+ K KL LS+Q L+DCSW GNNGCDGGEDFR+Y++I
Sbjct: 36 DQAVCGSCWSFGTVGELKGAYFRKTGKLVRLSEQQLVDCSWNSGNNGCDGGEDFRAYEYI 95
Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTAT-ATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
HGL + +DYG YLGQD CH AT A++ G++N+T + D L ALA GPVSV+
Sbjct: 96 RNHGLASDEDYGAYLGQDGVCHDTKVNATIASIKGYINIT--NRDDLLTALANVGPVSVS 153
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDA+ +SFSFY NGV+YD C N D LDHAVLAVGYG L G+PYW VKNSWSTYWGN G
Sbjct: 154 IDAALRSFSFYSNGVFYDPNCRNDTDSLDHAVLAVGYGTLQGEPYWLVKNSWSTYWGNDG 213
Query: 330 YVLMSIKDNNCGVMTAPTYVTM 351
YVL+S KDNNCGV T TYV +
Sbjct: 214 YVLISQKDNNCGVATQGTYVEL 235
>gi|327285051|ref|XP_003227248.1| PREDICTED: counting factor associated protein D-like [Anolis
carolinensis]
Length = 547
Score = 268 bits (684), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 123/202 (60%), Positives = 155/202 (76%), Gaps = 1/202 (0%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ+VCGSCWSF +TGA+EG+ ++K +L LSQQ LIDCSWG+GN CDGGE++++++W+
Sbjct: 345 DQAVCGSCWSFSSTGALEGSLFLKTGQLIPLSQQILIDCSWGFGNYACDGGEEWQAFEWV 404
Query: 211 MKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+KHG + + + YGPY GQ+ YCH T ++G+VNVT + ALK A+ KHGPVSV+
Sbjct: 405 LKHGGIASTESYGPYKGQNGYCHSNKTHLVGKLSGYVNVTSGNITALKAAIYKHGPVSVS 464
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS ++FSFY NGVYY+ KC N LDHAVLAVGYG L G+ YW VKNSWSTYWGN G
Sbjct: 465 IDASHRTFSFYSNGVYYEPKCGNKKGELDHAVLAVGYGVLQGELYWLVKNSWSTYWGNDG 524
Query: 330 YVLMSIKDNNCGVMTAPTYVTM 351
Y+LMS+KDNNCGV T TY M
Sbjct: 525 YILMSMKDNNCGVATDATYPLM 546
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 42/137 (30%), Positives = 59/137 (43%), Gaps = 26/137 (18%)
Query: 14 GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
G V T QL G K+ P TTET +N CF G ++ + PQ V PD+ +F+
Sbjct: 69 GQVITLQLGCVEPYGTLFKITPATTETVVNAEKCFQLNGTKESRVRPQEVFPDMKNFQA- 127
Query: 74 IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKAS 133
+ EE G W+ ++ G K + Y++WV S
Sbjct: 128 -----------------------LREEYYKGQYCTVWQNISYWGHKKNIYTMWV--TNTS 162
Query: 134 KDAIPVRYEMKGYNSLL 150
PV YEM+GYN+LL
Sbjct: 163 CGVSPVHYEMRGYNTLL 179
>gi|241577796|ref|XP_002403652.1| midgut cysteine proteinase, putative [Ixodes scapularis]
gi|215500253|gb|EEC09747.1| midgut cysteine proteinase, putative [Ixodes scapularis]
Length = 564
Score = 266 bits (681), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 127/209 (60%), Positives = 154/209 (73%), Gaps = 10/209 (4%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHK-------KLAVLSQQALIDCSWGYGNNGCDGGED 203
DQ+VCGSCWSFGT G +E A + K + KL LS+Q L+DCSW GNNGCDGGED
Sbjct: 358 DQAVCGSCWSFGTVGELEAALFRKARRFSMFTGKLTRLSEQQLVDCSWNQGNNGCDGGED 417
Query: 204 FRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATAT-MTGFVNVTPNSEDALKLALAK 262
FR+Y++I HGL T +DYG YLGQD CH AT T + ++NVT +++L+ ALA
Sbjct: 418 FRAYEYIRAHGLATDEDYGAYLGQDGICHDTKVNATVTTIKNYINVT--DKESLQKALAN 475
Query: 263 HGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWS 322
GPVSV+IDA+ K+F+FY NGV+YD KC N DGLDHAVLAVGYG L G+PYW +KNSWS
Sbjct: 476 VGPVSVSIDAAVKAFTFYSNGVFYDPKCRNDTDGLDHAVLAVGYGTLQGEPYWLIKNSWS 535
Query: 323 TYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
TYWGN GYVL+S KDNNCGV + TYV +
Sbjct: 536 TYWGNDGYVLISQKDNNCGVASQGTYVEL 564
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 47/145 (32%), Positives = 74/145 (51%), Gaps = 22/145 (15%)
Query: 14 GMVKTYQLSKNGTNGLS-------LKVAPM-TTETELNKISCFHAYGIPDATIEPQSVLP 65
GMV+T Q++ + G S K+A M ET + +CF G + T+ Q+VLP
Sbjct: 62 GMVQTIQVAPDAEEGRSETPYGANYKIAYMPDKETWQPRRTCFLVNGTANNTVPLQAVLP 121
Query: 66 DVSDFKVNIYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSL 125
DV+ F+ F+R ++ + T +E E+++LV + +VSKY+L
Sbjct: 122 DVTSFE-------FVR------QESCWEGTTAVDEPAERASCERFQLVVTNENRVSKYTL 168
Query: 126 WVRYNKASKDAIPVRYEMKGYNSLL 150
WV + + A+P RY M GYN+LL
Sbjct: 169 WVSRDGQGR-AVPRRYLMMGYNTLL 192
>gi|315131333|emb|CBM69275.1| venom protein Ci-80a [Chelonus inanitus]
Length = 165
Score = 266 bits (681), Expect = 9e-69, Method: Compositional matrix adjust.
Identities = 125/162 (77%), Positives = 137/162 (84%), Gaps = 1/162 (0%)
Query: 189 CSWGYGNNGCDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVN 247
CSWGYGNNGCDGGEDFRSYQWIMKHG LPT++DYG YLGQD YCH N T TA M+GFVN
Sbjct: 1 CSWGYGNNGCDGGEDFRSYQWIMKHGGLPTEEDYGGYLGQDGYCHSDNVTLTAKMSGFVN 60
Query: 248 VTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG 307
VT N +ALK+A+AKHGPVSVAIDAS K+FSFY NGVYYD C N D LDHAVLAVGYG
Sbjct: 61 VTSNDPNALKVAIAKHGPVSVAIDASHKTFSFYANGVYYDSSCGNRIDQLDHAVLAVGYG 120
Query: 308 ELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYV 349
+L+G+ YW VKNSWS YWGN GY+LM+ KDNNCGVMT PTYV
Sbjct: 121 KLNGQDYWLVKNSWSNYWGNDGYILMAQKDNNCGVMTTPTYV 162
>gi|363746764|ref|XP_003643791.1| PREDICTED: digestive cysteine proteinase 2-like, partial [Gallus
gallus]
Length = 203
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 123/199 (61%), Positives = 152/199 (76%), Gaps = 1/199 (0%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ+VCGSCWSF TTGA+EGA ++K L LSQQ LIDCSWG+GN CDGGE++R+Y+WI
Sbjct: 1 DQAVCGSCWSFATTGAMEGALFLKTGVLTPLSQQVLIDCSWGFGNYACDGGEEWRAYEWI 60
Query: 211 MKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
KHG + + + YG Y GQ+ CH + A +TG+VNVT + A+K A+ KHGPV+V+
Sbjct: 61 KKHGGIASTESYGTYKGQNGLCHYNQSEMLAKITGYVNVTSGNITAVKTAIYKHGPVAVS 120
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS K+FSFY NG+YY+ KC N P LDHAVLAVGYG L G+ YW +KNSWSTYWGN G
Sbjct: 121 IDASHKTFSFYSNGIYYEPKCANKPGQLDHAVLAVGYGVLQGETYWLIKNSWSTYWGNDG 180
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y+LM++KDNNCGV T TY
Sbjct: 181 YILMAMKDNNCGVATEATY 199
>gi|291223823|ref|XP_002731907.1| PREDICTED: predicted protein-like, partial [Saccoglossus
kowalevskii]
Length = 208
Score = 266 bits (679), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 122/202 (60%), Positives = 149/202 (73%), Gaps = 1/202 (0%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ+VCGSCWSFG GA+EGA +MK KL LSQQ L+DCSWGYGNNGC GGE FRSYQW+
Sbjct: 7 DQAVCGSCWSFGAIGAIEGALFMKSNKLVRLSQQNLVDCSWGYGNNGCFGGEQFRSYQWV 66
Query: 211 MKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
M +G + +++ YGPYLGQ+ CH + ++ + V D+LK +A +GPVSV+
Sbjct: 67 MDNGGIASEEQYGPYLGQNGLCHFNKSMVATKISNYTQVPSGDIDSLKKTIANYGPVSVS 126
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS +S SFY NGVYY+E+C N P+ LDHAVLAVGYG + + YW VKNSWSTYWG G
Sbjct: 127 IDASHRSLSFYSNGVYYEEECGNKPENLDHAVLAVGYGTMANQSYWLVKNSWSTYWGMNG 186
Query: 330 YVLMSIKDNNCGVMTAPTYVTM 351
+VLMS KDNNCGV T T VT+
Sbjct: 187 FVLMSQKDNNCGVATDATLVTI 208
>gi|326932936|ref|XP_003212567.1| PREDICTED: counting factor associated protein D-like [Meleagris
gallopavo]
Length = 573
Score = 263 bits (673), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 122/199 (61%), Positives = 151/199 (75%), Gaps = 1/199 (0%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ+VCGSCWSF TTGA+EGA ++K L LSQQ LIDCSWG+GN CDGGE++R+Y+WI
Sbjct: 371 DQAVCGSCWSFATTGAMEGALFLKTGVLTPLSQQVLIDCSWGFGNYACDGGEEWRAYEWI 430
Query: 211 MKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
KHG + + + YG Y GQ+ CH + A +TG+VNVT + A+K A+ KHGPV+V+
Sbjct: 431 KKHGGIASTESYGTYKGQNGLCHYNQSEMLAKITGYVNVTSGNITAVKTAIYKHGPVAVS 490
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS K+FSFY NG+YY+ KC N LDHAVLAVGYG L G+ YW +KNSWSTYWGN G
Sbjct: 491 IDASHKTFSFYSNGIYYEPKCANKSGQLDHAVLAVGYGVLQGETYWLIKNSWSTYWGNDG 550
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y+LM++KDNNCGV T TY
Sbjct: 551 YILMAMKDNNCGVATEATY 569
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 44/126 (34%), Positives = 57/126 (45%), Gaps = 26/126 (20%)
Query: 25 GTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVNIYRLFFLRPRF 84
G G+ K+ P TTE E+N CF G + ++ QSV P + FK FLR
Sbjct: 106 GPYGMRYKITPETTEKEVNVRKCFQLPGSKEDVVKAQSVFPSLDGFK-------FLR--- 155
Query: 85 HENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKASKDAIPVRYEMK 144
EE G W+ VT +K + Y+LWV +S PV YEM+
Sbjct: 156 --------------EEYYQGQYCAVWQNVTRWEQKKNVYTLWV--TNSSCGVAPVHYEMR 199
Query: 145 GYNSLL 150
GYNSLL
Sbjct: 200 GYNSLL 205
>gi|195175517|ref|XP_002028493.1| GL18069 [Drosophila persimilis]
gi|194103698|gb|EDW25741.1| GL18069 [Drosophila persimilis]
Length = 276
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 129/223 (57%), Positives = 157/223 (70%), Gaps = 25/223 (11%)
Query: 134 KDAIPVRYEMKGYNSLL---DQSVCGSCWSFGTTGAVEGAYYMKHK-KLAVLSQQALIDC 189
+D IP +Y+ + + ++ DQSVCGSCWSFGT G +EGA+++K+ L LSQQALIDC
Sbjct: 74 QDEIPAQYDWRLFGAVTPVKDQSVCGSCWSFGTIGHLEGAFFLKNGGDLVRLSQQALIDC 133
Query: 190 SWGYGNNGCDGGEDFRSYQWIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
SW +GNNGCDGGEDFR YQW+M+ G+PT+++YGPYLGQD YC I N T A + GFVNV
Sbjct: 134 SWPFGNNGCDGGEDFRVYQWMMQVGGVPTEEEYGPYLGQDGYCRINNVTLVAPIKGFVNV 193
Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
T N +A KLAL KHGP+S +C N DGLDHAVLAVG+G
Sbjct: 194 TSNDPNAFKLALLKHGPLS--------------------PECKNDVDGLDHAVLAVGFGT 233
Query: 309 LDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
+ G+ YW VKNSWSTYWGN GY+LMS + NNCGVMT PTYV M
Sbjct: 234 IKGEDYWLVKNSWSTYWGNDGYILMSARKNNCGVMTMPTYVEM 276
>gi|62955235|ref|NP_001017633.1| uncharacterized protein LOC550326 precursor [Danio rerio]
gi|62202194|gb|AAH92817.1| Zgc:110239 [Danio rerio]
Length = 546
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 117/202 (57%), Positives = 154/202 (76%), Gaps = 1/202 (0%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ+VCGSCWSF TTG +EGA ++K +L LSQQ L+DC+WG+GNNGCDGGE++R+++WI
Sbjct: 344 DQAVCGSCWSFATTGTLEGALFLKTGQLTSLSQQMLVDCTWGFGNNGCDGGEEWRAFEWI 403
Query: 211 MKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
MKHG + T + YG Y+G + CH ++ A +TG+ NVT ALK A+ K GPV+V+
Sbjct: 404 MKHGGISTAESYGAYMGMNGLCHYDKSSMVAQLTGYTNVTSGDILALKAAIFKFGPVAVS 463
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDA+ +SF+FY NGVYY+ +C N + LDHAVLAVGYG ++ + YW VKNSWS+YWGN G
Sbjct: 464 IDAAHRSFAFYSNGVYYEPECKNGINDLDHAVLAVGYGIMNNESYWLVKNSWSSYWGNDG 523
Query: 330 YVLMSIKDNNCGVMTAPTYVTM 351
Y+LMS+KDNNCGV T Y T+
Sbjct: 524 YILMSMKDNNCGVATDAIYATL 545
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 50/138 (36%), Positives = 64/138 (46%), Gaps = 25/138 (18%)
Query: 14 GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
G V T+ + + G K+ P+TTETE N + CF G D I PQ LPDV F
Sbjct: 66 GQVCTFFVGNDLDYGAVYKITPVTTETEFNTMKCFQLNGTKDEPIIPQIALPDVQGF--- 122
Query: 74 IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWV-RYNKA 132
E EKI Y G++ E W+ VT+ G K + Y LWV R
Sbjct: 123 ------------EFEKIEY---------YAGVLCEVWKNVTTVGHKKNTYRLWVTRPEGI 161
Query: 133 SKDAIPVRYEMKGYNSLL 150
A+P YEM G+N+LL
Sbjct: 162 DSSAMPHHYEMMGFNTLL 179
>gi|260821900|ref|XP_002606341.1| hypothetical protein BRAFLDRAFT_118507 [Branchiostoma floridae]
gi|229291682|gb|EEN62351.1| hypothetical protein BRAFLDRAFT_118507 [Branchiostoma floridae]
Length = 546
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 125/207 (60%), Positives = 151/207 (72%), Gaps = 6/207 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ+VCGSCWSFG A+EGA++ K L LSQQ L+DCSWGYGNN CDGGE++R+Y+W+
Sbjct: 340 DQAVCGSCWSFGAAEAIEGAWFKKTGLLVPLSQQNLMDCSWGYGNNACDGGEEWRAYEWV 399
Query: 211 MKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
MKHG + T YGPY G D CH A +TG+VNVT + ALK+ALA GPV+V
Sbjct: 400 MKHGGIATAAHYGPYTGADGKCHFDPKHVGAVVTGYVNVTSENGTALKMALANFGPVAVG 459
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGEL-DGK----PYWQVKNSWSTY 324
IDA+ K+FSFY NGVYYD+ C N P+ LDHAVLAVGYG + D K YW +KNSWST+
Sbjct: 460 IDAAVKTFSFYANGVYYDKDCGNKPEDLDHAVLAVGYGSMPDSKGVMQDYWLIKNSWSTH 519
Query: 325 WGNQGYVLMSIKDNNCGVMTAPTYVTM 351
WG+ GYVL+S KDNNCGV T TYV +
Sbjct: 520 WGDNGYVLISQKDNNCGVATDATYVKV 546
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 43/136 (31%), Positives = 58/136 (42%), Gaps = 25/136 (18%)
Query: 15 MVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVNI 74
M+ T Q S G K+A MTTE N+ SCF G + + PQS+ PD + FK
Sbjct: 63 MMSTIQKSDPKPGGAMYKMAYMTTEKVTNQKSCFQINGDKENPVMPQSIFPDTTKFKS-- 120
Query: 75 YRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKASK 134
IG G+ + L T EG +K + ++ Y K
Sbjct: 121 ----------------------IGTFPYKGLDCTAYMLNTVEGGG-NKNNTYILYMSKGK 157
Query: 135 DAIPVRYEMKGYNSLL 150
+PV Y MKGY+SLL
Sbjct: 158 TEVPVAYYMKGYDSLL 173
>gi|47229488|emb|CAF99476.1| unnamed protein product [Tetraodon nigroviridis]
Length = 478
Score = 258 bits (658), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 128/256 (50%), Positives = 162/256 (63%), Gaps = 55/256 (21%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYY-------------------------------------- 172
DQ++CGSCWSF TTG +EGA +
Sbjct: 222 DQAICGSCWSFATTGTIEGALFLKTGSLQVLSQQMLIDCSWGFGNNACDGGEEWRAYEWI 281
Query: 173 MKHKKLA----------------VLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHG-L 215
MKH +A VLSQQ LIDCSWG+GNN CDGGE++R+Y+WIMKHG +
Sbjct: 282 MKHGGIASAETYGPYLGMTGSLQVLSQQMLIDCSWGFGNNACDGGEEWRAYEWIMKHGGI 341
Query: 216 PTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQK 275
+ + YGPYLG + +CH+ ++ TA + + NVT ALKLAL K+GPV+V+IDAS +
Sbjct: 342 ASAETYGPYLGMNGFCHVNSSELTAQIQSYTNVTSGDALALKLALFKNGPVAVSIDASHR 401
Query: 276 SFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSI 335
SF FY NGVYY+ C ++ + LDHAVLAVGYG L+G+PYW +KNSWSTYWGN GY+LMS+
Sbjct: 402 SFVFYSNGVYYEPACGSTVEDLDHAVLAVGYGNLNGEPYWLIKNSWSTYWGNDGYILMSM 461
Query: 336 KDNNCGVMTAPTYVTM 351
KDNNCGV T TYVT+
Sbjct: 462 KDNNCGVTTDATYVTL 477
Score = 41.6 bits (96), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 32/56 (57%), Gaps = 5/56 (8%)
Query: 100 ELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKASKDAI-----PVRYEMKGYNSLL 150
E +G + E W+ VT+ G K + Y+LWV ++ + PV YEM GYN+LL
Sbjct: 2 EYYSGSLCEVWQNVTTVGHKKNTYTLWVTRSEGDANLTEGPVSPVHYEMMGYNTLL 57
>gi|195995651|ref|XP_002107694.1| hypothetical protein TRIADDRAFT_36902 [Trichoplax adhaerens]
gi|190588470|gb|EDV28492.1| hypothetical protein TRIADDRAFT_36902 [Trichoplax adhaerens]
Length = 544
Score = 256 bits (655), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 114/202 (56%), Positives = 152/202 (75%), Gaps = 1/202 (0%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ VCGSCWSFGTTG +EG+ Y+K KL LSQQ +IDC+WG+GNNGCDGGE+FR+++WI
Sbjct: 342 DQGVCGSCWSFGTTGTIEGSLYLKSGKLVSLSQQNMIDCTWGFGNNGCDGGEEFRAFEWI 401
Query: 211 MKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
KHG + T+ YG YL QD C + T A + G+V V ++ ALKLA++ GPV+V
Sbjct: 402 AKHGGIATEKSYGQYLAQDGKCKLNKTKIGAKIRGWVQVPHGNQSALKLAVSAVGPVAVG 461
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+DA+ KSFSFY +G+YYD++C N LDHAVLAVGYG +G+ YW +KNSWST+WG+ G
Sbjct: 462 MDAALKSFSFYSSGIYYDKQCGNKEQDLDHAVLAVGYGNENGQDYWIIKNSWSTHWGDDG 521
Query: 330 YVLMSIKDNNCGVMTAPTYVTM 351
YV +S+K+NNCG+ T ++V +
Sbjct: 522 YVKLSMKNNNCGIATDASFVNI 543
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 39/137 (28%), Positives = 59/137 (43%), Gaps = 27/137 (19%)
Query: 14 GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
GM Q S +G ++ P TTE N CF G A + PQ++LPD+S F
Sbjct: 66 GMDIVLQRSDMMAHGAKYELVPETTEKFTNVRQCFSIKGSKAAPVTPQAILPDLSKF--- 122
Query: 74 IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKAS 133
+ G + N I + W + +V+ Y+++V + S
Sbjct: 123 ---------------------SKTGTKTYNKIKCDVWTYKNKQMGRVNTYTMYV--TQGS 159
Query: 134 KDAIPVRYEMKGYNSLL 150
K + PV Y M+GY+SLL
Sbjct: 160 KPS-PVHYIMEGYDSLL 175
>gi|242008549|ref|XP_002425065.1| predicted protein [Pediculus humanus corporis]
gi|212508730|gb|EEB12327.1| predicted protein [Pediculus humanus corporis]
Length = 549
Score = 255 bits (651), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 127/191 (66%), Positives = 153/191 (80%), Gaps = 1/191 (0%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQSVCGSCWSFG+TGA+EGAY++K+ KL +SQQALIDCSWG+GN GCDGGED S+ +I
Sbjct: 355 DQSVCGSCWSFGSTGAIEGAYFLKNHKLVRVSQQALIDCSWGFGNGGCDGGEDSYSFDYI 414
Query: 211 MKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+KHG +PT+D YGPYLGQD YCH+ + + G+V V DALK+A+A HGPV VA
Sbjct: 415 LKHGGIPTEDSYGPYLGQDGYCHVNKSMDMVKIKGYVAVPSGDVDALKVAVAVHGPVGVA 474
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS K+FSFY NGVYYD C N+P+ LDHAVL VG+G+L+G+ YW VKNSWS YWGN G
Sbjct: 475 IDASHKTFSFYSNGVYYDPNCGNTPNDLDHAVLVVGFGKLNGEDYWLVKNSWSNYWGNDG 534
Query: 330 YVLMSIKDNNC 340
YVLMS K+NNC
Sbjct: 535 YVLMSQKNNNC 545
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 57/142 (40%), Positives = 75/142 (52%), Gaps = 29/142 (20%)
Query: 14 GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
GMVKTYQL G G KVAP+TTE N +CF G ++ +EPQ+VLPD+S+F+
Sbjct: 68 GMVKTYQLGSEGQYGTLKKVAPVTTEKFQNVDTCFVVSGSSESPVEPQTVLPDISNFE-- 125
Query: 74 IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVR----- 128
Y GE+ VN I KW L G+KV++Y++W
Sbjct: 126 ----------------------YAGEDTVNEIPCHKWTLTNKFGDKVNRYTMWTSSKVTW 163
Query: 129 YNKASKDAIPVRYEMKGYNSLL 150
+ A+PVRYEMKG+NSLL
Sbjct: 164 KSNGETTAVPVRYEMKGFNSLL 185
>gi|390355503|ref|XP_001200245.2| PREDICTED: counting factor associated protein D-like
[Strongylocentrotus purpuratus]
Length = 509
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 120/224 (53%), Positives = 158/224 (70%), Gaps = 5/224 (2%)
Query: 133 SKDAIPVRYE---MKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDC 189
S DA+P + + + + DQ+VCGSCWSFG+ +EGA +M+ K LSQQ L+DC
Sbjct: 283 SDDAVPDHIDWNVLGAVSPVKDQAVCGSCWSFGSAETIEGAVFMQSGKRVRLSQQMLMDC 342
Query: 190 SWGYGNNGCDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
+W GNNGCDGGE++R Y+W+MK+G +P ++ YGPYLGQ+ CH + A A++ + NV
Sbjct: 343 TWAAGNNGCDGGEEWRVYEWLMKNGGIPLEETYGPYLGQNGMCHYDKSKAVASIKKYYNV 402
Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG- 307
T ++ LK ALA GP++V IDA+ SFSFY G YYD C N+ D LDHAVLAVGYG
Sbjct: 403 TSGNQKDLKKALATKGPIAVGIDAAVPSFSFYSYGTYYDASCGNTVDDLDHAVLAVGYGT 462
Query: 308 ELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
+ G+ YW +KNSWST+WGN GYV +S+KDNNCGV TA TYV +
Sbjct: 463 DSSGQDYWLIKNSWSTHWGNNGYVAISMKDNNCGVATAATYVEL 506
>gi|390363592|ref|XP_790934.3| PREDICTED: counting factor associated protein D-like
[Strongylocentrotus purpuratus]
Length = 560
Score = 251 bits (641), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 123/225 (54%), Positives = 162/225 (72%), Gaps = 6/225 (2%)
Query: 133 SKDAIP--VRYEMKGYNS-LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDC 189
S DA+P + + ++G S + DQ+VCGSCWSFG+ +EGA +M+ K LSQQ L+DC
Sbjct: 333 SDDAVPDHIDWNVRGAVSPVKDQAVCGSCWSFGSAETIEGAVFMQSGKRVRLSQQMLMDC 392
Query: 190 SWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTT-ATATMTGFVN 247
+W GNNGCDGGE++R Y+W+MK+ G+P ++ YGPYLGQ+ CH +T A A++ + N
Sbjct: 393 TWAAGNNGCDGGEEWRVYEWLMKNGGIPLEETYGPYLGQNGMCHYGKSTPAVASIKKYYN 452
Query: 248 VTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG 307
VT ++ LK ALA GP++V IDA+ SFSFY G YYD C N+ D LDHAVLAVGYG
Sbjct: 453 VTSGNQKDLKKALATKGPIAVGIDAAVPSFSFYSYGTYYDASCGNTVDDLDHAVLAVGYG 512
Query: 308 -ELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
+ G+ YW +KNSWST+WGN GYV +S+KDNNCGV TA TYV +
Sbjct: 513 TDSSGQDYWLIKNSWSTHWGNNGYVAISMKDNNCGVATAATYVEL 557
>gi|449267783|gb|EMC78685.1| Digestive cysteine proteinase 1, partial [Columba livia]
Length = 320
Score = 248 bits (634), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 116/199 (58%), Positives = 146/199 (73%), Gaps = 7/199 (3%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ+VCGSCWSF T GA+EGA ++K L LSQQ LIDC+WG+GN E++R+Y+WI
Sbjct: 124 DQAVCGSCWSFATAGAMEGALFLKTGVLTPLSQQVLIDCAWGFGN------EEWRAYEWI 177
Query: 211 MKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
KHG + T + YG Y GQ+ CH + A +TG+VNVT + A+K A+ KHGPV+V+
Sbjct: 178 KKHGGIATTESYGTYKGQNGLCHYNQSEMLAKITGYVNVTSGNITAVKAAIYKHGPVAVS 237
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS K+FSFY NG+YY+ KC N LDHAVLAVGYG L G+ YW +KNSWSTYWGN G
Sbjct: 238 IDASHKTFSFYSNGIYYEPKCANKSGSLDHAVLAVGYGVLQGETYWLIKNSWSTYWGNDG 297
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y+LM+++DNNCGV T TY
Sbjct: 298 YILMAMRDNNCGVATEATY 316
>gi|326430129|gb|EGD75699.1| hypothetical protein PTSG_07816 [Salpingoeca sp. ATCC 50818]
Length = 545
Score = 239 bits (610), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 113/230 (49%), Positives = 154/230 (66%), Gaps = 8/230 (3%)
Query: 123 YSLWVRYNKASKDAIPVRYEMKGYNS------LLDQSVCGSCWSFGTTGAVEGAYYMKHK 176
+S Y ++ PVR + + + + DQ +CGSCWSFG A+EG Y++
Sbjct: 309 FSSRTDYAYVAETPSPVRSAARDWRTTGAVTGVKDQGICGSCWSFGAAQAIEGQYFLATN 368
Query: 177 KLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIAN 235
+ +SQQAL+DCSWG+GNN CDGGE FR+Y+W++++G +PT+ YGPYL D YCH
Sbjct: 369 RTVPMSQQALMDCSWGFGNNACDGGEAFRAYEWVLQNGYIPTEASYGPYLMADGYCHPEK 428
Query: 236 TTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPD 295
+ G+VN+T + + L GP++VAIDAS KSFSFY +GVYYD C N+PD
Sbjct: 429 ADKGPGIKGYVNITSGDMNKVLDMLDNDGPLAVAIDASLKSFSFYSSGVYYDSDCGNTPD 488
Query: 296 GLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMT 344
LDHAVLAVG+G +DG+ YW +KNSWST +G++GYV MS ++NNCGV T
Sbjct: 489 DLDHAVLAVGFGTSVDGEDYWIIKNSWSTNYGDRGYVRMSRRNNNCGVAT 538
>gi|357621272|gb|EHJ73161.1| putative C1A cysteine protease precursor [Danaus plexippus]
Length = 545
Score = 232 bits (591), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 111/221 (50%), Positives = 147/221 (66%), Gaps = 7/221 (3%)
Query: 135 DAIPVRYEMKGY---NSLLDQSVCGSCWSFGTTGAVEGAY-YMKHKKLAVLSQQALIDCS 190
D +P Y+ + Y S+ DQ CGSCW+FGTT AVEGA + +L L+ QALIDC+
Sbjct: 328 DTLPKEYDARMYGLVTSVKDQQDCGSCWTFGTTSAVEGALARINGGRLMRLANQALIDCA 387
Query: 191 WGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTP 250
WGY N GCDGG D +Y W++ +G+PT+++YGPY+ +D +C I N T T + GF NVTP
Sbjct: 388 WGYENFGCDGGTDTGAYHWMLNYGMPTEEEYGPYVNKDGFCRIHNMTQTYKIKGFTNVTP 447
Query: 251 NSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELD 310
S +ALK+AL HGP+SV+IDA+ ++Y G+Y D C S L+H V VGYGELD
Sbjct: 448 YSVEALKVALVNHGPLSVSIDATD-MLTYYNGGIYSDSDC--STTNLNHEVTLVGYGELD 504
Query: 311 GKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
G+ YW VKNSW WG GY ++ +DN+CG+ T PTYV
Sbjct: 505 GEEYWIVKNSWGRDWGVDGYFHITTRDNSCGITTEPTYVVF 545
>gi|167520318|ref|XP_001744498.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776829|gb|EDQ90447.1| predicted protein [Monosiga brevicollis MX1]
Length = 543
Score = 231 bits (590), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 111/204 (54%), Positives = 142/204 (69%), Gaps = 2/204 (0%)
Query: 142 EMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGG 201
E + DQ++CGSCWSFGT ++EG ++ L LSQQ+L+DCSW YGNN CDGG
Sbjct: 335 EQGAVTEVKDQAICGSCWSFGTAESIEGQVAIQKGILQPLSQQSLMDCSWSYGNNACDGG 394
Query: 202 EDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLAL 260
E FR+Y+WIM++G +PT YG Y G D CH TT AT+ +VNVT + + L
Sbjct: 395 EAFRAYEWIMENGFIPTAASYGLYQGADGVCHSKGTTPGATLATYVNVTSADMNTVLDTL 454
Query: 261 AKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKN 319
A GP+++AIDAS +SF+FY +GVYYD C N+ D LDHAVLAVGYG + G YW VKN
Sbjct: 455 ATKGPLAIAIDASLRSFNFYSSGVYYDADCGNTTDALDHAVLAVGYGTDALGNDYWIVKN 514
Query: 320 SWSTYWGNQGYVLMSIKDNNCGVM 343
SWST++G+ GY+LMS KDNNCG +
Sbjct: 515 SWSTHYGDNGYILMSRKDNNCGTV 538
>gi|340378677|ref|XP_003387854.1| PREDICTED: counting factor associated protein D-like [Amphimedon
queenslandica]
Length = 540
Score = 229 bits (583), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 112/230 (48%), Positives = 155/230 (67%), Gaps = 5/230 (2%)
Query: 124 SLWVRYNKASKDAIP--VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAV 180
S +V + S + IP V + +KG + DQ++CGSCWSFGTTG +EG+ ++K L
Sbjct: 309 SEFVYTSSVSANDIPDYVNWWLKGAVTPVKDQAICGSCWSFGTTGTLEGSLFIKTGNLVA 368
Query: 181 LSQQALIDCSWGYGNNGCDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTAT 239
LS+QA +DCSWG GNNGCDGGE R+++W++ LPT+ Y PY+ QD+ C T
Sbjct: 369 LSEQAFVDCSWGQGNNGCDGGESERAFEWLLLGRCLPTEATY-PYMMQDSTCKDKTATCG 427
Query: 240 ATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDH 299
++ + NV + LK+ + + GP+S+AIDAS +SFSFY NGVY + KC N+P+ LDH
Sbjct: 428 VRISKYYNVMSTNVTDLKVKIFERGPISIAIDASHRSFSFYANGVYKEPKCGNTPNDLDH 487
Query: 300 AVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYV 349
VLAVGYG+++G Y VKNSWST+WGN GYVLMS +N CG+ T ++
Sbjct: 488 QVLAVGYGDVNGDEYVIVKNSWSTHWGNDGYVLMSTDNNQCGIATDGSFA 537
>gi|254746342|emb|CAX16636.1| putative C1A cysteine protease precursor [Manduca sexta]
Length = 545
Score = 228 bits (582), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 108/205 (52%), Positives = 141/205 (68%), Gaps = 4/205 (1%)
Query: 148 SLLDQSVCGSCWSFGTTGAVEGAYYMKHK-KLAVLSQQALIDCSWGYGNNGCDGGEDFRS 206
++ +Q CGSCW+FGTT AVEGA + KL LS QALIDC+W +G GCDGG D +
Sbjct: 344 AVKNQDNCGSCWTFGTTAAVEGALAQHNGGKLLSLSNQALIDCAWPFGVRGCDGGSDNAA 403
Query: 207 YQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
Y+W+M++GLPT+ +YGPYL +D YCHI N T + GF +VTP S ALK+AL HGP+
Sbjct: 404 YEWMMEYGLPTEAEYGPYLNRDNYCHIDNVTVKYPIKGFTDVTPYSVGALKVALVNHGPL 463
Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWG 326
SV++ AS+ +F+ Y G++YD +C+ P LDH V VGYG D +W +KNSW WG
Sbjct: 464 SVSVHASE-AFTLYSGGIFYDTECD--PTSLDHEVALVGYGTRDEDTFWILKNSWGPMWG 520
Query: 327 NQGYVLMSIKDNNCGVMTAPTYVTM 351
GY+L+S +DNNCGV T PTYV
Sbjct: 521 IDGYILISSRDNNCGVATEPTYVNF 545
>gi|391341656|ref|XP_003745143.1| PREDICTED: uncharacterized protein LOC100900885 [Metaseiulus
occidentalis]
Length = 1356
Score = 227 bits (578), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 110/204 (53%), Positives = 143/204 (70%), Gaps = 8/204 (3%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ++CGSCWSFGT G +EG Y++KH +L ++Q L+DCSW GN+ CDGG D+ +Y +I
Sbjct: 1158 DQAICGSCWSFGTVGHIEGQYFLKHGELVRFAEQQLVDCSWTSGNDACDGGLDYVAYDYI 1217
Query: 211 MKHGLPTQDDYGPYLGQDAYC---HIANTTATATMTGFVNVTPNSEDALKLALAKHGPVS 267
K+GL + YGPY G D C I N T T+ + N++ + L+ A+A GP+S
Sbjct: 1218 KKYGLSSDAQYGPYRGIDGKCKDVEIENKPIT-TIQRYYNIS--GVENLRKAIAFVGPIS 1274
Query: 268 VAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGN 327
VAIDAS+ S SFY +GVY D C+++ LDHAVLAVGYG L GKPYW +KNSWSTYWGN
Sbjct: 1275 VAIDASRPSLSFYAHGVYEDPDCSSTE--LDHAVLAVGYGVLHGKPYWLIKNSWSTYWGN 1332
Query: 328 QGYVLMSIKDNNCGVMTAPTYVTM 351
GY+L+S KDN CGV + PTYV +
Sbjct: 1333 DGYILISQKDNMCGVASTPTYVEL 1356
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 82/206 (39%), Positives = 127/206 (61%), Gaps = 10/206 (4%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKK--LAVLSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
+Q CGSCWSF +E Y++ + K L S+Q L+DCSW + N GC GG ++
Sbjct: 377 NQGTCGSCWSFAVIAHLESQYFLNNGKENLTRFSEQQLVDCSWDFSNTGCSGGSIESAFS 436
Query: 209 WIMKHGLPTQDDYGPYLGQDAYCH--IANTTAT-ATMTGFVNVTPNSEDALKLALAKHGP 265
++ ++GL T + YGPY ++ C + T T +T+ GF + ++ L+ +A GP
Sbjct: 437 YVKEYGLFTDEQYGPYREEEGKCRDTVTGTEPTISTLEGFNAI--GGKECLRNYIALKGP 494
Query: 266 VSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYW 325
++VAIDAS SF +Y +GVY + C L+HAVLA+GYGEL+G+PYW +KNSW W
Sbjct: 495 IAVAIDASSPSFVYYSHGVYKNPACGRD---LNHAVLAIGYGELNGEPYWLIKNSWGDIW 551
Query: 326 GNQGYVLMSIKDNNCGVMTAPTYVTM 351
G++G++L+S ++N CG+ +Y +
Sbjct: 552 GSEGFMLISQENNTCGIEDELSYADL 577
Score = 50.8 bits (120), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 44/140 (31%), Positives = 65/140 (46%), Gaps = 24/140 (17%)
Query: 14 GMVKTYQLSKNGTNGLSLKVA-PMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKV 72
G +KT+Q++ G G S KV ET + + +C+ A PQ +LPD+SDFK
Sbjct: 871 GEMKTFQIA--GETGGSYKVVWSPNNETHVPEQNCYAA-----GPAAPQGILPDLSDFK- 922
Query: 73 NIYRLFFLRPR--FHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYN 130
F+R F + T + L N ++ + ++VSKY W +
Sbjct: 923 ------FVRTESCFTDTS------TLLNTLLSNAAKCYRYENTVQKYDRVSKYVFWANQD 970
Query: 131 KASKDAIPVRYEMKGYNSLL 150
+ IPVRY M GY+SLL
Sbjct: 971 -GDNNVIPVRYMMMGYDSLL 989
>gi|391339556|ref|XP_003744114.1| PREDICTED: counting factor associated protein D-like [Metaseiulus
occidentalis]
Length = 563
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 110/202 (54%), Positives = 140/202 (69%), Gaps = 5/202 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ++CGSCWSFG TG +EG Y KH KL S+Q LIDCSW +GN GCDGG+D+++YQ+I
Sbjct: 361 DQAICGSCWSFGATGNIEGMYARKHGKLVRFSEQQLIDCSWKFGNGGCDGGQDYQAYQYI 420
Query: 211 MKHGLPTQDDYGPYLGQDAYCH--IANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
M+HGL T +YG Y+G D CH A T+ G+VNVT E+ LK A+A GP+SV
Sbjct: 421 MQHGLSTDKEYGAYMGIDGKCHDGPALKRELPTLLGYVNVT--GENDLKRAVAFVGPISV 478
Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGN 327
I A+ S SFY G++ D+ C N LDHAVLAVGYG +G+ +W VKNSWST WG+
Sbjct: 479 GIFAALPSLSFYHTGIFNDKDCKNGLADLDHAVLAVGYGVSHEGEAFWIVKNSWSTLWGD 538
Query: 328 QGYVLMSIKDNNCGVMTAPTYV 349
GYV +++K+N CGV TA TY
Sbjct: 539 DGYVKIAMKNNICGVTTAATYA 560
>gi|325180654|emb|CCA15059.1| cathepsin putative [Albugo laibachii Nc14]
Length = 620
Score = 219 bits (558), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 106/199 (53%), Positives = 132/199 (66%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCWSFGT+GA+EGA ++K KKL +S+Q LIDCSW +GN C+GG D++SY WI
Sbjct: 419 DQGSCGSCWSFGTSGAIEGALFLKEKKLVNVSEQNLIDCSWNFGNQACNGGFDYQSYNWI 478
Query: 211 MKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G + D YG Y +CH T A A MTGFVNVT + L AL GP+SV+
Sbjct: 479 ISSGGIEATDSYGDYRNLPDFCHFEPTRAVARMTGFVNVTGTEQ--LNDALLHVGPLSVS 536
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDA+Q SF FY G Y DEKC + D LDH VLAVG+ G+ + +KNSW +WG G
Sbjct: 537 IDATQPSFYFYAGGYYDDEKCASDLDSLDHLVLAVGFTTFRGQRFTLIKNSWGKFWGEDG 596
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y+++S KDN CGV TA TY
Sbjct: 597 YIMISQKDNVCGVATAATY 615
>gi|300120790|emb|CBK21032.2| unnamed protein product [Blastocystis hominis]
Length = 516
Score = 218 bits (554), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 101/199 (50%), Positives = 141/199 (70%), Gaps = 2/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ +CGSCW++GT G +EG Y++K+ KL S+Q L+DCSW +GN+GC+GGEDFR+Y W+
Sbjct: 315 DQCMCGSCWTYGTVGVLEGQYFLKYGKLVKFSEQNLLDCSWNFGNDGCNGGEDFRAYGWM 374
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ + GL T +DYG YLG D +CH + A +T +V +TP S + L+ A+A GP+SV
Sbjct: 375 LHNGGLMTDEDYGHYLGIDGWCHFNKSAAAVKITDYVLITPGSVEELEDAVANVGPISVG 434
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
I A F FY GV+ + +C+++ + HAVLAVGYG +GK YW +KNSWSTYWG+ G
Sbjct: 435 I-AVTTDFLFYAEGVFDNPECSSAVEDQAHAVLAVGYGTENGKDYWLIKNSWSTYWGDNG 493
Query: 330 YVLMSIKDNNCGVMTAPTY 348
YV ++ K+N CGV TA +Y
Sbjct: 494 YVKIARKNNICGVATAASY 512
>gi|187944195|gb|ACD40324.1| cathepsin 1-like protease [Helicoverpa armigera]
Length = 550
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 105/220 (47%), Positives = 146/220 (66%), Gaps = 5/220 (2%)
Query: 136 AIPVRYEMKG---YNSLLDQSVCGSCWSFGTTGAVEGAY-YMKHKKLAVLSQQALIDCSW 191
IP Y+ + +++ +Q CGSCW+FGTT AVEGA + KL LS QAL+DC+W
Sbjct: 332 GIPDEYDARLEGLVSNIKNQEECGSCWTFGTTAAVEGALARINGGKLMALSNQALVDCAW 391
Query: 192 GYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPN 251
YG GC+GG D +Y+W+M++GLPT +YG Y +D C+I N T T + GFV+VTP
Sbjct: 392 AYGAYGCEGGTDNAAYEWMMEYGLPTVAEYGQYTNKDGECNILNMTETFPIRGFVDVTPL 451
Query: 252 SEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDG 311
S ALK+AL HGP+SV++ ++ +F Y +GV+YD +C+ P+ L+H V VGYG+ DG
Sbjct: 452 SVGALKVALVNHGPLSVSVHVTE-AFLQYSSGVFYDTECSQLPNDLNHEVTPVGYGQRDG 510
Query: 312 KPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
+W +KNSW +WG GY+ +S +DN CGV T PTYV
Sbjct: 511 DTFWILKNSWGPHWGIGGYMYISSRDNICGVSTEPTYVVF 550
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 37/150 (24%), Positives = 62/150 (41%), Gaps = 26/150 (17%)
Query: 14 GMVKTYQLSKNGT--NGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFK 71
G VK++ SKN G+ ++ P + + ++ C G +EP ++LPD +F+
Sbjct: 67 GNVKSFATSKNKQFRYGVQYEIHPEASRDQEVEVVCNAMEGGRYDRVEPSTILPDTDEFE 126
Query: 72 VNIYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNK 131
Y EE++N K+ + + S+ +LW YN+
Sbjct: 127 ------------------------YDDEEVMNDKDCVKFVNNVKDEQTDSEQTLWATYNE 162
Query: 132 ASKDAIPVRYEMKGYNSLLDQSVCGSCWSF 161
K +PVRYE+ +NS L W F
Sbjct: 163 DDKAWVPVRYEVLEFNSWLGSKDKHEIWDF 192
>gi|156392785|ref|XP_001636228.1| predicted protein [Nematostella vectensis]
gi|156223329|gb|EDO44165.1| predicted protein [Nematostella vectensis]
Length = 513
Score = 214 bits (544), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 95/197 (48%), Positives = 138/197 (70%), Gaps = 4/197 (2%)
Query: 147 NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRS 206
NS+ Q +CGSC++F GA+EGA+++K LS+Q ++DC+WG+GN GC GG +R+
Sbjct: 309 NSVKSQGICGSCYAFAVAGALEGAHFIKTGLKLDLSEQQIVDCTWGFGNRGCKGGYPYRA 368
Query: 207 YQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGP 265
QWI+KH GL T++ YG YL Q+ YCH NT+ A + ++++ + LKLA+A +GP
Sbjct: 369 MQWILKHGGLATEESYGRYLAQEGYCHFKNTSIGARLDKYMSIRQGNTSQLKLAVAFYGP 428
Query: 266 VSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYW 325
VS+ ++ K+F FY +G+YYD +C ++ LDHA LAVGYGE G YW VKNSWS W
Sbjct: 429 VSILVNTQPKTFKFYGSGIYYDTQCTHA---LDHAALAVGYGEEKGVSYWIVKNSWSAMW 485
Query: 326 GNQGYVLMSIKDNNCGV 342
G +GY+ +++KD+NCGV
Sbjct: 486 GEEGYIKIAMKDDNCGV 502
Score = 50.8 bits (120), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 41/138 (29%), Positives = 58/138 (42%), Gaps = 28/138 (20%)
Query: 14 GMVKTYQLSKNGTNGLSLKVAPMTTE-TELNKISCFHAYGIPDATIEPQSVLPDVSDFKV 72
GM +TYQ G G + K+ P +E T C H G + I+ QS++P
Sbjct: 36 GMDRTYQRGDLGKYGYACKIVPEFSERTGRTFKGCLHRRGNSNFQIKAQSIIPS------ 89
Query: 73 NIYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKA 132
P+F + + G E G KW + KV+ Y+L Y
Sbjct: 90 ---------PKF---------FKFKGHEDFRGKNCAKWEHSFNIYNKVNTYTL---YTTP 128
Query: 133 SKDAIPVRYEMKGYNSLL 150
S+ PVRYEM GY++LL
Sbjct: 129 SRPYTPVRYEMMGYDTLL 146
>gi|254746346|emb|CAX16638.1| putative C1A cysteine protease precursor [Spodoptera frugiperda]
Length = 552
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 107/228 (46%), Positives = 149/228 (65%), Gaps = 8/228 (3%)
Query: 128 RYNKASKDAIPVRYEMKGY---NSLLDQSVCGSCWSFGTTGAVEGAY-YMKHKKLAVLSQ 183
+ N+ S+D +P Y+ + + + +Q CGSCW+FGTT AVEGA + KL L+
Sbjct: 329 KLNELSED-MPKEYDTRLLGLVSHVKNQEDCGSCWTFGTTAAVEGALARINGGKLMALAN 387
Query: 184 QALIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMT 243
QAL+DC W +G GCDGG D +Y+W+M++GLPT+ +YGPY +D C+I N T T +
Sbjct: 388 QALVDCVWAFGAAGCDGGTDNAAYEWMMEYGLPTEAEYGPYTNKDGECNIINMTRTFPIK 447
Query: 244 GFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLA 303
G+V+VTP S ALK+AL HGP+SV++ + +FS Y +G++YD +CN L+H V
Sbjct: 448 GYVDVTPLSVGALKVALVNHGPLSVSVHVTD-TFSKYSSGIFYDTECNQL--HLNHEVTL 504
Query: 304 VGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
VGYGE DG +W +KNSW WG GY+ +S +DN CGV T PTYV
Sbjct: 505 VGYGERDGDTFWILKNSWGPQWGIGGYMYISARDNICGVATEPTYVVF 552
>gi|206725499|ref|NP_001128673.1| cathepsin L like protein precursor [Bombyx mori]
gi|198041259|dbj|BAG70408.1| cathepsin L like protein [Bombyx mori]
Length = 547
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 103/217 (47%), Positives = 145/217 (66%), Gaps = 7/217 (3%)
Query: 137 IPVRYEMK--GYNSLL-DQSVCGSCWSFGTTGAVEGAY-YMKHKKLAVLSQQALIDCSWG 192
+P Y+++ GY S + +Q CGSCW+FGTT A EGA + +L LS QA++DC+W
Sbjct: 332 LPSEYDLRILGYVSKVKNQEDCGSCWTFGTTAAAEGALARINGGRLLSLSNQAILDCAWP 391
Query: 193 YGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNS 252
YG +GC+GG D +Y W+MK GLPT+++YG Y D C+I N + + G+ +VTP +
Sbjct: 392 YGGSGCEGGSDNAAYDWMMKFGLPTEEEYGSYTNADGICNIKNMSTIYPIRGWTDVTPLN 451
Query: 253 EDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGK 312
+ALK+A HGP+SV+IDA+ K FS Y G++YD C + L+H + VGYGE DG
Sbjct: 452 VEALKVAAVNHGPLSVSIDATDK-FSLYTGGIFYDTTC--TTKRLNHEMALVGYGERDGD 508
Query: 313 PYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYV 349
YW +KNSW WG +GY+L+S ++NNCG+ T PTYV
Sbjct: 509 TYWIIKNSWGPDWGVEGYLLISSRNNNCGIATEPTYV 545
Score = 37.7 bits (86), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 31/139 (22%), Positives = 62/139 (44%), Gaps = 26/139 (18%)
Query: 14 GMVKTYQLSKNGTN--GLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFK 71
G VK+ + +N G+S ++ P T + E C G D + + +LP+ +F+
Sbjct: 65 GAVKSIIVDENAKYRYGVSYEIHPETDDGEDAIFKCNVMLGSEDNIFDLKIILPETDNFE 124
Query: 72 VNIYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNK 131
++ P +++ + ++ E+ S+ E + ++W Y++
Sbjct: 125 -------YVGP---DSQTVENTIKFVAED--------------SDLETKTIKTIWATYDE 160
Query: 132 ASKDAIPVRYEMKGYNSLL 150
+ + PVRYE+K YN LL
Sbjct: 161 KNNNWHPVRYEVKNYNELL 179
>gi|301122899|ref|XP_002909176.1| cathepsin, cysteine protease family C01A, putative [Phytophthora
infestans T30-4]
gi|262099938|gb|EEY57990.1| cathepsin, cysteine protease family C01A, putative [Phytophthora
infestans T30-4]
Length = 598
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 107/202 (52%), Positives = 133/202 (65%), Gaps = 3/202 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+FGTTGA+EGA + + KKL +SQQ L+DCSW YGNN C+GG D+++Y+WI
Sbjct: 397 DQGACGSCWTFGTTGALEGALFAQQKKLYNMSQQNLLDCSWDYGNNACNGGLDYQAYEWI 456
Query: 211 MKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ +G L T YG Y YCH A M GFVNVT S AL ALA GP+SV+
Sbjct: 457 IANGGLETTATYGAYRNAPDYCHFNVDNAIGRMKGFVNVT--SVQALNDALATIGPLSVS 514
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDA+ SF FY G Y D +C + D LDH+VLAVG + + Y VKNSWST+WG G
Sbjct: 515 IDANLPSFYFYGGGYYDDVECKSDLDSLDHSVLAVGVTTHNDQKYTLVKNSWSTHWGEDG 574
Query: 330 YVLMSIKDNNCGVMTAPTYVTM 351
Y+ +S K+N CGV TA TY +
Sbjct: 575 YIKISQKNNLCGVATAATYPVL 596
>gi|301090471|ref|XP_002895448.1| cathepsin, cysteine protease family C01A, putative [Phytophthora
infestans T30-4]
gi|262098628|gb|EEY56680.1| cathepsin, cysteine protease family C01A, putative [Phytophthora
infestans T30-4]
Length = 371
Score = 207 bits (526), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 107/202 (52%), Positives = 133/202 (65%), Gaps = 3/202 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+FGTTGA+EGA + + KKL +SQQ L+DCSW YGNN C+GG D+++Y+WI
Sbjct: 170 DQGACGSCWTFGTTGALEGALFAQQKKLYNMSQQNLLDCSWDYGNNACNGGLDYQAYEWI 229
Query: 211 MKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ +G L T YG Y YCH A M GFVNVT S AL ALA GP+SV+
Sbjct: 230 IANGGLETTATYGAYRNAPDYCHFNVDNAIGRMKGFVNVT--SVQALNDALATIGPLSVS 287
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDA+ SF FY G Y D +C + D LDH+VLAVG + + Y VKNSWST+WG G
Sbjct: 288 IDANLPSFYFYGGGYYDDVECKSDLDSLDHSVLAVGVTTHNDQKYTLVKNSWSTHWGEDG 347
Query: 330 YVLMSIKDNNCGVMTAPTYVTM 351
Y+ +S K+N CGV TA TY +
Sbjct: 348 YIKISQKNNLCGVATAATYPVL 369
>gi|161408097|dbj|BAF94152.1| cathepsin L-like cysteine protease 2 [Plautia stali]
Length = 334
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 103/200 (51%), Positives = 135/200 (67%), Gaps = 5/200 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTGA+EG + K KL LS+Q L+DCS YGNNGC+GG ++Q+I
Sbjct: 135 NQGHCGSCWAFSTTGALEGQNFRKTGKLVSLSEQNLVDCSGSYGNNGCEGGLMDNAFQYI 194
Query: 211 MK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ HG+ T+ Y PY G+D C T+ AT +GFV++T E+AL A+A GP+SVA
Sbjct: 195 KENHGIDTEKSY-PYEGEDETCRFRKTSIGATDSGFVDITQGDEEALMQAVATIGPISVA 253
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS +SF FY GVYY+ +C S + LDH VL VGYG D + YW VKNSW T WG+ G
Sbjct: 254 IDASHQSFQFYSEGVYYEPEC--SSENLDHGVLVVGYGVEDNQKYWLVKNSWGTQWGDGG 311
Query: 330 YVLMSI-KDNNCGVMTAPTY 348
Y+ M+ +DNNCG+ T +Y
Sbjct: 312 YIKMARDQDNNCGIATQASY 331
>gi|357624858|gb|EHJ75471.1| putative C1A cysteine protease precursor [Danaus plexippus]
Length = 529
Score = 204 bits (520), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 126/357 (35%), Positives = 190/357 (53%), Gaps = 48/357 (13%)
Query: 37 TTETELNKISCFHAYGIP---DATIEPQSVLPDVSDFKVNIYRLFFLRP----RFHENEK 89
T E +KI+ F+ Y P D + + D +DF ++ L + P R + +
Sbjct: 173 TGALEGHKITTFYDYRKPTKDDFNVALVTECEDATDFYKDLRVLHPMIPSDVDRLYHSYT 232
Query: 90 IRYNWTYIGEE--LVNGIILEKWRLVTSEGEKVSKYSLWV-RYNKASKDAI--------- 137
+N Y EE L I+ + W+ V +K + L + +Y+ +K+ +
Sbjct: 233 KHHNRNYKAEEHSLRKSILEQNWQRVLLHNKKNLGFKLTLNKYSDRTKEELSFLTGTRPS 292
Query: 138 ----------------------PVRYEMK---GYNSLLDQSVCGSCWSFGTTGAVEGAYY 172
P Y+M+ +++ +Q CGSCW+F T AVEGA
Sbjct: 293 LGTGTVSFPHTDEEVEQMVLDLPENYDMRLEGTISAVKNQGRCGSCWTFSTAAAVEGALA 352
Query: 173 MKHKKLAV-LSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYC 231
K+ + LS+Q+++DC+WGY N GCDGG ++++I+ +G+PTQ +YG YLG+D YC
Sbjct: 353 RKNGGRDLDLSEQSIVDCAWGYHNAGCDGGMIDTAFKYILDYGIPTQIEYGDYLGEDGYC 412
Query: 232 HIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCN 291
HI N T + GFV V S +A+K+AL K+GPVSVAI+A+ K Y +G+++D +CN
Sbjct: 413 HIENVTDVYNIIGFVQVPSKSVNAMKVALYKYGPVSVAINAN-KLLVAYESGIFFDPECN 471
Query: 292 NSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTY 348
D ++HAV VGYG DG YW VKNSW WG GY+L+S DNNC ++ Y
Sbjct: 472 E--DHINHAVTVVGYGVRDGATYWIVKNSWGEDWGQDGYLLISATDNNCHILEYAYY 526
>gi|1483570|emb|CAA68066.1| cathepsin l [Litopenaeus vannamei]
Length = 328
Score = 202 bits (515), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 105/201 (52%), Positives = 140/201 (69%), Gaps = 6/201 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TTG++EG +++K KL LS+Q L+DCS +GN GC GG +++++I
Sbjct: 128 DQKQCGSCWAFSTTGSLEGQHFLKDGKLVSLSEQNLVDCSDKFGNMGCMGGLMDQAFRYI 187
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T+D Y PY QD C + AT TG+V+V SE ALK A+A GP+SVA
Sbjct: 188 KANKGIDTEDSY-PYEAQDGKCRFDASNVGATDTGYVDVEHGSESALKKAVATIGPISVA 246
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELD-GKPYWQVKNSWSTYWGNQ 328
IDASQ SF FY +GVYY+E C+++ LDH VLAVGYGE + G+ YW VKNSW+T WGN+
Sbjct: 247 IDASQPSFQFYHDGVYYEEGCSSTM--LDHGVLAVGYGETEKGEAYWLVKNSWNTSWGNK 304
Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
GY+ MS K NNCG+ + +Y
Sbjct: 305 GYIQMSRDKKNNCGIASQASY 325
>gi|121543825|gb|ABM55577.1| putative cathepsin L-like protease [Maconellicoccus hirsutus]
Length = 341
Score = 201 bits (510), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 109/240 (45%), Positives = 146/240 (60%), Gaps = 6/240 (2%)
Query: 111 RLVTSEGEKVSKYSLWVRYNKASKDAIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGA 170
R VT G+++ + YN D++ R E + +Q CGSCW+F TTG++EG
Sbjct: 103 RRVT--GDEIDSVTFIPAYNVTVPDSVDWRTE-GAVTEVKNQGQCGSCWAFSTTGSLEGQ 159
Query: 171 YYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDA 229
++ K+L LS+Q LIDCS YGNNGC GG ++ +I + G+ T+ Y PY G D
Sbjct: 160 HFRNTKQLTSLSEQNLIDCSGKYGNNGCSGGLMDNAFAYIKSNKGIDTEQSY-PYEGIDD 218
Query: 230 YCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEK 289
C + AT GFV++ E+ LKLA+A GP+SVAIDAS +SF FY GVYYD+
Sbjct: 219 KCRYKPQESGATDKGFVDIPQGDEEKLKLAVATVGPISVAIDASHQSFQFYKKGVYYDKG 278
Query: 290 CNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
C N + LDH VLAVGYG +GK YW VKNSW WG GY+ M+ K N+CG+ T+ +Y
Sbjct: 279 CGNGEEDLDHGVLAVGYGTENGKDYWLVKNSWGKRWGLDGYIKMARNKHNHCGIATSASY 338
>gi|50539796|ref|NP_001002368.1| cathepsin L.1 precursor [Danio rerio]
gi|49900360|gb|AAH75887.1| Cathepsin L.1 [Danio rerio]
Length = 334
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 109/228 (47%), Positives = 140/228 (61%), Gaps = 6/228 (2%)
Query: 124 SLWVRYNKASKDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLS 182
S + R KA+ V + KGY + DQ CGSCW+F TG++EG + K KL LS
Sbjct: 107 STFFRLRKAAVVPDTVDWRDKGYVTDIKDQKQCGSCWAFSATGSLEGQTFRKTGKLVSLS 166
Query: 183 QQALIDCSWGYGNNGCDGGEDFRSYQWI-MKHGLPTQDDYGPYLGQDAYCHIANTTATAT 241
+Q L+DCS YGN GCDGG +++Q+I GL T+D Y PY QD C +T A+
Sbjct: 167 EQQLVDCSGSYGNYGCDGGLMDQAFQYIEANKGLDTEDSY-PYEAQDGECRFNPSTVGAS 225
Query: 242 MTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAV 301
TG+V++ E AL+ A+A GP+SVAIDA SF Y +GVY + C++S LDH V
Sbjct: 226 CTGYVDIASGDESALQEAVATIGPISVAIDAGHSSFQLYSSGVYNEPDCSSSE--LDHGV 283
Query: 302 LAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
LAVGYG +G YW VKNSW WG QGY+LMS K N CG+ TA +Y
Sbjct: 284 LAVGYGSSNGDDYWIVKNSWGLDWGVQGYILMSRNKSNQCGIATAASY 331
>gi|157132324|ref|XP_001655999.1| cathepsin l [Aedes aegypti]
gi|108881694|gb|EAT45919.1| AAEL002833-PA [Aedes aegypti]
Length = 339
Score = 198 bits (503), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 104/201 (51%), Positives = 134/201 (66%), Gaps = 6/201 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCWSF TGA+EG ++ K KL LS+Q L+DCS YGNNGC+GG ++Q+I
Sbjct: 139 DQGHCGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSGKYGNNGCNGGMMDYAFQYI 198
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T+ Y PY D CH AT G+V++ E+ALK ALA GPVS+A
Sbjct: 199 KDNGGIDTEKSY-PYEAIDDTCHFNPKAVGATDKGYVDIPQGDEEALKKALATVGPVSIA 257
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
IDAS +SF FY GVYY+ +C++ + LDH VLAVGYG +G+ YW VKNSW T WG+Q
Sbjct: 258 IDASHESFQFYSEGVYYEPQCDS--ENLDHGVLAVGYGTSEEGEDYWLVKNSWGTTWGDQ 315
Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
GYV M+ +DN+CGV T +Y
Sbjct: 316 GYVKMARNRDNHCGVATCASY 336
>gi|195995653|ref|XP_002107695.1| hypothetical protein TRIADDRAFT_51454 [Trichoplax adhaerens]
gi|190588471|gb|EDV28493.1| hypothetical protein TRIADDRAFT_51454 [Trichoplax adhaerens]
Length = 549
Score = 197 bits (502), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 92/204 (45%), Positives = 132/204 (64%), Gaps = 1/204 (0%)
Query: 149 LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
+ DQ +C SCW+ TT A+E A Y++ + LS Q+L+DCSW +GN GC GG +
Sbjct: 342 IRDQGMCSSCWALSTTQAIESALYIQTGEKVELSSQSLVDCSWPFGNQGCSGGFISHTLA 401
Query: 209 WIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVS 267
WI+K G+PT+ YG YLG++ YCH + T A + G+ + + ALK+ALAK GPV+
Sbjct: 402 WIIKQRGIPTKSSYGQYLGRNGYCHHHDLTTGAKLKGYSQIPQGNMKALKVALAKVGPVT 461
Query: 268 VAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGN 327
V+I+ S KSF FY +GV+Y+++C LDH VLA+GYG + Y +KNSWS YWG+
Sbjct: 462 VSINTSPKSFLFYSSGVFYNDQCLGDYAHLDHNVLAIGYGRQHDQDYILIKNSWSAYWGD 521
Query: 328 QGYVLMSIKDNNCGVMTAPTYVTM 351
GY+ +S ++NNCG+ T Y +
Sbjct: 522 HGYIKISTRNNNCGLATNAYYPIL 545
>gi|238816977|gb|ACR56863.1| cathepsin L-like cysteine proteinase [Delia coarctata]
Length = 338
Score = 197 bits (502), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 103/207 (49%), Positives = 136/207 (65%), Gaps = 6/207 (2%)
Query: 148 SLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSY 207
S+ DQ CGSCWSF +TG++EG ++ K L LS+Q L+DCS YGNNGC+GG ++
Sbjct: 135 SVKDQGHCGSCWSFSSTGSLEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAF 194
Query: 208 QWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
++I + G+ T+ Y PY G D CH T AT TGFV++ E+A+ A+A GPV
Sbjct: 195 RYIKDNGGVDTEKSY-PYEGIDDSCHFNKATVGATDTGFVDIPQGDEEAMMKAVATMGPV 253
Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYW 325
+VAIDAS +SF Y GVY D C S D LDH VL VGYG + DG+ YW VKNSW T W
Sbjct: 254 AVAIDASNESFQLYSEGVYNDPNC--SSDNLDHGVLVVGYGTDKDGQDYWLVKNSWGTTW 311
Query: 326 GNQGYVLMSI-KDNNCGVMTAPTYVTM 351
G+QGY+ M+ +DN CG+ TA ++ T+
Sbjct: 312 GDQGYIKMARNQDNQCGIATASSFPTV 338
>gi|158300877|ref|XP_001689282.1| AGAP011828-PA [Anopheles gambiae str. PEST]
gi|157013372|gb|EDO63348.1| AGAP011828-PA [Anopheles gambiae str. PEST]
Length = 344
Score = 197 bits (501), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 105/201 (52%), Positives = 133/201 (66%), Gaps = 6/201 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCWSF TGA+EG ++ K KL LS+Q L+DCS YGNNGC+GG ++Q+I
Sbjct: 144 DQGHCGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSQKYGNNGCNGGMMDFAFQYI 203
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T+ Y PY D CH AT GFV++ +E AL ALA GPVSVA
Sbjct: 204 KDNKGIDTEKSY-PYEAIDDECHYNPKAVGATDKGFVDIPQGNEKALMKALATVGPVSVA 262
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGEL-DGKPYWQVKNSWSTYWGNQ 328
IDAS +SF FY GVYY+ +C++ + LDH VLAVGYG DG+ YW VKNSW T WG+Q
Sbjct: 263 IDASHESFQFYSEGVYYEPQCDS--EQLDHGVLAVGYGTTEDGEDYWLVKNSWGTTWGDQ 320
Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
GYV M+ +DN+CG+ T +Y
Sbjct: 321 GYVKMARNRDNHCGIATTASY 341
>gi|427797099|gb|JAA64001.1| Putative cathepsin l cathepsin l, partial [Rhipicephalus
pulchellus]
Length = 331
Score = 197 bits (501), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 101/200 (50%), Positives = 133/200 (66%), Gaps = 5/200 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCWSF TTG++EG ++ K KL LS+Q LIDCS +GNNGC+GG ++++I
Sbjct: 132 NQGQCGSCWSFSTTGSLEGQHFRKLHKLVSLSEQNLIDCSRSFGNNGCEGGLMDYAFKYI 191
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T+ Y PY D CH + AT TGFV++ E+ LK A+A GPVSVA
Sbjct: 192 KANKGIDTEQSY-PYNATDGVCHFNKSAVGATDTGFVDIPEGDENKLKKAVATVGPVSVA 250
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS +SF FY GVY + +C++ + LDH VL VGYG DG+ YW VKNSW T WG+ G
Sbjct: 251 IDASHESFQFYSEGVYDEPECDS--EQLDHGVLVVGYGTKDGQDYWLVKNSWGTTWGDGG 308
Query: 330 YVLMSI-KDNNCGVMTAPTY 348
Y+ MS KDN CG+ +A +Y
Sbjct: 309 YIYMSRNKDNQCGIASAASY 328
>gi|229366214|gb|ACQ58087.1| Cathepsin L precursor [Anoplopoma fimbria]
Length = 334
Score = 196 bits (499), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 108/228 (47%), Positives = 143/228 (62%), Gaps = 6/228 (2%)
Query: 124 SLWVRYNKASKDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLS 182
S ++R + + V + KGY + DQ CGSCW+F TTG++EG + K KL LS
Sbjct: 107 SAYLRLPEGADLPNSVDWREKGYVTDVKDQKQCGSCWAFSTTGSLEGQTFRKTGKLVSLS 166
Query: 183 QQALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATAT 241
+Q L+DCS YGN GC GG ++++I + G+ T+D Y PY +D C + AT
Sbjct: 167 EQQLVDCSGDYGNEGCMGGLMDSAFRYIQANGGIDTEDSY-PYEAEDGQCRYNSANIGAT 225
Query: 242 MTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAV 301
TG+V+V EDALK ALA GPVSVAIDAS SF Y +GVY + +C++S LDH V
Sbjct: 226 CTGYVDVKQGDEDALKEALATIGPVSVAIDASHSSFQLYESGVYDEPECSSSE--LDHGV 283
Query: 302 LAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
LAVGYG +G YW VKNSW WGN+GY++M+ K N CG+ TA +Y
Sbjct: 284 LAVGYGSDNGHDYWLVKNSWGLGWGNKGYIMMTRNKHNQCGIATASSY 331
>gi|312381833|gb|EFR27483.1| hypothetical protein AND_05794 [Anopheles darlingi]
Length = 344
Score = 196 bits (499), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 104/201 (51%), Positives = 133/201 (66%), Gaps = 6/201 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCWSF TGA+EG ++ K KL LS+Q L+DCS YGNNGC+GG ++Q++
Sbjct: 144 DQGHCGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFQYV 203
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T+ Y PY D CH AT GFV++ E ALK ALA GPVSVA
Sbjct: 204 KDNKGIDTEKAY-PYEAIDDECHYNPKAIGATDKGFVDIPQGDEKALKKALATVGPVSVA 262
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGEL-DGKPYWQVKNSWSTYWGNQ 328
IDAS +SF FY GVYY+ +C++ + LDH VLAVGYG DG+ YW VKNSW T WG+Q
Sbjct: 263 IDASHESFQFYSEGVYYEPQCDS--EQLDHGVLAVGYGTTEDGEDYWLVKNSWGTTWGDQ 320
Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
GYV M+ ++N+CG+ T +Y
Sbjct: 321 GYVKMARNRENHCGIATTASY 341
>gi|156397875|ref|XP_001637915.1| predicted protein [Nematostella vectensis]
gi|156225031|gb|EDO45852.1| predicted protein [Nematostella vectensis]
Length = 331
Score = 196 bits (499), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 102/213 (47%), Positives = 140/213 (65%), Gaps = 6/213 (2%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
+ + KGY + +Q CGSCW+F TTGA+EG ++ K KL LS+Q L+DCS YGNNG
Sbjct: 119 IDWRSKGYVTPVKNQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSGKYGNNG 178
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++Q+I ++ G+ T+ Y PYL +D CH + A TGFV++ E+AL
Sbjct: 179 CEGGLMDNAFQYIKENGGIDTEKSY-PYLAKDGVCHYNKSAIGAKDTGFVDIPTGDENAL 237
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
+ ALA GP+S+AIDASQ +F FY GVY D C+++ LDH VLAVGYG DGK YW
Sbjct: 238 QQALASVGPISIAIDASQSTFHFYHQGVYDDPDCSSTR--LDHGVLAVGYGTDDGKDYWL 295
Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTY 348
VKNSW WG +GY+ ++ D++ CGV + +Y
Sbjct: 296 VKNSWGPSWGEEGYIKIARNDHDKCGVASKASY 328
>gi|91992508|gb|ABE72970.1| cathepsin L [Aedes aegypti]
Length = 339
Score = 196 bits (499), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 104/201 (51%), Positives = 133/201 (66%), Gaps = 6/201 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCWSF TGA+EG ++ K KL LS+Q L+DCS YGNNGC+GG ++Q+I
Sbjct: 139 DQGHCGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSGKYGNNGCNGGMMDYAFQYI 198
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T+ Y PY D CH AT G+V++ E+ALK ALA GPVS+A
Sbjct: 199 KDNGGIDTEKSY-PYEAIDDTCHFNPKAVGATDKGYVDIPQGDEEALKKALATVGPVSIA 257
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
IDAS +SF FY GVYY+ +C++ + LDH VLAVGYG +G+ YW VKNSW T WG+Q
Sbjct: 258 IDASHESFQFYSEGVYYEPQCDS--ENLDHGVLAVGYGTSEEGEDYWLVKNSWGTTWGDQ 315
Query: 329 GYVLMSIK-DNNCGVMTAPTY 348
GYV M+ DN+CGV T +Y
Sbjct: 316 GYVKMARNHDNHCGVATCASY 336
>gi|449673497|ref|XP_002169904.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 325
Score = 196 bits (499), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 112/250 (44%), Positives = 154/250 (61%), Gaps = 17/250 (6%)
Query: 102 VNGIILEKWRLVTSEGEKVSKYSLWVRYNKASKDAIPVRYEMKGY-NSLLDQSVCGSCWS 160
+NG++L K + + + V + A+ DA+ R E GY + +Q CGSCW+
Sbjct: 87 MNGLLLHKHQ---------NGSTFLVPSHTAAPDAVDWRSE--GYVTPVKNQGQCGSCWA 135
Query: 161 FGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQD 219
F +TGA+EG ++ K +L LS+Q L+DCS YGNNGC+GG ++ +I + G+ T+
Sbjct: 136 FSSTGALEGQHFKKTGRLVSLSEQNLVDCSTDYGNNGCNGGLMDNAFSYIKANGGIDTET 195
Query: 220 DYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSF 279
Y PY GQD C + ++ A TGFV++ EDALK A+A GPVSVAIDAS SF F
Sbjct: 196 GY-PYEGQDGTCRYSKSSIGADDTGFVDIPEGDEDALKQAVATVGPVSVAIDASHMSFQF 254
Query: 280 YVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKD-N 338
Y +GVY + +C SP LDH VL VGYG +GK YW VKNSW T WG +GY+ MS + N
Sbjct: 255 YHSGVYDEPQC--SPSALDHGVLVVGYGTDNGKDYWLVKNSWGTGWGTEGYIYMSRNNQN 312
Query: 339 NCGVMTAPTY 348
CG+ + +Y
Sbjct: 313 QCGIASKASY 322
>gi|170041165|ref|XP_001848344.1| cathepsin l [Culex quinquefasciatus]
gi|167864709|gb|EDS28092.1| cathepsin l [Culex quinquefasciatus]
Length = 340
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 102/200 (51%), Positives = 130/200 (65%), Gaps = 4/200 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCWSF TGA+EG ++ K KL LS+Q L+DCS YGNNGC+GG ++Q+I
Sbjct: 140 DQGHCGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSTKYGNNGCNGGMMDFAFQYI 199
Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
+G + PY D CH AT GFV++ E AL A+A GPVSVAI
Sbjct: 200 KDNGGIDTEKAYPYEAIDDTCHYNPKAVGATDKGFVDIPQGDEKALMKAIATAGPVSVAI 259
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQG 329
DAS +SF FY GVYY+ +C++ + LDH VLAVGYG +G+ YW VKNSW T WG+QG
Sbjct: 260 DASHESFQFYSEGVYYEPQCDS--ENLDHGVLAVGYGTSEEGEDYWLVKNSWGTTWGDQG 317
Query: 330 YVLMSI-KDNNCGVMTAPTY 348
YV M+ +DN+CG+ TA +Y
Sbjct: 318 YVKMARNRDNHCGIATAASY 337
>gi|32394728|gb|AAM96000.1| cathepsin L precursor [Metapenaeus ensis]
Length = 322
Score = 196 bits (497), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 103/224 (45%), Positives = 150/224 (66%), Gaps = 9/224 (4%)
Query: 131 KASKDAIP--VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALI 187
+A + +P V + KG + DQ CGSCW+F TTG++EG +++K KL LS+Q L+
Sbjct: 99 EADDETLPKHVDWRTKGAVTPVKDQKQCGSCWAFSTTGSLEGQHFLKDGKLVSLSEQNLV 158
Query: 188 DCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFV 246
DCS +GN GC GG +++++I ++ G+ T++ Y PY QD C ++ AT TGFV
Sbjct: 159 DCSGKFGNMGCCGGLMDQAFKYIKENKGIDTEESY-PYEAQDGKCRFDSSNVGATDTGFV 217
Query: 247 NVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGY 306
++ E++L A+A GP+SVAIDAS SF FY GVYY+++C+++ LDH VLA+GY
Sbjct: 218 DIAHGEENSLMKAVANIGPISVAIDASHPSFQFYHQGVYYEKECSSTM--LDHGVLAIGY 275
Query: 307 GEL-DGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
GE DGK YW VKNSW+T WG++G++ MS K NNCG+ + +Y
Sbjct: 276 GETDDGKEYWLVKNSWNTSWGDKGFIQMSRNKKNNCGIASQASY 319
>gi|32394730|gb|AAM96001.1| cathepsin L precursor [Metapenaeus ensis]
Length = 306
Score = 196 bits (497), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 103/224 (45%), Positives = 150/224 (66%), Gaps = 9/224 (4%)
Query: 131 KASKDAIP--VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALI 187
+A + +P V + KG + DQ CGSCW+F TTG++EG +++K KL LS+Q L+
Sbjct: 83 EADDETLPKHVDWRTKGAVTPVKDQKQCGSCWAFSTTGSLEGQHFLKDGKLVSLSEQNLV 142
Query: 188 DCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFV 246
DCS +GN GC GG +++++I ++ G+ T++ Y PY QD C ++ AT TGFV
Sbjct: 143 DCSGKFGNMGCCGGLMDQAFKYIKENKGIDTEESY-PYEAQDGKCRFDSSNVGATDTGFV 201
Query: 247 NVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGY 306
++ E++L A+A GP+SVAIDAS SF FY GVYY+++C+++ LDH VLA+GY
Sbjct: 202 DIAHGEENSLMKAVANIGPISVAIDASHPSFQFYHQGVYYEKECSSTM--LDHGVLAIGY 259
Query: 307 GEL-DGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
GE DGK YW VKNSW+T WG++G++ MS K NNCG+ + +Y
Sbjct: 260 GETDDGKEYWLVKNSWNTSWGDKGFIQMSRNKKNNCGIASQASY 303
>gi|229367042|gb|ACQ58501.1| Cathepsin L precursor [Anoplopoma fimbria]
Length = 334
Score = 196 bits (497), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 107/228 (46%), Positives = 143/228 (62%), Gaps = 6/228 (2%)
Query: 124 SLWVRYNKASKDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLS 182
S ++R + + V + KGY + DQ CGSCW+F TTG++EG + K KL LS
Sbjct: 107 SAYLRLPEGADLPNSVDWREKGYVTEVKDQKQCGSCWAFSTTGSLEGQTFRKTGKLVSLS 166
Query: 183 QQALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATAT 241
+Q L+DCS YGN GC GG ++++I + G+ T+D Y PY +D C + AT
Sbjct: 167 EQQLVDCSGDYGNEGCMGGLMDSAFRYIQANGGIDTEDSY-PYEAEDGQCRYNSANIGAT 225
Query: 242 MTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAV 301
TG+V+V EDALK A+A GPVSVAIDAS SF Y +GVY + +C++S LDH V
Sbjct: 226 CTGYVDVKQGDEDALKEAVATIGPVSVAIDASHSSFQLYESGVYDEPECSSSE--LDHGV 283
Query: 302 LAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
LAVGYG +G YW VKNSW WGN+GY++M+ K N CG+ TA +Y
Sbjct: 284 LAVGYGSDNGHDYWLVKNSWGLGWGNKGYIMMTRNKHNQCGIATASSY 331
>gi|91092014|ref|XP_970644.1| PREDICTED: similar to cathepsin-L-like cysteine peptidase 02
[Tribolium castaneum]
gi|270001249|gb|EEZ97696.1| cathepsin L precursor [Tribolium castaneum]
Length = 337
Score = 196 bits (497), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 101/201 (50%), Positives = 131/201 (65%), Gaps = 6/201 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCWSF TTG++EG ++ K KKL LS+Q LIDCS YGNNGC+GG ++++I
Sbjct: 137 DQGQCGSCWSFSTTGSLEGQHFRKSKKLVSLSEQNLIDCSEKYGNNGCNGGLMDNAFRYI 196
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T+ Y PY +D CH AT GFV++ E+ LK A+A GP+SVA
Sbjct: 197 KDNGGIDTEQSY-PYKAEDEKCHYKPRNKGATDRGFVDIESGDEEKLKAAVATVGPISVA 255
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
IDAS +F Y GVYY+ +C S + LDH VL VGYG + DG YW VKNSW WG+Q
Sbjct: 256 IDASHPTFQQYSEGVYYEPEC--SSEQLDHGVLVVGYGTDEDGNDYWLVKNSWGDSWGDQ 313
Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
GY+ M+ +DNNCG+ T +Y
Sbjct: 314 GYIKMARNRDNNCGIATQASY 334
>gi|346469447|gb|AEO34568.1| hypothetical protein [Amblyomma maculatum]
Length = 333
Score = 195 bits (496), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 100/200 (50%), Positives = 131/200 (65%), Gaps = 5/200 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F +TG++EG +++K KL LS+Q L+DCS YGN GC+GG S+ +I
Sbjct: 134 DQGQCGSCWAFSSTGSLEGQHFLKTGKLVSLSEQNLVDCSSAYGNQGCNGGLMDNSFNYI 193
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T+D Y PY +D C AT TGFV++ SE L+ A+A GPVSVA
Sbjct: 194 KANGGIDTEDSY-PYEAEDGDCRYKKEDVGATDTGFVDIKEGSEKDLQKAVATVGPVSVA 252
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDASQ+SF Y GVY + C S + LDH VLAVGYG +GK YW VKNSW+ WG G
Sbjct: 253 IDASQQSFQLYSEGVYDEPNC--SSESLDHGVLAVGYGVKNGKKYWLVKNSWAETWGQDG 310
Query: 330 YVLMSI-KDNNCGVMTAPTY 348
Y+LMS K+N CG+ ++ +Y
Sbjct: 311 YILMSRDKNNQCGIASSASY 330
>gi|151573014|gb|ABS17682.1| cathepsin L-1 [Artemia salina]
Length = 334
Score = 195 bits (495), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 100/200 (50%), Positives = 132/200 (66%), Gaps = 5/200 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F +TGA+EG + K KL LS+Q LIDCS YGN GC+GG +++Q+I
Sbjct: 135 DQGQCGSCWAFSSTGALEGQTFRKTGKLVSLSEQNLIDCSGKYGNEGCNGGLMDQAFQYI 194
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T++ Y PY +D C A GFV++ ED LK A+A GPVSVA
Sbjct: 195 KDNKGIDTENTY-PYEAEDDVCRYNPRNRGAVDRGFVDIPSGEEDKLKAAVATVGPVSVA 253
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS +SF FY GVYY+ C++ D LDH VL VGYG +GK YW VKNSWS +WG++G
Sbjct: 254 IDASHESFQFYSKGVYYEPSCDS--DDLDHGVLVVGYGSDNGKDYWLVKNSWSEHWGDEG 311
Query: 330 YVLMSI-KDNNCGVMTAPTY 348
Y+ M+ + N+CGV +A +Y
Sbjct: 312 YIKMARNRKNHCGVASAASY 331
>gi|33348836|gb|AAQ16118.1| cathepsin L-like cysteine proteinase B [Rhipicephalus
haemaphysaloides haemaphysaloides]
Length = 335
Score = 195 bits (495), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 98/200 (49%), Positives = 134/200 (67%), Gaps = 5/200 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTG++EG ++ K +KL LS+Q L+DCS +GNNGC+GG ++++I
Sbjct: 136 NQGQCGSCWAFSTTGSLEGPHFRKTRKLVSLSEQNLVDCSRSFGNNGCEGGLMDNAFKYI 195
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T+ Y PY D CH + AT TGFV++ E+ LK A+A GPVSVA
Sbjct: 196 KSNKGIDTEWSY-PYNATDGVCHFNRSDVGATDTGFVDIPEGDENKLKKAVAAVGPVSVA 254
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS +SF FY GVY + +C S + LDH VL VGYG DG+ YW VKNSW T WG++G
Sbjct: 255 IDASHESFQFYSEGVYDEPEC--SSEQLDHGVLVVGYGTKDGQDYWLVKNSWGTTWGDEG 312
Query: 330 YVLMSI-KDNNCGVMTAPTY 348
Y+ M+ KDN CG+ ++ +Y
Sbjct: 313 YIYMTRNKDNQCGIASSASY 332
>gi|55740402|gb|AAV63977.1| cathepsin L precursor [Artemia franciscana]
Length = 338
Score = 194 bits (494), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 100/200 (50%), Positives = 132/200 (66%), Gaps = 5/200 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F +TGA+EG + K KL LS+Q LIDCS YGN GC+GG +++Q+I
Sbjct: 139 DQGQCGSCWAFSSTGALEGQTFRKTGKLISLSEQNLIDCSGKYGNEGCNGGLMDQAFQYI 198
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T++ Y PY +D C A GFV++ ED LK A+A GPVSVA
Sbjct: 199 KDNKGIDTENTY-PYEAEDDVCRYNPRNRGAVDRGFVDIPSGEEDKLKAAVATVGPVSVA 257
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS +SF FY GVYY+ C++ D LDH VL VGYG +GK YW VKNSWS +WG++G
Sbjct: 258 IDASHESFQFYSKGVYYEPSCDS--DDLDHGVLVVGYGSDNGKDYWLVKNSWSEHWGDEG 315
Query: 330 YVLMSI-KDNNCGVMTAPTY 348
Y+ ++ + N+CGV TA +Y
Sbjct: 316 YIKIARNRKNHCGVATAASY 335
>gi|530736|emb|CAA56915.1| cathepsin l [Nephrops norvegicus]
gi|1582621|prf||2119193B cathepsin L-related Cys protease
Length = 313
Score = 194 bits (494), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 104/219 (47%), Positives = 142/219 (64%), Gaps = 7/219 (3%)
Query: 136 AIPVRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYG 194
A V + KG + DQ CGSCW+F TG++EG +++K+ +L LS+Q L+DCS YG
Sbjct: 99 AADVDWRTKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNNELVSLSEQELVDCSTEYG 158
Query: 195 NNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSE 253
N+GC GG ++ +I + G+ T+ Y PY QD C + AT TGFV V ++E
Sbjct: 159 NDGCGGGWMTSAFDYIKDNGGIDTESSY-PYEAQDRSCRFDANSIGATCTGFVEVQ-HTE 216
Query: 254 DALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKP 313
+AL A++ GP+SVAIDAS SF FY +GVYY++KC SP LDH VLAVGYG +
Sbjct: 217 EALHEAVSDIGPISVAIDASHFSFQFYSSGVYYEKKC--SPTNLDHGVLAVGYGTESTED 274
Query: 314 YWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
YW VKNSW + WG+ GY+ MS +DNNCG+ + P+Y T+
Sbjct: 275 YWLVKNSWGSGWGDAGYIKMSRNRDNNCGIASEPSYPTV 313
>gi|442754503|gb|JAA69411.1| Putative cathepsin l-like cysteine proteinase b [Ixodes ricinus]
Length = 335
Score = 194 bits (494), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 103/236 (43%), Positives = 148/236 (62%), Gaps = 16/236 (6%)
Query: 116 EGEKVSKYSLWVRYNKASKDAI-PVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMK 174
E E + +SL + +K A+ PV+ +Q CGSCW+F TG++EG ++ K
Sbjct: 110 EPENIEDFSLPKTVDWRTKGAVTPVK----------NQGQCGSCWAFSATGSLEGQHFRK 159
Query: 175 HKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHI 233
+ LS+Q L+DCS +GNNGC+GG ++++I + G+ T+ Y PY G D CH
Sbjct: 160 SGSMVSLSEQNLVDCSTDFGNNGCEGGLMDNAFKYIRANKGIDTEKSY-PYNGTDGTCHF 218
Query: 234 ANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNS 293
+T AT +GFV++ SE LK A+A GP+SVAIDAS +SF FY +GVY + +C++
Sbjct: 219 KKSTVGATDSGFVDIKEGSETQLKKAVATVGPISVAIDASHESFQFYSDGVYDEPECDS- 277
Query: 294 PDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
+ LDH VL VGYG L+G YW VKNSW T WG++GY+ MS K N CG+ ++ +Y
Sbjct: 278 -ESLDHGVLVVGYGTLNGTDYWLVKNSWGTTWGDEGYIRMSRNKKNQCGIASSASY 332
>gi|2584884|gb|AAB82743.1| cysteine proteinase [Oncorhynchus mykiss]
Length = 151
Score = 194 bits (493), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 88/151 (58%), Positives = 114/151 (75%), Gaps = 1/151 (0%)
Query: 160 SFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHG-LPTQ 218
SF TTGAVEGA ++K L VLSQQ L+DCSWG+GNNGCDGGE++R+Y+WIMKHG + T
Sbjct: 1 SFATTGAVEGALFLKSGSLQVLSQQMLVDCSWGFGNNGCDGGEEWRAYEWIMKHGGIATT 60
Query: 219 DDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFS 278
+ YG Y+G + CH+ + TA + + NVT +ALK+AL KHGPV+V+IDA +SF
Sbjct: 61 ETYGSYMGMNGLCHLNTSQLTARVQSYTNVTSGDAEALKVALFKHGPVAVSIDAGHRSFV 120
Query: 279 FYVNGVYYDEKCNNSPDGLDHAVLAVGYGEL 309
FY +G+YY+ KC N+ D LDHAVLAVGYG +
Sbjct: 121 FYSHGIYYEPKCGNTTDSLDHAVLAVGYGVM 151
>gi|281206749|gb|EFA80934.1| counting factor associated protein [Polysphondylium pallidum PN500]
Length = 530
Score = 194 bits (493), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 99/200 (49%), Positives = 133/200 (66%), Gaps = 2/200 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ VCGSCW+FG+TG++EG + KL LS+Q L+DC++ + GC+GG ++Q+I
Sbjct: 328 DQGVCGSCWTFGSTGSLEGVSCLATGKLVSLSEQQLVDCAYLGQSQGCNGGFASDAFQYI 387
Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTT-ATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
M G + PYL Q+ YC +++ + + +VNVT SE AL+ A+A GPV++A
Sbjct: 388 MNFGGIAYESTYPYLMQNGYCKDSSSQLSNIKVKSYVNVTSFSEPALQNAVATVGPVAIA 447
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS F FY +GVYY C N D LDH VLAVGYG L+G YW VKNSWST++G +G
Sbjct: 448 IDASAPDFRFYSSGVYYSSVCKNGLDDLDHEVLAVGYGTLNGADYWIVKNSWSTHYGAEG 507
Query: 330 YVLMSI-KDNNCGVMTAPTY 348
Y+LMS + NNCGV + PTY
Sbjct: 508 YILMSRNRGNNCGVASQPTY 527
>gi|33348834|gb|AAQ16117.1| cathepsin L-like cysteine proteinase A [Rhipicephalus
haemaphysaloides haemaphysaloides]
Length = 332
Score = 194 bits (493), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 104/238 (43%), Positives = 146/238 (61%), Gaps = 8/238 (3%)
Query: 116 EGEKVSKYSLWVRYNKASKDAIP--VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYY 172
G++ S+ S ++ + ++P V + KG + DQ CGSCW+F TG++EG ++
Sbjct: 95 RGQRTSRGSTFMPPANVNDSSLPSTVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHF 154
Query: 173 MKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYC 231
+K +L LS+Q L+DCS +GNNGC+GG ++++I + G+ ++ Y PY D C
Sbjct: 155 LKDGELVSLSEQNLVDCSQSFGNNGCEGGLMDNAFKYIKANDGIDAEESY-PYEAMDDKC 213
Query: 232 HIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCN 291
AT TGFV++ SED LK A+A GP+SVAIDA SF Y GVY + +C
Sbjct: 214 RFKKEDVGATDTGFVDIEGGSEDDLKKAVATVGPISVAIDAGHSSFQLYSEGVYDEPEC- 272
Query: 292 NSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
S + LDH VLAVGYG DGK YW VKNSW WG+ GY+LMS K+N CG+ +A +Y
Sbjct: 273 -SSEELDHGVLAVGYGVKDGKKYWLVKNSWGGSWGDNGYILMSRDKNNQCGIASAASY 329
>gi|5081735|gb|AAD39513.1|AF147207_1 cathepsin L-like protease precursor [Artemia franciscana]
Length = 338
Score = 194 bits (492), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 99/200 (49%), Positives = 132/200 (66%), Gaps = 5/200 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F +TGA+EG + K KL LS+Q LIDCS YGN GC+GG +++Q+I
Sbjct: 139 DQGQCGSCWAFSSTGALEGQTFRKTGKLISLSEQNLIDCSGKYGNEGCNGGLMDQAFQYI 198
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T++ Y PY +D C A GFV++ ED LK A+A GPVSVA
Sbjct: 199 KDNKGIDTENTY-PYEAEDNVCRYNPRNRGAIDRGFVHIPSGEEDKLKAAVATVGPVSVA 257
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS +SF FY GVYY+ C++ D LDH VL VGYG +GK YW VKNSWS +WG++G
Sbjct: 258 IDASHESFQFYSKGVYYEPSCDS--DDLDHGVLVVGYGSDNGKDYWLVKNSWSEHWGDEG 315
Query: 330 YVLMSI-KDNNCGVMTAPTY 348
Y+ ++ + N+CG+ TA +Y
Sbjct: 316 YIKIARNRKNHCGIATAASY 335
>gi|33112581|gb|AAP94046.1| cathepsin-L-like cysteine peptidase 02 [Tenebrio molitor]
Length = 337
Score = 194 bits (492), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 100/201 (49%), Positives = 131/201 (65%), Gaps = 6/201 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCWSF TG++EG ++ K KL LS+Q L+DCS +GNNGC+GG ++++I
Sbjct: 137 DQGQCGSCWSFSATGSLEGQHFRKSGKLVSLSEQNLVDCSEKFGNNGCNGGLMDNAFRYI 196
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T+ Y PY +D CH AT G+V++ +ED L+ A+A GPVSVA
Sbjct: 197 KANGGIDTEQAY-PYKAEDEKCHYKPKNKGATDRGYVDIESGNEDKLQSAVATVGPVSVA 255
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
IDAS +SF Y GVYY+ +C SP LDH VL VGYG E DG YW VKNSW WG+Q
Sbjct: 256 IDASHQSFQLYSGGVYYEPEC--SPSQLDHGVLVVGYGTEDDGTDYWLVKNSWGKSWGDQ 313
Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
GY+ M+ +DNNCG+ T +Y
Sbjct: 314 GYIKMARNRDNNCGIATEASY 334
>gi|391332597|ref|XP_003740719.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 330
Score = 194 bits (492), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 110/249 (44%), Positives = 148/249 (59%), Gaps = 7/249 (2%)
Query: 107 LEKWRLVTSEGEKVSKYSLWVRYNKASKDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTG 165
EK+R E + L R N++ V + +GY + +Q VCGSCW+F TTG
Sbjct: 85 FEKYRGTRFEANEARVSKLQHRDNRSMHVPDTVDWRTEGYVTPVKNQGVCGSCWAFSTTG 144
Query: 166 AVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPY 224
A+EG ++ + L LS+Q L+DCS YGN GC+GG ++++I GL T+ Y PY
Sbjct: 145 ALEGQHFRRSGDLVSLSEQMLVDCSAVYGNAGCNGGLMDNAFRFIKDAGGLETEKSY-PY 203
Query: 225 LGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGV 284
G+D CH A +TGFV+V E+ALK A GPVSVAIDAS ++F FY +GV
Sbjct: 204 TGKDGTCHFDARGIGAKLTGFVDVPSRDEEALKEAAGVVGPVSVAIDASGQNFQFYKDGV 263
Query: 285 YYDEKCNNSPDGLDHAVLAVGYGEL-DGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGV 342
Y + C+++ LDH VL VGYG DGK YW VKNSW + WG GY+ MS K+N CG+
Sbjct: 264 YDEITCSSTS--LDHGVLVVGYGTTRDGKDYWLVKNSWGSSWGQSGYIQMSRNKENQCGI 321
Query: 343 MTAPTYVTM 351
T +Y T+
Sbjct: 322 ATMASYPTV 330
>gi|154183745|gb|ABS70713.1| cathepsin L-like cysteine proteinase [Dermacentor variabilis]
Length = 333
Score = 194 bits (492), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 99/200 (49%), Positives = 130/200 (65%), Gaps = 5/200 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTG++EG +++K L LS+Q L+DCS +GN+GC+GG ++Q+I
Sbjct: 134 NQGQCGSCWAFSTTGSLEGQHFLKTGVLVSLSEQNLVDCSETFGNHGCEGGLMDNAFQYI 193
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T+ Y PY +D C AT TGFV++ SED LK A+A GPVSVA
Sbjct: 194 KANGGIDTEKSY-PYEAEDGECRFKKQNVGATDTGFVDIEQGSEDDLKKAVATVGPVSVA 252
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS SF Y GVY + +C S + LDH VL VGYG DGK YW VKNSW+ WG+ G
Sbjct: 253 IDASHSSFQLYSEGVYDETEC--SSEQLDHGVLVVGYGVEDGKKYWLVKNSWAESWGDNG 310
Query: 330 YVLMSI-KDNNCGVMTAPTY 348
Y+ MS KDN CG+ +A +Y
Sbjct: 311 YIKMSRDKDNQCGIASAASY 330
>gi|728637|emb|CAA59441.1| cathepsin l [Litopenaeus vannamei]
Length = 326
Score = 193 bits (491), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 104/224 (46%), Positives = 148/224 (66%), Gaps = 9/224 (4%)
Query: 131 KASKDAIP--VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALI 187
KA + +P V + KG + DQ CGSCW+F TTG++EG +++K KL LS+Q L+
Sbjct: 103 KADDETLPEKVDWRTKGAVTPVKDQKQCGSCWAFSTTGSLEGQHFLKDGKLVSLSEQNLV 162
Query: 188 DCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFV 246
DCS +GN GC GG +++++I + G+ T+D Y PY QD C + AT TG+V
Sbjct: 163 DCSDKFGNMGCMGGLMDQAFRYIKANKGIDTEDSY-PYEAQDGKCRFDASNVGATDTGYV 221
Query: 247 NVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGY 306
+V SE ALK A+A GP+SV IDASQ +F FY GVY+D+ C+++ LDH VLAVGY
Sbjct: 222 DVEHGSESALKKAVATIGPISVGIDASQSTFHFYHTGVYHDDHCSSTM--LDHGVLAVGY 279
Query: 307 G-ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
G + +G +W VKNSW+T WG++GY+ MS ++NNCG+ + +Y
Sbjct: 280 GSDENGGDFWLVKNSWNTSWGDKGYIKMSRNRNNNCGIASQASY 323
>gi|146217394|gb|ABQ10739.1| cathepsin L [Penaeus monodon]
Length = 341
Score = 193 bits (491), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 100/203 (49%), Positives = 136/203 (66%), Gaps = 6/203 (2%)
Query: 149 LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
+ DQ CGSCW+F TGA+EG + K +L LS+Q L+DCS +GNNGC+GG +++
Sbjct: 139 IKDQGQCGSCWAFSATGALEGQTFRKTGQLVSLSEQNLVDCSRKFGNNGCNGGLMDNAFE 198
Query: 209 WIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVS 267
++ ++ G+ T++ Y PY +D CH A A GFV+V SE ALK A+A GPVS
Sbjct: 199 YVKENGGIDTEESY-PYDAEDEKCHYNPRAAGAEDKGFVDVREGSEHALKKAVATVGPVS 257
Query: 268 VAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWG 326
VAIDAS +SF FY +GVY + +C SP+ LDH VL VGYG + DG YW VKNSW T WG
Sbjct: 258 VAIDASHESFQFYSHGVYIEPEC--SPEMLDHGVLVVGYGIDDDGTDYWLVKNSWGTTWG 315
Query: 327 NQGYVLMSI-KDNNCGVMTAPTY 348
+QGYV M+ +DN CG+ ++ ++
Sbjct: 316 DQGYVKMARNRDNQCGIASSASF 338
>gi|161408095|dbj|BAF94151.1| cathepsin L-like cysteine protease 1 [Plautia stali]
Length = 344
Score = 193 bits (491), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 101/224 (45%), Positives = 139/224 (62%), Gaps = 6/224 (2%)
Query: 128 RYNKASKDAIPVRYEMKGYNS-LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQAL 186
R+NK + V + KG S + DQ CGSCW+F +TGA+E ++K + LS+Q L
Sbjct: 107 RHNKDAVIQKEVDWRQKGAISPVKDQGHCGSCWAFSSTGALEAHTFLKKGRRVSLSEQNL 166
Query: 187 IDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGF 245
IDCS YGNNGC+GG +++Q++ + G+ T++ Y PY G+D+ C AT GF
Sbjct: 167 IDCSLNYGNNGCEGGLMEQAFQYVRDNDGIDTEEAY-PYEGEDSECRFKKNNVGATDAGF 225
Query: 246 VNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVG 305
V + E AL A+A GP+S+AIDAS SF FY GVYY+ +C+++ LDH VL VG
Sbjct: 226 VTIPSGDEQALMEAVATQGPLSIAIDASNPSFQFYSEGVYYEPECSSAQ--LDHGVLLVG 283
Query: 306 YGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
YG + YW VKNSWS WG GY+ M+ KDNNCG+ T ++
Sbjct: 284 YGVEKDQKYWLVKNSWSEQWGENGYIKMARNKDNNCGIATQASF 327
>gi|312306194|gb|ADQ73946.1| cathepsin L [Paralithodes camtschaticus]
Length = 324
Score = 193 bits (490), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 102/219 (46%), Positives = 143/219 (65%), Gaps = 8/219 (3%)
Query: 135 DAIP--VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW 191
D +P V + KG + DQ CGSCW+F TG++EG +++K KL LS+Q L+DCS
Sbjct: 106 DTLPAEVDWRTKGAVTPVKDQKACGSCWAFSATGSLEGQHFLKDGKLVSLSEQNLVDCST 165
Query: 192 GYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTP 250
G++GC GG ++ +I + G+ T+ Y PY D C + AT+TG+V+V
Sbjct: 166 KQGDHGCGGGLMDFAFTYIKDNGGIDTEASY-PYEATDGKCQYNPANSGATVTGYVDVEH 224
Query: 251 NSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELD 310
+SEDAL+ A+A GP+SVAIDAS+ +F FY GVYYD++C+++ LDH VLAVGYG D
Sbjct: 225 DSEDALQKAVATIGPISVAIDASRSTFHFYHKGVYYDKECSSTS--LDHGVLAVGYGTQD 282
Query: 311 GKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
G YW VKNSW+ WGN G++ MS ++NNCG+ T +Y
Sbjct: 283 GTDYWLVKNSWNITWGNHGFIEMSRNRNNNCGIATQASY 321
>gi|228245|prf||1801240C Cys protease 3
Length = 321
Score = 193 bits (490), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 97/203 (47%), Positives = 134/203 (66%), Gaps = 5/203 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TGA+EG +++K+ +L LS+Q L+DCS YGN+GC GG ++ +I
Sbjct: 122 DQEQCGSCWAFSATGALEGQHFLKNDELVSLSEQQLVDCSTDYGNDGCGGGWMTSAFDYI 181
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T+ Y PY +D C + A TG V + ++E+AL+ A++ GP+SVA
Sbjct: 182 KDNGGIDTESSY-PYEAEDRSCRFDANSIGAICTGSVEIVQHTEEALQEAVSGVGPISVA 240
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS SF FY +GVYY++ C SP LDH VLAVGYG K YW VKNSW + WG+ G
Sbjct: 241 IDASHFSFQFYSSGVYYEQNC--SPTFLDHGVLAVGYGTESTKDYWLVKNSWGSSWGDAG 298
Query: 330 YVLMSI-KDNNCGVMTAPTYVTM 351
Y+ MS +DNNCG+ + P+Y T+
Sbjct: 299 YIKMSRNRDNNCGIASEPSYPTV 321
>gi|151573016|gb|ABS17683.1| cathepsin L-1 [Artemia persimilis]
Length = 334
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 99/200 (49%), Positives = 130/200 (65%), Gaps = 5/200 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CG CW+F +TGA+EG + K KL L +Q LIDCS YGN GC+GG +++Q+I
Sbjct: 135 DQGQCGPCWAFSSTGALEGQTFRKTGKLVSLREQNLIDCSGKYGNEGCNGGLMDQAFQYI 194
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T++ Y PY +D C A GFV++ ED LK A+A GPVSVA
Sbjct: 195 KDNKGIDTENTY-PYEAEDDVCRYNPRNRGAVDRGFVDIPSGEEDKLKAAVATVGPVSVA 253
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS +SF FY GVYY+ C++ D LDH VL VGYG +GK YW VKNSWS +WG+QG
Sbjct: 254 IDASHESFQFYSKGVYYEPSCDS--DDLDHGVLVVGYGSDNGKDYWLVKNSWSEHWGDQG 311
Query: 330 YVLMSI-KDNNCGVMTAPTY 348
Y+ ++ + N+CGV TA +Y
Sbjct: 312 YIKIARNRKNHCGVATAASY 331
>gi|254746348|emb|CAX16639.1| putative C1A cysteine protease precursor [Spodoptera frugiperda]
Length = 539
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 94/219 (42%), Positives = 140/219 (63%), Gaps = 9/219 (4%)
Query: 135 DAIPVRYEMK---GYNSLLDQSVCGSCWSFGTTGAVEGAYYMKH-KKLAVLSQQALIDCS 190
+ +P ++++ + +Q CGSCW+F TT AVEGA + +L LS+Q+L+DC+
Sbjct: 322 EELPENFDLRMEGAVTPVKNQGHCGSCWAFCTTAAVEGAVARANGDRLVDLSEQSLVDCA 381
Query: 191 WGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTP 250
WGY N GC+GG + ++++ HG+PT+++YG Y D C I N + T + GF VTP
Sbjct: 382 WGYENQGCNGGTLDGAMKYVLTHGIPTEEEYGLYEANDGLCRIQNMSTTYNIRGFAQVTP 441
Query: 251 NSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNS-PDGLDHAVLAVGYGEL 309
+ +ALK+AL K+GPV+VAI AS + Y +G++YD C+ + P +H V +GYG
Sbjct: 442 RNPNALKVALNKYGPVTVAIHAS-RLMQHYASGLFYDFSCDETYP---NHGVTVIGYGTR 497
Query: 310 DGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTY 348
DG+ YW VKNSW WG G++LMS K+NNC V+ +P Y
Sbjct: 498 DGEDYWIVKNSWGESWGEDGFILMSAKNNNCFVLDSPYY 536
>gi|149617838|ref|XP_001521715.1| PREDICTED: cathepsin L1-like [Ornithorhynchus anatinus]
Length = 338
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 103/205 (50%), Positives = 137/205 (66%), Gaps = 10/205 (4%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTGA+EG + K +L LS+Q L+DCSW GN GC GG ++Q+I
Sbjct: 134 NQGHCGSCWAFSTTGALEGQLFRKSGRLISLSEQNLVDCSWQQGNQGCHGGIVDLAFQYI 193
Query: 211 MK-HGLPTQDDYGPYLGQD-AYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
++ G+ ++D Y PY +D A C ATA +TGFV++ P+SE+AL A+A GPVSV
Sbjct: 194 LQNQGIDSEDCY-PYTAKDTAQCTFKPECATAPVTGFVDIPPHSEEALMKAVATVGPVSV 252
Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTY 324
IDAS SF FY +G++YD KC S + LDHAVL VGYG + GK YW VKNSW +
Sbjct: 253 GIDASSTSFRFYQSGIFYDPKC--SSESLDHAVLVVGYGYEREDEAGKKYWIVKNSWGKH 310
Query: 325 WGNQGYVLMSI-KDNNCGVMTAPTY 348
WG++GYV MS + N+CG+ T +Y
Sbjct: 311 WGDRGYVYMSKDRGNHCGIATVASY 335
>gi|449670651|ref|XP_002168787.2| PREDICTED: uncharacterized protein LOC100201231 [Hydra
magnipapillata]
Length = 1244
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 91/224 (40%), Positives = 136/224 (60%), Gaps = 5/224 (2%)
Query: 131 KASKDAIPVRYEMKGY---NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALI 187
+ ++D IP + + + S+ Q +C C++F GA EG ++ K KL +S Q L+
Sbjct: 611 QVNEDEIPEEIDWREFGAVTSVKGQGICSGCYAFTAAGAAEGGWFRKTGKLIEVSSQQLV 670
Query: 188 DCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVN 247
DCSWGYGN+GC GG + WI HG+ T YG YLGQ+ YCH +T A + GF
Sbjct: 671 DCSWGYGNHGCKGGHYASALSWIFTHGVSTDKSYGKYLGQEGYCHFNDTLFGAQIDGFSY 730
Query: 248 VTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG 307
+ P + ALK +AK+GPV+V+I+ SF FY G+Y D++C + H+ L VGYG
Sbjct: 731 IQPYNISALKRVVAKYGPVAVSINTKPLSFKFYSKGIYDDKEC--LSNQTHHSALIVGYG 788
Query: 308 ELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
+L+G+ YW +KNSWS+ WG GY+ ++++++ CGV P V +
Sbjct: 789 KLNGREYWIIKNSWSSSWGEGGYMKLAMENHLCGVTENPVAVVI 832
>gi|402770499|gb|AFQ98384.1| cathepsin L, partial [Hyalomma anatolicum anatolicum]
Length = 312
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 105/237 (44%), Positives = 145/237 (61%), Gaps = 8/237 (3%)
Query: 117 GEKVSKYSLWVRYNKASKDAIP--VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYM 173
GE+ + S ++ + ++P V + KG + DQ CGSCW+F TG++EG +++
Sbjct: 76 GERKGRGSTFLPPANVNDSSLPKTVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFL 135
Query: 174 KHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCH 232
K KL LS+Q LIDCS +GN GC GG ++++I + G+ T++ Y PY D C
Sbjct: 136 KSGKLVSLSEQNLIDCSGSFGNEGCGGGLMDNAFKYIKANDGIDTEESY-PYEAMDGDCR 194
Query: 233 IANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNN 292
AT TGFV++ SED L+ A+A GP+SVAIDAS SF Y GVY + C
Sbjct: 195 FKKEDVGATDTGFVDIQQGSEDDLQKAVATVGPISVAIDASHSSFQLYSEGVYDEPNC-- 252
Query: 293 SPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
S + LDH VLAVGYG +GK YW VKNSW+ WG+ GY+LMS KDN CG+ ++ +Y
Sbjct: 253 SSEELDHGVLAVGYGVKNGKKYWLVKNSWAETWGDNGYILMSRDKDNQCGIASSASY 309
>gi|346466067|gb|AEO32878.1| hypothetical protein [Amblyomma maculatum]
Length = 358
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 96/200 (48%), Positives = 133/200 (66%), Gaps = 5/200 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTG++EG ++ K ++ LS+Q L+DCS +GNNGC+GG ++++I
Sbjct: 159 NQGQCGSCWAFSTTGSLEGQHFRKTGRMVSLSEQNLVDCSGKFGNNGCEGGLMDNAFKYI 218
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T+ Y PY G D CH + AT TGFV++ +E LK A+A GPVSVA
Sbjct: 219 KANGGIDTELSY-PYNGTDGICHFEKSDVGATDTGFVDIPEGNEQLLKKAVATVGPVSVA 277
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS +SF FY GVY + +C S + LDH VL VGYG DG+ YW VKNSW T WG+ G
Sbjct: 278 IDASHESFQFYSQGVYDEPEC--SSESLDHGVLVVGYGTKDGQDYWLVKNSWGTTWGDDG 335
Query: 330 YVLMSI-KDNNCGVMTAPTY 348
Y+ M+ K+N CG+ ++ +Y
Sbjct: 336 YIYMTRNKENQCGIASSASY 355
>gi|55740406|gb|AAV63979.1| cathepsin L1 precursor [Artemia parthenogenetica]
Length = 338
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 98/200 (49%), Positives = 131/200 (65%), Gaps = 5/200 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F +TGA+EG + K KL LS+Q LIDCS YGN GC+GG +++Q+I
Sbjct: 139 DQGQCGSCWAFSSTGALEGQTFRKTGKLVSLSEQNLIDCSGKYGNEGCNGGLMDQAFQYI 198
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T++ Y PY +D C A GFV++ ED LK A+A GPVSVA
Sbjct: 199 KDNKGIDTENTY-PYEAEDGVCRYNPRNRGAVDRGFVDIPSGEEDKLKAAVATVGPVSVA 257
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS +SF FY G YY+ C++ D LDH VL VGYG +G+ YW VKNSWS +WG++G
Sbjct: 258 IDASHESFQFYSKGXYYEPSCDS--DDLDHGVLVVGYGSDNGEDYWLVKNSWSEHWGDEG 315
Query: 330 YVLMSI-KDNNCGVMTAPTY 348
Y+ ++ + N+CGV TA +Y
Sbjct: 316 YIKIARNRKNHCGVATAASY 335
>gi|151176971|gb|ABR88030.1| digestive cysteine protease [Dermestes frischii]
Length = 339
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 99/201 (49%), Positives = 130/201 (64%), Gaps = 6/201 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCWSF TGA+EG ++ K KL LS+Q L+DCS +GN+GC+GG +++++
Sbjct: 139 DQGHCGSCWSFSATGALEGQHFRKTNKLVSLSEQNLVDCSTKFGNDGCNGGLMDNAFKYV 198
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
HG+ T+ Y PY D CH T+ AT GFV++ E+ L A+A GPVSVA
Sbjct: 199 KYNHGIDTEASY-PYHADDEKCHYNPKTSGATDRGFVDIPTGDEEKLMAAVATVGPVSVA 257
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
IDAS +SF Y GVYYD +C S + LDH VL VGYG + +G+ YW VKNSW WG Q
Sbjct: 258 IDASHESFQLYSEGVYYDPEC--SSEELDHGVLVVGYGTDENGQDYWIVKNSWGESWGEQ 315
Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
GY+ M+ +DNNCG+ T +Y
Sbjct: 316 GYIKMARNRDNNCGIATQASY 336
>gi|72008176|ref|XP_780713.1| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 335
Score = 192 bits (487), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 108/234 (46%), Positives = 145/234 (61%), Gaps = 7/234 (2%)
Query: 119 KVSKYSLWVRYNKASKDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKK 177
K +K S ++ N + V + KGY + DQ CGSCW+F TTG++EG ++ K
Sbjct: 102 KAAKGSTFLPSNNIGELPKTVDWRTKGYVTPVKDQGQCGSCWAFSTTGSLEGQHFKATGK 161
Query: 178 LAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANT 236
L LS+Q L+DCS GN GCDGG +++Q+I+K G+ T++ Y PY D CH
Sbjct: 162 LVSLSEQNLVDCSGKEGNEGCDGGLMDQAFQYIIKAGGIDTEESY-PYKAVDGECHFKKA 220
Query: 237 TATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDG 296
AT+TG+ +VT +SE AL+ A+A GP+SVAIDAS SF Y +GVY + C+++
Sbjct: 221 NIGATVTGYTDVTSDSETALQKAVAHIGPISVAIDASHMSFQLYKSGVYNEPDCSSTL-- 278
Query: 297 LDHAVLAVGYGEL-DGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
LDH VLAVGYG DG YW VKNSW+ WG GY+ MS KDN CG+ T +Y
Sbjct: 279 LDHGVLAVGYGTTSDGTDYWIVKNSWAETWGMNGYLWMSRNKDNQCGIATQASY 332
>gi|47086859|ref|NP_997749.1| cathepsin L, 1 a precursor [Danio rerio]
gi|42542930|gb|AAH66490.1| Cathepsin L1, a [Danio rerio]
Length = 337
Score = 191 bits (486), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 104/216 (48%), Positives = 143/216 (66%), Gaps = 11/216 (5%)
Query: 141 YEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCD 199
+ KGY + DQ CGSCW+F TTGA+EG + K KL LS+Q L+DCS GN GC+
Sbjct: 122 WREKGYVTPVKDQGECGSCWAFSTTGALEGQMFRKTGKLVSLSEQNLVDCSRPEGNEGCN 181
Query: 200 GGEDFRSYQWIM-KHGLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDALK 257
GG +++Q++ ++GL +++ Y PYLG D CH + A TGFV++ E AL
Sbjct: 182 GGLMDQAFQYVKDQNGLDSEESY-PYLGTDDQPCHFDPKNSAANDTGFVDIPSGKERALM 240
Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKP 313
A+A GPVSVAIDA +SF FY +G+YY+++C S + LDH VLAVGYG ++DGK
Sbjct: 241 KAIAAVGPVSVAIDAGHESFQFYQSGIYYEKEC--SSEELDHGVLAVGYGFEGEDVDGKK 298
Query: 314 YWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
YW VKNSWS WG++GY+ M+ + N+CG+ TA +Y
Sbjct: 299 YWIVKNSWSENWGDKGYIYMAKDRHNHCGIATAASY 334
>gi|23452059|gb|AAN32912.1| cathepsin [Danio rerio]
Length = 310
Score = 191 bits (486), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 104/216 (48%), Positives = 143/216 (66%), Gaps = 11/216 (5%)
Query: 141 YEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCD 199
+ KGY + DQ CGSCW+F TTGA+EG + K KL LS+Q L+DCS GN GC+
Sbjct: 95 WREKGYVTPVKDQGECGSCWAFSTTGALEGQMFRKTGKLVSLSEQNLVDCSRPEGNEGCN 154
Query: 200 GGEDFRSYQWIM-KHGLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDALK 257
GG +++Q++ ++GL +++ Y PYLG D CH + A TGFV++ E AL
Sbjct: 155 GGLMDQAFQYVKDQNGLDSEESY-PYLGTDDQPCHFDPKNSAANDTGFVDIPSGKERALM 213
Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKP 313
A+A GPVSVAIDA +SF FY +G+YY+++C S + LDH VLAVGYG ++DGK
Sbjct: 214 KAIAAVGPVSVAIDAGHESFQFYQSGIYYEKEC--SSEELDHGVLAVGYGFEGEDVDGKK 271
Query: 314 YWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
YW VKNSWS WG++GY+ M+ + N+CG+ TA +Y
Sbjct: 272 YWIVKNSWSENWGDKGYIYMAKDRHNHCGIATAASY 307
>gi|410923307|ref|XP_003975123.1| PREDICTED: cathepsin L1-like [Takifugu rubripes]
Length = 336
Score = 191 bits (486), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 106/217 (48%), Positives = 141/217 (64%), Gaps = 9/217 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + DQ CGSCW+F TTGA+EG ++ K L LS+Q L+DCS GN G
Sbjct: 119 VDWRDKGYVTPVKDQGQCGSCWAFSTTGAMEGQHFRKTGTLVSLSEQNLVDCSRPEGNEG 178
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQD-AYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG +++Q+I +G ++ PYLG D CH + +A TGFV+V SE AL
Sbjct: 179 CNGGLMDQAFQYIKDNGGLDSEESYPYLGTDEGPCHYDPSYNSANDTGFVDVPSGSERAL 238
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GPVSVAIDA +SF FY +G+YYD++C S + LDH VL VGYG ++DGK
Sbjct: 239 MKAVASVGPVSVAIDAGHESFQFYHSGIYYDKEC--SSEELDHGVLVVGYGFEGKDVDGK 296
Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
YW VKNSWS WG++GY+ M+ K N+CG+ TA +Y
Sbjct: 297 KYWIVKNSWSENWGDKGYIYMAKDKKNHCGIATAASY 333
>gi|82796372|gb|ABB91778.1| cathepsin L [Hymeniacidon perlevis]
Length = 323
Score = 191 bits (486), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 97/203 (47%), Positives = 133/203 (65%), Gaps = 5/203 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTG++EG +++K KL LS+Q L+DCS GN GC+GG +++++I
Sbjct: 124 NQGQCGSCWAFSTTGSLEGQHFLKTGKLVSLSEQNLVDCSGKEGNEGCNGGLMDQAFEYI 183
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
K+ G+ T+ Y PY D C + AT TG+V++ E+AL A+ K GPVSVA
Sbjct: 184 KKNGGIDTEASY-PYQAHDERCRFKASDVGATCTGYVDIKREDENALMQAVEKIGPVSVA 242
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS SF Y +GVYY+ +C S LDH VLA+GYG G YW VKNSW T WG +G
Sbjct: 243 IDASHSSFQLYRSGVYYEREC--SQTALDHGVLAIGYGTEGGSDYWLVKNSWGTDWGMEG 300
Query: 330 YVLMSI-KDNNCGVMTAPTYVTM 351
Y++MS ++NNCG+ T +Y T+
Sbjct: 301 YIMMSRNRNNNCGIATEASYPTV 323
>gi|410898132|ref|XP_003962552.1| PREDICTED: cathepsin L-like [Takifugu rubripes]
Length = 335
Score = 191 bits (486), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 105/228 (46%), Positives = 139/228 (60%), Gaps = 6/228 (2%)
Query: 124 SLWVRYNKASKDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLS 182
S + R + + V + KGY + DQ CGSCW+F TG++EG + K KL LS
Sbjct: 108 SAFFRLAEGTHLPTTVDWRDKGYVTGVKDQKQCGSCWAFSATGSLEGQNFRKTGKLVSLS 167
Query: 183 QQALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATAT 241
+Q L+DCS YGN GC+GG ++++I ++ G+ T+ Y PY +D C A
Sbjct: 168 EQQLVDCSGDYGNMGCNGGLMDYAFKYIQENGGIDTEKSY-PYEAEDGQCRFKPENVGAK 226
Query: 242 MTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAV 301
TG+V+VT EDALK A+A GPVSV IDAS SF Y +GVY ++ C S LDH V
Sbjct: 227 CTGYVDVTVGDEDALKEAVATIGPVSVGIDASHSSFQLYDSGVYDEQDC--SSQDLDHGV 284
Query: 302 LAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
LAVGYG +G+ YW VKNSW WG +GY++MS KDN CG+ TA +Y
Sbjct: 285 LAVGYGTDNGQDYWLVKNSWGLGWGQEGYIMMSRNKDNQCGIATAASY 332
>gi|402770503|gb|AFQ98386.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 191 bits (486), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 96/200 (48%), Positives = 130/200 (65%), Gaps = 5/200 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TG++EG +++K+ +L LS+Q L+DCS +GNNGC+GG ++++I
Sbjct: 133 DQGQCGSCWAFSATGSLEGRHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYI 192
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
++ G+ T+ Y PY D C AT TG+V + SED LK A+A GP+SVA
Sbjct: 193 KENDGIDTEKSY-PYEAVDGECRFKKEDVGATDTGYVEIKAGSEDDLKKAVATVGPISVA 251
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS SF Y GVY + +C S + LDH VL VGYG GK YW VKNSW+ WG+QG
Sbjct: 252 IDASHSSFQLYSEGVYDEPEC--SSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQG 309
Query: 330 YVLMSIKDNN-CGVMTAPTY 348
Y+LMS +NN CG+ + +Y
Sbjct: 310 YILMSRDNNNQCGIASQASY 329
>gi|238481789|gb|ACR43934.1| cathepsin L-like cysteine proteinase [Haliotis diversicolor
supertexta]
Length = 347
Score = 191 bits (486), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 100/213 (46%), Positives = 140/213 (65%), Gaps = 6/213 (2%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY ++ +Q CGSCWSF TTG++EG ++ K KL LS+Q L+DCS +GN G
Sbjct: 135 VDWRKKGYVTAVKNQGQCGSCWSFSTTGSLEGQHFHKSGKLVSLSEQQLVDCSGKFGNEG 194
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG +++++I+ + G+ T+++Y PY + CH + AT +G V+V E L
Sbjct: 195 CNGGLMDQAFEYIITNGGIETEEEY-PYDARQERCHFKKSEVAATASGCVDVKSGDETDL 253
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K ++A+ GPVS+AIDAS +SF Y GVY + KC+++ LDH VL VGYG DG+ YW
Sbjct: 254 KNSVAEVGPVSIAIDASHQSFQLYSGGVYDEPKCSSTE--LDHGVLVVGYGTDDGQDYWL 311
Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
VKNSW T WG +GYV MS +DN CGV T +Y
Sbjct: 312 VKNSWGTTWGLEGYVKMSRNQDNQCGVATQASY 344
>gi|330434686|gb|AEC22811.1| cathepsin L [Macrobrachium nipponense]
Length = 342
Score = 191 bits (485), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 102/201 (50%), Positives = 131/201 (65%), Gaps = 6/201 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TGA+EG +Y + L LS+Q L+DCS +GNNGC+GG ++Q+I
Sbjct: 142 DQGSCGSCWAFSATGALEGQHYRQTGDLVSLSEQNLVDCSSKFGNNGCNGGLMDNAFQYI 201
Query: 211 -MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T+ Y PY +D C A A GFV+V +E+ALK A+A GPVSVA
Sbjct: 202 KVNGGIDTEKSY-PYEAEDEPCRYNPANAGADDRGFVDVREGNENALKKAIATIGPVSVA 260
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGEL-DGKPYWQVKNSWSTYWGNQ 328
IDASQ SF FY +GVY D C S + LDH VLAVGYG DG+ YW VKNSWS WG+Q
Sbjct: 261 IDASQDSFQFYQHGVYSDPDC--SAENLDHGVLAVGYGTTEDGQDYWLVKNSWSKSWGDQ 318
Query: 329 GYVLMSIKDNN-CGVMTAPTY 348
GY+ ++ NN CG+ +A +Y
Sbjct: 319 GYIKIARNQNNMCGIASAASY 339
>gi|402770509|gb|AFQ98389.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 191 bits (484), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 96/200 (48%), Positives = 129/200 (64%), Gaps = 5/200 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TG++EG +++K+ +L LS+Q L+DCS +GNNGC+GG ++++I
Sbjct: 133 DQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYI 192
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T+ Y PY D C AT TG+V + SED LK A+A GP+SVA
Sbjct: 193 KANDGIDTEKSY-PYEAVDGECRFKKEDVGATDTGYVEIKAGSEDDLKKAVATVGPISVA 251
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS SF Y GVY + +C S + LDH VL VGYG GK YW VKNSW+ WG+QG
Sbjct: 252 IDASHSSFQLYSEGVYDEPEC--SSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQG 309
Query: 330 YVLMSIKDNN-CGVMTAPTY 348
Y+LMS +NN CG+ + +Y
Sbjct: 310 YILMSRDNNNQCGIASQASY 329
>gi|33112583|gb|AAP94047.1| cathepsin-L-like cysteine peptidase 03 [Tenebrio molitor]
Length = 337
Score = 191 bits (484), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 99/201 (49%), Positives = 130/201 (64%), Gaps = 6/201 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCWSF TG++EG ++ K KL LS+Q L+DCS +GNNGC+GG ++++I
Sbjct: 137 DQGQCGSCWSFSATGSLEGQHFRKSGKLVSLSEQNLVDCSEKFGNNGCNGGLMDNAFRYI 196
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T+ Y PY +D CH AT G+V++ +ED L+ A+A GPVSVA
Sbjct: 197 KANGGIDTEQAY-PYKAEDEKCHYKPKNKGATDRGYVDIESGNEDKLQSAVATVGPVSVA 255
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
IDAS +SF Y GVYY+ C+ S LDH VL VGYG E DG YW VKNSW WG+Q
Sbjct: 256 IDASHQSFQLYSGGVYYEPDCSASQ--LDHGVLVVGYGTEDDGTDYWLVKNSWGKSWGDQ 313
Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
GY+ M+ +DNNCG+ T +Y
Sbjct: 314 GYIKMARNRDNNCGIATEASY 334
>gi|402770501|gb|AFQ98385.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 191 bits (484), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 96/200 (48%), Positives = 129/200 (64%), Gaps = 5/200 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TG++EG +++K+ +L LS+Q L+DCS +GNNGC+GG ++++I
Sbjct: 133 DQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYI 192
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T+ Y PY D C AT TG+V + SED LK A+A GP+SVA
Sbjct: 193 KANDGIDTEKSY-PYEAVDGECRFKKEDVGATDTGYVEIKAGSEDDLKKAVATVGPISVA 251
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS SF Y GVY + +C S + LDH VL VGYG GK YW VKNSW+ WG+QG
Sbjct: 252 IDASHSSFQLYSEGVYDEPEC--SSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQG 309
Query: 330 YVLMSIKDNN-CGVMTAPTY 348
Y+LMS +NN CG+ + +Y
Sbjct: 310 YILMSRDNNNQCGIASQASY 329
>gi|2765358|emb|CAA74241.1| cathepsin L [Litopenaeus vannamei]
Length = 325
Score = 191 bits (484), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 103/224 (45%), Positives = 147/224 (65%), Gaps = 9/224 (4%)
Query: 131 KASKDAIP--VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALI 187
KA + +P V + KG + DQ CGSCW+F TTG++EG +++K KL LS+Q L+
Sbjct: 102 KADDETLPEKVDWRTKGAVTPVKDQKQCGSCWAFSTTGSLEGQHFLKDGKLVSLSEQNLV 161
Query: 188 DCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFV 246
DCS + N GC GG +++++I + G+ T+D Y PY QD C + AT TG+V
Sbjct: 162 DCSDKFRNMGCMGGLMDQAFRYIKANKGIDTEDSY-PYEAQDGKCRFDASNVGATDTGYV 220
Query: 247 NVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGY 306
+V SE ALK A+A GP+SV IDASQ +F FY GVY+D+ C+++ LDH VLAVGY
Sbjct: 221 DVEHGSESALKKAVATIGPISVGIDASQSTFHFYHTGVYHDDHCSSTM--LDHGVLAVGY 278
Query: 307 G-ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
G + +G +W VKNSW+T WG++GY+ MS ++NNCG+ + +Y
Sbjct: 279 GSDENGGDFWLVKNSWNTSWGDKGYIKMSRNRNNNCGIASQASY 322
>gi|29165304|gb|AAO65603.1| cathepsin L precursor [Hydra vulgaris]
Length = 324
Score = 191 bits (484), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 99/213 (46%), Positives = 138/213 (64%), Gaps = 6/213 (2%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + +GY + DQ CGSCW+F TTG++EG ++ K KL LS+Q L+DCS YGNNG
Sbjct: 112 VDWRNEGYVTPVKDQGQCGSCWAFSTTGSLEGQHFKKTGKLVSLSEQNLVDCSTAYGNNG 171
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
CDGG ++ +I ++ G+ ++ Y PY +D C ++ AT TGFV++ +E+ L
Sbjct: 172 CDGGLMDNAFTYIKENKGIDSEASY-PYTAEDGKCVFKKSSVAATDTGFVDIPEGNENKL 230
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K A+A GP+SVAIDAS +SF FY +GVY + C+++ LDH VL VGYG GK YW
Sbjct: 231 KEAVASVGPISVAIDASHESFQFYSSGVYNEPSCSSTE--LDHGVLVVGYGTESGKDYWL 288
Query: 317 VKNSWSTYWGNQGYVLMSIK-DNNCGVMTAPTY 348
VKNSW+T WG++GY+ M N CG+ T +Y
Sbjct: 289 VKNSWNTSWGDKGYIKMRRNAKNQCGIATKASY 321
>gi|348531585|ref|XP_003453289.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 366
Score = 190 bits (483), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 100/213 (46%), Positives = 135/213 (63%), Gaps = 6/213 (2%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + DQ +CGSCW+F TTG +EG ++ K KL LS+Q L+DCS +GNNG
Sbjct: 154 VDWRDKGYVTEVKDQKICGSCWAFSTTGVLEGQHFRKTGKLVSLSEQQLMDCSHSFGNNG 213
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG R++Q+I + G+ T+ Y PY + C A TG+V V P++EDAL
Sbjct: 214 CNGGSVKRAFQYIQANGGIDTEASY-PYEAKGQQCRYKPDGIGAKCTGYVEVKPSNEDAL 272
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K A+A GP+SV IDAS SF FY +GVY + C+ + L+H VLAVGYG +G YW
Sbjct: 273 KEAVATIGPISVGIDASHNSFRFYQSGVYDEPDCSKTV--LNHDVLAVGYGTENGHDYWL 330
Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
+KNSW WG++GY+ MS K N CG+ + TY
Sbjct: 331 IKNSWGIRWGDKGYIKMSRNKSNQCGIASDATY 363
>gi|340370270|ref|XP_003383669.1| PREDICTED: cathepsin L1-like [Amphimedon queenslandica]
Length = 326
Score = 190 bits (483), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 91/201 (45%), Positives = 132/201 (65%), Gaps = 3/201 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCWSF +TG++EG +++K L+ LS+Q L+DCS +GN+GC GG S++++
Sbjct: 125 NQGKCGSCWSFSSTGSLEGQHFLKTGTLSSLSEQQLMDCSTSFGNHGCKGGLMDNSFRYL 184
Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
++ PY +D +C ++ A A TG+ ++ EDALK A+A GP+SVAI
Sbjct: 185 ETVAGDMSEEMYPYTAEDGFCRYRSSEAIAKDTGYKDIPRGDEDALKEAVATVGPISVAI 244
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
DA +SF Y G+YY+ C+++ LDH VLAVGYG +G+ YW VKNSW WGN+GY
Sbjct: 245 DAGHRSFQLYHEGIYYEPACSSTK--LDHGVLAVGYGTGEGEEYWLVKNSWGPSWGNEGY 302
Query: 331 VLMSI-KDNNCGVMTAPTYVT 350
V+MS ++NNCG+ T +Y T
Sbjct: 303 VMMSRNRENNCGIATQASYPT 323
>gi|225706370|gb|ACO09031.1| Cathepsin L precursor [Osmerus mordax]
Length = 337
Score = 190 bits (483), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 104/218 (47%), Positives = 145/218 (66%), Gaps = 11/218 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + +GY + DQ CGSCW+F TTGA+EG + K +KL LS+Q L+DCS GN G
Sbjct: 120 VDWREEGYVTPVKDQGQCGSCWAFSTTGAMEGQMFRKTQKLVSLSEQNLVDCSRPEGNEG 179
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDA 255
C+GG +++Q+I + GL +++ Y PYLG D C+ + A TGF+++ E A
Sbjct: 180 CNGGLMDQAFQYIQDNSGLDSEEAY-PYLGTDDQPCNYKAEFSAANDTGFMDIPSGKEHA 238
Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
L A+A GPVSVAIDA +SF FY +G+YY+++C S + LDH VLAVGYG ++DG
Sbjct: 239 LMKAIASVGPVSVAIDAGHESFQFYQSGIYYEKEC--SSEELDHGVLAVGYGFEGEDVDG 296
Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
K YW VKNSWS WG++GY+LM+ + N+CG+ TA +Y
Sbjct: 297 KKYWIVKNSWSEKWGDKGYILMAKDRKNHCGIATAASY 334
>gi|148224022|ref|NP_001087489.1| cathepsin L2 precursor [Xenopus laevis]
gi|51258284|gb|AAH80004.1| MGC81823 protein [Xenopus laevis]
Length = 335
Score = 190 bits (483), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 104/217 (47%), Positives = 136/217 (62%), Gaps = 9/217 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + DQ CGSCW+F TTGA+EG +Y K KL LS+Q L+DCS GN G
Sbjct: 118 VDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHYRKAGKLISLSEQNLVDCSRAQGNQG 177
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQ-DAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG +++Q++ +G +D PY + D CH +A TGFV+V SE L
Sbjct: 178 CNGGLMDQAFQYVKDNGGIDSEDSYPYTAKDDQECHYDPNYNSANDTGFVDVPSGSEKDL 237
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GPVSVA+DA KSF FY +G+YYD +C S + LDH VL VGYG ++DGK
Sbjct: 238 MKAVASVGPVSVAVDAGHKSFQFYQSGIYYDPEC--SSEDLDHGVLVVGYGFEGEDVDGK 295
Query: 313 PYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
YW VKNSWS WGN GY+ + + N+CG+ TA +Y
Sbjct: 296 RYWIVKNSWSEKWGNNGYIKIAKDRHNHCGIATAASY 332
>gi|325303202|tpg|DAA34687.1| TPA_inf: cathepsin L-like cysteine proteinase B [Amblyomma
variegatum]
Length = 337
Score = 190 bits (483), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 95/200 (47%), Positives = 131/200 (65%), Gaps = 5/200 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTG++EG ++ K + LS+Q L+DCS +GNNGC+GG ++++I
Sbjct: 138 NQGQCGSCWAFSTTGSLEGQHFRKSGDMVSLSEQNLVDCSTAFGNNGCEGGLMDNAFKYI 197
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T+ Y PY G D CH + AT TGFV++ +E LK A+A GP+SVA
Sbjct: 198 KANGGIDTEKSY-PYNGTDGTCHFKKSDVGATDTGFVDIPEGNEHLLKKAVATVGPISVA 256
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS +SF FY GVY + +C S + LDH VL VGYG D + YW VKNSW T WG+ G
Sbjct: 257 IDASHQSFQFYSQGVYDEPEC--SSENLDHGVLVVGYGTKDDQDYWLVKNSWGTTWGDGG 314
Query: 330 YVLMSI-KDNNCGVMTAPTY 348
Y+ M+ KDN CG+ ++ +Y
Sbjct: 315 YIYMTRNKDNQCGIASSASY 334
>gi|261289779|ref|XP_002611751.1| hypothetical protein BRAFLDRAFT_284345 [Branchiostoma floridae]
gi|229297123|gb|EEN67761.1| hypothetical protein BRAFLDRAFT_284345 [Branchiostoma floridae]
Length = 330
Score = 190 bits (483), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 97/202 (48%), Positives = 132/202 (65%), Gaps = 5/202 (2%)
Query: 149 LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
+ DQ CGSCW+F TTG++EG + L LS+Q L+DCS GN GC+GG+ + +Q
Sbjct: 129 IKDQGQCGSCWAFSTTGSLEGQHAKATGTLVSLSEQNLVDCSRQEGNKGCEGGDMDQGFQ 188
Query: 209 WIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVS 267
+I+++ G+ T+ Y PY ++ C N+ ATM+ F +VT EDALK A A GP+S
Sbjct: 189 YIIQNKGIDTEQCY-PYKAKNHRCKFDNSCIGATMSSFTDVTSGDEDALKQACANIGPIS 247
Query: 268 VAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGN 327
V IDAS +SF FY +GVY + +C+++ LDH VL VGYG K YW VKNSW T WGN
Sbjct: 248 VGIDASHQSFQFYSSGVYNEFECSSTK--LDHGVLVVGYGTYGSKDYWLVKNSWGTVWGN 305
Query: 328 QGYVLMSI-KDNNCGVMTAPTY 348
+GY++MS KDN CGV T ++
Sbjct: 306 EGYIMMSRNKDNQCGVATDASF 327
>gi|254674508|dbj|BAH86062.1| cysteine protease [Haemaphysalis longicornis]
Length = 333
Score = 190 bits (482), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 97/199 (48%), Positives = 125/199 (62%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTG++EG ++ K KL LS+Q L+DCS +GN GC+GG +Q+I
Sbjct: 134 NQGQCGSCWAFSTTGSLEGQHFRKTGKLVSLSEQNLVDCSDDFGNQGCNGGLMDNGFQYI 193
Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
+G ++ PY QD C AT GFV++ SED LK A+A GPVSVAI
Sbjct: 194 KANGGIDTEESHPYTAQDGDCKFKKADVGATDAGFVDIQQGSEDDLKKAVATVGPVSVAI 253
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
DAS SF Y GVY + C++S LDH VL VGYG +GK YW VKNSW WG+ GY
Sbjct: 254 DASHGSFQLYSQGVYDEPDCSSSQ--LDHGVLTVGYGVKNGKKYWLVKNSWGGDWGDNGY 311
Query: 331 VLMSI-KDNNCGVMTAPTY 348
+LMS KDN CG+ ++ +Y
Sbjct: 312 ILMSRDKDNQCGIASSASY 330
>gi|402770507|gb|AFQ98388.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 190 bits (482), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 96/200 (48%), Positives = 129/200 (64%), Gaps = 5/200 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TTG++EG +++K+ +L LS+Q L+DCS +GNNGC+GG ++++I
Sbjct: 133 DQGQCGSCWAFSTTGSLEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYI 192
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T+ Y PY D C AT TG+V + ED LK A+A GP+SVA
Sbjct: 193 KANDGIDTEKSY-PYEAVDGECRFKKEDVGATDTGYVEIKAGCEDDLKKAVATVGPISVA 251
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS SF Y GVY + +C S + LDH VL VGYG GK YW VKNSW+ WG+QG
Sbjct: 252 IDASHSSFQLYSEGVYDEPEC--SSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQG 309
Query: 330 YVLMSIKDNN-CGVMTAPTY 348
Y+LMS +NN CG+ + +Y
Sbjct: 310 YILMSRDNNNQCGIASQASY 329
>gi|443724292|gb|ELU12369.1| hypothetical protein CAPTEDRAFT_165495 [Capitella teleta]
Length = 351
Score = 190 bits (482), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 106/222 (47%), Positives = 138/222 (62%), Gaps = 12/222 (5%)
Query: 136 AIPVR------YEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALID 188
AIPV + +GY + DQ CGSCWSF TTGA+EG ++ K KL LS+Q LID
Sbjct: 130 AIPVSLPAEVDWRKEGYVTPIKDQGHCGSCWSFSTTGALEGQHFRKTGKLVSLSEQNLID 189
Query: 189 CSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVN 247
CS YGNNGC+GG ++Q+I + G T+D Y PY D C AT TG+ +
Sbjct: 190 CSTSYGNNGCNGGVMDYAFQYIKDNDGDDTEDSY-PYEAADGPCRFKKEYVGATDTGYTD 248
Query: 248 VTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG 307
+ E+ +K A+A GPVSVAIDAS SF Y +GVY + +C+ P+GLDH VL VGYG
Sbjct: 249 LPKGDEEKMKEAVAMVGPVSVAIDASHTSFQMYQSGVYDEVECD--PEGLDHGVLVVGYG 306
Query: 308 ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
G+ YW VKNSW T WG++GY+ MS K+N CG+ + +Y
Sbjct: 307 TELGQDYWLVKNSWGTKWGDEGYIKMSRNKNNQCGISSMASY 348
>gi|41688064|dbj|BAD08618.1| cathepsin L preproprotein [Cyprinus carpio]
Length = 337
Score = 190 bits (482), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 105/213 (49%), Positives = 142/213 (66%), Gaps = 11/213 (5%)
Query: 144 KGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGE 202
KGY + DQ CGSCW+F TTGA+EG + K KL LS+Q L+DCS GN GC+GG
Sbjct: 125 KGYVTPVKDQGECGSCWAFSTTGAMEGQMFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGL 184
Query: 203 DFRSYQWIM-KHGLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDALKLAL 260
+++Q+I ++GL +++ Y PY+G D CH + A TGFV++ E AL A+
Sbjct: 185 MDQAFQYIKDQNGLDSEESY-PYVGTDDQPCHYDPKYSAANDTGFVDIPSGKEHALMKAI 243
Query: 261 AKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQ 316
A GPVSVAIDA +SF FY +G+YY+++C S + LDH VLAVGYG ++DGK YW
Sbjct: 244 AAVGPVSVAIDAGHESFQFYQSGIYYEKEC--SSEELDHGVLAVGYGFEGEDVDGKKYWI 301
Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
VKNSWS WG++GYV M+ + N+CG+ TA +Y
Sbjct: 302 VKNSWSENWGDKGYVYMAKDRHNHCGIATAASY 334
>gi|323450461|gb|EGB06342.1| hypothetical protein AURANDRAFT_59223 [Aureococcus anophagefferens]
Length = 539
Score = 190 bits (482), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 100/204 (49%), Positives = 129/204 (63%), Gaps = 6/204 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCWS+GTTGA+EG +++KH L LSQQ L+DCSW GNN CDGG D +Y W+
Sbjct: 338 DQGSCGSCWSYGTTGAIEGQHFLKHGALVELSQQNLMDCSWPQGNNACDGGLDSNAYDWM 397
Query: 211 MK---HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVS 267
++ + + YG Y D CH A+ A +TG+VNVT A ALA GP+S
Sbjct: 398 LEVNGGKVAAAEPYGGYKNADGACHAADGF-HAKITGYVNVT--GLPAFNDALANVGPLS 454
Query: 268 VAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGN 327
++IDA+ + F +Y +G YY + C D LDH VLAVG + Y VKNSWST+WG+
Sbjct: 455 ISIDATPEDFYYYKSGFYYSDDCKGGVDDLDHTVLAVGVKVHGDQRYTIVKNSWSTHWGD 514
Query: 328 QGYVLMSIKDNNCGVMTAPTYVTM 351
GYV +S K+N CGV TA TY T+
Sbjct: 515 AGYVYISQKNNCCGVATAATYPTI 538
>gi|334332720|ref|XP_001367595.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
Length = 333
Score = 190 bits (482), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 98/212 (46%), Positives = 135/212 (63%), Gaps = 4/212 (1%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TG++EG ++ K KKL LS+Q L+DCS GNNG
Sbjct: 121 VDWREKGYVTPVKNQGQCGSCWAFSATGSLEGQWFHKTKKLVSLSEQNLVDCSTSEGNNG 180
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
C GG +++++ +G + PYLGQD C + A +TGFV++ +E AL
Sbjct: 181 CSGGLMDNAFEYVKNNGGIDTEQAYPYLGQDNECKYRAECSGANVTGFVDIPSMNERALM 240
Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQV 317
A+A GP+SVAIDA SF FY +GVYY+ +C++S LDH VL VGYG + YW V
Sbjct: 241 KAVANVGPISVAIDAGNPSFQFYESGVYYEPQCSSSQ--LDHGVLVVGYGSIGKDEYWIV 298
Query: 318 KNSWSTYWGNQGYVLMS-IKDNNCGVMTAPTY 348
KNSW WG +GYVLM+ ++N+CG+ TA +Y
Sbjct: 299 KNSWGEEWGKKGYVLMAKFRNNHCGIATAASY 330
>gi|21483190|gb|AAL14223.1| cathepsin L [Dictyocaulus viviparus]
Length = 347
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 98/225 (43%), Positives = 143/225 (63%), Gaps = 7/225 (3%)
Query: 127 VRYNKASKDAIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQAL 186
V +N + D++ R E + +Q +CGSCW+F TGA+EG ++ KL LS+Q L
Sbjct: 124 VPFNVKAPDSVDWR-EHNLVTPVKNQGMCGSCWAFSATGALEGQHFRATGKLVSLSEQNL 182
Query: 187 IDCSWGYGNNGCDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGF 245
+DCS YGN+GC+GG ++++I HG+ T++ Y PY+G++ CH A GF
Sbjct: 183 VDCSTKYGNHGCNGGLMDLAFEYIKDNHGIDTEEGY-PYVGKEMRCHFKKRDIGAEDRGF 241
Query: 246 VNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVG 305
V++ EDALK+A+A GP+S+AIDA +SF Y GVY+DE+C S + LDH VL VG
Sbjct: 242 VDLPEGDEDALKVAVATQGPISIAIDAGHRSFQLYKKGVYFDEEC--SSEELDHGVLLVG 299
Query: 306 YG-ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
YG + + YW +KNSW T WG +GYV ++ ++N+CGV T +Y
Sbjct: 300 YGTDPEAGDYWIIKNSWGTKWGEKGYVRIARNRNNHCGVATKASY 344
>gi|185135439|ref|NP_001117777.1| procathepsin L precursor [Oncorhynchus mykiss]
gi|14582899|gb|AAK69706.1|AF358668_1 procathepsin L [Oncorhynchus mykiss]
Length = 338
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 104/218 (47%), Positives = 142/218 (65%), Gaps = 11/218 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + DQ CGSCW+F TTGA+EG + K KL LS+Q L+DCS GN G
Sbjct: 121 VDWREKGYVTPVKDQGSCGSCWAFSTTGAMEGQQFRKTGKLVSLSEQNLVDCSRPEGNEG 180
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDA 255
C+GG +++Q+I + GL T++ Y PY+G D CH + A TGFV++ E A
Sbjct: 181 CNGGLMDQAFQYIQDNAGLDTEESY-PYVGTDEDPCHYKPEFSAANETGFVDIPSGKEHA 239
Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
+ A+A GPVSVAIDA +SF FY +G+YY+++C S + LDH VL VGYG ++DG
Sbjct: 240 MMKAVAAVGPVSVAIDAGHESFQFYESGIYYEKEC--SSEELDHGVLVVGYGFEGEDVDG 297
Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
K YW VKNSWS WG++GY+ M+ + N+CG+ TA +Y
Sbjct: 298 KKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATASSY 335
>gi|348542774|ref|XP_003458859.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 330
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 104/213 (48%), Positives = 139/213 (65%), Gaps = 7/213 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + +GY + DQ CGSCW+F TGA+EG Y+ K KL LS+Q L+DCS + NNG
Sbjct: 119 VDWRKQGYVTHVKDQKECGSCWAFSATGALEGQYFKKTGKLVSLSEQQLVDCSRKFRNNG 178
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GGE ++Q+I + GL T++ Y Y +D CH + A +G+VNV+P EDAL
Sbjct: 179 CEGGEPHWAFQYIRYNGGLDTEESY-HYEAKDGQCHYNPDSVGAKCSGYVNVSP-FEDAL 236
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K A+A GP+SVAID S+ SF Y +GVY + C+N L+HAVLAVGYG +G YW
Sbjct: 237 KEAVATIGPISVAIDISRVSFQLYHSGVYDEPWCSNI--NLNHAVLAVGYGTENGHDYWL 294
Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
VKNSW + WGN+GY+ M+ KDN CG+ T +Y
Sbjct: 295 VKNSWGSEWGNKGYIKMTRNKDNQCGIATEASY 327
>gi|226821421|gb|ACO82386.1| cathepsin L-like protein [Lutjanus argentimaculatus]
Length = 301
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 102/205 (49%), Positives = 135/205 (65%), Gaps = 10/205 (4%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TTGA+EG ++ K KL LS+Q L+DCS GN GC+GG +++Q+I
Sbjct: 97 DQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYI 156
Query: 211 M-KHGLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
GL ++D Y PYLG D CH +A TGFV++ E AL A+A GPVSV
Sbjct: 157 KDNQGLDSEDSY-PYLGTDDQPCHYDPKYNSANDTGFVDIPSGKERALMKAVAAVGPVSV 215
Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTY 324
AIDA +SF FY +G+YY++ C S + LDH VL VGYG ++DGK YW VKNSWS
Sbjct: 216 AIDAGHESFQFYQSGIYYEKDC--SSEELDHGVLVVGYGFEGEDVDGKKYWIVKNSWSEK 273
Query: 325 WGNQGYVLMSI-KDNNCGVMTAPTY 348
WG++GY+ M+ + N+CG+ TA +Y
Sbjct: 274 WGDKGYIYMAKDRKNHCGIATAASY 298
>gi|242020003|ref|XP_002430447.1| Cathepsin L precursor, putative [Pediculus humanus corporis]
gi|212515585|gb|EEB17709.1| Cathepsin L precursor, putative [Pediculus humanus corporis]
Length = 345
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 104/234 (44%), Positives = 139/234 (59%), Gaps = 19/234 (8%)
Query: 119 KVSKYSLWVRYNKASKDAIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKL 178
K+ K+ WV+ + PV+ DQ CGSCW+F TGA+EG ++ K K L
Sbjct: 124 KLPKHVDWVKLGAVT----PVK----------DQGHCGSCWAFSATGALEGLHFRKTKVL 169
Query: 179 AVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI-MKHGLPTQDDYGPYLGQDAYCHIANTT 237
LS+Q LIDCS GNNGC+GG +++Q++ + G+ T+ Y PY G + C
Sbjct: 170 VSLSEQNLIDCSTEEGNNGCNGGLMDQAFQYVRINGGIDTERSY-PYEGNNDVCRYEPEN 228
Query: 238 ATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGL 297
+ A TG+ +V EDALK A+A GPVSVAIDASQ+SF Y +GVY++ C N P+ L
Sbjct: 229 SGAIDTGYTDVPLGDEDALKSAVATVGPVSVAIDASQESFQLYSSGVYFEPNCKNEPESL 288
Query: 298 DHAVLAVGYG--ELDGKPYWQVKNSWSTYWGNQGYVLMSIK-DNNCGVMTAPTY 348
DH VL VGYG E + YW VKNSW WG GY+ M+ DN CG+ T P++
Sbjct: 289 DHGVLVVGYGTDEETQQDYWLVKNSWGDSWGENGYIKMARNADNQCGIATQPSF 342
>gi|66378018|gb|AAY45870.1| cathepsin L-like cysteine proteinase [Rotylenchulus reniformis]
Length = 369
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 97/204 (47%), Positives = 132/204 (64%), Gaps = 6/204 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TGA+EG + K +L LS+Q L+DC+ YGN GC+GG ++Q+I
Sbjct: 169 NQGQCGSCWAFSATGALEGQHARKTGQLVSLSEQNLVDCTKKYGNMGCNGGLMDNAFQYI 228
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ + Y PY + CH AT TGF +V ED LKLA+A GPVSVA
Sbjct: 229 KDNEGIDKEMTY-PYKAKAGRCHFKRNDVGATDTGFFDVAEGDEDKLKLAVATQGPVSVA 287
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
IDA +SF Y +GVY++E+CN P+ LDH VL VGYG + + YW VKNSWST+WG Q
Sbjct: 288 IDAGHRSFQLYKHGVYFEEECN--PEELDHGVLVVGYGTDPEHGDYWIVKNSWSTHWGEQ 345
Query: 329 GYVLMSI-KDNNCGVMTAPTYVTM 351
GY+ M+ ++NNCG+ + +Y T+
Sbjct: 346 GYIRMAPNRNNNCGIPSHASYPTV 369
>gi|357627452|gb|EHJ77132.1| cathepsin L-like protease [Danaus plexippus]
Length = 341
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 101/201 (50%), Positives = 130/201 (64%), Gaps = 6/201 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCWSF TTGA+EG ++ K L LS+Q LIDCS YGNNGC+GG ++++I
Sbjct: 141 DQGKCGSCWSFSTTGALEGQHFRKSGFLVSLSEQNLIDCSSAYGNNGCNGGLMDNAFKYI 200
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T+ Y PY D C + A GFV++ E L LALA GPVSVA
Sbjct: 201 KDNDGIDTEKTY-PYEAVDDKCRYNPKNSGAEDVGFVDIPAGDEHKLMLALATVGPVSVA 259
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
IDASQ+SF Y +GVYYDE C S + LDH VL VGYG + DG YW VKNSW WG++
Sbjct: 260 IDASQESFQLYSDGVYYDENC--SSENLDHGVLVVGYGTDEDGGDYWLVKNSWGPSWGDE 317
Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
GY+ M+ +DN+CG+ ++ +Y
Sbjct: 318 GYIKMARNRDNHCGIASSASY 338
>gi|290462225|gb|ADD24160.1| Cathepsin L [Lepeophtheirus salmonis]
Length = 334
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 100/202 (49%), Positives = 132/202 (65%), Gaps = 7/202 (3%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCWSF TG++EG + K KL LS+Q L+DCS YGNNGC+GG ++++I
Sbjct: 133 NQGQCGSCWSFSATGSLEGQDFRKTGKLISLSEQNLVDCSRKYGNNGCEGGLMDYAFKYI 192
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+G+ T+ Y PY G D +CH + GFV++ SE L+ ALA GP+SVA
Sbjct: 193 QDNNGIDTEASY-PYEGIDGHCHYDPKNKGGSDIGFVDIKKGSEKDLQKALATVGPISVA 251
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG--ELDGKPYWQVKNSWSTYWGN 327
IDAS SF FY +GVY ++KC SP+ LDH VLAVGYG E+ G+ YW VKNSWS WG
Sbjct: 252 IDASHMSFQFYSHGVYSEKKC--SPENLDHGVLAVGYGTDEVTGEDYWLVKNSWSEKWGE 309
Query: 328 QGYVLMSI-KDNNCGVMTAPTY 348
GY+ M+ KDN CG+ ++ +Y
Sbjct: 310 DGYIKMARNKDNMCGIASSASY 331
>gi|440800456|gb|ELR21495.1| cathepsin Llike proteinase [Acanthamoeba castellanii str. Neff]
Length = 557
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 95/200 (47%), Positives = 132/200 (66%), Gaps = 8/200 (4%)
Query: 159 WSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGY------GNNGCDGGEDFRSYQWIMK 212
W+FGTTG +EGA+++K KL LS+Q ++DC+WGY GN GCDGG + QW++
Sbjct: 359 WTFGTTGTLEGAWFLKTGKLVSLSEQQIVDCAWGYEWSGPSGNLGCDGGYASGAIQWVID 418
Query: 213 HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDA 272
+G ++ PYL QDA+C + ++ + +VNV+ SE AL++A+A GPV+VAIDA
Sbjct: 419 NGGIALEESYPYLMQDAWCLKTDRSSGVKVQAYVNVSA-SEAALQMAVATAGPVAVAIDA 477
Query: 273 SQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVL 332
S F FY +GV+Y +C N + LDH VLAVGYG +G+ YW VKNSWST+WG GYV
Sbjct: 478 SHPEFLFYSSGVFYLAECKNDANDLDHEVLAVGYGTENGQDYWLVKNSWSTHWGENGYVK 537
Query: 333 MSI-KDNNCGVMTAPTYVTM 351
M+ + NNCG+ T Y +
Sbjct: 538 MARNRGNNCGIATQANYAII 557
>gi|21483188|gb|AAK77918.1| cathepsin L 1 [Dictyocaulus viviparus]
Length = 347
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 93/201 (46%), Positives = 133/201 (66%), Gaps = 6/201 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q +CGSCW+F TGA+EG ++ KL LS+Q L+DCS YGN+GC+GG ++++I
Sbjct: 147 NQGMCGSCWAFSATGALEGQHFRATGKLVSLSEQNLVDCSTKYGNHGCNGGLMDLAFEYI 206
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
HG+ T++ Y PY+G++ CH A GFV++ EDALK+A+A GP+S+A
Sbjct: 207 KDNHGIDTEEGY-PYVGKEMRCHFKKRDIGAEDRGFVDLPEGDEDALKVAVATQGPISIA 265
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
IDA +SF Y GVY+DE+C S + LDH VL VGYG + + YW +KNSW T WG +
Sbjct: 266 IDAGHRSFQLYKKGVYFDEEC--SSEELDHGVLLVGYGTDPEAGDYWIIKNSWGTKWGEK 323
Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
GYV ++ ++N+CGV T +Y
Sbjct: 324 GYVRIARNRNNHCGVATKASY 344
>gi|299507656|gb|ADJ21807.1| cathepsin L [Oplegnathus fasciatus]
Length = 336
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 101/205 (49%), Positives = 136/205 (66%), Gaps = 10/205 (4%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TTGA+EG ++ K KL LS+Q L+DCS GN GC+GG +++Q+I
Sbjct: 132 DQGQCGSCWAFSTTGAMEGQHFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYI 191
Query: 211 M-KHGLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
GL ++D Y PYLG D CH +A TGF+++ E AL A+A GPVSV
Sbjct: 192 KDNQGLDSEDSY-PYLGTDDQPCHYDPKYNSANDTGFIDIPSGKERALMKAVAAVGPVSV 250
Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTY 324
AIDA +SF FY +G+YY+++C S + LDH VL VGYG ++DGK YW VKNSWS
Sbjct: 251 AIDAGHESFQFYQSGIYYEKEC--SSEELDHGVLVVGYGFEGEDVDGKKYWIVKNSWSEK 308
Query: 325 WGNQGYVLMSI-KDNNCGVMTAPTY 348
WG++GY+ M+ + N+CG+ TA +Y
Sbjct: 309 WGDKGYIYMAKDRKNHCGIATAASY 333
>gi|342305188|dbj|BAK55648.1| cathepsin L [Oplegnathus fasciatus]
Length = 336
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 101/205 (49%), Positives = 136/205 (66%), Gaps = 10/205 (4%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TTGA+EG ++ K KL LS+Q L+DCS GN GC+GG +++Q+I
Sbjct: 132 DQGQCGSCWAFSTTGAMEGQHFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYI 191
Query: 211 M-KHGLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
GL ++D Y PYLG D CH +A TGF+++ E AL A+A GPVSV
Sbjct: 192 KDNQGLDSEDSY-PYLGTDDQPCHYDPKYNSANDTGFIDIPSGKERALMKAVAAVGPVSV 250
Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTY 324
AIDA +SF FY +G+YY+++C S + LDH VL VGYG ++DGK YW VKNSWS
Sbjct: 251 AIDAGHESFQFYQSGIYYEKEC--SSEELDHGVLVVGYGFEGEDVDGKKYWIVKNSWSEK 308
Query: 325 WGNQGYVLMSI-KDNNCGVMTAPTY 348
WG++GY+ M+ + N+CG+ TA +Y
Sbjct: 309 WGDKGYIYMAKDRKNHCGIATAASY 333
>gi|390994425|gb|AFM37362.1| cathepsin L2 [Dictyocaulus viviparus]
Length = 352
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 93/201 (46%), Positives = 133/201 (66%), Gaps = 6/201 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q +CGSCW+F TGA+EG ++ KL LS+Q L+DCS YGN+GC+GG ++++I
Sbjct: 152 NQGMCGSCWAFSATGALEGQHFRATGKLVSLSEQNLVDCSTKYGNHGCNGGLMDLAFEYI 211
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
HG+ T++ Y PY+G++ CH A GFV++ EDALK+A+A GP+S+A
Sbjct: 212 KDNHGIDTEEGY-PYVGKEMRCHFKKRDIGAEDRGFVDLPEGDEDALKVAVATQGPISIA 270
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
IDA +SF Y GVY+DE+C S + LDH VL VGYG + + YW +KNSW T WG +
Sbjct: 271 IDAGHRSFQLYKKGVYFDEEC--SSEELDHGVLLVGYGTDPEAGDYWIIKNSWGTKWGEK 328
Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
GYV ++ ++N+CGV T +Y
Sbjct: 329 GYVRIARNRNNHCGVATKASY 349
>gi|226443040|ref|NP_001140018.1| Cathepsin L1 precursor [Salmo salar]
gi|221221188|gb|ACM09255.1| Cathepsin L1 precursor [Salmo salar]
Length = 338
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 104/218 (47%), Positives = 142/218 (65%), Gaps = 11/218 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + DQ CGSCW+F TTGA+EG + K KL LS+Q L+DCS GN G
Sbjct: 121 VDWREKGYVTPVKDQGSCGSCWAFSTTGAMEGQQFRKTGKLVSLSEQNLVDCSRPEGNEG 180
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDA 255
C+GG +++Q+I + GL T++ Y PY+G D CH + A TGFV++ E A
Sbjct: 181 CNGGLMDQAFQYIQDNAGLDTEESY-PYVGTDEDPCHYKPEFSGANETGFVDIPSGKEHA 239
Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
+ A+A GPVSVAIDA +SF FY +G+YY+++C S + LDH VL VGYG ++DG
Sbjct: 240 MMKAVAAVGPVSVAIDAGHESFQFYESGIYYEKEC--SSEELDHGVLVVGYGFEGEDVDG 297
Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
K YW VKNSWS WG++GY+ M+ + N+CG+ TA +Y
Sbjct: 298 KKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATASSY 335
>gi|22653679|sp|Q26636.1|CATL_SARPE RecName: Full=Cathepsin L; Contains: RecName: Full=Cathepsin L
heavy chain; Contains: RecName: Full=Cathepsin L light
chain; Flags: Precursor
gi|505140|dbj|BAA03970.1| cathepsin L precursor [Sarcophaga peregrina]
Length = 339
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 100/204 (49%), Positives = 130/204 (63%), Gaps = 6/204 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F +TGA+EG ++ K L LS+Q L+DCS YGNNGC+GG ++++I
Sbjct: 139 DQGHCGSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI 198
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T+ Y PY G D CH T AT TGFV++ E+ +K A+A GPVSVA
Sbjct: 199 KDNGGIDTEKSY-PYEGIDDSCHFNKATIGATDTGFVDIPEGDEEKMKKAVATMGPVSVA 257
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
IDAS +SF Y GVY + +C+ LDH VL VGYG + G YW VKNSW T WG Q
Sbjct: 258 IDASHESFQLYSEGVYNEPECDEQ--NLDHGVLVVGYGTDESGMDYWLVKNSWGTTWGEQ 315
Query: 329 GYVLMSIKDNN-CGVMTAPTYVTM 351
GY+ M+ NN CG+ TA +Y T+
Sbjct: 316 GYIKMARNQNNQCGIATASSYPTV 339
>gi|344953542|gb|AEN28617.1| cathepsin L-like cysteine protease [Epinephelus coioides]
Length = 336
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 101/205 (49%), Positives = 136/205 (66%), Gaps = 10/205 (4%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TTGA+EG ++ K KL LS+Q L+DCS GN GC+GG +++Q++
Sbjct: 132 DQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYV 191
Query: 211 M-KHGLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
GL ++D Y PYLG D CH T + TGFV++ E AL A+A GPVSV
Sbjct: 192 KDNQGLDSEDSY-PYLGTDDQPCHYDPTYNSVNDTGFVDIPSGKERALMKAVAAVGPVSV 250
Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTY 324
AIDA +SF FY +G+YY+++C S + LDH VL VGYG ++DGK YW VKNSWS
Sbjct: 251 AIDAGHESFQFYQSGIYYEKEC--SSEELDHGVLVVGYGFQGEDVDGKKYWIVKNSWSEK 308
Query: 325 WGNQGYVLMSI-KDNNCGVMTAPTY 348
WG++GY+ M+ + N+CG+ TA +Y
Sbjct: 309 WGDKGYIYMAKDRKNHCGIATAASY 333
>gi|253796148|gb|ACT35690.1| cathepsin L-like cysteine proteinase [Ditylenchus destructor]
Length = 376
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 95/201 (47%), Positives = 130/201 (64%), Gaps = 6/201 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q +CGSCW+F TG++EG + L LS+Q L+DCS YGNNGC+GG ++Q+I
Sbjct: 176 NQGMCGSCWAFSATGSLEGQHKRSKGTLVSLSEQNLVDCSAAYGNNGCNGGLMDFAFQYI 235
Query: 211 MK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ HG+ T+ Y PY + CH ++ A TGF+++ ED LK+A+A GP+SVA
Sbjct: 236 KENHGIDTETSY-PYKARQKKCHFQRSSVGADDTGFMDLPEGDEDQLKIAVATQGPISVA 294
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
IDA +SF Y GVYY+++C S + LDH VL VGYG + D YW VKNSW T WG Q
Sbjct: 295 IDAGHRSFQLYKTGVYYEKEC--SSEQLDHGVLVVGYGTDPDHGDYWIVKNSWGTTWGEQ 352
Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
GYV M+ K+N+CG+ T +Y
Sbjct: 353 GYVRMARNKNNHCGIATKASY 373
>gi|57964592|ref|XP_560736.1| Anopheles gambiae str. PEST AGAP012900-PA [Anopheles gambiae str.
PEST]
gi|55246680|gb|EAL42129.1| AGAP012900-PA [Anopheles gambiae str. PEST]
Length = 135
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 87/134 (64%), Positives = 103/134 (76%)
Query: 218 QDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSF 277
+D+YG Y GQD YCH+ N T A +TG+VNVT DA KLAL KHGP+S+AIDA KSF
Sbjct: 2 EDEYGGYFGQDGYCHVQNMTLYAPITGWVNVTSGDPDAFKLALFKHGPLSIAIDAGHKSF 61
Query: 278 SFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKD 337
SFY NGVYY+ +C N D LDHAVLAVGYG+L+G+ YW +KNSWS YWGN GY LM++KD
Sbjct: 62 SFYSNGVYYEPECKNKLDELDHAVLAVGYGQLNGEDYWLIKNSWSNYWGNDGYALMAVKD 121
Query: 338 NNCGVMTAPTYVTM 351
NNCG+ T TYV M
Sbjct: 122 NNCGLTTDATYVLM 135
>gi|47230018|emb|CAG10432.1| unnamed protein product [Tetraodon nigroviridis]
Length = 294
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 105/230 (45%), Positives = 140/230 (60%), Gaps = 6/230 (2%)
Query: 122 KYSLWVRYNKASKDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAV 180
K S + R + + V + KGY + DQ CGSCW+F TG++EG Y K KL
Sbjct: 65 KGSAFFRLAEGTPLPTTVDWRDKGYVTGVKDQKQCGSCWAFSATGSLEGQNYRKTGKLVS 124
Query: 181 LSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTAT 239
LS+Q L+DCS YGN GC GG ++++I ++ G+ T++ Y PY +D C
Sbjct: 125 LSEQQLVDCSGDYGNMGCGGGLMDSAFKYIQENGGIDTEESY-PYEAEDGKCRFKPQNIG 183
Query: 240 ATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDH 299
A TG+V+VT EDALK A+A GPVSVAIDAS SF Y +GVY + +C S + LDH
Sbjct: 184 AKCTGYVDVTAGDEDALKEAVATIGPVSVAIDASHSSFQLYESGVYDELEC--SSEDLDH 241
Query: 300 AVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
VLAVGYG +G+ YW VKNSW WG +GY++MS K N CG+ + +Y
Sbjct: 242 GVLAVGYGTDNGQDYWLVKNSWGLGWGQKGYIMMSRNKHNQCGIASMASY 291
>gi|15593246|gb|AAL02220.1|AF410880_1 cysteine protease CP7 precursor [Frankliniella occidentalis]
Length = 333
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 118/290 (40%), Positives = 162/290 (55%), Gaps = 22/290 (7%)
Query: 70 FKVNIYRLFFLRPRFHENE---KIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLW 126
FK N R+ RF E K+ YN Y ++ + EK S ++ S +
Sbjct: 52 FKENAIRIAKHNDRFASGEVTFKVGYN-QYA--DMHTHEVTEKLNGYRSGLKQASAF--- 105
Query: 127 VRYNKASKDAIP----VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVL 181
+ AS D+ P V + KG + DQ CGSCWSF TG++EG ++K+K L L
Sbjct: 106 --VHTASNDSWPWSKKVDWRSKGAVTPIKDQGQCGSCWSFSATGSLEGQLFLKNKNLVSL 163
Query: 182 SQQALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATA 240
S+Q L+DCSW +GN GC+GG +++++ + G+ T++ Y PY +D C
Sbjct: 164 SEQNLVDCSWDFGNEGCNGGLMDSAFEYVKSYGGIDTEESY-PYTAEDGTCLYKAANNAG 222
Query: 241 TMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHA 300
TG+ +V SE AL+ A+ K GPVSVAIDAS SF Y +G+YY+ C S D LDH
Sbjct: 223 VNTGYKDVQAKSESALRDAVEKVGPVSVAIDASNWSFQMYTSGIYYEPAC--SSDSLDHG 280
Query: 301 VLAVGYG-ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
VLAVGYG E K +W VKNSW T WG +GY+ M+ K NNCG+ T +Y
Sbjct: 281 VLAVGYGSEWPNKEFWIVKNSWGTSWGEEGYIKMARNKKNNCGIATEASY 330
>gi|118125|sp|P25784.1|CYSP3_HOMAM RecName: Full=Digestive cysteine proteinase 3; Flags: Precursor
Length = 321
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 98/203 (48%), Positives = 134/203 (66%), Gaps = 6/203 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TGA+EG +++K+ +L LS+Q L+DCS YGN+GC GG ++ +I
Sbjct: 123 DQEQCGSCWAFSATGALEGQHFLKNDELVSLSEQQLVDCSTDYGNDGCGGGWMTSAFDYI 182
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T+ Y PY +D C + A TG V V ++E+AL+ A++ GP+SVA
Sbjct: 183 KDNGGIDTESSY-PYEAEDRSCRFDANSIGAICTGSVEVQ-HTEEALQEAVSGVGPISVA 240
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS SF FY +GVYY++ C SP LDH VLAVGYG K YW VKNSW + WG+ G
Sbjct: 241 IDASHFSFQFYSSGVYYEQNC--SPTFLDHGVLAVGYGTESTKDYWLVKNSWGSSWGDAG 298
Query: 330 YVLMSI-KDNNCGVMTAPTYVTM 351
Y+ MS +DNNCG+ + P+Y T+
Sbjct: 299 YIKMSRNRDNNCGIASEPSYPTV 321
>gi|225719768|gb|ACO15730.1| Cathepsin L1 precursor [Caligus clemensi]
Length = 338
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 104/218 (47%), Positives = 142/218 (65%), Gaps = 11/218 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + DQ CGSCW+F TTGA+EG + K KL LS+Q L+DCS GN G
Sbjct: 121 VDWREKGYVTPVKDQGSCGSCWAFSTTGAMEGQPFRKTGKLVSLSEQNLVDCSRPEGNEG 180
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDA 255
C+GG +++Q+I + GL T++ Y PY+G D CH + A TGFV++ E A
Sbjct: 181 CNGGLMDQAFQYIQDNAGLDTEESY-PYVGTDEDPCHYKPEFSAANETGFVDIPSGKEHA 239
Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
+ A+A GPVSVAIDA +SF FY +G+YY+++C S + LDH VL VGYG ++DG
Sbjct: 240 MMKAVAAVGPVSVAIDAGHESFQFYESGIYYEKEC--SSEELDHGVLVVGYGFEGEDVDG 297
Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
K YW VKNSWS WG++GY+ M+ + N+CG+ TA +Y
Sbjct: 298 KKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATASSY 335
>gi|15593252|gb|AAL02222.1|AF410882_1 cysteine protease CP14 precursor [Frankliniella occidentalis]
Length = 333
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 118/290 (40%), Positives = 162/290 (55%), Gaps = 22/290 (7%)
Query: 70 FKVNIYRLFFLRPRFHENE---KIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLW 126
FK N R+ RF E K+ YN Y ++ + EK S ++ S +
Sbjct: 52 FKENAIRIAKHNDRFASGEVTFKVGYN-QYA--DMHTHEVTEKLNGYRSGLKQASAF--- 105
Query: 127 VRYNKASKDAIP----VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVL 181
+ AS D+ P V + KG + DQ CGSCWSF TG++EG ++K+K L L
Sbjct: 106 --VHTASNDSWPWSKKVDWRSKGAVTPIKDQGQCGSCWSFSATGSLEGQLFLKNKNLVSL 163
Query: 182 SQQALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATA 240
S+Q L+DCSW +GN GC+GG +++++ + G+ T++ Y PY +D C
Sbjct: 164 SEQNLVDCSWDFGNEGCNGGLMDSAFEYVKSNGGIDTEESY-PYTAEDGTCLYKAANNAG 222
Query: 241 TMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHA 300
TG+ +V SE AL+ A+ K GPVSVAIDAS SF Y +G+YY+ C S D LDH
Sbjct: 223 VNTGYKDVQAKSESALRDAVEKVGPVSVAIDASNWSFQMYTSGIYYEPAC--SSDSLDHG 280
Query: 301 VLAVGYG-ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
VLAVGYG E K +W VKNSW T WG +GY+ M+ K NNCG+ T +Y
Sbjct: 281 VLAVGYGSEWPNKEFWIVKNSWGTSWGEEGYIKMARNKKNNCGIATEASY 330
>gi|11055|emb|CAA45129.1| cysteine proteinase preproenzyme [Homarus americanus]
Length = 320
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 98/203 (48%), Positives = 134/203 (66%), Gaps = 6/203 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TGA+EG +++K+ +L LS+Q L+DCS YGN+GC GG ++ +I
Sbjct: 122 DQEQCGSCWAFSATGALEGQHFLKNDELVSLSEQQLVDCSTDYGNDGCGGGWMTSAFDYI 181
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T+ Y PY +D C + A TG V V ++E+AL+ A++ GP+SVA
Sbjct: 182 KDNGGIDTESSY-PYEAEDRSCRFDANSIGAICTGSVEVQ-HTEEALQEAVSGVGPISVA 239
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS SF FY +GVYY++ C SP LDH VLAVGYG K YW VKNSW + WG+ G
Sbjct: 240 IDASHFSFQFYSSGVYYEQNC--SPTFLDHGVLAVGYGTESTKDYWLVKNSWGSSWGDAG 297
Query: 330 YVLMSI-KDNNCGVMTAPTYVTM 351
Y+ MS +DNNCG+ + P+Y T+
Sbjct: 298 YIKMSRNRDNNCGIASEPSYPTV 320
>gi|309380130|gb|ADO65978.1| cathepsin L [Eriocheir sinensis]
gi|309380134|gb|ADO65980.1| cathepsin L [Eriocheir sinensis]
Length = 325
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 100/236 (42%), Positives = 147/236 (62%), Gaps = 6/236 (2%)
Query: 115 SEGEKVSKYSLWVRYNKASKDAIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMK 174
S G+KV + +++ D + R + + DQ CGSCW+F TG++EG +++
Sbjct: 91 SAGKKVPRGTMYQPLVDELPDTVDWR-DKGAVTPVKDQKACGSCWAFSATGSLEGQHFLS 149
Query: 175 HKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHI 233
KL LS+Q L+DCS YGN GC GG ++++I +G+ T++ Y PY ++ C
Sbjct: 150 TGKLVSLSEQNLVDCSDKYGNFGCGGGLMDNAFRYIKDNNGIDTEESY-PYEAKNGPCRF 208
Query: 234 ANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNS 293
+ AT++ +V++ SED L+ A+A+ GPVSVAIDAS +F FY G+YYDEKC++S
Sbjct: 209 NSDNVGATLSSYVDIQHGSEDDLQKAVAEKGPVSVAIDASTSTFHFYSRGIYYDEKCSSS 268
Query: 294 PDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
LDH VLAVGYG D YW VKNSW+ WG+ GY+ MS ++NNCG+ + +Y
Sbjct: 269 F--LDHGVLAVGYGTDDSSDYWLVKNSWNETWGDSGYIKMSRNRNNNCGIASQASY 322
>gi|342675481|gb|AEL31666.1| cathepsin L [Cynoglossus semilaevis]
Length = 336
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 104/218 (47%), Positives = 140/218 (64%), Gaps = 11/218 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + DQ CGSCW+F TTGA+ZG + K KL LS+Q L+DCS GN G
Sbjct: 119 VDWREKGYVTPVKDQGQCGSCWAFSTTGALZGQNFRKMGKLVSLSEQNLVDCSRPEGNEG 178
Query: 198 CDGGEDFRSYQWIM-KHGLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDA 255
C GG +++Q++ GL ++D Y PYLG D CH + TGFV++ E A
Sbjct: 179 CGGGLMDQAFQYVKDNQGLDSEDSY-PYLGTDDQPCHYDPKYNSVNDTGFVDIPSGKEHA 237
Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
L A+A GPVSVAIDA +SF FY +G+YY+++C S + LDH VLAVGYG ++DG
Sbjct: 238 LMKAVASVGPVSVAIDAGHESFQFYQSGIYYEKEC--SSEELDHGVLAVGYGFEGEDVDG 295
Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
K YW VKNSWS WG++GY+ M+ + N+CG+ TA +Y
Sbjct: 296 KKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATAASY 333
>gi|52630917|gb|AAU84922.1| putative cathepsin L [Toxoptera citricida]
Length = 341
Score = 188 bits (478), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 105/228 (46%), Positives = 143/228 (62%), Gaps = 9/228 (3%)
Query: 127 VRYNKASKDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQ 183
V + K+ IP + + KGY + +Q CGSCWSF TG++EG ++ K L LS+
Sbjct: 114 VTFLKSENVVIPKSIDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVLVSLSE 173
Query: 184 QALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATM 242
Q LIDCS YGNNGC+GG ++++I + GL T+ Y PY +D C + AT
Sbjct: 174 QNLIDCSRKYGNNGCEGGLMDLAFKYIKSNKGLDTEKSY-PYEAEDDKCRYNPENSGATD 232
Query: 243 TGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVL 302
GFV++ EDAL ALA GPVS+AIDAS + F FY GV+Y+ +C+++ LDH VL
Sbjct: 233 KGFVDIPEGDEDALVHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTE--LDHGVL 290
Query: 303 AVGYG-ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
AVGYG + G YW VKNSW WG+QGY++M+ K NNCGV ++ +Y
Sbjct: 291 AVGYGTDHKGGDYWIVKNSWGKTWGDQGYIMMARNKKNNCGVASSASY 338
>gi|310975575|gb|ADP55136.1| truncated cathepsin L-like protein [Miichthys miiuy]
Length = 246
Score = 188 bits (478), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 102/205 (49%), Positives = 136/205 (66%), Gaps = 10/205 (4%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TTGA+EG ++ K KL LS+Q L+DCS GN GC+GG +++Q++
Sbjct: 42 DQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYV 101
Query: 211 M-KHGLPTQDDYGPYLGQ-DAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
GL ++D Y PYLG D CH +A TGF++V E AL A+A GPVSV
Sbjct: 102 KDNQGLDSEDAY-PYLGTGDQPCHYDPNYNSANDTGFIDVPSGKEHALMKAVAAVGPVSV 160
Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTY 324
AIDAS +SF FY +G+YY++ C S + LDH VL VGYG ++DGK YW VKNSWS
Sbjct: 161 AIDASHESFQFYQSGIYYEKDC--SSEELDHGVLVVGYGFEGEDVDGKKYWIVKNSWSEK 218
Query: 325 WGNQGYVLMSI-KDNNCGVMTAPTY 348
WG++GY+ M+ + N+CG+ TA +Y
Sbjct: 219 WGDKGYIYMAKDRKNHCGIATAASY 243
>gi|23306947|dbj|BAC16538.1| cathepsin L [Engraulis japonicus]
Length = 336
Score = 188 bits (478), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 104/221 (47%), Positives = 144/221 (65%), Gaps = 11/221 (4%)
Query: 139 VRYEMKGYNS-LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
+ Y GY + + DQ CGSCW+F TTGA+EG + + KL LS+Q L+DCS GN G
Sbjct: 119 IDYRDLGYATPVKDQGQCGSCWAFSTTGAMEGQLFREGGKLVSLSEQNLVDCSRPEGNEG 178
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDA 255
C+GG +++Q+I + GL T+D Y PYLG D CH + A TGFV++ E A
Sbjct: 179 CNGGLMDQAFQYIKDNGGLDTEDAY-PYLGTDDQDCHYDPKYSAANDTGFVDIPEGKERA 237
Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
L A+A GPVSVAIDA +SF FY +G+Y++++C+++ LDH VL VGYG ++DG
Sbjct: 238 LMKAVAAVGPVSVAIDAGHESFQFYHSGIYFEKECSSTE--LDHGVLVVGYGFEGEDVDG 295
Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
K YW VKNSWS WG++GY+ M+ + N+CG+ TA +Y M
Sbjct: 296 KKYWIVKNSWSEKWGDEGYIYMAKDRKNHCGIATAASYPLM 336
>gi|66812702|ref|XP_640530.1| counting factor associated protein [Dictyostelium discoideum AX4]
gi|74897159|sp|Q54TR1.1|CFAD_DICDI RecName: Full=Counting factor associated protein D; Flags:
Precursor
gi|60468561|gb|EAL66564.1| counting factor associated protein [Dictyostelium discoideum AX4]
Length = 531
Score = 188 bits (478), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 101/226 (44%), Positives = 144/226 (63%), Gaps = 7/226 (3%)
Query: 129 YNKASKDAIPVRYEMKGYNSLL---DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQA 185
++ S +IP + + N + DQ +CGSCW+FG+TG++EG + + +L LS+Q
Sbjct: 301 HDDESLRSIPSTVDWRNQNCVTPVKDQGICGSCWTFGSTGSLEGTNCVTNGELVSLSEQQ 360
Query: 186 LIDCSWGYGNNGCDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANTTATA-TMT 243
L+DC+ G+ GC GG ++Q++M+ G L T+ +Y PYL Q+ C T + ++T
Sbjct: 361 LVDCAILTGSQGCGGGFASSAFQYVMEIGSLATESNY-PYLMQNGLCRDRTVTPSGVSIT 419
Query: 244 GFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLA 303
G+VNVT SE AL+ A+A GPV++AIDAS F +Y++GVY + C N D LDH VLA
Sbjct: 420 GYVNVTSGSESALQNAIATTGPVAIAIDASVDDFRYYMSGVYNNPACKNGLDDLDHEVLA 479
Query: 304 VGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTY 348
+GYG G+ Y+ VKNSWST WG GYV M+ DNN CGV + TY
Sbjct: 480 IGYGTYQGQDYFLVKNSWSTNWGMDGYVYMARNDNNLCGVSSQATY 525
>gi|126681066|gb|ABO26562.1| cathepsin L-like cysteine protease [Ixodes ricinus]
Length = 335
Score = 188 bits (478), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 101/235 (42%), Positives = 146/235 (62%), Gaps = 16/235 (6%)
Query: 116 EGEKVSKYSLWVRYNKASKDAI-PVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMK 174
E E + +SL + +K A+ PV+ +Q CGSCW+F TG++EG ++ K
Sbjct: 110 EPENIEDFSLPKTVDWRTKGAVTPVK----------NQGQCGSCWAFSATGSLEGQHFRK 159
Query: 175 HKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHI 233
+ LS+Q L+ CS +GNNGC+GG ++++I + G+ T+ Y PY G D CH
Sbjct: 160 SGSMVSLSEQNLVGCSTDFGNNGCEGGLMDDAFKYIRANKGIDTEKSY-PYNGTDGTCHF 218
Query: 234 ANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNS 293
+T AT +GFV++ SE LK A+A GP+SVAIDAS +SF FY +GVY + +C++
Sbjct: 219 KKSTVGATDSGFVDIKEGSETQLKKAVATVGPISVAIDASHESFQFYSDGVYDEPECDS- 277
Query: 294 PDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPT 347
+ LDH VL VGYG L+G YW VKNSW T WG++GY+ MS K N CG+ ++ +
Sbjct: 278 -ESLDHGVLVVGYGTLNGTDYWFVKNSWGTTWGDEGYIRMSRNKKNQCGIASSAS 331
>gi|21483184|gb|AAF86584.1| cathepsin L cysteine protease [Haemonchus contortus]
Length = 355
Score = 188 bits (478), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 93/201 (46%), Positives = 133/201 (66%), Gaps = 6/201 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q +CGSCW+F +TGA+EG + KL LS+Q L+DCS YGN+GC+GG ++++I
Sbjct: 155 NQGMCGSCWAFSSTGALEGQHARATGKLVSLSEQNLVDCSTKYGNHGCNGGLMDLAFEYI 214
Query: 211 MK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ HG+ T+D Y PY+G++ CH T A GFV++ E+ALK A+A GP+S+A
Sbjct: 215 KENHGVDTEDSY-PYVGRETKCHFKRNTVGADDKGFVDLPEGDEEALKKAVATQGPISIA 273
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
IDA +SF Y GVY+DE+C S + LDH VL VGYG + + YW VKNSW WG +
Sbjct: 274 IDAGHRSFQLYKKGVYFDEEC--SSEELDHGVLLVGYGTDPEAGDYWLVKNSWGPTWGEK 331
Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
GY+ ++ ++N+CGV T +Y
Sbjct: 332 GYIRIARNRNNHCGVATKASY 352
>gi|157644745|gb|ABV59078.1| cathepsin L [Lates calcarifer]
Length = 337
Score = 188 bits (477), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 105/213 (49%), Positives = 138/213 (64%), Gaps = 11/213 (5%)
Query: 144 KGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGE 202
KGY + DQ CGSCW+F TTGA+EG + K KL LS+Q L+DCS GN GC+GG
Sbjct: 125 KGYVTPVKDQGQCGSCWAFSTTGALEGQQFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGL 184
Query: 203 DFRSYQWIM-KHGLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDALKLAL 260
+++Q++ GL ++D Y PYLG D CH +A TGFV+V E AL A+
Sbjct: 185 MDQAFQYVKDNQGLDSEDSY-PYLGTDDQPCHYDPNYNSANDTGFVDVPSGKERALMKAV 243
Query: 261 AKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQ 316
A GPVSVAIDA +SF FY +G+YY++ C S + LDH VL VGYG ++DGK YW
Sbjct: 244 AAVGPVSVAIDAGHESFQFYQSGIYYEKDC--SSEELDHGVLVVGYGYEGEDVDGKKYWI 301
Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
VKNSWS WG++GY+ M+ + N+CG+ TA +Y
Sbjct: 302 VKNSWSEKWGDKGYIYMAKDRKNHCGIATAASY 334
>gi|262410743|gb|ACY66807.1| cathepsin L [Aphis gossypii]
Length = 341
Score = 188 bits (477), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 104/228 (45%), Positives = 143/228 (62%), Gaps = 9/228 (3%)
Query: 127 VRYNKASKDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQ 183
V + K+ +P + + KGY + +Q CGSCWSF TG++EG ++ K L LS+
Sbjct: 114 VTFLKSENVVVPKAIDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVLVSLSE 173
Query: 184 QALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATM 242
Q LIDCS YGNNGC+GG ++++I + GL T+ Y PY +D C + AT
Sbjct: 174 QNLIDCSRKYGNNGCEGGLMDLAFKYIKSNKGLDTEKSY-PYEAEDDKCRYNPENSGATD 232
Query: 243 TGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVL 302
GFV++ EDAL ALA GPVS+AIDAS + F FY GV+Y+ +C+++ LDH VL
Sbjct: 233 KGFVDIPEGDEDALMHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTE--LDHGVL 290
Query: 303 AVGYG-ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
AVGYG + G YW VKNSW WG+QGY++M+ K NNCGV ++ +Y
Sbjct: 291 AVGYGTDHKGGDYWIVKNSWGKTWGDQGYIMMARNKKNNCGVASSASY 338
>gi|449683741|ref|XP_002155462.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 324
Score = 188 bits (477), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 100/213 (46%), Positives = 134/213 (62%), Gaps = 6/213 (2%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + +GY + DQ CGSCW+F TTG++EG + K KL LS+Q L+DCS YGNNG
Sbjct: 112 VDWRNEGYVTPVKDQGQCGSCWAFSTTGSLEGQNFKKTGKLVSLSEQNLVDCSTAYGNNG 171
Query: 198 CDGGEDFRSYQWIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++ +I + +G+ ++ Y PY +D C AT TGFV++ E+ L
Sbjct: 172 CNGGLMDNAFTYIKENNGIDSEASY-PYTAKDGKCAFTKPNVAATDTGFVDIPSGDENKL 230
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K A+A GP+SVAIDAS SF FY GVY + KC+++ LDH VL VGYG GK YW
Sbjct: 231 KEAVASVGPISVAIDASHFSFQFYRKGVYNERKCSSTE--LDHGVLVVGYGTESGKDYWL 288
Query: 317 VKNSWSTYWGNQGYVLMSIK-DNNCGVMTAPTY 348
VKNSW+T WG++GY+ MS N CG+ T +Y
Sbjct: 289 VKNSWNTSWGDKGYIKMSRNAKNQCGIATNASY 321
>gi|307192137|gb|EFN75465.1| Cathepsin L [Harpegnathos saltator]
Length = 339
Score = 187 bits (476), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 100/203 (49%), Positives = 132/203 (65%), Gaps = 6/203 (2%)
Query: 149 LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
+ DQ CGSCWSF TGA+EG +Y KL LS+Q LIDCS YGNNGC+GG +++Q
Sbjct: 137 IKDQGHCGSCWSFSATGALEGQHYRITGKLVSLSEQNLIDCSGRYGNNGCNGGLMDQAFQ 196
Query: 209 WIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVS 267
+I HGL T+ Y PY ++ C AT +G+V++ +E LK A+A GPVS
Sbjct: 197 YIKDNHGLDTEISY-PYEAENDKCRYNPRNNGATDSGYVDIPEGNEKKLKAAVATIGPVS 255
Query: 268 VAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDG-KPYWQVKNSWSTYWG 326
VAIDAS +SF FY GVYY+ +C S + LDH VL VGYG D + YW VKNSW WG
Sbjct: 256 VAIDASAESFQFYREGVYYEPRC--SSENLDHGVLVVGYGTDDNDQDYWLVKNSWGVTWG 313
Query: 327 NQGYVLMSI-KDNNCGVMTAPTY 348
++GY+ M+ KDN+CG+ ++ +Y
Sbjct: 314 DEGYIKMARNKDNHCGIASSASY 336
>gi|156398078|ref|XP_001638016.1| predicted protein [Nematostella vectensis]
gi|156225133|gb|EDO45953.1| predicted protein [Nematostella vectensis]
Length = 326
Score = 187 bits (476), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 103/228 (45%), Positives = 145/228 (63%), Gaps = 10/228 (4%)
Query: 126 WVRYNKASKDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLS 182
W Y S IP V + KGY + +Q CGSCW+F TTG+VEG ++ K L LS
Sbjct: 101 WATYMPPSNVKIPSSVDWSQKGYVTGVKNQGQCGSCWAFSTTGSVEGQHFRKTGSLVSLS 160
Query: 183 QQALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATAT 241
+Q LIDCS YGNNGC GG ++++I + G+ T+ Y PYLGQ CH +++ A
Sbjct: 161 EQNLIDCSGSYGNNGCQGGLMDNAFRYIESNGGIDTESSY-PYLGQQGSCHFSSSHVGAR 219
Query: 242 MTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAV 301
+TG+ ++ SE AL+ A+A GPVSVA+DASQ + FY +GVY + C+++ LDH V
Sbjct: 220 VTGYQDIPQGSEQALQSAVATVGPVSVAVDASQ--WQFYSSGVYDNPYCSSTQ--LDHGV 275
Query: 302 LAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
L +GYG +G+ YW VKNSW WG +GY++MS K+N CG+ ++ +Y
Sbjct: 276 LVIGYGNYNGQDYWLVKNSWGYSWGVEGYIMMSRNKNNQCGIASSASY 323
>gi|223646726|gb|ACN10121.1| Cathepsin L1 precursor [Salmo salar]
gi|223672581|gb|ACN12472.1| Cathepsin L1 precursor [Salmo salar]
Length = 338
Score = 187 bits (476), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 104/218 (47%), Positives = 141/218 (64%), Gaps = 11/218 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + DQ CGSCW+F TTGA+EG + K KL LS+Q L+DCS GN G
Sbjct: 121 VDWREKGYVTPVKDQGSCGSCWAFSTTGAMEGQQFRKTGKLVSLSEQNLVDCSRPEGNEG 180
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDA 255
C+GG +++Q+I + GL T++ Y PY+G D CH + A TGFV++ E A
Sbjct: 181 CNGGLMDQAFQYIQDNAGLDTEESY-PYVGTDEDPCHYKPEFSGANETGFVDIPSGKEHA 239
Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
+ A+A GPVSVAIDA +SF FY G+YY+++C S + LDH VL VGYG ++DG
Sbjct: 240 MMKAVAAVGPVSVAIDAGHESFQFYEFGIYYEKEC--SSEELDHGVLVVGYGFEGEDVDG 297
Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
K YW VKNSWS WG++GY+ M+ + N+CG+ TA +Y
Sbjct: 298 KKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATASSY 335
>gi|118119|sp|P13277.2|CYSP1_HOMAM RecName: Full=Digestive cysteine proteinase 1; Flags: Precursor
gi|11051|emb|CAA45127.1| cysteine proteinase preproenzyme [Homarus americanus]
gi|228243|prf||1801240A Cys protease 1
Length = 322
Score = 187 bits (476), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 101/224 (45%), Positives = 140/224 (62%), Gaps = 7/224 (3%)
Query: 132 ASKDAIPVRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCS 190
A+ ++ V + KG + DQ CGSCW+F TTG +EG +++K +L LS+Q L+DC+
Sbjct: 102 AAPESTEVDWRTKGAVTPVKDQGQCGSCWAFSTTGGIEGQHFLKTGRLVSLSEQQLVDCA 161
Query: 191 WG-YGNNGCDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
G Y N GC+GG R+ ++ +G + T+ Y PY +D C + T AT TG+V +
Sbjct: 162 GGSYYNQGCNGGWVERAIMYVRDNGGVDTESSY-PYEARDNTCRFNSNTIGATCTGYVGI 220
Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
SE ALK A GP+SVAIDAS +SF Y GVYY+ C++S LDHAVLAVGYG
Sbjct: 221 AQGSESALKTATRDIGPISVAIDASHRSFQSYYTGVYYEPSCSSSQ--LDHAVLAVGYGS 278
Query: 309 LDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
G+ +W VKNSW+T WG GY+ M+ ++NNCG+ T Y T+
Sbjct: 279 EGGQDFWLVKNSWATSWGESGYIKMARNRNNNCGIATDACYPTV 322
>gi|195056367|ref|XP_001995082.1| GH22826 [Drosophila grimshawi]
gi|193899288|gb|EDV98154.1| GH22826 [Drosophila grimshawi]
Length = 340
Score = 187 bits (476), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 98/201 (48%), Positives = 130/201 (64%), Gaps = 6/201 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F +TGA+EG ++ K L LS+Q L+DCS YGNNGC+GG ++++I
Sbjct: 140 DQGHCGSCWAFSSTGALEGQHFRKTGTLISLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI 199
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T+ Y PY G D CH T AT GF ++ E L A+A GPVSVA
Sbjct: 200 KDNGGIDTEKSY-PYEGIDDSCHFNKGTIGATDRGFTDIPQGDEKKLAQAVATIGPVSVA 258
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
IDAS +SF FY GVY + +C+ P LDH VL VGYG + +GK YW VKNSW T WG++
Sbjct: 259 IDASHESFQFYSTGVYDEPQCD--PQNLDHGVLVVGYGTDENGKDYWLVKNSWGTTWGDK 316
Query: 329 GYVLMSIKDNN-CGVMTAPTY 348
G++ M+ D+N CG+ TA +Y
Sbjct: 317 GFIKMARNDDNQCGIATASSY 337
>gi|209693435|ref|NP_001129410.1| cathepsin L precursor [Acyrthosiphon pisum]
gi|251823771|ref|NP_001156569.1| cathepsin L precursor [Acyrthosiphon pisum]
Length = 341
Score = 187 bits (475), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 104/228 (45%), Positives = 143/228 (62%), Gaps = 9/228 (3%)
Query: 127 VRYNKASKDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQ 183
V + K+ IP V + KGY + +Q CGSCWSF TG++EG ++ K L LS+
Sbjct: 114 VTFLKSENVVIPKSVDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVLVSLSE 173
Query: 184 QALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATM 242
Q LIDCS YGNNGC+GG ++++I + GL T+ Y PY +D C + AT
Sbjct: 174 QNLIDCSRKYGNNGCEGGLMDLAFKYIKSNKGLDTEKSY-PYEAEDDKCRYNPENSGATD 232
Query: 243 TGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVL 302
GFV++ EDAL ALA GPVS+AIDAS + F FY GV+Y+ +C+++ LDH VL
Sbjct: 233 KGFVDIPEGDEDALMHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTE--LDHGVL 290
Query: 303 AVGYG-ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
AVG+G + G YW VKNSW WG++GY++M+ K NNCGV ++ +Y
Sbjct: 291 AVGFGSDKKGGDYWIVKNSWGKTWGDEGYIMMARNKKNNCGVASSASY 338
>gi|30023547|gb|AAO48766.2| cathepsin L-like cysteine proteinase [Tenebrio molitor]
Length = 337
Score = 187 bits (475), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 97/201 (48%), Positives = 130/201 (64%), Gaps = 6/201 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCWSF TG++EG ++ + KL LS+Q L+DCS +GNNGC+GG ++++I
Sbjct: 137 DQGQCGSCWSFSATGSLEGQHFRQSGKLVSLSEQNLVDCSEKFGNNGCNGGLMDNAFRYI 196
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T+ Y PY +D CH AT G+V++ +ED L+ A+A GPVSVA
Sbjct: 197 KANGGIDTEQAY-PYKAEDEKCHYKPKNKGATDRGYVDIESGNEDKLQSAVATVGPVSVA 255
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
IDAS +SF Y GVYY+ C+ S LDH VL VGYG E DG YW VKNSW WG+Q
Sbjct: 256 IDASHQSFQLYSGGVYYEPDCSASQ--LDHGVLVVGYGTEDDGTDYWLVKNSWGKSWGDQ 313
Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
GY+ M+ ++NNCG+ T +Y
Sbjct: 314 GYIKMARNRNNNCGIATEASY 334
>gi|390347681|ref|XP_801784.2| PREDICTED: cathepsin L1-like isoform 2 [Strongylocentrotus
purpuratus]
Length = 336
Score = 187 bits (475), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 101/213 (47%), Positives = 135/213 (63%), Gaps = 6/213 (2%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TTG++EG + K KL LS+Q L+DCS GN G
Sbjct: 124 VDWRTKGYVTPVKNQGQCGSCWAFSTTGSLEGQTFKKTSKLVSLSEQNLVDCSRTEGNMG 183
Query: 198 CDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG + +Q+++ HG+ ++D Y PY +D CH + +A +TGF +VT E AL
Sbjct: 184 CEGGLMDQGFQYVIDNHGIDSEDCY-PYDAEDETCHYKASCDSAEVTGFTDVTSGDEQAL 242
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
A+A GPVSVAIDAS +SF Y +GVY + +C++S LDH VL VGYG GK YW
Sbjct: 243 MEAVASVGPVSVAIDASHQSFQLYESGVYDEPECSSSE--LDHGVLVVGYGTDGGKDYWL 300
Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
VKNSW WG GY+ MS K N CG+ T+ +Y
Sbjct: 301 VKNSWGETWGLSGYIKMSRNKSNQCGIATSASY 333
>gi|16304178|gb|AAL16954.1|AF426414_1 cathepsin L-like cysteine protease precursor [Delia radicum]
Length = 337
Score = 187 bits (475), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 99/207 (47%), Positives = 133/207 (64%), Gaps = 6/207 (2%)
Query: 148 SLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSY 207
++ DQ CGSCW+F +T A+EG ++ K L LS+Q L+DCS YGNNGC+GG ++
Sbjct: 134 AVKDQGHCGSCWAFSSTAALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAF 193
Query: 208 QWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
++I + G+ T+ Y PY G D CH + AT TGFV++ E+AL A+A GPV
Sbjct: 194 RYIKDNGGIDTEKSY-PYEGIDDSCHFTKSGVGATDTGFVDIPQGDEEALMKAVATMGPV 252
Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYW 325
SVAIDAS +SF Y GVY + +C+ LDH VL VGYG + G YW VKNSW T W
Sbjct: 253 SVAIDASHESFQLYSEGVYNEPECD--AQNLDHGVLVVGYGTDKTGLDYWLVKNSWGTTW 310
Query: 326 GNQGYVLMSI-KDNNCGVMTAPTYVTM 351
G+QGY+ M+ +DN CG+ TA +Y T+
Sbjct: 311 GDQGYIKMARNQDNQCGIATASSYPTV 337
>gi|402770505|gb|AFQ98387.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 187 bits (475), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 95/200 (47%), Positives = 128/200 (64%), Gaps = 5/200 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TG++EG +++K+ +L LS+Q L+DCS +GNNGC+GG ++++I
Sbjct: 133 DQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYI 192
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T+ Y PY D C AT TG+V + SE LK A+A GP+SVA
Sbjct: 193 KANDGIDTEKSY-PYEAVDGECRFKKEDVGATDTGYVEIKAGSEVDLKKAVATVGPISVA 251
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS SF Y GVY + +C S + LDH VL VGYG GK YW VKNSW+ WG+QG
Sbjct: 252 IDASHSSFQLYSEGVYDEPEC--SSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQG 309
Query: 330 YVLMSIKDNN-CGVMTAPTY 348
Y+LMS +NN CG+ + +Y
Sbjct: 310 YILMSRDNNNQCGIASQASY 329
>gi|389608655|dbj|BAM17937.1| cathepsin L [Papilio xuthus]
Length = 341
Score = 187 bits (475), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 100/201 (49%), Positives = 129/201 (64%), Gaps = 6/201 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCWSF +TGA+EG +Y + L LS+Q LIDCS YGNNGC+GG ++++I
Sbjct: 141 DQGKCGSCWSFSSTGALEGQHYRRTNILVSLSEQNLIDCSAAYGNNGCNGGLMDNAFKYI 200
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T+ Y PY G D C A GFV++ E L A+A GPVSVA
Sbjct: 201 KDNRGIDTEKSY-PYEGIDDKCRYNPKNTGADDNGFVDIPSGDEGKLMAAVATVGPVSVA 259
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
IDASQ SF FY +GVY+DE C++S LDH VL VGYG + +G YW VKNSW WG+
Sbjct: 260 IDASQSSFQFYSDGVYFDENCSSSS--LDHGVLVVGYGTDENGGDYWLVKNSWGRSWGDL 317
Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
GY+ M+ +DN+CG+ TA +Y
Sbjct: 318 GYIKMARNRDNHCGIATAASY 338
>gi|21425246|emb|CAD33266.1| cathepsin L [Aphis gossypii]
Length = 341
Score = 187 bits (475), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 104/228 (45%), Positives = 143/228 (62%), Gaps = 9/228 (3%)
Query: 127 VRYNKASKDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQ 183
V + K+ IP V + KGY + +Q CGSCWSF TG++EG ++ K L LS+
Sbjct: 114 VTFLKSENVVIPKSVDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVLVSLSE 173
Query: 184 QALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATM 242
Q LIDCS YGNNGC+GG ++++I + GL T+ Y PY +D C + AT
Sbjct: 174 QNLIDCSRKYGNNGCEGGLMDLAFKYIKSNKGLDTEKSY-PYEAEDDKCRYNPENSGATD 232
Query: 243 TGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVL 302
GFV++ EDAL ALA GPVS+AIDAS + F FY GV+Y+ +C+++ LDH VL
Sbjct: 233 KGFVDIPEGDEDALMHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTE--LDHGVL 290
Query: 303 AVGYG-ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
AVG+G + G YW VKNSW WG++GY++M+ K NNCGV ++ +Y
Sbjct: 291 AVGFGSDKKGGDYWIVKNSWGKTWGDEGYIMMARNKKNNCGVASSASY 338
>gi|195429415|ref|XP_002062758.1| GK19626 [Drosophila willistoni]
gi|194158843|gb|EDW73744.1| GK19626 [Drosophila willistoni]
Length = 341
Score = 187 bits (475), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 97/201 (48%), Positives = 129/201 (64%), Gaps = 6/201 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F +TGA+EG ++ K L LS+Q L+DCS YGNNGC+GG +++++
Sbjct: 141 DQGHCGSCWAFSSTGAIEGQHFRKSGTLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYV 200
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T+ Y Y G D CH + AT GF ++ +E L A+A GPVSVA
Sbjct: 201 KDNGGIDTEKSY-AYEGIDDSCHFDKNSIGATDRGFADIPQGNEKKLAQAVATIGPVSVA 259
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
IDASQ+SF FY GVY + C S + LDH VL VGYG E DG YW VKNSW T WG++
Sbjct: 260 IDASQQSFQFYSEGVYDEPNC--SAENLDHGVLVVGYGTEKDGSDYWLVKNSWGTTWGDK 317
Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
G++ MS K+N CG+ +A +Y
Sbjct: 318 GFIKMSRNKENQCGIASASSY 338
>gi|402770517|gb|AFQ98393.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 187 bits (475), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 95/200 (47%), Positives = 128/200 (64%), Gaps = 5/200 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TG++EG +++K+ +L LS+Q L+DCS +GNNGC+GG ++++I
Sbjct: 133 DQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYI 192
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T+ Y PY D C AT TG+V + SE LK A+A GP+SVA
Sbjct: 193 KANDGIDTEKSY-PYKAVDGECRFKKEDVGATDTGYVEIKAGSEVDLKKAVATVGPISVA 251
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS SF Y GVY + +C S + LDH VL VGYG GK YW VKNSW+ WG+QG
Sbjct: 252 IDASHSSFQLYSEGVYDEPEC--SSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQG 309
Query: 330 YVLMSIKDNN-CGVMTAPTY 348
Y+LMS +NN CG+ + +Y
Sbjct: 310 YILMSRDNNNQCGIASQASY 329
>gi|318037269|ref|NP_001187182.1| cathepsin L precursor [Ictalurus punctatus]
gi|196475596|gb|ACG76367.1| cathepsin L [Ictalurus punctatus]
Length = 336
Score = 187 bits (475), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 102/215 (47%), Positives = 135/215 (62%), Gaps = 9/215 (4%)
Query: 141 YEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCD 199
+ KGY + DQ CGSCW+F TTGA+EG + K KL LS+Q L+DCS GN GC+
Sbjct: 121 WREKGYVTPVKDQGQCGSCWAFSTTGAMEGQQFRKTGKLVSLSEQNLVDCSRPEGNEGCN 180
Query: 200 GGEDFRSYQWIMKHGLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDALKL 258
GG +++Q+I +G + + PYLG D CH + + A TGFV++ E AL
Sbjct: 181 GGLMDQAFQYIKDNGGLDTEKFYPYLGTDDQPCHYDPSYSAANDTGFVDIPSGKEHALMK 240
Query: 259 ALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPY 314
A+ GPVSVAIDA +SF FY +G+YY+ C S + LDH VL VGYG +DGK Y
Sbjct: 241 AVTAVGPVSVAIDAGHESFQFYQSGIYYEADC--SSEDLDHGVLVVGYGYEGENVDGKKY 298
Query: 315 WQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
W VKNSWS WGN+GY+ M+ + N+CG+ TA +Y
Sbjct: 299 WIVKNSWSEQWGNKGYIYMAKDRHNHCGIATAASY 333
>gi|261289783|ref|XP_002611753.1| hypothetical protein BRAFLDRAFT_236364 [Branchiostoma floridae]
gi|229297125|gb|EEN67763.1| hypothetical protein BRAFLDRAFT_236364 [Branchiostoma floridae]
Length = 307
Score = 187 bits (475), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 97/213 (45%), Positives = 136/213 (63%), Gaps = 6/213 (2%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TTG++EG + K KL LS+Q L+DCS +GN G
Sbjct: 95 VDWRTKGYVTPVKNQEQCGSCWAFSTTGSLEGQTFKKTGKLVSLSEQNLVDCSGEFGNQG 154
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++++I + G+ T+D Y PY +D C AT+TG+ +++ E AL
Sbjct: 155 CNGGLMDDAFKYIKANGGIDTEDSY-PYEARDGKCRFKPADVGATVTGYTDISEGDEGAL 213
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
A+A GP+SVAIDAS +F Y +GVYY+ +C+++ LDH VLAVGYG GK YW
Sbjct: 214 TQAVATVGPISVAIDASHHTFQMYSHGVYYEPQCSSTE--LDHGVLAVGYGTEGGKDYWL 271
Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
VKNSW WG GY++MS K+N CG+ T+ +Y
Sbjct: 272 VKNSWGEVWGQNGYIMMSRNKNNQCGIATSASY 304
>gi|156367164|ref|XP_001627289.1| predicted protein [Nematostella vectensis]
gi|156214194|gb|EDO35189.1| predicted protein [Nematostella vectensis]
Length = 514
Score = 187 bits (475), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 91/195 (46%), Positives = 119/195 (61%), Gaps = 1/195 (0%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM 211
Q +CGSC++ GAVEGAY+MK KL LS Q +IDCSWG GN GC GG ++ WI
Sbjct: 321 QGICGSCYALAAVGAVEGAYFMKTGKLKELSAQQVIDCSWGSGNRGCKGGYYNKAMSWIY 380
Query: 212 KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAID 271
HG+ + + YGPYLGQ+ C I A + F V + ALK+++A+ GP V+I+
Sbjct: 381 LHGIASAESYGPYLGQEGTCRIEGLRRAAAIDAFAFVPKYNNTALKISVARFGPAVVSIN 440
Query: 272 ASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYV 331
+ S FY G+Y D +C + H+VL VGYG DG+PYW VKNSWST WG GY+
Sbjct: 441 ENPLSLKFYSWGLYDDPECGRDTAAV-HSVLVVGYGVEDGEPYWLVKNSWSTTWGMDGYI 499
Query: 332 LMSIKDNNCGVMTAP 346
++ K N CGV P
Sbjct: 500 KIAWKRNTCGVTRNP 514
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 35/137 (25%), Positives = 59/137 (43%), Gaps = 26/137 (18%)
Query: 14 GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
G +T+Q + G +G + K+ PM T+ + I C+H G I Q +L +
Sbjct: 45 GTDRTFQRADVGPHGEAFKIVPMYTDEKGGYIGCWHLEGTERTPIVVQPILIIST----- 99
Query: 74 IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKAS 133
++W + G E+ +G KW ++ +++WV +
Sbjct: 100 ------------------HDWKFAGYEVYHGASTAKWEYRYLAFGLMNAHTIWV---TTA 138
Query: 134 KDAIPVRYEMKGYNSLL 150
PVRYEMKGY++LL
Sbjct: 139 DPPRPVRYEMKGYDNLL 155
>gi|348531519|ref|XP_003453256.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 334
Score = 187 bits (474), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 100/218 (45%), Positives = 136/218 (62%), Gaps = 8/218 (3%)
Query: 136 AIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG 192
A+P V + KGY + DQ CGSCW+F TTG +EG ++ K KL LS+Q L+DCS
Sbjct: 117 ALPDTVDWRDKGYVTEVKDQKQCGSCWAFSTTGVLEGQHFRKTGKLVSLSEQQLMDCSHS 176
Query: 193 YGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPN 251
+GNNGC+GG R+ Q+I + G+ T+ Y PY + C A TG+V+V P+
Sbjct: 177 FGNNGCNGGSVKRALQYIQANGGIDTETSY-PYKAKGQRCRYKPDGIGAKCTGYVHVKPS 235
Query: 252 SEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDG 311
+E+ LK A+A GP+SV IDAS+ SF FY +GVY D C+ + LDH LAVGYG +G
Sbjct: 236 NEETLKKAVATLGPISVGIDASRHSFQFYQSGVYDDPDCSKTV--LDHGALAVGYGTENG 293
Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
YW +KNSW WG++GY+ MS K N CG+ + +Y
Sbjct: 294 HDYWLIKNSWGLRWGDKGYIKMSRNKSNQCGIASEASY 331
>gi|402770515|gb|AFQ98392.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 187 bits (474), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 95/200 (47%), Positives = 128/200 (64%), Gaps = 5/200 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TG++EG +++K+ +L LS+Q L+DCS +GNNGC+GG ++++I
Sbjct: 133 DQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYI 192
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T+ Y PY D C AT TG+V + SE LK A+A GP+SVA
Sbjct: 193 KANDGIDTEKSY-PYEAVDGECRFKKEDVGATDTGYVEIKAGSEVDLKKAVATVGPISVA 251
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS SF Y GVY + +C S + LDH VL VGYG GK YW VKNSW+ WG+QG
Sbjct: 252 IDASHSSFQLYSEGVYDEPEC--SSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQG 309
Query: 330 YVLMSIKDNN-CGVMTAPTY 348
Y+LMS +NN CG+ + +Y
Sbjct: 310 YILMSRDNNNQCGIASQASY 329
>gi|402770511|gb|AFQ98390.1| cathepsin L [Rhipicephalus microplus]
gi|402770513|gb|AFQ98391.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 187 bits (474), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 95/200 (47%), Positives = 128/200 (64%), Gaps = 5/200 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TG++EG +++K+ +L LS+Q L+DCS +GNNGC+GG ++++I
Sbjct: 133 DQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYI 192
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T+ Y PY D C AT TG+V + SE LK A+A GP+SVA
Sbjct: 193 KANDGIDTEKSY-PYEAVDGECRFKKEDVGATDTGYVEIKAGSEVDLKKAVATVGPISVA 251
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS SF Y GVY + +C S + LDH VL VGYG GK YW VKNSW+ WG+QG
Sbjct: 252 IDASHSSFQLYSEGVYDEPEC--SSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQG 309
Query: 330 YVLMSIKDNN-CGVMTAPTY 348
Y+LMS +NN CG+ + +Y
Sbjct: 310 YILMSRDNNNQCGIASQASY 329
>gi|7381610|gb|AAF61565.1|AF227957_1 cathepsin L-like proteinase precursor [Rhipicephalus microplus]
Length = 332
Score = 187 bits (474), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 95/200 (47%), Positives = 128/200 (64%), Gaps = 5/200 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TG++EG +++K+ +L LS+Q L+DCS +GNNGC+GG ++++I
Sbjct: 133 DQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYI 192
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T+ Y PY D C AT TG+V + SE LK A+A GP+SVA
Sbjct: 193 KANDGIDTEKSY-PYEAVDGECRFKKEDVGATDTGYVEIKAGSEVDLKKAVATVGPISVA 251
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS SF Y GVY + +C S + LDH VL VGYG GK YW VKNSW+ WG+QG
Sbjct: 252 IDASHSSFQLYSEGVYDEPEC--SSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQG 309
Query: 330 YVLMSIKDNN-CGVMTAPTY 348
Y+LMS +NN CG+ + +Y
Sbjct: 310 YILMSRDNNNQCGIASQASY 329
>gi|156399477|ref|XP_001638528.1| predicted protein [Nematostella vectensis]
gi|156225649|gb|EDO46465.1| predicted protein [Nematostella vectensis]
Length = 325
Score = 187 bits (474), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 99/213 (46%), Positives = 135/213 (63%), Gaps = 6/213 (2%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KG+ ++ +Q CGSCW+F +TG++EG ++ K KL LS+Q L+DCS YGNNG
Sbjct: 113 VDWRDKGFVTAVKNQGQCGSCWAFSSTGSLEGQHFRKTGKLVSLSEQNLVDCSKKYGNNG 172
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++++I + G+ T+ Y PY +D CH + AT+TG+ +V SE L
Sbjct: 173 CEGGLMDYAFKYIKNNDGIDTEQSY-PYTARDGQCHFKPGSVGATVTGYTDVQRGSEGDL 231
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
+ A+A GP+SVAIDA SF Y GVY + C+++ LDH VLAVGYG DGK YW
Sbjct: 232 QSAVATVGPISVAIDAGHSSFQLYKTGVYSEPDCSSTQ--LDHGVLAVGYGAEDGKDYWL 289
Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
VKNSW WG GY+ MS KDN CG+ T +Y
Sbjct: 290 VKNSWGEGWGMNGYIKMSRNKDNQCGIATQASY 322
>gi|50657029|emb|CAH04632.1| cathepsin L [Suberites domuncula]
Length = 324
Score = 187 bits (474), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 97/216 (44%), Positives = 136/216 (62%), Gaps = 6/216 (2%)
Query: 136 AIPVRYEMKGYNS-LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYG 194
A V + KG S + +Q CGSCWSF TG++EG + +K +L LS+Q L+DCS +G
Sbjct: 109 AASVDWRQKGVVSEVKNQGQCGSCWSFSATGSLEGQHALKMGRLVSLSEQNLMDCSSRFG 168
Query: 195 NNGCDGGEDFRSYQWIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSE 253
N+GC GG ++++++ HG+ T+ Y PY +D YC AT T + ++ SE
Sbjct: 169 NHGCKGGIMDDAFRYVISNHGVDTESSY-PYTAKDGYCRFNQNNVGATETSYRDIARGSE 227
Query: 254 DALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKP 313
+L A A+ GP+SVAIDAS +SF FY NGVYY+ C++S LDH VL VGYG G+
Sbjct: 228 SSLTQASAQIGPISVAIDASHRSFQFYKNGVYYEPSCSSSR--LDHGVLVVGYGTEGGQD 285
Query: 314 YWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
Y+ VKNSW T WG GY++MS + NNCG+ + +Y
Sbjct: 286 YFIVKNSWGTRWGMDGYIMMSRNRRNNCGIASQASY 321
>gi|37786769|gb|AAO64471.1| cathepsin L precursor [Fundulus heteroclitus]
Length = 337
Score = 187 bits (474), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 103/218 (47%), Positives = 142/218 (65%), Gaps = 11/218 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + DQ CGSCW+F TTGA+EG + + KL LS Q L++CS GN G
Sbjct: 120 VDWREKGYVTPVKDQGECGSCWAFSTTGALEGQEFTRTGKLVSLSGQNLVECSRPEGNEG 179
Query: 198 CDGGEDFRSYQWIM-KHGLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDA 255
C+GG +++Q++ GL ++D Y PYLG D CH + A TGFV++ +E A
Sbjct: 180 CNGGLMDQAFQYVKDNQGLDSEDSY-PYLGTDDQPCHYDPKFSAANDTGFVDIPSGNERA 238
Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
L A+A GPVSVAIDA +SF FY +G+YY+++C S + LDH VLAVGYG ++DG
Sbjct: 239 LMKAVASVGPVSVAIDAGHESFQFYQSGIYYEKEC--SSEELDHGVLAVGYGFQGEDVDG 296
Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
K +W VKNSWS WG++GY+ M+ + N+CG+ TA +Y
Sbjct: 297 KKFWIVKNSWSENWGDKGYIYMAKDRKNHCGIATAASY 334
>gi|320164780|gb|EFW41679.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 334
Score = 187 bits (474), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 97/200 (48%), Positives = 127/200 (63%), Gaps = 5/200 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCWSF TTG+VEG + K +L LS+Q L+DCS GN GC+GG ++Q+I
Sbjct: 135 DQGQCGSCWSFSTTGSVEGQHARKTGQLVSLSEQNLVDCSKAQGNQGCNGGLMDDAFQYI 194
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ + G+ T+ Y PY +D C AT++ F ++T SE L+ A+A GPVSVA
Sbjct: 195 ITNKGIDTEASY-PYTAKDGTCKFNAANVGATLSSFQDITRGSESDLQNAVATVGPVSVA 253
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS+ SF Y +GVY ++KC S LDH VLA GYG +G PYW VKNSW + WG G
Sbjct: 254 IDASKNSFQLYTSGVYNEKKC--SSTSLDHGVLAAGYGTSNGTPYWLVKNSWGSSWGQAG 311
Query: 330 YVLMSIKDNN-CGVMTAPTY 348
Y+ MS NN CG+ T+ +Y
Sbjct: 312 YIWMSRNANNQCGIATSASY 331
>gi|221090861|ref|XP_002167224.1| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 324
Score = 187 bits (474), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 98/213 (46%), Positives = 137/213 (64%), Gaps = 6/213 (2%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + +GY + DQ CGSCW+F TTG++EG ++ K KL LS+Q L+DCS YGNNG
Sbjct: 112 VDWRNEGYVTPVKDQGQCGSCWAFSTTGSLEGQHFKKTGKLVSLSEQNLVDCSTAYGNNG 171
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++ +I ++ G+ ++ Y PY +D C + AT TGFV++ +E+ L
Sbjct: 172 CNGGLMDNAFTYIKENKGIDSEASY-PYTAEDGKCVFKKPSVAATDTGFVDLPEGNENKL 230
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K A+A GP+SVAIDAS +SF FY +GVY + C+++ LDH VL VGYG GK YW
Sbjct: 231 KEAVASVGPISVAIDASHESFQFYSSGVYNEPSCSSTE--LDHGVLVVGYGTESGKDYWL 288
Query: 317 VKNSWSTYWGNQGYVLMSIK-DNNCGVMTAPTY 348
VKNSW+T WG++GY+ M N CG+ T +Y
Sbjct: 289 VKNSWNTSWGDKGYIKMRRNAKNQCGIATKASY 321
>gi|228244|prf||1801240B Cys protease 2
Length = 323
Score = 186 bits (473), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 107/285 (37%), Positives = 159/285 (55%), Gaps = 18/285 (6%)
Query: 79 FLRPRFHENEKI--RYNWTYIGEELVNGIILEKWRLVTSEG----------EKVSKYSLW 126
+ R F +N+K +N Y E+ + + K+ +T E + + S++
Sbjct: 39 YRRVIFEQNQKYIEEFNKKYENGEVTFNLAMNKFGDMTLEEFNAVMKGNIPRRSAPVSVF 98
Query: 127 VRYNKASKDAIPVRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQA 185
+ A V + KG + DQ CGSCW+F TTG++EG +++K L L++Q
Sbjct: 99 YPKKETGPQATEVDWRTKGAVTPVKDQGQCGSCWAFSTTGSLEGQHFLKTGSLISLAEQQ 158
Query: 186 LIDCSWGYGNNGCDGGEDFRSYQWI-MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTG 244
L+DCS YG GC+GG ++ +I +G+ T+ Y PY +D C + + AT +G
Sbjct: 159 LVDCSRPYGPQGCNGGWMNDAFDYIKANNGIDTEASY-PYEARDGSCRFDSNSVAATCSG 217
Query: 245 FVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAV 304
N+ SE L+ A+ GP+SV IDA+ SF FY +GVYY+ C SP LDHAVLAV
Sbjct: 218 HTNIASGSETGLQQAVRDIGPISVTIDAAHSSFQFYSSGVYYEPSC--SPSYLDHAVLAV 275
Query: 305 GYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
GYG G+ +W VKNSW+T WG+ GY+ MS ++NNCG+ T +Y
Sbjct: 276 GYGSEGGQDFWLVKNSWATSWGDAGYIKMSRNRNNNCGIATVASY 320
>gi|221117518|ref|XP_002157675.1| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 340
Score = 186 bits (473), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 97/213 (45%), Positives = 133/213 (62%), Gaps = 3/213 (1%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY N + +Q CGSCW+F TTGA+EG + K L LS+Q L+DC+ YGN
Sbjct: 125 VDWRTKGYVNPVKNQGQCGSCWAFSTTGALEGQTFRKTGVLPDLSEQNLVDCTQSYGNEA 184
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++++I + G+ ++ Y Y YC+ A+ TGFV++ EDAL
Sbjct: 185 CNGGWMDNAFKYISDNKGIDSEAGYPYYAKALGYCYYNQQFNVASDTGFVDIASGDEDAL 244
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K+A+A GP+SVAIDA++ SF Y +GVYY+ C N + LDHAVL VGYG DG+ +W
Sbjct: 245 KVAVATVGPISVAIDATKDSFMRYQSGVYYEPTCGNGLENLDHAVLVVGYGTEDGRDFWL 304
Query: 317 VKNSWSTYWGNQGYVLMSIK-DNNCGVMTAPTY 348
VKNSW WG+QGY+ MS N CG+ T +Y
Sbjct: 305 VKNSWDITWGDQGYIKMSRNMSNQCGIATKASY 337
>gi|390337645|ref|XP_001199228.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 333
Score = 186 bits (473), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 107/234 (45%), Positives = 144/234 (61%), Gaps = 9/234 (3%)
Query: 119 KVSKYSLWVRYNKASKDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKK 177
K +K S ++ N K V + KGY + DQ CGSCW+F TG++EG ++ K K
Sbjct: 102 KAAKGSTFLPPNNVGKLPKTVDWRTKGYVTPVKDQGQCGSCWAFSATGSLEGQHFKKTGK 161
Query: 178 LAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANT 236
L LS+Q L+DCS N GC+GG R++Q+I+ G + T++ Y PY+ D CH
Sbjct: 162 LVSLSEQNLVDCS--DKNYGCNGGLMDRAFQYIIDAGGIDTEESY-PYIAMDGNCHFKTA 218
Query: 237 TATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDG 296
AT+TG+ +VT SE AL+ A+A GP+SVAIDAS SF Y +GVY + C+++
Sbjct: 219 NVGATVTGYTDVTSGSEKALQKAVAHIGPISVAIDASHFSFQLYQSGVYNEPGCSSTL-- 276
Query: 297 LDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
LDH VLAVGYG +DG YW VKNSW+ WG GY+ MS KDN CG+ T +Y
Sbjct: 277 LDHGVLAVGYGTTIDGTDYWIVKNSWAETWGMNGYIWMSRNKDNQCGIATQASY 330
>gi|34850847|dbj|BAC87861.1| cathepsin L [Engraulis japonicus]
Length = 336
Score = 186 bits (473), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 103/221 (46%), Positives = 143/221 (64%), Gaps = 11/221 (4%)
Query: 139 VRYEMKGYNS-LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
+ Y GY + + DQ CGSCW+F TTGA+EG + + KL LS+Q L+DCS GN G
Sbjct: 119 IDYRDLGYATPVKDQGQCGSCWAFSTTGAMEGQLFREGGKLVSLSEQNLVDCSRPEGNEG 178
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDA 255
C+GG +++Q+I + GL T+D Y PYLG D CH + A TGFV++ E A
Sbjct: 179 CNGGLMDQAFQYIKDNGGLDTEDAY-PYLGTDDQDCHYDPKYSAANDTGFVDIPEGKERA 237
Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
L A+A GPVSVAIDA + F FY +G+Y++++C+++ LDH VL VGYG ++DG
Sbjct: 238 LMKAVAAVGPVSVAIDAGHECFQFYHSGIYFEKECSSTE--LDHGVLVVGYGFEGEDVDG 295
Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
K YW VKNSWS WG++GY+ M+ + N+CG+ TA +Y M
Sbjct: 296 KKYWIVKNSWSEKWGDEGYIYMAKDRKNHCGIATAASYPLM 336
>gi|432936690|ref|XP_004082231.1| PREDICTED: cathepsin L-like [Oryzias latipes]
Length = 334
Score = 186 bits (473), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 100/213 (46%), Positives = 134/213 (62%), Gaps = 6/213 (2%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + DQ CGSCW+F TG++EG + K KL LS+Q L+DCS YGN G
Sbjct: 122 VDWRDKGYVTDVKDQKQCGSCWAFSATGSLEGQTFRKTGKLVSLSEQQLVDCSGDYGNMG 181
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG ++++I G+ T++ Y PY +D C AT TG+V+V+ EDAL
Sbjct: 182 CGGGLMDDAFRYIQATGGIDTEESY-PYEAEDGECRYKPDAVGATCTGYVDVSSGDEDAL 240
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
+ A+A GP+SV IDAS SF Y +G+Y + +C++S LDH VLAVGYG +G+ YW
Sbjct: 241 QEAVATIGPISVGIDASHISFQLYESGLYDEPQCSSSE--LDHGVLAVGYGSENGQDYWL 298
Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
VKNSW WG+QGY+ MS K N CG+ TA +Y
Sbjct: 299 VKNSWGLTWGDQGYIKMSKNKSNQCGIATAASY 331
>gi|229893789|gb|ACQ90252.1| cathepsin L [Pinctada fucata]
Length = 362
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 103/214 (48%), Positives = 132/214 (61%), Gaps = 7/214 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCWSF TTG++EG ++ + KL LS+Q L+DCS +GN G
Sbjct: 147 VDWRDKGYVTPVKNQGQCGSCWSFSTTGSLEGQHFRQTGKLISLSEQQLVDCSGTFGNEG 206
Query: 198 CDGGEDFRSYQWIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++++I GL +DDY PY + CH+ + A TG +V EDAL
Sbjct: 207 CNGGLMDNAFEYIKSIGGLEGEDDY-PYTAKQGKCHLKKSLFKANDTGCTDVESGDEDAL 265
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYW 315
K ALA GP+SVAIDAS SF Y GVY +E+C S LDH VL VGYG E +G YW
Sbjct: 266 KDALASVGPISVAIDASHASFQSYDGGVYDEEEC--SSQNLDHGVLTVGYGTEENGGDYW 323
Query: 316 QVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
VKNSW WG +GY+ MS KDN CG+ T +Y
Sbjct: 324 LVKNSWGEMWGEEGYIKMSRNKDNQCGIATQASY 357
>gi|308474437|ref|XP_003099440.1| CRE-CPL-1 protein [Caenorhabditis remanei]
gi|308266846|gb|EFP10799.1| CRE-CPL-1 protein [Caenorhabditis remanei]
Length = 337
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 93/201 (46%), Positives = 132/201 (65%), Gaps = 6/201 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q +CGSCW+F TGA+EG + K KL LS+Q L+DCS YGN+GC+GG +++++I
Sbjct: 137 NQGMCGSCWAFSATGALEGQHARKLGKLVSLSEQNLVDCSTKYGNHGCNGGLMDQAFEYI 196
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
HG+ T+D Y PY G+D CH + A G+ ++ E+ LK+A+A GP+S+A
Sbjct: 197 RDNHGVDTEDSY-PYKGRDMKCHFSKKDVGADDKGYTDLPEGDEEQLKIAVATQGPISIA 255
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
IDA +SF Y GVYYDE+C S + LDH VL VGYG + + YW VKNSW T WG +
Sbjct: 256 IDAGHRSFQLYKKGVYYDEEC--SSEELDHGVLLVGYGTDPEHGDYWLVKNSWGTGWGEK 313
Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
GY+ ++ ++N+CGV T +Y
Sbjct: 314 GYIRIARNRNNHCGVATKASY 334
>gi|37994576|gb|AAH60335.1| Unknown (protein for MGC:68554) [Xenopus laevis]
Length = 335
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 101/217 (46%), Positives = 135/217 (62%), Gaps = 9/217 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + DQ CGSCW+F TTGA+EG +Y K KL LS+Q L+DCS GN G
Sbjct: 118 VDWRKKGYVTPVKDQGQCGSCWAFSTTGALEGQHYRKTSKLISLSEQNLVDCSRAQGNEG 177
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQ-DAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG +++Q++ +G +D PY + D CH +A TGFV+V E L
Sbjct: 178 CNGGLMDQAFQYVKDNGGIDSEDSYPYTAKDDQECHYDPNNNSANDTGFVDVQSGCEKDL 237
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GPVSVAIDA +SF FY +G+YY+ +C S + LDH VL VGYG ++DGK
Sbjct: 238 MKAVASVGPVSVAIDAGHQSFQFYQSGIYYEPEC--SSEDLDHGVLVVGYGFESEDVDGK 295
Query: 313 PYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
YW VKNSWS WG+ GY+ + + N+CG+ TA +Y
Sbjct: 296 KYWIVKNSWSEKWGDNGYINIAKDRHNHCGIATAASY 332
>gi|391328503|ref|XP_003738728.1| PREDICTED: digestive cysteine proteinase 3-like [Metaseiulus
occidentalis]
Length = 506
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 98/214 (45%), Positives = 136/214 (63%), Gaps = 6/214 (2%)
Query: 141 YEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCD 199
Y ++GY + DQ CGSCW+F TTG++EG ++ KL LS+Q L+DCS GNNGC+
Sbjct: 296 YWLEGYVTPVKDQGQCGSCWAFSTTGSLEGQHFKATGKLVSLSEQNLVDCSGDEGNNGCE 355
Query: 200 GGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKL 258
GG + + +I + G+ T++ Y PY +D C + A +TGFV++ SE AL+
Sbjct: 356 GGLMDQGFTYIKNNGGIDTEESY-PYNAEDGDCAFKSNAVGARVTGFVDIDSGSEKALQK 414
Query: 259 ALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVK 318
A+A GPVSVAIDAS SF Y G+Y + C+++ LDH VLAVGYG +G YW VK
Sbjct: 415 AVATVGPVSVAIDASNDSFQLYKEGIYDEPACSSTQ--LDHGVLAVGYGSENGVDYWLVK 472
Query: 319 NSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
NSW+T WG GY+ M+ KDN CG+ + +Y T+
Sbjct: 473 NSWNTVWGQDGYIKMARNKDNQCGIASQASYPTV 506
Score = 145 bits (365), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 79/179 (44%), Positives = 109/179 (60%), Gaps = 7/179 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V Y G+ + +Q +CGSCW+F TG++EG +++ L LS+Q L+DCS N G
Sbjct: 124 VDYRKSGHVTPVKNQGLCGSCWAFSATGSLEGQLSIQNGTLVSLSEQNLLDCS--RENQG 181
Query: 198 CDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
CDGG +++++I K+G + T++ Y PY G+ C A +TG V+V E AL
Sbjct: 182 CDGGYMDKAFEYIKKNGGIDTEESY-PYTGRKGKCMFKKKNIGARVTGHVDVPAEDEQAL 240
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYW 315
KLA+AK GP+SV IDAS+ SF FY G+Y + C+ S LDH VL VGYG GK YW
Sbjct: 241 KLAVAKIGPISVGIDASKDSFRFYKEGIYDESSCSTSQ--LDHGVLVVGYGSEKGKDYW 297
>gi|288548566|gb|ADC52431.1| cathepsin L2 cysteine protease [Pinctada fucata]
Length = 330
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 101/216 (46%), Positives = 134/216 (62%), Gaps = 6/216 (2%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCWSF TG++EG + K KL LS+Q L+DCS GN+G
Sbjct: 118 VDWRPKGYVTPIKNQGQCGSCWSFSATGSLEGQTFKKTGKLVSLSEQNLVDCSKKQGNHG 177
Query: 198 CDGGEDFRSYQWI-MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++ +I +G+ T+ Y PY +D C + AT TGFV++ E+AL
Sbjct: 178 CEGGLMDDAFTYIKANNGIDTEASY-PYKARDGKCEFKSADVGATDTGFVDIKTKDEEAL 236
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K A+A GP+SVAIDAS SF Y GVY+D C+ + LDH VLAVGYG D K YW
Sbjct: 237 KQAVATVGPISVAIDASHMSFQLYRTGVYHDWFCSQTK--LDHGVLAVGYGTEDSKDYWL 294
Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
VKNSW WG +GY+ MS + NNCG+ T+ +Y T+
Sbjct: 295 VKNSWGESWGQKGYIQMSRNRRNNCGIATSASYPTV 330
>gi|268560858|ref|XP_002638172.1| C. briggsae CBR-CPL-1 protein [Caenorhabditis briggsae]
Length = 336
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 93/201 (46%), Positives = 132/201 (65%), Gaps = 6/201 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q +CGSCW+F TGA+EG + K +L LS+Q L+DCS YGN+GC+GG +++++I
Sbjct: 136 NQGMCGSCWAFSATGALEGQHARKLGQLVSLSEQNLVDCSTKYGNHGCNGGLMDQAFEYI 195
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
HG+ T++ Y PY G+D CH T A G+V+ E+ LK+A+A GP+S+A
Sbjct: 196 RDNHGVDTEESY-PYKGRDMKCHFNKKTVGADDKGYVDTPEGDEEQLKIAVATQGPISIA 254
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
IDA +SF Y GVYYDE+C S + LDH VL VGYG + + YW VKNSW T WG +
Sbjct: 255 IDAGHRSFQLYKKGVYYDEEC--SSEELDHGVLLVGYGTDPEHGDYWLVKNSWGTGWGEK 312
Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
GY+ ++ ++N+CGV T +Y
Sbjct: 313 GYIRIARNRNNHCGVATKASY 333
>gi|118123|sp|P25782.1|CYSP2_HOMAM RecName: Full=Digestive cysteine proteinase 2; Flags: Precursor
gi|11053|emb|CAA45128.1| cysteine proteinase preproenzyme [Homarus americanus]
Length = 323
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 107/285 (37%), Positives = 159/285 (55%), Gaps = 18/285 (6%)
Query: 79 FLRPRFHENEKI--RYNWTYIGEELVNGIILEKWRLVTSEG----------EKVSKYSLW 126
+ R F +N+K +N Y E+ + + K+ +T E + + S++
Sbjct: 39 YRRVIFEQNQKYIEEFNKKYENGEVTFNLAMNKFGDMTLEEFNAVMKGNIPRRSAPVSVF 98
Query: 127 VRYNKASKDAIPVRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQA 185
+ A V + KG + DQ CGSCW+F TTG++EG +++K L L++Q
Sbjct: 99 YPKKETGPQATEVDWRTKGAVTPVKDQGQCGSCWAFSTTGSLEGQHFLKTGSLISLAEQQ 158
Query: 186 LIDCSWGYGNNGCDGGEDFRSYQWI-MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTG 244
L+DCS YG GC+GG ++ +I +G+ T+ Y PY +D C + + AT +G
Sbjct: 159 LVDCSRPYGPQGCNGGWMNDAFDYIKANNGIDTEAAY-PYEARDGSCRFDSNSVAATCSG 217
Query: 245 FVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAV 304
N+ SE L+ A+ GP+SV IDA+ SF FY +GVYY+ C SP LDHAVLAV
Sbjct: 218 HTNIASGSETGLQQAVRDIGPISVTIDAAHSSFQFYSSGVYYEPSC--SPSYLDHAVLAV 275
Query: 305 GYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
GYG G+ +W VKNSW+T WG+ GY+ MS ++NNCG+ T +Y
Sbjct: 276 GYGSEGGQDFWLVKNSWATSWGDAGYIKMSRNRNNNCGIATVASY 320
>gi|146152090|gb|ABQ08058.1| cathepsin L [Misgurnus mizolepis]
Length = 337
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 103/218 (47%), Positives = 141/218 (64%), Gaps = 11/218 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
+ + KGY + DQ CGSCW+F TTGA+EG + K KL LS+Q L+DCS GN G
Sbjct: 120 LDWREKGYVTPVKDQGECGSCWAFSTTGAMEGQMFRKQGKLVSLSEQNLVDCSRPEGNEG 179
Query: 198 CDGGEDFRSYQWIM-KHGLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDA 255
C+GG +++Q+I +GL +++ Y PYLG D CH A TGFV++ E A
Sbjct: 180 CNGGLMDQAFQYIKDNNGLDSEEAY-PYLGTDDQPCHYDPKYNAANDTGFVDIPSGKEHA 238
Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
L A+A GPVSVAIDA +SF FY +G+Y++++C S + LDH VL VGYG ++DG
Sbjct: 239 LMKAVASVGPVSVAIDAGHESFQFYQSGIYFEKEC--SSEELDHGVLVVGYGFEGEDVDG 296
Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
K YW VKNSWS WG++GY+ M+ + N+CG+ TA +Y
Sbjct: 297 KKYWIVKNSWSESWGDKGYIYMAKDRKNHCGIATAASY 334
>gi|21489677|gb|AAM55195.1|AF412313_1 cathepsin L cysteine protease [Haemonchus contortus]
gi|21483192|gb|AAL14224.1| cathepsin L [Haemonchus contortus]
Length = 354
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 92/201 (45%), Positives = 132/201 (65%), Gaps = 6/201 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q +CGSCW+F +TGA+EG + KL LS+Q L+DCS YGN+GC+GG ++++I
Sbjct: 154 NQGMCGSCWAFSSTGALEGQHARATGKLVSLSEQNLVDCSTKYGNHGCNGGLMDLAFEYI 213
Query: 211 MK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ HG+ T+D Y PY+G++ CH A GFV++ E+ALK A+A GP+S+A
Sbjct: 214 KENHGVDTEDSY-PYVGRETKCHFKRNAVGADDKGFVDLPEGDEEALKKAVATQGPISIA 272
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
IDA +SF Y GVY+DE+C S + LDH VL VGYG + + YW VKNSW WG +
Sbjct: 273 IDAGHRSFQLYKKGVYFDEEC--SSEELDHGVLLVGYGTDPEAGDYWLVKNSWGPTWGEK 330
Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
GY+ ++ ++N+CGV T +Y
Sbjct: 331 GYIRIARNRNNHCGVATKASY 351
>gi|89272015|emb|CAJ83143.1| cathepsin L2 [Xenopus (Silurana) tropicalis]
Length = 335
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 101/217 (46%), Positives = 136/217 (62%), Gaps = 9/217 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + DQ CGSCW+F TTGA+EG +Y K+ LS+Q L+DCS GN G
Sbjct: 118 VDWRKKGYVTPVKDQGQCGSCWAFSTTGALEGQHYRNTGKMISLSEQNLVDCSRAQGNQG 177
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQ-DAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG +++Q++ +G +D PY + D CH +A TGFV+VT SE L
Sbjct: 178 CNGGLMDQAFQYVKDNGGIDSEDSYPYTAKDDQECHYDPNYNSANDTGFVDVTSESEKDL 237
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GPVSVA+DA +SF FY +G+YY+ +C S + LDH VL VGYG + DGK
Sbjct: 238 MNAVASVGPVSVAVDAGHQSFQFYKSGIYYEPEC--SSEDLDHGVLVVGYGFEGEDEDGK 295
Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
YW VKNSWS WGN GY+ ++ + N+CG+ TA +Y
Sbjct: 296 KYWIVKNSWSEKWGNDGYIYIAKDRHNHCGIATAASY 332
>gi|326501772|dbj|BAK02675.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 333
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 103/221 (46%), Positives = 138/221 (62%), Gaps = 8/221 (3%)
Query: 133 SKDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDC 189
SK A+P V + KGY + DQ CGSCW+F TTGA+EG ++ + KL LS+Q L+DC
Sbjct: 113 SKIALPDTVDWRDKGYVTDVKDQGQCGSCWAFSTTGALEGQHFKQTGKLVSLSEQNLVDC 172
Query: 190 SWGYGNNGCDGGEDFRSYQWIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
S GN GC+GG +++++I + +G+ T+D Y PY D C AT TGF ++
Sbjct: 173 SGKQGNMGCNGGLMDQAFEYIKENNGIDTEDSY-PYEAVDNQCRFKAANVGATDTGFTDI 231
Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
T E AL+ A+A GP+SVAIDA SF Y +GVY + C+ + LDH VLAVGYG
Sbjct: 232 TSKDESALQQAVATVGPISVAIDAGHTSFQLYKHGVYNEPFCSQTR--LDHGVLAVGYGT 289
Query: 309 LDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
GK YW VKNSW WG++GY+ M+ K N CG+ TA +Y
Sbjct: 290 DSGKDYWLVKNSWGEGWGDKGYIKMTRNKRNQCGIATAASY 330
>gi|341878328|gb|EGT34263.1| CBN-CPL-1 protein [Caenorhabditis brenneri]
Length = 336
Score = 186 bits (471), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 93/201 (46%), Positives = 132/201 (65%), Gaps = 6/201 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q +CGSCW+F TGA+EG + K +L LS+Q L+DCS YGN+GC+GG +++++I
Sbjct: 136 NQGMCGSCWAFSATGALEGQHARKLGQLVSLSEQNLVDCSTKYGNHGCNGGLMDQAFEYI 195
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
HG+ T++ Y PY G+D CH T A G+V+ E+ LK+A+A GP+S+A
Sbjct: 196 RDNHGVDTEESY-PYKGRDMKCHFNKKTIGADDKGYVDTPEGDEEQLKIAVATQGPISIA 254
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
IDA +SF Y GVYYDE+C S + LDH VL VGYG + + YW VKNSW T WG +
Sbjct: 255 IDAGHRSFQLYKKGVYYDEEC--SSEELDHGVLLVGYGTDPEHGDYWLVKNSWGTGWGEK 312
Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
GY+ ++ ++N+CGV T +Y
Sbjct: 313 GYIRIARNRNNHCGVATKASY 333
>gi|330800456|ref|XP_003288252.1| hypothetical protein DICPUDRAFT_55299 [Dictyostelium purpureum]
gi|325081708|gb|EGC35214.1| hypothetical protein DICPUDRAFT_55299 [Dictyostelium purpureum]
Length = 531
Score = 186 bits (471), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 94/200 (47%), Positives = 132/200 (66%), Gaps = 3/200 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ VCGSCW+FG+TG++EG + + L LS+Q L+DC++ G+ GC+GG ++Q+I
Sbjct: 326 DQGVCGSCWTFGSTGSLEGTNCVTNGYLVSLSEQQLVDCAYLMGSQGCNGGFAASAFQYI 385
Query: 211 MKHG-LPTQDDYGPYLGQDAYCHIANTTATAT-MTGFVNVTPNSEDALKLALAKHGPVSV 268
M G + T+ DY YL Q+A C +TT + ++ +VNVT S +AL A+A GPV++
Sbjct: 386 MDAGGIATESDY-QYLMQNALCKDKSTTFSGVGVSSYVNVTAGSINALLNAVATQGPVAI 444
Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQ 328
AIDAS F +Y +G+Y + C N PD LDH VLA+GYG L+G YW VKNSWST WG +
Sbjct: 445 AIDASVDDFRYYQSGIYSNPSCKNGPDDLDHEVLAIGYGTLNGVDYWLVKNSWSTNWGME 504
Query: 329 GYVLMSIKDNNCGVMTAPTY 348
GY ++ +N CG + TY
Sbjct: 505 GYFMLERANNLCGPASQATY 524
>gi|2804262|dbj|BAA24442.1| cysteine proteinase [Sitophilus zeamais]
Length = 338
Score = 186 bits (471), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 98/201 (48%), Positives = 129/201 (64%), Gaps = 6/201 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCWSF TG++EG ++ K KL LS+Q L+DCS YGNNGC+GG ++++I
Sbjct: 138 DQGHCGSCWSFSATGSLEGQHFRKTGKLVSLSEQNLVDCSGRYGNNGCNGGLMDNAFRYI 197
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T+ Y PYL +D CH + AT GFV++ +ED LK A+A GPVS+A
Sbjct: 198 KDNGGIDTEKSY-PYLAEDEKCHYKAQNSGATDKGFVDIEEANEDDLKAAVATVGPVSIA 256
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
IDAS ++F Y +GVY D +C S LDH VL VGYG DG+ YW VKNSW WG
Sbjct: 257 IDASHETFQLYSDGVYSDPEC--SSQELDHGVLVVGYGTSDDGQDYWLVKNSWGPSWGLN 314
Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
GY+ M+ +DN CGV + +Y
Sbjct: 315 GYIKMARNQDNMCGVASQASY 335
>gi|159792912|gb|ABW98676.1| cathepsin L [Apostichopus japonicus]
Length = 332
Score = 186 bits (471), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 110/258 (42%), Positives = 155/258 (60%), Gaps = 13/258 (5%)
Query: 98 GEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKASKDAIPVRYEMKGY-NSLLDQSVCG 156
GEE V ++ + S + K+ + ++ + D++ R E GY + DQ CG
Sbjct: 84 GEEFVQ--MMNGLKFDASRERQGIKFLSYAKFQ--APDSVDWRDE--GYVTPVKDQGQCG 137
Query: 157 SCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GL 215
SCW+F TTG++EG ++ L LS+Q L+DCS YGNNGC+GG ++Q+I + G+
Sbjct: 138 SCWAFSTTGSLEGQHFRSTGVLTSLSEQNLVDCSISYGNNGCEGGLMDYAFQYIKDNLGI 197
Query: 216 PTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQK 275
T+D Y PY +D C + AT +G+V+V EDALK A A +GP+SVAIDAS +
Sbjct: 198 DTEDKY-PYEAEDDTCRFSPDNVGATDSGYVDVDSGDEDALKEACAANGPISVAIDASHE 256
Query: 276 SFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQGYVLMS 334
SF Y +GVY +E C++ LDH VL VGYG + G YW VKNSW WG +GY+ MS
Sbjct: 257 SFQLYESGVYDEESCSSIE--LDHGVLVVGYGTDSVGGDYWIVKNSWGLSWGQEGYIWMS 314
Query: 335 I-KDNNCGVMTAPTYVTM 351
KDN CG+ T+ +Y T+
Sbjct: 315 RNKDNQCGIATSASYPTV 332
>gi|330803820|ref|XP_003289900.1| hypothetical protein DICPUDRAFT_80649 [Dictyostelium purpureum]
gi|325080011|gb|EGC33585.1| hypothetical protein DICPUDRAFT_80649 [Dictyostelium purpureum]
Length = 328
Score = 186 bits (471), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 112/277 (40%), Positives = 152/277 (54%), Gaps = 14/277 (5%)
Query: 84 FHENEKIRYNWTYIGEELVNGII-------LEKWRLVTSEGEKVSKYSLWVRYNKASKDA 136
F +N W G + + G+ E R+ V K +L + SK
Sbjct: 55 FQDNMDFVTKWNQKGSDTILGLNSMADLTNQEYQRIYLGTKTTVKKPNLIIGVTDVSKAP 114
Query: 137 IPVRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGN 195
V + G ++ +Q CG C+SF TTG+VEG + + K+L LS+Q ++DCS GN
Sbjct: 115 ASVDWRANGAVTAVKNQGQCGGCYSFSTTGSVEGIHEITSKQLVSLSEQQILDCSGSEGN 174
Query: 196 NGCDGGEDFRSYQWIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSED 254
NGCDGG S+++I+ GL T+ Y PY G C AT+TG+ NV SE
Sbjct: 175 NGCDGGLMTNSFEYIIAVGGLDTEASY-PYEGVVGKCKFNKANIGATITGYKNVKSGSES 233
Query: 255 ALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPY 314
L+ A+A PVSVAIDASQ SF Y +GVYY+ C+++ LDH VLAVGYG G+ Y
Sbjct: 234 DLQTAVAAQ-PVSVAIDASQNSFQLYSSGVYYEPACSSTQ--LDHGVLAVGYGSQSGQDY 290
Query: 315 WQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVT 350
W VKNSW WG +G++LM+ K NNCG+ T +Y T
Sbjct: 291 WIVKNSWGADWGEKGFILMARNKHNNCGIATMASYPT 327
>gi|52345644|ref|NP_001004869.1| cathepsin L2 precursor [Xenopus (Silurana) tropicalis]
gi|49522051|gb|AAH74718.1| MGC69486 protein [Xenopus (Silurana) tropicalis]
Length = 335
Score = 186 bits (471), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 101/217 (46%), Positives = 136/217 (62%), Gaps = 9/217 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + DQ CGSCW+F TTGA+EG +Y K+ LS+Q L+DCS GN G
Sbjct: 118 VDWRKKGYVTPVKDQGQCGSCWAFSTTGALEGQHYRNTGKMISLSEQNLVDCSRAQGNQG 177
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQ-DAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG +++Q++ +G +D PY + D CH +A TGFV+VT SE L
Sbjct: 178 CNGGLMDQAFQYVKDNGGIDSEDSYPYTAKDDQECHYDPNYNSANDTGFVDVTSGSEKDL 237
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GPVSVA+DA +SF FY +G+YY+ +C S + LDH VL VGYG + DGK
Sbjct: 238 MNAVASVGPVSVAVDAGHQSFQFYKSGIYYEPEC--SSEDLDHGVLVVGYGFEGEDEDGK 295
Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
YW VKNSWS WGN GY+ ++ + N+CG+ TA +Y
Sbjct: 296 KYWIVKNSWSEKWGNDGYIYIAKDRHNHCGIATAASY 332
>gi|291224872|ref|XP_002732426.1| PREDICTED: cathepsin L2-like [Saccoglossus kowalevskii]
Length = 691
Score = 185 bits (470), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 100/216 (46%), Positives = 131/216 (60%), Gaps = 6/216 (2%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + DQ CGSCW+F TTG++EG + KL S+Q L+DCS YGN G
Sbjct: 479 VDWRTKGYVTEVKDQGACGSCWAFSTTGSMEGQSFKNTGKLVSFSEQQLVDCSGSYGNMG 538
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
C GG +++ +I +G+ + DY PY +D C + A AT TG+ ++ E AL+
Sbjct: 539 CGGGLMDQAFAYIEDYGIEPEADY-PYTAKDDPCSYDTSKAVATNTGYTDIATMDEKALQ 597
Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGEL-DGKPYWQ 316
A+A GP+SVAIDAS SF Y +GVY + C+ + LDH VLAVGYG DG YW
Sbjct: 598 QAVATVGPISVAIDASHSSFRLYKSGVYDEPACSQTM--LDHGVLAVGYGTTDDGNDYWI 655
Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
VKNSW + WGNQGY+ MS DN CG+ T +Y M
Sbjct: 656 VKNSWGSTWGNQGYIHMSRNNDNQCGIATNASYPLM 691
>gi|305434756|gb|ADM53740.1| cathepsin L1 precursor [Lepeophtheirus salmonis]
Length = 325
Score = 185 bits (470), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 98/205 (47%), Positives = 131/205 (63%), Gaps = 7/205 (3%)
Query: 148 SLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSY 207
++ DQ CGSCW+F TTG+VEG Y++K+KKL S+Q L+DCS + N GC+GG ++
Sbjct: 121 AVKDQKDCGSCWAFSTTGSVEGQYFIKNKKLLSFSEQQLVDCSSDFRNEGCNGGWMDNAF 180
Query: 208 QWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
++++ + G+ T+D Y PY D C T A ++ F +V SED LKLA+A+ GP+
Sbjct: 181 KYLIANKGIATEDTY-PYTATDGVCVYNKTMAAGRISSFKDVKHGSEDQLKLAVAQIGPI 239
Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGK--PYWQVKNSWSTY 324
SVAIDAS F FY GVY DE+C S LDH VLAVGYG G YW VKNSWS
Sbjct: 240 SVAIDASSGDFQFYKKGVYVDEEC--SSKYLDHGVLAVGYGTDKGTGLDYWLVKNSWSAS 297
Query: 325 WGNQGYVLMSIKDNN-CGVMTAPTY 348
WG+QGY+ M+ N CG+ + +Y
Sbjct: 298 WGDQGYIKMARNHKNMCGIASLASY 322
>gi|395514296|ref|XP_003761355.1| PREDICTED: cathepsin L1-like [Sarcophilus harrisii]
Length = 262
Score = 185 bits (470), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 96/205 (46%), Positives = 136/205 (66%), Gaps = 7/205 (3%)
Query: 148 SLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSY 207
S+ D+ CGSCW+F TG++EG ++ K KL LS+Q L+DCS GN+GC GG ++
Sbjct: 58 SIKDKGQCGSCWAFSATGSLEGQWFHKTGKLVSLSEQNLVDCSTAQGNSGCQGGLMDNAF 117
Query: 208 QWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
+++ K+ G+ T++ Y PY+G+D CH + + A +TG+V++ E AL A+A GP+
Sbjct: 118 EYVKKNGGIDTEESY-PYVGKDGTCHYNSQCSGANVTGYVDIPAGVERALAKAVATVGPI 176
Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG--ELDGKPYWQVKNSWSTY 324
SVAIDA SF FY +GVYY+ +C S + LDH VL VG+G +GK YW VKNSW
Sbjct: 177 SVAIDAGHSSFQFYRSGVYYEPEC--SSEELDHGVLVVGFGVEGKNGKKYWIVKNSWGEE 234
Query: 325 WGNQGYVLMSIKDNN-CGVMTAPTY 348
WG++GYVLM+ NN CG+ TA +Y
Sbjct: 235 WGDRGYVLMTRDHNNHCGIATAASY 259
>gi|195995655|ref|XP_002107696.1| hypothetical protein TRIADDRAFT_19901 [Trichoplax adhaerens]
gi|190588472|gb|EDV28494.1| hypothetical protein TRIADDRAFT_19901 [Trichoplax adhaerens]
Length = 569
Score = 185 bits (470), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 93/252 (36%), Positives = 145/252 (57%), Gaps = 15/252 (5%)
Query: 115 SEGEKVSKYSLWVRYNKASKDAIPVRYE------------MKGYNSLLD-QSVCGSCWSF 161
+E E + R+ A+K+ IP +Y+ KG + + Q+ CGSC++F
Sbjct: 313 TEKELTNMRGRLRRFYHATKNYIPWKYDPHEKPLESIDWRKKGAVTPIKYQAHCGSCFAF 372
Query: 162 GTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDY 221
+E A + + +L LS+Q++IDCSWGYGN C+GG ++ +YQ++ KHG+P+ Y
Sbjct: 373 SAVTPIESALFRQTGRLIPLSEQSIIDCSWGYGNYACNGGMEYAAYQFVTKHGIPSAKCY 432
Query: 222 GPYLGQDAYCHIANTTATATMTGFVNVTPNSE-DALKLALAKHGPVSVAIDASQKSFSFY 280
GPYL YCH+ N T + G+ V P+ A++LA++K GP+S+A + SF FY
Sbjct: 433 GPYLASPGYCHLLNCTHRVRLDGYSIVLPSGNVSAVRLAVSKIGPISIAFNVHSLSFHFY 492
Query: 281 VNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLM-SIKDNN 339
+G++ D C N ++HA+ VGYG+L + YW ++NSWS +WG GY+ M S N
Sbjct: 493 SSGIFEDPTCYNDIHHINHALNLVGYGKLGNEHYWIIRNSWSEHWGENGYMRMKSTNANE 552
Query: 340 CGVMTAPTYVTM 351
C V T T+ +
Sbjct: 553 CAVATFATFPVL 564
Score = 41.6 bits (96), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 22/63 (34%), Positives = 33/63 (52%), Gaps = 4/63 (6%)
Query: 14 GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
G+ T +G G + K+ P+TTET+ N SC +G P PQ+ +P + VN
Sbjct: 82 GVSTTIIRRDDGPFGSAYKIMPITTETQYNVKSCVAIHGRPRHPYRPQTAIPSL----VN 137
Query: 74 IYR 76
I+R
Sbjct: 138 IFR 140
>gi|405966500|gb|EKC31778.1| Cathepsin L [Crassostrea gigas]
Length = 271
Score = 185 bits (470), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 98/213 (46%), Positives = 133/213 (62%), Gaps = 6/213 (2%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + +GY + +Q CGSCWSF TG++EG ++ KKL LS+Q L+DCS GN+G
Sbjct: 59 VDWRKEGYVTDIKNQGHCGSCWSFSATGSLEGQHFKASKKLVSLSEQNLVDCSQREGNHG 118
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG ++++I + G+ T++ Y PY ++ +CH AT TG+V++ ED L
Sbjct: 119 CQGGLMDNAFRYIESNKGIDTEESY-PYTAKNGFCHFKKENVGATDTGYVDIPHMQEDKL 177
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
+ A+A GP+SVAIDA KSF Y GVY + C++S LDH VLAVGYG G YW
Sbjct: 178 QEAVATVGPISVAIDAGHKSFQLYREGVYSEPACSSSK--LDHGVLAVGYGTESGDDYWL 235
Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
VKNSW T WG QGYV+M+ K N CG+ T +Y
Sbjct: 236 VKNSWGTSWGMQGYVMMARNKHNMCGIATQASY 268
>gi|124487918|gb|ABN12042.1| putative cathepsin L precursor [Maconellicoccus hirsutus]
Length = 211
Score = 185 bits (469), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 90/204 (44%), Positives = 129/204 (63%), Gaps = 1/204 (0%)
Query: 149 LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
+ DQ CGSC++F TTG++EG + K L LS+Q +IDCS YGN GC+GG ++
Sbjct: 8 VKDQGDCGSCYAFSTTGSIEGQQFRKSGTLKSLSEQQIIDCSVKYGNGGCEGGVMENAFN 67
Query: 209 WIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
+++ +G + PY+ ++ C + A + F + E+ LKLA+AK GP+S+
Sbjct: 68 YVIDNGGIDSEGSYPYIDRETQCAYKPENSAANIKDFATLPVGDEEMLKLAVAKVGPISI 127
Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQ 328
AI+ S +SF Y +GVYYD+ C + PD L HAVL VGYG DGK YW VKNSW+T WG
Sbjct: 128 AINTSPRSFKLYKSGVYYDKDCKSDPDDLTHAVLVVGYGTEDGKDYWLVKNSWNTDWGEN 187
Query: 329 GYVLMSI-KDNNCGVMTAPTYVTM 351
GY+ M+ K+N+CG+ + TY T+
Sbjct: 188 GYIKMARNKNNHCGIASYATYPTV 211
>gi|196002275|ref|XP_002111005.1| expressed hypothetical protein [Trichoplax adhaerens]
gi|190586956|gb|EDV27009.1| expressed hypothetical protein [Trichoplax adhaerens]
Length = 325
Score = 185 bits (469), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 99/213 (46%), Positives = 136/213 (63%), Gaps = 6/213 (2%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + +GY + +Q CGSCW+F TTG++EG ++ K KL LS+Q LIDCS GN+G
Sbjct: 113 VDWRKEGYVTPIKNQGRCGSCWAFSTTGSLEGQHFRKTGKLVSLSEQNLIDCSAAEGNDG 172
Query: 198 CDGGEDFRSYQWI-MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG ++++I + +G+ T+ Y PY G+D C T A TG++++ SED L
Sbjct: 173 CGGGFMDDAFEYIKLNNGIDTEASY-PYEGRDDICRYKKTNKGAIDTGYMDIKQYSEDDL 231
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K A+A GP+SVAIDAS KSF Y GVY++ +C+ + LDH VL VGYG +G+ YW
Sbjct: 232 KAAVATVGPISVAIDASHKSFHMYHTGVYHEPECSQTV--LDHGVLVVGYGTENGEDYWL 289
Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
VKNSW T WG GY+ MS + NNCG+ T +Y
Sbjct: 290 VKNSWGTDWGMNGYIKMSRNRSNNCGIATNASY 322
>gi|432114312|gb|ELK36240.1| Aryl hydrocarbon receptor nuclear translocator [Myotis davidii]
Length = 897
Score = 184 bits (468), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 99/216 (45%), Positives = 133/216 (61%), Gaps = 8/216 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
+ Y KGY + +Q CGSCW+F + GA+EG K KL LS Q L+DC N+G
Sbjct: 687 IDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLMKKTGKLLNLSPQNLVDCV--SENDG 744
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG ++Q++ ++ G+ ++D Y PY+GQD C T A G+ + +E AL
Sbjct: 745 CGGGYMTNAFQYVQRNRGIDSEDAY-PYVGQDESCMYNPTGKAAKCRGYKEIPEGNEKAL 803
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K A+A+ GP+SVAIDAS SF FY GVYYDE CN+ D L+HAVLAVGYG GK +W
Sbjct: 804 KKAVARVGPISVAIDASLSSFQFYSKGVYYDENCNS--DNLNHAVLAVGYGIQKGKKHWI 861
Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
+KNSW WGN+GY+LM+ NN CG+ ++ M
Sbjct: 862 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 897
>gi|195124431|ref|XP_002006696.1| GI21205 [Drosophila mojavensis]
gi|193911764|gb|EDW10631.1| GI21205 [Drosophila mojavensis]
Length = 339
Score = 184 bits (468), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 95/210 (45%), Positives = 133/210 (63%), Gaps = 6/210 (2%)
Query: 142 EMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGG 201
E ++ DQ CGSCW+F +TGA+EG ++ K L LS+Q L+DCS YGNNGC+GG
Sbjct: 130 EKGAVTAVKDQGHCGSCWAFSSTGALEGQHFRKTGTLVSLSEQNLVDCSAKYGNNGCNGG 189
Query: 202 EDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLAL 260
++++I + G+ T+ Y PY G D CH + AT GF ++ +E + A+
Sbjct: 190 LMDNAFRYIKDNGGIDTEKSY-PYEGIDDSCHFNKDSVGATDRGFADIPQGNEKKMAEAV 248
Query: 261 AKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKN 319
A GPVSVAIDAS +SF FY G+Y + +CN+ LDH VL VGYG + GK YW VKN
Sbjct: 249 ATIGPVSVAIDASHESFQFYSEGIYNEPECNSQ--NLDHGVLVVGYGTDESGKDYWLVKN 306
Query: 320 SWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
SW T WG++G++ M+ +DN CG+ +A +Y
Sbjct: 307 SWGTTWGDKGFIKMARNEDNQCGIASASSY 336
>gi|340370276|ref|XP_003383672.1| PREDICTED: digestive cysteine proteinase 2-like [Amphimedon
queenslandica]
Length = 327
Score = 184 bits (468), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 94/204 (46%), Positives = 133/204 (65%), Gaps = 5/204 (2%)
Query: 149 LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
+ +Q CGSCWSF +TG++EG +++ L LS+Q L+DCS YGN+GC+GG S++
Sbjct: 124 IKNQGQCGSCWSFSSTGSLEGQHFINTGTLVSLSEQQLMDCSTKYGNHGCNGGLMDNSFR 183
Query: 209 WIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVS 267
++ G T+D+Y PY ++ C ++ A T +V++ ED+LK A+A GP+S
Sbjct: 184 YLKSVAGDETEDNY-PYTAENGVCRYDSSLAVVTDKSYVDIPQGDEDSLKDAVANVGPIS 242
Query: 268 VAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGN 327
VAIDAS SF Y +GVYY C+++ LDH VLA+GYG DGK YW VKNSW T WG
Sbjct: 243 VAIDASHSSFQLYNSGVYYASTCSSTQ--LDHGVLAIGYGTEDGKDYWLVKNSWGTSWGM 300
Query: 328 QGYVLMSI-KDNNCGVMTAPTYVT 350
+GY+ MS ++NNCG+ T +Y T
Sbjct: 301 EGYIKMSRNRNNNCGIATQASYPT 324
>gi|348546019|ref|XP_003460476.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
gi|348546143|ref|XP_003460538.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 334
Score = 184 bits (468), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 101/213 (47%), Positives = 132/213 (61%), Gaps = 6/213 (2%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + DQ CGSCW+F TG++EG ++ K L LS+Q L+DCS YGN G
Sbjct: 122 VDWRDKGYVTDVKDQKQCGSCWAFSATGSLEGQHFRKTGTLVSLSEQQLVDCSGDYGNMG 181
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG ++Q+I + G+ T++ Y PY ++ C AT TG+ V+ EDAL
Sbjct: 182 CMGGLMDYAFQYIQANGGIDTEESY-PYEAENGKCRYNPDNIGATSTGYTEVSQGDEDAL 240
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K A+A GP+SV IDASQ SF FY +GVY + C++ LDH VLAVGYG DG YW
Sbjct: 241 KEAVATIGPISVGIDASQMSFQFYESGVYNEPDCSSLE--LDHGVLAVGYGTEDGNDYWL 298
Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
VKNSW WG++GY+ MS K N CG+ TA +Y
Sbjct: 299 VKNSWGLEWGDKGYIKMSRNKSNQCGIATAASY 331
>gi|345320664|ref|XP_001521690.2| PREDICTED: cathepsin L1-like [Ornithorhynchus anatinus]
Length = 388
Score = 184 bits (468), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 99/205 (48%), Positives = 136/205 (66%), Gaps = 10/205 (4%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTGA+EG + K +L LS+Q L+DCSW GN GC+GG ++Q+I
Sbjct: 184 NQGHCGSCWAFSTTGALEGQLFRKSGRLVSLSEQNLVDCSWQQGNQGCNGGIVDFAFQYI 243
Query: 211 MKH-GLPTQDDYGPYLGQD-AYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
+++ G+ ++D Y PY +D A C ATA +TGFV++ P+SE+AL A+A GPVSV
Sbjct: 244 LENRGIDSEDCY-PYTAKDTAQCAFKPECATARVTGFVDIPPHSEEALMKAVATVGPVSV 302
Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTY 324
AIDA SF FY +G++Y+ KC S + L+HAVL VGYG + GK YW VKNSW
Sbjct: 303 AIDAHPTSFRFYQSGIFYEPKC--SSERLNHAVLVVGYGYEGEDEAGKKYWIVKNSWGKQ 360
Query: 325 WGNQGYVLMSI-KDNNCGVMTAPTY 348
WG+ GY +S + N+CG+ T +Y
Sbjct: 361 WGDHGYFYLSKDRGNHCGIATTASY 385
>gi|291224868|ref|XP_002732424.1| PREDICTED: cathepsin L-like [Saccoglossus kowalevskii]
Length = 823
Score = 184 bits (468), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 95/202 (47%), Positives = 128/202 (63%), Gaps = 5/202 (2%)
Query: 149 LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
L+ + CGSCW+F TTG++EG + K KL LS+Q L+DCS +GN+GC+GG +++
Sbjct: 622 LVAKGQCGSCWAFSTTGSLEGQTFKKTGKLPDLSEQQLVDCSTQFGNHGCNGGLMDLAFE 681
Query: 209 WIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVS 267
+I G+ + DY PYL +D C + AT TG+V++ E+ALK A+A GP+S
Sbjct: 682 YIKAAPGIEGEMDY-PYLAKDGRCMFDQSKVVATDTGYVDIPSMDENALKEAVATIGPIS 740
Query: 268 VAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGN 327
VAIDA SF Y +GVY + C S + LDH VLAVGYG DG+ YW VKNSW WG
Sbjct: 741 VAIDAGHPSFQMYKSGVYNEPGC--SSERLDHGVLAVGYGTEDGQDYWLVKNSWGDSWGQ 798
Query: 328 QGYVLMSIKDNN-CGVMTAPTY 348
GY++MS NN CG+ T +Y
Sbjct: 799 AGYIMMSRNMNNQCGIATQASY 820
>gi|225719058|gb|ACO15375.1| Cathepsin L1 precursor [Caligus clemensi]
Length = 326
Score = 184 bits (467), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 98/202 (48%), Positives = 127/202 (62%), Gaps = 7/202 (3%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQW- 209
DQ CGSCW+F TG+VEG ++K KKL LS+Q L+DCS GN GC GG ++++
Sbjct: 125 DQGKCGSCWAFSATGSVEGQIFLKKKKLMSLSEQQLVDCSGDEGNLGCGGGLMDNAFKYF 184
Query: 210 IMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
I G+ + Y PY +D C + + AT++ F +V ED LK+A+A GPVSVA
Sbjct: 185 IANKGIANEKSY-PYTAKDNDCKYKKSMSVATISSFKDVKHKDEDQLKMAVANVGPVSVA 243
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG--ELDGKPYWQVKNSWSTYWGN 327
IDAS F FY +GVYYDE C S + LDH VLAVGYG + G +W VKNSW+ WG
Sbjct: 244 IDASSSKFQFYESGVYYDENC--SSEVLDHGVLAVGYGTDKKSGMDFWLVKNSWAASWGL 301
Query: 328 QGYVLMSI-KDNNCGVMTAPTY 348
GY+ M+ KDNNCG+ T +Y
Sbjct: 302 NGYIKMARNKDNNCGIATMASY 323
>gi|392922426|ref|NP_001256718.1| Protein CPL-1, isoform a [Caenorhabditis elegans]
gi|3879367|emb|CAB07275.1| Protein CPL-1, isoform a [Caenorhabditis elegans]
Length = 337
Score = 184 bits (467), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 92/201 (45%), Positives = 131/201 (65%), Gaps = 6/201 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q +CGSCW+F TGA+EG + K +L LS+Q L+DCS YGN+GC+GG +++++I
Sbjct: 137 NQGMCGSCWAFSATGALEGQHARKLGQLVSLSEQNLVDCSTKYGNHGCNGGLMDQAFEYI 196
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
HG+ T++ Y PY G+D CH T A G+V+ E+ LK+A+A GP+S+A
Sbjct: 197 RDNHGVDTEESY-PYKGRDMKCHFNKKTVGADDKGYVDTPEGDEEQLKIAVATQGPISIA 255
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
IDA +SF Y GVYYDE+C S + LDH VL VGYG + + YW VKNSW WG +
Sbjct: 256 IDAGHRSFQLYKKGVYYDEEC--SSEELDHGVLLVGYGTDPEHGDYWIVKNSWGAGWGEK 313
Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
GY+ ++ ++N+CGV T +Y
Sbjct: 314 GYIRIARNRNNHCGVATKASY 334
>gi|226477902|emb|CAX72658.1| Cathepsin L precursor [Schistosoma japonicum]
gi|226488903|emb|CAX74801.1| Cathepsin L precursor [Schistosoma japonicum]
Length = 372
Score = 184 bits (467), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 94/204 (46%), Positives = 131/204 (64%), Gaps = 7/204 (3%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F +TGA+EG +Y K +L LS+Q LIDCS YGNNGC+GG ++Q++
Sbjct: 167 NQGQCGSCWAFSSTGAIEGQHYRKTNRLVNLSEQQLIDCSKSYGNNGCEGGLMDLAFQYV 226
Query: 211 MKH-GLPTQDDYGPYLGQDA----YCHIANTTATATMTGFVNVTPNSEDALKLALAKHGP 265
+ G+ ++ Y PY+ D C +T A +TG++N+ E AL A+A GP
Sbjct: 227 RDNEGIDSEISY-PYISGDGDENVRCLFNSTNIMAQVTGYINIHEGDERALMNAVATIGP 285
Query: 266 VSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYW 325
VSVAI+A SFS Y +G+Y D +C ++ + LDH VL VGYG DGKPYW +KNSW W
Sbjct: 286 VSVAINAGLSSFSMYKSGIYSDPECASASEDLDHGVLLVGYGIEDGKPYWLIKNSWGEDW 345
Query: 326 GNQGYV-LMSIKDNNCGVMTAPTY 348
G++GYV ++ N CGV +A +Y
Sbjct: 346 GDKGYVKILKDSKNMCGVASAASY 369
>gi|66378053|gb|AAY45871.1| cathepsin L-like cysteine proteinase [Longidorus elongatus]
Length = 358
Score = 184 bits (466), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 110/255 (43%), Positives = 149/255 (58%), Gaps = 10/255 (3%)
Query: 99 EELVNGIIL-EKWRLVTSEGEKVSKYSLWVRYNKASKDAIPVRYEMKGY-NSLLDQSVCG 156
+ +NG L K +L S+ K + N D++ R E GY + DQ CG
Sbjct: 104 RQRMNGFKLPAKRKLAKSQPLKEDGMIFEMPDNVTIPDSVDWRKE--GYVTKVKDQGSCG 161
Query: 157 SCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI-MKHGL 215
SCW+F TG++EG +Y + KL LS+Q L+DC + GC+GG ++Q++ G+
Sbjct: 162 SCWAFSATGSLEGQHYKQTGKLVSLSEQNLVDCDVNGDDEGCNGGYMDGAFQYVETNKGI 221
Query: 216 PTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQK 275
T+ Y PY G+D C + AT TGFV++ +E L+ A+A GPVSVAIDA+
Sbjct: 222 DTEASY-PYKGRDGRCRFKSEDVGATDTGFVDIPEGNETLLEAAIATVGPVSVAIDAASF 280
Query: 276 SFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGEL-DGKPYWQVKNSWSTYWGNQGYVLMS 334
F FY +GVYYD C SP+ LDH VLAVGY DGK Y+ VKNSWS WG+ GY+LMS
Sbjct: 281 KFQFYSHGVYYDRSC--SPEYLDHGVLAVGYNSTKDGKQYYIVKNSWSEDWGDDGYILMS 338
Query: 335 -IKDNNCGVMTAPTY 348
K+NNCG+ T +Y
Sbjct: 339 RRKNNNCGIATMASY 353
>gi|392881548|gb|AFM89606.1| cathepsin L [Callorhinchus milii]
Length = 338
Score = 184 bits (466), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 99/217 (45%), Positives = 137/217 (63%), Gaps = 9/217 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + +GY + DQ CGSCW+F TTGA+EG ++ + +L LS+Q L++CS GN G
Sbjct: 121 VDWRDEGYVTPVKDQGQCGSCWAFSTTGALEGQHFRRTGQLVSLSEQNLVECSKPEGNEG 180
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG +++Q++ +G +D PY+G D CH A TGFV++ E AL
Sbjct: 181 CNGGLMDQAFQYVKDNGGIDSEDSYPYVGTDDTPCHYNPQYNAANDTGFVDIPSGKERAL 240
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GPVSVAIDA SF FY +G+Y++ +C+++ LDH VL VGYG + DGK
Sbjct: 241 MKAIAAVGPVSVAIDAGHTSFQFYQSGIYFEAECSSTD--LDHGVLVVGYGVEKRDTDGK 298
Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
YW VKNSWS WG GY+LM+ KDN+CG+ TA +Y
Sbjct: 299 KYWIVKNSWSEKWGQNGYILMAKDKDNHCGIATAASY 335
>gi|355681656|gb|AER96815.1| Cathepsin L precursor [Mustela putorius furo]
Length = 331
Score = 184 bits (466), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 98/217 (45%), Positives = 137/217 (63%), Gaps = 11/217 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TGA+EG + K K+L LS+Q L+DCS GN G
Sbjct: 118 VDWRKKGYVTPVKNQGPCGSCWAFSATGALEGQMFRKTKRLVSLSEQNLVDCSQAEGNEG 177
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG ++Q++ + GL +++ Y PY QD C + A TGF+++ P E++L
Sbjct: 178 CSGGLMDYAFQYVKDNGGLDSEESY-PYRAQDESCKYKPEQSAANDTGFMDIHPE-EESL 235
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
KLA+A GP+S AIDAS +F FY G+YYD C S + LDH +L VGYG + + +
Sbjct: 236 KLAVATVGPISAAIDASLSTFQFYHKGIYYDPDC--SSENLDHGILVVGYGSQGEDSEKQ 293
Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
YW VKNSW T WG QGY+LM+ +DN+CG+ TA ++
Sbjct: 294 KYWIVKNSWGTDWGTQGYILMAKDRDNHCGIATAASF 330
>gi|392884266|gb|AFM90965.1| cathepsin L [Callorhinchus milii]
Length = 338
Score = 184 bits (466), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 99/217 (45%), Positives = 137/217 (63%), Gaps = 9/217 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + +GY + DQ CGSCW+F TTGA+EG ++ + +L LS+Q L++CS GN G
Sbjct: 121 VDWRDEGYVTPVKDQGQCGSCWAFSTTGALEGQHFRRTGQLVSLSEQNLVECSKPEGNEG 180
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG +++Q++ +G +D PY+G D CH A TGFV++ E AL
Sbjct: 181 CNGGLMDQAFQYVKDNGGIDSEDSYPYVGTDDTPCHYNPQYNAANDTGFVDIPSGKERAL 240
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GPVSVAIDA SF FY +G+Y++ +C+++ LDH VL VGYG + DGK
Sbjct: 241 MKAIAAVGPVSVAIDAGHTSFQFYQSGIYFEAECSSTD--LDHGVLVVGYGVEKRDTDGK 298
Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
YW VKNSWS WG GY+LM+ KDN+CG+ TA +Y
Sbjct: 299 KYWIVKNSWSEKWGQNGYILMAKDKDNHCGIATAASY 335
>gi|387914010|gb|AFK10614.1| cathepsin L [Callorhinchus milii]
gi|392873762|gb|AFM85713.1| cathepsin L [Callorhinchus milii]
gi|392877488|gb|AFM87576.1| cathepsin L [Callorhinchus milii]
Length = 338
Score = 184 bits (466), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 99/217 (45%), Positives = 137/217 (63%), Gaps = 9/217 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + +GY + DQ CGSCW+F TTGA+EG ++ + +L LS+Q L++CS GN G
Sbjct: 121 VDWRDEGYVTPVKDQGQCGSCWAFSTTGALEGQHFRRTGQLVSLSEQNLVECSKPEGNEG 180
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG +++Q++ +G +D PY+G D CH A TGFV++ E AL
Sbjct: 181 CNGGLMDQAFQYVKDNGGIDSEDSYPYVGTDDTPCHYNPQYNAANDTGFVDIPSGKERAL 240
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GPVSVAIDA SF FY +G+Y++ +C+++ LDH VL VGYG + DGK
Sbjct: 241 MKAIAAVGPVSVAIDAGHTSFQFYQSGIYFEAECSSTD--LDHGVLVVGYGVEKRDTDGK 298
Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
YW VKNSWS WG GY+LM+ KDN+CG+ TA +Y
Sbjct: 299 KYWIVKNSWSEKWGQNGYILMAKDKDNHCGIATAASY 335
>gi|320169658|gb|EFW46557.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 324
Score = 184 bits (466), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 96/202 (47%), Positives = 127/202 (62%), Gaps = 5/202 (2%)
Query: 149 LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
+ DQ CGSCWSF TTG+VEG + K +L LS+Q L+DCS GN GC+GG +++Q
Sbjct: 123 IKDQGQCGSCWSFSTTGSVEGQHARKTGQLVSLSEQNLVDCSSAQGNAGCNGGLMDQAFQ 182
Query: 209 WIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVS 267
+I+ +G+ T+ Y PY QD C + AT+ + ++ SE L+ A+A GP+S
Sbjct: 183 YIISNNGIDTESSY-PYTAQDGTCQFNSANVGATVASYQDIASGSESDLQNAVATVGPIS 241
Query: 268 VAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGN 327
VAIDASQ SF FY +GVY + C++S LDH VLAVGYG YW VKNSW T WG
Sbjct: 242 VAIDASQPSFQFYSSGVYNEPACSSSQ--LDHGVLAVGYGTSGSSDYWLVKNSWGTSWGQ 299
Query: 328 QGYVLMSIKDNN-CGVMTAPTY 348
GY+ M+ NN CG+ TA +Y
Sbjct: 300 SGYIWMTRNSNNQCGIATAASY 321
>gi|348531515|ref|XP_003453254.1| PREDICTED: cathepsin L2-like [Oreochromis niloticus]
Length = 333
Score = 184 bits (466), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 99/228 (43%), Positives = 142/228 (62%), Gaps = 7/228 (3%)
Query: 124 SLWVRYNKASKDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLS 182
S + R K +K V + +GY + Q CGSCW+F TGA+EG ++ K +KL LS
Sbjct: 107 STFNRLPKGTKLPKTVDWRKQGYVTKVKHQKECGSCWAFSATGALEGQHFRKTRKLVSLS 166
Query: 183 QQALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATAT 241
+Q L+DCS +GN+GC+GG ++Q+I + GL T+D Y PY +D CH + A
Sbjct: 167 EQQLVDCSRSFGNHGCNGGWMNPAFQYIRYNGGLDTEDSY-PYKAKDGICHYNPNSVGAI 225
Query: 242 MTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAV 301
+G V+V+P+ E ALK A+A GP+S+A+DAS +SF Y +GVY + +CN + HA+
Sbjct: 226 CSGHVDVSPD-EAALKQAVATIGPISIAVDASHESFQLYQSGVYDEHRCNKKH--VTHAM 282
Query: 302 LAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
L VGYG G YW +KNSW WG++GY+ M+ K N CG+ TA +Y
Sbjct: 283 LVVGYGTEGGHDYWLIKNSWGLQWGDKGYIKMTRNKGNQCGIATAASY 330
>gi|291224870|ref|XP_002732425.1| PREDICTED: cathepsin L2-like [Saccoglossus kowalevskii]
Length = 326
Score = 184 bits (466), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 99/213 (46%), Positives = 135/213 (63%), Gaps = 6/213 (2%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
+ + +GY + DQ CGSCW+F +TG++EG + K KL LS+Q L+DCS YGN G
Sbjct: 114 IDWRTQGYVTDVKDQGACGSCWAFSSTGSLEGQTFKKTGKLVPLSEQQLVDCSGDYGNMG 173
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
C GG +++ +I G ++D Y PY G D C + AT TG+ ++ E+AL+
Sbjct: 174 CGGGWMDQAFSYIKDKGEESEDGY-PYTGTDDTCVYDASKVVATDTGYTDIPEMDENALQ 232
Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQ 316
A+A GP+SVAIDA+ SF FY +GVY + +C+ + LDHAVLAVGYG +G YW
Sbjct: 233 QAVATVGPISVAIDATHSSFQFYESGVYDEPECSQT--NLDHAVLAVGYGTSEEGLDYWI 290
Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
VKNSWST WG QGY+ MS KDN CG+ + +Y
Sbjct: 291 VKNSWSTGWGMQGYIEMSRNKDNQCGIASKASY 323
>gi|56758090|gb|AAW27185.1| SJCHGC06231 protein [Schistosoma japonicum]
Length = 372
Score = 183 bits (465), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 94/204 (46%), Positives = 131/204 (64%), Gaps = 7/204 (3%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F +TGA+EG +Y K +L LS+Q LIDCS YGNNGC+GG ++Q++
Sbjct: 167 NQGQCGSCWAFSSTGAIEGQHYRKTNRLVNLSEQQLIDCSKSYGNNGCEGGLMDLAFQYV 226
Query: 211 MKH-GLPTQDDYGPYLGQDA----YCHIANTTATATMTGFVNVTPNSEDALKLALAKHGP 265
+ G+ ++ Y PY+ D C +T A +TG++N+ E AL A+A GP
Sbjct: 227 RDNKGIDSEISY-PYISGDGDENVRCLFNSTNIMAQVTGYINIHEGDERALMNAVATIGP 285
Query: 266 VSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYW 325
VSVAI+A SFS Y +G+Y D +C ++ + LDH VL VGYG DGKPYW +KNSW W
Sbjct: 286 VSVAINAGLPSFSMYKSGIYSDPECASASEDLDHGVLLVGYGIEDGKPYWLIKNSWGEDW 345
Query: 326 GNQGYV-LMSIKDNNCGVMTAPTY 348
G++GYV ++ N CGV +A +Y
Sbjct: 346 GDKGYVKILKDSKNMCGVASAASY 369
>gi|47523662|ref|NP_999467.1| cathepsin K precursor [Sus scrofa]
gi|15213940|sp|Q9GLE3.1|CATK_PIG RecName: Full=Cathepsin K; Flags: Precursor
gi|10048286|gb|AAG12340.1|AF292030_1 cathepsin K precursor [Sus scrofa]
Length = 330
Score = 183 bits (465), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 101/216 (46%), Positives = 133/216 (61%), Gaps = 8/216 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
+ Y KGY + +Q CGSCW+F + GA+EG K KL LS Q L+DC N+G
Sbjct: 120 IDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 177
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG ++Q++ K+ G+ ++D Y PY+GQD C T A G+ + +E AL
Sbjct: 178 CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQDENCMYNPTGKAAKCRGYREIPEGNEKAL 236
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K A+A+ GPVSVAIDAS SF FY GVYYDE CN+ D L+HAVLAVGYG GK +W
Sbjct: 237 KRAVARVGPVSVAIDASLTSFQFYSKGVYYDENCNS--DNLNHAVLAVGYGIQKGKKHWI 294
Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
+KNSW WGN+GY+LM+ NN CG+ ++ M
Sbjct: 295 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 330
>gi|256082975|ref|XP_002577726.1| subfamily C1A unassigned peptidase (C01 family) [Schistosoma
mansoni]
Length = 1471
Score = 183 bits (464), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 92/204 (45%), Positives = 127/204 (62%), Gaps = 7/204 (3%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTGA+EG +Y K +L LS+Q L+DCS YGNNGC GG +++++
Sbjct: 165 NQGQCGSCWAFSTTGAIEGQHYRKTNRLVNLSEQQLVDCSKSYGNNGCSGGLMNSAFEYV 224
Query: 211 MKH-GLPTQDDYGPYLGQDA----YCHIANTTATATMTGFVNVTPNSEDALKLALAKHGP 265
+ G+ ++ Y PY+ D C + A +TG+VN+ E AL A+A GP
Sbjct: 225 RDNEGIDSEISY-PYVSGDGTENNRCLFNASNILAQVTGYVNIHEGDERALMDAVATKGP 283
Query: 266 VSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYW 325
VSVAI+A SFS Y +G+Y D C + D LDH VL VGYGE +G+ YW +KNSW W
Sbjct: 284 VSVAINAGLPSFSMYKSGIYSDTDCEGTLDALDHGVLVVGYGEENGRSYWLIKNSWGEEW 343
Query: 326 GNQGYVLMSIKDNN-CGVMTAPTY 348
G +GY+ +S +N CGV +A +Y
Sbjct: 344 GEKGYIKISKGSHNMCGVASAASY 367
>gi|194757786|ref|XP_001961143.1| GF13722 [Drosophila ananassae]
gi|190622441|gb|EDV37965.1| GF13722 [Drosophila ananassae]
Length = 417
Score = 183 bits (464), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 95/201 (47%), Positives = 128/201 (63%), Gaps = 6/201 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F +TGA+EG +Y K L LS+Q L+DCS YGNNGC+GG ++++I
Sbjct: 217 DQGHCGSCWAFSSTGALEGQHYRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI 276
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T+ Y PY D CH T AT GFV++ +E L A+A GPVSVA
Sbjct: 277 KDNGGIDTEKSY-PYEALDDSCHFNKGTIGATDRGFVDIPQGNEKKLAEAVATIGPVSVA 335
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
IDAS +SF FY GVY + C+ LDH VL VG+G + G+ YW VKNSW T WG++
Sbjct: 336 IDASHESFQFYSEGVYVEPACD--AQNLDHGVLVVGFGTDESGQDYWLVKNSWGTTWGDK 393
Query: 329 GYV-LMSIKDNNCGVMTAPTY 348
G++ ++ KDN CG+ +A +Y
Sbjct: 394 GFIKMLRNKDNQCGIASASSY 414
>gi|449663703|ref|XP_002169139.2| PREDICTED: uncharacterized protein LOC100198320 [Hydra
magnipapillata]
Length = 1092
Score = 183 bits (464), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 85/196 (43%), Positives = 123/196 (62%), Gaps = 7/196 (3%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM 211
QS C SCW+F GA+EGAY + + L ++ Q L+DCSWGY NNGC GG +R+ ++
Sbjct: 838 QSFCASCWAFSAVGAIEGAYTIYNGTLKEMAVQQLVDCSWGYNNNGCRGGWSWRALNFVG 897
Query: 212 KHGLPTQDDYGPYLGQDAYCHIA---NTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
+HGL T++ YG Y+ Q+ YCH N M+ + + + + LK AL +GP SV
Sbjct: 898 EHGLATREAYGNYIAQEGYCHCGKNDNCDGVKKMS-YRTIEKKNAEKLKAALTLYGPGSV 956
Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQ 328
A+ ++K+F FY +GVY D KC + DHAV+ +GYG + KPYW +KNSW WG+
Sbjct: 957 AVQCARKTFKFYSSGVYDDPKCT---EVTDHAVVIIGYGVENNKPYWLIKNSWGKLWGDN 1013
Query: 329 GYVLMSIKDNNCGVMT 344
GY+ + + +N CGV+T
Sbjct: 1014 GYMKIDMNNNLCGVLT 1029
>gi|330803818|ref|XP_003289899.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
gi|325080010|gb|EGC33584.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
Length = 326
Score = 183 bits (464), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 96/205 (46%), Positives = 132/205 (64%), Gaps = 6/205 (2%)
Query: 148 SLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSY 207
++ +Q CG C++F TTG+VEG + + ++L LS+Q ++DCS GNNGCDGG S+
Sbjct: 125 AVKNQGQCGGCYAFSTTGSVEGIHEITSQQLVPLSEQQILDCSGSEGNNGCDGGLMTNSF 184
Query: 208 QWIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
++I+ GL T+ Y PY G+ C AT+TG+ NV SE L+ A+A PV
Sbjct: 185 EYIIAVGGLDTEASY-PYTGEVGKCKFNKKNIGATITGYKNVESGSESDLQTAVAAQ-PV 242
Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWG 326
SVAIDASQ SF Y +GVYY+ +C+++ LDH VLAVGYG G+ YW VKNSW WG
Sbjct: 243 SVAIDASQSSFQLYASGVYYEPECSSTQ--LDHGVLAVGYGSQSGQDYWIVKNSWGADWG 300
Query: 327 NQGYVLMSI-KDNNCGVMTAPTYVT 350
G++LM+ KDNNCG+ T ++ T
Sbjct: 301 ENGFILMARNKDNNCGIATMASFPT 325
>gi|34559455|gb|AAQ75437.1| cathepsin L-like protease [Helicoverpa armigera]
gi|338855117|gb|AEJ31938.1| cathepsin L-like protease [Helicoverpa assulta]
Length = 341
Score = 183 bits (464), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 97/201 (48%), Positives = 131/201 (65%), Gaps = 6/201 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TTGA+EG ++ K L LS+Q LIDCS YGNNGC+GG ++++I
Sbjct: 141 DQGKCGSCWAFSTTGALEGQHFRKTGYLVSLSEQNLIDCSAAYGNNGCNGGLMDNAFKYI 200
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T+ Y PY G D C + A GFV++ E+ L A+A GPVSVA
Sbjct: 201 KDNGGIDTEKAY-PYEGVDDKCRYNAKNSGADDVGFVDIPQGDEEKLMQAVATVGPVSVA 259
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
IDASQ+SF FY +GVYYDE C+++ LDH V+ VGYG + G YW VKNSW WG+
Sbjct: 260 IDASQESFQFYSDGVYYDENCSSTD--LDHGVMVVGYGTDEQGGDYWLVKNSWGRTWGDL 317
Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
GY+ M+ K+N+CG+ ++ +Y
Sbjct: 318 GYIKMARNKNNHCGIASSASY 338
>gi|291383486|ref|XP_002708337.1| PREDICTED: cathepsin L1 [Oryctolagus cuniculus]
Length = 333
Score = 183 bits (464), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 104/232 (44%), Positives = 141/232 (60%), Gaps = 13/232 (5%)
Query: 129 YNKASKDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQA 185
+ KA +P V + KGY + +Q CGSCW+F TGA+EG + K +L LS+Q
Sbjct: 106 FGKAVFQEVPSSVDWRDKGYVTPVKNQGRCGSCWAFSATGALEGQMFRKTGRLVSLSEQN 165
Query: 186 LIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTG 244
LIDCSW GN GC GG ++Q++ + GL ++D Y PY +D C + + A TG
Sbjct: 166 LIDCSWPAGNYGCRGGLPDHAFQYVKDNGGLDSEDSY-PYEARDGLCRYSPQESVANDTG 224
Query: 245 FVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAV 304
FV + P E+AL A+A GP++VAIDAS SF FY G+YY+ C S + LDHAVL V
Sbjct: 225 FVQI-PEQEEALMEAVATVGPIAVAIDASHSSFLFYKEGIYYEPNC--SRENLDHAVLVV 281
Query: 305 GYG----ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
GYG E D + YW VKNSW WG GY+ M+ ++N+CG+ TA +Y T+
Sbjct: 282 GYGFEGAESDNQKYWLVKNSWGKGWGMDGYMKMAKDRNNHCGIATAASYPTV 333
>gi|431896622|gb|ELK06034.1| Cathepsin K [Pteropus alecto]
Length = 330
Score = 182 bits (463), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 101/216 (46%), Positives = 133/216 (61%), Gaps = 8/216 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V Y KGY + +Q CGSCW+F + GA+EG K KL LS Q L+DC N+G
Sbjct: 120 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 177
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG ++Q++ K+ G+ ++D Y PY+GQD C T A G+ + +E AL
Sbjct: 178 CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQDESCMYNPTGKAAKCRGYKEIPEGNEKAL 236
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K A+A+ GP+SVAIDAS SF FY GVYYDE CN+ D L+HAVLAVGYG G+ +W
Sbjct: 237 KRAVARVGPISVAIDASLTSFQFYRKGVYYDENCNS--DNLNHAVLAVGYGIQKGRKHWI 294
Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
+KNSW WGN+GYVLM+ NN CG+ ++ M
Sbjct: 295 IKNSWGENWGNKGYVLMARNKNNACGIANLASFPRM 330
>gi|198432221|ref|XP_002130541.1| PREDICTED: similar to cathepsin L [Ciona intestinalis]
Length = 330
Score = 182 bits (463), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 97/223 (43%), Positives = 143/223 (64%), Gaps = 5/223 (2%)
Query: 130 NKASKDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALID 188
+K +++ V + +GY + +Q CGSCW+F TG++EG ++ K KKL LS+Q LID
Sbjct: 106 DKFAQNPTTVDWRTQGYVTPVKNQLQCGSCWAFSATGSLEGQHFAKTKKLVSLSEQQLID 165
Query: 189 CSWGYGNNGCDGGEDFRSYQWIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVN 247
CS G+ GC GG ++ +I + G+ ++ +Y PY ++ C + AT+TG V+
Sbjct: 166 CSTKQGDLGCGGGYPDWAFAYINQVGGIESETNY-PYEAKNDVCRFNVSEVAATLTGCVD 224
Query: 248 VTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG 307
+TP+SE L+ A+ GPVSV IDAS SF Y +G+YY+++C++SP LDH VLAVGYG
Sbjct: 225 ITPDSETQLEKAVGSIGPVSVLIDASHISFQLYGSGIYYEQQCSSSPASLDHGVLAVGYG 284
Query: 308 ELDGKPYWQVKNSWSTYWGN-QGYVLMSI-KDNNCGVMTAPTY 348
+G+ YW VKNSW WG GY+ M+ K+NNCG+ T +Y
Sbjct: 285 ADNGQEYWMVKNSWGEGWGKLGGYIKMAKNKNNNCGIATQASY 327
>gi|348531523|ref|XP_003453258.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 341
Score = 182 bits (463), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 102/228 (44%), Positives = 142/228 (62%), Gaps = 6/228 (2%)
Query: 124 SLWVRYNKASKDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLS 182
S + R K + V + KGY ++ +Q CGSCW+F TG++EG ++ K KL LS
Sbjct: 114 STFFRLPKGTVLPDTVDWRDKGYVTNVQNQMDCGSCWAFSATGSLEGQHFRKTGKLVSLS 173
Query: 183 QQALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATAT 241
+Q L+DCS +GN GC+GG ++Q+I + G+ T++ Y PY +D C + AT
Sbjct: 174 KQQLVDCSGEFGNEGCNGGLMDSAFQYIQANGGIDTEESY-PYEAEDGKCRYNPKSTGAT 232
Query: 242 MTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAV 301
TG+V+V P +E+ LK A+A GP+SVAIDA SF FY +GVY + C+++ LDHAV
Sbjct: 233 CTGYVDVQPANEETLKEAVATIGPISVAIDAFHPSFQFYESGVYDEPDCSSTM--LDHAV 290
Query: 302 LAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
LAVGYG +G YW VKNS WG +GY+ MS K N CG+ TA +Y
Sbjct: 291 LAVGYGTENGLDYWLVKNSAGVGWGEKGYIKMSRNKSNQCGIATAASY 338
>gi|332375975|gb|AEE63128.1| unknown [Dendroctonus ponderosae]
Length = 338
Score = 182 bits (463), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 97/202 (48%), Positives = 128/202 (63%), Gaps = 7/202 (3%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCWSF TGA+EG ++ + KKL LS+Q L+DCS +GNNGC+GG ++++I
Sbjct: 137 DQGHCGSCWSFSATGALEGQHFRQTKKLVSLSEQNLVDCSSRFGNNGCNGGLMDNAFRYI 196
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T+ Y PY+G+D + AT GFV++ ED LK A+A GP+S+A
Sbjct: 197 KNNGGIDTEAAY-PYMGEDEKFRYSAKNRGATDKGFVDIPSGDEDKLKAAVATVGPISIA 255
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG--ELDGKPYWQVKNSWSTYWGN 327
IDAS +SF Y NGVY D C+++ LDH VL VGYG E G YW VKNSW WG
Sbjct: 256 IDASHESFQLYSNGVYSDPTCSSTE--LDHGVLVVGYGTDEKTGMDYWLVKNSWGDTWGL 313
Query: 328 QGYVLMSI-KDNNCGVMTAPTY 348
GY+ M+ +DN CGV T +Y
Sbjct: 314 DGYIKMARNQDNQCGVATQASY 335
>gi|334332718|ref|XP_001367502.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
Length = 333
Score = 182 bits (463), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 95/212 (44%), Positives = 130/212 (61%), Gaps = 4/212 (1%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CG+CWSF GA+EG ++ K KL LS Q LIDC+ GNNG
Sbjct: 121 VDWREKGYVTPVKEQGDCGACWSFSAVGAIEGQWFRKTGKLVSLSIQNLIDCTIPEGNNG 180
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
CDGG ++Q++ +G ++ PY+ QD C + A +TGFV++ E AL
Sbjct: 181 CDGGFMDNAFQYVQDNGGIDTEECYPYVAQDTECKYKPECSGANITGFVDIPSMDERALM 240
Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQV 317
A+A GP+SV ID++ SF FY +GVYY+ C++S LDH VL VGYG + YW V
Sbjct: 241 EAVATVGPISVGIDSANPSFKFYQSGVYYEPDCSSSQ--LDHGVLVVGYGSIGKDEYWIV 298
Query: 318 KNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
KNSW WG+ GY+LM+ KDN+CG+ T +Y
Sbjct: 299 KNSWGEAWGDNGYILMAKDKDNHCGIATEASY 330
>gi|225709022|gb|ACO10357.1| Cathepsin L precursor [Caligus rogercresseyi]
Length = 332
Score = 182 bits (463), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 98/202 (48%), Positives = 131/202 (64%), Gaps = 7/202 (3%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F +TG++EG + K KL LS+Q L+DCS YGNNGC+GG ++ +I
Sbjct: 131 NQGQCGSCWAFSSTGSLEGQTFRKTGKLIPLSEQNLVDCSRKYGNNGCEGGLMDFAFTYI 190
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T+ Y PY G CH + ++ GFV+V SE+ L A+A GPVSVA
Sbjct: 191 RDNKGIDTEGSY-PYEGVGGRCHYDPSKKGSSDIGFVDVKKGSEEELLKAVASVGPVSVA 249
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG--ELDGKPYWQVKNSWSTYWGN 327
IDAS SF FY +GVY++ KC SP+ LDH VL VGYG E G+ YW VKNSWS WG+
Sbjct: 250 IDASHMSFQFYSHGVYFESKC--SPENLDHGVLVVGYGTDENSGEDYWLVKNSWSENWGD 307
Query: 328 QGYVLMSI-KDNNCGVMTAPTY 348
QGY+ M+ K N CG+ ++ +Y
Sbjct: 308 QGYIKMARNKKNMCGIASSASY 329
>gi|21953244|emb|CAD42716.1| putative cathepsin L [Myzus persicae]
Length = 341
Score = 182 bits (463), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 101/228 (44%), Positives = 142/228 (62%), Gaps = 9/228 (3%)
Query: 127 VRYNKASKDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQ 183
V + K+ IP + + KGY + +Q CGSCWSF TG++EG ++ K L LS+
Sbjct: 114 VTFLKSENVVIPKSIDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVLVSLSE 173
Query: 184 QALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATM 242
Q LIDCS YGNNGC+GG ++++I + GL T+ Y PY +D C + AT
Sbjct: 174 QNLIDCSRKYGNNGCEGGLMDLAFKYIKSNKGLDTEKSY-PYEAEDDKCRYNPDNSGATD 232
Query: 243 TGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVL 302
GFV++ E+AL ALA GPVS+AIDAS + F FY GV+Y+ +C+++ LDH VL
Sbjct: 233 NGFVDIPEGDEEALMHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTE--LDHGVL 290
Query: 303 AVGY-GELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
AVG+ + G YW VKNSW WG++GY++M+ K NNCGV ++ +Y
Sbjct: 291 AVGFRTDKKGGDYWIVKNSWGKTWGDEGYIMMARNKKNNCGVASSASY 338
>gi|405966499|gb|EKC31777.1| Cathepsin L [Crassostrea gigas]
Length = 331
Score = 182 bits (462), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 97/213 (45%), Positives = 132/213 (61%), Gaps = 6/213 (2%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + +GY + +Q CGSCWSF TG++EG ++ KKL LS+Q L+DCS GN+G
Sbjct: 119 VDWRKEGYVTDIKNQGHCGSCWSFSATGSLEGQHFKASKKLVSLSEQNLVDCSKKEGNHG 178
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG ++++I + G+ T++ Y PY ++ +CH AT TG+V++ ED L
Sbjct: 179 CQGGLMDNAFRYIESNKGIDTEESY-PYTAKNGFCHFKAENVGATDTGYVDIPHMQEDKL 237
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
+ A+A GP+SV IDA KSF Y GVY + C++S LDH VLAVGYG G YW
Sbjct: 238 QEAVATVGPISVGIDAGHKSFQLYREGVYSEPACSSSK--LDHGVLAVGYGTESGDDYWL 295
Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
VKNSW T WG QGYV+M+ K N CG+ T +Y
Sbjct: 296 VKNSWGTSWGMQGYVMMARNKHNMCGIATQASY 328
>gi|198427748|ref|XP_002130282.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 340
Score = 182 bits (462), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 96/199 (48%), Positives = 123/199 (61%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCWSF TG++EG + K KL LS+Q LIDCS GN+GC+GG +++++I
Sbjct: 141 NQGQCGSCWSFSATGSLEGQHKKKTGKLVSLSEQNLIDCSTPEGNDGCNGGLMDQAFKYI 200
Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
G + Y PY +D C T + AT TGFV++ E+ LK A A GP+SVAI
Sbjct: 201 KIQGGIDTEAYYPYEAKDDTCRFNITDSGATDTGFVDIKSGDEEMLKEAAATVGPISVAI 260
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
DAS SF FY NGVY + C+++ LDH VL VGYG +GK YW VKNSW WG GY
Sbjct: 261 DASHTSFQFYSNGVYSETACSSTM--LDHGVLVVGYGTENGKDYWLVKNSWGEGWGEAGY 318
Query: 331 VLMSIK-DNNCGVMTAPTY 348
+ MS DN CG+ T +Y
Sbjct: 319 IKMSRNADNQCGIATQASY 337
>gi|156371477|ref|XP_001628790.1| predicted protein [Nematostella vectensis]
gi|156215775|gb|EDO36727.1| predicted protein [Nematostella vectensis]
Length = 330
Score = 182 bits (462), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 98/213 (46%), Positives = 133/213 (62%), Gaps = 6/213 (2%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + +GY + +Q CGSCW+F TTG++EG + K KL LS+Q L+DCS YGNNG
Sbjct: 118 VDWRKEGYVTPVKNQGQCGSCWAFSTTGSLEGQNFKKTGKLVSLSEQNLVDCSTAYGNNG 177
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG ++++I ++ G+ T++ Y PY ++ C + A TGFV+VT E+AL
Sbjct: 178 CQGGLMDYAFKYIKENGGIDTEESY-PYEARNDRCRFQKSNIGAVDTGFVDVTHGDEEAL 236
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K A GP+SVAIDA SF FY +GVY + C+++ LDH VL VGYG G YW
Sbjct: 237 KTAAGTVGPISVAIDAGHMSFQFYHSGVYNNAGCSSTS--LDHGVLVVGYGTYQGSDYWL 294
Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
VKNSW WG +GY++MS K+N CGV T +Y
Sbjct: 295 VKNSWGERWGMEGYIMMSRNKNNQCGVATQASY 327
>gi|405963298|gb|EKC28885.1| Cathepsin L [Crassostrea gigas]
Length = 265
Score = 182 bits (462), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 103/216 (47%), Positives = 133/216 (61%), Gaps = 8/216 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + +GY + +Q CGSCW+F TTG +EG +Y K KL LS+Q L+DCS N G
Sbjct: 55 VDWSKEGYVTPVKNQGQCGSCWAFSTTGGLEGQHYRKTGKLVSLSEQNLLDCS--KENMG 112
Query: 198 CDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++Y++I ++G + T++ Y PYLG+ C + AT TGFV VT E AL
Sbjct: 113 CNGGLPQKAYKYIKENGGIDTEESY-PYLGKKETCSFRPSEVGATCTGFVQVTAGDELAL 171
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K A+A GP++V IDASQ SF Y GVY ++ CN P DHAVL VGYG GK YW
Sbjct: 172 KKAVASVGPITVCIDASQPSFQLYKGGVYDEQSCN--PIVFDHAVLIVGYGVYQGKDYWL 229
Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
VKNSW T WG GY++MS NN CG+ Y T+
Sbjct: 230 VKNSWGTSWGMDGYIMMSRNQNNQCGIANHAVYPTV 265
>gi|383849553|ref|XP_003700409.1| PREDICTED: cathepsin L-like [Megachile rotundata]
Length = 343
Score = 182 bits (462), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 93/201 (46%), Positives = 131/201 (65%), Gaps = 6/201 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TG++EG ++ + L LS+Q LIDCS YGNNGC+GG +++ +I
Sbjct: 139 DQGHCGSCWAFSATGSLEGQHFRRTGVLVSLSEQNLIDCSGSYGNNGCNGGLMDQAFSYI 198
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ GL T+ Y PY G+D C ++ A+ GFV++ E LK A+A GPVSVA
Sbjct: 199 KDNKGLDTEKTY-PYEGEDDKCRYDKRSSGASDVGFVDIPVGDEQKLKAAVATVGPVSVA 257
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
IDAS +SF FY +G+Y++ +C S LDH VL VGYG + +G+ YW VKNSW WG +
Sbjct: 258 IDASHQSFQFYSDGIYFEPEC--SSTNLDHGVLVVGYGTDEEGRDYWIVKNSWGESWGEK 315
Query: 329 GYVLMSIK-DNNCGVMTAPTY 348
GY+ M+ DN+CG+ ++ +Y
Sbjct: 316 GYIKMARNIDNHCGIASSASY 336
>gi|390476660|ref|XP_003735160.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin K [Callithrix jacchus]
Length = 329
Score = 182 bits (462), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 100/216 (46%), Positives = 133/216 (61%), Gaps = 8/216 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V Y KGY + +Q CGSCW+F + GA+EG K KL LS Q L+DC N+G
Sbjct: 119 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 176
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG ++Q++ K+ G+ ++D Y PY+GQ+ C T A G+ + +E AL
Sbjct: 177 CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQEESCMYNPTGKAAKCRGYREIPEGNEKAL 235
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K A+A+ GP+SVAIDAS SF FY GVYYDE CN+ D L+HAVLAVGYG L G +W
Sbjct: 236 KRAVARVGPISVAIDASLTSFQFYSKGVYYDESCNS--DNLNHAVLAVGYGILKGNKHWI 293
Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
+KNSW WGN+GY+LM+ NN CG+ ++ M
Sbjct: 294 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 329
>gi|195381187|ref|XP_002049336.1| GJ20806 [Drosophila virilis]
gi|194144133|gb|EDW60529.1| GJ20806 [Drosophila virilis]
Length = 339
Score = 182 bits (462), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 94/201 (46%), Positives = 128/201 (63%), Gaps = 6/201 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F +TGA+EG ++ K L LS+Q L+DCS YGNNGC+GG ++++I
Sbjct: 139 DQGHCGSCWAFSSTGALEGQHFRKAGTLISLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI 198
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T+ Y PY G D CH T AT G V++ E + A+A GPVSVA
Sbjct: 199 KDNGGIDTEKSY-PYEGIDDSCHFNKATIGATDRGSVDIPQGDEKKMAEAVATIGPVSVA 257
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
IDAS +SF FY G+Y + +C+ P LDH VL VGYG + G+ YW VKNSW T WG++
Sbjct: 258 IDASHESFQFYSEGIYNEPQCD--PQNLDHGVLVVGYGTDESGQDYWLVKNSWGTTWGDK 315
Query: 329 GYVLMSIK-DNNCGVMTAPTY 348
G++ M+ DN CG+ +A +Y
Sbjct: 316 GFIKMARNADNQCGIASASSY 336
>gi|307175095|gb|EFN65237.1| Cathepsin L [Camponotus floridanus]
Length = 372
Score = 182 bits (461), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 95/205 (46%), Positives = 133/205 (64%), Gaps = 7/205 (3%)
Query: 148 SLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSY 207
++ DQ CGSCW+F +TGA+EG ++ + L LS+Q LIDCS YGNNGC+GG ++
Sbjct: 168 AIKDQGQCGSCWAFSSTGALEGQHFRQSGVLVSLSEQNLIDCSGKYGNNGCNGGLMDYAF 227
Query: 208 QWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
++I ++ GL T+ Y PY ++ C + A+ GFV++ ED LK A+A GP+
Sbjct: 228 RYIKENKGLDTEKSY-PYEAENDQCRYNPKNSGASDVGFVDIPEGDEDKLKAAVATIGPI 286
Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELD--GKPYWQVKNSWSTY 324
SVAIDAS +SF FY GVYY+ +C SP LDH VL VGYG G+ YW VKNSW
Sbjct: 287 SVAIDASHESFHFYSEGVYYEPEC--SPANLDHGVLIVGYGTDSGTGEDYWLVKNSWGET 344
Query: 325 WGNQGYVLMSI-KDNNCGVMTAPTY 348
WG +GY+ M+ K+N+CG+ ++ +Y
Sbjct: 345 WGEKGYIKMARNKENHCGIASSASY 369
>gi|405971603|gb|EKC36430.1| Cathepsin L [Crassostrea gigas]
Length = 360
Score = 182 bits (461), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 102/214 (47%), Positives = 132/214 (61%), Gaps = 7/214 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCWSF TTG++EG ++ K KL LS+ L+DCS +GN G
Sbjct: 147 VDWRKKGYVTDVKNQGQCGSCWSFSTTGSLEGQHFRKSGKLVSLSESQLVDCSQSFGNEG 206
Query: 198 CDGGEDFRSYQWIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++++I GL +++DY PY + C +T AT TG V+V SE AL
Sbjct: 207 CNGGLMDNAFKYIKSVGGLESEEDY-PYKPKQGTCKFDDTKVAATDTGCVDVESGSESAL 265
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELD-GKPYW 315
K A+++ GPVSVAIDAS SF Y GVY + +C S + LDH VL VGYG D G+ YW
Sbjct: 266 KKAVSEVGPVSVAIDASHSSFQSYAGGVYDEPEC--SSEQLDHGVLCVGYGTDDQGQDYW 323
Query: 316 QVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
VKNSW WG GYV MS K N CG+ T +Y
Sbjct: 324 IVKNSWGAEWGEDGYVKMSRNKKNQCGIATQASY 357
>gi|156354963|ref|XP_001623448.1| predicted protein [Nematostella vectensis]
gi|156210148|gb|EDO31348.1| predicted protein [Nematostella vectensis]
Length = 392
Score = 182 bits (461), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 90/221 (40%), Positives = 133/221 (60%), Gaps = 13/221 (5%)
Query: 137 IPVRYEMKGY---NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG- 192
+P + + + Y N Q CGSCW+F T GAVE A++++ +L L++Q L+DC+W
Sbjct: 177 VPDQLDWRNYGAVNPAKGQGTCGSCWAFATAGAVEAAHFIQKGELLNLAEQQLLDCTWST 236
Query: 193 ----YGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
+GNNGC GG ++++ W+ K G+ T YG Y GQ+ +C +N T A +T + V
Sbjct: 237 PGVYHGNNGCLGGWTWKAFSWVKKFGIATTKSYGHYRGQEGFCKTSNLTVGARITSYRRV 296
Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
+ ALK AL+ HGP +++I+A+ KS FY +G+ D+ C+N DHAVL +GYG
Sbjct: 297 KRFNPIALKKALSYHGPATISINANPKSLKFYSDGIMSDKHCSNK---TDHAVLLIGYGS 353
Query: 309 LDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYV 349
+G PYW +KNSWS WGN G++ IK CG+ P V
Sbjct: 354 DNGVPYWLIKNSWSHKWGNNGFI--KIKQGLCGIEKRPFVV 392
>gi|390337642|ref|XP_780653.3| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 333
Score = 182 bits (461), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 108/234 (46%), Positives = 141/234 (60%), Gaps = 9/234 (3%)
Query: 119 KVSKYSLWVRYNKASKDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKK 177
K +K S ++ N + V + KGY + DQ CGSCW+F TTG+VEG ++ K
Sbjct: 102 KAAKGSTFLPPNNVGELPKTVDWRTKGYVTPVKDQGQCGSCWAFSTTGSVEGQHFKATGK 161
Query: 178 LAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANT 236
L LS+Q L+DCS + GCDGG R++Q+I+ G + T+ Y PY D CH
Sbjct: 162 LVSLSEQNLVDCS--GRDAGCDGGFMDRAFQYIIDAGGIDTEASY-PYKAVDGKCHFKKA 218
Query: 237 TATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDG 296
AT+TG+ +VT SE AL+ A+A GP+SVAIDAS SF Y +GVY + C+++
Sbjct: 219 NVGATVTGYTDVTSGSEKALQKAVAHVGPISVAIDASHMSFQHYKSGVYNEPGCDSTV-- 276
Query: 297 LDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
LDH VLAVGYG DG YW VKNSW+ WG GYV MS KDN CG+ T +Y
Sbjct: 277 LDHGVLAVGYGTSSDGTDYWIVKNSWAETWGMNGYVWMSRNKDNQCGIATNASY 330
>gi|158268253|gb|ABW25046.1| cathepsin L-like protease [Strongylus vulgaris]
Length = 354
Score = 182 bits (461), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 89/201 (44%), Positives = 130/201 (64%), Gaps = 6/201 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q +CGSCW+F TGA+EG + K+ LS+Q L+DCS YGN+GC+GG ++++I
Sbjct: 154 NQGMCGSCWAFSATGALEGQHARASGKMVSLSEQNLVDCSTKYGNHGCNGGLMDLAFEYI 213
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
HG+ T++ Y PY+G++ CH A GFV++ E+ALK+A+A GP+S+A
Sbjct: 214 KDNHGIDTEESY-PYVGRETKCHFKKKDIGAEDKGFVDLPEGDEEALKVAVATQGPISIA 272
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
IDA ++F Y GVYYDE+C S + LDH VL VGYG + + YW +KNSW WG +
Sbjct: 273 IDAGHRTFQLYKKGVYYDEEC--SSEELDHGVLLVGYGTDPEAGDYWLIKNSWGPGWGEK 330
Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
GY+ ++ + N+CGV T +Y
Sbjct: 331 GYIRIARNRSNHCGVATKASY 351
>gi|158268255|gb|ABW25047.1| cathepsin L-like protease [Strongylus vulgaris]
Length = 354
Score = 182 bits (461), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 89/201 (44%), Positives = 130/201 (64%), Gaps = 6/201 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q +CGSCW+F TGA+EG + K+ LS+Q L+DCS YGN+GC+GG ++++I
Sbjct: 154 NQGMCGSCWAFSATGALEGQHARASGKMVSLSEQNLVDCSTKYGNHGCNGGLMDLAFEYI 213
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
HG+ T++ Y PY+G++ CH A GFV++ E+ALK+A+A GP+S+A
Sbjct: 214 KDNHGIDTEESY-PYVGRETKCHFKKKDIGAEDKGFVDLPEGDEEALKVAVATQGPISIA 272
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
IDA ++F Y GVYYDE+C S + LDH VL VGYG + + YW +KNSW WG +
Sbjct: 273 IDAGHRTFQLYKKGVYYDEEC--SSEELDHGVLLVGYGTDPEAGDYWLIKNSWGPGWGEK 330
Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
GY+ ++ + N+CGV T +Y
Sbjct: 331 GYIRIARNRSNHCGVATKASY 351
>gi|17224950|gb|AAL37181.1|AF320084_1 cathepsin L-like protease [Ancylostoma caninum]
Length = 214
Score = 182 bits (461), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 90/201 (44%), Positives = 132/201 (65%), Gaps = 6/201 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q +CGSCW+F TGA+EG + ++ LS+Q L+DCS YGN+GC+GG ++++I
Sbjct: 14 NQGMCGSCWAFSATGALEGQHARASGQMVSLSEQNLVDCSTKYGNHGCNGGLMDLAFEYI 73
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
HG+ T++ Y PY+G+D CH A G+V++ E+ALK+A+A GP+S+A
Sbjct: 74 KDNHGIDTEESY-PYVGRDMKCHFKKKDIGAVDNGYVDLPEGDEEALKIAVATQGPISIA 132
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
IDA ++F Y GVYYDE+C S + LDH VL VGYG + + YW VKNSW T WG +
Sbjct: 133 IDAGHRTFQLYKKGVYYDEEC--SSEELDHGVLLVGYGTDPEAGDYWLVKNSWGTGWGEK 190
Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
GY+ ++ ++N+CGV T +Y
Sbjct: 191 GYIRIARNRNNHCGVATKASY 211
>gi|2706547|emb|CAA75862.1| putative cathepsin L [Xenopus laevis]
Length = 231
Score = 182 bits (461), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 100/217 (46%), Positives = 134/217 (61%), Gaps = 9/217 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + DQ CGSCW+ TTGA+EG +Y K KL LS+Q L+DCS GN G
Sbjct: 14 VDWRKKGYVTPVKDQGQCGSCWAPSTTGALEGQHYRKTSKLISLSEQNLVDCSRAQGNEG 73
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQ-DAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG +++Q++ +G +D PY + D CH +A TGFV+V E L
Sbjct: 74 CNGGLMDQAFQYVKDNGGIDSEDSYPYTAKDDQECHYDPNNNSANDTGFVDVQSGCEKDL 133
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GPVSVAIDA +SF FY +G+YY+ +C S + LDH VL VGYG ++DGK
Sbjct: 134 MKAVASVGPVSVAIDAGHQSFQFYQSGIYYEPEC--SSEDLDHGVLVVGYGFESEDVDGK 191
Query: 313 PYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
YW VKNSWS WG+ GY+ + + N+CG+ TA +Y
Sbjct: 192 KYWIVKNSWSEKWGDNGYINIAKDRHNHCGIATAASY 228
>gi|261289785|ref|XP_002611754.1| hypothetical protein BRAFLDRAFT_284341 [Branchiostoma floridae]
gi|229297126|gb|EEN67764.1| hypothetical protein BRAFLDRAFT_284341 [Branchiostoma floridae]
Length = 327
Score = 181 bits (460), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 90/200 (45%), Positives = 130/200 (65%), Gaps = 4/200 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TTG++EG ++MK KL LS+Q L+DCS +GN GC+GG +++++I
Sbjct: 127 DQGHCGSCWAFSTTGSLEGQHFMKTGKLVSLSEQNLLDCSRRFGNKGCEGGLMDQAFRYI 186
Query: 211 MKHGLPTQDDYGPYLGQD-AYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+G ++ PY+ +D C + + AT++ + ++ E AL A+ GPVSVA
Sbjct: 187 KSNGGIDTEECYPYMAKDEKVCDYKTSCSGATLSSYTDIKAMDEMALMQAVGTVGPVSVA 246
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS KS FY +G+Y + +C+ + LDH VLAVGYG +DG YW VKNSW + WG+ G
Sbjct: 247 IDASHKSLRFYKSGIYDEPECSRTK--LDHGVLAVGYGSMDGMDYWLVKNSWGSAWGDMG 304
Query: 330 YVLMSI-KDNNCGVMTAPTY 348
YV M+ K+N CG+ T +Y
Sbjct: 305 YVKMTRNKNNQCGIATKASY 324
>gi|77735825|ref|NP_001029607.1| cathepsin K precursor [Bos taurus]
gi|59858469|gb|AAX09069.1| cathepsin K preproprotein [Bos taurus]
gi|83638771|gb|AAI09854.1| Cathepsin K [Bos taurus]
gi|296489554|tpg|DAA31667.1| TPA: cathepsin K [Bos taurus]
Length = 334
Score = 181 bits (460), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 100/216 (46%), Positives = 132/216 (61%), Gaps = 8/216 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V Y KGY + +Q CGSCW+F + GA+EG K KL LS Q L+DC N+G
Sbjct: 124 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 181
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG ++Q++ K+ G+ ++D Y PY+GQD C T A G+ + +E AL
Sbjct: 182 CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQDENCMYNPTGKAAKCRGYREIPEGNEKAL 240
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K A+A+ GP+SVAIDAS SF FY GVYYDE CN+ D L+HAVLAVGYG G +W
Sbjct: 241 KRAVARVGPISVAIDASLTSFQFYRKGVYYDENCNS--DNLNHAVLAVGYGIQKGNKHWI 298
Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
+KNSW WGN+GY+LM+ NN CG+ ++ M
Sbjct: 299 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 334
>gi|149751227|ref|XP_001490649.1| PREDICTED: cathepsin K-like [Equus caballus]
Length = 329
Score = 181 bits (460), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 100/216 (46%), Positives = 132/216 (61%), Gaps = 8/216 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
+ Y KGY + +Q CGSCW+F + GA+EG K KL LS Q L+DC N+G
Sbjct: 119 IDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 176
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG ++Q++ K+ G+ ++D Y PY+GQD C T A G+ + +E AL
Sbjct: 177 CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQDESCMYNPTGKAAKCRGYREIPQGNEKAL 235
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K A+A+ GPVSVAIDAS SF FY GVYYDE CN+ D L+HAVLAVGYG G +W
Sbjct: 236 KRAVARVGPVSVAIDASLTSFQFYSRGVYYDENCNS--DNLNHAVLAVGYGIQKGNKHWI 293
Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
+KNSW WGN+GY+LM+ NN CG+ ++ M
Sbjct: 294 IKNSWGENWGNKGYILMARNKNNACGIANMASFPKM 329
>gi|77404197|ref|NP_001029168.1| cathepsin K precursor [Canis lupus familiaris]
gi|122056102|sp|Q3ZKN1.1|CATK_CANFA RecName: Full=Cathepsin K; Flags: Precursor
gi|58047562|gb|AAW65150.1| cathepsin K [Canis lupus familiaris]
Length = 330
Score = 181 bits (460), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 101/222 (45%), Positives = 134/222 (60%), Gaps = 8/222 (3%)
Query: 133 SKDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW 191
S+ V Y KGY + +Q CGSCW+F + GA+EG K KL LS Q L+DC
Sbjct: 114 SRAPDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV- 172
Query: 192 GYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTP 250
N+GC GG ++Q++ K+ G+ ++D Y PY+GQD C T A G+ +
Sbjct: 173 -SENDGCGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQDESCMYNPTGKAAKCRGYREIPE 230
Query: 251 NSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELD 310
+E ALK A+A+ GP+SVAIDAS SF FY GVYYDE CN+ D L+HAVLAVGYG
Sbjct: 231 GNEKALKRAVARVGPISVAIDASLTSFQFYSKGVYYDENCNS--DNLNHAVLAVGYGIQK 288
Query: 311 GKPYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
G +W +KNSW WGN+GY+LM+ NN CG+ ++ M
Sbjct: 289 GNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 330
>gi|330805277|ref|XP_003290611.1| hypothetical protein DICPUDRAFT_81345 [Dictyostelium purpureum]
gi|325079250|gb|EGC32859.1| hypothetical protein DICPUDRAFT_81345 [Dictyostelium purpureum]
Length = 330
Score = 181 bits (460), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 97/200 (48%), Positives = 126/200 (63%), Gaps = 6/200 (3%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCWSF TTG+VEGA+ +K + LS+Q L+DCS +GNNGCDGG ++++I
Sbjct: 131 DQGQCGSCWSFSTTGSVEGAHQIKTGNMVTLSEQNLVDCSGKFGNNGCDGGLMVNAFKFI 190
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
M G+ T+D Y PY C + A ++G+ +T SE L+ AL K PVS+A
Sbjct: 191 MSQGGVATEDSY-PYNAVQGKCKFTKSMVGANISGYKEITQGSELELQAALTKQ-PVSIA 248
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDASQ+SF Y +GVY + +C S LDH VLAVGYG +GK Y+ VKNSW+ WG G
Sbjct: 249 IDASQQSFQLYKSGVYDEPEC--SSYQLDHGVLAVGYGTENGKDYYIVKNSWADSWGQDG 306
Query: 330 YVLMSIK-DNNCGVMTAPTY 348
Y+ MS N CGV T +Y
Sbjct: 307 YIFMSRNAKNQCGVATMASY 326
>gi|391338876|ref|XP_003743781.1| PREDICTED: cathepsin L-like isoform 4 [Metaseiulus occidentalis]
Length = 336
Score = 181 bits (460), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 99/201 (49%), Positives = 125/201 (62%), Gaps = 6/201 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCWSF TTGA+EG + K +L LS+Q LIDCS YGNNGC GG ++ +I
Sbjct: 136 NQGHCGSCWSFSTTGALEGQLFRKTGELVSLSEQNLIDCSTSYGNNGCGGGLMDNAFTYI 195
Query: 211 MK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ HG+ T++ Y PY G+ C + TGFV++ +E AL ALA GPVSVA
Sbjct: 196 KENHGIDTEESY-PYEGKQGKCRYHKEDSAGRDTGFVDIPSGNERALAKALATIGPVSVA 254
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGEL-DGKPYWQVKNSWSTYWGNQ 328
IDAS +SF FY GVY C++ LDH VLAVGYG DG+ Y+ +KNSW WG +
Sbjct: 255 IDASHESFQFYHEGVYNPPDCDSHS--LDHGVLAVGYGTTDDGQDYYIIKNSWGERWGQE 312
Query: 329 GYVLMSIKD-NNCGVMTAPTY 348
GYVLM+ N CGV T +Y
Sbjct: 313 GYVLMARNSKNECGVATQASY 333
>gi|116563690|gb|ABJ99858.1| cathepsin L [Hippoglossus hippoglossus]
Length = 336
Score = 181 bits (460), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 103/218 (47%), Positives = 137/218 (62%), Gaps = 11/218 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + DQ CGSCW+F TTGA+EG + K KL LS+Q L+DCS GN G
Sbjct: 119 VDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQQFRKTGKLVSLSEQNLVDCSRPEGNEG 178
Query: 198 CDGGEDFRSYQWIM-KHGLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDA 255
C GG +++Q++ GL ++D Y PY G D CH +A TGFV+V E A
Sbjct: 179 CGGGLMDQAFQYVTDNQGLDSEDSY-PYTGTDDQPCHYDPLYNSANDTGFVDVPSGKEHA 237
Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
L A+A GPVSVAIDA +SF FY +G+YY+++C S + LDH VLAVGYG + G
Sbjct: 238 LMKAVASVGPVSVAIDAGHESFQFYQSGIYYEKEC--SSEELDHGVLAVGYGFEGEDKMG 295
Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
K +W VKNSW WG++GY+ M+ + N+CG+ TA +Y
Sbjct: 296 KKFWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAASY 333
>gi|426216528|ref|XP_004002514.1| PREDICTED: cathepsin K [Ovis aries]
Length = 330
Score = 181 bits (460), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 100/216 (46%), Positives = 132/216 (61%), Gaps = 8/216 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V Y KGY + +Q CGSCW+F + GA+EG K KL LS Q L+DC N+G
Sbjct: 120 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 177
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG ++Q++ K+ G+ ++D Y PY+GQD C T A G+ + +E AL
Sbjct: 178 CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQDENCMYNPTGKAAKCRGYREIPEGNEKAL 236
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K A+A+ GP+SVAIDAS SF FY GVYYDE CN+ D L+HAVLAVGYG G +W
Sbjct: 237 KRAVARVGPISVAIDASLTSFQFYRKGVYYDENCNS--DNLNHAVLAVGYGIQKGNKHWI 294
Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
+KNSW WGN+GY+LM+ NN CG+ ++ M
Sbjct: 295 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 330
>gi|109940312|sp|Q5E968.2|CATK_BOVIN RecName: Full=Cathepsin K; Flags: Precursor
Length = 329
Score = 181 bits (460), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 100/216 (46%), Positives = 132/216 (61%), Gaps = 8/216 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V Y KGY + +Q CGSCW+F + GA+EG K KL LS Q L+DC N+G
Sbjct: 119 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 176
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG ++Q++ K+ G+ ++D Y PY+GQD C T A G+ + +E AL
Sbjct: 177 CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQDENCMYNPTGKAAKCRGYREIPEGNEKAL 235
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K A+A+ GP+SVAIDAS SF FY GVYYDE CN+ D L+HAVLAVGYG G +W
Sbjct: 236 KRAVARVGPISVAIDASLTSFQFYRKGVYYDENCNS--DNLNHAVLAVGYGIQKGNKHWI 293
Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
+KNSW WGN+GY+LM+ NN CG+ ++ M
Sbjct: 294 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 329
>gi|157278115|ref|NP_001098156.1| cathepsin L precursor [Oryzias latipes]
gi|50251128|dbj|BAD27581.1| cathepsin L [Oryzias latipes]
Length = 336
Score = 181 bits (459), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 98/217 (45%), Positives = 136/217 (62%), Gaps = 9/217 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + DQ CGSCW+F TTGA+EG + K KL LS+Q L+DCS GN G
Sbjct: 119 VDWRDKGYVTPVKDQGQCGSCWAFSTTGALEGQQFRKTGKLVSLSEQNLVDCSRPEGNEG 178
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG +++Q++ + +D+ PY G D C + TGFV++ E AL
Sbjct: 179 CNGGLMDQAFQYVKDNQGLDSEDFYPYKGTDDQPCQYNAQYSAVNDTGFVDIPSGKERAL 238
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GPVSVAIDA +SF FY +G+Y++++C S D LDH VL VGYG ++DGK
Sbjct: 239 MKAVASVGPVSVAIDAGHESFQFYQSGIYFEKEC--SSDELDHGVLVVGYGFEGEDVDGK 296
Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
YW VKNSWS WG++G++ M+ + N+CG+ TA +Y
Sbjct: 297 KYWIVKNSWSEKWGDKGFIYMAKDRHNHCGIATAASY 333
>gi|156355026|ref|XP_001623478.1| predicted protein [Nematostella vectensis]
gi|156210181|gb|EDO31378.1| predicted protein [Nematostella vectensis]
Length = 306
Score = 181 bits (459), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 90/221 (40%), Positives = 133/221 (60%), Gaps = 13/221 (5%)
Query: 137 IPVRYEMKGY---NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG- 192
+P + + + Y N Q CGSCW+F T GAVE A++++ +L L++Q L+DC+W
Sbjct: 91 VPDQLDWRDYGAVNPAKGQGTCGSCWAFATAGAVEAAHFIQKGELLNLAEQQLLDCTWST 150
Query: 193 ----YGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
+GNNGC GG ++++ W+ K G+ T YG Y GQ+ +C +N T A +T + V
Sbjct: 151 PGVYHGNNGCLGGWTWKAFSWVKKFGIATTKSYGHYRGQEGFCKTSNLTVGARITSYRRV 210
Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
+ ALK AL+ HGP +++I+A+ KS FY +G+ D+ C+N DHAVL +GYG
Sbjct: 211 KRFNPIALKKALSYHGPATISINANPKSLKFYSDGIMSDKHCSNK---TDHAVLLIGYGS 267
Query: 309 LDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYV 349
+G PYW +KNSWS WGN G++ IK CG+ P V
Sbjct: 268 DNGVPYWLIKNSWSHKWGNNGFI--KIKQGLCGIEKRPFVV 306
>gi|403302736|ref|XP_003942009.1| PREDICTED: cathepsin K isoform 2 [Saimiri boliviensis boliviensis]
Length = 383
Score = 181 bits (459), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 99/216 (45%), Positives = 132/216 (61%), Gaps = 8/216 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V Y KGY + +Q CGSCW+F + GA+EG K KL LS Q L+DC N+G
Sbjct: 173 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 230
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG ++Q++ K+ G+ ++D Y PY+GQ+ C T A G+ + +E AL
Sbjct: 231 CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQEESCMYNPTGKAAKCRGYREIPEGNEKAL 289
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K A+A+ GP+SVAIDAS SF FY GVYYDE CN+ D L+HAVLAVGYG G +W
Sbjct: 290 KRAVARVGPISVAIDASLTSFQFYSKGVYYDESCNS--DNLNHAVLAVGYGIQKGNKHWI 347
Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
+KNSW WGN+GY+LM+ NN CG+ ++ M
Sbjct: 348 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 383
>gi|15593249|gb|AAL02221.1|AF410881_1 cysteine protease CP10 precursor [Frankliniella occidentalis]
Length = 334
Score = 181 bits (459), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 103/226 (45%), Positives = 137/226 (60%), Gaps = 12/226 (5%)
Query: 132 ASKDAIP----VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQAL 186
AS D+ P V + KG + DQ CGSCWSF TG++EG ++K+K L LS+Q L
Sbjct: 109 ASNDSWPWSKKVDWRSKGAVTPIKDQGQCGSCWSFSATGSLEGQLFLKNKNLVSLSEQNL 168
Query: 187 IDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDA-YCHIANTTATATMTG 244
+DCSW +GN GC+GG +++++ + G+ T++ Y PY D C TG
Sbjct: 169 VDCSWDFGNEGCNGGLMDSAFEYVESNGGIDTEESY-PYTAVDGDSCLYKAANNAGVNTG 227
Query: 245 FVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAV 304
+ +V SE AL+ A+ K GPVSVAIDAS SF Y +G+YY+ C S D LDH VLAV
Sbjct: 228 YKDVQAKSESALRDAVEKAGPVSVAIDASNWSFQMYSSGIYYESAC--SSDYLDHGVLAV 285
Query: 305 GYG-ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
GYG E K +W VKNSW T WG +GY+ M+ K NNCG+ T +Y
Sbjct: 286 GYGSEWPNKEFWIVKNSWGTSWGEEGYIKMARNKKNNCGIATEASY 331
>gi|33520126|gb|AAQ21040.1| cathepsin L precursor [Branchiostoma belcheri tsingtauense]
Length = 327
Score = 181 bits (459), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 92/200 (46%), Positives = 126/200 (63%), Gaps = 4/200 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTG++EG +++K L LS+Q L+DCS GN GC GG +++++I
Sbjct: 127 NQEQCGSCWAFSTTGSLEGQHFLKSGTLVSLSEQNLVDCSRKEGNKGCKGGLMDQAFKYI 186
Query: 211 MKHGLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+G ++ PY G+D C + + AT++ FV+V EDALK A A GP+SV
Sbjct: 187 KTNGGIDTEECYPYKGRDERKCEYKASCSGATLSSFVDVKTGDEDALKQASATIGPISVG 246
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS SF Y +GVY++++C S LDH VL VGYG K YW VKNSW WG +G
Sbjct: 247 IDASHPSFQLYDHGVYHEKRC--SSKKLDHGVLVVGYGTQSTKDYWLVKNSWGADWGMEG 304
Query: 330 YVLMSI-KDNNCGVMTAPTY 348
Y++MS KDN CG+ T +Y
Sbjct: 305 YIMMSRNKDNQCGIATQASY 324
>gi|334324659|ref|XP_001371004.2| PREDICTED: cathepsin K-like [Monodelphis domestica]
Length = 332
Score = 181 bits (459), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 107/256 (41%), Positives = 145/256 (56%), Gaps = 13/256 (5%)
Query: 99 EELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKASKDAIPVRYEMKGY-NSLLDQSVCGS 157
EE+V + K L S+ + W +K + Y KGY + +Q CGS
Sbjct: 87 EEVVQKMTGLKVPLSRSQNNDTLYFPDW-----ETKTPDSIDYRKKGYVTPVKNQGQCGS 141
Query: 158 CWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLP 216
CW+F + GA+EG K KL LS Q L+DC N+GC GG ++Q++ K+ G+
Sbjct: 142 CWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDGCGGGYMTNAFQYVQKNRGID 199
Query: 217 TQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKS 276
++D Y PY+G+D C T A G+ + SE ALK A+A+ GPV+VAIDAS S
Sbjct: 200 SEDAY-PYIGEDESCMYNPTGKAAKCRGYREIPEGSEKALKRAVARVGPVAVAIDASLSS 258
Query: 277 FSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIK 336
F FY GVYYDE CN+ D L+HAVLAVGYG G +W +KNSW WGN+GY+LM+
Sbjct: 259 FQFYSKGVYYDENCNS--DNLNHAVLAVGYGIQRGTKHWIIKNSWGEQWGNKGYILMARN 316
Query: 337 DNN-CGVMTAPTYVTM 351
NN CG+ ++ M
Sbjct: 317 KNNACGIANLASFPKM 332
>gi|391338870|ref|XP_003743778.1| PREDICTED: cathepsin L-like isoform 1 [Metaseiulus occidentalis]
gi|391338872|ref|XP_003743779.1| PREDICTED: cathepsin L-like isoform 2 [Metaseiulus occidentalis]
gi|391338874|ref|XP_003743780.1| PREDICTED: cathepsin L-like isoform 3 [Metaseiulus occidentalis]
Length = 331
Score = 181 bits (459), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 99/201 (49%), Positives = 125/201 (62%), Gaps = 6/201 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCWSF TTGA+EG + K +L LS+Q LIDCS YGNNGC GG ++ +I
Sbjct: 131 NQGHCGSCWSFSTTGALEGQLFRKTGELVSLSEQNLIDCSTSYGNNGCGGGLMDNAFTYI 190
Query: 211 MK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ HG+ T++ Y PY G+ C + TGFV++ +E AL ALA GPVSVA
Sbjct: 191 KENHGIDTEESY-PYEGKQGKCRYHKEDSAGRDTGFVDIPSGNERALAKALATIGPVSVA 249
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGEL-DGKPYWQVKNSWSTYWGNQ 328
IDAS +SF FY GVY C++ LDH VLAVGYG DG+ Y+ +KNSW WG +
Sbjct: 250 IDASHESFQFYHEGVYNPPDCDSHS--LDHGVLAVGYGTTDDGQDYYIIKNSWGERWGQE 307
Query: 329 GYVLMSIKD-NNCGVMTAPTY 348
GYVLM+ N CGV T +Y
Sbjct: 308 GYVLMARNSKNECGVATQASY 328
>gi|410968296|ref|XP_003990643.1| PREDICTED: cathepsin K [Felis catus]
Length = 330
Score = 181 bits (459), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 100/222 (45%), Positives = 134/222 (60%), Gaps = 8/222 (3%)
Query: 133 SKDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW 191
S+ + Y KGY + +Q CGSCW+F + GA+EG K KL LS Q L+DC
Sbjct: 114 SRAPDSIDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV- 172
Query: 192 GYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTP 250
N+GC GG ++Q++ K+ G+ ++D Y PY+GQD C T A G+ +
Sbjct: 173 -SENDGCGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQDESCMYNPTGKAAKCRGYREIPE 230
Query: 251 NSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELD 310
+E ALK A+A+ GP+SVAIDAS SF FY GVYYDE CN+ D L+HAVLAVGYG
Sbjct: 231 GNEKALKRAVARVGPISVAIDASLTSFQFYSKGVYYDENCNS--DNLNHAVLAVGYGIQK 288
Query: 311 GKPYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
G +W +KNSW WGN+GY+LM+ NN CG+ ++ M
Sbjct: 289 GNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 330
>gi|440906717|gb|ELR56946.1| Cathepsin K [Bos grunniens mutus]
Length = 338
Score = 181 bits (459), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 99/216 (45%), Positives = 132/216 (61%), Gaps = 8/216 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
+ Y KGY + +Q CGSCW+F + GA+EG K KL LS Q L+DC N+G
Sbjct: 128 IDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 185
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG ++Q++ K+ G+ ++D Y PY+GQD C T A G+ + +E AL
Sbjct: 186 CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQDENCMYNPTGKAAKCRGYREIPEGNEKAL 244
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K A+A+ GP+SVAIDAS SF FY GVYYDE CN+ D L+HAVLAVGYG G +W
Sbjct: 245 KRAVARVGPISVAIDASLTSFQFYRKGVYYDENCNS--DNLNHAVLAVGYGIQKGNKHWI 302
Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
+KNSW WGN+GY+LM+ NN CG+ ++ M
Sbjct: 303 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 338
>gi|301767944|ref|XP_002919404.1| PREDICTED: cathepsin K-like [Ailuropoda melanoleuca]
gi|281352889|gb|EFB28473.1| hypothetical protein PANDA_008011 [Ailuropoda melanoleuca]
Length = 330
Score = 181 bits (459), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 100/222 (45%), Positives = 134/222 (60%), Gaps = 8/222 (3%)
Query: 133 SKDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW 191
S+ + Y KGY + +Q CGSCW+F + GA+EG K KL LS Q L+DC
Sbjct: 114 SRAPDSIDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV- 172
Query: 192 GYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTP 250
N+GC GG ++Q++ K+ G+ ++D Y PY+GQD C T A G+ +
Sbjct: 173 -SENDGCGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQDESCMYNPTGKAAKCRGYREIPE 230
Query: 251 NSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELD 310
+E ALK A+A+ GP+SVAIDAS SF FY GVYYDE CN+ D L+HAVLAVGYG
Sbjct: 231 GNEKALKRAVARVGPISVAIDASLTSFQFYSKGVYYDENCNS--DNLNHAVLAVGYGIQK 288
Query: 311 GKPYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
G +W +KNSW WGN+GY+LM+ NN CG+ ++ M
Sbjct: 289 GNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 330
>gi|15593255|gb|AAL02223.1|AF410883_1 cysteine protease CP19 precursor [Frankliniella occidentalis]
Length = 334
Score = 181 bits (459), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 103/226 (45%), Positives = 138/226 (61%), Gaps = 12/226 (5%)
Query: 132 ASKDAIP----VRYEMKGYNS-LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQAL 186
AS D+ P V + KG + + DQ CGSCWSF TG++EG ++K+K L LS+Q L
Sbjct: 109 ASNDSWPWSKKVDWRSKGAATPIKDQGQCGSCWSFSATGSLEGQLFLKNKNLVSLSEQNL 168
Query: 187 IDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDA-YCHIANTTATATMTG 244
+DCSW +GN GC+GG +++++ + G+ T++ Y PY D C TG
Sbjct: 169 VDCSWDFGNEGCNGGLMDSAFEYVKSNGGIDTEESY-PYTAVDGDSCLYRAANNAGVNTG 227
Query: 245 FVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAV 304
+ +V SE AL+ A+ K GPVSVAIDAS SF Y +G+YY+ C S D LDH VLAV
Sbjct: 228 YKDVQAKSESALRDAVEKVGPVSVAIDASNWSFQMYSSGIYYESAC--SSDYLDHGVLAV 285
Query: 305 GYG-ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
GYG E K +W VKNSW T WG +GY+ M+ K NNCG+ T +Y
Sbjct: 286 GYGSEWPNKEFWIVKNSWGTSWGEEGYIKMARNKKNNCGIATEASY 331
>gi|389610697|dbj|BAM18960.1| cathepsin L [Papilio polytes]
Length = 341
Score = 181 bits (458), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 95/201 (47%), Positives = 129/201 (64%), Gaps = 6/201 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCWSF TGA+EG +Y + L LS+Q LIDCS YGNNGC+GG ++++I
Sbjct: 141 DQGKCGSCWSFSATGALEGQHYRQTNILVSLSEQNLIDCSTAYGNNGCNGGLMDNAFKYI 200
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T+ Y PY D C + A GF+++ E L A+A GPVSVA
Sbjct: 201 KDNKGIDTEKSY-PYEAVDDKCRYNPRNSGADDVGFIDIPSGDEGKLMAAVATVGPVSVA 259
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
IDASQ++F FY +GVY+DE C+++ LDH VL VGYG + +G YW VKNSW WG+
Sbjct: 260 IDASQETFQFYSDGVYFDENCSSTS--LDHGVLVVGYGTDENGGDYWLVKNSWGRSWGDL 317
Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
GY+ M+ +DN+CG+ TA ++
Sbjct: 318 GYIKMARNRDNHCGIATAASF 338
>gi|33242872|gb|AAQ01140.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 181 bits (458), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 97/204 (47%), Positives = 134/204 (65%), Gaps = 9/204 (4%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TTG++EG + K KL LS+Q L+DCS +GN GC GG +++Q+I
Sbjct: 131 DQGECGSCWAFSTTGSLEGQHSSKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYI 190
Query: 211 MKH-GLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
+ GL T++ Y PY D C N++ AT+ G+ +V +E ALK A+A GPVSV
Sbjct: 191 KANGGLDTEESY-PYTATDDKPCKFDNSSVGATLVGYKDVKSGNEHALKRAVATVGPVSV 249
Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDG---KPYWQVKNSWSTYW 325
AIDA +SF FY +GVY + +C S + LDH VLAVGYG ++ + +W VKNSW W
Sbjct: 250 AIDAGHESFQFYSSGVYDEPQC--STEQLDHGVLAVGYGAMNDNSHQAFWIVKNSWGPSW 307
Query: 326 GNQGYVLMSI-KDNNCGVMTAPTY 348
G+QGY++MS K+N CG+ T+ +Y
Sbjct: 308 GDQGYIMMSRNKNNQCGIATSASY 331
>gi|198432215|ref|XP_002130162.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 331
Score = 181 bits (458), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 97/214 (45%), Positives = 136/214 (63%), Gaps = 7/214 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + +GY + DQ CGSCW+F TTG++EG ++ K KL LS+Q L+DCS GN+G
Sbjct: 118 VDWRKEGYVTPVKDQKQCGSCWAFSTTGSLEGQHFKKTGKLVSLSEQNLVDCSMKEGNHG 177
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQ-DAYCHIANTTATATMTGFVNVTPNSEDA 255
C GG +++I + G+ T+ Y PY+ + + C + + AT+TG V++ SE A
Sbjct: 178 CQGGLMDLGFEYIFDNGGIDTESSY-PYMAKNEPQCMYKRSNSGATLTGCVDIKRGSESA 236
Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYW 315
L A+A GP+SVAIDA KSF Y +GVYY+ C S LDH VLAVG+G +G+ +W
Sbjct: 237 LMKAVADVGPISVAIDAGHKSFQMYKSGVYYEPSC--SSVKLDHGVLAVGFGADNGEDFW 294
Query: 316 QVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
VKNSW WG +GY++MS +DNNCG+ T +Y
Sbjct: 295 LVKNSWGPIWGMEGYIMMSRNRDNNCGIATQASY 328
>gi|330805273|ref|XP_003290609.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
gi|325079248|gb|EGC32857.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
Length = 333
Score = 181 bits (458), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 99/216 (45%), Positives = 129/216 (59%), Gaps = 7/216 (3%)
Query: 135 DAIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYG 194
D+I R E + + DQ CGSCWSF TTGAVEGA+ +K + LS+Q L+DCS YG
Sbjct: 119 DSIDWR-EKGAVSQVKDQGQCGSCWSFSTTGAVEGAHQIKSGNMVSLSEQNLVDCSGQYG 177
Query: 195 NNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSE 253
N GC+GG ++++I+ + G+ T+ Y PY C + A + G+ + E
Sbjct: 178 NQGCEGGLMVNAFEYIIDNGGIATESSY-PYTAAQGRCKFTKSMNGANIIGYKEIPQGEE 236
Query: 254 DALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKP 313
D+L ALAK PVSVAIDAS SF Y +GVY + C S + LDH VLAVGYG L+GK
Sbjct: 237 DSLTAALAKQ-PVSVAIDASHMSFQLYSSGVYDEPAC--SSEALDHGVLAVGYGTLEGKD 293
Query: 314 YWQVKNSWSTYWGNQGYVLMSIK-DNNCGVMTAPTY 348
Y+ +KNSW WG GY+ MS N CGV T +Y
Sbjct: 294 YYIIKNSWGPTWGQDGYIFMSRNAQNQCGVATMASY 329
>gi|157779038|gb|ABV71063.1| cathepsin L3 precursor [Schistosoma mansoni]
gi|360044915|emb|CCD82463.1| subfamily C1A unassigned peptidase (C01 family) [Schistosoma
mansoni]
Length = 370
Score = 181 bits (458), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 92/204 (45%), Positives = 127/204 (62%), Gaps = 7/204 (3%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTGA+EG +Y K +L LS+Q L+DCS YGNNGC GG +++++
Sbjct: 165 NQGQCGSCWAFSTTGAIEGQHYRKTNRLVNLSEQQLVDCSKSYGNNGCSGGLMNSAFEYV 224
Query: 211 MKH-GLPTQDDYGPYLGQDA----YCHIANTTATATMTGFVNVTPNSEDALKLALAKHGP 265
+ G+ ++ Y PY+ D C + A +TG+VN+ E AL A+A GP
Sbjct: 225 RDNEGIDSEISY-PYVSGDGTENNRCLFNASNILAQVTGYVNIHEGDERALMDAVATKGP 283
Query: 266 VSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYW 325
VSVAI+A SFS Y +G+Y D C + D LDH VL VGYGE +G+ YW +KNSW W
Sbjct: 284 VSVAINAGLPSFSMYKSGIYSDTDCEGTLDALDHGVLVVGYGEENGRSYWLIKNSWGEEW 343
Query: 326 GNQGYVLMSIKDNN-CGVMTAPTY 348
G +GY+ +S +N CGV +A +Y
Sbjct: 344 GEKGYIKISKGSHNMCGVASAASY 367
>gi|33242870|gb|AAQ01139.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 181 bits (458), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 97/204 (47%), Positives = 134/204 (65%), Gaps = 9/204 (4%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TTG++EG + K KL LS+Q L+DCS +GN GC GG +++Q+I
Sbjct: 131 DQGECGSCWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYI 190
Query: 211 MKH-GLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
+ GL T++ Y PY D C N++ AT+ G+ +V +E ALK A+A GPVSV
Sbjct: 191 TANGGLDTEESY-PYTATDDEPCKFDNSSVGATLVGYKDVKSGNEHALKRAVATVGPVSV 249
Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDG---KPYWQVKNSWSTYW 325
AIDA +SF FY +GVY + +C S + LDH VLAVGYG ++ + +W VKNSW W
Sbjct: 250 AIDAGHESFQFYSSGVYDEPQC--STEQLDHGVLAVGYGAMNDNSHQAFWIVKNSWGPSW 307
Query: 326 GNQGYVLMSI-KDNNCGVMTAPTY 348
G+QGY++MS K+N CG+ T+ +Y
Sbjct: 308 GDQGYIMMSRNKNNQCGIATSASY 331
>gi|324512246|gb|ADY45078.1| Cathepsin L [Ascaris suum]
Length = 388
Score = 181 bits (458), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 94/202 (46%), Positives = 129/202 (63%), Gaps = 8/202 (3%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q +CGSCW+F TGA+EG + K L LS+Q L+DCS YGNNGC+GG ++++I
Sbjct: 188 NQGMCGSCWAFSATGALEGQHKRKIGSLVSLSEQNLVDCSRKYGNNGCNGGLMDYAFEYI 247
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
HG+ T+ Y PY G++ CH T A G+V++ E+ LK+A+A GP+SVA
Sbjct: 248 KDNHGVDTEASY-PYKGKEMKCHFNKKTVGAEDEGYVDLPEGDEEKLKIAVATQGPISVA 306
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG--ELDGKPYWQVKNSWSTYWGN 327
IDA SF Y GVYY+ +C S + LDH VL VGYG E+DG YW VKNSW WG
Sbjct: 307 IDAGHPSFQMYRKGVYYEPQC--SSESLDHGVLVVGYGTDEIDG-DYWIVKNSWGPGWGE 363
Query: 328 QGYVLMSI-KDNNCGVMTAPTY 348
+GYV ++ +DN+CG+ + +Y
Sbjct: 364 KGYVRIARNRDNHCGIASKASY 385
>gi|38146075|gb|AAR11477.1| cathepsin L [Litopenaeus vannamei]
Length = 297
Score = 181 bits (458), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 97/206 (47%), Positives = 136/206 (66%), Gaps = 8/206 (3%)
Query: 131 KASKDAIP--VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALI 187
KA + +P V + KG + DQ CGSCW+F TTG++EG +++K KL LS+Q L+
Sbjct: 95 KADDETLPEKVDWRTKGAVTPVKDQKQCGSCWAFSTTGSLEGQHFLKDGKLVSLSEQNLV 154
Query: 188 DCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFV 246
DCS +GN GC GG +++++I + G+ T+D Y PY QD C + AT TG+V
Sbjct: 155 DCSDKFGNMGCMGGLMDQAFRYIKANKGIDTEDSY-PYEAQDGKCRFDASNVGATDTGYV 213
Query: 247 NVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGY 306
+V SE ALK A+A GP+SV IDASQ +F FY GVY+D+ C+++ LDH VLAVGY
Sbjct: 214 DVEHGSESALKKAVATIGPISVGIDASQSTFHFYHTGVYHDDHCSSTM--LDHGVLAVGY 271
Query: 307 G-ELDGKPYWQVKNSWSTYWGNQGYV 331
G + +G +W VKNSW+T WG++GY+
Sbjct: 272 GSDENGGDFWLVKNSWNTSWGDKGYI 297
>gi|33242876|gb|AAQ01142.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 180 bits (457), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 97/204 (47%), Positives = 134/204 (65%), Gaps = 9/204 (4%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TTG++EG + K KL LS+Q L+DCS +GN GC GG +++Q+I
Sbjct: 131 DQGECGSCWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYI 190
Query: 211 MKH-GLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
+ GL T++ Y PY D C N++ AT+ G+ +V +E ALK A+A GPVSV
Sbjct: 191 KANGGLDTEESY-PYTATDDKPCKFDNSSVGATLVGYKDVKSGNEHALKRAVATVGPVSV 249
Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDG---KPYWQVKNSWSTYW 325
AIDA +SF FY +GVY + +C S + LDH VLAVGYG ++ + +W VKNSW W
Sbjct: 250 AIDAGHESFQFYSSGVYDEPQC--STEQLDHGVLAVGYGAMNDNSHQAFWIVKNSWGPSW 307
Query: 326 GNQGYVLMSI-KDNNCGVMTAPTY 348
G+QGY++MS K+N CG+ T+ +Y
Sbjct: 308 GDQGYIMMSRNKNNQCGIATSASY 331
>gi|254746340|emb|CAX16635.1| putative C1A cysteine protease precursor [Manduca sexta]
Length = 342
Score = 180 bits (457), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 99/201 (49%), Positives = 128/201 (63%), Gaps = 6/201 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TTGA+EG ++ K L LS+Q LIDCS YGNNGC+GG ++++I
Sbjct: 142 DQGKCGSCWAFSTTGALEGQHFRKSGYLVSLSEQNLIDCSSTYGNNGCNGGLMDNAFKYI 201
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T+ Y PY G D C + A GFV++ E+ L A+A GPVSVA
Sbjct: 202 KDNGGIDTEKTY-PYEGVDDKCRYNPKNSGAEDVGFVDIPSGDEEKLMQAVATVGPVSVA 260
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
IDASQ SF FY GVYYD +C+++ LDH VL VGYG + G YW VKNSWS WG
Sbjct: 261 IDASQNSFQFYSGGVYYDTECSSTD--LDHGVLVVGYGTDEAGGDYWLVKNSWSRTWGEL 318
Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
GY+ M+ +DN+CG+ T +Y
Sbjct: 319 GYIKMARNRDNHCGIATDASY 339
>gi|33242878|gb|AAQ01143.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 180 bits (457), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 97/204 (47%), Positives = 134/204 (65%), Gaps = 9/204 (4%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TTG++EG + K KL LS+Q L+DCS +GN GC GG +++Q+I
Sbjct: 131 DQGECGSCWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYI 190
Query: 211 MKH-GLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
+ GL T++ Y PY D C N++ AT+ G+ +V +E ALK A+A GPVSV
Sbjct: 191 KANGGLDTEESY-PYTATDDKPCKFDNSSVGATLVGYKDVKSGNEHALKRAVATVGPVSV 249
Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDG---KPYWQVKNSWSTYW 325
AIDA +SF FY +GVY + +C S + LDH VLAVGYG ++ + +W VKNSW W
Sbjct: 250 AIDAGHESFQFYSSGVYDEPQC--STEQLDHGVLAVGYGAMNDNSHQAFWIVKNSWGPSW 307
Query: 326 GNQGYVLMSI-KDNNCGVMTAPTY 348
G+QGY++MS K+N CG+ T+ +Y
Sbjct: 308 GDQGYIMMSRNKNNQCGIATSASY 331
>gi|33242874|gb|AAQ01141.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 180 bits (457), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 97/204 (47%), Positives = 134/204 (65%), Gaps = 9/204 (4%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TTG++EG + K KL LS+Q L+DCS +GN GC GG +++Q+I
Sbjct: 131 DQGECGSCWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYI 190
Query: 211 MKH-GLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
+ GL T++ Y PY D C N++ AT+ G+ +V +E ALK A+A GPVSV
Sbjct: 191 KANGGLDTEESY-PYTATDDKPCKFDNSSVGATLVGYKDVKSGNEHALKRAVATVGPVSV 249
Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDG---KPYWQVKNSWSTYW 325
AIDA +SF FY +GVY + +C S + LDH VLAVGYG ++ + +W VKNSW W
Sbjct: 250 AIDAGHESFQFYSSGVYDEPQC--STEQLDHGVLAVGYGAMNDNSHQAFWIVKNSWGPSW 307
Query: 326 GNQGYVLMSI-KDNNCGVMTAPTY 348
G+QGY++MS K+N CG+ T+ +Y
Sbjct: 308 GDQGYIMMSRNKNNQCGIATSASY 331
>gi|349604730|gb|AEQ00199.1| Cathepsin K-like protein, partial [Equus caballus]
Length = 219
Score = 180 bits (457), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 100/216 (46%), Positives = 132/216 (61%), Gaps = 8/216 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
+ Y KGY + +Q CGSCW+F + GA+EG K KL LS Q L+DC N+G
Sbjct: 9 IDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 66
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG ++Q++ K+ G+ ++D Y PY+GQD C T A G+ + +E AL
Sbjct: 67 CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQDESCMYNPTGKAAKCRGYREIPQGNEKAL 125
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K A+A+ GPVSVAIDAS SF FY GVYYDE CN+ D L+HAVLAVGYG G +W
Sbjct: 126 KRAVARVGPVSVAIDASLTSFQFYSRGVYYDENCNS--DNLNHAVLAVGYGIQKGNKHWI 183
Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
+KNSW WGN+GY+LM+ NN CG+ ++ M
Sbjct: 184 IKNSWGENWGNKGYILMARNKNNACGIANMASFPKM 219
>gi|60654335|gb|AAX29858.1| cathepsin K [synthetic construct]
gi|60654337|gb|AAX29859.1| cathepsin K [synthetic construct]
Length = 330
Score = 180 bits (457), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 100/216 (46%), Positives = 132/216 (61%), Gaps = 8/216 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V Y KGY + +Q CGSCW+F + GA+EG K KL LS Q L+DC N+G
Sbjct: 119 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 176
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG ++Q++ K+ G+ ++D Y PY+GQ+ C T A G+ + +E AL
Sbjct: 177 CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQEESCMYNPTGKAAKCRGYREIPEGNEKAL 235
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K A+A+ GPVSVAIDAS SF FY GVYYDE CN+ D L+HAVLAVGYG G +W
Sbjct: 236 KRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNS--DNLNHAVLAVGYGIQKGNKHWI 293
Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
+KNSW WGN+GY+LM+ NN CG+ ++ M
Sbjct: 294 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 329
>gi|344275468|ref|XP_003409534.1| PREDICTED: cathepsin K-like [Loxodonta africana]
Length = 329
Score = 180 bits (457), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 100/216 (46%), Positives = 132/216 (61%), Gaps = 8/216 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
+ Y KGY + +Q CGSCW+F + GA+EG K KL LS Q L+DC N+G
Sbjct: 119 IDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 176
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG ++Q++ K+ G+ ++D Y PY+GQD C T A G+ + +E AL
Sbjct: 177 CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQDESCMYNPTGKAAKCRGYREIPVGNEKAL 235
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K A+A+ GPVSVAIDAS SF FY GVYYDE CN+ D L+HAVLAVGYG G +W
Sbjct: 236 KRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNS--DNLNHAVLAVGYGIQKGNKHWI 293
Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
+KNSW WGN+GY+LM+ NN CG+ ++ M
Sbjct: 294 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 329
>gi|281346354|gb|EFB21938.1| hypothetical protein PANDA_009085 [Ailuropoda melanoleuca]
Length = 333
Score = 180 bits (457), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 98/217 (45%), Positives = 136/217 (62%), Gaps = 11/217 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + DQ CGSCW+F TGA+EG + K KL LS+Q L+DCS GN G
Sbjct: 118 VDWREKGYVTPVKDQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAEGNAG 177
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG +++++ + GL +++ Y PYL QD C + A TGF ++ + E++L
Sbjct: 178 CNGGLMDNAFRYVKDNGGLDSEESY-PYLAQDGRCKYKPEQSAANDTGFADIHQD-EESL 235
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
L++A GP+SVAIDAS +F FY G+YYD C S + LDH VL VGYG E + K
Sbjct: 236 MLSVATVGPISVAIDASLDTFRFYYKGIYYDPNC--SSEDLDHGVLVVGYGSDEREAENK 293
Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
YW VKNSW T WG QGY+LM+ + N+CG+ T+ ++
Sbjct: 294 NYWIVKNSWGTQWGMQGYILMAKDRGNHCGIATSASF 330
>gi|395729888|ref|XP_002810309.2| PREDICTED: cathepsin K [Pongo abelii]
Length = 343
Score = 180 bits (457), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 100/216 (46%), Positives = 132/216 (61%), Gaps = 8/216 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V Y KGY + +Q CGSCW+F + GA+EG K KL LS Q L+DC N+G
Sbjct: 133 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 190
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG ++Q++ K+ G+ ++D Y PY+GQ+ C T A G+ + +E AL
Sbjct: 191 CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQEESCMYNPTGKAAKCRGYREIPEGNEKAL 249
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K A+A+ GPVSVAIDAS SF FY GVYYDE CN+ D L+HAVLAVGYG G +W
Sbjct: 250 KRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNS--DNLNHAVLAVGYGIQKGNKHWI 307
Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
+KNSW WGN+GY+LM+ NN CG+ ++ M
Sbjct: 308 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 343
>gi|348514005|ref|XP_003444531.1| PREDICTED: cathepsin L1-like [Oreochromis niloticus]
Length = 338
Score = 180 bits (457), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 99/213 (46%), Positives = 135/213 (63%), Gaps = 11/213 (5%)
Query: 144 KGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGE 202
KGY + DQ CGSCW+F TGA+EG + K K+ LS+Q L++CS GN GC+GG
Sbjct: 126 KGYVTPVKDQGQCGSCWAFSATGALEGQQFRKTGKMVQLSEQNLVECSRPEGNEGCNGGL 185
Query: 203 DFRSYQWIM-KHGLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDALKLAL 260
+++Q++ GL +++ Y PYLG D CH TGFV++ SE AL A+
Sbjct: 186 MDQAFQYVKDNQGLDSEESY-PYLGTDDQKCHYDPRYNAVNDTGFVDIKSGSEHALMKAV 244
Query: 261 AKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQ 316
GP+SVAIDA +SF FY +G+YY+ +C S + LDH VL VGYG ++DGK YW
Sbjct: 245 TAVGPISVAIDAGHESFQFYQSGIYYEPEC--SSEELDHGVLLVGYGFEGEDVDGKKYWI 302
Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
VKNSWS WG++GYV M+ + N+CG+ TA +Y
Sbjct: 303 VKNSWSEKWGDKGYVYMAKDRQNHCGIATAASY 335
>gi|269784818|ref|NP_001161481.1| cathepsin L1 precursor [Gallus gallus]
Length = 353
Score = 180 bits (457), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 98/217 (45%), Positives = 133/217 (61%), Gaps = 9/217 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + DQ CGSCW+F TTGA+EG ++ K KL LS+Q L+DCS GN G
Sbjct: 136 VDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGNQG 195
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDAL 256
C+GG +++Q++ +G ++ PY +D C A TGFV++ E AL
Sbjct: 196 CNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDCRYKAEYNAANDTGFVDIPQGHERAL 255
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GPVSVAIDA SF FY +G+YY+ C S + LDH VL VGYG ++DGK
Sbjct: 256 MKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPDC--SSEDLDHGVLVVGYGFEGEDVDGK 313
Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
YW VKNSW WG++GY+ M+ + N+CG+ TA +Y
Sbjct: 314 KYWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAASY 350
>gi|391336140|ref|XP_003742440.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 330
Score = 180 bits (457), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 100/216 (46%), Positives = 132/216 (61%), Gaps = 6/216 (2%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TTG++EG + K KL LS+Q L+DCS GN G
Sbjct: 118 VDWTQKGYVTEVKNQGQCGSCWAFSTTGSLEGQVFKKTGKLVSLSEQNLVDCSTSEGNQG 177
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG +++ +I K+ G+ T+ Y PY G D C AT++GFV+V E+AL
Sbjct: 178 CNGGLMDQAFTYIKKNGGIDTEAAY-PYTGSDGTCRFLENKVGATVSGFVDVKSGDENAL 236
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K A+A GP+SVAIDAS F FY GVY C+++ LDH VL VGYG GK YW
Sbjct: 237 KEAVATVGPISVAIDASSIFFQFYRGGVYNPWFCSSTE--LDHGVLVVGYGTEGGKDYWL 294
Query: 317 VKNSWSTYWGNQGYVLM-SIKDNNCGVMTAPTYVTM 351
VKNSW + WG +GY+ M K N CG+ T +Y T+
Sbjct: 295 VKNSWGSSWGLKGYIKMVRNKKNRCGIATQASYPTV 330
>gi|397492864|ref|XP_003817340.1| PREDICTED: cathepsin K [Pan paniscus]
Length = 343
Score = 180 bits (457), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 100/216 (46%), Positives = 132/216 (61%), Gaps = 8/216 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V Y KGY + +Q CGSCW+F + GA+EG K KL LS Q L+DC N+G
Sbjct: 133 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 190
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG ++Q++ K+ G+ ++D Y PY+GQ+ C T A G+ + +E AL
Sbjct: 191 CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQEESCMYNPTGKAAKCRGYREIPEGNEKAL 249
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K A+A+ GPVSVAIDAS SF FY GVYYDE CN+ D L+HAVLAVGYG G +W
Sbjct: 250 KRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNS--DNLNHAVLAVGYGIQKGNKHWI 307
Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
+KNSW WGN+GY+LM+ NN CG+ ++ M
Sbjct: 308 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 343
>gi|4503151|ref|NP_000387.1| cathepsin K preproprotein [Homo sapiens]
gi|1168793|sp|P43235.1|CATK_HUMAN RecName: Full=Cathepsin K; AltName: Full=Cathepsin O; AltName:
Full=Cathepsin O2; AltName: Full=Cathepsin X; Flags:
Precursor
gi|562757|emb|CAA57649.1| Cathepsin O [Homo sapiens]
gi|606923|gb|AAA65233.1| cathepsin O [Homo sapiens]
gi|1195556|gb|AAB35521.1| cathepsin O2 [Homo sapiens]
gi|16359188|gb|AAH16058.1| Cathepsin K [Homo sapiens]
gi|49456311|emb|CAG46476.1| CTSK [Homo sapiens]
gi|60823594|gb|AAX36649.1| cathepsin K [synthetic construct]
gi|119573901|gb|EAW53516.1| cathepsin K (pycnodysostosis), isoform CRA_b [Homo sapiens]
gi|307685681|dbj|BAJ20771.1| cathepsin K [synthetic construct]
gi|312150424|gb|ADQ31724.1| cathepsin K [synthetic construct]
Length = 329
Score = 180 bits (457), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 100/216 (46%), Positives = 132/216 (61%), Gaps = 8/216 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V Y KGY + +Q CGSCW+F + GA+EG K KL LS Q L+DC N+G
Sbjct: 119 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 176
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG ++Q++ K+ G+ ++D Y PY+GQ+ C T A G+ + +E AL
Sbjct: 177 CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQEESCMYNPTGKAAKCRGYREIPEGNEKAL 235
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K A+A+ GPVSVAIDAS SF FY GVYYDE CN+ D L+HAVLAVGYG G +W
Sbjct: 236 KRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNS--DNLNHAVLAVGYGIQKGNKHWI 293
Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
+KNSW WGN+GY+LM+ NN CG+ ++ M
Sbjct: 294 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 329
>gi|332220191|ref|XP_003259241.1| PREDICTED: cathepsin K [Nomascus leucogenys]
Length = 329
Score = 180 bits (457), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 100/216 (46%), Positives = 132/216 (61%), Gaps = 8/216 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V Y KGY + +Q CGSCW+F + GA+EG K KL LS Q L+DC N+G
Sbjct: 119 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 176
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG ++Q++ K+ G+ ++D Y PY+GQ+ C T A G+ + +E AL
Sbjct: 177 CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQEESCMYNPTGKAAKCRGYREIPEGNEKAL 235
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K A+A+ GPVSVAIDAS SF FY GVYYDE CN+ D L+HAVLAVGYG G +W
Sbjct: 236 KRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNS--DNLNHAVLAVGYGIQKGNKHWI 293
Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
+KNSW WGN+GY+LM+ NN CG+ ++ M
Sbjct: 294 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 329
>gi|449513868|ref|XP_002191976.2| PREDICTED: cathepsin L1-like [Taeniopygia guttata]
Length = 443
Score = 180 bits (457), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 98/217 (45%), Positives = 133/217 (61%), Gaps = 9/217 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + DQ CGSCW+F TTGA+EG ++ K KL LS+Q L+DCS GN G
Sbjct: 226 VDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGNQG 285
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDAL 256
C+GG +++Q++ +G ++ PY +D C A TGFV++ E AL
Sbjct: 286 CNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDCRYKAEYNAANDTGFVDIPQGHERAL 345
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GPVSVAIDA SF FY +G+YY+ C S + LDH VL VGYG ++DGK
Sbjct: 346 MKAVAAVGPVSVAIDAGHSSFQFYQSGIYYEPDC--SSEDLDHGVLVVGYGFEGEDVDGK 403
Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
YW VKNSW WG++GY+ M+ + N+CG+ TA +Y
Sbjct: 404 KYWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAASY 440
>gi|49456399|emb|CAG46520.1| CTSK [Homo sapiens]
Length = 329
Score = 180 bits (457), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 100/216 (46%), Positives = 132/216 (61%), Gaps = 8/216 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V Y KGY + +Q CGSCW+F + GA+EG K KL LS Q L+DC N+G
Sbjct: 119 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 176
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG ++Q++ K+ G+ ++D Y PY+GQ+ C T A G+ + +E AL
Sbjct: 177 CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQEESCMYNPTGKAAKCRGYREIPEGNEKAL 235
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K A+A+ GPVSVAIDAS SF FY GVYYDE CN+ D L+HAVLAVGYG G +W
Sbjct: 236 KRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNS--DNLNHAVLAVGYGIQKGNKHWI 293
Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
+KNSW WGN+GY+LM+ NN CG+ ++ M
Sbjct: 294 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 329
>gi|836934|gb|AAA95998.1| cathepsin X [Homo sapiens]
Length = 329
Score = 180 bits (457), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 100/216 (46%), Positives = 132/216 (61%), Gaps = 8/216 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V Y KGY + +Q CGSCW+F + GA+EG K KL LS Q L+DC N+G
Sbjct: 119 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 176
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG ++Q++ K+ G+ ++D Y PY+GQ+ C T A G+ + +E AL
Sbjct: 177 CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQEESCMYNPTGKAAKCRGYREIPEGNEKAL 235
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K A+A+ GPVSVAIDAS SF FY GVYYDE CN+ D L+HAVLAVGYG G +W
Sbjct: 236 KRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNS--DNLNHAVLAVGYGIQKGNKHWI 293
Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
+KNSW WGN+GY+LM+ NN CG+ ++ M
Sbjct: 294 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 329
>gi|74136185|ref|NP_001027984.1| cathepsin K precursor [Macaca mulatta]
gi|47117667|sp|P61276.1|CATK_MACFA RecName: Full=Cathepsin K; Flags: Precursor
gi|47117668|sp|P61277.1|CATK_MACMU RecName: Full=Cathepsin K; Flags: Precursor
gi|3236470|gb|AAC23694.1| cathepsin K [Macaca fascicularis]
gi|4927694|gb|AAD33249.1| cathepsin K [Macaca mulatta]
gi|355558400|gb|EHH15180.1| hypothetical protein EGK_01237 [Macaca mulatta]
gi|355763132|gb|EHH62118.1| hypothetical protein EGM_20317 [Macaca fascicularis]
gi|380809978|gb|AFE76864.1| cathepsin K preproprotein [Macaca mulatta]
gi|383416065|gb|AFH31246.1| cathepsin K preproprotein [Macaca mulatta]
gi|384945478|gb|AFI36344.1| cathepsin K preproprotein [Macaca mulatta]
Length = 329
Score = 180 bits (457), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 100/216 (46%), Positives = 132/216 (61%), Gaps = 8/216 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V Y KGY + +Q CGSCW+F + GA+EG K KL LS Q L+DC N+G
Sbjct: 119 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 176
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG ++Q++ K+ G+ ++D Y PY+GQ+ C T A G+ + +E AL
Sbjct: 177 CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQEESCMYNPTGKAAKCRGYREIPEGNEKAL 235
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K A+A+ GPVSVAIDAS SF FY GVYYDE CN+ D L+HAVLAVGYG G +W
Sbjct: 236 KRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNS--DNLNHAVLAVGYGIQKGNKHWI 293
Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
+KNSW WGN+GY+LM+ NN CG+ ++ M
Sbjct: 294 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 329
>gi|261289811|ref|XP_002611767.1| hypothetical protein BRAFLDRAFT_284308 [Branchiostoma floridae]
gi|229297139|gb|EEN67777.1| hypothetical protein BRAFLDRAFT_284308 [Branchiostoma floridae]
Length = 336
Score = 180 bits (457), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 96/204 (47%), Positives = 133/204 (65%), Gaps = 9/204 (4%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TTG++EG + K KL LS+Q L+DCS +GN GC GG +++Q+I
Sbjct: 133 DQGECGSCWAFSTTGSLEGQHANKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYI 192
Query: 211 MKH-GLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
+ GL T++ Y PY D C N++ AT+ G+ +V +E ALK A+A GP+SV
Sbjct: 193 KANGGLDTEESY-PYTATDDKPCKFDNSSVGATLIGYKDVKSGNEHALKRAVATVGPISV 251
Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDG---KPYWQVKNSWSTYW 325
AIDA +SF FY +GVY + +C S + LDH VL VGYG ++ + +W VKNSW W
Sbjct: 252 AIDAGHESFQFYSSGVYDEPQC--SSEQLDHGVLVVGYGAMNDNSHQAFWIVKNSWGPNW 309
Query: 326 GNQGYVLMSI-KDNNCGVMTAPTY 348
G+QGY++MS KDN CG+ T+ +Y
Sbjct: 310 GDQGYIMMSRNKDNQCGIATSASY 333
>gi|443708542|gb|ELU03619.1| hypothetical protein CAPTEDRAFT_17807 [Capitella teleta]
Length = 350
Score = 180 bits (457), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 96/221 (43%), Positives = 132/221 (59%), Gaps = 10/221 (4%)
Query: 136 AIPVR------YEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALID 188
AIPV + +GY + +Q CGSCW+F TTG++EG ++ K KL LS+Q L+D
Sbjct: 129 AIPVSVPAEVDWRKEGYVTPVKNQGQCGSCWAFSTTGSLEGQHFRKTGKLVSLSEQNLVD 188
Query: 189 CSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
CS YGN GC+GG ++Q+I + + PY D C + AT TG+ ++
Sbjct: 189 CSTSYGNEGCNGGIVDYAFQYIKDNDGDDTEACYPYEAVDGTCRFKSVCVGATCTGYTDL 248
Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
E +K A+A GPVSVAIDAS SF Y +G+Y +++C SP LDHAVL VGYG
Sbjct: 249 PKGDEAKMKEAVALVGPVSVAIDASHSSFQMYQSGIYVEQEC--SPKQLDHAVLVVGYGT 306
Query: 309 LDGKPYWQVKNSWSTYWGNQGYVLMSIK-DNNCGVMTAPTY 348
G+ YW VKNSW T WG++GY+ M+ DN CG+ + +Y
Sbjct: 307 EQGQDYWLVKNSWGTTWGDEGYIKMARNMDNQCGIASQASY 347
>gi|313221001|emb|CBY31833.1| unnamed protein product [Oikopleura dioica]
gi|313229611|emb|CBY18426.1| unnamed protein product [Oikopleura dioica]
Length = 362
Score = 180 bits (457), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 103/228 (45%), Positives = 132/228 (57%), Gaps = 4/228 (1%)
Query: 123 YSLWVRYNKASKDAIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLS 182
Y WV +++ D E + DQ CGSCWSF TGA+EG KL LS
Sbjct: 124 YLDWVDADESKLDKSVDWREKGAVTEVKDQGQCGSCWSFSATGALEGQMAQVFGKLPDLS 183
Query: 183 QQALIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQD-AYCHIANTTATAT 241
+Q L+DCS GN GC+GG ++Q++ +D+ PY G D C + A
Sbjct: 184 EQNLVDCSRPEGNQGCNGGLMDAAFQYVKDQDGLDGEDWYPYEGVDNKECRYDKSHREAD 243
Query: 242 MTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAV 301
TGF + +E ALK ALAK GPVSVAIDAS SF FY +GVYY+ C SP+ LDH V
Sbjct: 244 DTGFKMIPEGNEKALKHALAKVGPVSVAIDASNPSFQFYQSGVYYEPNC--SPENLDHGV 301
Query: 302 LAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
LAVGYG DG+ Y+ VKNSWS WG+ GY+ M+ K+N+CG+ + Y
Sbjct: 302 LAVGYGTEDGEHYYLVKNSWSEAWGDNGYIKMARNKENHCGIASYAVY 349
>gi|6435586|pdb|7PCK|A Chain A, Crystal Structure Of Wild Type Human Procathepsin K
gi|6435587|pdb|7PCK|B Chain B, Crystal Structure Of Wild Type Human Procathepsin K
gi|6435588|pdb|7PCK|C Chain C, Crystal Structure Of Wild Type Human Procathepsin K
gi|6435589|pdb|7PCK|D Chain D, Crystal Structure Of Wild Type Human Procathepsin K
gi|6435592|pdb|1BY8|A Chain A, The Crystal Structure Of Human Procathepsin K
Length = 314
Score = 180 bits (457), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 100/216 (46%), Positives = 132/216 (61%), Gaps = 8/216 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V Y KGY + +Q CGSCW+F + GA+EG K KL LS Q L+DC N+G
Sbjct: 104 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 161
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG ++Q++ K+ G+ ++D Y PY+GQ+ C T A G+ + +E AL
Sbjct: 162 CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQEESCMYNPTGKAAKCRGYREIPEGNEKAL 220
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K A+A+ GPVSVAIDAS SF FY GVYYDE CN+ D L+HAVLAVGYG G +W
Sbjct: 221 KRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNS--DNLNHAVLAVGYGIQKGNKHWI 278
Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
+KNSW WGN+GY+LM+ NN CG+ ++ M
Sbjct: 279 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 314
>gi|402856109|ref|XP_003892642.1| PREDICTED: cathepsin K [Papio anubis]
Length = 348
Score = 180 bits (457), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 100/216 (46%), Positives = 132/216 (61%), Gaps = 8/216 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V Y KGY + +Q CGSCW+F + GA+EG K KL LS Q L+DC N+G
Sbjct: 138 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 195
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG ++Q++ K+ G+ ++D Y PY+GQ+ C T A G+ + +E AL
Sbjct: 196 CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQEESCMYNPTGKAAKCRGYREIPEGNEKAL 254
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K A+A+ GPVSVAIDAS SF FY GVYYDE CN+ D L+HAVLAVGYG G +W
Sbjct: 255 KRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNS--DNLNHAVLAVGYGIQKGNKHWI 312
Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
+KNSW WGN+GY+LM+ NN CG+ ++ M
Sbjct: 313 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 348
>gi|388509526|gb|AFK42829.1| unknown [Lotus japonicus]
Length = 333
Score = 180 bits (457), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 97/216 (44%), Positives = 132/216 (61%), Gaps = 6/216 (2%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + DQ CGSCW+F +TG++EG ++ K +L LS+Q L DCS GN G
Sbjct: 121 VDWRTKGYVTPIKDQGQCGSCWAFSSTGSLEGQHFAKTGQLVSLSEQNLTDCSQKQGNMG 180
Query: 198 CDGGEDFRSYQWIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG +++ +I + +G+ T+ Y PY D CH AT TG+ ++ E+AL
Sbjct: 181 CNGGLMDQAFTYIKENNGIDTESSY-PYKAVDEKCHFKAADVGATDTGYTDIAQQDENAL 239
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
+ A+A GP+SVAIDAS SF Y +G Y + C S LDH VLAVGY DGK Y+
Sbjct: 240 QSAIATVGPISVAIDASHSSFQLYRSGAYNERAC--SATQLDHGVLAVGYDSEDGKDYYI 297
Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
VKNSW T WG +GY+ M+ K+N CG+ T TY T+
Sbjct: 298 VKNSWGTSWGQKGYIWMTRNKNNQCGIATMSTYPTV 333
>gi|426331364|ref|XP_004026652.1| PREDICTED: cathepsin K [Gorilla gorilla gorilla]
Length = 329
Score = 180 bits (456), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 100/216 (46%), Positives = 132/216 (61%), Gaps = 8/216 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V Y KGY + +Q CGSCW+F + GA+EG K KL LS Q L+DC N+G
Sbjct: 119 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 176
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG ++Q++ K+ G+ ++D Y PY+GQ+ C T A G+ + +E AL
Sbjct: 177 CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQEESCMYNPTGKAAKCRGYREIPEGNEKAL 235
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K A+A+ GPVSVAIDAS SF FY GVYYDE CN+ D L+HAVLAVGYG G +W
Sbjct: 236 KRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNS--DNLNHAVLAVGYGIQKGNKHWI 293
Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
+KNSW WGN+GY+LM+ NN CG+ ++ M
Sbjct: 294 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 329
>gi|318816588|ref|NP_001187996.1| cathepsin L precursor [Ictalurus punctatus]
gi|308324547|gb|ADO29408.1| cathepsin L [Ictalurus punctatus]
Length = 334
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 97/213 (45%), Positives = 132/213 (61%), Gaps = 6/213 (2%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + DQ CGSCW+F TG++EG + K KL LS+Q L+DCS YGN G
Sbjct: 122 VDWRDKGYVAEVKDQKNCGSCWAFSATGSLEGQTFRKTGKLVSLSEQQLVDCSGKYGNMG 181
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG ++++I + G+ T++ Y PY D C T AT TG+V++ E+AL
Sbjct: 182 CGGGLMDLAFEYIEDNKGIDTEESY-PYEATDGDCRFKPATVGATCTGYVDINSEDENAL 240
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
+ A+A GP+SVAIDA SF Y +G+Y + C S + LDH VLAVGYG + + YW
Sbjct: 241 QKAVANIGPISVAIDAGHISFQLYGSGIYNEPNC--SSEDLDHGVLAVGYGTDNQQDYWL 298
Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
VKNSW WG+QGY+ M+ K+N CG+ TA +Y
Sbjct: 299 VKNSWGLDWGDQGYIKMTRNKNNQCGIATAASY 331
>gi|405966498|gb|EKC31776.1| Cathepsin L [Crassostrea gigas]
Length = 330
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 98/216 (45%), Positives = 133/216 (61%), Gaps = 6/216 (2%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCWSF TG++EG + K KL LS+Q L+DCS GN+G
Sbjct: 118 VDWRPKGYVTPIKNQGQCGSCWSFSATGSLEGQTFKKTGKLPSLSEQNLVDCSQKQGNHG 177
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG ++Q+I + G+ T+ Y PY ++ C AT +GF ++ SE L
Sbjct: 178 CQGGLMDDAFQYIKDNSGIDTESSY-PYEAKNGKCRFNAANVGATDSGFTDIKSKSESDL 236
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
+ A+A GP+SVAIDAS SF Y +GVY++ C+ + LDH VLAVGYG GK YW
Sbjct: 237 QSAVATVGPISVAIDASHMSFQLYRSGVYHEFFCSETR--LDHGVLAVGYGTESGKDYWL 294
Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
VKNSW WG +GY++MS K NNCG+ T+ +Y T+
Sbjct: 295 VKNSWGESWGQKGYIMMSRNKRNNCGIATSASYPTV 330
>gi|395535911|ref|XP_003769964.1| PREDICTED: cathepsin K [Sarcophilus harrisii]
Length = 332
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 100/216 (46%), Positives = 131/216 (60%), Gaps = 8/216 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V Y KGY + +Q CGSCW+F + GA+EG K KL LS Q L+DC N+G
Sbjct: 122 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SKNDG 179
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG ++Q++ ++ G+ ++D Y PY+GQD C T A G+ + SE AL
Sbjct: 180 CGGGYMTNAFQYVQENRGIDSEDAY-PYIGQDESCMYNPTGKAAKCRGYREIPEGSEKAL 238
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K A+A+ GPV+VAIDAS SF FY GVYYDE CN D L+HAVLAVGYG G +W
Sbjct: 239 KRAVARVGPVAVAIDASLSSFQFYSKGVYYDENCNG--DNLNHAVLAVGYGIQRGTKHWI 296
Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
+KNSW WGN+GY+LM+ K N CG+ ++ M
Sbjct: 297 IKNSWGEEWGNKGYILMARNKKNACGIANLASFPKM 332
>gi|387015022|gb|AFJ49630.1| Cathepsin L1-like [Crotalus adamanteus]
Length = 338
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 102/219 (46%), Positives = 139/219 (63%), Gaps = 12/219 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + DQ CGSCW+F TGA+EG ++ K KL LS+Q LIDCS GN G
Sbjct: 120 VDWREKGYVTPVKDQGQCGSCWAFSATGALEGQHFRKTGKLVSLSEQNLIDCSGPEGNQG 179
Query: 198 CDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDA 255
C+GG +++Q+I +G+ +++ Y PY+G+D C +A TGFV++ E A
Sbjct: 180 CNGGLMDQAFQYIKDNNGIDSEESY-PYIGKDDEDCLYKPEYNSANDTGFVDIPEGRERA 238
Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-----ELD 310
L A+A GP+SVAIDAS SF FY +GVYY+ +CN+ LDH VL VGYG + +
Sbjct: 239 LMKAVAAVGPISVAIDASHTSFQFYESGVYYEPQCNSEE--LDHGVLVVGYGYEGTDDDN 296
Query: 311 GKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
K YW VKNSWS WG+QGY+ M+ + NNCG+ +A +Y
Sbjct: 297 KKRYWIVKNSWSEKWGDQGYIHMAKDRSNNCGIASAASY 335
>gi|225718114|gb|ACO14903.1| Cathepsin L precursor [Caligus clemensi]
Length = 336
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 96/202 (47%), Positives = 129/202 (63%), Gaps = 7/202 (3%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCWSF TGA+EG + K KL LS+Q L+DCS +GNNGC+GG ++ +I
Sbjct: 135 NQGQCGSCWSFSATGALEGQDFRKTGKLISLSEQNLVDCSRKFGNNGCEGGLMDFAFTYI 194
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T+ Y PY G D +CH + GFV++ SE LK A+A GP+SVA
Sbjct: 195 RDNKGIDTEASY-PYEGIDGHCHYNPKNKGGSDIGFVDIKKGSEKDLKKAVAGVGPISVA 253
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG--ELDGKPYWQVKNSWSTYWGN 327
IDAS SF FY +GVY + KC S + LDH VL VG+G + G+ YW VKNSWS WG+
Sbjct: 254 IDASHMSFQFYSHGVYVESKC--SSEELDHGVLVVGFGTDSVSGEDYWLVKNSWSEKWGD 311
Query: 328 QGYVLMSI-KDNNCGVMTAPTY 348
QGY+ M+ K+N CG+ ++ +Y
Sbjct: 312 QGYIKMARNKENMCGIASSASY 333
>gi|119573900|gb|EAW53515.1| cathepsin K (pycnodysostosis), isoform CRA_a [Homo sapiens]
Length = 288
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 100/216 (46%), Positives = 132/216 (61%), Gaps = 8/216 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V Y KGY + +Q CGSCW+F + GA+EG K KL LS Q L+DC N+G
Sbjct: 78 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 135
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG ++Q++ K+ G+ ++D Y PY+GQ+ C T A G+ + +E AL
Sbjct: 136 CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQEESCMYNPTGKAAKCRGYREIPEGNEKAL 194
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K A+A+ GPVSVAIDAS SF FY GVYYDE CN+ D L+HAVLAVGYG G +W
Sbjct: 195 KRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNS--DNLNHAVLAVGYGIQKGNKHWI 252
Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
+KNSW WGN+GY+LM+ NN CG+ ++ M
Sbjct: 253 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 288
>gi|403302734|ref|XP_003942008.1| PREDICTED: cathepsin K isoform 1 [Saimiri boliviensis boliviensis]
Length = 329
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 99/216 (45%), Positives = 132/216 (61%), Gaps = 8/216 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V Y KGY + +Q CGSCW+F + GA+EG K KL LS Q L+DC N+G
Sbjct: 119 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 176
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG ++Q++ K+ G+ ++D Y PY+GQ+ C T A G+ + +E AL
Sbjct: 177 CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQEESCMYNPTGKAAKCRGYREIPEGNEKAL 235
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K A+A+ GP+SVAIDAS SF FY GVYYDE CN+ D L+HAVLAVGYG G +W
Sbjct: 236 KRAVARVGPISVAIDASLTSFQFYSKGVYYDESCNS--DNLNHAVLAVGYGIQKGNKHWI 293
Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
+KNSW WGN+GY+LM+ NN CG+ ++ M
Sbjct: 294 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 329
>gi|391328505|ref|XP_003738729.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 323
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 99/222 (44%), Positives = 136/222 (61%), Gaps = 8/222 (3%)
Query: 135 DAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW 191
+A+P V + KGY + +Q CGSCW+F TTG++EG ++ KL LS+Q L+DCS
Sbjct: 105 EALPTQVDWREKGYVTPVKNQGQCGSCWAFSTTGSLEGQHFKATGKLVSLSEQNLVDCSR 164
Query: 192 GYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTP 250
GNNGC+GG + +I ++ G+ T++ Y PY G+D C + A + GFV+V
Sbjct: 165 VEGNNGCNGGLMDNGFTYIQQNGGIDTEESY-PYTGKDGDCAFNENSVGARVKGFVDVPQ 223
Query: 251 NSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELD 310
E AL+ A+A GPVSVAIDAS SF +Y GVY + C+ S LDH VL VGYG +
Sbjct: 224 RDEAALQAAVASVGPVSVAIDASNDSFQYYKEGVYDEPSCSFSQ--LDHGVLVVGYGTEN 281
Query: 311 GKPYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTYVTM 351
G YW VKNSW WG GY+ +M K+N CG+ + +Y T+
Sbjct: 282 GVDYWLVKNSWGPTWGQDGYIKMMRNKENQCGIASMASYPTV 323
>gi|355681653|gb|AER96814.1| cathepsin K [Mustela putorius furo]
Length = 329
Score = 179 bits (455), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 99/219 (45%), Positives = 133/219 (60%), Gaps = 8/219 (3%)
Query: 133 SKDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW 191
S+ + Y KGY + +Q CGSCW+F + GA+EG K KL LS Q L+DC
Sbjct: 114 SRAPDSIDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV- 172
Query: 192 GYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTP 250
N+GC GG ++Q++ K+ G+ ++D Y PY+GQD C T A G+ +
Sbjct: 173 -SENDGCGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQDESCMYNPTGKAAKCKGYREIPE 230
Query: 251 NSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELD 310
+E ALK A+A+ GP+SVAIDAS SF FY GVYYDE CN+ D L+HAVLAVGYG
Sbjct: 231 GNEKALKRAVARVGPISVAIDASLTSFQFYSKGVYYDENCNS--DNLNHAVLAVGYGVQK 288
Query: 311 GKPYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTY 348
G +W +KNSW WGN+GY+LM+ NN CG+ ++
Sbjct: 289 GNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLASF 327
>gi|449275508|gb|EMC84350.1| Cathepsin L1, partial [Columba livia]
Length = 319
Score = 179 bits (455), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 98/217 (45%), Positives = 133/217 (61%), Gaps = 9/217 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + DQ CGSCW+F TTGA+EG ++ K KL LS+Q L+DCS GN G
Sbjct: 102 VDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGNQG 161
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDAL 256
C+GG +++Q++ +G ++ PY +D C A TGFV++ E AL
Sbjct: 162 CNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDCRYKAEYNAANDTGFVDIPQGHERAL 221
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GPVSVAIDA SF FY +G+YY+ C S + LDH VL VGYG ++DGK
Sbjct: 222 MKAVAAVGPVSVAIDAGHSSFQFYQSGIYYEPDC--SSEDLDHGVLVVGYGFEGEDVDGK 279
Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
YW VKNSW WG++GY+ M+ + N+CG+ TA +Y
Sbjct: 280 KYWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAASY 316
>gi|392873948|gb|AFM85806.1| cathepsin L [Callorhinchus milii]
Length = 338
Score = 179 bits (455), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 98/217 (45%), Positives = 136/217 (62%), Gaps = 9/217 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + +GY + DQ CGSCW+F TTGA+EG ++ + +L LS+Q L++CS GN G
Sbjct: 121 VDWRDEGYVTPVKDQGQCGSCWAFSTTGALEGQHFRRTGQLVSLSEQNLVECSKPEGNEG 180
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG +++Q++ +G +D PY+G D CH A TGFV++ E AL
Sbjct: 181 CNGGLMDQAFQYVKDNGGIDSEDSYPYVGTDDTPCHYNPQYNAANDTGFVDIPSGKERAL 240
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GPVSVAIDA SF FY +G+Y++ +C+++ LDH VL VGYG + DGK
Sbjct: 241 MKAIAAVGPVSVAIDAGHTSFQFYQSGIYFEAECSSTD--LDHGVLVVGYGVEKRDTDGK 298
Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
YW VKNSWS G GY+LM+ KDN+CG+ TA +Y
Sbjct: 299 KYWIVKNSWSEKLGQNGYILMAKDKDNHCGIATAASY 335
>gi|350412176|ref|XP_003489564.1| PREDICTED: cathepsin L-like [Bombus impatiens]
Length = 343
Score = 179 bits (455), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 96/201 (47%), Positives = 130/201 (64%), Gaps = 6/201 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCWSF TGA+EG ++ + L LS+Q LIDCS YGNNGC+GG +++Q+I
Sbjct: 139 DQGHCGSCWSFSATGALEGQHFRRTGILIPLSEQNLIDCSGKYGNNGCNGGLMDQAFQYI 198
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ GL T+ Y PY ++ C + A G+V++ +E LK A+A GPVSVA
Sbjct: 199 KDNKGLDTEVTY-PYEAENDKCRYNAANSGARDVGYVDIPQGNEKKLKAAVATIGPVSVA 257
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
IDAS +SF FY GVYY+ +C S + LDH VLAVGYG + +G+ YW VKNSW WG+
Sbjct: 258 IDASHQSFQFYSEGVYYEPEC--SSENLDHGVLAVGYGTDENGQDYWLVKNSWGETWGDN 315
Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
GY+ M+ K N+CG+ + +Y
Sbjct: 316 GYIKMARNKLNHCGIASTASY 336
>gi|156124996|gb|ABU50816.1| Ale o 1 allergen [Aleuroglyphus ovatus]
Length = 337
Score = 179 bits (455), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 97/219 (44%), Positives = 135/219 (61%), Gaps = 8/219 (3%)
Query: 135 DAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW 191
+A+P V + KG + +Q CGSCW+F ++EG + +K KL LS+Q L+DCS
Sbjct: 119 EALPATVDWTTKGVVTPIKNQQQCGSCWAFSAVASMEGQHALKTGKLVSLSEQNLVDCSA 178
Query: 192 GYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTP 250
G+ GC GG ++++++++ G+ T+ Y PY D C + AT+ FV+V
Sbjct: 179 AEGDMGCSGGWMDYAFKYVIQNRGIDTEASY-PYKAIDESCEFKRNSIGATIHSFVDVKT 237
Query: 251 NSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELD 310
E AL+ A+A GP+SVAIDASQ SF FY +GVY + C S + LDH V AVGYG L+
Sbjct: 238 GDESALQNAVASIGPISVAIDASQPSFQFYSSGVYNEPDC--STEILDHGVTAVGYGTLN 295
Query: 311 GKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
G PYW+VKNSW T WG +GY+ MS K N CG+ T +Y
Sbjct: 296 GVPYWKVKNSWGTSWGQKGYIFMSRNKQNQCGIATKASY 334
>gi|306992173|gb|ADN19567.1| cathepsin L-like proteinase [Spodoptera frugiperda]
Length = 344
Score = 179 bits (455), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 94/201 (46%), Positives = 129/201 (64%), Gaps = 6/201 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TTGA+EG ++ K L LS+Q L+DCS YGNNGC+GG ++++I
Sbjct: 144 DQGKCGSCWAFSTTGALEGQHFRKTGYLVSLSEQNLVDCSAAYGNNGCNGGLMDNAFKYI 203
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T+ Y PY D C + A GFV++ E+ L A+A GP+SVA
Sbjct: 204 KDNGGIDTEKSY-PYEAVDDKCRYNPKNSGADDVGFVDIPQGDEEKLMQAVATVGPISVA 262
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
IDASQ++F FY GVYYDE C+++ LDH V+ VGYG E +G YW VKNSW WG
Sbjct: 263 IDASQETFQFYSKGVYYDENCSSTD--LDHGVMVVGYGTEEEGGDYWLVKNSWGRSWGEL 320
Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
GY+ M+ K+N+CG+ ++ +Y
Sbjct: 321 GYIKMAHNKNNHCGIASSASY 341
>gi|326672297|ref|XP_003199631.1| PREDICTED: cathepsin L1-like [Danio rerio]
Length = 336
Score = 179 bits (455), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 100/221 (45%), Positives = 142/221 (64%), Gaps = 11/221 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + +GY + DQ CGSCWSF +TGA+EG + K KL +S+Q L+DCS +GN G
Sbjct: 119 VDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPHGNQG 178
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDA 255
C+GG +++Q++ ++ GL ++ Y PYL +D C A +TGFV++ +E A
Sbjct: 179 CNGGLMDQAFQYVKENKGLDSEQSY-PYLARDDLPCRYDPRFNVAKITGFVDIPKGNELA 237
Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
L A+A GPVSVAIDAS +S FY +G+YY+ C++S LDHAVL VGYG ++ G
Sbjct: 238 LMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSR--LDHAVLVVGYGYQGADVAG 295
Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
YW VKNSWS WG++GY+ M+ K+N+CG+ T +Y M
Sbjct: 296 NRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPLM 336
>gi|156124998|gb|ABU50817.1| Ale o 1 allergen [Aleuroglyphus ovatus]
Length = 337
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 96/219 (43%), Positives = 135/219 (61%), Gaps = 8/219 (3%)
Query: 135 DAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW 191
+A+P V + KG + +Q CGSCW+F ++EG + +K KL LS+Q L+DCS
Sbjct: 119 EALPATVDWTTKGVVTPIKNQQQCGSCWAFSAVASMEGQHALKTGKLVSLSEQNLVDCSA 178
Query: 192 GYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTP 250
G+ GC GG ++++++++ G+ T+ Y PY D C + AT+ FV+V
Sbjct: 179 AEGDMGCSGGWMDYAFKYVIQNRGIDTEASY-PYKAIDESCEFKRNSVGATIHSFVDVKT 237
Query: 251 NSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELD 310
E AL+ A+A GP+SVAIDA+Q SF FY +GVY + C S + LDH V AVGYG L+
Sbjct: 238 GDESALQNAVASIGPISVAIDAAQPSFQFYSSGVYNEPDC--STEILDHGVTAVGYGTLN 295
Query: 311 GKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
G PYW+VKNSW T WG +GY+ MS K N CG+ T +Y
Sbjct: 296 GAPYWKVKNSWGTSWGRKGYIFMSRNKQNQCGIATKASY 334
>gi|301769893|ref|XP_002920368.1| PREDICTED: cathepsin L1-like [Ailuropoda melanoleuca]
Length = 503
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 96/216 (44%), Positives = 133/216 (61%), Gaps = 9/216 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + DQ CGSCW+F TGA+EG + K KL LS+Q L+DCS GN G
Sbjct: 118 VDWREKGYVTPVKDQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAEGNAG 177
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
C+GG +++++ +G ++ PYL QD C + A TGF ++ + E++L
Sbjct: 178 CNGGLMDNAFRYVKDNGGLDSEESYPYLAQDGRCKYKPEQSAANDTGFADIHQD-EESLM 236
Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKP 313
L++A GP+SVAIDAS +F FY G+YYD C S + LDH VL VGYG E + K
Sbjct: 237 LSVATVGPISVAIDASLDTFRFYYKGIYYDPNC--SSEDLDHGVLVVGYGSDEREAENKN 294
Query: 314 YWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
YW VKNSW T WG QGY+LM+ + N+CG+ T+ ++
Sbjct: 295 YWIVKNSWGTQWGMQGYILMAKDRGNHCGIATSASF 330
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 58/113 (51%), Positives = 71/113 (62%), Gaps = 8/113 (7%)
Query: 238 ATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGL 297
+ A +TG VNV P E+A+ LA+A GPVS AI AS SF F G+YYD C S + L
Sbjct: 390 SAADVTGPVNV-PQQEEAVMLAVAAGGPVSAAIRASLGSFQFCKEGIYYDPNC--SSEDL 446
Query: 298 DHAVLAVGYG----ELDGKPYWQVKNSWSTYWGNQGYVLMSIK-DNNCGVMTA 345
DH VL VGYG E + K YW VKNSW T WG QGY+L+ DN+C + T+
Sbjct: 447 DHGVLVVGYGSDEREAENKNYWIVKNSWGTDWGLQGYMLLVRDWDNHCEITTS 499
>gi|340727787|ref|XP_003402217.1| PREDICTED: cathepsin L-like [Bombus terrestris]
Length = 343
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 96/201 (47%), Positives = 130/201 (64%), Gaps = 6/201 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCWSF TGA+EG ++ + L LS+Q LIDCS YGNNGC+GG +++Q+I
Sbjct: 139 DQGHCGSCWSFSATGALEGQHFRRTGILIPLSEQNLIDCSGKYGNNGCNGGLMDQAFQYI 198
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ GL T+ Y PY ++ C + A G+V++ +E LK A+A GPVSVA
Sbjct: 199 KDNKGLDTEVTY-PYEAENDKCRYNAANSGARDVGYVDIPQGNEKKLKAAVATIGPVSVA 257
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
IDAS +SF FY GVYY+ +C S + LDH VLAVGYG + +G+ YW VKNSW WG+
Sbjct: 258 IDASHQSFQFYSEGVYYEPEC--SSENLDHGVLAVGYGTDENGQDYWLVKNSWGETWGDN 315
Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
GY+ M+ K N+CG+ + +Y
Sbjct: 316 GYIKMARNKLNHCGIASTASY 336
>gi|75765285|pdb|1U9V|A Chain A, Crystal Structure Of The Cysteine Protease Human Cathepsin
K In Complex With The Covalent Inhibitor Nvp-Abe854
gi|75765286|pdb|1U9W|A Chain A, Crystal Structure Of The Cysteine Protease Human Cathepsin
K In Complex With The Covalent Inhibitor Nvp-Abi491
gi|75765287|pdb|1U9X|A Chain A, Crystal Structure Of The Cysteine Protease Human Cathepsin
K In Complex With The Covalent Inhibitor Nvp-Abj688
gi|160286063|pdb|2R6N|A Chain A, Crystal Structure Of A Pyrrolopyrimidine Inhibitor In
Complex With Human Cathepsin K
Length = 217
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 100/216 (46%), Positives = 132/216 (61%), Gaps = 8/216 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V Y KGY + +Q CGSCW+F + GA+EG K KL LS Q L+DC N+G
Sbjct: 7 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 64
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG ++Q++ K+ G+ ++D Y PY+GQ+ C T A G+ + +E AL
Sbjct: 65 CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQEESCMYNPTGKAAKCRGYREIPEGNEKAL 123
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K A+A+ GPVSVAIDAS SF FY GVYYDE CN+ D L+HAVLAVGYG G +W
Sbjct: 124 KRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNS--DNLNHAVLAVGYGIQKGNKHWI 181
Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
+KNSW WGN+GY+LM+ NN CG+ ++ M
Sbjct: 182 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 217
>gi|281398206|gb|ADA67926.1| putative peptidase [Bombyx mori]
Length = 696
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 92/241 (38%), Positives = 131/241 (54%), Gaps = 16/241 (6%)
Query: 111 RLVTSEGEKVSKYSLWVRYNKASKDAIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGA 170
R + E++ + W R+ + PVR+ Q C SCW+F GAVEGA
Sbjct: 423 RRARRDAERLPRNFDWRRHGAVT----PVRF----------QGSCSSCWAFAVVGAVEGA 468
Query: 171 YYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAY 230
+++ +L LS+Q LIDC+ YG +GC G +Y +I GL D+Y PY +
Sbjct: 469 LFLRTSRLVPLSEQDLIDCAHPYGGHGCSGTWPRAAYDYIQDQGLRALDEYLPYEAKVLQ 528
Query: 231 CHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKC 290
C T ++G VNVT NSE ALK A+ ++GP V +D K F ++ GV YD +C
Sbjct: 529 CAARQTRPVTHISGHVNVTRNSELALKAAIRRNGPNVVLVDGQSKGFIYHKKGVLYDNRC 588
Query: 291 NNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVT 350
+HAVLAVG+GE G+ ++ VKNSWS+ W N+GY + N CGV+T P+Y
Sbjct: 589 KKR--SRNHAVLAVGWGERRGESFFVVKNSWSSNWANEGYGWLHAPSNTCGVLTEPSYPV 646
Query: 351 M 351
+
Sbjct: 647 L 647
>gi|189525868|ref|XP_001341714.2| PREDICTED: cathepsin L1-like isoform 1 [Danio rerio]
Length = 336
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 101/221 (45%), Positives = 141/221 (63%), Gaps = 11/221 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + +GY + DQ CGSCWSF +TGA+EG + K KL +S+Q L+DCS GN G
Sbjct: 119 VDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQG 178
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDA 255
C+GG +++Q++ ++ GL ++ Y PYL +D C A +TGFV++ +E A
Sbjct: 179 CNGGLMDQAFQYVKENKGLDSEQSY-PYLARDDLPCRYDPRFNVAKITGFVDIPSGNELA 237
Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
L A+A GPVSVAIDAS +S FY +G+YY+ C++S LDHAVL VGYG ++ G
Sbjct: 238 LMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSR--LDHAVLVVGYGYQGADVAG 295
Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
YW VKNSWS WG++GY+ M+ K+N+CGV T +Y M
Sbjct: 296 NRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATKASYPLM 336
>gi|2914594|pdb|1MEM|A Chain A, Crystal Structure Of Cathepsin K Complexed With A Potent
Vinyl Sulfone Inhibitor
gi|28374044|pdb|1NL6|A Chain A, Crystal Structure Of The Cysteine Protease Human Cathepsin
K In Complex With A Covalent Azepanone Inhibitor
gi|28374045|pdb|1NL6|B Chain B, Crystal Structure Of The Cysteine Protease Human Cathepsin
K In Complex With A Covalent Azepanone Inhibitor
gi|28374047|pdb|1NLJ|A Chain A, Crystal Structure Of The Cysteine Protease Human Cathepsin
K In Complex With A Covalent Azepanone Inhibitor
gi|28374048|pdb|1NLJ|B Chain B, Crystal Structure Of The Cysteine Protease Human Cathepsin
K In Complex With A Covalent Azepanone Inhibitor
gi|47168617|pdb|1Q6K|A Chain A, Cathepsin K Complexed With T-butyl(1s)-1-cyclohexyl-2-
Oxoethylcarbamate
gi|55670045|pdb|1TU6|A Chain A, Cathepsin K Complexed With A Ketoamide Inhibitor
gi|55670046|pdb|1TU6|B Chain B, Cathepsin K Complexed With A Ketoamide Inhibitor
gi|62738654|pdb|1YK7|A Chain A, Cathepsin K Complexed With A Cyanopyrrolidine Inhibitor
gi|73535690|pdb|1YK8|A Chain A, Cathepsin K Complexed With A Cyanamide-Based Inhibitor
gi|73535721|pdb|1YT7|A Chain A, Cathepsin K Complexed With A Constrained Ketoamide
Inhibitor
gi|93278849|pdb|2BDL|A Chain A, Cathepsin K Complexed With A Pyrrolidine Ketoamide-Based
Inhibitor
gi|114793438|pdb|2ATO|A Chain A, Crystal Structure Of Human Cathepsin K In Complex With
Myocrisin
gi|114793448|pdb|2AUX|A Chain A, Cathepsin K Complexed With A Semicarbazone Inhibitor
gi|114793451|pdb|2AUZ|A Chain A, Cathepsin K Complexed With A Semicarbazone Inhibitor
gi|126030469|pdb|2FTD|A Chain A, Crystal Structure Of Cathepsin K Complexed With 7-Methyl-
Substituted Azepan-3-One Compound
gi|126030470|pdb|2FTD|B Chain B, Crystal Structure Of Cathepsin K Complexed With 7-Methyl-
Substituted Azepan-3-One Compound
gi|157830076|pdb|1ATK|A Chain A, Crystal Structure Of The Cysteine Protease Human Cathepsin
K In Complex With The Covalent Inhibitor E-64
gi|157830085|pdb|1AU0|A Chain A, Crystal Structure Of The Cysteine Protease Human Cathepsin
K In Complex With A Covalent Symmetric Diacylaminomethyl
Ketone Inhibitor
gi|157830086|pdb|1AU2|A Chain A, Crystal Structure Of The Cysteine Protease Human Cathepsin
K In Complex With A Covalent Propanone Inhibitor
gi|157830087|pdb|1AU3|A Chain A, Crystal Structure Of The Cysteine Protease Human Cathepsin
K In Complex With A Covalent Pyrrolidinone Inhibitor
gi|157830088|pdb|1AU4|A Chain A, Crystal Structure Of The Cysteine Protease Human Cathepsin
K In Complex With A Covalent Pyrrolidinone Inhibitor
gi|157830146|pdb|1AYU|A Chain A, Crystal Structure Of Cysteine Protease Human Cathepsin K
In Complex With A Covalent Symmetric Biscarbohydrazide
Inhibitor
gi|157830147|pdb|1AYV|A Chain A, Crystal Structure Of Cysteine Protease Human Cathepsin K
In Complex With A Covalent Thiazolhydrazide Inhibitor
gi|157830148|pdb|1AYW|A Chain A, Crystal Structure Of Cysteine Protease Human Cathepsin K
In Complex With A Covalent
Benzyloxybenzoylcarbohydrazide Inhibitor
gi|157830300|pdb|1BGO|A Chain A, Crystal Structure Of Cysteine Protease Human Cathepsin K
In Complex With A Covalent Peptidomimetic Inhibitor
gi|197305045|pdb|3C9E|A Chain A, Crystal Structure Of The Cathepsin K : Chondroitin Sulfate
Complex.
gi|290560385|pdb|3KW9|A Chain A, X-Ray Structure Of Cathepsin K Covalently Bound To A
Triazine Ligand
gi|290560386|pdb|3KWZ|A Chain A, Cathepsin K In Complex With A Non-Selective 2-Cyano-
Pyrimidine Inhibitor
gi|290560387|pdb|3KX1|A Chain A, Cathepsin K In Complex With A Selective 2-Cyano-Pyrimidine
Inhibitor
gi|293651910|pdb|3KWB|X Chain X, Structure Of Catk Covalently Bound To A Dioxo-Triazine
Inhibitor
gi|293651911|pdb|3KWB|Y Chain Y, Structure Of Catk Covalently Bound To A Dioxo-Triazine
Inhibitor
gi|308198615|pdb|3O1G|A Chain A, Cathepsin K Covalently Bound To A 2-Cyano Pyrimidine
Inhibitor With A Benzyl P3 Group.
gi|327200584|pdb|3O0U|A Chain A, Cathepsin K Covalently Bound To A Cyano-Pyrimidine
Inhibitor With Improved Selectivity Over Herg
gi|394986262|pdb|4DMX|A Chain A, Cathepsin K Inhibitor
gi|394986263|pdb|4DMY|A Chain A, Cathepsin K Inhibitor
gi|394986264|pdb|4DMY|B Chain B, Cathepsin K Inhibitor
Length = 215
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 100/216 (46%), Positives = 132/216 (61%), Gaps = 8/216 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V Y KGY + +Q CGSCW+F + GA+EG K KL LS Q L+DC N+G
Sbjct: 5 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 62
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG ++Q++ K+ G+ ++D Y PY+GQ+ C T A G+ + +E AL
Sbjct: 63 CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQEESCMYNPTGKAAKCRGYREIPEGNEKAL 121
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K A+A+ GPVSVAIDAS SF FY GVYYDE CN+ D L+HAVLAVGYG G +W
Sbjct: 122 KRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNS--DNLNHAVLAVGYGIQKGNKHWI 179
Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
+KNSW WGN+GY+LM+ NN CG+ ++ M
Sbjct: 180 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 215
>gi|50513589|pdb|1SNK|A Chain A, Cathepsin K Complexed With Carbamate Derivatized
Norleucine Aldehyde
Length = 214
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 100/216 (46%), Positives = 132/216 (61%), Gaps = 8/216 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V Y KGY + +Q CGSCW+F + GA+EG K KL LS Q L+DC N+G
Sbjct: 4 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 61
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG ++Q++ K+ G+ ++D Y PY+GQ+ C T A G+ + +E AL
Sbjct: 62 CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQEESCMYNPTGKAAKCRGYREIPEGNEKAL 120
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K A+A+ GPVSVAIDAS SF FY GVYYDE CN+ D L+HAVLAVGYG G +W
Sbjct: 121 KRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNS--DNLNHAVLAVGYGIQKGNKHWI 178
Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
+KNSW WGN+GY+LM+ NN CG+ ++ M
Sbjct: 179 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 214
>gi|334332714|ref|XP_001367224.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
Length = 335
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 97/214 (45%), Positives = 132/214 (61%), Gaps = 6/214 (2%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q C SCW+F GA+EG ++ K KL LS Q L+DCS GNNG
Sbjct: 121 VDWREKGYVTPVKEQRGCYSCWAFSAAGAIEGQWFRKTGKLVSLSVQNLVDCSIPEGNNG 180
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
CDGG ++Q++ +G ++ PY+ QD C + A +TGFV + E AL
Sbjct: 181 CDGGLMGNAFQYVQDNGGIDTEECYPYVAQDNECKYQPECSGANVTGFVKIPSTDERALM 240
Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE--LDGKPYW 315
A+A GP+SVAIDA SF FY +GVYYD +C++S L+H VL VGYG +G+ YW
Sbjct: 241 KAVANVGPISVAIDAGNPSFKFYQSGVYYDPQCSSSQ--LNHGVLVVGYGSEGKNGRKYW 298
Query: 316 QVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
VKNSW WG+ GYVLM+ +DN+CG++T +Y
Sbjct: 299 IVKNSWGENWGDNGYVLMAKDEDNHCGIITDASY 332
>gi|328776427|ref|XP_625135.3| PREDICTED: cathepsin L-like [Apis mellifera]
Length = 351
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 94/201 (46%), Positives = 128/201 (63%), Gaps = 6/201 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCWSF TGA+EG ++ + L LS+Q LIDCS YGNNGC+GG +++Q+I
Sbjct: 145 DQGHCGSCWSFSATGALEGQHFRRTGVLVSLSEQNLIDCSGKYGNNGCNGGLMDQAFQYI 204
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ GL T+ Y PY ++ C + A G++++ E LK A+A GPVSVA
Sbjct: 205 KDNKGLDTEASY-PYEAENDKCRYNPANSGAIDVGYIDIPTGDEKLLKAAVATIGPVSVA 263
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
IDAS +SF FY GVYY+ +C S + LDH VL +GYG +G+ YW VKNSW WGN
Sbjct: 264 IDASHQSFQFYSEGVYYEPEC--SSEELDHGVLVIGYGTNENGQDYWLVKNSWGETWGNN 321
Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
GY+ M+ K N+CG+ ++ +Y
Sbjct: 322 GYIKMARNKLNHCGIASSASY 342
>gi|315364648|pdb|3OVZ|A Chain A, Cathepsin K In Complex With A Covalent Inhibitor With A
Ketoamide Warhead
Length = 213
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 100/216 (46%), Positives = 132/216 (61%), Gaps = 8/216 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V Y KGY + +Q CGSCW+F + GA+EG K KL LS Q L+DC N+G
Sbjct: 3 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 60
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG ++Q++ K+ G+ ++D Y PY+GQ+ C T A G+ + +E AL
Sbjct: 61 CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQEESCMYNPTGKAAKCRGYREIPEGNEKAL 119
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K A+A+ GPVSVAIDAS SF FY GVYYDE CN+ D L+HAVLAVGYG G +W
Sbjct: 120 KRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNS--DNLNHAVLAVGYGIQKGNKHWI 177
Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
+KNSW WGN+GY+LM+ NN CG+ ++ M
Sbjct: 178 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 213
>gi|18858809|ref|NP_571273.1| cathepsin L, 1 b precursor [Danio rerio]
gi|1752664|emb|CAA69623.1| cathepsin L [Danio rerio]
Length = 336
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 101/221 (45%), Positives = 141/221 (63%), Gaps = 11/221 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + +GY + DQ CGSCWSF +TGA+EG + K KL +S+Q L+DCS GN G
Sbjct: 119 VDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQG 178
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDA 255
C+GG +++Q++ ++ GL ++ Y PYL +D C A +TGFV++ +E A
Sbjct: 179 CNGGLMDQAFQYVKENKGLDSEQSY-PYLARDDLPCRYDPRFNVAKITGFVDIPSGNELA 237
Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
L A+A GPVSVAIDAS +S FY +G+YY+ C++S LDHAVL VGYG ++ G
Sbjct: 238 LMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSR--LDHAVLVVGYGYQGADVAG 295
Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
YW VKNSWS WG++GY+ M+ K+N+CGV T +Y M
Sbjct: 296 NRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATKASYPLM 336
>gi|380014284|ref|XP_003691169.1| PREDICTED: cathepsin L-like [Apis florea]
Length = 345
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 94/201 (46%), Positives = 129/201 (64%), Gaps = 6/201 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCWSF TGA+EG ++ + L LS+Q LIDCS YGNNGC+GG +++Q+I
Sbjct: 139 DQGHCGSCWSFSATGALEGQHFRRTGVLVSLSEQNLIDCSGKYGNNGCNGGLMDQAFQYI 198
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ GL T+ Y PY ++ C + A G++++ +E LK A+A GPVSVA
Sbjct: 199 KDNKGLDTEASY-PYEAENDKCRYNPANSGAIDVGYIDIPTGNEKLLKAAVATIGPVSVA 257
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
IDAS +SF FY GVYY+ +C S + LDH VL +GYG +G+ YW VKNSW WGN
Sbjct: 258 IDASHQSFQFYSEGVYYEPEC--SSEELDHGVLVIGYGTNENGEDYWLVKNSWGETWGNN 315
Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
GY+ M+ K N+CG+ ++ +Y
Sbjct: 316 GYIKMARNKLNHCGIASSASY 336
>gi|330805275|ref|XP_003290610.1| hypothetical protein DICPUDRAFT_98747 [Dictyostelium purpureum]
gi|325079249|gb|EGC32858.1| hypothetical protein DICPUDRAFT_98747 [Dictyostelium purpureum]
Length = 334
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 98/204 (48%), Positives = 134/204 (65%), Gaps = 8/204 (3%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TTGAVEGA+ +K + S+Q L+DCS YGNNGCDGG ++++I
Sbjct: 136 DQGHCGSCWAFATTGAVEGAHQIKTGNMVTFSEQHLVDCSGRYGNNGCDGGLMTSAFKYI 195
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATAT-MTGFVNVTPNSEDALKLALAKHGPVSV 268
+ + G+ T++ Y PY C + NTT T ++G+ +V SE AL A++K PV+V
Sbjct: 196 IDNDGIATEEAY-PYTATQNRC-VYNTTMLGTAISGYKDVPRGSESALTAAISKQ-PVAV 252
Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQ 328
AIDAS +F Y +GVY + C S L+H VLAVGYG L+GK Y+ VKNSW+ WGNQ
Sbjct: 253 AIDASPITFQLYKSGVYQEATC--SSYRLNHGVLAVGYGTLEGKDYYIVKNSWAETWGNQ 310
Query: 329 GYVLMSIKDNN-CGVMTAPTYVTM 351
GY+LM+ NN CG+ T +Y ++
Sbjct: 311 GYILMARNANNHCGIATMASYASV 334
>gi|392922428|ref|NP_001256719.1| Protein CPL-1, isoform b [Caenorhabditis elegans]
gi|379657173|emb|CCG28194.1| Protein CPL-1, isoform b [Caenorhabditis elegans]
Length = 198
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 91/198 (45%), Positives = 129/198 (65%), Gaps = 6/198 (3%)
Query: 154 VCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM-K 212
+CGSCW+F TGA+EG + K +L LS+Q L+DCS YGN+GC+GG +++++I
Sbjct: 1 MCGSCWAFSATGALEGQHARKLGQLVSLSEQNLVDCSTKYGNHGCNGGLMDQAFEYIRDN 60
Query: 213 HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDA 272
HG+ T++ Y PY G+D CH T A G+V+ E+ LK+A+A GP+S+AIDA
Sbjct: 61 HGVDTEESY-PYKGRDMKCHFNKKTVGADDKGYVDTPEGDEEQLKIAVATQGPISIAIDA 119
Query: 273 SQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQGYV 331
+SF Y GVYYDE+C S + LDH VL VGYG + + YW VKNSW WG +GY+
Sbjct: 120 GHRSFQLYKKGVYYDEEC--SSEELDHGVLLVGYGTDPEHGDYWIVKNSWGAGWGEKGYI 177
Query: 332 LMSI-KDNNCGVMTAPTY 348
++ ++N+CGV T +Y
Sbjct: 178 RIARNRNNHCGVATKASY 195
>gi|348542776|ref|XP_003458860.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 334
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 99/213 (46%), Positives = 132/213 (61%), Gaps = 6/213 (2%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + +GY + DQ CGSCW+F TGA+EG ++ K L LS+Q L+DCS YGN G
Sbjct: 122 VDWREQGYVTGVKDQKQCGSCWAFSATGALEGQHFRKTGILVSLSEQQLVDCSGAYGNEG 181
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++++I + G+ T+ Y PY +D C + AT +G+V+V E+AL
Sbjct: 182 CNGGWMDSAFRYIEANGGIDTEASY-PYEAEDWLCRYNPASVGATCSGYVDVNKYDEEAL 240
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K A+A GPVSVAIDAS SF FY +GVY + C++ LDH VLAVGYG +G YW
Sbjct: 241 KEAVATIGPVSVAIDASHASFQFYTSGVYDEPGCSSIE--LDHGVLAVGYGTENGHDYWL 298
Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
VKNSW WG GY+ MS K N CG+ +A +Y
Sbjct: 299 VKNSWGRGWGEMGYIKMSRNKHNQCGIASAASY 331
>gi|33242880|gb|AAQ01144.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 96/204 (47%), Positives = 134/204 (65%), Gaps = 9/204 (4%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TTG++EG + K KL LS+Q L+DCS +GN GC GG +++Q+I
Sbjct: 131 DQGECGSCWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYI 190
Query: 211 MKH-GLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
+ GL T++ Y PY D C N++ AT+ G+ +V ++E ALK A+A GPVSV
Sbjct: 191 KANGGLDTEESY-PYTATDDKPCKFDNSSVGATLIGYKDVKSSNEHALKRAVATVGPVSV 249
Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDG---KPYWQVKNSWSTYW 325
AIDA +SF FY +GVY + +C S + LDH VL VGYG ++ + +W VKNSW W
Sbjct: 250 AIDAGHESFQFYSSGVYDEPQC--STEQLDHGVLVVGYGAMNDNSHQAFWIVKNSWGPNW 307
Query: 326 GNQGYVLMSI-KDNNCGVMTAPTY 348
G+QGY++MS K+N CG+ T+ +Y
Sbjct: 308 GDQGYIMMSRNKNNQCGIATSASY 331
>gi|405958751|gb|EKC24845.1| Cathepsin L [Crassostrea gigas]
Length = 330
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 97/216 (44%), Positives = 133/216 (61%), Gaps = 6/216 (2%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCWSF TG++EG + K KL LS+Q L+DCS GN+G
Sbjct: 118 VDWRPKGYVTPIKNQGQCGSCWSFSATGSLEGQTFKKTGKLPSLSEQNLVDCSQKQGNHG 177
Query: 198 CDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG ++Q+I +G+ T+ Y PY ++ C AT +GF ++ SE L
Sbjct: 178 CQGGLMDDAFQYIKDNNGIDTESSY-PYEAKNGKCRFNAANVGATDSGFTDIKSKSESDL 236
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
+ A+A GP++VAIDAS SF Y +GVY++ C+ + LDH VLAVGYG GK YW
Sbjct: 237 QSAVATVGPIAVAIDASHMSFQLYKSGVYHEFFCSETR--LDHGVLAVGYGTESGKDYWL 294
Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
VKNSW WG +GY++MS K NNCG+ T+ +Y T+
Sbjct: 295 VKNSWGESWGQKGYIMMSRNKRNNCGIATSASYPTV 330
>gi|94421564|gb|ABF18889.1| cathepsin-L [Lygus lineolaris]
Length = 314
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 93/189 (49%), Positives = 120/189 (63%), Gaps = 4/189 (2%)
Query: 149 LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
+ +Q CGSCW+F TTG++EG + +K KL LS+Q L+DCS GN+GCDGG ++
Sbjct: 128 IKNQGQCGSCWAFSTTGSLEGQHALKKGKLVSLSEQELVDCSAAEGNDGCDGGLMDDAFT 187
Query: 209 WIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVS 267
+I K +G+ T+ Y PY G+D C + AT+TGFV+VT SE L+ A A GP+S
Sbjct: 188 YIKKNNGIDTEQSY-PYTGEDGTCSFKKSDVAATVTGFVDVTSGSESGLQDASATIGPIS 246
Query: 268 VAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGN 327
VAIDAS F Y +GVY C+ + LDH VL VGYG DG YW VKNSW T WG+
Sbjct: 247 VAIDASSWDFQLYESGVYDVSDCSTTE--LDHGVLVVGYGTDDGTAYWLVKNSWGTDWGH 304
Query: 328 QGYVLMSIK 336
GY+ MS K
Sbjct: 305 HGYIQMSRK 313
>gi|226469954|emb|CAX70258.1| Cathepsin L precursor [Schistosoma japonicum]
Length = 372
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 92/204 (45%), Positives = 128/204 (62%), Gaps = 7/204 (3%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F +TGA+EG +Y K +L LS+Q LIDCS YGNNGC+GG ++Q++
Sbjct: 167 NQGQCGSCWAFSSTGAIEGQHYRKTNRLVNLSEQQLIDCSKSYGNNGCEGGLMDLAFQYV 226
Query: 211 MKH-GLPTQDDYGPYLGQDA----YCHIANTTATATMTGFVNVTPNSEDALKLALAKHGP 265
+ G+ ++ Y PY+ D C T A +TG++N+ E AL A+ GP
Sbjct: 227 RDNEGIDSEISY-PYISGDGDENVRCLFNFTNIMAQVTGYINIHEGDERALMNAVTTIGP 285
Query: 266 VSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYW 325
VSVAI+A SFS Y +G+Y D +C ++ + LDH VL VGYG DGKPYW +KNSW W
Sbjct: 286 VSVAINAGLSSFSMYKSGIYSDPECASASEDLDHGVLLVGYGIEDGKPYWLIKNSWGEDW 345
Query: 326 GNQGYV-LMSIKDNNCGVMTAPTY 348
G++GYV ++ N C V +A +Y
Sbjct: 346 GDKGYVKILKDSKNMCSVASAASY 369
>gi|395514298|ref|XP_003761356.1| PREDICTED: cathepsin L1-like [Sarcophilus harrisii]
Length = 365
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 96/215 (44%), Positives = 139/215 (64%), Gaps = 9/215 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TG++EG ++ K KL LS+Q L+DCS GN+G
Sbjct: 152 VDWRDKGYVTPVKNQGQCGSCWAFSATGSLEGQWFRKTGKLVSLSEQNLVDCSTAQGNSG 211
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG +++++ ++ G+ T++ Y PY+ D C + A +TG+V++ E AL
Sbjct: 212 CQGGLMDNAFEYVKENGGIDTEESY-PYIAADDTCQYKPQYSGANITGYVDIPSRMEKAL 270
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGK--PY 314
+ A+A GP+SVAIDA SF FY +GVYY+ +C S + LDH VLAVGYG + GK Y
Sbjct: 271 EKAVATVGPISVAIDAGHSSFQFYRSGVYYEPEC--SSEDLDHGVLAVGYG-VQGKNGKY 327
Query: 315 WQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
W VKNSW WG+ GY+LM+ ++N+CG+ TA +Y
Sbjct: 328 WIVKNSWGEEWGDSGYILMARDRNNHCGIATAASY 362
>gi|93279455|pdb|2F7D|A Chain A, A Mutant Rabbit Cathepsin K With A Nitrile Inhibitor
Length = 215
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 99/216 (45%), Positives = 131/216 (60%), Gaps = 8/216 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
+ Y KGY + +Q CGSCW+F + GA+EG K KL LS Q L+DC N+G
Sbjct: 5 IDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 62
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG ++Q++ ++ G+ ++D Y PY+GQD C T A G+ + +E AL
Sbjct: 63 CGGGYMTNAFQYVQRNRGIDSEDAY-PYVGQDESCMYNPTGKAAKCRGYREIPEGNEKAL 121
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K A+A+ GPVSVAIDAS SF FY GVYYDE C S D L+HAVLAVGYG G +W
Sbjct: 122 KRAVARVGPVSVAIDASLTSFQFYSKGVYYDENC--SSDNLNHAVLAVGYGIQKGNKHWI 179
Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
+KNSW WGN+GY+LM+ NN CG+ ++ M
Sbjct: 180 IKNSWGESWGNKGYILMARNKNNACGIANLASFPKM 215
>gi|24653514|ref|NP_523735.2| cysteine proteinase-1, isoform C [Drosophila melanogaster]
gi|118572624|sp|Q95029.2|CATL_DROME RecName: Full=Cathepsin L; AltName: Full=Cysteine proteinase 1;
Contains: RecName: Full=Cathepsin L heavy chain;
Contains: RecName: Full=Cathepsin L light chain; Flags:
Precursor
gi|21627209|gb|AAM68565.1| cysteine proteinase-1, isoform C [Drosophila melanogaster]
Length = 371
Score = 179 bits (453), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 91/204 (44%), Positives = 129/204 (63%), Gaps = 6/204 (2%)
Query: 148 SLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSY 207
++ DQ CGSCW+F +TGA+EG ++ K L LS+Q L+DCS YGNNGC+GG ++
Sbjct: 168 AVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAF 227
Query: 208 QWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
++I + G+ T+ Y PY D CH T AT GF ++ E + A+A GPV
Sbjct: 228 RYIKDNGGIDTEKSY-PYEAIDDSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPV 286
Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYW 325
SVAIDAS +SF FY GVY + +C+ LDH VL VG+G + G+ YW VKNSW T W
Sbjct: 287 SVAIDASHESFQFYSEGVYNEPQCD--AQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTW 344
Query: 326 GNQGYV-LMSIKDNNCGVMTAPTY 348
G++G++ ++ K+N CG+ +A +Y
Sbjct: 345 GDKGFIKMLRNKENQCGIASASSY 368
>gi|110625773|ref|NP_081620.2| cathepsin L-like 3 precursor [Mus musculus]
gi|74208432|dbj|BAE26401.1| unnamed protein product [Mus musculus]
gi|187955662|gb|AAI47425.1| RIKEN cDNA 2310051M13 gene [Mus musculus]
gi|187957686|gb|AAI47424.1| RIKEN cDNA 2310051M13 gene [Mus musculus]
Length = 331
Score = 178 bits (452), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 97/201 (48%), Positives = 126/201 (62%), Gaps = 7/201 (3%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F G++EG + K KL LS Q L+DCSW GN GCDGG ++Q++
Sbjct: 132 DQGSCGSCWAFSAVGSLEGQMFRKTGKLVPLSVQNLVDCSWSQGNQGCDGGLPDLAFQYV 191
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ GL T Y PY + C + AT+TGFVNV +SEDAL A+A GP+SV
Sbjct: 192 KDNGGLDTSVSY-PYEALNGTCRYNPKNSAATVTGFVNVQ-SSEDALMKAVATVGPISVG 249
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
ID KSF FY G+YY+ C+++ LDHAVL VGYG E DG+ YW VKNSW WG
Sbjct: 250 IDTKHKSFQFYKEGMYYEPDCSSTV--LDHAVLVVGYGEESDGRKYWLVKNSWGRDWGMN 307
Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
GY+ M+ ++NNCG+ + +Y
Sbjct: 308 GYIKMAKDRNNNCGIASDASY 328
>gi|118424553|gb|ABK90824.1| cathepsin L-like cysteine proteinase [Spodoptera exigua]
Length = 344
Score = 178 bits (452), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 95/201 (47%), Positives = 128/201 (63%), Gaps = 6/201 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TTGA+EG ++ K L LS+Q LIDCS YGNNGC+GG ++++I
Sbjct: 144 DQGKCGSCWAFSTTGALEGQHFRKTGYLVSLSEQNLIDCSAAYGNNGCNGGLMDNAFKYI 203
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T+ Y PY D C + A GFV++ E+ L A+A GP+SVA
Sbjct: 204 KDNGGIDTEKSY-PYEAVDDKCRYNPKESGADDVGFVDIPQGDEEKLMQAVATVGPISVA 262
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
IDASQ++F FY GVYYDE C+++ LDH V+ VGYG E DG W VKNSW WG
Sbjct: 263 IDASQETFQFYSKGVYYDENCSSTD--LDHGVMVVGYGTEEDGSDDWLVKNSWGRSWGEL 320
Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
GY+ M+ K+N+CG+ ++ +Y
Sbjct: 321 GYIKMARNKNNHCGIASSASY 341
>gi|327263389|ref|XP_003216502.1| PREDICTED: cathepsin L1-like [Anolis carolinensis]
Length = 339
Score = 178 bits (452), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 99/218 (45%), Positives = 137/218 (62%), Gaps = 10/218 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + DQ CGSCW+F TTG++EG ++ K KL LS+Q L+DCS GN G
Sbjct: 121 VDWREKGYVTPVKDQGQCGSCWAFSTTGSLEGQHFRKTGKLVSLSEQNLVDCSRPEGNQG 180
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDAL 256
C+GG +++++I +G ++ PY+ +D C + A TGFV+V E AL
Sbjct: 181 CNGGLMDQAFEYIADNGGIDSEESYPYIAKDDEDCLYKSEFNAANDTGFVDVPEGHERAL 240
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-----ELDG 311
A+A GPVSVAIDAS +F FY +G+YYD C S + LDH VL VGYG + +
Sbjct: 241 MKAVAAVGPVSVAIDASHSTFQFYESGIYYDPDC--SSEELDHGVLVVGYGFEGTDDDNK 298
Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
K YW VKNSWS WG++GY+LM+ ++N+CG+ TA +Y
Sbjct: 299 KKYWIVKNSWSDKWGDKGYILMAKDRNNHCGIATAASY 336
>gi|390368662|ref|XP_780781.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 333
Score = 178 bits (452), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 106/234 (45%), Positives = 138/234 (58%), Gaps = 9/234 (3%)
Query: 119 KVSKYSLWVRYNKASKDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKK 177
K +K S ++ N K V + KGY + DQ CGSCW+F TG++EG + K K
Sbjct: 102 KAAKGSTFLPSNNVDKLPKTVDWRTKGYVTPVKDQGQCGSCWAFSATGSLEGQQFKKTGK 161
Query: 178 LAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANT 236
L LS+Q L+DCS Y N GC GG R++Q+I+ G + T+ Y Y D CH
Sbjct: 162 LVSLSEQNLVDCS--YRNYGCHGGFMDRAFQYIIDAGGIDTEATY-SYRAVDGNCHFKKA 218
Query: 237 TATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDG 296
AT+TG+ +VT SE AL+ A+A GP+SVAIDAS K F FY +GVY + C+ +
Sbjct: 219 NVGATVTGYTDVTSGSEKALQKAVAHIGPISVAIDASHKFFKFYKSGVYNEPGCSTTR-- 276
Query: 297 LDHAVLAVGYGEL-DGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
L HAVL VGYG DG YW VKNSW+ WG GY+ MS KDN CG+ + +Y
Sbjct: 277 LGHAVLVVGYGTTSDGTDYWIVKNSWAKTWGMNGYLWMSRNKDNQCGIASEASY 330
>gi|255522980|gb|ACU12382.1| RE21773p [Drosophila melanogaster]
Length = 375
Score = 178 bits (452), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 91/204 (44%), Positives = 129/204 (63%), Gaps = 6/204 (2%)
Query: 148 SLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSY 207
++ DQ CGSCW+F +TGA+EG ++ K L LS+Q L+DCS YGNNGC+GG ++
Sbjct: 172 AVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAF 231
Query: 208 QWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
++I + G+ T+ Y PY D CH T AT GF ++ E + A+A GPV
Sbjct: 232 RYIKDNGGIDTEKSY-PYEAIDDSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPV 290
Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYW 325
SVAIDAS +SF FY GVY + +C+ LDH VL VG+G + G+ YW VKNSW T W
Sbjct: 291 SVAIDASHESFQFYSEGVYNEPQCD--AQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTW 348
Query: 326 GNQGYV-LMSIKDNNCGVMTAPTY 348
G++G++ ++ K+N CG+ +A +Y
Sbjct: 349 GDKGFIKMLRNKENQCGIASASSY 372
>gi|322799749|gb|EFZ20954.1| hypothetical protein SINV_06041 [Solenopsis invicta]
Length = 337
Score = 178 bits (452), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 95/205 (46%), Positives = 129/205 (62%), Gaps = 7/205 (3%)
Query: 148 SLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSY 207
++ DQ CGSCW+F +TGA+EG ++ L LS+Q LIDCS YGNNGC+GG ++
Sbjct: 133 AVKDQGHCGSCWAFSSTGALEGQHFRSTGYLVSLSEQNLIDCSGKYGNNGCNGGLMDYAF 192
Query: 208 QWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
Q+I + GL T+ Y PY ++ C + AT G+V++ E+ LK A+A GP+
Sbjct: 193 QYIKDNKGLDTEKTY-PYEAENDRCRYNPRNSGATDKGYVDIPQGDEEKLKAAVATIGPI 251
Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG--ELDGKPYWQVKNSWSTY 324
SVAIDAS +SF Y GVYYD C S + LDH VL VGYG E G YW VKNSW
Sbjct: 252 SVAIDASHESFQLYSEGVYYDPDC--SAENLDHGVLIVGYGTDETSGHDYWLVKNSWGKT 309
Query: 325 WGNQGYVLMSI-KDNNCGVMTAPTY 348
WG +GY+ M+ K+N+CG+ ++ +Y
Sbjct: 310 WGQKGYIKMARNKNNHCGIASSASY 334
>gi|157311713|ref|NP_001098585.1| uncharacterized protein LOC564979 precursor [Danio rerio]
gi|156230121|gb|AAI52284.1| Wu:fa26c03 protein [Danio rerio]
Length = 336
Score = 178 bits (452), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 100/221 (45%), Positives = 142/221 (64%), Gaps = 11/221 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + +G+ + DQ CGSCWSF +TGA+EG + K KL +S+Q L+DCS GN G
Sbjct: 119 VDWRQRGFVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQG 178
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDA 255
C+GG +++Q++ ++ GL ++ Y PYL +D C A +TGFV++ +E A
Sbjct: 179 CNGGLMDQAFQYVKENKGLDSEQSY-PYLARDDLPCRYDPRFNVAKITGFVDIPRGNELA 237
Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
L A+A GPVSVAIDAS +S FY +G+YY+ C++S LDHAVL VGYG ++ G
Sbjct: 238 LMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSR--LDHAVLVVGYGYQGADVAG 295
Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
YW VKNSWS WG++GY+ M+ K+N+CGV T+ +Y M
Sbjct: 296 NRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATSASYPLM 336
>gi|38147395|gb|AAR12010.1| cathepsin L-like proteinase [Triatoma infestans]
Length = 328
Score = 178 bits (451), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 89/200 (44%), Positives = 127/200 (63%), Gaps = 5/200 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
D CGSCW+F +TG++ G ++K+KKL LS+Q L+DCS YGN+GCDGG +++Q+I
Sbjct: 129 DPGQCGSCWAFSSTGSLGGQLFLKNKKLVSLSEQQLVDCSGNYGNDGCDGGIMVQAFQYI 188
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T+ Y PY +D C + T G+V++ E+ALK A+A+ GP+SVA
Sbjct: 189 KGNGGIDTEGSY-PYEAEDDKCRYKTKSVAGTDKGYVDIAQGDENALKEAVAEIGPISVA 247
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDA SF FY G+Y + C+N+ LDH VL VGYG +G+ YW VKNSW WG G
Sbjct: 248 IDAGNLSFQFYSEGIYDEPFCSNTE--LDHGVLVVGYGTENGQDYWLVKNSWGPSWGENG 305
Query: 330 YVLMSIKDNN-CGVMTAPTY 348
Y+ ++ NN CG+ + +Y
Sbjct: 306 YIKIARNHNNHCGIASMASY 325
>gi|189525870|ref|XP_001923796.1| PREDICTED: cathepsin L1 [Danio rerio]
Length = 335
Score = 178 bits (451), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 99/221 (44%), Positives = 140/221 (63%), Gaps = 12/221 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + +GY + DQ CGSCWSF +TGA+EG + K KL +S+Q L+DCS +GN G
Sbjct: 119 VDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPHGNQG 178
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDA 255
C+GG +++Q++ ++ GL ++ Y PYL +D C A +TGFV++ +E A
Sbjct: 179 CNGGLMDQAFQYVKENKGLDSEQSY-PYLARDDLPCRYDPRFNVAKITGFVDIPKGNELA 237
Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
L A+A GPVSVAIDAS +S FY +G+YY+ C + LDHAVL VGYG ++ G
Sbjct: 238 LMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSQ---LDHAVLVVGYGYQGADVAG 294
Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
YW VKNSWS WG++GY+ M+ K+N+CG+ T +Y M
Sbjct: 295 NRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPLM 335
>gi|157787177|ref|NP_001099150.1| cathepsin L1-like precursor [Danio rerio]
gi|157422879|gb|AAI53505.1| MGC174152 protein [Danio rerio]
Length = 336
Score = 178 bits (451), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 100/221 (45%), Positives = 141/221 (63%), Gaps = 11/221 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + +GY + DQ CGSCWSF +TGA+EG + K KL +S+Q L+DCS GN G
Sbjct: 119 VDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQG 178
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDA 255
C+GG +++Q++ ++ GL ++ Y PYL +D C A +TGFV++ +E A
Sbjct: 179 CNGGLMDQAFQYVKENKGLDSEQSY-PYLARDDLPCRYDPRFNVAKITGFVDIPRGNELA 237
Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
L A+A GPVSVAIDAS +S FY +G+YY+ C++S LDHAVL VGYG ++ G
Sbjct: 238 LMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSR--LDHAVLVVGYGYQGADVAG 295
Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
YW VKNSWS WG++GY+ M+ K+N+CG+ T +Y M
Sbjct: 296 NRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPLM 336
>gi|395856027|ref|XP_003800444.1| PREDICTED: cathepsin K [Otolemur garnettii]
Length = 329
Score = 178 bits (451), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 97/216 (44%), Positives = 131/216 (60%), Gaps = 8/216 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
+ Y KGY + +Q CGSCW+F + GA+EG K KL LS Q L+DC N+G
Sbjct: 119 IDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SDNDG 176
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG ++Q++ K+ G+ ++D Y PY+GQD C T A G+ + +E AL
Sbjct: 177 CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQDESCMYNPTGKAAKCRGYREIPEGNEKAL 235
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K A+A+ GP+SV IDAS SF FY GVYYDE CN+ D ++HAVLAVGYG G +W
Sbjct: 236 KRAVARVGPISVGIDASLTSFQFYSKGVYYDESCNS--DNVNHAVLAVGYGIQKGNKHWI 293
Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
+KNSW WGN+GY+LM+ NN CG+ ++ M
Sbjct: 294 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 329
>gi|343978787|gb|AEM76722.1| cathepsin L-like proteinase [Triatoma brasiliensis]
Length = 330
Score = 178 bits (451), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 98/221 (44%), Positives = 133/221 (60%), Gaps = 8/221 (3%)
Query: 133 SKDAIP--VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDC 189
S D +P V + KG + DQ CGSCWSF TG++EG ++K KL LS+Q L+DC
Sbjct: 110 SNDKLPKSVDWRQKGAVTPVKDQGQCGSCWSFSATGSLEGQIFLKKGKLVSLSEQNLMDC 169
Query: 190 SWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
S YGNNGC+GG +++Q++ + G+ T+ Y PY +D C T G+V++
Sbjct: 170 SKEYGNNGCEGGLMDKAFQYVSDNKGIDTESSY-PYEARDYACRFKKDKVGGTDKGYVDI 228
Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
E AL+ ALA GP+SVAIDAS +SF FY GVY + C S LDH VLAVGYG
Sbjct: 229 PEGDEKALQNALATVGPISVAIDASHESFHFYSEGVYNEPYC--SSYDLDHGVLAVGYGT 286
Query: 309 LDGKPYWQVKNSWSTYWGNQGYVLMSIK-DNNCGVMTAPTY 348
+G+ YW VKNSW WG GY+ ++ N+CG+ + +Y
Sbjct: 287 ENGQDYWLVKNSWGPSWGESGYIKIARNHSNHCGIASMASY 327
>gi|195484843|ref|XP_002090843.1| GE12574 [Drosophila yakuba]
gi|194176944|gb|EDW90555.1| GE12574 [Drosophila yakuba]
Length = 341
Score = 178 bits (451), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 92/204 (45%), Positives = 128/204 (62%), Gaps = 6/204 (2%)
Query: 148 SLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSY 207
++ DQ CGSCW+F +TGA+EG ++ K L LS+Q L+DCS YGNNGC+GG ++
Sbjct: 138 AVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAF 197
Query: 208 QWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
++I + G+ T+ Y PY D CH T AT GF ++ E + A+A GPV
Sbjct: 198 RYIKDNGGIDTEKSY-PYEAIDDSCHFNKGTIGATDRGFTDIPQGDEKKMAEAVATVGPV 256
Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYW 325
SVAIDAS +SF FY GVY + +C+ LDH VL VG+G + G YW VKNSW T W
Sbjct: 257 SVAIDASHESFQFYSEGVYNEPQCD--AQNLDHGVLVVGFGTDESGDDYWLVKNSWGTTW 314
Query: 326 GNQGYV-LMSIKDNNCGVMTAPTY 348
G++G++ ++ KDN CG+ +A +Y
Sbjct: 315 GDKGFIKMLRNKDNQCGIASASSY 338
>gi|33242882|gb|AAQ01145.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 178 bits (451), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 96/204 (47%), Positives = 133/204 (65%), Gaps = 9/204 (4%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CG CW+F TTG++EG + K KL LS+Q L+DCS +GN GC GG +++Q+I
Sbjct: 131 DQGECGPCWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYI 190
Query: 211 MKH-GLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
+ GL T++ Y PY D C N++ AT+ G+ +V +E ALK A+A GPVSV
Sbjct: 191 PANGGLDTEESY-PYTATDDKPCKFDNSSVGATLVGYKDVKSGNEHALKRAVATVGPVSV 249
Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDG---KPYWQVKNSWSTYW 325
AIDA +SF FY +GVY + +C S + LDH VLAVGYG ++ + +W VKNSW W
Sbjct: 250 AIDAGHESFQFYSSGVYDEPQC--STEQLDHGVLAVGYGAMNDNSHQAFWIVKNSWGPSW 307
Query: 326 GNQGYVLMSI-KDNNCGVMTAPTY 348
G+QGY++MS K+N CG+ T+ +Y
Sbjct: 308 GDQGYIMMSRNKNNQCGIATSASY 331
>gi|114559412|ref|XP_001171151.1| PREDICTED: cathepsin K isoform 4 [Pan troglodytes]
gi|410221358|gb|JAA07898.1| cathepsin K [Pan troglodytes]
gi|410248298|gb|JAA12116.1| cathepsin K [Pan troglodytes]
gi|410301088|gb|JAA29144.1| cathepsin K [Pan troglodytes]
gi|410351445|gb|JAA42326.1| cathepsin K [Pan troglodytes]
Length = 329
Score = 178 bits (451), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 98/216 (45%), Positives = 132/216 (61%), Gaps = 8/216 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V Y KGY + +Q CGSCW+F + GA+EG K KL LS Q L+DC N+G
Sbjct: 119 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 176
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG +++++ K+ G+ ++D Y PY+GQ+ C T A G+ + +E AL
Sbjct: 177 CGGGYMTNAFEYVQKNRGIDSEDAY-PYVGQEESCMYNPTGKAAKCRGYREIPEGNEKAL 235
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K A+A+ GPVSVAIDAS SF FY GVY+DE CN+ D L+HAVLAVGYG G +W
Sbjct: 236 KRAVARVGPVSVAIDASLTSFQFYSRGVYFDESCNS--DNLNHAVLAVGYGIQKGNKHWI 293
Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
+KNSW WGN+GY+LM+ NN CG+ ++ M
Sbjct: 294 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 329
>gi|24653516|ref|NP_725347.1| cysteine proteinase-1, isoform A [Drosophila melanogaster]
gi|24653518|ref|NP_725348.1| cysteine proteinase-1, isoform B [Drosophila melanogaster]
gi|1658527|gb|AAB18345.1| cysteine proteinase 1 [Drosophila melanogaster]
gi|2305221|gb|AAB65749.1| cysteine proteinase-1 [Drosophila melanogaster]
gi|7303249|gb|AAF58311.1| cysteine proteinase-1, isoform A [Drosophila melanogaster]
gi|21627210|gb|AAM68566.1| cysteine proteinase-1, isoform B [Drosophila melanogaster]
gi|54650754|gb|AAV36956.1| LP06554p [Drosophila melanogaster]
gi|220951982|gb|ACL88534.1| Cp1-PA [synthetic construct]
Length = 341
Score = 177 bits (450), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 91/204 (44%), Positives = 129/204 (63%), Gaps = 6/204 (2%)
Query: 148 SLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSY 207
++ DQ CGSCW+F +TGA+EG ++ K L LS+Q L+DCS YGNNGC+GG ++
Sbjct: 138 AVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAF 197
Query: 208 QWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
++I + G+ T+ Y PY D CH T AT GF ++ E + A+A GPV
Sbjct: 198 RYIKDNGGIDTEKSY-PYEAIDDSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPV 256
Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYW 325
SVAIDAS +SF FY GVY + +C+ LDH VL VG+G + G+ YW VKNSW T W
Sbjct: 257 SVAIDASHESFQFYSEGVYNEPQCD--AQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTW 314
Query: 326 GNQGYV-LMSIKDNNCGVMTAPTY 348
G++G++ ++ K+N CG+ +A +Y
Sbjct: 315 GDKGFIKMLRNKENQCGIASASSY 338
>gi|188501707|gb|ACD54818.1| cathepsin L precursor-like protein [Adineta vaga]
Length = 331
Score = 177 bits (450), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 97/213 (45%), Positives = 133/213 (62%), Gaps = 8/213 (3%)
Query: 139 VRYEMKGYNSLL-DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + + DQ CGSCW+F TTGA+EG ++ K +L LS+Q L+DCS N G
Sbjct: 121 VDWRTKGYVTYVKDQGQCGSCWAFSTTGALEGQHFAKTSQLIPLSEQNLVDCS--LLNFG 178
Query: 198 CDGGEDFRSYQWI-MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG +Y +I M HG+ +++ Y PY + C AT+TG + ++E L
Sbjct: 179 CNGGNQDLAYDYIKMHHGISSEESY-PYWAERDICQFNKIHIAATITGHARIKKHNETDL 237
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
+ A+A GP++V+IDA Q SF FY +GVY + C S LDHAVLAVGYG L+ K Y+
Sbjct: 238 QAAIATIGPIAVSIDAGQGSFQFYSSGVYDEPNC--STKRLDHAVLAVGYGTLNSKDYYI 295
Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
VKNSW + WG +GY+LMS K N CG+ T+ Y
Sbjct: 296 VKNSWGSSWGIRGYILMSRNKQNQCGIATSALY 328
>gi|125811033|ref|XP_001361727.1| GA25021 [Drosophila pseudoobscura pseudoobscura]
gi|54636904|gb|EAL26307.1| GA25021 [Drosophila pseudoobscura pseudoobscura]
Length = 341
Score = 177 bits (450), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 92/201 (45%), Positives = 128/201 (63%), Gaps = 6/201 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F +TGA+EG +Y K L LS+Q L+DCS YGNNGC+GG ++++I
Sbjct: 141 DQGHCGSCWAFSSTGALEGQHYRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI 200
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T+ Y PY D CH T AT GFV++ +E + A+A GPV+VA
Sbjct: 201 KDNGGIDTEKSY-PYEAIDDSCHFNKGTIGATDRGFVDIPQGNEKKMAEAVATIGPVAVA 259
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
IDAS +SF FY GVY + C+ LDH VL VG+G + G+ YW VKNSW T WG++
Sbjct: 260 IDASHESFQFYSEGVYNEPACD--AQNLDHGVLVVGFGTDESGQDYWLVKNSWGTTWGDK 317
Query: 329 GYV-LMSIKDNNCGVMTAPTY 348
G++ ++ K+N CG+ +A +Y
Sbjct: 318 GFIKMLRNKENQCGIASASSY 338
>gi|197258082|gb|ACH56225.1| cathepsin L-like cysteine proteinase [Bursaphelenchus xylophilus]
Length = 282
Score = 177 bits (450), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 95/218 (43%), Positives = 136/218 (62%), Gaps = 9/218 (4%)
Query: 135 DAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGY 193
DA+ R E GY + +Q +CGSCW+F TG++EG + KL LS+Q L+DCS +
Sbjct: 67 DAVDWRDE--GYVTPVKNQGMCGSCWAFSATGSLEGQHKRATGKLVSLSEQNLVDCSADF 124
Query: 194 GNNGCDGGEDFRSYQWIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNS 252
GNNGC+GG +++++ + HG+ T++ Y PY + CH A TGFV++
Sbjct: 125 GNNGCNGGLMDFAFEYVKQNHGIDTEESY-PYKAKQKKCHFQKANVGADDTGFVDLPEAD 183
Query: 253 EDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDG 311
E+ LK A+A GPVSVAIDA +SF Y GVYY++ C SP+ LDH VL VGYG + +
Sbjct: 184 EEQLKAAVASQGPVSVAIDAGHRSFRLYKTGVYYEKHC--SPEQLDHGVLVVGYGTDPEH 241
Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
YW VKNSW WG +GYV ++ ++N+CG+ + +Y
Sbjct: 242 GDYWIVKNSWGEEWGEKGYVRIARNRNNHCGIASKASY 279
>gi|81294188|gb|AAI08032.1| Cathepsin L, 1 b [Danio rerio]
Length = 336
Score = 177 bits (450), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 101/221 (45%), Positives = 140/221 (63%), Gaps = 11/221 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + +GY + DQ CGSCWSF +TGA+EG + K KL +S+Q L+DCS GN G
Sbjct: 119 VDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQG 178
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDA 255
C+GG ++Q++ ++ GL ++ Y PYL +D C A +TGFV++ +E A
Sbjct: 179 CNGGLMDLAFQYVKENKGLDSEQSY-PYLARDDLPCRYDPRFNVAKITGFVDIPSGNELA 237
Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
L A+A GPVSVAIDAS +S FY +G+YY+ C++S LDHAVL VGYG ++ G
Sbjct: 238 LMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSR--LDHAVLVVGYGYQGADVAG 295
Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
YW VKNSWS WG++GY+ M+ K+N+CGV T +Y M
Sbjct: 296 NRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATKASYPLM 336
>gi|156368930|ref|XP_001627944.1| predicted protein [Nematostella vectensis]
gi|156214907|gb|EDO35881.1| predicted protein [Nematostella vectensis]
Length = 315
Score = 177 bits (450), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 85/203 (41%), Positives = 126/203 (62%), Gaps = 11/203 (5%)
Query: 137 IPVRYEMKGY---NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG- 192
+P + + + Y N Q CGSCW+F T GAVE A++++ +L L++Q L+DC+W
Sbjct: 112 VPDQLDWRDYGAVNPAKGQGTCGSCWAFATAGAVEAAHFIQKGELLNLAEQQLLDCTWST 171
Query: 193 ----YGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
+GNNGC GG ++++ W+ K GL T YG Y GQ+ +C +N T A +T + V
Sbjct: 172 PGVYHGNNGCLGGWTWKAFSWVKKFGLATTKSYGHYRGQEGFCKTSNLTVGARITSYRRV 231
Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
+ ALK AL+ HGP +++I+A+ KS FY +G+ D+ C+N DHAVL +GYG
Sbjct: 232 KRFNPIALKKALSYHGPATISINANPKSLKFYSDGIMSDKHCSNK---TDHAVLLIGYGS 288
Query: 309 LDGKPYWQVKNSWSTYWGNQGYV 331
+G PYW +KNSWS WGN G++
Sbjct: 289 DNGVPYWLIKNSWSHKWGNNGFI 311
>gi|149510440|ref|XP_001518002.1| PREDICTED: cathepsin K-like [Ornithorhynchus anatinus]
Length = 618
Score = 177 bits (449), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 96/216 (44%), Positives = 130/216 (60%), Gaps = 8/216 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
+ Y KGY + +Q CGSCW+F + GA+EG K +L LS Q L+DC N+G
Sbjct: 408 IDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGRLLDLSPQNLVDCV--ASNDG 465
Query: 198 CDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG ++Q++ G+ ++D Y PY+GQD C + T A G+ V E AL
Sbjct: 466 CGGGYMTNAFQYVHDNRGIDSEDAY-PYVGQDEPCRYSPTGKAAKCRGYREVPVGDEKAL 524
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K A+A+ GPV+VAIDAS SF FY GVY+DE CN + L+HA+LAVGYG G +W
Sbjct: 525 KRAVARVGPVAVAIDASLSSFQFYSKGVYFDENCNGA--NLNHALLAVGYGAQKGAKHWI 582
Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
+KNSW WGN+GYVLM+ NN CG+ + ++ M
Sbjct: 583 IKNSWGEEWGNKGYVLMARNKNNACGIASLASFPKM 618
>gi|340368360|ref|XP_003382720.1| PREDICTED: digestive cysteine proteinase 2-like [Amphimedon
queenslandica]
Length = 326
Score = 177 bits (449), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 91/220 (41%), Positives = 133/220 (60%), Gaps = 5/220 (2%)
Query: 134 KDAIPVRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG 192
K A V + KG ++ +Q CGSCWSF TTG++EG +++K L LS+Q +DCS
Sbjct: 106 KVAATVDWREKGAVTAIKNQGKCGSCWSFSTTGSLEGQHFLKTGTLLSLSEQQFVDCSTK 165
Query: 193 YGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNS 252
+GN+GC GG +++++ + PY +D +C +T G+ ++ +
Sbjct: 166 FGNHGCKGGTMDNAFRYLETVSGDETEMMYPYTAEDGFCKFRSTEGKVKCEGYKDIPRDD 225
Query: 253 EDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDG- 311
EDAL+ A+A GP+SVAIDA SF Y GVYY+ C+++ LDH VLAVGYG +G
Sbjct: 226 EDALREAVATVGPISVAIDAGHSSFQLYKEGVYYNPTCSSTK--LDHGVLAVGYGTYEGS 283
Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVT 350
+ YW VKNSW WG +GY++MS ++NNCG+ T +Y T
Sbjct: 284 EEYWLVKNSWGPSWGMEGYIMMSRNRENNCGIATMASYPT 323
>gi|13928758|ref|NP_113748.1| cathepsin K precursor [Rattus norvegicus]
gi|12585195|sp|O35186.1|CATK_RAT RecName: Full=Cathepsin K; Flags: Precursor
gi|2305208|gb|AAB65743.1| cathepsin K [Rattus norvegicus]
gi|50927597|gb|AAH78793.1| Cathepsin K [Rattus norvegicus]
gi|149030667|gb|EDL85704.1| cathepsin K, isoform CRA_a [Rattus norvegicus]
Length = 329
Score = 177 bits (449), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 95/215 (44%), Positives = 126/215 (58%), Gaps = 6/215 (2%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
+ Y KGY + +Q CGSCW+F + GA+EG K KL LS Q L+DC N G
Sbjct: 119 IDYRKKGYVTPVKNQGQCGSCWAFSSAGALEGQLKKKTGKLLALSPQNLVDCV--SENYG 176
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
C GG ++Q++ ++G +D PY+GQD C T A G+ + +E ALK
Sbjct: 177 CGGGYMTTAFQYVQQNGGIDSEDAYPYVGQDESCMYNATAKAAKCRGYREIPVGNEKALK 236
Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQV 317
A+A+ GPVSV+IDAS SF FY GVYYDE C+ D ++HAVL VGYG G YW +
Sbjct: 237 RAVARVGPVSVSIDASLTSFQFYSRGVYYDENCDR--DNVNHAVLVVGYGTQKGNKYWII 294
Query: 318 KNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
KNSW WGN+GYVL++ NN CG+ ++ M
Sbjct: 295 KNSWGESWGNKGYVLLARNKNNACGITNLASFPKM 329
>gi|162138968|ref|NP_001104662.1| uncharacterized protein LOC567623 precursor [Danio rerio]
gi|158254065|gb|AAI54241.1| Zgc:174153 protein [Danio rerio]
Length = 336
Score = 177 bits (449), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 101/221 (45%), Positives = 139/221 (62%), Gaps = 11/221 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + +GY + DQ CGSCWSF +TGA+EG + K KL +S+Q L+DCS GN G
Sbjct: 119 VDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQG 178
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDA 255
C+GG ++Q++ ++ GL ++ Y PYL +D C A TGFV++ +E A
Sbjct: 179 CNGGLMDLAFQYVKENKGLDSEQSY-PYLARDDLPCRYDPRFNVAKSTGFVDIPSGNEPA 237
Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
L A+A GPVSVAIDAS +S FY +G+YY+ C++S LDHAVL VGYG ++ G
Sbjct: 238 LMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSR--LDHAVLVVGYGYQGADVAG 295
Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
YW VKNSWS WG++GY+ M+ K+N+CGV T +Y M
Sbjct: 296 NRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATKASYPLM 336
>gi|530734|emb|CAA56914.1| cathepsin l [Nephrops norvegicus]
gi|1582620|prf||2119193A cathepsin L-related Cys protease
Length = 324
Score = 177 bits (449), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 96/227 (42%), Positives = 142/227 (62%), Gaps = 11/227 (4%)
Query: 133 SKDAIP----VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALI 187
S DA P V + KG + DQ CGSCW+F TG++EG +++K+ +L L++Q L+
Sbjct: 101 STDAAPETTEVDWRTKGCVTHVKDQGQCGSCWAFSATGSLEGQHFLKYGELVSLAEQQLV 160
Query: 188 DCSWG-YGNNGCDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGF 245
DC+ G Y N GC+GG +++++I +G + T+ Y PY +D C + + AT +GF
Sbjct: 161 DCAGGIYYNQGCNGGWVNQAFKYIKANGGIDTESSY-PYEARDNTCRFNSNSVAATCSGF 219
Query: 246 VNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVG 305
V++ SE GP+SVAIDA+ +SF Y +GVYY+ C++S LDHAVLAVG
Sbjct: 220 VSIAQGSESPEVRRTTNTGPISVAIDAAHRSFQSYSSGVYYEPSCSSSQ--LDHAVLAVG 277
Query: 306 YGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
YG G+ +W VKNSW T WG+ GY+ M+ ++NNCG+ T +Y T+
Sbjct: 278 YGSEGGQDFWLVKNSWGTSWGSAGYINMARNRNNNCGIATDASYPTV 324
>gi|156368926|ref|XP_001627942.1| predicted protein [Nematostella vectensis]
gi|156214905|gb|EDO35879.1| predicted protein [Nematostella vectensis]
Length = 310
Score = 177 bits (449), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 84/203 (41%), Positives = 126/203 (62%), Gaps = 11/203 (5%)
Query: 137 IPVRYEMKGY---NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG- 192
+P + + + Y N Q CGSCW+F T GAVE A++++ +L L++Q L+DC+W
Sbjct: 107 VPDQLDWRNYGAVNPAKGQGTCGSCWAFATAGAVEAAHFIQKGELLNLAEQQLLDCTWST 166
Query: 193 ----YGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
+GNNGC GG ++++ W+ K GL T YG Y GQ+ +C +N T A +T + V
Sbjct: 167 PGVYHGNNGCLGGWTWKAFSWVKKFGLATTKSYGHYRGQEGFCKTSNLTVGARITSYRRV 226
Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
+ ALK AL+ HGP +++I+A+ KS +Y +G+ D+ C+N DHAVL +GYG
Sbjct: 227 KRFNPIALKKALSYHGPATISINANPKSLKYYSDGIMSDKHCSNK---TDHAVLLIGYGS 283
Query: 309 LDGKPYWQVKNSWSTYWGNQGYV 331
+G PYW +KNSWS WGN G++
Sbjct: 284 ENGVPYWLIKNSWSNKWGNNGFI 306
>gi|6630972|gb|AAF19630.1|AF194426_1 cysteine proteinase precursor [Myxine glutinosa]
Length = 324
Score = 177 bits (448), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 96/212 (45%), Positives = 120/212 (56%), Gaps = 4/212 (1%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + +GY + DQ CGSCWSF TG++EG ++ K L LS+Q L+DCSW YGN G
Sbjct: 112 VDWRNQGYVTPVKDQGQCGSCWSFSATGSLEGQHFAKTGTLVSLSEQQLVDCSWSYGNYG 171
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
C GG +Y +I G + PY Q+ CH + A AT TG V + E +L
Sbjct: 172 CSGGLMESAYDYIRDAGGVQLESAYPYTAQNGRCHFDQSKAVATCTGHVAIPSGDEQSLM 231
Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQV 317
A+ GPV+VAIDAS F Y +GVY +C++S LDH VLA GYG G YW V
Sbjct: 232 QAVGTVGPVAVAIDASGYDFQLYESGVYDRSRCSSSS--LDHGVLAAGYGTEGGNDYWLV 289
Query: 318 KNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
KNSW WG QGY+ MS K N CG+ T Y
Sbjct: 290 KNSWGPGWGAQGYIKMSRNKSNQCGIATMACY 321
>gi|1272388|gb|AAB17051.1| cysteine protease, partial [Spirometra mansonoides]
Length = 216
Score = 177 bits (448), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 88/212 (41%), Positives = 126/212 (59%), Gaps = 4/212 (1%)
Query: 141 YEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDG 200
+E S+ +Q CGSCWSF GA+EGA +K L LS+Q L+DCSW YGN GC+G
Sbjct: 8 HEKGAVTSVKNQGQCGSCWSFSANGAIEGAIQIKMGILPTLSEQQLVDCSWEYGNQGCNG 67
Query: 201 GEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLAL 260
G ++Q+ ++G+ + DY Y +D +C A +TG+ + E +L+ A+
Sbjct: 68 GFMSLAFQYAQRYGVEAEVDY-RYTAKDGFCRYQQDMVVANVTGYAELPQGDEASLQRAV 126
Query: 261 AKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNS 320
A GP+SV IDA+ F Y +GV+ + C SPD ++H VL +GYG + +PYW VKNS
Sbjct: 127 AVIGPISVGIDANDPGFMSYSHGVFVSKTC--SPDDINHGVLVIGYGTENDEPYWLVKNS 184
Query: 321 WSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
W WG QGYV M+ NN CG+ + +Y T+
Sbjct: 185 WGRSWGEQGYVKMARNKNNMCGIASVASYPTV 216
>gi|130502110|ref|NP_001076110.1| cathepsin K precursor [Oryctolagus cuniculus]
gi|1168794|sp|P43236.1|CATK_RABIT RecName: Full=Cathepsin K; AltName: Full=Protein OC-2; Flags:
Precursor
gi|454187|dbj|BAA03125.1| OC-2 protein [Oryctolagus cuniculus]
Length = 329
Score = 177 bits (448), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 98/216 (45%), Positives = 130/216 (60%), Gaps = 8/216 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
+ Y KGY + +Q CGSCW+F + GA+EG K KL LS Q L+DC N G
Sbjct: 119 IDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENYG 176
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG ++Q++ ++ G+ ++D Y PY+GQD C T A G+ + +E AL
Sbjct: 177 CGGGYMTNAFQYVQRNRGIDSEDAY-PYVGQDESCMYNPTGKAAKCRGYREIPEGNEKAL 235
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K A+A+ GPVSVAIDAS SF FY GVYYDE C S D ++HAVLAVGYG G +W
Sbjct: 236 KRAVARVGPVSVAIDASLTSFQFYSKGVYYDENC--SSDNVNHAVLAVGYGIQKGNKHWI 293
Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
+KNSW WGN+GY+LM+ NN CG+ ++ M
Sbjct: 294 IKNSWGESWGNKGYILMARNKNNACGIANLASFPKM 329
>gi|326672302|ref|XP_003199633.1| PREDICTED: cathepsin L1-like [Danio rerio]
gi|157423549|gb|AAI53506.1| Im:6910535 [Danio rerio]
Length = 335
Score = 177 bits (448), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 99/221 (44%), Positives = 139/221 (62%), Gaps = 12/221 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + +GY + DQ CGSCWSF +TGA+EG + K KL +S+Q L+DCS GN G
Sbjct: 119 VDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQG 178
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDA 255
C+GG +++Q++ ++ GL ++ Y PYL +D C A +TGFV++ +E A
Sbjct: 179 CNGGIMDQAFQYVKENKGLDSEQSY-PYLARDDLPCRYDPRFNVAKITGFVDIPKGNELA 237
Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
L A+A GPVSVAIDAS +S FY +G+YY+ C + LDHAVL VGYG ++ G
Sbjct: 238 LMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSR---LDHAVLVVGYGYQGADVAG 294
Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
YW VKNSWS WG++GY+ M+ K+N+CG+ T +Y M
Sbjct: 295 NRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPLM 335
>gi|156739289|ref|NP_001096592.1| uncharacterized protein LOC569326 precursor [Danio rerio]
gi|156230119|gb|AAI52283.1| Im:6910535 protein [Danio rerio]
Length = 335
Score = 176 bits (447), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 99/221 (44%), Positives = 139/221 (62%), Gaps = 12/221 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + +GY + DQ CGSCWSF +TGA+EG + K KL +S+Q L+DCS GN G
Sbjct: 119 VDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQG 178
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDA 255
C+GG +++Q++ ++ GL ++ Y PYL +D C A +TGFV++ +E A
Sbjct: 179 CNGGIMDQAFQYVKENKGLDSEQSY-PYLARDDLPCRYDPRFNVAKITGFVDIPRGNELA 237
Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
L A+A GPVSVAIDAS +S FY +G+YY+ C + LDHAVL VGYG ++ G
Sbjct: 238 LMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSR---LDHAVLVVGYGYQGADVAG 294
Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
YW VKNSWS WG++GY+ M+ K+N+CG+ T +Y M
Sbjct: 295 NRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPLM 335
>gi|195583187|ref|XP_002081405.1| GD10995 [Drosophila simulans]
gi|194193414|gb|EDX06990.1| GD10995 [Drosophila simulans]
Length = 341
Score = 176 bits (447), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 94/214 (43%), Positives = 132/214 (61%), Gaps = 7/214 (3%)
Query: 139 VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KG ++ DQ CGSCW+F +TGA+EG ++ K L LS+Q L+DCS YGNNG
Sbjct: 128 VDWRTKGAVTAVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNG 187
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++++I + G+ T+ Y PY D CH T AT GF ++ E +
Sbjct: 188 CNGGLMDNAFRYIKDNGGIDTEKSY-PYEAIDDSCHFNKGTIGATDRGFTDIPQGDEKKM 246
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYW 315
A+A GPVSVAIDAS +SF FY GVY + +C+ LDH VL VG+G + G YW
Sbjct: 247 AEAVATVGPVSVAIDASHESFQFYSEGVYNEPQCD--AQNLDHGVLVVGFGTDESGDDYW 304
Query: 316 QVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
VKNSW T WG++G++ ++ K+N CG+ +A +Y
Sbjct: 305 LVKNSWGTTWGDKGFIKMLRNKENQCGIASASSY 338
>gi|417409774|gb|JAA51378.1| Putative cathepsin k, partial [Desmodus rotundus]
Length = 331
Score = 176 bits (447), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 97/216 (44%), Positives = 130/216 (60%), Gaps = 8/216 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
+ Y KGY + +Q CGSCW+F + GA+EG K KL LS Q L+DC N+G
Sbjct: 121 IDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 178
Query: 198 CDGGEDFRSYQWIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG ++ ++ K G+ ++D Y PY+GQD C T A G+ + +E AL
Sbjct: 179 CGGGYMTNAFHYVQKNQGIDSEDAY-PYVGQDESCMYNPTGKAAKCRGYKEIPEGNEKAL 237
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K A+A+ GP+SVAIDAS SF FY GVYYD+ CN+ D L+HAVLAVGYG K +W
Sbjct: 238 KRAVARVGPISVAIDASLTSFQFYSKGVYYDKNCNS--DNLNHAVLAVGYGIQKRKKHWI 295
Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
+KNSW WGN+GY+LM+ NN CG+ ++ M
Sbjct: 296 IKNSWGESWGNKGYILMARNKNNACGIANLASFPKM 331
>gi|156355008|ref|XP_001623469.1| predicted protein [Nematostella vectensis]
gi|156210172|gb|EDO31369.1| predicted protein [Nematostella vectensis]
Length = 301
Score = 176 bits (447), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 82/185 (44%), Positives = 119/185 (64%), Gaps = 8/185 (4%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG-----YGNNGCDGGEDFRS 206
Q CGSCW+F T GAVE A++++ +L L++Q L+DC+W +GNNGC GG +++
Sbjct: 116 QGTCGSCWAFATAGAVEAAHFIQKGELLNLAEQQLLDCTWSTPGVYHGNNGCLGGWTWKA 175
Query: 207 YQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
+ W+ K GL T YG Y GQ+ +C +N T A +T + V + ALK AL+ HGP
Sbjct: 176 FSWVKKFGLATTKSYGHYRGQEGFCKTSNLTVGARITSYRRVKRFNPIALKKALSYHGPA 235
Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWG 326
+++I+A+ KS FY +G+ D+ C+N DHAVL +GYG +G PYW +KNSWS WG
Sbjct: 236 TISINANPKSLKFYSDGIMSDKHCSNK---TDHAVLLIGYGSNNGVPYWLIKNSWSHKWG 292
Query: 327 NQGYV 331
N G++
Sbjct: 293 NNGFI 297
>gi|156739275|ref|NP_001096585.1| cathepsin L1-like precursor [Danio rerio]
gi|156230123|gb|AAI52285.1| MGC174857 protein [Danio rerio]
Length = 335
Score = 176 bits (447), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 99/221 (44%), Positives = 139/221 (62%), Gaps = 12/221 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + +GY + DQ CGSCWSF +TGA+EG + K KL +S+Q L+DCS GN G
Sbjct: 119 VDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQG 178
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDA 255
C+GG +++Q++ ++ GL ++ Y PYL +D C A +TGFV++ +E A
Sbjct: 179 CNGGIMDQAFQYVKENKGLDSEQSY-PYLARDDLPCRYDPRFNVAKITGFVDIPRGNELA 237
Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
L A+A GPVSVAIDAS +S FY +G+YY+ C + LDHAVL VGYG ++ G
Sbjct: 238 LMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSR---LDHAVLVVGYGYQGADVAG 294
Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
YW VKNSWS WG++GY+ M+ K+N+CG+ T +Y M
Sbjct: 295 NRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPLM 335
>gi|156739281|ref|NP_001096588.1| cathepsin L1-like precursor [Danio rerio]
gi|166158351|ref|NP_001107526.1| uncharacterized protein LOC100135391 precursor [Xenopus (Silurana)
tropicalis]
gi|326672305|ref|XP_003199634.1| PREDICTED: cathepsin L1-like [Danio rerio]
gi|156230096|gb|AAI52237.1| MGC174155 protein [Danio rerio]
gi|163916362|gb|AAI57707.1| LOC100135391 protein [Xenopus (Silurana) tropicalis]
Length = 335
Score = 176 bits (447), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 99/221 (44%), Positives = 139/221 (62%), Gaps = 12/221 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + +GY + DQ CGSCWSF +TGA+EG + K KL +S+Q L+DCS GN G
Sbjct: 119 VDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQG 178
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDA 255
C+GG +++Q++ ++ GL ++ Y PYL +D C A +TGFV++ +E A
Sbjct: 179 CNGGIMDQAFQYVKENKGLDSEQSY-PYLARDDLPCRYDPRFNVAKITGFVDIPRGNELA 237
Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
L A+A GPVSVAIDAS +S FY +G+YY+ C + LDHAVL VGYG ++ G
Sbjct: 238 LMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSR---LDHAVLVVGYGYQGADVAG 294
Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
YW VKNSWS WG++GY+ M+ K+N+CG+ T +Y M
Sbjct: 295 NRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPLM 335
>gi|28932702|gb|AAO60045.1| midgut cysteine proteinase 2 [Rhipicephalus appendiculatus]
Length = 564
Score = 176 bits (447), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 119/292 (40%), Positives = 168/292 (57%), Gaps = 15/292 (5%)
Query: 69 DFKVNIYR--LFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLW 126
D + +I+R L F+ + N ++ + I + + RL + +G S+ +
Sbjct: 279 DRRRDIFRQNLRFIDSKNRANLGYNLAVNHLADRTREEISVLRGRLQSKDGS--SRAEPF 336
Query: 127 VRYNKASKDAIPVRYEMKGYNSLL---DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQ 183
R+ +K +P + + + Y ++ DQ+VCGSCWSFGT G +EGAY+ K +L LS+
Sbjct: 337 PRHRFTAK--LPDQIDWRPYGAVTPVKDQAVCGSCWSFGTVGELEGAYFRKTGRLVRLSE 394
Query: 184 QALIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTAT-ATM 242
Q L+DCSW GNNGCDGGEDFR+Y++I HGL + +DYG Y+GQD CH + +T +++
Sbjct: 395 QQLVDCSWNNGNNGCDGGEDFRAYEYIADHGLASDEDYGAYIGQDGVCHDSKVNSTISSI 454
Query: 243 TGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVL 302
+VN+T + D L ALA GPVSV+IDA+ +SFSFY D LDH+VL
Sbjct: 455 KSYVNIT--NRDDLPTALANVGPVSVSIDAALRSFSFYPTVSSMIPTAAMDTDSLDHSVL 512
Query: 303 AVGYGE-LDGKPYWQVKNSWSTYWGN-QGYVLMSIKDNNC-GVMTAPTYVTM 351
L G+PYW VKNSW G GYVL+S K GV T TYV +
Sbjct: 513 RQSATRTLQGEPYWGVKNSWVYLLGEMMGYVLISPKGTTTGGVATQGTYVEL 564
Score = 43.5 bits (101), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 19/43 (44%), Positives = 30/43 (69%), Gaps = 1/43 (2%)
Query: 108 EKWRLVTSEGEKVSKYSLWVRYNKASKDAIPVRYEMKGYNSLL 150
E+++L ++VSKY+LWV + + A+P RY M+GYN+LL
Sbjct: 154 ERFQLTVPTRDRVSKYTLWVSRDSQGR-AVPRRYLMRGYNTLL 195
>gi|195153545|ref|XP_002017686.1| GL17172 [Drosophila persimilis]
gi|194113482|gb|EDW35525.1| GL17172 [Drosophila persimilis]
Length = 341
Score = 176 bits (447), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 91/201 (45%), Positives = 128/201 (63%), Gaps = 6/201 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F +TGA+EG +Y K L LS+Q L+DCS YGNNGC+GG ++++I
Sbjct: 141 DQGHCGSCWAFSSTGALEGQHYRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI 200
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T+ Y PY D CH + AT GFV++ +E + A+A GPV+VA
Sbjct: 201 KDNGGIDTEKSY-PYEAIDDSCHFNKGSIGATDRGFVDIPQGNEKKMAEAVATIGPVAVA 259
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
IDAS +SF FY GVY + C+ LDH VL VG+G + G+ YW VKNSW T WG++
Sbjct: 260 IDASHESFQFYSEGVYNEPACD--AQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDK 317
Query: 329 GYV-LMSIKDNNCGVMTAPTY 348
G++ ++ K+N CG+ +A +Y
Sbjct: 318 GFIKMLRNKENQCGIASASSY 338
>gi|344258279|gb|EGW14383.1| Cathepsin L1 [Cricetulus griseus]
Length = 295
Score = 176 bits (447), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 97/217 (44%), Positives = 134/217 (61%), Gaps = 8/217 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + GY + DQ CG+CW+F G++ G + K KL LS+Q L+DCSW +GN G
Sbjct: 83 VDWRKHGYVTPVKDQGSCGACWAFSAVGSLVGQMFWKTGKLVPLSEQNLVDCSWSHGNIG 142
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG ++Q++M + GL T + Y PY ++ C + A +TGFV + P +E +L
Sbjct: 143 CHGGLMQNAFQYVMDNGGLDTSESY-PYESRNTTCRYNPENSAANVTGFVKI-PANEYSL 200
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYW 315
A+A GP+S AID SF FY G+YY+ +C++S LDHAVL VGYG E DG+ YW
Sbjct: 201 MKAVAIVGPISAAIDTKHHSFQFYRGGMYYEPECSSS--NLDHAVLVVGYGEESDGRKYW 258
Query: 316 QVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
VKNSW TYWG GY+ M+ ++NNCG+ T Y T+
Sbjct: 259 LVKNSWGTYWGMNGYIKMARDRNNNCGIATYAMYPTV 295
>gi|21617827|sp|P09648.1|CATL1_CHICK RecName: Full=Cathepsin L1; Contains: RecName: Full=Cathepsin L1
heavy chain; Contains: RecName: Full=Cathepsin L1 light
chain
Length = 218
Score = 176 bits (447), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 96/213 (45%), Positives = 129/213 (60%), Gaps = 5/213 (2%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + DQ CGSCW+F TTGA+EG ++ KL LS+Q L+DCS GN G
Sbjct: 5 VDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRTKGKLVSLSEQNLVDCSRPEGNQG 64
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDAL 256
C+GG +++Q++ +G ++ PY +D C A TGFV++ E AL
Sbjct: 65 CNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDCRYKAEYNAANDTGFVDIPQGHERAL 124
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
A+A GPVSVAIDA SF FY +G+YY+ C S + LDH VL VGYG GK YW
Sbjct: 125 MKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPDC--SSEDLDHGVLVVGYGFEGGKKYWI 182
Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
VKNSW WG++GY+ M+ + N+CG+ TA +Y
Sbjct: 183 VKNSWGEKWGDKGYIYMAKDRKNHCGIATAASY 215
>gi|156325009|ref|XP_001618537.1| hypothetical protein NEMVEDRAFT_v1g44827 [Nematostella vectensis]
gi|156199245|gb|EDO26437.1| predicted protein [Nematostella vectensis]
Length = 283
Score = 176 bits (447), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 83/192 (43%), Positives = 121/192 (63%), Gaps = 8/192 (4%)
Query: 147 NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG-----YGNNGCDGG 201
N Q CGSCW+F T GAVE A++++ +L L++Q L+DC+W +GNNGC GG
Sbjct: 93 NPAKGQGTCGSCWAFATAGAVEAAHFIQKGELLNLAEQQLLDCTWSTPEVYHGNNGCLGG 152
Query: 202 EDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALA 261
++++ W+ K GL T YG Y GQ+ +C +N T A +T + V + ALK AL+
Sbjct: 153 WTWKAFSWVKKFGLATTKSYGHYRGQEGFCKTSNLTVGARITSYRRVKRFNPIALKKALS 212
Query: 262 KHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSW 321
HGP +++I+A+ KS FY +G+ D+ C+N DHAVL +GYG +G PYW +KNSW
Sbjct: 213 YHGPATISINANPKSLKFYSDGIMSDKHCSNK---TDHAVLLIGYGSDNGVPYWLIKNSW 269
Query: 322 STYWGNQGYVLM 333
S WGN G++ +
Sbjct: 270 SHKWGNNGFIKI 281
>gi|148709355|gb|EDL41301.1| cDNA sequence BC051665 [Mus musculus]
Length = 349
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 98/204 (48%), Positives = 126/204 (61%), Gaps = 7/204 (3%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F G++EG + K KL LS+Q L+DCSW YGN GC+GG ++Q++
Sbjct: 150 DQGHCGSCWAFSAVGSLEGQIFRKTGKLVPLSEQNLMDCSWSYGNVGCNGGLMELAFQYV 209
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
++ GL T++ Y Y D C + +TGFV V P SEDAL A+A GPVSV
Sbjct: 210 KENRGLDTRESYA-YEAWDGPCRYDPKYSAVNITGFVKV-PLSEDALMNAVASVGPVSVG 267
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
ID SF FY G YY+ C+++ LDHAVL VGYG E DG+ YW VKNSW WG
Sbjct: 268 IDTHHHSFRFYRGGTYYEPDCSST--NLDHAVLVVGYGEESDGRKYWLVKNSWGEDWGMD 325
Query: 329 GYVLMSI-KDNNCGVMTAPTYVTM 351
GY+ M+ +DNNCG+ T Y T+
Sbjct: 326 GYIKMAKDRDNNCGIATYAIYPTV 349
>gi|74211558|dbj|BAE26509.1| unnamed protein product [Mus musculus]
Length = 338
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 98/204 (48%), Positives = 126/204 (61%), Gaps = 7/204 (3%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F G++EG + K KL LS+Q L+DCSW YGN GC+GG ++Q++
Sbjct: 139 DQGHCGSCWAFSAVGSLEGQIFRKTGKLVPLSEQNLMDCSWSYGNVGCNGGLMELAFQYV 198
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
++ GL T++ Y Y D C + +TGFV V P SEDAL A+A GPVSV
Sbjct: 199 KENRGLDTRESYA-YEAWDGPCRYDPKYSAVNITGFVKV-PLSEDALMNAVASVGPVSVG 256
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
ID SF FY G YY+ C+++ LDHAVL VGYG E DG+ YW VKNSW WG
Sbjct: 257 IDTHHHSFRFYRGGTYYEPDCSST--NLDHAVLVVGYGEESDGRKYWLVKNSWGEDWGMD 314
Query: 329 GYVLMSI-KDNNCGVMTAPTYVTM 351
GY+ M+ +DNNCG+ T Y T+
Sbjct: 315 GYIKMAKDRDNNCGIATYAIYPTV 338
>gi|269954686|ref|NP_954599.2| uncharacterized protein LOC218275 precursor [Mus musculus]
Length = 330
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 98/204 (48%), Positives = 126/204 (61%), Gaps = 7/204 (3%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F G++EG + K KL LS+Q L+DCSW YGN GC+GG ++Q++
Sbjct: 131 DQGHCGSCWAFSAVGSLEGQIFRKTGKLVPLSEQNLMDCSWSYGNVGCNGGLMELAFQYV 190
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
++ GL T++ Y Y D C + +TGFV V P SEDAL A+A GPVSV
Sbjct: 191 KENRGLDTRESYA-YEAWDGPCRYDPKYSAVNITGFVKV-PLSEDALMNAVASVGPVSVG 248
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
ID SF FY G YY+ C+++ LDHAVL VGYG E DG+ YW VKNSW WG
Sbjct: 249 IDTHHHSFRFYRGGTYYEPDCSST--NLDHAVLVVGYGEESDGRKYWLVKNSWGEDWGMD 306
Query: 329 GYVLMSI-KDNNCGVMTAPTYVTM 351
GY+ M+ +DNNCG+ T Y T+
Sbjct: 307 GYIKMAKDRDNNCGIATYAIYPTV 330
>gi|195334204|ref|XP_002033774.1| GM21500 [Drosophila sechellia]
gi|194125744|gb|EDW47787.1| GM21500 [Drosophila sechellia]
Length = 341
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 90/204 (44%), Positives = 129/204 (63%), Gaps = 6/204 (2%)
Query: 148 SLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSY 207
++ DQ CGSCW+F +TGA+EG ++ K L LS+Q L+DCS YGNNGC+GG ++
Sbjct: 138 AVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAF 197
Query: 208 QWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
++I + G+ T+ Y PY D CH T AT GF ++ E + A+A GPV
Sbjct: 198 RYIKDNGGIDTEKSY-PYEAIDDSCHFNKGTIGATDRGFTDIPQGDEKKMAEAVATVGPV 256
Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYW 325
+VAIDAS +SF FY GVY + +C+ LDH VL VG+G + G+ YW VKNSW T W
Sbjct: 257 AVAIDASHESFQFYSEGVYNEPQCD--AQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTW 314
Query: 326 GNQGYV-LMSIKDNNCGVMTAPTY 348
G++G++ ++ K+N CG+ +A +Y
Sbjct: 315 GDKGFIKMLRNKENQCGIASASSY 338
>gi|54020916|ref|NP_001005702.1| cathepsin K (pycnodysostosis) precursor [Xenopus (Silurana)
tropicalis]
gi|49671274|gb|AAH75275.1| cathepsin K (pycnodysostosis) [Xenopus (Silurana) tropicalis]
Length = 329
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 95/215 (44%), Positives = 126/215 (58%), Gaps = 6/215 (2%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
+ Y KGY + +Q +CGSCW+F + GA+EG K KL LS Q L+DC N G
Sbjct: 119 IDYRKKGYVTPVHNQGICGSCWAFSSVGALEGQLMKKTGKLVSLSPQNLVDCD--TDNYG 176
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
C+GG ++ ++ +G D PY+GQD CH AT G+ + SE ALK
Sbjct: 177 CEGGYMTNAFGYVRDNGGIDSDAEYPYVGQDEGCHYNPADKAATCKGYKEIPVGSEKALK 236
Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQV 317
A+A GPVSV+IDAS SF FY GVYYD CN PD ++HAVL VGYG G +W +
Sbjct: 237 RAVANVGPVSVSIDASLPSFQFYKKGVYYDSSCN--PDAVNHAVLVVGYGNEKGIKHWII 294
Query: 318 KNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
KNSW +WG +GYVL++ K N CG+ + ++ M
Sbjct: 295 KNSWGDWWGKKGYVLLARDKKNACGIASLASFPVM 329
>gi|194883222|ref|XP_001975702.1| GG20414 [Drosophila erecta]
gi|190658889|gb|EDV56102.1| GG20414 [Drosophila erecta]
Length = 341
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 91/204 (44%), Positives = 127/204 (62%), Gaps = 6/204 (2%)
Query: 148 SLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSY 207
++ DQ CGSCW+F +TGA+EG ++ K L LS+Q L+DCS YGNNGC+GG ++
Sbjct: 138 AVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAF 197
Query: 208 QWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
++I + G+ T+ Y PY D CH AT GF ++ E + A+A GPV
Sbjct: 198 RYIKDNGGIDTEKSY-PYEAIDDSCHFNKGAIGATDRGFTDIPQGDEKKMAEAVATVGPV 256
Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYW 325
+VAIDAS +SF FY GVY + +C+ LDH VL VGYG + G YW VKNSW T W
Sbjct: 257 AVAIDASHESFQFYSEGVYNEPQCD--AQNLDHGVLVVGYGTDESGDDYWLVKNSWGTTW 314
Query: 326 GNQGYV-LMSIKDNNCGVMTAPTY 348
G++G++ ++ KDN CG+ +A +Y
Sbjct: 315 GDKGFIKMLRNKDNQCGIASASSY 338
>gi|30388235|gb|AAH51665.1| CDNA sequence BC051665 [Mus musculus]
Length = 330
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 98/204 (48%), Positives = 126/204 (61%), Gaps = 7/204 (3%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F G++EG + K KL LS+Q L+DCSW YGN GC+GG ++Q++
Sbjct: 131 DQGHCGSCWAFSAVGSLEGQIFRKTGKLVPLSEQNLMDCSWSYGNVGCNGGLMELAFQYV 190
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
++ GL T++ Y Y D C + +TGFV V P SEDAL A+A GPVSV
Sbjct: 191 KENRGLDTRESYA-YEAWDGPCRYDPKYSAVNITGFVKV-PLSEDALMNAVASVGPVSVG 248
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
ID SF FY G YY+ C+++ LDHAVL VGYG E DG+ YW VKNSW WG
Sbjct: 249 IDTHHHSFRFYRGGTYYEPDCSST--NLDHAVLVVGYGEESDGRKYWLVKNSWGEDWGMD 306
Query: 329 GYVLMSI-KDNNCGVMTAPTYVTM 351
GY+ M+ +DNNCG+ T Y T+
Sbjct: 307 GYIKMAKDRDNNCGIATYAIYPTV 330
>gi|330842502|ref|XP_003293216.1| hypothetical protein DICPUDRAFT_95775 [Dictyostelium purpureum]
gi|325076482|gb|EGC30264.1| hypothetical protein DICPUDRAFT_95775 [Dictyostelium purpureum]
Length = 376
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 105/242 (43%), Positives = 136/242 (56%), Gaps = 43/242 (17%)
Query: 149 LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
+ DQ CGSCWSF TTG+VEGA+ +K KL LS+Q L+DCS GN GCDGG ++
Sbjct: 137 IKDQGQCGSCWSFSTTGSVEGAHQIKTGKLVSLSEQNLVDCSGAEGNLGCDGGLMDNAFI 196
Query: 209 WIMKH-GLPTQDDYGPYLGQDA-YCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
+I+++ G+ T+ Y PY Q C T+ AT++G+VN+T SE L+ A+AK+GPV
Sbjct: 197 YIIQNKGIDTESSY-PYKAQSGTKCLFKPTSIGATLSGYVNITAGSESQLETAVAKNGPV 255
Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-----ELDGKP-------- 313
SVAIDAS SF Y +GVYY+ KC SP LDH VL VGYG E + P
Sbjct: 256 SVAIDASHNSFQLYSSGVYYEPKC--SPTELDHGVLVVGYGVAKKDENNASPNKHQIRIR 313
Query: 314 ------------------------YWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
YW VKNSW WG QG++ MS + NNCG+ + +Y
Sbjct: 314 HNDDFGIDEIVTDSSSDDGRKTSQYWLVKNSWGVSWGMQGFIQMSKNRKNNCGIASCASY 373
Query: 349 VT 350
T
Sbjct: 374 PT 375
>gi|297287735|ref|XP_002803218.1| PREDICTED: putative cathepsin L-like protein 6-like [Macaca
mulatta]
Length = 270
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 102/259 (39%), Positives = 151/259 (58%), Gaps = 12/259 (4%)
Query: 101 LVNGIILEKWRLVTSEGEKVSKYSLWVRYNKASKDAIP--VRYEMKGY-NSLLDQSVCGS 157
++ + L ++R V + G + K+ + + IP V + KGY + +Q +CGS
Sbjct: 16 IIGSLALTEFRQVMN-GFQYQKHRKGKEFQEQHLLEIPTSVDWREKGYVTPVKNQGMCGS 74
Query: 158 CWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPT 217
CW+F TGA+EG + K KL LS+Q L+DCSW GN G +GG S++++ ++G
Sbjct: 75 CWAFSATGALEGQMFWKTGKLISLSEQNLVDCSWPQGNEGYNGGFMDNSFRYVQENGGLD 134
Query: 218 QDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSF 277
+ PY G+ C + A TGFV++ P+ E L A+A GP+SVA+DAS SF
Sbjct: 135 SEASYPYEGKVKTCRYNPKYSVANDTGFVDI-PSREKDLAKAVATVGPISVAVDASHFSF 193
Query: 278 SFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTYWGNQGYVLM 333
FY G+Y++ +C+ P+GLDHA+L VGYG + D YW VKNSW WG GY+ M
Sbjct: 194 QFYKKGIYFEPRCD--PEGLDHAMLTVGYGYEGADSDNNKYWLVKNSWGKNWGMDGYIKM 251
Query: 334 SI-KDNNCGVMTAPTYVTM 351
+ + NNCG+ TA +Y T+
Sbjct: 252 AKDRRNNCGIATAASYPTV 270
>gi|23344734|gb|AAN28680.1| cathepsin L [Theromyzon tessulatum]
Length = 351
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 95/220 (43%), Positives = 135/220 (61%), Gaps = 11/220 (5%)
Query: 137 IPVRYEMKGYNS-LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGN 195
+ V + KG S + +Q CGSCW+F TTG++EG + K + LS+Q L+DCS YGN
Sbjct: 131 VEVDWRTKGLVSEVKNQGSCGSCWAFSTTGSLEGQHMRKTGTMVDLSEQNLVDCSTSYGN 190
Query: 196 NGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSED 254
+GC+GG ++++I + G+ T++ Y PY G+D C AT+TGFV + +E
Sbjct: 191 DGCNGGLMTNAFKYIKDNKGIDTEEAY-PYAGRDGDCKFKKNKVGATVTGFVEIPAGNEK 249
Query: 255 ALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPY 314
L+ ALA GPVSVAIDA+ +SF Y +GVY + +C+++ LDH VLAVGYG + GK Y
Sbjct: 250 KLQEALATVGPVSVAIDANHQSFMLYKSGVYDEPECDSAQ--LDHGVLAVGYGSIHGKDY 307
Query: 315 WQVKNSWSTYWGNQGYVLMSIK------DNNCGVMTAPTY 348
+ VKNSW T WG QGY+ S CG++ +Y
Sbjct: 308 YIVKNSWGTTWGEQGYIRFSTTAVPDAIGGICGILLDASY 347
>gi|320543907|ref|NP_001188921.1| cysteine proteinase-1, isoform D [Drosophila melanogaster]
gi|318068589|gb|ADV37168.1| cysteine proteinase-1, isoform D [Drosophila melanogaster]
Length = 249
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 94/214 (43%), Positives = 133/214 (62%), Gaps = 7/214 (3%)
Query: 139 VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KG ++ DQ CGSCW+F +TGA+EG ++ K L LS+Q L+DCS YGNNG
Sbjct: 36 VDWRTKGAVTAVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNG 95
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++++I + G+ T+ Y PY D CH T AT GF ++ E +
Sbjct: 96 CNGGLMDNAFRYIKDNGGIDTEKSY-PYEAIDDSCHFNKGTVGATDRGFTDIPQGDEKKM 154
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYW 315
A+A GPVSVAIDAS +SF FY GVY + +C+ LDH VL VG+G + G+ YW
Sbjct: 155 AEAVATVGPVSVAIDASHESFQFYSEGVYNEPQCD--AQNLDHGVLVVGFGTDESGEDYW 212
Query: 316 QVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
VKNSW T WG++G++ ++ K+N CG+ +A +Y
Sbjct: 213 LVKNSWGTTWGDKGFIKMLRNKENQCGIASASSY 246
>gi|156354961|ref|XP_001623447.1| predicted protein [Nematostella vectensis]
gi|156210147|gb|EDO31347.1| predicted protein [Nematostella vectensis]
Length = 294
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 84/203 (41%), Positives = 126/203 (62%), Gaps = 11/203 (5%)
Query: 137 IPVRYEMKGY---NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG- 192
+P + + + Y N Q CGSCW+F T GAVE A++++ +L L++Q L+DC+W
Sbjct: 91 VPDQLDWRDYGAVNPAKGQGTCGSCWAFATAGAVEAAHFIQKGELLNLAEQQLLDCTWST 150
Query: 193 ----YGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
+GNNGC GG ++++ W+ K G+ T YG Y GQ+ +C +N T A +T + V
Sbjct: 151 PGVYHGNNGCLGGWTWKAFSWVKKFGIATTKSYGHYRGQEGFCKTSNLTVGARITSYRRV 210
Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
+ ALK AL+ HGP +++I+A+ KS FY +G+ D+ C+N DHAVL +GYG
Sbjct: 211 KRFNPIALKKALSYHGPATISINANPKSLKFYSDGIMSDKHCSNK---TDHAVLLIGYGS 267
Query: 309 LDGKPYWQVKNSWSTYWGNQGYV 331
+G PYW +KNSWS WGN G++
Sbjct: 268 DNGVPYWLIKNSWSHKWGNNGFI 290
>gi|118425914|gb|ABK90856.1| cathepsin-L-like cysteine peptidase [Radix peregra]
Length = 324
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 97/220 (44%), Positives = 133/220 (60%), Gaps = 8/220 (3%)
Query: 134 KDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCS 190
KD++P V + +GY + DQ CGSCW+F TTG++EG ++ K+L LS+ L+DCS
Sbjct: 105 KDSLPTAVDWRKEGYVTEVKDQGQCGSCWAFSTTGSLEGQHFKATKQLVSLSESNLVDCS 164
Query: 191 WGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVT 249
+GN GC+GG ++++I + G+ T+ Y PY +D C+ AT + ++T
Sbjct: 165 KKWGNQGCNGGLMDNAFKYIADNKGIDTEKSY-PYKPEDRKCNFKKANVGATDKLYKDIT 223
Query: 250 PNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGEL 309
SEDAL+ A+A GP+SVAIDAS SF Y GVY ++ C S LDH VLAVGY
Sbjct: 224 SGSEDALQEAVATIGPISVAIDASHDSFQLYSGGVYNEKAC--STKTLDHGVLAVGYDSK 281
Query: 310 DGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
+G YW VKNSW WG GY+ MS K N CG+ T +Y
Sbjct: 282 NGDDYWIVKNSWGKSWGIDGYIWMSRNKKNQCGIATMASY 321
>gi|327289219|ref|XP_003229322.1| PREDICTED: cathepsin K-like, partial [Anolis carolinensis]
Length = 289
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 97/216 (44%), Positives = 127/216 (58%), Gaps = 8/216 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V Y KGY + +Q CGSCW+F + GA+E MK KL LS Q L+DC N+G
Sbjct: 79 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEAQLKMKTGKLLNLSPQNLVDCV--SNNDG 136
Query: 198 CDGGEDFRSYQWI-MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG +++++ + G+ + D Y PY+GQD C T A G+ + E AL
Sbjct: 137 CGGGYMTNAFEYVHVNRGIDSDDTY-PYIGQDENCMYNPTGKAAKCRGYKEIPEGDEKAL 195
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K A+A+ GPVSV IDAS SF FY GVYYDE CN D ++HAVLAVGYG G +W
Sbjct: 196 KRAVARKGPVSVGIDASLASFQFYSRGVYYDENCN--ADNINHAVLAVGYGSQKGTKHWI 253
Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
VKNSW WG++GY+LM+ NN CG+ ++ M
Sbjct: 254 VKNSWGEDWGDKGYILMARNMNNACGIANLASFPKM 289
>gi|114625736|ref|XP_001153919.1| PREDICTED: cathepsin L2 isoform 2 [Pan troglodytes]
gi|114625742|ref|XP_520130.2| PREDICTED: cathepsin L2 isoform 5 [Pan troglodytes]
Length = 334
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 96/217 (44%), Positives = 130/217 (59%), Gaps = 10/217 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TGA+EG + K KL LS+Q L+DCS GN G
Sbjct: 118 VDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQG 177
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG R++Q++ ++ GL +++ Y PY+ D C + A TGF VTP E AL
Sbjct: 178 CNGGFMARAFQYVKENGGLDSEESY-PYVAMDEICKYRPENSVANDTGFTVVTPGKEKAL 236
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GP+SVA+DA SF FY +G+Y++ C S LDH VL VGYG +
Sbjct: 237 MKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDC--SSKNLDHGVLVVGYGFEGANSNNS 294
Query: 313 PYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
YW VKNSW WG+ GYV + K+N+CG+ TA +Y
Sbjct: 295 KYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASY 331
>gi|45384464|ref|NP_990302.1| cathepsin K precursor [Gallus gallus]
gi|25089842|sp|Q90686.1|CATK_CHICK RecName: Full=Cathepsin K; AltName: Full=JTAP-1; Flags: Precursor
gi|1017831|gb|AAC59739.1| JTAP-1 [Gallus gallus]
Length = 334
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 95/223 (42%), Positives = 132/223 (59%), Gaps = 8/223 (3%)
Query: 132 ASKDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCS 190
+S+ V + KGY + DQ CGSCW+F + GA+EG + KL LS Q L+ C
Sbjct: 117 SSRAPAAVDWRRKGYVTPVKDQGQCGSCWAFSSVGALEGQLKRRTGKLLSLSPQNLVYCV 176
Query: 191 WGYGNNGCDGGEDFRSYQWI-MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVT 249
NNGC GG +++++ + G+ ++D Y PY+GQD C + T A G+ +
Sbjct: 177 --SNNNGCGGGYMTNAFEYVRLNRGIDSEDAY-PYIGQDESCMYSPTGKAAKCRGYREIP 233
Query: 250 PNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGEL 309
++E ALK A+A+ GPVSV IDAS SF FY GVYYD CN P+ ++HAVLAVGYG
Sbjct: 234 EDNEKALKRAVARIGPVSVGIDASLPSFQFYSRGVYYDTGCN--PENINHAVLAVGYGAQ 291
Query: 310 DGKPYWQVKNSWSTYWGNQGYVLMSIK-DNNCGVMTAPTYVTM 351
G +W +KNSW T WGN+GYVL++ CG+ ++ M
Sbjct: 292 KGTKHWIIKNSWGTEWGNKGYVLLARNMKQTCGIANLASFPKM 334
>gi|225579644|gb|ACN93991.1| cathepsin L [Dicentrarchus labrax]
Length = 316
Score = 175 bits (444), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 93/190 (48%), Positives = 125/190 (65%), Gaps = 9/190 (4%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TTGA+EG ++ K KL LS+Q L+DCS GN GC+GG +++Q++
Sbjct: 128 DQGQCGSCWAFSTTGALEGQHFRKSGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYV 187
Query: 211 M-KHGLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
GL ++D Y PYLG D CH +A TGFV++ E AL A+A GPVSV
Sbjct: 188 KDNQGLDSEDSY-PYLGTDDQPCHYDPNYNSANDTGFVDIPSGKEHALMKAVAAVGPVSV 246
Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTY 324
AIDA +SF FY +G+YY+++C S + LDH VL VGYG ++DGK YW V NSWS
Sbjct: 247 AIDAGHESFQFYQSGIYYEKEC--SSEELDHGVLVVGYGFEGQDVDGKKYWIVNNSWSEK 304
Query: 325 WGNQGYVLMS 334
WG++GY+ ++
Sbjct: 305 WGDEGYIYLA 314
>gi|156355022|ref|XP_001623476.1| predicted protein [Nematostella vectensis]
gi|156210179|gb|EDO31376.1| predicted protein [Nematostella vectensis]
Length = 195
Score = 175 bits (444), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 83/190 (43%), Positives = 120/190 (63%), Gaps = 8/190 (4%)
Query: 147 NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG-----YGNNGCDGG 201
N Q CGSCW+F T GAVE A++++ +L L++Q L+DC+W +GNNGC GG
Sbjct: 7 NPAKGQGTCGSCWAFATAGAVEAAHFIQKGELLNLAEQQLLDCTWSTPGVYHGNNGCLGG 66
Query: 202 EDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALA 261
++++ W+ K GL T YG Y GQ+ +C +N T A +T + V + ALK AL+
Sbjct: 67 WTWKAFSWVKKFGLATTKSYGHYRGQEGFCKTSNLTVGARITSYRRVKRFNPIALKKALS 126
Query: 262 KHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSW 321
HGP +++I+A+ KS FY +G+ D+ C+N DHAVL +GYG +G PYW +KNSW
Sbjct: 127 YHGPATISINANPKSLKFYSDGIMSDKHCSNK---TDHAVLLIGYGSDNGVPYWLIKNSW 183
Query: 322 STYWGNQGYV 331
S WGN G++
Sbjct: 184 SHKWGNNGFI 193
>gi|344238391|gb|EGV94494.1| Ras-specific guanine nucleotide-releasing factor 1 [Cricetulus
griseus]
Length = 1632
Score = 175 bits (444), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 83/198 (41%), Positives = 121/198 (61%), Gaps = 3/198 (1%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM 211
Q CGSCW+F TTGA+E A + K+ L++Q L+DC+ + N+GC+GG +++++I+
Sbjct: 1432 QGSCGSCWTFSTTGALESAVAIASGKMLSLAEQQLVDCAQNFNNHGCEGGLPSQAFEYIL 1491
Query: 212 -KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
G+ +D Y PY G+D +C A A + N+T N E A+ A+A + PVS A
Sbjct: 1492 YNKGIMGEDTY-PYRGKDGHCKFDPQKAIAFVKDVANITLNDEKAMVEAVALYNPVSFAF 1550
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
+ + F Y G+Y C+ +PD ++HAVLAVGYGE DG PYW VKNSW T WG++GY
Sbjct: 1551 EVTD-DFMLYQKGIYSSTSCHKTPDKVNHAVLAVGYGEKDGIPYWIVKNSWGTNWGDKGY 1609
Query: 331 VLMSIKDNNCGVMTAPTY 348
L+ N CG+ +Y
Sbjct: 1610 FLIERGKNMCGLAACASY 1627
>gi|311265493|ref|XP_003130681.1| PREDICTED: cathepsin L1-like [Sus scrofa]
Length = 332
Score = 175 bits (444), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 96/216 (44%), Positives = 131/216 (60%), Gaps = 10/216 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY ++ +Q CGSCW+F TGA+EG + K KL LS+Q L+DCSW GN G
Sbjct: 118 VDWREKGYVTAVKNQGHCGSCWAFSATGALEGQMFRKTSKLISLSEQNLVDCSWPEGNEG 177
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++Q+I + GL +++ Y PY G+D C ++ A TG+V++ P E AL
Sbjct: 178 CNGGLMDNAFQYIKDNGGLDSEESY-PYFGKDGSCKYKPQSSAANDTGYVDI-PKQEKAL 235
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGY---GELDGKP 313
A+A GP+SV IDAS +SF FY G+Y++ +C S + LDH VL VGY G
Sbjct: 236 MKAVATVGPISVGIDASHESFQFYSTGIYFEPQC--SSEDLDHGVLVVGYGVEGAHSNNK 293
Query: 314 YWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTY 348
YW VKNSW WG GY+ M+ NN CG+ T +Y
Sbjct: 294 YWLVKNSWGNTWGMDGYIKMTKDQNNHCGIATMASY 329
>gi|351694420|gb|EHA97338.1| Cathepsin K [Heterocephalus glaber]
Length = 329
Score = 175 bits (444), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 99/216 (45%), Positives = 132/216 (61%), Gaps = 8/216 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V Y KGY + +Q CGSCW+F + GA+EG K KL LS Q L+DC N+G
Sbjct: 119 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 176
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG ++Q++ ++ G+ ++D Y PY+GQD C T A G+ V +E AL
Sbjct: 177 CGGGYMTNAFQYVQQNRGIDSEDAY-PYVGQDESCMYNPTGKAAKCRGYREVPVGNEKAL 235
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K A+A+ GP+SVAIDAS SF FY GVYYDE C+ D L+HAVLAVGYG G +W
Sbjct: 236 KRAVARVGPISVAIDASLTSFQFYSKGVYYDESCDG--DNLNHAVLAVGYGIQRGHKHWI 293
Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
+KNSW WGN+GYVL++ K+N CG+ ++ M
Sbjct: 294 LKNSWGENWGNKGYVLLARNKNNTCGIANLASFPKM 329
>gi|33242886|gb|AAQ01147.1| cathepsin [Haplochromis chilotes]
Length = 334
Score = 175 bits (444), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 92/202 (45%), Positives = 122/202 (60%), Gaps = 6/202 (2%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM 211
Q CGSCW+F GA+EG KL LS Q L+DCS YGN+GC+GG R++Q+++
Sbjct: 137 QGACGSCWAFSAAGALEGQLAKSTGKLVDLSPQNLVDCSGKYGNHGCNGGFMTRAFQYVI 196
Query: 212 -KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
HG+ + Y PY+G+D CH T A + + + E+ALK LA GP+SVAI
Sbjct: 197 DNHGIDSDASY-PYIGRDDQCHYNPATRAANCSSYQFLPEGDENALKQGLATVGPISVAI 255
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
DA + FSFY +GVY D C ++H VLAVGYG L+G+ YW VKNSW T +G+QGY
Sbjct: 256 DARRPRFSFYRSGVYNDPSCTQK---VNHGVLAVGYGTLNGQDYWLVKNSWGTTFGDQGY 312
Query: 331 VLMSIK-DNNCGVMTAPTYVTM 351
+ M+ N CG+ P Y M
Sbjct: 313 IRMARNTGNQCGIALYPCYPVM 334
>gi|295321664|pdb|3H7D|A Chain A, The Crystal Structure Of The Cathepsin K Variant M5 In
Compl Chondroitin-4-Sulfate
gi|295321665|pdb|3H7D|E Chain E, The Crystal Structure Of The Cathepsin K Variant M5 In
Compl Chondroitin-4-Sulfate
Length = 215
Score = 175 bits (444), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 99/216 (45%), Positives = 130/216 (60%), Gaps = 8/216 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V Y KGY + +Q CGSCW+F + GA+EG K KL LS Q L+DC N+G
Sbjct: 5 VDYREKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 62
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG ++Q++ K+ G+ ++D Y PY+GQ+ C T A G+ + +E AL
Sbjct: 63 CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQEESCMYNPTGKAAKCRGYREIPEGNEKAL 121
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K A+A+ GPVSVAIDAS SF FY GVYYDE CN+ D L+HAVLAVGYGE G +W
Sbjct: 122 KRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNS--DNLNHAVLAVGYGESKGNKHWI 179
Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
+KNSW WG GY+ M+ NN CG+ ++ M
Sbjct: 180 IKNSWGENWGMGGYIKMARNKNNACGIANLASFPKM 215
>gi|397499865|ref|XP_003820654.1| PREDICTED: cathepsin L2 isoform 1 [Pan paniscus]
gi|397499867|ref|XP_003820655.1| PREDICTED: cathepsin L2 isoform 2 [Pan paniscus]
Length = 334
Score = 175 bits (443), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 96/217 (44%), Positives = 129/217 (59%), Gaps = 10/217 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TGA+EG + K KL LS+Q L+DCS GN G
Sbjct: 118 VDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQG 177
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG R++Q++ ++ GL +++ Y PY+ D C + A TGF VTP E AL
Sbjct: 178 CNGGFMARAFQYVKENGGLDSEESY-PYVAMDEICKYRPENSVANDTGFTVVTPGKEKAL 236
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GP+SVA+DA SF FY +G+Y++ C S LDH VL VGYG +
Sbjct: 237 MKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDC--SSKNLDHGVLVVGYGFEGANSNNS 294
Query: 313 PYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
YW VKNSW WG+ GYV + K N+CG+ TA +Y
Sbjct: 295 KYWLVKNSWGPEWGSNGYVKIAKDKKNHCGIATAASY 331
>gi|410519429|gb|AFV73398.1| cathepsin L [Haliotis discus hannai]
Length = 326
Score = 175 bits (443), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 95/216 (43%), Positives = 130/216 (60%), Gaps = 6/216 (2%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F +TG++EG + K+ KL LS+Q L+DCS GN G
Sbjct: 114 VDWRTKGYVTEVKNQGQCGSCWAFSSTGSLEGQTFKKYNKLISLSEQNLVDCSTEQGNMG 173
Query: 198 CDGGEDFRSYQWI-MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG +++ +I + G+ T+ Y PY C A TG+ ++ SE L
Sbjct: 174 CGGGLMDQAFTYIKVNDGIDTETSY-PYEAASGKCRFNKANVGANDTGYTDIKSKSESDL 232
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
+ A+A GP++VAIDAS SF Y +GVY+ C+ + LDH VLAVGYG GK YW
Sbjct: 233 QSAVATVGPIAVAIDASHMSFQLYKSGVYHYIFCSQTR--LDHGVLAVGYGTDSGKDYWL 290
Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
VKNSW WG QGY++MS +DNNCG+ T +Y T+
Sbjct: 291 VKNSWGATWGQQGYIMMSRNRDNNCGIATQASYPTV 326
>gi|47076309|emb|CAD89795.1| putative cathepsin L protease [Meloidogyne incognita]
Length = 383
Score = 175 bits (443), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 102/222 (45%), Positives = 136/222 (61%), Gaps = 12/222 (5%)
Query: 136 AIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG 192
AIP V + KG S+ +Q +CGSCW+F TGA+EG + K L LS+Q LIDC+ G
Sbjct: 162 AIPESVDWRDKGLVTSVKNQGMCGSCWAFSATGALEGQHSRKLGTLVSLSEQNLIDCTKG 221
Query: 193 --YGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNV 248
YGN GC+GG ++Q+I + G+ T++ Y PY ++ C + AT TG+V++
Sbjct: 222 EPYGNMGCNGGLMDNAFQYIEDNKGVDTENSY-PYKAKNGKKCLFKRSNVGATDTGYVDL 280
Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
ED LK+A+A GP+SVAIDA +SF Y +GVY +E C SPD L H VL VGYG
Sbjct: 281 PSGDEDKLKIAVATQGPISVAIDAGHRSFQLYAHGVYDEEAC--SPDNLGHGVLVVGYGT 338
Query: 309 LD-GKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
D YW VKNSW +WG GY+ MS KDN CG+ + +Y
Sbjct: 339 DDIHGDYWLVKNSWGEHWGENGYIRMSRNKDNQCGIASKASY 380
>gi|37963625|gb|AAP94048.2| cathepsin-L-like midgut cysteine proteinase [Tenebrio molitor]
Length = 330
Score = 175 bits (443), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 90/218 (41%), Positives = 133/218 (61%), Gaps = 7/218 (3%)
Query: 136 AIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGN 195
A V + + + DQ CGSCWSF TTGAVEG ++ +L LS+Q LIDCS YGN
Sbjct: 118 AASVDWRSNAVSEVKDQGQCGSCWSFSTTGAVEGQLALQRGRLTSLSEQNLIDCSSSYGN 177
Query: 196 NGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDA 255
GCDGG ++ +I +G+ ++ Y PY Q YC ++ + T++G+ ++ E++
Sbjct: 178 AGCDGGWMDSAFSYIHDYGIMSESAY-PYEAQGDYCRFDSSQSVTTLSGYYDLPSGDENS 236
Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYW 315
L A+ + GPV+VAIDA+ + FY G++YD+ CN S L+H VL VGYG +G+ YW
Sbjct: 237 LADAVGQAGPVAVAIDATDE-LQFYSGGLFYDQTCNQSD--LNHGVLVVGYGSDNGQDYW 293
Query: 316 QVKNSWSTYWGNQGYVLMSIKD--NNCGVMTAPTYVTM 351
+KNSW + WG GY +++ NNCG+ TA +Y +
Sbjct: 294 ILKNSWGSGWGESGY-WRQVRNYGNNCGIATAASYPAL 330
>gi|15128493|dbj|BAB62718.1| plerocercoid growth factor/cysteine protease [Spirometra
erinaceieuropaei]
gi|15130639|dbj|BAB62799.1| plerocercoid growth factor-2/cysteine protease [Spirometra
erinaceieuropaei]
Length = 336
Score = 175 bits (443), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 97/251 (38%), Positives = 139/251 (55%), Gaps = 14/251 (5%)
Query: 102 VNGIILEKWRLVTSEGEKVSKYSLWVRYNKASKDAIPVRYEMKGYNSLLDQSVCGSCWSF 161
+ GI+L K R K ++ V + D++ R E S+ +Q CGSCWSF
Sbjct: 99 LRGIVLTKLR---------RKEAVSVPLKENLPDSVNWR-ERGAVTSVKNQGQCGSCWSF 148
Query: 162 GTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDY 221
GA+EGA +K L LS+Q L+DCSW YGN GC+GG +++Q+ ++G+ + DY
Sbjct: 149 SANGAIEGAIQIKTGALRSLSEQQLMDCSWDYGNQGCNGGLMPQAFQYAQRYGVEAEVDY 208
Query: 222 GPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYV 281
Y +D C A +TG+ + E L+ A+A GP+SV IDA+ F Y
Sbjct: 209 -RYTERDGVCRYRQDLVVANVTGYAELPEGDEGGLQRAVATIGPISVGIDAADPGFMSYS 267
Query: 282 NGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNN-C 340
+GV+ + C SP +DH VL VGYG +G+ YW VKNSW + WG GYV M+ NN C
Sbjct: 268 HGVFVSKTC--SPYAIDHGVLVVGYGAENGEAYWLVKNSWGSSWGEGGYVKMARNRNNMC 325
Query: 341 GVMTAPTYVTM 351
G+ + +Y T+
Sbjct: 326 GIASMASYPTV 336
>gi|391343119|ref|XP_003745860.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 385
Score = 175 bits (443), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 92/204 (45%), Positives = 126/204 (61%), Gaps = 6/204 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F G++EG ++ KL LS+Q L+DCS GN+GC+GG ++++++
Sbjct: 185 NQGQCGSCWAFSAVGSLEGQHFKSTGKLVSLSEQNLVDCSTPEGNSGCNGGWMDQAFEYV 244
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
HG+ T+D Y PY+G D CH N + AT+ GF++V E+AL+ A+ GPVSVA
Sbjct: 245 KDNHGIDTEDSY-PYVGTDGSCHFKNKSIGATLKGFMDVKEGDEEALRQAVGVAGPVSVA 303
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
IDAS F FY GVY C+ S LDH VL VGYG + GK +W VKNSW WG
Sbjct: 304 IDASSMLFQFYRGGVYNVPWCSTSE--LDHGVLVVGYGKQFQGKDFWMVKNSWGVGWGIY 361
Query: 329 GYVLMSI-KDNNCGVMTAPTYVTM 351
GY+ MS K N CG+ + + T+
Sbjct: 362 GYIEMSRNKGNQCGIASKASIPTV 385
>gi|308322281|gb|ADO28278.1| cathepsin L [Ictalurus furcatus]
Length = 359
Score = 175 bits (443), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 96/218 (44%), Positives = 134/218 (61%), Gaps = 8/218 (3%)
Query: 136 AIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG 192
A+P V + GY + +Q C SCW+F TGA+EG + K KL LS+Q L+DCS
Sbjct: 115 ALPNTVNWTQMGYVTEVEEQKQCNSCWAFSATGALEGQTFKKTGKLVSLSKQQLVDCSKK 174
Query: 193 YGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPN 251
+GNNGC GG +++++ ++ GL T++ Y PY +D C T T TG V +
Sbjct: 175 FGNNGCKGGLMNWAFEYVKENGGLHTEESY-PYEAKDGSCRDNLGTVGVTCTGHVQINSE 233
Query: 252 SEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDG 311
E+AL+ A+A GP+SVAIDA+ SF Y +G+Y + C S ++H VLAVGYG DG
Sbjct: 234 DENALQEAVATIGPISVAIDANHTSFQLYESGLYDEPDC--SCTDMNHGVLAVGYGTDDG 291
Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
K YW +KNSW WG++GY+ MS K+N CG+ TA +Y
Sbjct: 292 KDYWLIKNSWGINWGDKGYIKMSRNKNNQCGIATAASY 329
>gi|198432217|ref|XP_002130230.1| PREDICTED: similar to cathepsin L [Ciona intestinalis]
Length = 327
Score = 174 bits (442), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 96/222 (43%), Positives = 140/222 (63%), Gaps = 7/222 (3%)
Query: 134 KDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG 192
++ + + +GY + +Q CGSCW+F TTG++EG ++ K K L LS+Q L+DCS+
Sbjct: 109 ENPTSIDWRTRGYVTPVKNQLQCGSCWAFSTTGSLEGQHFAKTKNLVSLSEQQLMDCSFK 168
Query: 193 YGNNGCDGGEDFRSYQWI-MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPN 251
G+ GC GG ++ +I + G+ ++ DY PY ++ +C N++ AT+TG V+VT
Sbjct: 169 EGDEGCGGGIMDYAFDYIFLAGGVESEADY-PYEARNDHCRFDNSSIAATLTGCVDVTSG 227
Query: 252 SEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDG 311
SE L+ A+ GPVSVAIDAS SF Y +GV Y+ C+ + LDH VLAVGYG +G
Sbjct: 228 SETQLEKAVGSIGPVSVAIDASHISFQLYGSGVNYEPMCSTTT--LDHGVLAVGYGADNG 285
Query: 312 KPYWQVKNSWSTYWGN-QGYVLMSI-KDNNCGVMTAPTYVTM 351
YW VKNSW WG+ GY+ MS ++NNCG+ T +Y T+
Sbjct: 286 NEYWIVKNSWGEGWGHLNGYIKMSKNRNNNCGIATQASYPTV 327
>gi|2804264|dbj|BAA24443.1| cysteine proteinase [Sitophilus zeamais]
Length = 331
Score = 174 bits (442), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 93/197 (47%), Positives = 126/197 (63%), Gaps = 6/197 (3%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCWSF +G++EG ++ K KL LS+Q L+DCS YGNNGC+GG ++++I
Sbjct: 138 DQGHCGSCWSFSGSGSLEGQHFRKTGKLVSLSEQNLVDCSGRYGNNGCNGGLMDNAFRYI 197
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T+ Y PYL +D CH + AT GFV++ +ED LK A+A GP+S+A
Sbjct: 198 KDNGGIDTEQSY-PYLAEDEKCHYKTQNSGATDKGFVDIEEGNEDDLKAAVATVGPISIA 256
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
IDAS ++F Y +GVY D +C + LDH VL VGYG DG+ YW VKNSW G
Sbjct: 257 IDASYETFQLYSDGVYSDPECISQE--LDHGVLVVGYGTSDDGQDYWLVKNSWRPSCGLN 314
Query: 329 GYVLMSI-KDNNCGVMT 344
GY+ M+ +DN CGV +
Sbjct: 315 GYIKMARNQDNMCGVAS 331
>gi|72005575|ref|XP_783218.1| PREDICTED: cathepsin L2-like isoform 2 [Strongylocentrotus
purpuratus]
gi|390337647|ref|XP_003724610.1| PREDICTED: cathepsin L2-like isoform 1 [Strongylocentrotus
purpuratus]
Length = 334
Score = 174 bits (442), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 98/214 (45%), Positives = 132/214 (61%), Gaps = 7/214 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + +GY + DQ CGSCW+F T GA+EG +++K L LS+Q L+DCS GN+G
Sbjct: 121 VDWRTEGYVTPVKDQGQCGSCWAFSTVGALEGQHFVKTGTLVSLSEQNLVDCSQAEGNDG 180
Query: 198 CDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG + ++I +G + T+ Y PY G D CH + AT+TGF V +SE AL
Sbjct: 181 CNGGWPAWADEYIKSNGGIDTEVGY-PYEGVDDSCHYRTSDVGATITGFAEVEADSEKAL 239
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGEL-DGKPYW 315
+ ALA+ GP+SV IDA+Q SF Y +GVY + C+++ LDH V AVGY DG Y+
Sbjct: 240 EKALAQVGPISVCIDATQPSFQLYESGVYDEPDCSST--ALDHCVTAVGYDSTADGDKYY 297
Query: 316 QVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
VKNSW T WG +GY+ MS K CG+ T TY
Sbjct: 298 IVKNSWGTTWGQEGYIWMSRDKQKQCGIATNATY 331
>gi|86279347|gb|ABC88769.1| putative cathepsin L-like proteinase [Tenebrio molitor]
Length = 328
Score = 174 bits (442), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 90/218 (41%), Positives = 133/218 (61%), Gaps = 7/218 (3%)
Query: 136 AIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGN 195
A V + + + DQ CGSCWSF TTGAVEG ++ +L LS+Q LIDCS YGN
Sbjct: 116 AASVDWRSNAVSEVKDQGQCGSCWSFSTTGAVEGQLALQRGRLTSLSEQNLIDCSSSYGN 175
Query: 196 NGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDA 255
GCDGG ++ +I +G+ ++ Y PY Q YC ++ + T++G+ ++ E++
Sbjct: 176 AGCDGGWMDSAFSYIHDYGIMSESAY-PYEAQGDYCRFDSSQSVTTLSGYYDLPSGDENS 234
Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYW 315
L A+ + GPV+VAIDA+ + FY G++YD+ CN S L+H VL VGYG +G+ YW
Sbjct: 235 LADAVGQAGPVAVAIDATDE-LQFYSGGLFYDQTCNQSD--LNHGVLVVGYGSDNGQDYW 291
Query: 316 QVKNSWSTYWGNQGYVLMSIKD--NNCGVMTAPTYVTM 351
+KNSW + WG GY +++ NNCG+ TA +Y +
Sbjct: 292 ILKNSWGSGWGESGY-WRQVRNYGNNCGIATAASYPAL 328
>gi|186688053|gb|ACC86112.1| cathepsin K [Paralichthys olivaceus]
Length = 330
Score = 174 bits (442), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 105/250 (42%), Positives = 140/250 (56%), Gaps = 18/250 (7%)
Query: 105 IILEKWRLVT-SEGEKVSKYSLWVRYNKASKDAIPVRYEMKGYNSLLDQSVCGSCWSFGT 163
+ LE+ R T + E+VSK +V Y K PV+ +Q CGSCW+F +
Sbjct: 96 VPLERQRSFTMALDERVSKLPKFVDYRKEGM-VTPVK----------NQGSCGSCWAFSS 144
Query: 164 TGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGP 223
GA+EG K +L LS Q +DC NNGC GG ++Q++ ++G ++ P
Sbjct: 145 AGALEGQLAKKTGQLMDLSPQNPVDCV--TENNGCGGGYMTNAFQYVQENGGIDSEEAYP 202
Query: 224 YLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNG 283
Y+G+D C ++ A G+ V E AL +AL K GPVSV IDASQ SF FY G
Sbjct: 203 YVGEDQSCRYNSSGMAAQCKGYKEVPVGDEHALAVALFKVGPVSVGIDASQSSFQFYQRG 262
Query: 284 VYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCG 341
VYYD CN D ++HAVLAVGYG GK YW +KNSWS WG +GY+LM+ +DN CG
Sbjct: 263 VYYDRNCNK--DDINHAVLAVGYGISSKGKKYWIIKNSWSENWGKKGYILMARNRDNLCG 320
Query: 342 VMTAPTYVTM 351
+ +Y M
Sbjct: 321 IANLASYPIM 330
>gi|86279345|gb|ABC88768.1| putative cathepsin L-like proteinase [Tenebrio molitor]
Length = 328
Score = 174 bits (442), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 90/217 (41%), Positives = 130/217 (59%), Gaps = 5/217 (2%)
Query: 136 AIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGN 195
A V + + + DQ CGSCWSF TTGAVEG ++ L LS+Q LIDCS YGN
Sbjct: 116 AASVDWRSNAVSEVKDQGQCGSCWSFSTTGAVEGQLALQRGGLTSLSEQNLIDCSSSYGN 175
Query: 196 NGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDA 255
GCDGG ++ +I +G+ ++ Y PY QD YC ++ + T++G+ ++ E++
Sbjct: 176 AGCDGGWMDSAFSYIHDYGIMSESAY-PYEAQDDYCRFDSSQSVTTLSGYYDLPSGDENS 234
Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYW 315
L A+ + GPV+VAIDA+ + FY G++YD+ CN S L+H V VGYG +G+ YW
Sbjct: 235 LADAVGQAGPVAVAIDATDE-LQFYSGGLFYDQTCNQSD--LNHGVFVVGYGSDNGQDYW 291
Query: 316 QVKNSWSTYWGNQGY-VLMSIKDNNCGVMTAPTYVTM 351
+KNSW + WG GY + NNCG+ TA +Y +
Sbjct: 292 ILKNSWGSGWGENGYWTQVRNYGNNCGIATAASYPAL 328
>gi|348525618|ref|XP_003450319.1| PREDICTED: cathepsin S-like [Oreochromis niloticus]
Length = 330
Score = 174 bits (442), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 97/222 (43%), Positives = 129/222 (58%), Gaps = 7/222 (3%)
Query: 133 SKDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW 191
SK V Y KG S+ +Q CGSCW+F GA+EG KL LS Q L+DCS
Sbjct: 113 SKIPKSVDYRKKGMVTSVKNQGACGSCWAFSAAGALEGQLAKSTGKLVDLSPQNLVDCSG 172
Query: 192 GYGNNGCDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTP 250
YGN+GC+GG R++Q+++ HG+ + Y PY G+D C T A + + +
Sbjct: 173 KYGNHGCNGGFMTRAFQYVIDNHGIDSDASY-PYTGRDEQCRYNPATRAANCSSYQFLPE 231
Query: 251 NSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELD 310
E+ALK ALA GP+SVAIDA + FSFY +GVY D C ++H VLAVGYG L+
Sbjct: 232 GDENALKQALATIGPISVAIDARRPRFSFYRSGVYNDPSCTQE---VNHGVLAVGYGSLN 288
Query: 311 GKPYWQVKNSWSTYWGNQGYVLMSIK-DNNCGVMTAPTYVTM 351
G+ YW VKNSW + +G+QGY+ M+ N CG+ Y M
Sbjct: 289 GQDYWLVKNSWGSTFGDQGYIRMARNTGNQCGIALYACYPVM 330
>gi|391340505|ref|XP_003744580.1| PREDICTED: digestive cysteine proteinase 1-like [Metaseiulus
occidentalis]
Length = 469
Score = 174 bits (442), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 94/203 (46%), Positives = 126/203 (62%), Gaps = 7/203 (3%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F T+GA+EG +++K+ +L LS+Q ++DCSW + GC+GG+ + +++
Sbjct: 272 DQGQCGSCWAFSTSGAIEGQHFLKNGELLSLSEQQMVDCSWL--DFGCNGGQPMLAMEYV 329
Query: 211 -MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
GL + Y PY G CH +A A +TGF SE AL+ A+AK GP+SV
Sbjct: 330 RFNGGLELETAY-PYKGVGGSCHSDKKSAAAKITGFWMAGFYSESALQKAVAKVGPISVG 388
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+DAS + F Y +G+Y E C S GLDHAVLAVGYG D YW VKNSW+T WG +G
Sbjct: 389 MDASGEDFQHYKSGIYNPESC--SSIGLDHAVLAVGYGTSDDGDYWLVKNSWNTSWGEKG 446
Query: 330 YV-LMSIKDNNCGVMTAPTYVTM 351
Y L K N CG+ T P Y T+
Sbjct: 447 YFKLPRNKGNKCGIATTPIYPTV 469
>gi|348531513|ref|XP_003453253.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 333
Score = 174 bits (442), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 93/213 (43%), Positives = 131/213 (61%), Gaps = 7/213 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TGA+EG ++ K +L LS+Q L+DCS +GN G
Sbjct: 122 VDWRKKGYVTKVKNQQQCGSCWAFSATGALEGQHFKKTGRLVYLSEQQLVDCSRNFGNRG 181
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
CDGG ++++I + G+ T+ Y PY D CH + A G+V+V+P+ E+AL
Sbjct: 182 CDGGWMNNAFKYIKDNGGIQTEASY-PYQAMDGLCHYNPNSVGAICNGYVDVSPD-EEAL 239
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K A+A GP+S+A+DAS +SF Y +GVY + +CN+ L H +L VGYG G YW
Sbjct: 240 KEAVATIGPISIAMDASHESFQLYQSGVYDEHRCNDYY--LSHGMLVVGYGTEGGLDYWL 297
Query: 317 VKNSWSTYWGNQGYVLM-SIKDNNCGVMTAPTY 348
+KNSW WG GY+ M K N CG+ TA +Y
Sbjct: 298 IKNSWGLGWGKMGYIKMVRNKRNQCGIATAASY 330
>gi|312282841|dbj|BAJ34286.1| unnamed protein product [Thellungiella halophila]
Length = 358
Score = 174 bits (442), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 88/199 (44%), Positives = 122/199 (61%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TTGA+E AY+ K LS+Q L+DC+ + N GC+GG +++++I
Sbjct: 158 DQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYI 217
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ GL T++ Y PY G+D C + + VN+T +ED LK A+ PVS+A
Sbjct: 218 KSNGGLDTEEAY-PYTGKDGTCKYSAENVGVQVLDSVNITLGAEDELKHAVGLVRPVSIA 276
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ KSF Y +GVY D C N+P ++HAVLAVGYG DG PYW +KNSW WG++G
Sbjct: 277 FEVV-KSFRLYKSGVYTDSHCGNTPMDVNHAVLAVGYGIEDGVPYWLIKNSWGADWGDKG 335
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y M + N CG+ T +Y
Sbjct: 336 YFKMEMGKNMCGIATCASY 354
>gi|449681105|ref|XP_002158608.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 339
Score = 174 bits (442), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 91/213 (42%), Positives = 131/213 (61%), Gaps = 3/213 (1%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + +GY + +Q CGSCW+F +TGA+EG + K L +S+Q L+DC+ YGN
Sbjct: 124 VDWRTEGYVTRVKNQGQCGSCWAFSSTGALEGQTFRKTGVLQEISEQNLVDCTQSYGNEA 183
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++ +I + G+ ++ Y Y YC+ A+ TGFV++ E+AL
Sbjct: 184 CNGGWMDNAFTYIKDNKGIDSEVGYPYYARALGYCYYNQQYNVASDTGFVDIPSGDENAL 243
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K+A+A GP+SVAIDA++ SF Y +GVY + C N + LDHAVL VGYG +G+ +W
Sbjct: 244 KVAVATVGPISVAIDATKASFMSYQSGVYNEPTCGNGIENLDHAVLVVGYGTEEGRDFWI 303
Query: 317 VKNSWSTYWGNQGYVLMSIK-DNNCGVMTAPTY 348
VKNSW T WG+QGY+ MS N CG+ T +Y
Sbjct: 304 VKNSWDTTWGDQGYIKMSRNMSNQCGIATKASY 336
>gi|301789679|ref|XP_002930256.1| PREDICTED: cathepsin L1-like [Ailuropoda melanoleuca]
gi|281343339|gb|EFB18923.1| hypothetical protein PANDA_020645 [Ailuropoda melanoleuca]
Length = 334
Score = 174 bits (442), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 98/220 (44%), Positives = 131/220 (59%), Gaps = 10/220 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + +KGY + +Q CGSCW+F TGA+EG + K KL LS+Q L+DCS GN G
Sbjct: 118 VDWTLKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRSQGNEG 177
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDA-YCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++Q++ ++G ++ PYLG D C + A TGFV++ P E AL
Sbjct: 178 CNGGLMDNAFQYVKENGGLDSEESYPYLGTDTDSCKYKPECSAANDTGFVDI-PQREKAL 236
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GP+SVAIDA +SF FY +G+YYD C S LDH VL VGYG + +
Sbjct: 237 MKAVATVGPISVAIDAGHQSFQFYKSGIYYDPDC--SSKDLDHGVLVVGYGFEGTDSNNN 294
Query: 313 PYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
+W VKNSW WG GYV M+ NN CG+ TA +Y T+
Sbjct: 295 KFWIVKNSWGPEWGTNGYVKMAKDQNNHCGIATAASYPTV 334
>gi|426362423|ref|XP_004048364.1| PREDICTED: cathepsin L2 isoform 1 [Gorilla gorilla gorilla]
gi|426362425|ref|XP_004048365.1| PREDICTED: cathepsin L2 isoform 2 [Gorilla gorilla gorilla]
Length = 334
Score = 174 bits (442), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 95/217 (43%), Positives = 129/217 (59%), Gaps = 10/217 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TGA+EG + K KL LS+Q L+DCS GN G
Sbjct: 118 VDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQG 177
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG R++Q++ ++ GL +++ Y PY+ D C + A TGF V P E AL
Sbjct: 178 CNGGFMARAFQYVKENGGLDSEESY-PYVAMDEICKYRPENSVANDTGFTVVAPGKEKAL 236
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GP+SVA+DA SF FY +G+Y++ C S LDH VL VGYG +
Sbjct: 237 MKAVATVGPISVAVDAGHSSFQFYKSGIYFEPDC--SSKNLDHGVLVVGYGFEGANSNNS 294
Query: 313 PYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
YW VKNSW WG+ GYV + K+N+CG+ TA +Y
Sbjct: 295 KYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASY 331
>gi|388890776|gb|AFK80364.1| cysteine proteinase 3, partial [Acanthamoeba castellanii]
Length = 329
Score = 174 bits (442), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 93/202 (46%), Positives = 128/202 (63%), Gaps = 6/202 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCWSF TTG+ EGA ++K +L LS+Q LIDCS YGNNGC+GG ++++I
Sbjct: 131 NQGQCGSCWSFSTTGSTEGANFLKTGRLTSLSEQNLIDCSGSYGNNGCNGGLMDYAFEYI 190
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ + G+ T+ Y PY C + ++T + +V+ E+AL A+A P SVA
Sbjct: 191 INNKGIDTEASY-PYQTAQYTCQYNPANSGGSLTSYTDVSSGDENALLNAVATE-PTSVA 248
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS SF FY GVYY+ C+++ LDH VLAVG+G DG+ YW VKNSW WG G
Sbjct: 249 IDASHNSFQFYSGGVYYESACSSTQ--LDHGVLAVGWGTEDGQDYWLVKNSWGADWGLAG 306
Query: 330 YVLMSI-KDNNCGVMTAPTYVT 350
Y+ M+ + NNCG+ T+ +Y T
Sbjct: 307 YIKMARNRSNNCGIATSASYPT 328
>gi|291383517|ref|XP_002708299.1| PREDICTED: cathepsin L1 [Oryctolagus cuniculus]
Length = 333
Score = 174 bits (442), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 99/220 (45%), Positives = 133/220 (60%), Gaps = 11/220 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TGA+EG + K KL LS+Q L+DCS GN G
Sbjct: 118 VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFQKTGKLISLSEQNLVDCSHPQGNQG 177
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++Q++ + GL +++ Y PY G D C + A TGFV++ P E AL
Sbjct: 178 CNGGLMDYAFQYVKDNSGLDSEESY-PYEGMDGTCKYKPECSVANDTGFVDI-PGHEKAL 235
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GP+S AIDA SF FY +G+YYD C S LDH +L VGYG +
Sbjct: 236 LRAVATVGPISAAIDAGHMSFQFYKSGIYYDPDC--SSKDLDHGILVVGYGFEGTNSNAT 293
Query: 313 PYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTYVTM 351
YW VKNSW T WG++GYV ++ KDN+CG+ TA +Y T+
Sbjct: 294 KYWLVKNSWGTTWGDEGYVKIIRDKDNHCGIATAASYPTV 333
>gi|2804266|dbj|BAA24444.1| cysteine proteinase [Sitophilus zeamais]
Length = 331
Score = 174 bits (441), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 94/197 (47%), Positives = 125/197 (63%), Gaps = 6/197 (3%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCWSF +G++EG ++ K KL LS+Q L+DCS YGN GC+GG ++++I
Sbjct: 138 DQGHCGSCWSFSGSGSLEGQHFRKTGKLVSLSEQNLVDCSGRYGNTGCNGGLMDNAFRYI 197
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T+ Y PYL +D CH + AT GFV++ +ED LK A+A GPVS+A
Sbjct: 198 KDNGGIDTEQSY-PYLAEDEKCHYKTQNSGATDKGFVDIEEGNEDDLKAAVATVGPVSIA 256
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
IDAS ++F Y +GVY D +C S LDH VL VGYG DG+ YW VKNSW G
Sbjct: 257 IDASYETFQLYSDGVYSDPEC--SSQELDHGVLVVGYGTSDDGQDYWLVKNSWRPSCGLN 314
Query: 329 GYVLMSI-KDNNCGVMT 344
GY+ M+ +DN CGV +
Sbjct: 315 GYIKMARNQDNMCGVAS 331
>gi|1149525|emb|CAA64218.1| preprocathepsin K [Mus musculus]
Length = 329
Score = 174 bits (441), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 92/215 (42%), Positives = 125/215 (58%), Gaps = 6/215 (2%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
+ Y KGY + +Q CGSCW+F + GA+EG K KL LS Q L+DC N G
Sbjct: 119 IDYRKKGYVTPVKNQGQCGSCWAFSSAGALEGQLKKKTGKLLALSPQNLVDCV--TENYG 176
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
C GG ++Q++ ++G +D PY+GQD C T A G+ + +E ALK
Sbjct: 177 CGGGYMTTAFQYVQQNGGIDSEDAFPYVGQDESCMYNATAKAAKCRGYREIPVGNEKALK 236
Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQV 317
A+A+ GP+SV+IDAS SF FY GVYYDE C+ D ++HAVL VGYG G +W +
Sbjct: 237 RAVARVGPISVSIDASLASFQFYSRGVYYDENCDR--DNVNHAVLVVGYGTQKGSKHWII 294
Query: 318 KNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
KNSW WGN+GY L++ NN CG+ ++ M
Sbjct: 295 KNSWGESWGNKGYALLARNKNNACGITNMASFPKM 329
>gi|293342579|ref|XP_001065885.2| PREDICTED: cathepsin L1 [Rattus norvegicus]
gi|293354415|ref|XP_225137.5| PREDICTED: cathepsin L1 [Rattus norvegicus]
gi|149039747|gb|EDL93863.1| rCG24278 [Rattus norvegicus]
Length = 330
Score = 174 bits (441), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 96/204 (47%), Positives = 128/204 (62%), Gaps = 7/204 (3%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F G++EG + K KL LS+Q L+DCSW YGN GC+GG ++Q++
Sbjct: 131 NQGQCGSCWAFSAVGSLEGQIFKKTGKLVSLSEQNLVDCSWSYGNLGCNGGLMEFAFQYV 190
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
++ GL T + Y Y QD C + A +TGFV V P SED L A+A GPVSV
Sbjct: 191 KENRGLDTGESYA-YEAQDGLCRYNPKYSAANVTGFVKV-PLSEDDLMSAVASVGPVSVG 248
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
ID+ +SF FY G+YY+ C+++ +DHAVL VGYG E DG YW VKNSW WG
Sbjct: 249 IDSHHQSFRFYSGGMYYEPDCSSTE--MDHAVLVVGYGEESDGGKYWLVKNSWGEDWGMD 306
Query: 329 GYVLMSI-KDNNCGVMTAPTYVTM 351
GY+ M+ ++NNCG+ T Y T+
Sbjct: 307 GYIKMAKDQNNNCGIATYAIYPTV 330
>gi|348531521|ref|XP_003453257.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 333
Score = 174 bits (441), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 99/228 (43%), Positives = 140/228 (61%), Gaps = 7/228 (3%)
Query: 124 SLWVRYNKASKDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLS 182
S ++R + + V + KGY + +Q CGSCW+F GA+EG + K KL LS
Sbjct: 107 STFLRLPEGTDLPDTVDWRDKGYVTDVQNQMQCGSCWAFSAIGALEGQNFRKTGKLVSLS 166
Query: 183 QQALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATAT 241
+Q L+DCS +GN+GC+GG ++++I G+ T+ Y PY ++ CH T AT
Sbjct: 167 KQQLVDCSQSFGNHGCNGGWMDWAFKYIQATGGIDTEASY-PYEAEEGNCHYNPETVGAT 225
Query: 242 MTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAV 301
TG+V+V+PN EDALK A+A GP+S+A+DAS +SF FY +GVY + C S HA+
Sbjct: 226 CTGYVDVSPN-EDALKEAVATIGPISIAMDASHESFQFYQSGVYDEPSCITSR--FSHAM 282
Query: 302 LAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
LAVGYG +G YW VKNS+ WG +GY+ MS K N CG+ + +Y
Sbjct: 283 LAVGYGTENGHDYWLVKNSFGLGWGEKGYIKMSRNKSNQCGIASKASY 330
>gi|340370384|ref|XP_003383726.1| PREDICTED: silicatein-like [Amphimedon queenslandica]
Length = 337
Score = 174 bits (441), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 90/228 (39%), Positives = 134/228 (58%), Gaps = 8/228 (3%)
Query: 129 YNKASKDAIPVRYEMKGYNSLL---DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQA 185
Y++ +P + + N++ DQ CGSC++F GA+EGA + H KL LS+Q
Sbjct: 113 YSRPDDPNLPEEVDWRTKNAVTGVKDQGQCGSCYAFSAVGALEGAQALAHDKLVHLSEQN 172
Query: 186 LIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTG 244
++DCS YGN GC+GG + S+++I+ + G+ +D Y Y G+ C G
Sbjct: 173 IVDCSIPYGNKGCNGGNMYESFRYIIDNDGIDREDGY-KYTGRQGQCKFDRKAIGGRQVG 231
Query: 245 FVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAV 304
+++ SE L+ ALA GPVSVAID S +F FY GV+ + C+ + L HA L +
Sbjct: 232 IIHIPTGSEAELQSALATAGPVSVAIDGSSNAFRFYEKGVFDEPNCSTTK--LTHAGLII 289
Query: 305 GYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
GYG+ GKPYW VKNSW +WG +GY++M+ K N CG+ TA ++ T+
Sbjct: 290 GYGKKKGKPYWLVKNSWGPHWGMKGYIMMARNKANQCGIATAASFPTL 337
>gi|18308182|gb|AAL67857.1|AF462309_1 cysteine proteinase [Acanthamoeba healyi]
Length = 330
Score = 174 bits (440), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 96/225 (42%), Positives = 135/225 (60%), Gaps = 8/225 (3%)
Query: 131 KASKDAIPVRYEMKGYNSLL---DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALI 187
+A IP ++ + ++ +Q CGSCWSF TTG+ EGA ++K +L LS+Q LI
Sbjct: 108 EAPATGIPSEFDWRQKGAVTHVKNQGQCGSCWSFSTTGSTEGANFLKTGRLVSLSEQNLI 167
Query: 188 DCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFV 246
DCS YGNNGC+GG ++++I+ + G+ T+ Y C ++TG+
Sbjct: 168 DCSVSYGNNGCNGGLMDYAFEYIINNRGIDTEASYPYQTAGPLTCQYNAANKGGSLTGYT 227
Query: 247 NVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGY 306
+VT E+AL A K PVSVAIDAS SF FY GVYY+ C+++ LDH VL VG+
Sbjct: 228 DVTSGDENALLNAAVKE-PVSVAIDASHNSFQFYSGGVYYESACSSTQ--LDHGVLVVGW 284
Query: 307 GELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVT 350
G +G+ +W VKNSW WG GY+ MS ++NNCG+ TA +Y T
Sbjct: 285 GSENGQDFWWVKNSWGASWGLNGYIKMSRNQNNNCGIATAASYPT 329
>gi|156354996|ref|XP_001623463.1| predicted protein [Nematostella vectensis]
gi|156210166|gb|EDO31363.1| predicted protein [Nematostella vectensis]
Length = 312
Score = 174 bits (440), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 84/203 (41%), Positives = 124/203 (61%), Gaps = 11/203 (5%)
Query: 137 IPVRYEMKGY---NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG- 192
+P + + + Y N Q CGSCW F T GAVE A++++ +L L++Q L+DC+W
Sbjct: 109 VPDQLDWRDYGAVNPAKGQGTCGSCWPFATAGAVEAAHFIQKGELLNLAEQQLLDCTWST 168
Query: 193 ----YGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
+GNNGC GG ++++ W+ K GL T YG Y GQ+ +C +N T A +T + V
Sbjct: 169 PGVYHGNNGCLGGWTWKAFSWVKKFGLATTKSYGHYRGQEGFCKTSNLTVGARITSYRRV 228
Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
+ ALK AL HGP +++I+A+ KS FY +G+ D+ C+N DHAVL +GYG
Sbjct: 229 KRFNPIALKKALYYHGPATISINANPKSLKFYSDGIISDKHCSNK---TDHAVLLIGYGS 285
Query: 309 LDGKPYWQVKNSWSTYWGNQGYV 331
+G PYW ++NSWS WGN G++
Sbjct: 286 DNGVPYWLIRNSWSHKWGNNGFI 308
>gi|163658591|gb|ABY28387.1| cathepsin L [Gnathostoma spinigerum]
Length = 398
Score = 174 bits (440), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 91/202 (45%), Positives = 124/202 (61%), Gaps = 7/202 (3%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TGA+EG + K +L LS+Q L+DCS YGNNGC+GG ++++I
Sbjct: 197 DQGQCGSCWAFSATGALEGQHMRKTHQLVSLSEQNLVDCSRKYGNNGCNGGLMDNAFEYI 256
Query: 211 M-KHGLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
HG+ T++ Y PY G + CH A G+ ++ E+ALK+A+A GP+SV
Sbjct: 257 KDNHGIDTEESY-PYKGVEGKKCHFRRKFVGAEDYGYTDLPEGDEEALKVAVATIGPISV 315
Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGN 327
AIDA SF Y G+Y + +C SP+ LDH VL VGYG + + YW VKNSW T WG
Sbjct: 316 AIDAGHISFQNYRKGIYTENEC--SPEDLDHGVLVVGYGTDENAGDYWIVKNSWGTRWGE 373
Query: 328 QGYVLMSI-KDNNCGVMTAPTY 348
GY+ M+ K N CG+ + +Y
Sbjct: 374 HGYIRMARNKRNQCGIASKASY 395
>gi|73946536|ref|XP_541257.2| PREDICTED: cathepsin L1 [Canis lupus familiaris]
Length = 333
Score = 174 bits (440), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 94/216 (43%), Positives = 129/216 (59%), Gaps = 9/216 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + +GY + DQ C CW+F TGA+EG + K KL LS+Q L+DCSW GN G
Sbjct: 118 VDWREQGYVTPVKDQGQCLGCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSWSQGNRG 177
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
C+GG ++Q++ +G ++ PYL ++ C + A +T F + N ED L
Sbjct: 178 CNGGLMEYAFQYVKDNGGLDSEESYPYLARNEPCKYRPEKSAANVTAFWPIL-NEEDGLM 236
Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKP 313
+A GPVS A+D+S +SF FY G+YYD KC+N L+H VL VGYG E D K
Sbjct: 237 TTVATVGPVSAAVDSSPQSFQFYKKGIYYDPKCSNKL--LNHGVLVVGYGFEGAESDNKK 294
Query: 314 YWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
YW VKNSW T WG QGY+L++ +DN+CG+ T +Y
Sbjct: 295 YWIVKNSWGTNWGMQGYMLLAKDRDNHCGIATRASY 330
>gi|340374922|ref|XP_003385986.1| PREDICTED: cathepsin L-like [Amphimedon queenslandica]
Length = 325
Score = 174 bits (440), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 96/235 (40%), Positives = 141/235 (60%), Gaps = 4/235 (1%)
Query: 116 EGEKVSKYSLWVRYNKASKDAIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKH 175
EG K + + + N A D + S+ +Q CGSCW+F TGA+E +++
Sbjct: 91 EGFKRNMMNESLHANNAKADTAVDWRDRGAVTSVKNQGQCGSCWAFSATGALEAMHFLNK 150
Query: 176 KKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIAN 235
L LS+Q L+DCS+ YGN GC GG R++ ++ G ++ PY+G C ++
Sbjct: 151 GTLVSLSEQQLVDCSYRYGNAGCQGGLMDRAFAYVRNKGYICSEESYPYVGYMWACKSSS 210
Query: 236 TTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPD 295
+ A+ G+ N+ E++L A++ G VSVAIDAS+ SF FY +GV+YD C++S
Sbjct: 211 CSPAASCRGYKNIASGDEESLTNAVSSVGTVSVAIDASRYSFQFYSSGVFYDSDCSSSR- 269
Query: 296 GLDHAVLAVGYGE-LDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
L+H VL VGYG DG Y+ VKNSWS+ WG++GYVLM+ KDNNCG+ +A +Y
Sbjct: 270 -LNHGVLVVGYGTYYDGTEYYIVKNSWSSDWGDKGYVLMARNKDNNCGIASAASY 323
>gi|313235898|emb|CBY11285.1| unnamed protein product [Oikopleura dioica]
Length = 326
Score = 174 bits (440), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 106/244 (43%), Positives = 140/244 (57%), Gaps = 25/244 (10%)
Query: 126 WVRYN------KASKDAIP------------VRYEMKGY-NSLLDQSVCGSCWSFGTTGA 166
WV YN KA+ D P V + +GY + DQ CGSCW+F TTG+
Sbjct: 84 WVEYNRLRNGAKANSDCKPMPSSSAQANPDAVDWRPQGYVTPIKDQGQCGSCWAFSTTGS 143
Query: 167 VEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGG-EDFRSYQWIMKHGLPTQDDYGPYL 225
EGA++ K KL +LS+Q L+DCS G++GC+GG DF I G+ T+ Y PY
Sbjct: 144 TEGAHFKKTGKLVMLSEQQLVDCSTKEGDHGCNGGLMDFGFTYIIENDGITTESAY-PYK 202
Query: 226 GQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVY 285
QD C + TA AT++ +V SE L+ A+A GP+SVAIDA SF Y G+Y
Sbjct: 203 AQDGSCK-SGMTAAATLSECYDVAQGSEADLETAVATVGPISVAIDAHLLSFRLYKQGIY 261
Query: 286 YDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMT 344
+D C+++ LDH VLAVGY YW VKNSW+T WGN+GY+ M+ K N CG+ T
Sbjct: 262 HDRLCSSTR--LDHGVLAVGYKNDPSGNYWIVKNSWNTTWGNEGYIWMAKDKKNTCGIAT 319
Query: 345 APTY 348
A +Y
Sbjct: 320 AASY 323
>gi|15826035|pdb|1FH0|A Chain A, Crystal Structure Of Human Cathepsin V Complexed With An
Irreversible Vinyl Sulfone Inhibitor
gi|15826036|pdb|1FH0|B Chain B, Crystal Structure Of Human Cathepsin V Complexed With An
Irreversible Vinyl Sulfone Inhibitor
Length = 221
Score = 174 bits (440), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 96/217 (44%), Positives = 129/217 (59%), Gaps = 10/217 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TGA+EG + K KL LS+Q L+DCS GN G
Sbjct: 5 VDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQG 64
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG R++Q++ ++ GL +++ Y PY+ D C + A TGF V P E AL
Sbjct: 65 CNGGFMARAFQYVKENGGLDSEESY-PYVAVDEICKYRPENSVAQDTGFTVVAPGKEKAL 123
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GP+SVA+DA SF FY +G+Y++ C S LDH VL VGYG D
Sbjct: 124 MKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDC--SSKNLDHGVLVVGYGFEGANSDNS 181
Query: 313 PYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
YW VKNSW WG+ GYV + K+N+CG+ TA +Y
Sbjct: 182 KYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASY 218
>gi|254746344|emb|CAX16637.1| putative C1A cysteine protease precursor [Manduca sexta]
Length = 541
Score = 174 bits (440), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 85/200 (42%), Positives = 122/200 (61%), Gaps = 4/200 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAV-LSQQALIDCSWGYGNNGCDGGEDFRSYQW 209
+Q CGSCW+F + AVE +K+ + LS+Q+L+DC+WG+ GC+G +++
Sbjct: 343 NQGNCGSCWAFSSVAAVEATLALKNGGRNLELSEQSLVDCAWGFEAMGCNGASPDSGFKY 402
Query: 210 IMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
I++HG+PT +YGPYL ++ +C N + +TGF VTP + + K+ L ++GPV VA
Sbjct: 403 ILEHGVPTDMEYGPYLEKNGFCEARNMSKLYHITGFGRVTPRNPEITKVVLNRYGPVLVA 462
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
I A S Y +GV+YD C NS +H V VGYG DG+ YW VKNSW WG G
Sbjct: 463 IHAGN-SMKLYSSGVFYDVTCENSTP--NHGVALVGYGVRDGELYWMVKNSWGEDWGEDG 519
Query: 330 YVLMSIKDNNCGVMTAPTYV 349
Y+LMS +NNC +M + Y
Sbjct: 520 YILMSATNNNCLLMDSGFYA 539
>gi|313241067|emb|CBY33367.1| unnamed protein product [Oikopleura dioica]
Length = 326
Score = 174 bits (440), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 106/244 (43%), Positives = 139/244 (56%), Gaps = 25/244 (10%)
Query: 126 WVRYN------KASKDAIP------------VRYEMKGY-NSLLDQSVCGSCWSFGTTGA 166
WV YN KA+ D P V + +GY + DQ CGSCW+F TTG+
Sbjct: 84 WVEYNRLRNGAKANSDCKPMPSSSAEANPDSVDWRNEGYVTPIKDQGQCGSCWAFSTTGS 143
Query: 167 VEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGG-EDFRSYQWIMKHGLPTQDDYGPYL 225
EGA++ K KL LS+Q L+DCS G++GC+GG DF I G+ T+ Y PY
Sbjct: 144 TEGAHFKKTGKLVTLSEQQLVDCSTKEGDHGCNGGLMDFGFTYIIENDGITTESAY-PYK 202
Query: 226 GQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVY 285
QD C + TA AT++ +V SE L+ A+A GP+SVAIDA SF Y G+Y
Sbjct: 203 AQDGSCK-SGMTAAATLSECYDVAQGSEADLETAVATVGPISVAIDAHLLSFRLYKQGIY 261
Query: 286 YDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMT 344
+D C+++ LDH VLAVGY YW VKNSW+T WGN+GY+ M+ K N CG+ T
Sbjct: 262 HDRLCSSTR--LDHGVLAVGYKNDPSGNYWIVKNSWNTTWGNEGYIWMAKDKKNTCGIAT 319
Query: 345 APTY 348
A +Y
Sbjct: 320 AASY 323
>gi|1834307|dbj|BAA09820.1| cysteine proteinase [Spirometra erinaceieuropaei]
gi|1834309|dbj|BAA09821.1| cysteine proteinase [Spirometra erinaceieuropaei]
Length = 336
Score = 174 bits (440), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 96/251 (38%), Positives = 138/251 (54%), Gaps = 14/251 (5%)
Query: 102 VNGIILEKWRLVTSEGEKVSKYSLWVRYNKASKDAIPVRYEMKGYNSLLDQSVCGSCWSF 161
+ GI+L K R K ++ V + D++ R E S+ +Q CGSCWSF
Sbjct: 99 LRGIVLTKLR---------RKEAVSVPLKENLPDSVNWR-ERGAVTSVKNQGQCGSCWSF 148
Query: 162 GTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDY 221
GA+EGA +K L LS+Q L+DCSW YGN GC+GG +++Q+ ++G+ + DY
Sbjct: 149 SANGAIEGAIQIKTGALRSLSEQQLMDCSWDYGNQGCNGGLMPQAFQYAQRYGVEAEVDY 208
Query: 222 GPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYV 281
Y +D C A +TG+ + E L+ A+A GP+SV IDA+ F Y
Sbjct: 209 -RYTERDGVCRYRQDLVVANVTGYAELPEGDEGGLQRAVATIGPISVGIDAADPGFMSYS 267
Query: 282 NGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNN-C 340
+GV+ + C SP +DH VL VGYG +G YW VKNSW + WG GY+ M+ NN C
Sbjct: 268 HGVFVSKTC--SPYAIDHGVLVVGYGAENGDAYWLVKNSWGSSWGEDGYLKMARNRNNMC 325
Query: 341 GVMTAPTYVTM 351
G+ + +Y T+
Sbjct: 326 GIASMASYPTV 336
>gi|31982433|ref|NP_031828.2| cathepsin K precursor [Mus musculus]
gi|12644320|sp|P55097.2|CATK_MOUSE RecName: Full=Cathepsin K; Flags: Precursor
gi|3550487|emb|CAA06825.1| cathepsin K [Mus musculus]
gi|12834090|dbj|BAB22783.1| unnamed protein product [Mus musculus]
gi|28277388|gb|AAH46320.1| Cathepsin K [Mus musculus]
gi|74209960|dbj|BAE21279.1| unnamed protein product [Mus musculus]
gi|148706870|gb|EDL38817.1| cathepsin K, isoform CRA_a [Mus musculus]
Length = 329
Score = 174 bits (440), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 92/215 (42%), Positives = 125/215 (58%), Gaps = 6/215 (2%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
+ Y KGY + +Q CGSCW+F + GA+EG K KL LS Q L+DC N G
Sbjct: 119 IDYRKKGYVTPVKNQGQCGSCWAFSSAGALEGQLKKKTGKLLALSPQNLVDCV--TENYG 176
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
C GG ++Q++ ++G +D PY+GQD C T A G+ + +E ALK
Sbjct: 177 CGGGYMTTAFQYVQQNGGIDSEDAYPYVGQDESCMYNATAKAAKCRGYREIPVGNEKALK 236
Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQV 317
A+A+ GP+SV+IDAS SF FY GVYYDE C+ D ++HAVL VGYG G +W +
Sbjct: 237 RAVARVGPISVSIDASLASFQFYSRGVYYDENCDR--DNVNHAVLVVGYGTQKGSKHWII 294
Query: 318 KNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
KNSW WGN+GY L++ NN CG+ ++ M
Sbjct: 295 KNSWGESWGNKGYALLARNKNNACGITNMASFPKM 329
>gi|356565778|ref|XP_003551114.1| PREDICTED: thiol protease aleurain-like [Glycine max]
Length = 353
Score = 174 bits (440), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 89/199 (44%), Positives = 121/199 (60%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TTGA+E AY K LS+Q L+DC+ + N GC+GG +++++I
Sbjct: 153 DQGNCGSCWTFSTTGALEAAYAQAFGKNISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYI 212
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ GL T++ Y PY G+D C + +N+T +ED LK A+A PVSVA
Sbjct: 213 KYNGGLDTEEAY-PYTGKDGVCKFTAKNVAVRVIDSINITLGAEDELKQAVAFVRPVSVA 271
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ + K F FY NGVY C ++P ++HAVLAVGYG DG PYW +KNSW + WG+ G
Sbjct: 272 FEVA-KDFRFYNNGVYTSTICGSTPMDVNHAVLAVGYGVEDGVPYWIIKNSWGSNWGDNG 330
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y M + N CGV T +Y
Sbjct: 331 YFKMELGKNMCGVATCASY 349
>gi|348545637|ref|XP_003460286.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 334
Score = 173 bits (439), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 97/218 (44%), Positives = 133/218 (61%), Gaps = 8/218 (3%)
Query: 136 AIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG 192
A+P V + KGY + DQ CGSCW+F TGA+EG ++ K L LS+Q L+DCS
Sbjct: 117 ALPNTVDWRDKGYVTEVKDQKQCGSCWAFSATGALEGQHFRKTGTLVSLSEQQLVDCSSN 176
Query: 193 YGNNGCDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPN 251
+GN+GC GG ++++I G+ T++ Y PY ++ C + AT +G++ V
Sbjct: 177 FGNSGCMGGWMDFAFKYIKYNRGIDTEEFY-PYEAKNGLCRYKRDSIGATCSGYIIVKRF 235
Query: 252 SEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDG 311
E ALK A+A GP+SV IDAS+ SF Y +GVYYD+ C + L+HAVLAVGYG +G
Sbjct: 236 EEQALKEAVATVGPISVTIDASRPSFQLYESGVYYDDGCGSIF--LNHAVLAVGYGTENG 293
Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
YW VKNSW WG +GY+ MS K N CG+ + Y
Sbjct: 294 HDYWLVKNSWGLGWGEKGYIRMSRNKKNQCGIASVARY 331
>gi|119433808|gb|ABL74967.1| cysteine protease [Acanthamoeba castellanii]
Length = 330
Score = 173 bits (439), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 95/202 (47%), Positives = 126/202 (62%), Gaps = 6/202 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCWSF TTG+ EGA ++K L LS+Q LIDCS YGNNGC+GG ++++I
Sbjct: 132 NQGQCGSCWSFSTTGSTEGANFLKRGTLVSLSEQNLIDCSGSYGNNGCNGGLMDYAFEYI 191
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ + G+ T+ Y PY C + ++T + +V+ E+AL A+A P SVA
Sbjct: 192 INNKGIDTEASY-PYETAQYNCRYNPANSGGSLTSYTDVSSGDENALLNAVAIE-PTSVA 249
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS SF FY GVYY+ C S LDH VLAVG+G +G+ YW VKNSW WG QG
Sbjct: 250 IDASHNSFQFYSGGVYYESSC--SSTQLDHGVLAVGWGTENGQDYWLVKNSWGADWGLQG 307
Query: 330 YVLMSI-KDNNCGVMTAPTYVT 350
Y+ M+ + NNCG+ TA +Y T
Sbjct: 308 YIKMARNRHNNCGIATAASYPT 329
>gi|261289789|ref|XP_002611756.1| hypothetical protein BRAFLDRAFT_236363 [Branchiostoma floridae]
gi|229297128|gb|EEN67766.1| hypothetical protein BRAFLDRAFT_236363 [Branchiostoma floridae]
Length = 308
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 89/201 (44%), Positives = 129/201 (64%), Gaps = 6/201 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TG++EG +++K L LS+Q L+DCS GN GC GG +++++I
Sbjct: 108 NQEQCGSCWAFSATGSLEGQHFLKTNNLVSLSEQNLVDCSRREGNKGCKGGSMDQAFKYI 167
Query: 211 -MKHGLPTQDDYGPYLGQD-AYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
M G+ T++ Y Y G+D + C ++ + AT++ + ++ E AL A++ GP+SV
Sbjct: 168 KMNGGIDTEECY-SYRGRDESMCRYKSSCSGATLSSYTDIKTGDEMALMQAVSTVGPISV 226
Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQ 328
AIDA KSF Y +GVY + KC+++ LDH VLAVGYG +G YW VKNSW T WG +
Sbjct: 227 AIDAGHKSFQLYHHGVYDEPKCSSTH--LDHGVLAVGYGSSNGSDYWLVKNSWGTEWGME 284
Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
GY++MS K N CG+ T Y
Sbjct: 285 GYIMMSRNKHNQCGIATRAIY 305
>gi|405971604|gb|EKC36431.1| Cathepsin L [Crassostrea gigas]
Length = 384
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 93/214 (43%), Positives = 130/214 (60%), Gaps = 7/214 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TG++EG Y+ K+ KL LS+ L+DCS +GN G
Sbjct: 170 VDWRSKGYVTKVKNQGACGSCWAFSATGSLEGQYFRKNGKLVPLSESQLVDCSGSFGNEG 229
Query: 198 CDGGEDFRSYQWIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG +++++ G+ ++ DY PY + C T AT++G V+V SE +L
Sbjct: 230 CNGGFMENAFKYVKSVGGIESESDY-PYKARQRTCAFDKTKVIATVSGCVDVESGSESSL 288
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYW 315
K +++ GPVSVAIDA SF Y GVY + C+ S L+H VL VGYG L GK YW
Sbjct: 289 KEVVSEVGPVSVAIDAGHSSFQLYAGGVYDEPLCSTSR--LNHGVLCVGYGTSLQGKDYW 346
Query: 316 QVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
VKNSW WG +GY+ MS K+N CG+ + +Y
Sbjct: 347 IVKNSWGVRWGVEGYIKMSRNKNNQCGIASEASY 380
>gi|224460525|gb|ACN43674.1| cathepsin L [Paralichthys olivaceus]
Length = 334
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 96/200 (48%), Positives = 127/200 (63%), Gaps = 5/200 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCWSF +TGA+EG + K +L LS+Q L+DCS YGN GC+GG ++++I
Sbjct: 135 NQGSCGSCWSFSSTGALEGQNFRKTGRLVSLSEQELVDCSGNYGNYGCNGGWMDNAFRYI 194
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ K G+ T+D Y PY GQ C AT TG+ ++ +E ALK A+A GPVSVA
Sbjct: 195 VNKGGIHTEDSY-PYEGQVGQCRANYGEIGATCTGYYDIPSGNEHALKEAVATFGPVSVA 253
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
I AS +SF Y +GVY + C S LDHAVL VGYG G+ YW VKNSW WG+QG
Sbjct: 254 IHASDQSFQLYHSGVYNNPYC--SGTALDHAVLIVGYGTEYGQDYWLVKNSWGPAWGDQG 311
Query: 330 YVLMSI-KDNNCGVMTAPTY 348
Y+ MS + N CG+ +A ++
Sbjct: 312 YIKMSRNRYNQCGIASAASF 331
>gi|91092022|ref|XP_970951.1| PREDICTED: similar to cathepsin l [Tribolium castaneum]
gi|270001246|gb|EEZ97693.1| cathepsin L precursor [Tribolium castaneum]
Length = 343
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 89/210 (42%), Positives = 126/210 (60%), Gaps = 4/210 (1%)
Query: 142 EMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGG 201
E+ + +Q C CW+F GA+EG + K +L LS Q LIDCS YGN+GC GG
Sbjct: 132 EVGAVTPVKNQGSCAGCWAFSAAGALEGHNFRKTGRLVELSPQNLIDCSTNYGNDGCSGG 191
Query: 202 EDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLAL 260
+Y+++ + G+ T+D Y PY ++ C T A TG+V++ E L+ A+
Sbjct: 192 LMNPAYEYVRTNPGIDTEDSY-PYEARNGPCRFRPETVGAYCTGYVDIAEGDEQGLEAAI 250
Query: 261 AKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKN 319
A GPVS A+DA ++SF FY +G+YYD +C N PD ++HAVL VGYG E +G+ YW VKN
Sbjct: 251 ATLGPVSAAMDAGRQSFQFYSDGIYYDPQCGNRPDDVNHAVLVVGYGTEPNGQKYWLVKN 310
Query: 320 SWSTYWGNQGYVLMSIKDNN-CGVMTAPTY 348
S+ WG GYV ++ NN CG+ +Y
Sbjct: 311 SYGPQWGIGGYVKLAKDANNHCGIAIQASY 340
>gi|3087790|emb|CAA75029.1| cathepsin L2 [Homo sapiens]
Length = 334
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 95/217 (43%), Positives = 129/217 (59%), Gaps = 10/217 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TGA+EG + K KL LS+Q L+DCS GN G
Sbjct: 118 VDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQG 177
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG R++Q++ ++ GL +++ Y PY+ D C + A TGF V P E AL
Sbjct: 178 CNGGFMARAFQYVKENGGLDSEESY-PYVAVDEICKYRPENSVANDTGFTVVAPGKEKAL 236
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GP+SVA+DA SF FY +G+Y++ C S LDH VL VGYG +
Sbjct: 237 MKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDC--SSKNLDHGVLVVGYGFEGANSNNS 294
Query: 313 PYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
YW VKNSW WG+ GYV + K+N+CG+ TA +Y
Sbjct: 295 KYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASY 331
>gi|426219875|ref|XP_004004143.1| PREDICTED: cathepsin L1 [Ovis aries]
Length = 333
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 99/225 (44%), Positives = 137/225 (60%), Gaps = 13/225 (5%)
Query: 136 AIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG 192
+IP V + KGY + +Q CGSCW+F TTGA+EG + K KL LS+Q L+DCS
Sbjct: 113 SIPPSVDWREKGYVTPVKNQGKCGSCWAFSTTGALEGQMFRKTGKLVSLSEQNLVDCSQP 172
Query: 193 YGNNGCDGGEDFRSYQWIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPN 251
GN GC GG ++Q+++ GL +++ Y PY G C+ + A TGFV++ P
Sbjct: 173 EGNRGCHGGLMDNAFQYVLDVGGLDSEESY-PYTGLVGTCNYNPKNSAANETGFVDL-PK 230
Query: 252 SEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG---- 307
E+AL A+A GP+SVA+DAS SF FY +G+YY+ KC + + +DH VL VGYG
Sbjct: 231 QENALMKAVATLGPISVAVDASNPSFQFYKSGIYYEPKCKS--ESVDHGVLVVGYGFEGA 288
Query: 308 ELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
+ D YW VKNSW +WG GY+ M+ NN CG+ T +Y T+
Sbjct: 289 DSDDNKYWLVKNSWGKHWGINGYIKMAKDQNNHCGIATMASYPTV 333
>gi|23110960|ref|NP_001324.2| cathepsin L2 preproprotein [Homo sapiens]
gi|320118898|ref|NP_001188504.1| cathepsin L2 preproprotein [Homo sapiens]
gi|12644075|sp|O60911.2|CATL2_HUMAN RecName: Full=Cathepsin L2; AltName: Full=Cathepsin U; AltName:
Full=Cathepsin V; Flags: Precursor
gi|3107915|dbj|BAA25909.1| cathepsin V [Homo sapiens]
gi|3228672|gb|AAC23598.1| cathepsin U [Homo sapiens]
gi|3869129|dbj|BAA34365.1| cathepsin L2 [Homo sapiens]
gi|23958123|gb|AAH23504.1| CTSL2 protein [Homo sapiens]
gi|37182404|gb|AAQ89004.1| cathepsin L2 [Homo sapiens]
gi|83405150|gb|AAI10513.1| Cathepsin L2 [Homo sapiens]
gi|119579235|gb|EAW58831.1| cathepsin L2, isoform CRA_a [Homo sapiens]
gi|119579236|gb|EAW58832.1| cathepsin L2, isoform CRA_a [Homo sapiens]
Length = 334
Score = 173 bits (438), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 95/217 (43%), Positives = 129/217 (59%), Gaps = 10/217 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TGA+EG + K KL LS+Q L+DCS GN G
Sbjct: 118 VDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQG 177
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG R++Q++ ++ GL +++ Y PY+ D C + A TGF V P E AL
Sbjct: 178 CNGGFMARAFQYVKENGGLDSEESY-PYVAVDEICKYRPENSVANDTGFTVVAPGKEKAL 236
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GP+SVA+DA SF FY +G+Y++ C S LDH VL VGYG +
Sbjct: 237 MKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDC--SSKNLDHGVLVVGYGFEGANSNNS 294
Query: 313 PYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
YW VKNSW WG+ GYV + K+N+CG+ TA +Y
Sbjct: 295 KYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASY 331
>gi|348586441|ref|XP_003478977.1| PREDICTED: cathepsin K-like [Cavia porcellus]
Length = 329
Score = 173 bits (438), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 97/216 (44%), Positives = 131/216 (60%), Gaps = 8/216 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V Y KGY + +Q CGSCW+F + GA+EG K KL LS Q L+DC N+G
Sbjct: 119 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 176
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG ++Q++ ++ G+ ++D Y PY+GQ+ C T A G+ + +E AL
Sbjct: 177 CGGGYMTNAFQYVQENRGIDSEDAY-PYVGQEESCMYNPTGKAAKCRGYREIPVGNEKAL 235
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K A+A+ GPVSVAIDAS SF FY GVYYDE CN + L+HA+LAVGYG G +W
Sbjct: 236 KRAVARVGPVSVAIDASLSSFQFYSKGVYYDESCNG--EDLNHALLAVGYGMQRGNKHWI 293
Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
+KNSW WGN+GYVL++ NN CG+ ++ M
Sbjct: 294 LKNSWGENWGNKGYVLLARNKNNACGIANLASFPKM 329
>gi|440793751|gb|ELR14926.1| Cysteine proteinase 5, putative [Acanthamoeba castellanii str.
Neff]
Length = 326
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 103/286 (36%), Positives = 159/286 (55%), Gaps = 17/286 (5%)
Query: 70 FKVNIYR--LFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWV 127
++ N++R ++ H+N+ G+ L N + ++ ++ ++ + S
Sbjct: 48 YRWNVWRENYLYIEAHNHQNKSFHLAMNKFGD-LTNAEFNKLFKGLSITADQAKQES--- 103
Query: 128 RYNKASKDAIPVRYEMKGYNSLL---DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQ 184
+ A +P ++ + ++ +Q CGSCWSF TTG+ EGA ++KH +L LS+Q
Sbjct: 104 --DIAPAPGLPADFDWRQKGAVTHVKNQGQCGSCWSFSTTGSTEGANFLKHGRLTSLSEQ 161
Query: 185 ALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMT 243
L+DCS YGN+GC+GG ++++I+++ G+ T++ Y PY C + +
Sbjct: 162 NLVDCSTSYGNHGCNGGLMDYAFEYIIRNKGIDTEESY-PYHASQGTCRYNKQHSGGELV 220
Query: 244 GFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLA 303
+ NV +E AL A+A P SVAIDAS SF FY GVY + C++S LDH VLA
Sbjct: 221 SYTNVPSGNEGALLNAVATQ-PTSVAIDASHSSFQFYKGGVYDEPACSSSR--LDHGVLA 277
Query: 304 VGYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
VG+G DGK YW VKNSW WG GY+ MS K N CG+ TA ++
Sbjct: 278 VGWGVRDGKDYWLVKNSWGADWGLSGYIEMSRNKHNQCGIATAASH 323
>gi|395819351|ref|XP_003783057.1| PREDICTED: cathepsin L1-like [Otolemur garnettii]
Length = 333
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 99/220 (45%), Positives = 133/220 (60%), Gaps = 11/220 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TGA+EG + K KL LS+Q L+DCS GN G
Sbjct: 118 VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLISLSEQNLVDCSRPQGNEG 177
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
CDGG ++Q+I ++ GL +++ Y PY D C + A TGFV++ P E AL
Sbjct: 178 CDGGLMDYAFQYIKENGGLDSEESY-PYDAMDESCKYRPEYSVANDTGFVDI-PKEEKAL 235
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GP+SVAIDA +SF FY GVY++ +C S D +DH VL VGYG E D
Sbjct: 236 MKAVATVGPISVAIDAGHESFQFYKEGVYFEPEC--SSDNVDHGVLVVGYGYEETESDNN 293
Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
+W VKNSW WG GY+ M+ + N+CG+ TA +Y T+
Sbjct: 294 KFWLVKNSWGEEWGLGGYIKMTKDQKNHCGIATAASYPTV 333
>gi|215259621|gb|ACJ64302.1| cathepsin l [Culex tarsalis]
Length = 125
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 80/123 (65%), Positives = 97/123 (78%)
Query: 227 QDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYY 286
QDAYC + N T A +TG+VNVT DA+K+AL KHGP+S+AIDA KSFS+Y NGVYY
Sbjct: 1 QDAYCRLENKTLYAPITGWVNVTSGDTDAMKVALFKHGPLSIAIDAGHKSFSYYANGVYY 60
Query: 287 DEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAP 346
+ +C N+ DGLDHAVLAVGYG+L+G+ YW +KNSWS WGN GY LMS+KDNNCG+ T
Sbjct: 61 EPECKNNLDGLDHAVLAVGYGKLNGEDYWLIKNSWSNLWGNDGYALMSMKDNNCGLATDA 120
Query: 347 TYV 349
TYV
Sbjct: 121 TYV 123
>gi|261289787|ref|XP_002611755.1| hypothetical protein BRAFLDRAFT_284339 [Branchiostoma floridae]
gi|229297127|gb|EEN67765.1| hypothetical protein BRAFLDRAFT_284339 [Branchiostoma floridae]
Length = 327
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 88/200 (44%), Positives = 125/200 (62%), Gaps = 4/200 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTG++EG +++K L LS+Q L+DCS GN GC GG +++++I
Sbjct: 127 NQEQCGSCWAFSTTGSLEGQHFLKSGTLVSLSEQNLVDCSRKEGNKGCQGGLMDQAFKYI 186
Query: 211 MKHGLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+G ++ PY G++ C ++ + AT++ +V++ EDAL A A GP+SV
Sbjct: 187 KTNGGIDTEECYPYKGKNERKCEYKSSCSGATLSSYVDIKTGDEDALMQASATIGPISVG 246
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS SF Y +GVY++++C S LDH VL VGYG K YW VKNSW WG +G
Sbjct: 247 IDASHPSFQLYDHGVYHEKRC--SSKKLDHGVLVVGYGTDGEKDYWLVKNSWGEEWGMEG 304
Query: 330 YVLMSI-KDNNCGVMTAPTY 348
Y+ MS KDN CG+ T +Y
Sbjct: 305 YIKMSRNKDNQCGIATQASY 324
>gi|297684914|ref|XP_002820054.1| PREDICTED: cathepsin L2 isoform 2 [Pongo abelii]
Length = 334
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 92/216 (42%), Positives = 126/216 (58%), Gaps = 8/216 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TGA+EG + K KL LS+Q L+DCS GN G
Sbjct: 118 VDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSHPQGNQG 177
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
C+GG +++Q++ ++G ++ PY+ D C + A TGF + P E AL
Sbjct: 178 CNGGFMDKAFQYVKENGGLDSEESYPYVAMDEICKYRPENSVANDTGFTVILPGKEKALM 237
Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKP 313
A+A GP+SVA+DA SF FY +G+Y++ C S LDH VL VGYG D
Sbjct: 238 KAVATVGPISVAMDAGHSSFQFYKSGIYFEPDC--SSKNLDHGVLVVGYGFEGANSDNSK 295
Query: 314 YWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
YW VKNSW WG+ GYV + K+N+CG+ TA +Y
Sbjct: 296 YWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASY 331
>gi|156354979|ref|XP_001623456.1| predicted protein [Nematostella vectensis]
gi|156210156|gb|EDO31356.1| predicted protein [Nematostella vectensis]
Length = 310
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 83/203 (40%), Positives = 124/203 (61%), Gaps = 11/203 (5%)
Query: 137 IPVRYEMKGY---NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG- 192
+P + + + Y N Q CGSCW+F T GAVE A++++ +L L++Q L+DC+W
Sbjct: 107 VPDQLDWRNYGAVNPAKGQGTCGSCWAFATAGAVEAAHFIQKGELLNLAEQQLLDCTWST 166
Query: 193 ----YGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
+GNN C GG ++++ W+ K GL T YG Y GQ+ +C +N A +T + V
Sbjct: 167 PGVYHGNNCCLGGWTWKAFSWVKKFGLATTKSYGHYRGQEGFCKTSNLIVGARITSYRRV 226
Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
+ ALK AL+ HGP +++I+A+ KS FY +G+ D+ C+N DHAVL +GYG
Sbjct: 227 KRFNPIALKKALSYHGPATISINANPKSLKFYSDGIMSDKHCSNK---TDHAVLLIGYGS 283
Query: 309 LDGKPYWQVKNSWSTYWGNQGYV 331
+G PYW +KNSWS WGN G++
Sbjct: 284 DNGVPYWLIKNSWSHKWGNNGFI 306
>gi|156394055|ref|XP_001636642.1| predicted protein [Nematostella vectensis]
gi|156223747|gb|EDO44579.1| predicted protein [Nematostella vectensis]
Length = 200
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 90/195 (46%), Positives = 121/195 (62%), Gaps = 5/195 (2%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM 211
Q CGSCW+F G +EGAY +K KL S+Q L+DCSWG+GNNGC GG +R W
Sbjct: 9 QGRCGSCWTFSAMGPIEGAYALKTGKLKDFSEQQLLDCSWGFGNNGCRGGFSWRGLVWAR 68
Query: 212 KHGLPTQDDYGPYLGQDAYCHIANTTATATM--TGFVNVTPNSEDALKLALAKHGPVSVA 269
K G+ + YG YL Q+ YCH A T T+ + V ++ AL+ LA +GP S+A
Sbjct: 69 KFGVTSSRTYGRYLAQEGYCHCAKTDKCDTVHFERVMTVPKYNQTALRRGLALYGPASIA 128
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
I+ ++KS FY GVY D +C P +H V+ VGYG +G+PYW VKNSW +WG QG
Sbjct: 129 INVNRKSLKFYSGGVYDDPQC---PPNTNHGVVVVGYGVEEGQPYWIVKNSWGKFWGEQG 185
Query: 330 YVLMSIKDNNCGVMT 344
++ +S K N CGV++
Sbjct: 186 FIRISRKKNQCGVVS 200
>gi|194320502|gb|ACF48469.1| cathepsin L [Triatoma brasiliensis]
Length = 330
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 90/200 (45%), Positives = 122/200 (61%), Gaps = 5/200 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCWSF TG++EG ++K KL LS+Q L+DCS YGNNGC+GG +++Q++
Sbjct: 131 DQGQCGSCWSFSATGSLEGQVFLKTGKLVSLSEQNLVDCSTSYGNNGCEGGLMDQAFQYV 190
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T+ Y PY ++ C T G V++ E AL+ ALA GP+SVA
Sbjct: 191 SDNKGIDTEASY-PYEARENTCRFKKNKVGGTDKGHVDIPAGDEKALQNALATVGPISVA 249
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDA+ SF FY GVY + C S LDH VLAVGYG +G+ YW VKNSW WG G
Sbjct: 250 IDANHGSFQFYSKGVYNEPNC--SSYDLDHGVLAVGYGTENGQDYWLVKNSWGPSWGENG 307
Query: 330 YVLMSIK-DNNCGVMTAPTY 348
Y+ ++ N+CG+ + +Y
Sbjct: 308 YIKIARNHSNHCGIASMASY 327
>gi|297684916|ref|XP_002820055.1| PREDICTED: cathepsin L2 isoform 3 [Pongo abelii]
Length = 345
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 92/216 (42%), Positives = 126/216 (58%), Gaps = 8/216 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TGA+EG + K KL LS+Q L+DCS GN G
Sbjct: 129 VDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSHPQGNQG 188
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
C+GG +++Q++ ++G ++ PY+ D C + A TGF + P E AL
Sbjct: 189 CNGGFMDKAFQYVKENGGLDSEESYPYVAMDEICKYRPENSVANDTGFTVILPGKEKALM 248
Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKP 313
A+A GP+SVA+DA SF FY +G+Y++ C S LDH VL VGYG D
Sbjct: 249 KAVATVGPISVAMDAGHSSFQFYKSGIYFEPDC--SSKNLDHGVLVVGYGFEGANSDNSK 306
Query: 314 YWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
YW VKNSW WG+ GYV + K+N+CG+ TA +Y
Sbjct: 307 YWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASY 342
>gi|17062058|gb|AAL34984.1|AF320565_1 cathepsine L-like cysteine protease [Rhodnius prolixus]
Length = 316
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 87/200 (43%), Positives = 124/200 (62%), Gaps = 5/200 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCWSF TG++EG ++K +L LS+Q L+DCS YGN+GC+GG +++Q++
Sbjct: 117 DQGHCGSCWSFSATGSLEGQLFLKTGRLVSLSEQNLVDCSKTYGNSGCEGGLMNQAFQYV 176
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T+ Y PY ++ C T G+V++ SE L+ A+A GP+SV
Sbjct: 177 RDNKGIDTEASY-PYEARENNCRFKEDKVGGTDKGYVDILEASEKDLQSAVATVGPISVR 235
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS +SF FY GVY ++ C SP LDH VL VGYG +G+ YW VKNSW WG G
Sbjct: 236 IDASHESFQFYSEGVYKEQYC--SPSQLDHGVLTVGYGTENGQDYWLVKNSWGPSWGESG 293
Query: 330 YVLMSIK-DNNCGVMTAPTY 348
Y+ ++ N+CG+ + +Y
Sbjct: 294 YIKIARNHKNHCGIASMASY 313
>gi|66735056|gb|AAY53767.1| cysteine protease [Saprolegnia parasitica]
Length = 523
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 92/214 (42%), Positives = 129/214 (60%), Gaps = 12/214 (5%)
Query: 142 EMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGG 201
E G + +Q +CGSCW+F TTGA+EGA ++ K+L +S+Q L+DC G+ GC+GG
Sbjct: 124 EQGGVTPVKNQGMCGSCWAFSTTGAIEGAAFVSSKQLVSVSEQELVDCDHN-GDMGCNGG 182
Query: 202 EDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLAL 260
+++W+ H GL ++DY PY ++ C + +T F +V N E ALK A+
Sbjct: 183 LMDNAFKWVKTHKGLCKEEDY-PYHAKEGTCALKKCKPVTKVTAFHDVPANDEQALKAAV 241
Query: 261 AKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNS 320
AK PVSVAI+A Q F FY +GV+ D+ C LDH VL VGYGE GK YW+VKNS
Sbjct: 242 AKQ-PVSVAIEADQPEFQFYKSGVF-DKSCGTK---LDHGVLVVGYGEEGGKKYWKVKNS 296
Query: 321 WSTYWGNQGYVLMSI----KDNNCGVMTAPTYVT 350
W WG++GY+ ++ + CGV P+Y T
Sbjct: 297 WGADWGDKGYIKLAREFGPETGQCGVAMVPSYPT 330
>gi|441593109|ref|XP_003260582.2| PREDICTED: cathepsin L2 isoform 1 [Nomascus leucogenys]
Length = 334
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 94/217 (43%), Positives = 129/217 (59%), Gaps = 10/217 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TGA+EG + K KL LS+Q L+DCS GN G
Sbjct: 118 VDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQG 177
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG +++Q++ ++ GL +++ Y PY+ D C + A TGF V P E AL
Sbjct: 178 CNGGFMGKAFQYVKENGGLDSEESY-PYVAMDEICKYRPENSVANDTGFTVVPPGKEKAL 236
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GP+SVA+DA SF FY G+Y++ C S + LDH VL VGYG +
Sbjct: 237 MKAVATVGPISVAMDAGHSSFQFYNQGIYFEPDC--SSENLDHGVLVVGYGFEGANSNNS 294
Query: 313 PYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
YW VKNSW WG+ GYV + K+N+CG+ TA +Y
Sbjct: 295 KYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASY 331
>gi|431897851|gb|ELK06685.1| Cathepsin L1 [Pteropus alecto]
Length = 331
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 95/216 (43%), Positives = 127/216 (58%), Gaps = 9/216 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + DQ CGSCW+F TGA+EG + K KL LS+Q L+DCS GN G
Sbjct: 116 VDWRQKGYVTPVKDQGQCGSCWAFSATGALEGQMFRKTGKLISLSEQNLVDCSQSQGNEG 175
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
CDGG ++Q++ +G ++ PYL +D C + A +GFV++ E +L
Sbjct: 176 CDGGLMDNAFQYVKDNGGLDSEESYPYLARDESCKYKPEFSAANDSGFVDIH-KQERSLM 234
Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKP 313
A+A GP+SV IDAS SF FY G+YY+ +C S + L+H VL VGYG E +
Sbjct: 235 KAVASVGPISVGIDASYSSFQFYEKGIYYEPEC--SSEDLNHGVLVVGYGFERAESNKNK 292
Query: 314 YWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTY 348
YW VKNSW T WG GY+ M+ NN CG+ TA +Y
Sbjct: 293 YWIVKNSWGTNWGMNGYINMAKDQNNHCGIATAASY 328
>gi|354472953|ref|XP_003498701.1| PREDICTED: cathepsin K [Cricetulus griseus]
Length = 329
Score = 172 bits (435), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 93/215 (43%), Positives = 125/215 (58%), Gaps = 6/215 (2%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
+ Y KGY + +Q CGSCW+F + GA+EG K KL LS Q L+DC N G
Sbjct: 119 IDYRKKGYVTPVKNQGECGSCWAFSSAGALEGQLKKKTGKLLNLSPQNLVDCV--SENYG 176
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
C GG +++++ +G +D PY+GQD C T A G+ + SE ALK
Sbjct: 177 CGGGYMTTAFRYVQTNGGIDSEDAYPYVGQDQSCMYNPTAKAAKCRGYREIPVGSEKALK 236
Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQV 317
A+A+ GP+SV+IDAS SF FY GVYYDE C+ D ++HAVL VGYG G +W +
Sbjct: 237 RAVARVGPISVSIDASLTSFQFYSRGVYYDENCDG--DNVNHAVLVVGYGAQKGNKHWII 294
Query: 318 KNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
KNSW WGN+GYVL++ NN CG+ ++ M
Sbjct: 295 KNSWGESWGNKGYVLLARNRNNACGITNLASFPKM 329
>gi|115715524|ref|XP_780580.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 334
Score = 172 bits (435), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 97/214 (45%), Positives = 132/214 (61%), Gaps = 7/214 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TG++EG ++ K KL LS+Q L+DCS GN G
Sbjct: 121 VDWRTKGYVTPVKNQLQCGSCWAFSATGSLEGQHFKKTGKLVSLSEQNLVDCSGKEGNMG 180
Query: 198 CDGGEDFRSYQWIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG +++Q+I+ G+ T+ Y PY D CH AT TG+ +VT SE AL
Sbjct: 181 CEGGLMDQAFQYILDVGGIDTEMSY-PYTAMDGQCHFNKANIGATDTGYTDVTTGSESAL 239
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYW 315
++A+A GP+SVAIDAS +SF Y +GVY + C+++ LDH VLAVGYG DG Y+
Sbjct: 240 QMAVASVGPISVAIDASHQSFQLYKSGVYNEPACSSTL--LDHGVLAVGYGTSSDGTDYF 297
Query: 316 QVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
+SW WG GY+ MS KDN CG+ T +Y
Sbjct: 298 FFFHSWGAAWGMNGYLWMSRNKDNQCGIATKASY 331
>gi|345493482|ref|XP_001602523.2| PREDICTED: cathepsin L-like [Nasonia vitripennis]
Length = 514
Score = 172 bits (435), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 89/202 (44%), Positives = 125/202 (61%), Gaps = 7/202 (3%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TG++EG ++ + L LS+Q L+DCS +GN+GCDGG ++ ++
Sbjct: 312 NQGNCGSCWAFSATGSLEGQHFRHNGSLISLSEQNLVDCSGRFGNDGCDGGLMNNAFTYV 371
Query: 211 -MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ GL ++ Y PY +D C + A G+VN+ SE L+ A+A GP+SVA
Sbjct: 372 KVNRGLDSEKSY-PYEAEDDRCRYNPKNSAADDAGYVNIPTGSESKLQAAVATVGPISVA 430
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG--ELDGKPYWQVKNSWSTYWGN 327
IDA SF FY +GVYY+ C+ + LDH VLA+GYG GK +W VKNSW WG
Sbjct: 431 IDADSDSFMFYHSGVYYEPDCSRTD--LDHGVLAIGYGTDSKTGKQFWLVKNSWGEDWGE 488
Query: 328 QGYVLMSI-KDNNCGVMTAPTY 348
+GY+ MS + NNCG+ TA +Y
Sbjct: 489 KGYIRMSRNRHNNCGIATAASY 510
Score = 154 bits (388), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 87/195 (44%), Positives = 115/195 (58%), Gaps = 16/195 (8%)
Query: 149 LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
+ DQ CGSCWSF TGA+EG ++ + KL LS+Q LIDCS YGNNGC+GG +++
Sbjct: 137 IKDQGHCGSCWSFSATGALEGQHFRQTGKLVSLSEQNLIDCSGKYGNNGCNGGLMDNAFK 196
Query: 209 WIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVS 267
+I + GL T+ Y PY +D C + A GFV++ E+ LK A+A GPVS
Sbjct: 197 YIRDNKGLDTESTY-PYEAEDDECRYNARNSGAEDVGFVDIPEGDEEKLKAAIATIGPVS 255
Query: 268 VAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWG 326
VAIDAS ++F FY GVYY+ +C+++ LDH VL VGYG DG+ YW+
Sbjct: 256 VAIDASHQTFQFYSTGVYYEPECSSTE--LDHGVLVVGYGTSEDGQDYWK---------- 303
Query: 327 NQGYVLMSIKDNNCG 341
QG V NCG
Sbjct: 304 -QGAVTPVKNQGNCG 317
>gi|344271616|ref|XP_003407633.1| PREDICTED: cathepsin L1-like [Loxodonta africana]
Length = 334
Score = 172 bits (435), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 101/222 (45%), Positives = 135/222 (60%), Gaps = 14/222 (6%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TGA+EG + K KL LS+Q L+DCS GN G
Sbjct: 118 VDWTQKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRREGNEG 177
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDA 255
C+GG ++Q++ + GL +++ Y PYL D + C+ + A TGFV++ P E A
Sbjct: 178 CNGGLMDNAFQYVQDNGGLDSEESY-PYLATDTHTCNYKPECSAANDTGFVDI-PQREKA 235
Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGK--- 312
L A+A GP+SVAIDA +SF FY +G+YY+ C S LDH VL VGYG +GK
Sbjct: 236 LMKAVATVGPISVAIDAGHESFQFYKSGIYYEPGC--SSKDLDHGVLLVGYG-FEGKDSE 292
Query: 313 --PYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
+W VKNSW T WG GYV M+ NN CG+ TA +Y T+
Sbjct: 293 NNKFWIVKNSWGTSWGTNGYVKMAKDQNNHCGIATAASYPTV 334
>gi|66377984|gb|AAY45869.1| cathepsin L-like cysteine proteinase [Globodera pallida]
Length = 379
Score = 172 bits (435), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 89/202 (44%), Positives = 126/202 (62%), Gaps = 7/202 (3%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q +CGSCW+F +TGA+E + + +L LS+Q LIDCS YGN GC+GG ++Q+I
Sbjct: 178 NQGMCGSCWAFSSTGALEAQHARQTGQLISLSEQNLIDCSKKYGNMGCNGGIMDNAFQYI 237
Query: 211 M-KHGLPTQDDYGPYLGQDA-YCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
+G+ + DY PY + C AT TGF ++ E+ LK+A+A GP SV
Sbjct: 238 KDNNGVDKELDY-PYKAKTGKKCLFKRNDVGATDTGFFDIAEGDEEKLKIAVATQGPASV 296
Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGN 327
AIDA +SF Y +GVY++++C SP+ LDH VL VGYG + YW VKNSW +WG
Sbjct: 297 AIDAGHRSFQLYTHGVYFEKEC--SPENLDHGVLVVGYGTDAQQGDYWIVKNSWGAHWGE 354
Query: 328 QGYVLMSI-KDNNCGVMTAPTY 348
QGY+ M+ + NNCG+ + +Y
Sbjct: 355 QGYIRMARNRKNNCGIASHASY 376
>gi|66394764|gb|AAY46196.1| cathepsin L-like cysteine proteinase [Globodera pallida]
Length = 379
Score = 172 bits (435), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 89/202 (44%), Positives = 126/202 (62%), Gaps = 7/202 (3%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q +CGSCW+F +TGA+E + + +L LS+Q LIDCS YGN GC+GG ++Q+I
Sbjct: 178 NQGMCGSCWAFSSTGALEAQHARQTGQLISLSEQNLIDCSKKYGNMGCNGGIMDNAFQYI 237
Query: 211 M-KHGLPTQDDYGPYLGQDA-YCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
+G+ + DY PY + C AT TGF ++ E+ LK+A+A GP SV
Sbjct: 238 KDNNGVDKELDY-PYKAKTGKKCLFKRNDVGATDTGFFDIAEGDEEKLKIAVATQGPASV 296
Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGN 327
AIDA +SF Y +GVY++++C SP+ LDH VL VGYG + YW VKNSW +WG
Sbjct: 297 AIDAGHRSFQLYTHGVYFEKEC--SPENLDHGVLVVGYGTDAQQGDYWIVKNSWGAHWGE 354
Query: 328 QGYVLMSI-KDNNCGVMTAPTY 348
QGY+ M+ + NNCG+ + +Y
Sbjct: 355 QGYIRMARNRKNNCGIASHASY 376
>gi|55740404|gb|AAV63978.1| cathepsin L2 precursor [Artemia franciscana]
Length = 226
Score = 172 bits (435), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 89/199 (44%), Positives = 123/199 (61%), Gaps = 3/199 (1%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM 211
Q C SCW+F +TGA+E + K KL LS+Q LIDCS YGN GC GG +++++I
Sbjct: 26 QGQCASCWAFSSTGALESQTFRKTGKLISLSEQNLIDCSGEYGNLGCKGGWISQAFEYIK 85
Query: 212 KH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
+ G+ T++ Y Y ++ +C A GFVN+ ED LK A+A GPVS I
Sbjct: 86 DNKGIDTENKY-HYEAKENFCRDNPRNRGAVALGFVNIPSGEEDKLKAAVATVGPVSAVI 144
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
D S + F FY GVYY+ C S + L+HAVL +GYG +G+ YW VKNSWS +WG++GY
Sbjct: 145 DVSHEGFQFYSKGVYYEPSCKTSFEHLNHAVLVIGYGSDNGEDYWLVKNSWSKHWGDEGY 204
Query: 331 VLMSI-KDNNCGVMTAPTY 348
+ ++ + N+CGV TA Y
Sbjct: 205 LKIARNRKNHCGVATAALY 223
>gi|94448674|emb|CAI91575.1| cathepsin L2 [Lubomirskia baicalensis]
Length = 324
Score = 171 bits (434), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 96/213 (45%), Positives = 131/213 (61%), Gaps = 6/213 (2%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KG+ + +Q CGSCWSF TG++EG ++ L LS+Q L+DCS GN+G
Sbjct: 112 VDWSKKGWVTPVKNQGQCGSCWSFSATGSMEGQHFNATGTLMSLSEQNLVDCSAAEGNHG 171
Query: 198 CDGGEDFRSYQWIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++++++K +G+ T+ Y PY D+ C AT++G+V+VT +SE L
Sbjct: 172 CNGGLMDDAFEYVIKNNGIDTEASY-PYRAVDSTCKFNTADVGATISGYVDVTKDSESDL 230
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
++A+A GPVSVAIDAS SF FY +GVY C S LDH VLAVGYG K YW
Sbjct: 231 QVAVATIGPVSVAIDASHISFQFYSSGVYDPLIC--SSTNLDHGVLAVGYGTDGSKDYWL 288
Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTY 348
VKNSW WG GY+ M NN CG+ T+ +Y
Sbjct: 289 VKNSWGASWGMSGYIEMVRNHNNKCGIATSASY 321
>gi|37905511|gb|AAO64477.1| cathepsin S precursor [Fundulus heteroclitus]
Length = 337
Score = 171 bits (434), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 90/202 (44%), Positives = 124/202 (61%), Gaps = 6/202 (2%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM 211
Q CGSCW+F GA+EG K KL LS Q L+DCS YGN+GC+GG +++Q+++
Sbjct: 140 QGACGSCWAFSAAGALEGQLAKKTGKLQNLSPQNLVDCSTKYGNHGCNGGFMHKAFQYVI 199
Query: 212 -KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
G+ ++D Y PY G+D C T A + + + E ALK A+A GP+SVAI
Sbjct: 200 DNQGIDSEDSY-PYRGRDQQCQYNPATRAANCSRYDFLPEGDEQALKEAIATIGPISVAI 258
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
DA + F+FY +GVY D C + ++HAVLAVGYG L G+ YW VKNSW T +G+QGY
Sbjct: 259 DARRPRFAFYRSGVYDDSSCTQN---VNHAVLAVGYGSLGGQDYWLVKNSWGTSFGDQGY 315
Query: 331 VLMSI-KDNNCGVMTAPTYVTM 351
+ M+ K++ CG+ Y M
Sbjct: 316 IRMARNKNDQCGIALYACYPIM 337
>gi|340381055|ref|XP_003389037.1| PREDICTED: cathepsin L1-like [Amphimedon queenslandica]
Length = 329
Score = 171 bits (434), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 98/243 (40%), Positives = 140/243 (57%), Gaps = 11/243 (4%)
Query: 118 EKVSKYSLWVRYNKASKDAIP--VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMK 174
+ ++ S RY+ + +A+P V + KG + +Q CGSCW+F TTG++EG ++K
Sbjct: 89 RRSARKSNATRYHVPTGNALPDTVDWRTKGAVTPVKNQKQCGSCWAFSTTGSLEGQTFLK 148
Query: 175 HKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIA 234
L LS+Q L+DCS YGN+GC GG ++++I +G + PY ++ C
Sbjct: 149 KGTLPSLSEQQLVDCSDKYGNHGCQGGLMDNAFKYIEANGGIDSEASYPYEAKNGKCRFQ 208
Query: 235 NTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSP 294
+ AT TG+ ++ + D L+ A+A GP+SVA+DAS SF Y GVY C+++
Sbjct: 209 QSAVAATCTGYKDIPHDDIDGLQDAVANVGPISVAMDASHSSFQLYAAGVYDPLLCSSTR 268
Query: 295 DGLDHAVLAVGYG-ELDG-----KPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTY 348
LDH VLAVGYG E G KPYW VKNSW WG QGY + KDN CG+ T +Y
Sbjct: 269 --LDHGVLAVGYGTEPSGLFHEEKPYWLVKNSWGPDWGQQGYFKIVRKDNKCGIATDASY 326
Query: 349 VTM 351
T+
Sbjct: 327 PTV 329
>gi|348531517|ref|XP_003453255.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 330
Score = 171 bits (434), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 96/213 (45%), Positives = 133/213 (62%), Gaps = 7/213 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + +GY + Q CGSCW+F TGA+EG ++ K L LS+Q L+DCS Y NNG
Sbjct: 119 VDWREQGYVTDVKHQQQCGSCWAFSATGALEGQHFKKTGTLVPLSEQQLVDCSRKYRNNG 178
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
CDGGE ++Q+I + G+ T+ Y Y +D C + + A G+V+V+P E+AL
Sbjct: 179 CDGGEPNWAFQYIRDNGGVDTEKSY-RYEAKDGQCRYRSNSIGAKCNGYVDVSP-FEEAL 236
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
A+A GP+SV+ID S+ SF Y +GVY + C+N L+HAVLAVGYG +G YW
Sbjct: 237 MEAVATIGPISVSIDDSRVSFQLYQSGVYDEPWCSNI--NLNHAVLAVGYGTENGHDYWL 294
Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
VKNSW + WGN+GY+ M+ K N CG+ T +Y
Sbjct: 295 VKNSWGSGWGNKGYIKMTRNKGNQCGIATEASY 327
>gi|156354971|ref|XP_001623452.1| predicted protein [Nematostella vectensis]
gi|156210152|gb|EDO31352.1| predicted protein [Nematostella vectensis]
Length = 301
Score = 171 bits (434), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 82/190 (43%), Positives = 118/190 (62%), Gaps = 8/190 (4%)
Query: 147 NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG-----YGNNGCDGG 201
N Q CGSCW+F T GAVE A++++ +L L++Q L+DC+W +GNN C GG
Sbjct: 111 NPAKGQGTCGSCWAFATAGAVEAAHFIQKGELLNLAEQQLLDCTWSTPGVYHGNNCCLGG 170
Query: 202 EDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALA 261
++++ W+ K GL T YG Y GQ+ +C +N T A +T + V + ALK AL+
Sbjct: 171 WTWKAFSWVKKFGLATTKSYGHYRGQEGFCKTSNRTVGARITSYRRVKRFNPIALKKALS 230
Query: 262 KHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSW 321
HGP +++I+A+ KS FY NG+ D+ C+N HAVL +GYG +G PYW +KNSW
Sbjct: 231 YHGPATISINANPKSIKFYSNGIMSDKHCSNK---TYHAVLLIGYGSDNGVPYWLIKNSW 287
Query: 322 STYWGNQGYV 331
S WGN G++
Sbjct: 288 SHKWGNNGFI 297
>gi|77379397|gb|ABA71355.1| cysteine protease [Brassica napus]
Length = 359
Score = 171 bits (434), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 86/199 (43%), Positives = 120/199 (60%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TTGA+E AY+ K LS+Q L+DC+ + N GC+GG +++++I
Sbjct: 159 DQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYI 218
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ GL T++ Y PY G+D C + + VN+T +ED LK A+ PVS+A
Sbjct: 219 KSNGGLDTEEAY-PYTGEDGTCKYSAENVGVQVLDSVNITLGAEDELKHAVGLLRPVSIA 277
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ SF Y +GVY D C +P ++HAVLAVGYG DG PYW +KNSW WG++G
Sbjct: 278 FEVIH-SFRLYKSGVYSDSHCGQTPMDVNHAVLAVGYGIEDGVPYWLIKNSWGADWGDKG 336
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y M + N CG+ T +Y
Sbjct: 337 YFKMEMGKNMCGIATCASY 355
>gi|281427380|ref|NP_001163996.1| cathepsin L-like proteinase precursor [Tribolium castaneum]
gi|281427798|ref|NP_001164001.1| cathepsin L-like proteinase precursor [Tribolium castaneum]
gi|270001241|gb|EEZ97688.1| cathepsin L precursor [Tribolium castaneum]
gi|270016928|gb|EFA13374.1| hypothetical protein TcasGA2_TC001950 [Tribolium castaneum]
Length = 328
Score = 171 bits (434), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 90/217 (41%), Positives = 127/217 (58%), Gaps = 5/217 (2%)
Query: 136 AIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGN 195
A V + K + DQ CGSCWSF TTGAVEG + K L LS+Q L+DCS YGN
Sbjct: 116 AAEVDWRSKAVTEVKDQGQCGSCWSFSTTGAVEGQLAISGKGLTSLSEQNLVDCSSQYGN 175
Query: 196 NGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDA 255
GC+GG ++ +I +G+ ++ Y PY D C + + ++ G+ ++ E A
Sbjct: 176 AGCNGGWMDSAFDYIHDNGIMSESAY-PYTAMDGNCRFDASQSVTSLQGYYDIPSGDESA 234
Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYW 315
L+ A+A +GPV+VA+DA+++ Y GV YD C S L+H VL VGYG G+ YW
Sbjct: 235 LQDAVANNGPVAVALDATEE-LQLYSGGVLYDTTC--SAQALNHGVLVVGYGSEGGQDYW 291
Query: 316 QVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
VKNSW + WG QGY + ++NNCG+ TA +Y +
Sbjct: 292 IVKNSWGSGWGEQGYWRQARNRNNNCGIATAASYPAL 328
>gi|1705639|sp|Q10991.1|CATL1_SHEEP RecName: Full=Cathepsin L1; Contains: RecName: Full=Cathepsin L1
heavy chain; Contains: RecName: Full=Cathepsin L1 light
chain; Flags: Precursor
Length = 217
Score = 171 bits (434), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 100/217 (46%), Positives = 131/217 (60%), Gaps = 8/217 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TGA+EG + K KL LS+Q L+D S GN G
Sbjct: 5 VDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDSSRPQGNQG 64
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++Q+I ++ GL +++ Y PY D C+ + A TGFV++ P E AL
Sbjct: 65 CNGGLMDNAFQYIKENGGLDSEESY-PYEATDTSCNYKPEYSAAKDTGFVDI-PQREKAL 122
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYW 315
A+A GP+SVAIDA SF FY +G+YYD C S LDH VL VGYG E +W
Sbjct: 123 MKAVATVGPISVAIDAGHSSFQFYKSGIYYDPDC--SSKDLDHGVLVVGYGFEGTNNKFW 180
Query: 316 QVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
VKNSW WGN+GYV M+ NN CG+ TA +Y T+
Sbjct: 181 IVKNSWGPEWGNKGYVKMAKDQNNHCGIATAASYPTV 217
>gi|45822209|emb|CAE47501.1| cathepsin L-like proteinase [Diabrotica virgifera virgifera]
Length = 325
Score = 171 bits (433), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 89/199 (44%), Positives = 123/199 (61%), Gaps = 4/199 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ +CGSCW+F TTG+VE A+++K L LS+Q L+DC+ GC GG ++ ++I
Sbjct: 127 DQGMCGSCWTFSTTGSVEAAHFLKTGNLVSLSEQNLVDCAKDTCY-GCGGGWMDKALEYI 185
Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
K G+ ++ DY PY G D C + A ++ F + N E+ LK A+A GP+SVAI
Sbjct: 186 EKGGIMSEKDY-PYEGVDDNCRFDISKVAAKISNFTYIKKNDEEDLKNAVAAKGPISVAI 244
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
DAS +F YV+G+ D +C+N D L+H VL VGYG +GK YW +KNSW WG GY
Sbjct: 245 DASA-TFQLYVSGILDDTECSNEFDSLNHGVLVVGYGTENGKDYWIIKNSWGVNWGMDGY 303
Query: 331 VLMSI-KDNNCGVMTAPTY 348
+ MS K+N CG+ T Y
Sbjct: 304 IRMSRNKNNQCGITTDGVY 322
>gi|156938919|gb|ABU97481.1| cathepsin L-like cysteine protease [Tyrophagus putrescentiae]
Length = 333
Score = 171 bits (433), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 90/203 (44%), Positives = 127/203 (62%), Gaps = 6/203 (2%)
Query: 149 LLDQSVCGSCWSF-GTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSY 207
+ +Q CGSCW+F ++EG + +K KL LS+Q L+DCS GN GC+GG +++
Sbjct: 131 IKNQEQCGSCWAFFSAVASMEGQHGLKTGKLVSLSEQNLVDCSAAEGNMGCEGGLMDQAF 190
Query: 208 QWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
Q+++ + G+ T+ Y PY D + AT+ +V+V SE +L+ A+A GP+
Sbjct: 191 QYVIANKGIDTEMSY-PYKAIDESWEFKKNSVGATIKSYVDVKTGSESSLQSAVATVGPI 249
Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWG 326
SV IDASQ SF FY +GVY + C+ + LDH V AVGYG L+G PYW+VKNSW T WG
Sbjct: 250 SVGIDASQLSFQFYSSGVYEEPACSTTI--LDHGVTAVGYGALNGTPYWKVKNSWGTSWG 307
Query: 327 NQGYVLMSI-KDNNCGVMTAPTY 348
GY+ MS K N CG+ TA ++
Sbjct: 308 MSGYIFMSRNKQNQCGIATAASW 330
>gi|384941728|gb|AFI34469.1| cathepsin L2 preproprotein [Macaca mulatta]
Length = 334
Score = 171 bits (433), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 95/220 (43%), Positives = 130/220 (59%), Gaps = 10/220 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TGA+EG + K KL LS+Q L+DCS GN G
Sbjct: 118 VDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQG 177
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG +++++ ++ GL +++ Y PY+ D C + + A TGF V E AL
Sbjct: 178 CNGGFMNSAFRYVKENGGLDSEESY-PYVAMDGICKYRSENSVANDTGFEVVPAGKEKAL 236
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GP+SVA+DA SF FY +G+Y++ C S LDH VL VGYG D
Sbjct: 237 MKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDC--SSKNLDHGVLVVGYGFEGANSDNN 294
Query: 313 PYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTYVTM 351
YW VKNSW WG+ GYV + KDN+CG+ TA +Y T+
Sbjct: 295 KYWLVKNSWGPEWGSNGYVKIAKDKDNHCGIATAASYPTV 334
>gi|118136313|gb|ABK62794.1| cathepsin L-like cysteine protease [Neobenedenia melleni]
gi|118136315|gb|ABK62795.1| cathepsin L-like cysteine protease [Neobenedenia melleni]
Length = 335
Score = 171 bits (433), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 91/204 (44%), Positives = 129/204 (63%), Gaps = 6/204 (2%)
Query: 148 SLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSY 207
++ +Q+ CGSCW+F +TG++EGA KL S+Q L+DCS +GN+GC+GG S+
Sbjct: 132 AVKNQAQCGSCWAFSSTGSIEGAVKRATGKLISFSEQQLVDCSTAFGNHGCNGGIMDNSF 191
Query: 208 QWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
+++ + GL ++ Y PY Q C + T++ F +V+ E LK A+ GPV
Sbjct: 192 NYLIHNKGLESEASY-PYEAQKKECRYKKALSKGTISSFTDVSQFDEKDLKRAVGLVGPV 250
Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGEL-DGKPYWQVKNSWSTYW 325
S+AIDASQ SF Y +GVY +E C+ + L+H VLAVGYG +G YW+VKNSW+ W
Sbjct: 251 SIAIDASQFSFHLYDSGVYDEEDCSQTM--LNHGVLAVGYGTTPEGLDYWKVKNSWTNTW 308
Query: 326 GNQGYVLMSI-KDNNCGVMTAPTY 348
G +GY+LMS KDN CGV T +Y
Sbjct: 309 GMEGYILMSRNKDNQCGVATVASY 332
>gi|377656292|pdb|3QT4|A Chain A, Structure Of Digestive Procathepsin L 3 Of Tenebrio
Molitor Larval Midgut
Length = 329
Score = 171 bits (433), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 89/218 (40%), Positives = 132/218 (60%), Gaps = 7/218 (3%)
Query: 136 AIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGN 195
A V + + + DQ CGS WSF TTGAVEG ++ +L LS+Q LIDCS YGN
Sbjct: 117 AASVDWRSNAVSEVKDQGQCGSSWSFSTTGAVEGQLALQRGRLTSLSEQNLIDCSSSYGN 176
Query: 196 NGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDA 255
GCDGG ++ +I +G+ ++ Y PY Q YC ++ + T++G+ ++ E++
Sbjct: 177 AGCDGGWMDSAFSYIHDYGIMSESAY-PYEAQGDYCRFDSSQSVTTLSGYYDLPSGDENS 235
Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYW 315
L A+ + GPV+VAIDA+ + FY G++YD+ CN S L+H VL VGYG +G+ YW
Sbjct: 236 LADAVGQAGPVAVAIDATDE-LQFYSGGLFYDQTCNQSD--LNHGVLVVGYGSDNGQDYW 292
Query: 316 QVKNSWSTYWGNQGYVLMSIKD--NNCGVMTAPTYVTM 351
+KNSW + WG GY +++ NNCG+ TA +Y +
Sbjct: 293 ILKNSWGSGWGESGY-WRQVRNYGNNCGIATAASYPAL 329
>gi|18141289|gb|AAL60582.1|AF454960_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 359
Score = 171 bits (433), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 86/199 (43%), Positives = 120/199 (60%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TTGA+E AY+ K LS+Q L+DC+ + N GC+GG +++++I
Sbjct: 159 DQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYI 218
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ GL T++ Y PY G+D C + + VN+T +ED LK A+ PVS+A
Sbjct: 219 KSNGGLDTEEAY-PYTGEDGTCKYSAENVGVEVLDSVNITLGAEDELKHAVGLVRPVSIA 277
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ SF Y +GVY D C +P ++HAVLAVGYG DG PYW +KNSW WG++G
Sbjct: 278 FEVIH-SFRLYKSGVYSDSHCGQTPMDVNHAVLAVGYGIEDGVPYWLIKNSWGADWGDKG 336
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y M + N CG+ T +Y
Sbjct: 337 YFKMEMGKNMCGIATCASY 355
>gi|109112413|ref|XP_001106814.1| PREDICTED: cathepsin L2 isoform 3 [Macaca mulatta]
gi|297271422|ref|XP_002800251.1| PREDICTED: cathepsin L2 [Macaca mulatta]
Length = 334
Score = 171 bits (432), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 95/220 (43%), Positives = 130/220 (59%), Gaps = 10/220 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TGA+EG + K KL LS+Q L+DCS GN G
Sbjct: 118 VDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSHPQGNQG 177
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG +++++ ++ GL +++ Y PY+ D C + + A TGF V E AL
Sbjct: 178 CNGGFMNSAFRYVKENGGLDSEESY-PYVAMDGICKYRSENSVANDTGFKVVPAGKEKAL 236
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GP+SVA+DA SF FY +G+Y++ C S LDH VL VGYG D
Sbjct: 237 MKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDC--SSKNLDHGVLVVGYGFEGANSDNN 294
Query: 313 PYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTYVTM 351
YW VKNSW WG+ GYV + KDN+CG+ TA +Y T+
Sbjct: 295 KYWLVKNSWGPEWGSNGYVKIAKDKDNHCGIATAASYPTV 334
>gi|23397070|gb|AAN31820.1| putative cysteine proteinase AALP [Arabidopsis thaliana]
Length = 358
Score = 171 bits (432), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 86/199 (43%), Positives = 120/199 (60%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TTGA+E AY+ K LS+Q L+DC+ + N GC+GG +++++I
Sbjct: 158 DQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYI 217
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ GL T+ Y PY G+D C + + VN+T +ED LK A+ PVS+A
Sbjct: 218 KSNGGLDTEKAY-PYTGKDETCKFSAENVGVQVLNSVNITLGAEDELKHAVGLVRPVSIA 276
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ SF Y +GVY D C ++P ++HAVLAVGYG DG PYW +KNSW WG++G
Sbjct: 277 FEVIH-SFRLYKSGVYTDSHCGSTPMDVNHAVLAVGYGVEDGVPYWLIKNSWGADWGDKG 335
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y M + N CG+ T +Y
Sbjct: 336 YFKMEMGKNMCGIATCASY 354
>gi|355753449|gb|EHH57495.1| Cathepsin L1 [Macaca fascicularis]
Length = 333
Score = 171 bits (432), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 95/220 (43%), Positives = 132/220 (60%), Gaps = 11/220 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TGA+EG + K KL LS+Q L+DCSW GN G
Sbjct: 118 VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSWPQGNEG 177
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++Q++ + GL +++ Y PY + C + A TGFV++ P E AL
Sbjct: 178 CNGGLMDYAFQYVADNGGLDSEESY-PYEATEESCKYNPEYSVANDTGFVDI-PKQEKAL 235
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GP+SVAIDA +SF FY G+Y++ C S + +DH VL VGYG E D
Sbjct: 236 MKAVATVGPISVAIDAGHESFMFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDNS 293
Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
YW VKNSW WG GY+ M+ + N+CG+ +A +Y T+
Sbjct: 294 KYWLVKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYPTV 333
>gi|118404242|ref|NP_001072435.1| cathepsin K precursor [Xenopus (Silurana) tropicalis]
gi|113197688|gb|AAI21683.1| hypothetical protein MGC147539 [Xenopus (Silurana) tropicalis]
Length = 331
Score = 171 bits (432), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 106/254 (41%), Positives = 145/254 (57%), Gaps = 20/254 (7%)
Query: 113 VTSEGEKVSKYSLWV-RYNKAS---------KDAIP--VRYEMKGY-NSLLDQSVCGSCW 159
+TSE + LWV R N+++ ++ IP + Y KGY + +Q CGSCW
Sbjct: 83 MTSEEVVRTMTGLWVHRRNRSTNFTSEDNTAQEKIPDSIDYRKKGYVTPIRNQGSCGSCW 142
Query: 160 SFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQ 218
+F + GA+EG K KL LS Q L+DC N+GC GG +++++ + G+ ++
Sbjct: 143 AFSSVGALEGQLKKKKGKLVDLSPQNLVDCV--KKNDGCGGGYMTNAFEYVRDNKGIDSE 200
Query: 219 DDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFS 278
+ Y PY+G+D C T A+ GF V SE ALK A+ GPVSV IDA SF
Sbjct: 201 NAY-PYVGEDQECMYNATGKAASCKGFKEVQEGSEKALKKAVGLVGPVSVGIDAGLSSFQ 259
Query: 279 FYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KD 337
FY GVYYD+ CN + ++HAVLAVGYG YW VKNSW WGN+GY+LM+ KD
Sbjct: 260 FYSKGVYYDKDCN--AENINHAVLAVGYGTQKKTKYWIVKNSWGEDWGNKGYILMAREKD 317
Query: 338 NNCGVMTAPTYVTM 351
N CG+ + +Y M
Sbjct: 318 NACGISSLASYPVM 331
>gi|1093503|prf||2104214A Cys protease
Length = 255
Score = 171 bits (432), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 92/214 (42%), Positives = 130/214 (60%), Gaps = 7/214 (3%)
Query: 139 VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KG ++ DQ CGSCW+F +TGA+EG ++ K L LS+Q L+DCS YGNNG
Sbjct: 42 VDWRTKGAVTAVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNG 101
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++ +I + G+ T+ Y PY D CH AT GF ++ E +
Sbjct: 102 CNGGLMDNAFPYIKDNGGIDTEKSY-PYEAIDDSCHFNRAQVGATDRGFTDIPQGDEKKM 160
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYW 315
A+A GPVSVAIDAS +SF FY GVY + +C+ LDH VL VG+G + G+ YW
Sbjct: 161 PEAVATVGPVSVAIDASHESFQFYSEGVYNEPQCD--AQNLDHGVLVVGFGTDESGEDYW 218
Query: 316 QVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
VKNSW T WG++G++ ++ K+N CG+ + +Y
Sbjct: 219 LVKNSWGTTWGDKGFIKMLRNKENQCGIASPSSY 252
>gi|348687948|gb|EGZ27762.1| papain-like cysteine protease C1 [Phytophthora sojae]
Length = 533
Score = 171 bits (432), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 92/209 (44%), Positives = 132/209 (63%), Gaps = 14/209 (6%)
Query: 145 GYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDF 204
G + +Q +CGSCW+F TTGAVEGA ++ KL LS+Q L+DC G+ GC+GG
Sbjct: 128 GVTPVKNQGMCGSCWAFSTTGAVEGATFVSSGKLLSLSEQELVDCDHN-GDMGCNGGLMD 186
Query: 205 RSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKH 263
++QWI HG + ++DDY Y + C ++ +TGF +V P E ALK+A+A+
Sbjct: 187 HAFQWIEDHGGICSEDDY-EYKAKAQVCRKCDSVVK--VTGFQDVNPQDEHALKVAVAQQ 243
Query: 264 GPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWST 323
PVSVAI+A QK+F FY +GV+ + C LDH VLAVGYG +G+ +W+VKNSW
Sbjct: 244 -PVSVAIEADQKAFQFYKSGVF-NLTCGTR---LDHGVLAVGYGNDNGQKFWKVKNSWGA 298
Query: 324 YWGNQGYVLMSIKDN----NCGVMTAPTY 348
WG QGY+ ++ ++N CG+ + P+Y
Sbjct: 299 SWGEQGYIRLAREENGPAGQCGIASVPSY 327
>gi|320169652|gb|EFW46551.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 325
Score = 171 bits (432), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 93/201 (46%), Positives = 123/201 (61%), Gaps = 6/201 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCWSF TTG+VEG + K L LS+Q L+DCS GN GC+GG ++++I
Sbjct: 125 NQGQCGSCWSFSTTGSVEGQHARKTGTLVSLSEQNLVDCSSQEGNEGCNGGLMDDAFEYI 184
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+K+ G+ T+ Y PY C AT+ + ++ SE L+ A+A GPVSVA
Sbjct: 185 IKNGGIDTEASY-PYTATTGTCKFNAANIGATVASYQDIITGSESDLQNAVATVGPVSVA 243
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
IDAS +F FY GVY ++KC+ + LDH VLAVGYG +GK YW VKNSW WG
Sbjct: 244 IDASHINFQFYFTGVYNEKKCSTTQ--LDHGVLAVGYGTSTEGKDYWLVKNSWGATWGKA 301
Query: 329 GYVLMSIK-DNNCGVMTAPTY 348
GY+ MS DN CG+ T+ +Y
Sbjct: 302 GYIWMSRNADNQCGIATSASY 322
>gi|164420679|ref|NP_001037464.2| fibroinase precursor [Bombyx mori]
gi|40556818|gb|AAR87763.1| fibroinase precursor [Bombyx mori]
Length = 341
Score = 171 bits (432), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 93/203 (45%), Positives = 125/203 (61%), Gaps = 6/203 (2%)
Query: 149 LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
+ DQ CGSCWSF TTGA+EG ++ + L LS+Q LIDCS YGNNGC+GG +++
Sbjct: 139 IKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFK 198
Query: 209 WIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVS 267
+I + G+ T+ Y PY G D C A GFV++ E L A+A GPVS
Sbjct: 199 YIKDNGGIDTEQTY-PYEGVDDKCRYNPKNTGAEDVGFVDIPEGDEQKLMEAVATVGPVS 257
Query: 268 VAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWG 326
VAIDAS SF Y +GVY +E+C+++ LDH VL VGYG + G YW VKNSW WG
Sbjct: 258 VAIDASHTSFQLYSSGVYNEEECSSTD--LDHGVLVVGYGTDEQGVDYWLVKNSWGRSWG 315
Query: 327 NQGYV-LMSIKDNNCGVMTAPTY 348
GY+ ++ K+N CG+ ++ +Y
Sbjct: 316 ELGYIKMIRNKNNRCGIASSASY 338
>gi|317135059|gb|ADV03094.1| cathepsin L [Hyriopsis cumingii]
gi|372126672|gb|AEX88474.1| cathepsin L [Hyriopsis schlegelii]
Length = 333
Score = 170 bits (431), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 94/232 (40%), Positives = 135/232 (58%), Gaps = 6/232 (2%)
Query: 120 VSKYSLWVRYNKASKDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKL 178
+ + L +Y S+ V + KGY + +Q CGSC++F TGAVEG ++ K KL
Sbjct: 102 IERRGLVFKYTNLSEYPSEVDWRSKGYVTPVKNQGGCGSCYAFSATGAVEGQHFRKTGKL 161
Query: 179 AVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTT 237
LS+Q ++DCS+ GN GC GG +S+ +I +G+ T++ Y PY +D C +
Sbjct: 162 VSLSEQNIVDCSFKEGNKGCRGGLMDKSFTYIKDNNGIDTEEAY-PYEARDGPCRFRRSE 220
Query: 238 ATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGL 297
AT+ G+V++ N E AL+ A+ GP+SVAID +F FY +GV+ + C S +
Sbjct: 221 VGATVRGYVDLPENDEIALQHAVTTIGPISVAIDGHHFNFRFYHHGVFDNPNC--SKTKI 278
Query: 298 DHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
+H VL VGYG DG YW VKNSW WG +GY+LMS DN C + A +Y
Sbjct: 279 NHGVLVVGYGTRDGLDYWLVKNSWGERWGAEGYILMSRNNDNQCCITCAASY 330
>gi|449679414|ref|XP_002161570.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 353
Score = 170 bits (431), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 93/214 (43%), Positives = 129/214 (60%), Gaps = 5/214 (2%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + +GY + +Q +C SCW+F TTGA+EG + K L LS+Q L+DCS YGN G
Sbjct: 138 VDWRTEGYVTPVKNQGLCRSCWAFSTTGALEGQTFRKTGILPTLSEQNLVDCSKSYGNQG 197
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQD-AYCHIANTTATATMTGFVNVTPNSEDA 255
CDGG ++++I + GL +++ Y PY ++ YC+ A+ +GFV + EDA
Sbjct: 198 CDGGWTNNAFEYIKDNDGLDSENGY-PYDAKELGYCYYDEKYKEASDSGFVEIPYGDEDA 256
Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYW 315
LK A+A GP++V IDAS+ SF Y +GVY + C N L HAVL VGYG G +W
Sbjct: 257 LKEAVATVGPIAVNIDASKPSFQSYKSGVYNEPTCGNGITNLTHAVLVVGYGTEKGHKFW 316
Query: 316 QVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
VKNSW WG+ GY+ MS K N CG+ T ++
Sbjct: 317 LVKNSWGKTWGDHGYIKMSRNKSNQCGIATRASF 350
>gi|258406688|gb|ACV72067.1| putative cysteine protease [Lathyrus sativus]
Length = 350
Score = 170 bits (431), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 89/199 (44%), Positives = 117/199 (58%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TTGA+E AY K LS+Q L+DC+ + N GC GG +++++I
Sbjct: 150 DQGHCGSCWTFSTTGALESAYAQAFGKNISLSEQQLVDCAGAFNNFGCSGGLPSQAFEYI 209
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ GL T++ Y PY G + C + + G VN+T SED LK A+A PVSVA
Sbjct: 210 KYNGGLETEETY-PYTGSNGLCKFTSENVALKVLGSVNITLGSEDELKHAVAFARPVSVA 268
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ F Y +GVY C N+P ++HAVLAVGYG DG PYW +KNSW WG+ G
Sbjct: 269 FEVVH-DFRLYKSGVYTSTACGNTPMDVNHAVLAVGYGIEDGIPYWHIKNSWGGDWGDHG 327
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y M + N CGV T +Y
Sbjct: 328 YFKMEMGKNMCGVATCSSY 346
>gi|410978262|ref|XP_003995514.1| PREDICTED: cathepsin L1-like [Felis catus]
Length = 333
Score = 170 bits (431), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 95/217 (43%), Positives = 128/217 (58%), Gaps = 11/217 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + DQ CGSCW+F TGA+EG + K KL LS+Q L+DCS GN G
Sbjct: 118 VDWREKGYVTPVKDQGPCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSQAEGNEG 177
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++Q++ + GL +++ Y PY QD C + A TGF ++ P E AL
Sbjct: 178 CNGGLMNNAFQYVKDNGGLDSEESY-PYHAQDESCKYKPQDSAANDTGFFDI-PQQEKAL 235
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDG----K 312
+A+A GP+SV IDAS +F FY G+YYD C S + LDH VL +GYG G K
Sbjct: 236 MVAVATKGPISVGIDASHFTFQFYHEGIYYDPDC--SSEDLDHGVLVIGYGTEIGQSINK 293
Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
YW VKNSW WG GY+ M+ + N+CG+ T ++
Sbjct: 294 TYWIVKNSWGANWGIDGYIKMAKDRKNHCGIATMASF 330
>gi|218478060|dbj|BAH03396.1| cathepsin L-like cysteine peptidase [Taenia saginata]
Length = 338
Score = 170 bits (431), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 88/200 (44%), Positives = 124/200 (62%), Gaps = 6/200 (3%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F +TGA+EGA+ K KL LS+Q L+DCS GN+GC+GG +++++
Sbjct: 140 NQGNCGSCWAFSSTGALEGAFAKKTGKLISLSEQQLVDCSLKNGNDGCNGGYMSYAFKYL 199
Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
+H + + Y PY D C + T+T ++ +E AL A+A GP+S+AI
Sbjct: 200 EEHSIEPESAY-PYRATDGPCRYNESLGVGTVTDIGDIPEGNETALMEAVATVGPISIAI 258
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
DAS F FY +G+Y C S L+H VLA+GYG+ DGKPYW VKNSW T WG +GY
Sbjct: 259 DASSLGFMFYRHGIYKSHWC--SSKFLNHGVLAIGYGKQDGKPYWLVKNSWGTRWGMKGY 316
Query: 331 VLMSIKD--NNCGVMTAPTY 348
++M+ KD N CGV + +
Sbjct: 317 IMMA-KDYHNMCGVASLADF 335
>gi|417399160|gb|JAA46608.1| Putative pro-cathepsin h [Desmodus rotundus]
Length = 336
Score = 170 bits (431), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 91/241 (37%), Positives = 138/241 (57%), Gaps = 2/241 (0%)
Query: 109 KWRLVTSEGEKVSK-YSLWVRYNKASKDAIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAV 167
K R + SE + S S ++R + ++ R + + + + +Q CGSCW+F TTGA+
Sbjct: 92 KRRYLWSEPQNCSATKSNYLRGHGPYPTSVDWRKKGRFVSPVKNQGGCGSCWTFSTTGAL 151
Query: 168 EGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQ 227
E A +K K+ LS+Q L+DC+ + N+GC GG +++++I + ++D PY G+
Sbjct: 152 ESAIAIKTGKMLSLSEQQLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGIMEEDSYPYEGK 211
Query: 228 DAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYD 287
D+ C A A + N+T N E A+ A+A + PVS A + + F Y G+Y
Sbjct: 212 DSNCRFQPEKAIAFVKDVANITLNDEAAMVEAVALYNPVSFAFEVTS-DFMLYRKGIYSS 270
Query: 288 EKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPT 347
C+ +PD ++HAVLAVGYGE +GKPYW VKNSW YWG GY L+ N CG+ +
Sbjct: 271 TSCHKTPDKVNHAVLAVGYGEQNGKPYWIVKNSWGPYWGMNGYFLIERGTNMCGLAACAS 330
Query: 348 Y 348
Y
Sbjct: 331 Y 331
>gi|18424347|ref|NP_568921.1| thiol protease aleurain [Arabidopsis thaliana]
gi|71152227|sp|Q8H166.2|ALEU_ARATH RecName: Full=Thiol protease aleurain; Short=AtALEU; AltName:
Full=Senescence-associated gene product 2; Flags:
Precursor
gi|7230640|gb|AAF43041.1|AF233883_1 AALP protein [Arabidopsis thaliana]
gi|13430722|gb|AAK25983.1|AF360273_1 putative cysteine proteinase AALP [Arabidopsis thaliana]
gi|9757740|dbj|BAB08221.1| AALP protein [Arabidopsis thaliana]
gi|21617934|gb|AAM66984.1| cysteine proteinase AALP [Arabidopsis thaliana]
gi|23397068|gb|AAN31819.1| putative cysteine proteinase AALP [Arabidopsis thaliana]
gi|23397074|gb|AAN31822.1| putative cysteine proteinase AALP [Arabidopsis thaliana]
gi|24417304|gb|AAN60262.1| unknown [Arabidopsis thaliana]
gi|222423506|dbj|BAH19723.1| AT5G60360 [Arabidopsis thaliana]
gi|222424411|dbj|BAH20161.1| AT5G60360 [Arabidopsis thaliana]
gi|332009930|gb|AED97313.1| thiol protease aleurain [Arabidopsis thaliana]
Length = 358
Score = 170 bits (431), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 86/199 (43%), Positives = 120/199 (60%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TTGA+E AY+ K LS+Q L+DC+ + N GC+GG +++++I
Sbjct: 158 DQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYI 217
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ GL T+ Y PY G+D C + + VN+T +ED LK A+ PVS+A
Sbjct: 218 KSNGGLDTEKAY-PYTGKDETCKFSAENVGVQVLNSVNITLGAEDELKHAVGLVRPVSIA 276
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ SF Y +GVY D C ++P ++HAVLAVGYG DG PYW +KNSW WG++G
Sbjct: 277 FEVIH-SFRLYKSGVYTDSHCGSTPMDVNHAVLAVGYGVEDGVPYWLIKNSWGADWGDKG 335
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y M + N CG+ T +Y
Sbjct: 336 YFKMEMGKNMCGIATCASY 354
>gi|405977173|gb|EKC41636.1| Cathepsin K [Crassostrea gigas]
Length = 942
Score = 170 bits (431), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 92/200 (46%), Positives = 122/200 (61%), Gaps = 7/200 (3%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTG +EG ++ K KKL LS+Q L+DC N GC GG +Y++I
Sbjct: 745 NQGYCGSCWAFATTGGLEGQHFRKTKKLVSLSEQNLVDCC--KENLGCTGGLPVTAYKYI 802
Query: 211 MKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
++G + T++ Y PYLG++ C AT GFV V E L+ A+A GPV+V+
Sbjct: 803 ARNGGIDTEESY-PYLGKNGNCTFRPPKIGATCQGFVRVPAGDEVGLQKAVASVGPVTVS 861
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS KSF Y GVY D+KC S +H VL VGYG+ GK YW VKNSW +G G
Sbjct: 862 IDASLKSFYLYKEGVYDDKKC--SKKMFNHFVLIVGYGKHLGKEYWLVKNSWGMSFGMDG 919
Query: 330 YVLMSI-KDNNCGVMTAPTY 348
Y++M+ +DN CG+ P Y
Sbjct: 920 YIMMARNQDNQCGISNQPVY 939
>gi|402898110|ref|XP_003912074.1| PREDICTED: cathepsin L2 [Papio anubis]
Length = 334
Score = 170 bits (430), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 95/220 (43%), Positives = 129/220 (58%), Gaps = 10/220 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TGA+EG + K KL LS+Q L+DCS GN G
Sbjct: 118 VDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQG 177
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG +++++ ++ GL +++ Y PY+ D C + A TGF V E AL
Sbjct: 178 CNGGFMNSAFRYVKENGGLDSEESY-PYVAMDGICKYRPENSVANDTGFEVVPAGKEKAL 236
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GP+SVA+DA SF FY +G+Y++ C S LDH VL VGYG D
Sbjct: 237 MKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDC--SSKNLDHGVLVVGYGFEGANSDNN 294
Query: 313 PYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTYVTM 351
YW VKNSW WG+ GYV + KDN+CG+ TA +Y T+
Sbjct: 295 KYWLVKNSWGPEWGSNGYVKIAKDKDNHCGIATAASYPTV 334
>gi|146386731|pdb|1VSN|A Chain A, Crystal Structure Of A Potent Small Molecule Inhibitor
Bound To Cathepsin K
Length = 215
Score = 170 bits (430), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 96/216 (44%), Positives = 127/216 (58%), Gaps = 8/216 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
+ Y KGY + +Q CGSCW+F + GA+EG L L+ Q L+DC N+G
Sbjct: 5 IDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKATGALLNLAPQNLVDCV--SENDG 62
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG ++Q++ ++ G+ ++D Y PY+GQD C T A G+ + +E AL
Sbjct: 63 CGGGYMTNAFQYVQRNRGIDSEDAY-PYVGQDESCMYNPTGKAAKCRGYREIPEGNEAAL 121
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K A+A GPVSVAIDAS SF FY GVYYDE C S D L+HAVLAVGYG G +W
Sbjct: 122 KRAVAAVGPVSVAIDASLTSFQFYSAGVYYDENC--SSDALNHAVLAVGYGIQAGNKHWI 179
Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
+KNSW WGN GY+LM+ NN CG+ ++ M
Sbjct: 180 IKNSWGESWGNAGYILMARNKNNACGIANLASFPKM 215
>gi|94420703|gb|ABF18679.1| cysteine protease [Medicago sativa]
Length = 350
Score = 170 bits (430), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 89/199 (44%), Positives = 118/199 (59%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TTGA+E AY K LS+Q L+DC+ + N GC+GG +++++I
Sbjct: 150 DQGHCGSCWTFSTTGALESAYAQAFGKNISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYI 209
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ GL T++ Y PY GQ+ C + + G VN+T +ED LK A+A PVSVA
Sbjct: 210 KYNGGLETEEAY-PYTGQNGPCKFTSEDVAVQVLGSVNITLGAEDELKHAVAFARPVSVA 268
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ F Y GVY C N+P ++HAVLAVGYG DG PYW +KNSW WG+ G
Sbjct: 269 FEVVD-DFRLYKKGVYTSTTCGNTPMDVNHAVLAVGYGIEDGVPYWLIKNSWGGEWGDHG 327
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y M + N CGV T +Y
Sbjct: 328 YFKMEMGKNMCGVATCSSY 346
>gi|312381834|gb|EFR27484.1| hypothetical protein AND_05795 [Anopheles darlingi]
Length = 508
Score = 170 bits (430), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 86/201 (42%), Positives = 124/201 (61%), Gaps = 6/201 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F +TGAVEG ++ K KL LS+Q L+DC+ Y N GC GG +RS+Q+I
Sbjct: 308 EQGKCGSCWAFSSTGAVEGQHFRKTNKLVSLSEQNLVDCTSNYRNKGCKGGAIYRSFQYI 367
Query: 211 MK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ HG+ T+ Y PY ++ C A + G+V++ EDAL A+A GP+S+
Sbjct: 368 EQNHGIDTEKSY-PYQAKEGPCAYNPKAIGAKVKGYVHIPTGDEDALMKAVATVGPISIV 426
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
+D+ +F Y +GVYYD +C S L HA+L VGYG G+ +W VKNSW T WG +
Sbjct: 427 VDSRHHTFKHYADGVYYDSQC--SATNLTHAMLVVGYGTSKKGEDFWLVKNSWGTSWGIK 484
Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
GY+ M+ ++N+CG+ Y
Sbjct: 485 GYIKMARNRNNSCGIANKAYY 505
Score = 58.2 bits (139), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 28/55 (50%), Positives = 38/55 (69%), Gaps = 2/55 (3%)
Query: 253 EDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG 307
EDAL A+A GP+++++DA KSF Y GVYY+ +CN P L H++L VGYG
Sbjct: 253 EDALMKAVATIGPIAISMDAGHKSFQSYGAGVYYEPQCN--PSSLTHSMLVVGYG 305
>gi|47522698|ref|NP_999057.1| cathepsin L1 precursor [Sus scrofa]
gi|2499874|sp|Q28944.1|CATL1_PIG RecName: Full=Cathepsin L1; Contains: RecName: Full=Cathepsin L1
heavy chain; Contains: RecName: Full=Cathepsin L1 light
chain; Flags: Precursor
gi|1468964|dbj|BAA07140.1| porcine cathepsin L [Sus scrofa]
gi|15027272|emb|CAC44793.1| cathepsin L [Sus scrofa]
Length = 334
Score = 170 bits (430), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 100/221 (45%), Positives = 133/221 (60%), Gaps = 12/221 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY ++ +Q CGSCW+F TGA+EG + K KL LS+Q L+DCS GN G
Sbjct: 118 VDWREKGYVTAVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQG 177
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDA-YCHIANTTATATMTGFVNVTPNSEDA 255
C+GG ++Q++ + GL T++ Y PYLG++ C + A TGFV++ P E A
Sbjct: 178 CNGGLMDNAFQYVKDNGGLDTEESY-PYLGRETNSCTYKPECSAANDTGFVDI-PQREKA 235
Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
L A+A GP+SVAIDA SF FY +G+YYD C S LDH VL VGYG + +
Sbjct: 236 LMKAVATVGPISVAIDAGHSSFQFYKSGIYYDPDC--SSKDLDHGVLVVGYGFEGTDSNS 293
Query: 312 KPYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
+W VKNSW WG GYV M+ NN CG+ TA +Y T+
Sbjct: 294 SKFWIVKNSWGPEWGWNGYVKMAKDQNNHCGISTAASYPTV 334
>gi|2239109|emb|CAA70694.1| cathepsin S-like cysteine proteinase [Heterodera glycines]
Length = 353
Score = 170 bits (430), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 93/203 (45%), Positives = 123/203 (60%), Gaps = 9/203 (4%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKH-KKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQW 209
DQ CGSCW+F TGA+EGA K K+ LS+Q L+DCS YGN GCDGG ++++
Sbjct: 152 DQGDCGSCWAFSATGAIEGALAQKKASKIISLSEQNLVDCSSKYGNEGCDGGLMDSAFEY 211
Query: 210 IM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
+ +GL T++ Y PY C N T T+ F ++ E+ LK+A+A GP+SV
Sbjct: 212 VRDNNGLDTEESY-PYEAVTGKCQFKNETVGGTVVSFKDLKKGDEEQLKIAVATIGPISV 270
Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG--ELDGKPYWQVKNSWSTYWG 326
A+DAS SF FY GVYY+ C+N LDH VL VGYG E G YW VKNSW +WG
Sbjct: 271 ALDASNLSFQFYKTGVYYERWCSNRY--LDHGVLLVGYGTDETHGD-YWLVKNSWGPHWG 327
Query: 327 NQGYVLMSI-KDNNCGVMTAPTY 348
GY+ ++ K N+CG+ T +Y
Sbjct: 328 ENGYIRIARNKQNHCGIATMASY 350
>gi|354549232|gb|AER27707.1| putative cysteine protease [Phytophthora sp. SH-2011]
Length = 533
Score = 170 bits (430), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 92/209 (44%), Positives = 131/209 (62%), Gaps = 14/209 (6%)
Query: 145 GYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDF 204
G + +Q +CGSCW+F TTGAVEGA ++ KL LS+Q L+DC G+ GC+GG
Sbjct: 128 GVTPVKNQGMCGSCWAFSTTGAVEGATFVSSGKLPSLSEQELVDCDHN-GDMGCNGGLMD 186
Query: 205 RSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKH 263
++QWI HG + ++DDY Y + C ++ +TGF +V P E ALK+A+A+
Sbjct: 187 HAFQWIEDHGGICSEDDY-EYKAKAQVCRECDSVVK--VTGFQDVNPQDEHALKVAVAQQ 243
Query: 264 GPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWST 323
PVSVAI+A QK+F FY +GV+ + C LDH VLAVGYG +G +W+VKNSW
Sbjct: 244 -PVSVAIEADQKAFQFYKSGVF-NLTCGTR---LDHGVLAVGYGNDNGHKFWKVKNSWGA 298
Query: 324 YWGNQGYVLMSIKDN----NCGVMTAPTY 348
WG QGY+ ++ ++N CG+ + P+Y
Sbjct: 299 SWGEQGYIRLAREENGPAGQCGIASVPSY 327
>gi|1698586|gb|AAB37252.1| cathepsin L, partial [Oncorhynchus mykiss]
Length = 196
Score = 170 bits (430), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 93/196 (47%), Positives = 129/196 (65%), Gaps = 10/196 (5%)
Query: 160 SFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQ 218
+F TTGA+EG + K KL LS+Q L+DCS GN GC+GG +++Q+I + GL T+
Sbjct: 1 AFSTTGAMEGQQFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYIQDNAGLDTE 60
Query: 219 DDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSF 277
+ Y PY+G D CH + A TGFV++ E A+ A+A GPVSVAIDA +SF
Sbjct: 61 ESY-PYVGTDEDPCHYKPEFSAANETGFVDIPSGKEHAMMKAVAAVGPVSVAIDAGHESF 119
Query: 278 SFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTYWGNQGYVLM 333
FY +G+YY+++C S + LDH VL VGYG ++DGK YW VKNSWS WG++GY+ M
Sbjct: 120 QFYESGIYYEKEC--SSEELDHGVLVVGYGFEGEDVDGKKYWIVKNSWSEKWGDKGYIYM 177
Query: 334 SI-KDNNCGVMTAPTY 348
+ + N+CG+ TA +Y
Sbjct: 178 AKDRKNHCGIATASSY 193
>gi|291383484|ref|XP_002708316.1| PREDICTED: cathepsin L1 [Oryctolagus cuniculus]
Length = 333
Score = 170 bits (430), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 97/232 (41%), Positives = 135/232 (58%), Gaps = 13/232 (5%)
Query: 129 YNKASKDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQA 185
+ KA +P V + KGY + Q CGSCW+F TGA+EG + K +L LS+Q
Sbjct: 106 FGKAVFQEVPSSVDWRDKGYVTPVKKQGRCGSCWAFSATGALEGQMFRKTGRLVSLSEQN 165
Query: 186 LIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTG 244
LIDCSW GN+GC GG ++Q++ + GL ++D Y PY ++ C + A TG
Sbjct: 166 LIDCSWPAGNHGCRGGLTDHAFQYVKDNGGLDSEDSY-PYEARNLPCRYDPQKSVANGTG 224
Query: 245 FVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAV 304
FV + P E+AL A+A GP++VAIDA SF FY G+YY+ C S +HAVL V
Sbjct: 225 FVRI-PRQENALMEAVATVGPIAVAIDAGHPSFQFYKEGIYYEPNC--SSKHHNHAVLVV 281
Query: 305 GYG----ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
GYG E D YW VKNSW WG GY+ ++ ++N+CG+ + +Y T+
Sbjct: 282 GYGYEGAESDSNKYWLVKNSWGKRWGEAGYIRIAKDRNNHCGIASHASYPTV 333
>gi|189053498|dbj|BAG35664.1| unnamed protein product [Homo sapiens]
Length = 334
Score = 170 bits (430), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 94/217 (43%), Positives = 128/217 (58%), Gaps = 10/217 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q C SCW+F TGA+EG + K KL LS+Q L+DCS GN G
Sbjct: 118 VDWRKKGYVTPVKNQKQCVSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQG 177
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG R++Q++ ++ GL +++ Y PY+ D C + A TGF V P E AL
Sbjct: 178 CNGGFMARAFQYVKENGGLDSEESY-PYVAVDEICKYRPENSVANDTGFTVVAPGKEKAL 236
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GP+SVA+DA SF FY +G+Y++ C S LDH VL VGYG +
Sbjct: 237 MKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDC--SSKNLDHGVLVVGYGFEGANSNNS 294
Query: 313 PYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
YW VKNSW WG+ GYV + K+N+CG+ TA +Y
Sbjct: 295 KYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASY 331
>gi|261824891|pdb|3H6S|A Chain A, Strucure Of Clitocypin - Cathepsin V Complex
gi|261824892|pdb|3H6S|B Chain B, Strucure Of Clitocypin - Cathepsin V Complex
gi|261824893|pdb|3H6S|C Chain C, Strucure Of Clitocypin - Cathepsin V Complex
gi|261824894|pdb|3H6S|D Chain D, Strucure Of Clitocypin - Cathepsin V Complex
gi|310942696|pdb|3KFQ|A Chain A, Unreduced Cathepsin V In Complex With Stefin A
gi|310942697|pdb|3KFQ|B Chain B, Unreduced Cathepsin V In Complex With Stefin A
Length = 221
Score = 170 bits (430), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 95/217 (43%), Positives = 128/217 (58%), Gaps = 10/217 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGS W+F TGA+EG + K KL LS+Q L+DCS GN G
Sbjct: 5 VDWRKKGYVTPVKNQKQCGSXWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQG 64
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG R++Q++ ++ GL +++ Y PY+ D C + A TGF V P E AL
Sbjct: 65 CNGGFMARAFQYVKENGGLDSEESY-PYVAVDEICKYRPENSVAQDTGFTVVAPGKEKAL 123
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GP+SVA+DA SF FY +G+Y++ C S LDH VL VGYG D
Sbjct: 124 MKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDC--SSKNLDHGVLVVGYGFEGANSDNS 181
Query: 313 PYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
YW VKNSW WG+ GYV + K+N+CG+ TA +Y
Sbjct: 182 KYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASY 218
>gi|355567966|gb|EHH24307.1| Cathepsin L2 [Macaca mulatta]
gi|355753494|gb|EHH57540.1| Cathepsin L2 [Macaca fascicularis]
gi|380790509|gb|AFE67130.1| cathepsin L2 preproprotein [Macaca mulatta]
Length = 334
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 95/220 (43%), Positives = 129/220 (58%), Gaps = 10/220 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TGA+EG + K KL LS+Q L+DCS GN G
Sbjct: 118 VDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSHPQGNQG 177
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG +++++ ++ GL +++ Y PY+ D C + A TGF V E AL
Sbjct: 178 CNGGFMNSAFRYVKENGGLDSEESY-PYVAMDGICKYRPENSVANDTGFEVVPAGKEKAL 236
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GP+SVA+DA SF FY +G+Y++ C S LDH VL VGYG D
Sbjct: 237 MKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDC--SSKNLDHGVLVVGYGFEGANSDNN 294
Query: 313 PYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTYVTM 351
YW VKNSW WG+ GYV + KDN+CG+ TA +Y T+
Sbjct: 295 KYWLVKNSWGPEWGSNGYVKIAKDKDNHCGIATAASYPTV 334
>gi|330801846|ref|XP_003288934.1| hypothetical protein DICPUDRAFT_153222 [Dictyostelium purpureum]
gi|325081026|gb|EGC34558.1| hypothetical protein DICPUDRAFT_153222 [Dictyostelium purpureum]
Length = 334
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 90/214 (42%), Positives = 122/214 (57%), Gaps = 7/214 (3%)
Query: 142 EMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGG 201
E + + DQ CGSC+SF GA E AYY+ + ++ LS+Q ++DCS YGN GC+GG
Sbjct: 122 EKGAVHDIKDQGHCGSCYSFSAIGAAESAYYIANGEMLTLSEQNILDCSRSYGNEGCNGG 181
Query: 202 EDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHI--ANTTATATMTGFVNVTPNSEDALKLA 259
S+Q+++ G + PY +DA C T AT G V + E L+ A
Sbjct: 182 YMLESFQFLLDQGGAVSEASYPYEAKDASCRFDSVKTPIVATFNGTVEIRRGDEGDLQQA 241
Query: 260 LAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGY--GELDGKPYWQV 317
+A HGPV+VAIDA SF Y GVYY+ C S L HAVLAVGY + GK YW V
Sbjct: 242 IATHGPVAVAIDAGHISFQLYKTGVYYEPYC--SSYSLSHAVLAVGYDTDSVTGKDYWIV 299
Query: 318 KNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVT 350
NSW WG+ G++ M+ + N+CG+ T +Y+T
Sbjct: 300 ANSWGLKWGDSGFIKMARNRGNHCGISTMSSYIT 333
>gi|354466410|ref|XP_003495667.1| PREDICTED: pro-cathepsin H-like [Cricetulus griseus]
Length = 333
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 83/203 (40%), Positives = 125/203 (61%), Gaps = 3/203 (1%)
Query: 147 NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRS 206
+++ +Q CGSCW+F TTGA+E A + K+ L++Q L+DC+ + N+GC+GG ++
Sbjct: 128 SAVKNQGSCGSCWTFSTTGALESAVAIASGKMLSLAEQQLVDCAQNFNNHGCEGGLPSQA 187
Query: 207 YQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGP 265
+++I+ G+ +D Y PY G+D +C A A + N+T N E A+ A+A + P
Sbjct: 188 FEYILYNKGIMGEDTY-PYRGKDGHCKFDPQKAIAFVKDVANITLNDEKAMVEAVALYNP 246
Query: 266 VSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYW 325
VS A + + F Y G+Y C+ +PD ++HAVLAVGYGE DG PYW VKNSW T W
Sbjct: 247 VSFAFEVTD-DFMLYQKGIYSSTSCHKTPDKVNHAVLAVGYGEKDGIPYWIVKNSWGTNW 305
Query: 326 GNQGYVLMSIKDNNCGVMTAPTY 348
G++GY L+ N CG+ +Y
Sbjct: 306 GDKGYFLIERGKNMCGLAACASY 328
>gi|281203744|gb|EFA77940.1| hypothetical protein PPL_08585 [Polysphondylium pallidum PN500]
Length = 505
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 97/239 (40%), Positives = 131/239 (54%), Gaps = 27/239 (11%)
Query: 135 DAIPVRYEMKGYNS-LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGY 193
D+ V + KG S + DQ CGSCWSF TTG+VEGA+ +K + LS+Q L+DCS
Sbjct: 266 DSATVDWRQKGAVSPIKDQGQCGSCWSFSTTGSVEGAHQIKSGNMVELSEQNLVDCSTSE 325
Query: 194 GNNGCDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNS 252
GN GC+GG ++++I+ +G+ T+ Y C + AT++ + N+T S
Sbjct: 326 GNMGCNGGLMDYAFEYIITNNGIDTESSYPYTASSGTTCKYNKANSGATISSYKNITAGS 385
Query: 253 EDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE---- 308
E L A+ GPVSVAIDAS SF Y +G+YYD C S LDH VL VGYG
Sbjct: 386 ESDLADAVKNAGPVSVAIDASHNSFQLYSHGIYYDASC--SSVNLDHGVLVVGYGSGTPD 443
Query: 309 ------------------LDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
D K YW VKNSW T WG++G++ MS +DNNCG+ + +Y
Sbjct: 444 SDSRVHKGSQVRVKVPKTDDTKNYWIVKNSWGTSWGDKGFIYMSKDRDNNCGIASCASY 502
>gi|14041143|emb|CAA71554.1| cathepsin [Geodia cydonium]
Length = 322
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 91/218 (41%), Positives = 133/218 (61%), Gaps = 8/218 (3%)
Query: 136 AIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG 192
A+P V + KGY + +Q CGSCW+F TG++EG ++ KL LS+Q L+DCS
Sbjct: 102 ALPTTVDWRTKGYVTGVKNQGQCGSCWAFSATGSLEGQHFNATGKLVSLSEQNLVDCSSA 161
Query: 193 YGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPN 251
GN GC+GG ++++++K+ G+ T+ Y PY+ +D CH ++ +T + +V++
Sbjct: 162 EGNEGCNGGLPDDAFKYVIKNGGIDTEASY-PYVARDEKCHYSSANIGSTCSSYVDIESK 220
Query: 252 SEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDG 311
SE L++A A GP+ V IDAS F Y GVY+ + C+ + LDH VL VGYG
Sbjct: 221 SEAQLQVASATVGPIPVGIDASHLGFQLYDGGVYHSDLCSQTR--LDHGVLVVGYGVYKE 278
Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
K YW VKNSW T WG G ++MS +DNNCG+ T +Y
Sbjct: 279 KDYWMVKNSWGTNWGISGDMMMSRNRDNNCGIATMASY 316
>gi|75060921|sp|Q5E998.1|CATL2_BOVIN RecName: Full=Cathepsin L2; Flags: Precursor
gi|59858409|gb|AAX09039.1| cathepsin L2 preproprotein [Bos taurus]
Length = 334
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 98/220 (44%), Positives = 128/220 (58%), Gaps = 10/220 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TGA+EG + K KL LS+Q L+DCS GN G
Sbjct: 118 VDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNQG 177
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDA-YCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++Q+I +G ++ PYL D C+ + A TGFV++ P E AL
Sbjct: 178 CNGGLMDNAFQYIKDNGCLDSEESYPYLATDTNSCNYKPECSAANDTGFVDI-PQREKAL 236
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GP+SVAIDA SF FY +G+YYD C S LDH VL VGYG + +
Sbjct: 237 MKAVATVGPISVAIDAGHTSFQFYKSGIYYDPDC--SSKDLDHGVLVVGYGFEGTDSNNN 294
Query: 313 PYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
+W VKNSW WG GYV M+ NN CG+ TA +Y T+
Sbjct: 295 KFWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPTV 334
>gi|449452572|ref|XP_004144033.1| PREDICTED: thiol protease aleurain-like [Cucumis sativus]
gi|449500499|ref|XP_004161114.1| PREDICTED: thiol protease aleurain-like [Cucumis sativus]
Length = 356
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 89/199 (44%), Positives = 116/199 (58%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TTGA+E AY H K LS+Q L+DC G+ N GC+GG +++++I
Sbjct: 156 DQGHCGSCWTFSTTGALEAAYAQAHGKGISLSEQQLVDCGRGFNNFGCNGGLPSQAFEYI 215
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ GL T++ Y PY G D C + VN+T +ED LK A+A PVSVA
Sbjct: 216 KYNGGLDTEEAY-PYTGVDGSCKFVPENVGVQVIDSVNITLGAEDELKHAVAFVRPVSVA 274
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ F Y GVY C ++P ++HAVLAVGYG DG PYW +KNSW WG+ G
Sbjct: 275 FEVVS-GFRLYSKGVYTSNSCGSTPMDVNHAVLAVGYGVEDGIPYWLIKNSWGGNWGDNG 333
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y M + N CGV T +Y
Sbjct: 334 YFKMEMGKNMCGVATCASY 352
>gi|395502422|ref|XP_003755580.1| PREDICTED: pro-cathepsin H [Sarcophilus harrisii]
Length = 334
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 84/199 (42%), Positives = 122/199 (61%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTG +E A + KL L++Q L+DC+ + N+GC+GG +++++I
Sbjct: 133 NQGGCGSCWTFSTTGGLESAVAIATGKLLSLAEQQLVDCAQDFNNHGCNGGLPSQAFEYI 192
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
M G+ +D Y PY G+D C A A + N+T E+A+ A+A H PVS A
Sbjct: 193 MYNKGIMGEDTY-PYEGKDGTCKFQPNKAIAFVKDVANITAYDEEAMTEAVAHHNPVSFA 251
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ + S++ G+Y + KC+ SPD ++HAVLAVGYG+ +G PYW VKNSW T WGN G
Sbjct: 252 FEVTDDFLSYH-KGIYSNPKCSKSPDKVNHAVLAVGYGKENGIPYWIVKNSWGTSWGNNG 310
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y L+ N CG+ +Y
Sbjct: 311 YFLIERGKNMCGLADCASY 329
>gi|74152091|dbj|BAE32077.1| unnamed protein product [Mus musculus]
Length = 245
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 91/200 (45%), Positives = 120/200 (60%), Gaps = 6/200 (3%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCS--WGYGNNGCDGGEDFRSYQW 209
Q CG+CW+F GA+EG +K KL LS Q L+DCS YGN GC GG ++Q+
Sbjct: 46 QGSCGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQY 105
Query: 210 IMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
I+ +G D PY D CH + AT +G++ + EDALK A+A GPVSV
Sbjct: 106 IIDNGGIEADASYPYKATDEKCHYNSKNRAATCSGYIQLPFGDEDALKEAVATKGPVSVG 165
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS SF FY +GVY D C + ++H VL VGYG LDGK YW VKNSW +G+QG
Sbjct: 166 IDASHSSFFFYKSGVYDDPSCTGN---VNHGVLVVGYGTLDGKDYWLVKNSWGLNFGDQG 222
Query: 330 YVLMSIKD-NNCGVMTAPTY 348
Y+ M+ + N+CG+ + +Y
Sbjct: 223 YIRMARNNKNHCGIASYCSY 242
>gi|13905172|gb|AAH06878.1| Cathepsin H [Mus musculus]
Length = 333
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 80/200 (40%), Positives = 124/200 (62%), Gaps = 1/200 (0%)
Query: 149 LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
+++Q CGSCW+F TTGA+E A + K+ L++Q L+DC+ + N+GC GG ++++
Sbjct: 130 VINQGACGSCWTFSTTGALESAVAIASGKMLSLAEQQLVDCAQAFNNHGCKGGLPSQAFE 189
Query: 209 WIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
+I+ + ++D PY+G+D+ C A A + VN+T N E A+ A+A + PVS
Sbjct: 190 YILYNKGIMEEDSYPYIGKDSSCRFNPQKAVAFVKNVVNITLNDEAAMVEAVALYNPVSF 249
Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQ 328
A + ++ F Y +GVY + C+ +PD ++HAVLAVGYGE +G YW VKNSW + WG
Sbjct: 250 AFEVTE-DFLMYKSGVYSSKSCHKTPDKVNHAVLAVGYGEQNGLLYWIVKNSWGSQWGEN 308
Query: 329 GYVLMSIKDNNCGVMTAPTY 348
GY L+ N CG+ +Y
Sbjct: 309 GYFLIERGKNMCGLAACASY 328
>gi|431917800|gb|ELK17041.1| Cathepsin L1 [Pteropus alecto]
Length = 334
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 95/220 (43%), Positives = 129/220 (58%), Gaps = 10/220 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TG++EG + K KL LS+Q L+DCS GN G
Sbjct: 118 VDWRQKGYVTPVKNQGQCGSCWAFSATGSLEGQMFRKTGKLVSLSEQNLVDCSRSQGNEG 177
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDA-YCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++Q+I +G ++ PYL +++ C+ + A TGFV++ P E +L
Sbjct: 178 CNGGLMDNAFQYIKDNGGLDSEESYPYLAKESDTCNYKPEYSAANDTGFVDI-PQREKSL 236
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKP--- 313
A+A GP+SVAIDA SF FY G+YY+ C S LDH VL +GYG G P
Sbjct: 237 MKAVATVGPISVAIDAGHSSFQFYNKGIYYEPDC--SSKDLDHGVLVIGYGSEGGDPKSN 294
Query: 314 -YWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
+W VKNSW WG GYV M+ NN CG+ TA +Y T+
Sbjct: 295 KFWIVKNSWGPEWGMNGYVKMAKDQNNHCGIATAASYPTV 334
>gi|54020908|ref|NP_001005695.1| cathepsin S precursor [Xenopus (Silurana) tropicalis]
gi|49522293|gb|AAH75261.1| cathepsin S [Xenopus (Silurana) tropicalis]
Length = 333
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 87/199 (43%), Positives = 120/199 (60%), Gaps = 4/199 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F GA+EG +K KL LS Q L+DCS YGN GC GG +++Q++
Sbjct: 135 NQGGCGSCWAFSAVGALEGQLMLKTGKLVSLSPQNLVDCSSKYGNKGCGGGFMTQAFQYV 194
Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
+ + D Y PY D CH T +T + + P +ED LK AL GP+SVAI
Sbjct: 195 IDNKGIDSDSYYPYHAMDEKCHYDPTGKASTCAKYTEIVPGTEDNLKQALGSIGPISVAI 254
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
D ++ SF Y +GVY D C++ ++H VLAVGYG L+G+ +W +KNSW T +G+QGY
Sbjct: 255 DGTRPSFFLYRSGVYSDPTCSHE---VNHGVLAVGYGNLNGQDFWLLKNSWGTKYGDQGY 311
Query: 331 VLMSI-KDNNCGVMTAPTY 348
V ++ K N CGV + Y
Sbjct: 312 VRIARNKGNLCGVASYTCY 330
>gi|417399134|gb|JAA46597.1| Putative cathepsin l1 [Desmodus rotundus]
Length = 335
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 97/221 (43%), Positives = 133/221 (60%), Gaps = 12/221 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F GA+EG + K KL LS+Q L+DCS GN G
Sbjct: 119 VDWRQKGYVTPVKNQGQCGSCWAFSANGALEGQMFRKTGKLVSLSEQNLVDCSHSQGNQG 178
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDA-YCHIANTTATATMTGFVNVTPNSEDA 255
C+GG ++Q++ + GL +++ Y PYLG+++ C+ + A TGFV++ P E
Sbjct: 179 CNGGLMDNAFQYVKDNKGLDSEESY-PYLGRESNTCNYRPEYSAANDTGFVDI-PQHERG 236
Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
L A+A GP+SVAIDA SF FY G+YY+ C S LDH VL VGYG + D
Sbjct: 237 LMKAVATVGPISVAIDAGHSSFQFYSEGIYYEPNC--SSKDLDHGVLVVGYGSEGAQSDS 294
Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
+W VKNSW T WG GYV M+ + N+CG+ TA +Y T+
Sbjct: 295 NKFWIVKNSWGTGWGMSGYVKMARDQSNHCGIATAASYPTV 335
>gi|41323856|gb|AAS00027.1| cathepsin L-like cysteine proteinase [Taenia solium]
Length = 339
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 88/200 (44%), Positives = 124/200 (62%), Gaps = 6/200 (3%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F +TGA+EGA+ K KL LS+Q L+DCS GN+GC+GG +++++
Sbjct: 141 NQGNCGSCWAFSSTGALEGAFAKKTGKLISLSEQQLVDCSLKNGNDGCNGGYMSYAFKYL 200
Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
+H + + Y PY D C + T+T ++ +E AL A+A GP+S+AI
Sbjct: 201 EEHFIEPESAY-PYRATDGPCRYNESLGVGTVTDIGDIPEGNETALMEAVATVGPISIAI 259
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
DAS F FY +G+Y C S L+H VLA+GYG+ DGKPYW VKNSW T WG +GY
Sbjct: 260 DASSLGFMFYRHGIYKSHWC--SSKFLNHGVLAIGYGKQDGKPYWLVKNSWGTRWGMKGY 317
Query: 331 VLMSIKD--NNCGVMTAPTY 348
++M+ KD N CGV + +
Sbjct: 318 IMMA-KDYHNMCGVASLADF 336
>gi|344257452|gb|EGW13556.1| Cathepsin L1 [Cricetulus griseus]
Length = 290
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 92/204 (45%), Positives = 126/204 (61%), Gaps = 7/204 (3%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ C SCW+F G++EG + K +L LS+Q L+DCSW YGN GC GG +++++
Sbjct: 91 DQGQCSSCWAFSAVGSLEGQIFRKTGQLISLSEQNLVDCSWSYGNIGCFGGLMEYAFRYV 150
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
++ GL T+ Y PY ++ C + A +T FV + P SEDAL A+A GP+SV
Sbjct: 151 KENRGLDTRVSY-PYEARNGPCRYDPKNSAANVTDFVKI-PISEDALMKAVATVGPISVG 208
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
+D+ SF FY G+YY+ C++S LDHAVL VGYG E DG YW VKNSW WG
Sbjct: 209 VDSHHHSFRFYKGGMYYEPHCSSS--NLDHAVLVVGYGEESDGNKYWMVKNSWGQGWGMN 266
Query: 329 GYVLMSI-KDNNCGVMTAPTYVTM 351
GY+ M+ ++NNCG+ T Y T+
Sbjct: 267 GYIKMARDRNNNCGIATYAIYPTV 290
>gi|403368476|gb|EJY84073.1| Cathepsin L [Oxytricha trifallax]
Length = 338
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 94/211 (44%), Positives = 124/211 (58%), Gaps = 7/211 (3%)
Query: 142 EMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGG 201
E + DQ CGSCWSF TGA+EG ++ L LS+Q L+DCS GN GC GG
Sbjct: 132 EQGAVTPVKDQGQCGSCWSFSATGAMEGHAKIQFGTLYSLSEQQLVDCSQAEGNEGCGGG 191
Query: 202 EDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALA 261
+++Q++ + L T+D Y PY D C A++ + FV+VTPN+ + LK AL
Sbjct: 192 WMDQAFQYVEQTALETEDQY-PYEAVDDTCR-ASSAGVVKVDSFVDVTPNNVNELKAALD 249
Query: 262 KHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSW 321
K GPVSVAI+A Q F FY GV D C + LDH VLAVGYG G+ Y+ VKNSW
Sbjct: 250 K-GPVSVAIEADQMVFQFYSGGVINDASCGTT---LDHGVLAVGYGNESGQDYFLVKNSW 305
Query: 322 STYWGNQGYVLMSIK-DNNCGVMTAPTYVTM 351
WG +GYV ++ DN CG+++ +Y M
Sbjct: 306 GASWGEEGYVKIAASPDNICGILSQASYPIM 336
>gi|354502595|ref|XP_003513369.1| PREDICTED: cathepsin L1-like [Cricetulus griseus]
Length = 330
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 92/204 (45%), Positives = 126/204 (61%), Gaps = 7/204 (3%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ C SCW+F G++EG + K +L LS+Q L+DCSW YGN GC GG +++++
Sbjct: 131 DQGQCSSCWAFSAVGSLEGQIFRKTGQLISLSEQNLVDCSWSYGNIGCFGGLMEYAFRYV 190
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
++ GL T+ Y PY ++ C + A +T FV + P SEDAL A+A GP+SV
Sbjct: 191 KENRGLDTRVSY-PYEARNGPCRYDPKNSAANVTDFVKI-PISEDALMKAVATVGPISVG 248
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
+D+ SF FY G+YY+ C++S LDHAVL VGYG E DG YW VKNSW WG
Sbjct: 249 VDSHHHSFRFYKGGMYYEPHCSSS--NLDHAVLVVGYGEESDGNKYWMVKNSWGQGWGMN 306
Query: 329 GYVLMSI-KDNNCGVMTAPTYVTM 351
GY+ M+ ++NNCG+ T Y T+
Sbjct: 307 GYIKMARDRNNNCGIATYAIYPTV 330
>gi|2239107|emb|CAA70693.1| cathepsin L-like cysteine proteinase [Heterodera glycines]
Length = 374
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 88/202 (43%), Positives = 125/202 (61%), Gaps = 7/202 (3%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q +CGSCW+F TGA+EG + L LS+Q LIDCS YGN GC+GG ++Q+I
Sbjct: 173 NQGMCGSCWAFSATGALEGQHVRDKGHLVSLSEQNLIDCSKKYGNMGCNGGIMDNAFQYI 232
Query: 211 MKH-GLPTQDDYGPYLGQDA-YCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
+ G+ + Y PY + C AT +G+ ++ E+ LK+A+A GPVSV
Sbjct: 233 KDNKGIDKETAY-PYKAKTGKKCLFKRNDVGATDSGYNDIAEGDEEDLKMAVATQGPVSV 291
Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGN 327
AIDA +SF Y NGVY++++C+ P+ LDH VL VGYG + YW VKNSW T WG
Sbjct: 292 AIDAGHRSFQLYTNGVYFEKECD--PENLDHGVLVVGYGTDPTQGDYWIVKNSWGTRWGE 349
Query: 328 QGYVLMSI-KDNNCGVMTAPTY 348
QGY+ M+ ++NNCG+ + ++
Sbjct: 350 QGYIRMARNRNNNCGIASHASF 371
>gi|426219849|ref|XP_004004130.1| PREDICTED: cathepsin L1 isoform 1 [Ovis aries]
gi|426219851|ref|XP_004004131.1| PREDICTED: cathepsin L1 isoform 2 [Ovis aries]
Length = 334
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 98/220 (44%), Positives = 130/220 (59%), Gaps = 10/220 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TGA+EG + K KL LS+Q L+DCS GN G
Sbjct: 118 VDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQG 177
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQD-AYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++Q+I ++G ++ PYL D + C+ + A TGFV++ P E AL
Sbjct: 178 CNGGLMDNAFQYIKENGGLDSEESYPYLATDTSSCNYKPECSAANDTGFVDI-PQREKAL 236
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GP+SVAIDA SF FY +G+YYD C S LDH VL VGYG + +
Sbjct: 237 MKAVATVGPISVAIDAGHASFQFYKSGIYYDPDC--SSKDLDHGVLVVGYGFEGTDSNNN 294
Query: 313 PYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
+W VKNSW WG GYV M+ NN CG+ TA +Y T+
Sbjct: 295 KFWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPTV 334
>gi|218478062|dbj|BAH03397.1| cathepsin L-like cysteine peptidase [Taenia asiatica]
Length = 338
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 87/200 (43%), Positives = 124/200 (62%), Gaps = 6/200 (3%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F +TGA+EGA+ K KL LS+Q L+DCS GN+GC+GG +++++
Sbjct: 140 NQGNCGSCWAFSSTGALEGAFAKKTGKLISLSEQQLVDCSLKNGNDGCNGGYMSYAFKYL 199
Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
+H + + Y PY D C + T+T ++ +E AL A+A GP+S+AI
Sbjct: 200 EEHSIEPESAY-PYRATDGPCRYNESLGVGTVTDIGDIPEGNETALMEAVATVGPISIAI 258
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
DAS F FY +G+Y C S L+H VLA+GYG+ +GKPYW VKNSW T WG +GY
Sbjct: 259 DASSLGFMFYRHGIYKSHWC--SSKFLNHGVLAIGYGKQEGKPYWLVKNSWGTRWGMKGY 316
Query: 331 VLMSIKD--NNCGVMTAPTY 348
++M+ KD N CGV + +
Sbjct: 317 IMMA-KDYHNMCGVASLADF 335
>gi|68399197|ref|XP_695425.1| PREDICTED: cathepsin L [Danio rerio]
Length = 349
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 96/226 (42%), Positives = 126/226 (55%), Gaps = 8/226 (3%)
Query: 127 VRYNKASKDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQA 185
+R N + Y KGY + DQ CGSCWSF TTGA+EG Y +L LS+Q
Sbjct: 125 LRLNAKRLGVTNIDYRAKGYVTEVKDQGYCGSCWSFSTTGAIEGQMYKHTGRLVSLSEQQ 184
Query: 186 LIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAY-CHIANTTATATMTG 244
L+DCS YG GC G +Y +++ + L + D Y PY D C A A ++
Sbjct: 185 LVDCSRSYGTYGCSGAWMANAYDYVINNALESSDTY-PYTSVDTQPCFYEKNLAMAGISD 243
Query: 245 FVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAV 304
+ V +E AL A+A GPVSVAIDA SF FY +G+Y + CN P+ L+HAVL V
Sbjct: 244 YRFVPAGNEQALADAVATVGPVSVAIDADNPSFLFYSSGIYKESNCN--PNNLNHAVLVV 301
Query: 305 GYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKD--NNCGVMTAPTY 348
GYG +G YW +KNSW T WG GY+ M I++ N CG+ + Y
Sbjct: 302 GYGSEEGTDYWIIKNSWGTGWGEGGYMRM-IRNGKNTCGIASYALY 346
>gi|139947602|ref|NP_001077155.1| cathepsin L1 precursor [Bos taurus]
gi|134025180|gb|AAI34742.1| CTSL1 protein [Bos taurus]
gi|296484500|tpg|DAA26615.1| TPA: cathepsin L1 [Bos taurus]
Length = 333
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 97/225 (43%), Positives = 135/225 (60%), Gaps = 13/225 (5%)
Query: 136 AIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG 192
+IP V + KGY + +Q CGSCW+F TGA+EG + K KL LS+Q L+DCS
Sbjct: 113 SIPPSVDWREKGYVTPVKNQGKCGSCWAFSATGALEGQMFQKTGKLVSLSEQNLVDCSQP 172
Query: 193 YGNNGCDGGEDFRSYQWIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPN 251
GN GC GG ++Q+++ GL +++ Y PY G C + A TGFV++ P
Sbjct: 173 EGNRGCHGGFIDNAFQYVLDVGGLDSEESY-PYTGLVGTCLYNPNNSAANETGFVDL-PK 230
Query: 252 SEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG---- 307
E AL A+A GP+SVA+DA SF FY +G+YY+ C S + +DHAVL VGYG
Sbjct: 231 QEKALMKAVANLGPISVAVDAHNPSFQFYKSGIYYEPNC--SSESVDHAVLVVGYGFEGA 288
Query: 308 ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
+ D YW VKNSW +WG GY+ M+ ++N+CG+ T +Y T+
Sbjct: 289 DSDDNKYWLVKNSWGEHWGMNGYIKMAKDRNNHCGIATMASYPTV 333
>gi|355681660|gb|AER96816.1| cathepsin L2 [Mustela putorius furo]
Length = 334
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 97/220 (44%), Positives = 130/220 (59%), Gaps = 10/220 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TGA+EG + K KL LS+Q L+DCS GN G
Sbjct: 118 VDWTQKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRSQGNQG 177
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDA-YCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++Q+I +G ++ PYL +D C+ + A TGFV++ P E AL
Sbjct: 178 CNGGLMDFAFQYIKDNGGLDSEESYPYLARDTDSCNYKPEYSVANDTGFVDI-PQRERAL 236
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GP+SVAIDA +SF FY +G+Y+D C S LDH VL VGYG + +
Sbjct: 237 MKAVATVGPISVAIDAGHQSFQFYKSGIYFDPDC--SSKDLDHGVLVVGYGFEGTDSNNN 294
Query: 313 PYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
+W VKNSW WG GYV M+ NN CG+ TA +Y T+
Sbjct: 295 KFWIVKNSWGPEWGCNGYVKMAKDQNNHCGIATAASYPTV 334
>gi|403333364|gb|EJY65772.1| Cathepsin L [Oxytricha trifallax]
Length = 338
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 93/202 (46%), Positives = 122/202 (60%), Gaps = 7/202 (3%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCWSF TGA+EG ++ L LS+Q L+DCS GN GC GG +++Q++
Sbjct: 141 DQGQCGSCWSFSATGAMEGHAKIQFGTLYSLSEQQLVDCSQAEGNEGCGGGWMDQAFQYV 200
Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
+ L T+D Y PY D C A++ + FV+VTPN+ + LK AL K GPVSVAI
Sbjct: 201 EQTALETEDQY-PYEAVDDTCR-ASSAGVVKVDSFVDVTPNNVNELKAALDK-GPVSVAI 257
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
+A Q F FY GV D C + LDH VLAVGYG G+ Y+ VKNSW WG +GY
Sbjct: 258 EADQMVFQFYSGGVINDASCGTT---LDHGVLAVGYGNESGQDYFLVKNSWGASWGEEGY 314
Query: 331 VLMSIK-DNNCGVMTAPTYVTM 351
V ++ DN CG+++ +Y M
Sbjct: 315 VKIAASPDNICGILSQASYPIM 336
>gi|149755237|ref|XP_001495795.1| PREDICTED: cathepsin L1-like [Equus caballus]
Length = 339
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 95/220 (43%), Positives = 131/220 (59%), Gaps = 12/220 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q +CGSCW+F TGA+EG + K KL LS+Q L+DCSW GN G
Sbjct: 118 VDWRKKGYVTPVKNQGLCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSWAQGNEG 177
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG ++Q++ + GL ++ Y PYL +D +C + A TGF+++ E L
Sbjct: 178 CSGGLMDYAFQYVKDNGGLDSEKSY-PYLAEDGFCKYKPEYSAANDTGFLDIQ-QQEKFL 235
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGK---- 312
A+A GP+S IDAS +SF FY G+YYD C S LDH VL VGYG +GK
Sbjct: 236 MEAVATVGPISAGIDASLESFQFYKEGIYYDPDC--SSKYLDHGVLVVGYG-FEGKDSRN 292
Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
YW VKNSW WG GY+ M+ ++N+CG+ T +Y ++
Sbjct: 293 KYWLVKNSWGEDWGMNGYIKMAKDRENHCGIATMASYPSL 332
>gi|111073719|dbj|BAF02548.1| triticain gamma [Triticum aestivum]
Length = 365
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 88/199 (44%), Positives = 119/199 (59%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQS CGSCW+F TTGA+E AY K LS+Q L+DC+ G+ N GC GG +++++I
Sbjct: 164 DQSHCGSCWTFSTTGALEAAYTQATGKNISLSEQQLVDCAGGFNNFGCSGGLPSQAFEYI 223
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T++ Y PY G + CH A + VN+T N+ED LK A+ PVSVA
Sbjct: 224 KYNGGIDTEESY-PYKGVNGVCHYKAENAVVQVLDSVNITLNAEDELKNAVGLVRPVSVA 282
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ F Y +GVY + C +PD ++HAVLAVGYG +G PYW +KNSW WG+ G
Sbjct: 283 FEVIN-GFRQYKSGVYSSDHCGTTPDDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNG 341
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y M + N C V T +Y
Sbjct: 342 YFKMEMGKNMCAVATCASY 360
>gi|443698586|gb|ELT98517.1| hypothetical protein CAPTEDRAFT_128252 [Capitella teleta]
Length = 324
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 95/237 (40%), Positives = 137/237 (57%), Gaps = 8/237 (3%)
Query: 117 GEKVSKYSLWVRYNKASKDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYM 173
G K++K S+ + + +P V + KGY + +Q CGSCW+F +TG++EG +
Sbjct: 88 GYKMAKSSVGSSFLEPENLQVPKTVDWREKGYVTPVKNQGQCGSCWAFSSTGSLEGQVFR 147
Query: 174 KHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCH 232
K +L +S+Q L+DCS GN GC GG ++ +I K+ G+ ++ Y PY D C
Sbjct: 148 KTGRLPSISEQNLVDCSRDEGNMGCSGGLMDNAFTYIKKNMGIDSEKSY-PYEAVDGECR 206
Query: 233 IANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNN 292
+ + T +GFV++ E AL+ A+A GPVSVAIDAS SF FY GVY + C++
Sbjct: 207 YKKSDSVTTDSGFVDIPHGDETALRTAVASVGPVSVAIDASHTSFQFYKTGVYTEANCSS 266
Query: 293 SPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
+ LDH VL VGYG +G+ YW VKNSW WG GY+ L N CG+ + +Y
Sbjct: 267 TQ--LDHGVLVVGYGVENGQDYWLVKNSWGASWGEAGYIKLARNHGNQCGIASQASY 321
>gi|388521567|gb|AFK48845.1| unknown [Medicago truncatula]
Length = 343
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 89/199 (44%), Positives = 117/199 (58%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TTGA+E AY K LS+Q L+DC+ Y N GC+GG +++++I
Sbjct: 143 DQGHCGSCWTFSTTGALESAYAQAFGKNISLSEQQLVDCAGAYNNFGCNGGLPSQAFEYI 202
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ GL T++ Y PY GQ+ C + + G VN+T +ED LK A+A PVSVA
Sbjct: 203 KYNGGLETEEVY-PYTGQNGLCKFTSENVAVQVLGSVNITLGAEDELKHAVAFARPVSVA 261
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
F Y GVY C ++P ++HAVLAVGYG DG PYW +KNSW WG+ G
Sbjct: 262 FQVVD-DFRLYKKGVYTGTTCGSTPMDVNHAVLAVGYGIEDGVPYWLIKNSWGGEWGDHG 320
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y M + N CGV T +Y
Sbjct: 321 YFKMEMGKNMCGVATCSSY 339
>gi|111036374|dbj|BAF02516.1| cathepsin L-like proteinase [Echinococcus multilocularis]
Length = 338
Score = 169 bits (427), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 89/200 (44%), Positives = 119/200 (59%), Gaps = 5/200 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TGA+EG K KL LS+Q L+DCS GN GC+GG+ ++++
Sbjct: 139 DQGDCGSCWAFSATGALEGQLKRKTGKLISLSEQQLVDCSTYTGNEGCNGGDMNDAFRYW 198
Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
M++G ++ DY PY D C ++ ++ FV V ED LKL++A+ GPVSVAI
Sbjct: 199 MRNGAESESDY-PYTAMDGKCKFNSSKVVTKVSKFVKVPKKREDQLKLSVAQVGPVSVAI 257
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGY-GELDGKPYWQVKNSWSTYWGNQG 329
DA+ F Y G+Y D C S LDHAVL VGY + + YW VKNSW WG +G
Sbjct: 258 DATSSGFMLYKKGIYQDNTC--SQQYLDHAVLVVGYDADKTRQKYWIVKNSWGEDWGQRG 315
Query: 330 YVLMSI-KDNNCGVMTAPTY 348
Y+ M+ K N CG+ T +Y
Sbjct: 316 YIWMARDKGNMCGIATMASY 335
>gi|109940313|sp|P25975.3|CATL1_BOVIN RecName: Full=Cathepsin L1; Contains: RecName: Full=Cathepsin L1
heavy chain; Contains: RecName: Full=Cathepsin L1 light
chain; Flags: Precursor
gi|74354943|gb|AAI02313.1| CTSL2 protein [Bos taurus]
gi|154425700|gb|AAI51426.1| Cathepsin L2 [Bos taurus]
gi|296484466|tpg|DAA26581.1| TPA: cathepsin L2 precursor [Bos taurus]
gi|440898893|gb|ELR50299.1| Cathepsin L1 [Bos grunniens mutus]
Length = 334
Score = 169 bits (427), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 98/220 (44%), Positives = 128/220 (58%), Gaps = 10/220 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TGA+EG + K KL LS+Q L+DCS GN G
Sbjct: 118 VDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNQG 177
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDA-YCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++Q+I +G ++ PYL D C+ + A TGFV++ P E AL
Sbjct: 178 CNGGLMDNAFQYIKDNGGLDSEESYPYLATDTNSCNYKPECSAANDTGFVDI-PQREKAL 236
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GP+SVAIDA SF FY +G+YYD C S LDH VL VGYG + +
Sbjct: 237 MKAVATVGPISVAIDAGHTSFQFYKSGIYYDPDC--SSKDLDHGVLVVGYGFEGTDSNNN 294
Query: 313 PYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
+W VKNSW WG GYV M+ NN CG+ TA +Y T+
Sbjct: 295 KFWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPTV 334
>gi|288764223|emb|CAQ03432.1| silcatein 1 [Spongilla lacustris]
gi|296168747|emb|CAQ54051.1| silicatein alpha 3 [Spongilla lacustris]
Length = 327
Score = 169 bits (427), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 88/199 (44%), Positives = 125/199 (62%), Gaps = 5/199 (2%)
Query: 148 SLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSY 207
S+ QS CGS ++F GA+EGA + KL LS+Q +IDCS YGN+GC GG+ + ++
Sbjct: 125 SVKTQSQCGSSYAFAAVGALEGASALATDKLVALSEQNIIDCSVPYGNHGCSGGDTYTAF 184
Query: 208 QWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
++++ + G+ T+ Y PY G+ + C + A AT TG V + SE L A+A GPV
Sbjct: 185 KYVVDNGGIDTESSY-PYKGKQSSCQYNSKNAGATATGVVKIASGSESDLMSAVASGGPV 243
Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWG 326
+VA+DAS SF FY +GV+ C+N+ L+HA+L GYG ++GK YW VKNSW T WG
Sbjct: 244 AVAVDASVNSFMFYQSGVFDSSTCSNTK--LNHAMLVTGYGSVNGKDYWLVKNSWGTSWG 301
Query: 327 NQGYVLM-SIKDNNCGVMT 344
GY+ M K N CG+ +
Sbjct: 302 ESGYIRMVRNKYNQCGIAS 320
>gi|151547430|gb|ABS12459.1| cysteine protease Cp [Citrus sinensis]
Length = 361
Score = 169 bits (427), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 84/199 (42%), Positives = 121/199 (60%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TTG++E AY+ K LS+Q L+DC+ + N GC+GG +++++I
Sbjct: 161 DQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYI 220
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ GL T++ Y PY G+D C ++ + VN+T +ED L+ A+ PVSVA
Sbjct: 221 KYNGGLDTEEAY-PYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVA 279
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ F FY +GVY KC N+P ++HAV+AVGYG DG PYW +KNSW WG+ G
Sbjct: 280 FEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHG 338
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y + + N CG+ T +Y
Sbjct: 339 YFKIKMGKNMCGIATCASY 357
>gi|283046734|ref|NP_001164314.1| cathepsin L precursor [Tribolium castaneum]
gi|270001247|gb|EEZ97694.1| cathepsin L precursor [Tribolium castaneum]
Length = 328
Score = 168 bits (426), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 87/217 (40%), Positives = 128/217 (58%), Gaps = 5/217 (2%)
Query: 136 AIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGN 195
A V + + + +Q CGSCWSF TTGAVEG + + L LS+Q L+DCS YGN
Sbjct: 116 AAEVDWRNSAVSEVKNQGQCGSCWSFSTTGAVEGQLAISGRGLTSLSEQNLVDCSSAYGN 175
Query: 196 NGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDA 255
GC+GG ++ +I +G+ ++ Y PY + C + + ++ G+ ++ E+A
Sbjct: 176 AGCNGGWMDSAFDYIHDNGIMSESAY-PYTASEGSCRFNPSESVTSLQGYYDLPSGDENA 234
Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYW 315
LK A+A +GP++VA+DA+ + FY GV YD C S L+H VL VGYG G+ YW
Sbjct: 235 LKSAVANNGPIAVALDATDE-LQFYSGGVLYDTTC--SAQALNHGVLVVGYGSEGGQDYW 291
Query: 316 QVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
VKNSW + WG QGY + ++NNCG+ TA +Y +
Sbjct: 292 IVKNSWGSGWGEQGYWRQARNRNNNCGIATAASYPAL 328
>gi|94733563|emb|CAK11015.1| novel protein similar to vertebrate cathepsin L (CTSL) [Danio
rerio]
Length = 334
Score = 168 bits (426), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 96/226 (42%), Positives = 126/226 (55%), Gaps = 8/226 (3%)
Query: 127 VRYNKASKDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQA 185
+R N + Y KGY + DQ CGSCWSF TTGA+EG Y +L LS+Q
Sbjct: 110 LRLNAKRLGVTNIDYRAKGYVTEVKDQGYCGSCWSFSTTGAIEGQMYKHTGRLVSLSEQQ 169
Query: 186 LIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAY-CHIANTTATATMTG 244
L+DCS YG GC G +Y +++ + L + D Y PY D C A A ++
Sbjct: 170 LVDCSRSYGTYGCSGAWMANAYDYVINNALESSDTY-PYTSVDTQPCFYEKNLAMAGISD 228
Query: 245 FVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAV 304
+ V +E AL A+A GPVSVAIDA SF FY +G+Y + CN P+ L+HAVL V
Sbjct: 229 YRFVPAGNEQALADAVATVGPVSVAIDADNPSFLFYSSGIYKESNCN--PNNLNHAVLVV 286
Query: 305 GYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKD--NNCGVMTAPTY 348
GYG +G YW +KNSW T WG GY+ M I++ N CG+ + Y
Sbjct: 287 GYGSEEGTDYWIIKNSWGTGWGEGGYMRM-IRNGKNTCGIASYALY 331
>gi|293342577|ref|XP_001065834.2| PREDICTED: cathepsin L1 [Rattus norvegicus]
gi|293354413|ref|XP_573976.3| PREDICTED: cathepsin L1 [Rattus norvegicus]
gi|149039745|gb|EDL93861.1| rCG24317, isoform CRA_a [Rattus norvegicus]
Length = 330
Score = 168 bits (426), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 93/214 (43%), Positives = 130/214 (60%), Gaps = 8/214 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + GY + +Q CGSCW+F G++EG + K KL LS+Q L+DCSW +GN G
Sbjct: 118 VDWRKHGYVTPVKNQGPCGSCWAFSAVGSLEGQVFRKTGKLVPLSEQNLVDCSWSHGNKG 177
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
CDGG ++Q++ + GL T Y PY + C + A + GF+++ P SE+AL
Sbjct: 178 CDGGLPDFAFQYVKDNGGLDTSVSY-PYEALNGTCRYNPKYSAAKVVGFMSIPP-SENAL 235
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYW 315
A+A GP+SV ID KSF FY G+YY+ C+++ L+HAVL VGYG E DG+ YW
Sbjct: 236 MKAVATVGPISVGIDIKHKSFQFYKGGMYYEPDCSST--NLNHAVLVVGYGEESDGRKYW 293
Query: 316 QVKNSWSTYWGNQGYVLMSIK-DNNCGVMTAPTY 348
VKNSW WG GY+ M+ +NNCG+ + +Y
Sbjct: 294 LVKNSWGRDWGMDGYIKMAKDWNNNCGIASDASY 327
>gi|307141900|gb|ADN34745.1| putative cysteine peptidase [Echinococcus granulosus]
Length = 218
Score = 168 bits (426), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 89/198 (44%), Positives = 119/198 (60%), Gaps = 5/198 (2%)
Query: 149 LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
+ DQ CGSCW+F TGA+EG K KL LS+Q L+DCS GN GC+GG +++
Sbjct: 22 IKDQGDCGSCWAFSATGALEGQLKRKKGKLISLSEQQLVDCSTDMGNEGCNGGYMNDAFR 81
Query: 209 WIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
+ M++G ++ DY PY D C ++ ++ FV V ED LKL++A+ GPVSV
Sbjct: 82 YWMQNGAESESDY-PYTAMDGKCKFNSSKVVTKVSKFVKVPKKREDQLKLSVAQVGPVSV 140
Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGY-GELDGKPYWQVKNSWSTYWGN 327
AIDA+ F Y G+Y D C S LDHAVL VGY ++ G+ YW VKNSW WG
Sbjct: 141 AIDAASSGFMLYKKGIYQDNTC--SQQYLDHAVLVVGYDADMAGQKYWIVKNSWGEDWGQ 198
Query: 328 QGYVLMSI-KDNNCGVMT 344
+GY+ M+ K N CG+ T
Sbjct: 199 RGYIWMARDKGNMCGIAT 216
>gi|410960470|ref|XP_003986812.1| PREDICTED: pro-cathepsin H [Felis catus]
Length = 321
Score = 168 bits (426), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 88/210 (41%), Positives = 125/210 (59%), Gaps = 3/210 (1%)
Query: 140 RYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCD 199
R + K + + +Q CGSCW+F TTGA+E A +K KL L++Q L+DC+ + N+GC
Sbjct: 109 RTKGKYVSPVKNQGGCGSCWTFSTTGALESAIAIKTGKLLSLAEQQLVDCAQNFNNHGCQ 168
Query: 200 GGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKL 258
GG +++++I G+ +D Y PY GQD C + A A + N+T N E+A+
Sbjct: 169 GGLPSQAFEYIRYNKGIMGEDTY-PYKGQDGDCKFQPSKAIAFVKDVANITINDEEAMVE 227
Query: 259 ALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVK 318
A+A + PVS A + + F Y GVY C+ +PD ++HAVLAVGYGE DG PYW VK
Sbjct: 228 AVALYNPVSFAFEVTD-DFMMYRKGVYSSTSCHKTPDKVNHAVLAVGYGEKDGIPYWIVK 286
Query: 319 NSWSTYWGNQGYVLMSIKDNNCGVMTAPTY 348
NSW WG +GY L+ N CG+ +Y
Sbjct: 287 NSWGPQWGMKGYFLIERGKNMCGLAACASY 316
>gi|66814630|ref|XP_641494.1| cysteine protease [Dictyostelium discoideum AX4]
gi|118121|sp|P04989.1|CYSP2_DICDI RecName: Full=Cysteine proteinase 2; AltName: Full=Prestalk
cathepsin; Flags: Precursor
gi|167860|gb|AAA33240.1| pst-cathepsin [Dictyostelium discoideum]
gi|1834417|emb|CAA27050.1| cysteine proteinase 2 [Dictyostelium discoideum]
gi|60469522|gb|EAL67513.1| cysteine protease [Dictyostelium discoideum AX4]
gi|225484|prf||1304284A cathepsin,prestalk
Length = 376
Score = 168 bits (426), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 102/241 (42%), Positives = 134/241 (55%), Gaps = 46/241 (19%)
Query: 149 LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
+ DQ CGSCWSF TTG+ EGA+ +K KKL LS+Q L+DCS N GCDGG ++
Sbjct: 138 IKDQGQCGSCWSFSTTGSTEGAHALKTKKLVSLSEQNLVDCSGPEENFGCDGGLMNNAFD 197
Query: 209 WIMKH-GLPTQDDYGPYLGQD-AYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
+I+K+ G+ T+ Y PY + + C + AT+ G+VN+T SE +L+ A+HGPV
Sbjct: 198 YIIKNKGIDTESSY-PYTAETGSTCLFNKSDIGATIKGYVNITAGSEISLENG-AQHGPV 255
Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKP------------- 313
SVAIDAS SF Y +G+YY+ KC SP LDH VL VGYG + GK
Sbjct: 256 SVAIDASHNSFQLYTSGIYYEPKC--SPTELDHGVLVVGYG-VQGKDDEGPVLNRKQTIV 312
Query: 314 -------------------------YWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPT 347
YW VKNSW T WG +GY+LMS + NNCG+ + +
Sbjct: 313 IHKNEDNKVESSDDSSDSVRPKANNYWIVKNSWGTSWGIKGYILMSKDRKNNCGIASVSS 372
Query: 348 Y 348
Y
Sbjct: 373 Y 373
>gi|403302732|ref|XP_003942007.1| PREDICTED: cathepsin S isoform 2 [Saimiri boliviensis boliviensis]
Length = 289
Score = 168 bits (426), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 88/206 (42%), Positives = 122/206 (59%), Gaps = 6/206 (2%)
Query: 145 GYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDF 204
G N L D CG+CW+F GA+E +K KL LS Q L+DCS YGN GC+GG
Sbjct: 85 GMNHLGDMGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSEKYGNKGCNGGFMT 144
Query: 205 RSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKH 263
++Q+I+ + G+ ++ Y PY D C + AT + + + ED LK A+A
Sbjct: 145 EAFQYIIDNKGIDSEASY-PYKATDQKCQYDSKYRAATCSKYTELPYGREDVLKEAVANK 203
Query: 264 GPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWST 323
GPV V +DAS SF Y +GVYYD C ++H VL +GYG+L+GK YW VKNSW +
Sbjct: 204 GPVCVGVDASHPSFFLYRSGVYYDPACTQK---VNHGVLVIGYGDLNGKEYWLVKNSWGS 260
Query: 324 YWGNQGYVLMSI-KDNNCGVMTAPTY 348
+G QGY+ M+ K N+CG+ + P+Y
Sbjct: 261 NFGEQGYIRMARNKGNHCGIASYPSY 286
>gi|297793593|ref|XP_002864681.1| hypothetical protein ARALYDRAFT_496172 [Arabidopsis lyrata subsp.
lyrata]
gi|297310516|gb|EFH40940.1| hypothetical protein ARALYDRAFT_496172 [Arabidopsis lyrata subsp.
lyrata]
Length = 361
Score = 168 bits (426), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 85/192 (44%), Positives = 118/192 (61%), Gaps = 3/192 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TTGA+E AY+ K LS+Q L+DC+ Y N GC+GG +++++I
Sbjct: 158 DQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAYNNYGCNGGLPSQAFEYI 217
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ GL T++ Y PY+G+D C + + VN+T +ED LK A+ PVS+A
Sbjct: 218 KSNGGLDTEEAY-PYIGKDGTCKFSAENVGVQVLDSVNITLGAEDELKHAVGLVRPVSIA 276
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ SF Y +GVY D C ++P ++HAVLAVGYG DG PYW +KNSW WG++G
Sbjct: 277 FEVIH-SFRLYKSGVYTDSHCGSTPMDVNHAVLAVGYGVEDGVPYWLIKNSWGADWGDKG 335
Query: 330 YVLMSIKDNNCG 341
Y M + N CG
Sbjct: 336 YFKMEMGKNMCG 347
>gi|344271892|ref|XP_003407771.1| PREDICTED: cathepsin L1-like [Loxodonta africana]
Length = 334
Score = 168 bits (426), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 98/221 (44%), Positives = 132/221 (59%), Gaps = 12/221 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + DQ CGSCW+F TGA+EG + K KL LS+Q L+DCS GN G
Sbjct: 118 VDWTQKGYVTPVKDQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRREGNEG 177
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDA 255
C+GG ++Q+I + GL +++ Y PY D C + A TGFV++ P E A
Sbjct: 178 CNGGLMDNAFQYIKDNGGLDSEESY-PYTAMDKQDCRYNPKYSAANDTGFVDIPP-QEKA 235
Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
L A+A GP+SVA+DA +SF FY +G+YYD C S L+H VL VGYG +
Sbjct: 236 LMKAVATVGPISVAVDAGHESFQFYKSGIYYDSNC--SSKDLNHGVLVVGYGFEGIDSAN 293
Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
YW VKNSW T WG GY+ M+ ++N+CG+ TA +Y T+
Sbjct: 294 NRYWLVKNSWGTGWGTDGYIKMAKDRNNHCGIATAASYPTV 334
>gi|440906716|gb|ELR56945.1| Cathepsin S, partial [Bos grunniens mutus]
Length = 342
Score = 168 bits (425), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 90/200 (45%), Positives = 121/200 (60%), Gaps = 7/200 (3%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG-YGNNGCDGGEDFRSYQWI 210
Q CGSCW+F GA+E +K KL LS Q L+DCS YGN GC+GG ++Q+I
Sbjct: 144 QGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTAKYGNKGCNGGFMTEAFQYI 203
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ +G+ ++ Y PY D C AT + ++ + SE+ALK A+A GPVSV
Sbjct: 204 IDNNGIDSEASY-PYKAMDGKCQYDVKNRAATCSRYIELPFGSEEALKEAVANKGPVSVG 262
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS SF Y GVYYD C + ++H VL VGYG LDGK YW VKNSW ++G+QG
Sbjct: 263 IDASHSSFFLYKTGVYYDPSCTQN---VNHGVLVVGYGNLDGKDYWLVKNSWGLHFGDQG 319
Query: 330 YVLMSIKD-NNCGVMTAPTY 348
Y+ M+ N+CG+ + P+Y
Sbjct: 320 YIRMARNSGNHCGIASYPSY 339
>gi|339252572|ref|XP_003371509.1| cathepsin L1 [Trichinella spiralis]
gi|316968239|gb|EFV52542.1| cathepsin L1 [Trichinella spiralis]
Length = 448
Score = 168 bits (425), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 100/238 (42%), Positives = 129/238 (54%), Gaps = 43/238 (18%)
Query: 151 DQSVCGSCWSFGT---------------TGAVEGAYYMKHKKLAVLSQQALIDCSWGYGN 195
DQ CGSCW+F TGA+EG K KL LS+Q LIDCS YGN
Sbjct: 211 DQGHCGSCWAFSAVNSNALHVHSRAFQQTGALEGQNKRKTGKLVSLSEQNLIDCSRKYGN 270
Query: 196 NGCDGGEDFRSYQWIMK-HGLPTQDDYGPYLGQ----DAYCHIANTTATATMTGFVNVTP 250
GC GG +++++ + HG+ T++ Y PY D C N+T AT GFV++ P
Sbjct: 271 KGCSGGLMDNAFEYVKENHGIDTEESY-PYEAAVRMLDKKCRFKNSTIGATDKGFVDIEP 329
Query: 251 NSEDALKLALAKHGPVSVAIDASQKSFSFYVNG-------------------VYYDEKCN 291
+E L A+A GP+SVAIDAS +SF FY +G VY++ C
Sbjct: 330 GNETYLMHAVATIGPLSVAIDASHESFQFYSSGMLLMVDIFNTVEVMWTNLGVYFEPMC- 388
Query: 292 NSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
S LDH VL VGYG L GK YW VKNSW T WGN GY+ M+ K+N+CG+ + +Y
Sbjct: 389 -SSQFLDHGVLVVGYGSLKGKDYWIVKNSWGTSWGNDGYIFMARNKNNSCGIASFASY 445
>gi|426216526|ref|XP_004002513.1| PREDICTED: cathepsin S isoform 2 [Ovis aries]
Length = 281
Score = 168 bits (425), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 94/211 (44%), Positives = 125/211 (59%), Gaps = 8/211 (3%)
Query: 141 YEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW-GYGNNGCD 199
YE+ G N L D CGSCW+F GA+E +K KL LS Q L+DCS YGN GC+
Sbjct: 73 YEL-GMNHLGDMGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTVKYGNKGCN 131
Query: 200 GGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKL 258
GG ++Q+I+ +G+ ++ Y PY D C AT + ++ + SE+ALK
Sbjct: 132 GGFMTEAFQYIIDNNGIDSEASY-PYKAMDGRCQYDVKNRAATCSRYIELPFGSEEALKE 190
Query: 259 ALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVK 318
A+A GPVSV IDA Q SF Y GVYYD C + ++H VL VGYG L+GK YW VK
Sbjct: 191 AVANKGPVSVGIDAKQTSFFLYKTGVYYDPSCTQN---VNHGVLVVGYGSLNGKDYWLVK 247
Query: 319 NSWSTYWGNQGYVLMSIKD-NNCGVMTAPTY 348
NSW +G+QGY+ M+ N+CG+ P+Y
Sbjct: 248 NSWGLNFGDQGYIRMARNSGNHCGIANFPSY 278
>gi|344271925|ref|XP_003407787.1| PREDICTED: cathepsin L1-like [Loxodonta africana]
Length = 333
Score = 168 bits (425), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 94/220 (42%), Positives = 129/220 (58%), Gaps = 11/220 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + DQ CGSCW+F TGA+EG + K KL LS+Q L+DCSW GN G
Sbjct: 118 VDWTQKGYVTPVKDQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSWREGNEG 177
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++Q++ +G ++ PY D C + A TGFV++ P E AL
Sbjct: 178 CNGGLMDNAFQYVKDNGGLDSEESYPYTATDTQDCRYNPKYSAANDTGFVDIPP-QEKAL 236
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GP+SVAIDA Q SF FY +G+Y+D C + ++H VLAVGYG + D
Sbjct: 237 MKAVATVGPISVAIDAGQVSFQFYSSGIYFDPACRLT---VNHGVLAVGYGFEGTDPDKN 293
Query: 313 PYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTYVTM 351
YW VKNSW WG GY+ + ++N+CG+ A +Y T+
Sbjct: 294 KYWLVKNSWGKSWGADGYIKIAKDRNNHCGIARAASYPTV 333
>gi|957281|gb|AAB33990.1| cysteine proteinase [Bombyx mori]
Length = 344
Score = 168 bits (425), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 90/202 (44%), Positives = 121/202 (59%), Gaps = 4/202 (1%)
Query: 149 LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
+ DQ CGSCWSF TTGA+EG ++ + L LS+Q LIDCS YGNNGC+GG +++
Sbjct: 142 IKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFK 201
Query: 209 WIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
+I +G + PY G D C A GFV++ E L A+A GPVSV
Sbjct: 202 YIKDNGGIDTEQAYPYEGVDDKCRYNPKNTGAEDVGFVDIPEGDEQKLMEAVATVGPVSV 261
Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGN 327
AIDAS F Y +GVY +E+C+++ LDH VL VGYG + G YW VKNSW WG
Sbjct: 262 AIDASHTHFQLYSSGVYNEEECSSTD--LDHGVLVVGYGTDEQGVDYWLVKNSWGRSWGE 319
Query: 328 QGYV-LMSIKDNNCGVMTAPTY 348
GY+ ++ K+N CG+ ++ +Y
Sbjct: 320 LGYIKMIRNKNNRCGIASSASY 341
>gi|334314327|ref|XP_001368532.2| PREDICTED: cathepsin H-like [Monodelphis domestica]
Length = 344
Score = 168 bits (425), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 89/232 (38%), Positives = 134/232 (57%), Gaps = 13/232 (5%)
Query: 118 EKVSKYSLWVRYNKASKDAIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKK 177
+V Y ++ + K PV+ +Q CGSCW+F TTG +E A + K
Sbjct: 120 RRVGPYPDFMDWRKKGNYVSPVK----------NQGGCGSCWTFSTTGGLESAVAIATGK 169
Query: 178 LAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANT 236
L L++Q L+DC+ + N+GC+GG +++++IM +G+ +D Y PY G+D C
Sbjct: 170 LLSLAEQQLVDCAQAFNNHGCNGGLPSQAFEYIMYNNGIMGEDTY-PYEGKDGTCRFKPD 228
Query: 237 TATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDG 296
A A + VN+T E+A+ A+A H PVS A + ++ F Y +G+Y + +C+ SPD
Sbjct: 229 KAIAFVKDVVNITIYDEEAMTEAVAHHNPVSFAFEVTE-DFMSYRDGIYSNPRCDKSPDK 287
Query: 297 LDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTY 348
++HAVLAVGYG+ +G YW VKNSW T WGN GY L+ N CG+ +Y
Sbjct: 288 VNHAVLAVGYGKNNGILYWIVKNSWGTSWGNNGYFLIERGKNMCGLADCASY 339
>gi|148688953|gb|EDL20900.1| cathepsin H, isoform CRA_a [Mus musculus]
Length = 291
Score = 168 bits (425), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 80/197 (40%), Positives = 121/197 (61%), Gaps = 1/197 (0%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM 211
Q CGSCW+F TTGA+E A + K+ L++Q L+DC+ + N+GC GG +++++I+
Sbjct: 95 QGACGSCWTFSTTGALESAVAIASGKMLSLAEQQLVDCAQAFNNHGCKGGLPSQAFEYIL 154
Query: 212 KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAID 271
+ ++D PY+G+D+ C A A + VN+T N E A+ A+A + PVS A +
Sbjct: 155 YNKGIMEEDSYPYIGKDSSCRFNPQKAVAFVKNVVNITLNDEAAMVEAVALYNPVSFAFE 214
Query: 272 ASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYV 331
++ F Y +GVY + C+ +PD ++HAVLAVGYGE +G YW VKNSW + WG GY
Sbjct: 215 VTE-DFLMYKSGVYSSKSCHKTPDKVNHAVLAVGYGEQNGLLYWIVKNSWGSQWGENGYF 273
Query: 332 LMSIKDNNCGVMTAPTY 348
L+ N CG+ +Y
Sbjct: 274 LIERGKNMCGLAACASY 290
>gi|356582227|ref|NP_001239115.1| cathepsin L1 precursor [Canis lupus familiaris]
gi|62899810|sp|Q9GL24.1|CATL1_CANFA RecName: Full=Cathepsin L1; Contains: RecName: Full=Cathepsin L1
heavy chain; Contains: RecName: Full=Cathepsin L1 light
chain; Flags: Precursor
gi|10185020|emb|CAC08809.1| cathepsin L [Canis lupus familiaris]
Length = 333
Score = 168 bits (425), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 99/221 (44%), Positives = 134/221 (60%), Gaps = 13/221 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TGA+EG + K KL LS+Q L+DCS GN G
Sbjct: 118 VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNEG 177
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDA 255
C+GG +++++ + GL +++ Y PYLG+D C+ + A TGFV++ P E A
Sbjct: 178 CNGGLMDNAFRYVKDNGGLDSEESY-PYLGRDTETCNYKPECSAANDTGFVDL-PQREKA 235
Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDG---- 311
L A+A GP+SVAIDA +SF FY +G+Y+D C S LDH VL VGYG +G
Sbjct: 236 LMKAVATLGPISVAIDAGHQSFQFYKSGIYFDPDC--SSKDLDHGVLVVGYG-FEGTDSN 292
Query: 312 KPYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
+W VKNSW WG GYV M+ NN CG+ TA +Y T+
Sbjct: 293 NKFWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPTV 333
>gi|166235890|ref|NP_031827.2| pro-cathepsin H preproprotein [Mus musculus]
gi|341940309|sp|P49935.2|CATH_MOUSE RecName: Full=Pro-cathepsin H; AltName: Full=Cathepsin B3; AltName:
Full=Cathepsin BA; Contains: RecName: Full=Cathepsin H
mini chain; Contains: RecName: Full=Cathepsin H;
Contains: RecName: Full=Cathepsin H heavy chain;
Contains: RecName: Full=Cathepsin H light chain; Flags:
Precursor
gi|74151776|dbj|BAE29677.1| unnamed protein product [Mus musculus]
gi|74181999|dbj|BAE34071.1| unnamed protein product [Mus musculus]
gi|74211659|dbj|BAE29188.1| unnamed protein product [Mus musculus]
gi|74213518|dbj|BAE35569.1| unnamed protein product [Mus musculus]
gi|148688954|gb|EDL20901.1| cathepsin H, isoform CRA_b [Mus musculus]
Length = 333
Score = 168 bits (425), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 80/198 (40%), Positives = 122/198 (61%), Gaps = 1/198 (0%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTGA+E A + K+ L++Q L+DC+ + N+GC GG +++++I
Sbjct: 132 NQGACGSCWTFSTTGALESAVAIASGKMLSLAEQQLVDCAQAFNNHGCKGGLPSQAFEYI 191
Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
+ + ++D PY+G+D+ C A A + VN+T N E A+ A+A + PVS A
Sbjct: 192 LYNKGIMEEDSYPYIGKDSSCRFNPQKAVAFVKNVVNITLNDEAAMVEAVALYNPVSFAF 251
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
+ ++ F Y +GVY + C+ +PD ++HAVLAVGYGE +G YW VKNSW + WG GY
Sbjct: 252 EVTE-DFLMYKSGVYSSKSCHKTPDKVNHAVLAVGYGEQNGLLYWIVKNSWGSQWGENGY 310
Query: 331 VLMSIKDNNCGVMTAPTY 348
L+ N CG+ +Y
Sbjct: 311 FLIERGKNMCGLAACASY 328
>gi|356984263|gb|AET43955.1| cathepsin L2, partial [Reishia clavigera]
Length = 278
Score = 168 bits (425), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 88/193 (45%), Positives = 122/193 (63%), Gaps = 5/193 (2%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TG++EG ++ K L LS+Q L+DCS GN G
Sbjct: 89 VDWRTKGYVTPVKNQKQCGSCWAFSATGSLEGQHFKKTGTLVSLSEQNLVDCSKKEGNEG 148
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG +++++I ++ G+ T+ Y PY D C + AT TG+ ++ SE L
Sbjct: 149 CEGGLMDQAFEYIKRNKGIDTEQSY-PYRAVDEKCRFSRADVGATDTGYTDIHKGSEKDL 207
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
+ A+A GP+SVAIDAS+ SF Y +GVYY+ KC+++ LDH VLAVGYG D K YW
Sbjct: 208 QSAVATVGPISVAIDASRDSFQLYKSGVYYEPKCSSTM--LDHGVLAVGYGTTDSKDYWI 265
Query: 317 VKNSWSTYWGNQG 329
VKNSW T WG +G
Sbjct: 266 VKNSWGTQWGMKG 278
>gi|358334194|dbj|GAA34712.2| cathepsin L [Clonorchis sinensis]
Length = 401
Score = 168 bits (425), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 88/185 (47%), Positives = 117/185 (63%), Gaps = 9/185 (4%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TGA+EG ++M K+L LS+Q L+DCS +GN GC GG +++++
Sbjct: 182 DQGQCGSCWAFSATGAIEGQHFMATKQLVSLSEQQLVDCSSHFGNFGCSGGWMDNAFKYV 241
Query: 211 MK-HGLPTQDDYGPYLGQDA-----YCHIANTTATATMTGFVNVTPNSEDALKLALAKHG 264
HG+ T+ Y PY+ + C AT+TG V++ ++E ALK A+ HG
Sbjct: 242 KHTHGITTETKY-PYISGETGTPNPRCEFHGQAIAATVTGIVDLPRSNEFALKQAVGLHG 300
Query: 265 PVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTY 324
P+SVAI AS +SF Y +GVY DE+C S D LDHAVL VGYGE +G PYW +KNSW
Sbjct: 301 PISVAIHASLESFMGYKSGVYSDEEC--SSDQLDHAVLVVGYGEENGIPYWLIKNSWGFD 358
Query: 325 WGNQG 329
WG G
Sbjct: 359 WGEMG 363
>gi|27806673|ref|NP_776457.1| cathepsin L2 precursor [Bos taurus]
gi|1542853|emb|CAA62870.1| cathepsin L [Bos taurus]
Length = 334
Score = 167 bits (424), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 98/220 (44%), Positives = 128/220 (58%), Gaps = 10/220 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TGA+EG + K KL LS+Q L+DCS GN G
Sbjct: 118 VDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNQG 177
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDA-YCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++Q+I +G ++ PYL D C+ + A TGFV++ P E AL
Sbjct: 178 CNGGLMDNAFQYIKDNGGLDSEESYPYLATDTNSCNYKPECSAANDTGFVDI-PQREKAL 236
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GP+SVAIDA SF FY +G+YYD C S LDH VL VGYG + +
Sbjct: 237 MKAVATVGPISVAIDAGHTSFQFYKSGIYYDPDC--SCKDLDHGVLVVGYGFEGTDSNNN 294
Query: 313 PYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
+W VKNSW WG GYV M+ NN CG+ TA +Y T+
Sbjct: 295 KFWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPTV 334
>gi|28971813|dbj|BAC65418.1| cathepsin L [Pandalus borealis]
Length = 318
Score = 167 bits (424), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 92/200 (46%), Positives = 119/200 (59%), Gaps = 7/200 (3%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TTGA+EGA+++KH L LS+Q L+DCS N+GC+GG +Y +I
Sbjct: 121 DQGQCGSCWAFSTTGALEGAHFLKHGDLVSLSEQNLVDCS--TENSGCNGGVVQWAYDYI 178
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+G+ T+ Y PY QD C AT+TG+ ++ E A+ GPVSV
Sbjct: 179 KSNNGIDTESSY-PYEAQDLTCRFDAAHVGATVTGYADIPYADEVTQASAVHDDGPVSVC 237
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDA SF Y +GVYY+ CN P ++HAVL VGYG +G YW +KNSW T WG G
Sbjct: 238 IDAGHNSFQLYSSGVYYEPNCN--PSSINHAVLPVGYGTEEGSDYWLIKNSWGTGWGLSG 295
Query: 330 YV-LMSIKDNNCGVMTAPTY 348
Y+ L K N+CGV T Y
Sbjct: 296 YMKLTRNKSNHCGVATQSCY 315
>gi|5231178|gb|AAD41105.1|AF157961_1 cysteine proteinase [Hypera postica]
Length = 324
Score = 167 bits (424), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 88/201 (43%), Positives = 120/201 (59%), Gaps = 6/201 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TG+ EGAY K KL LS+Q LIDC + GCDGG +++++
Sbjct: 129 DQGDCGSCWAFSITGSTEGAYARKSGKLVSLSEQQLIDCCTDT-SAGCDGGSLDDNFKYV 187
Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
MK GL +++ Y Y G+D C + ++ + ++ EDAL A+A GPVSV +
Sbjct: 188 MKDGLQSEESY-TYKGEDGACKYNVASVVTKVSKYTSIPAEDEDALLEAVATVGPVSVGM 246
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
DAS S Y +G+Y D+ C SP GL+HA+LAVGYG +GK YW +KNSW WG QGY
Sbjct: 247 DASY--LSSYDSGIYEDQDC--SPAGLNHAILAVGYGTENGKDYWIIKNSWGASWGEQGY 302
Query: 331 VLMSIKDNNCGVMTAPTYVTM 351
++ N CG+ Y T+
Sbjct: 303 FRLARGKNQCGISEDTVYPTI 323
>gi|390457768|ref|XP_002742793.2| PREDICTED: cathepsin L2 [Callithrix jacchus]
Length = 588
Score = 167 bits (424), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 92/217 (42%), Positives = 130/217 (59%), Gaps = 11/217 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TGA+EG + K KL LS+Q L+DCS GN G
Sbjct: 118 VDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSHPQGNQG 177
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++Q++ ++ GL ++ Y PY+ +D C + A TGFV V P E L
Sbjct: 178 CNGGFMNNAFQYVKENGGLDSEASY-PYVAKDGSCKYKPENSVANDTGFV-VIPAHEKEL 235
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GP+SVA+DAS SF FY +G+Y+++ C S LDH VL VGYG +
Sbjct: 236 MKAVATVGPISVAVDASHSSFQFYKSGIYFEQDC--SSKNLDHGVLVVGYGFEGTNSNNN 293
Query: 313 PYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
YW +KNSW WG+ GY+ + ++N+CG+ TA +Y
Sbjct: 294 NYWLIKNSWGPEWGSNGYIKIAKDRNNHCGIATAASY 330
>gi|194741252|ref|XP_001953103.1| GF17600 [Drosophila ananassae]
gi|190626162|gb|EDV41686.1| GF17600 [Drosophila ananassae]
Length = 333
Score = 167 bits (424), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 95/201 (47%), Positives = 121/201 (60%), Gaps = 8/201 (3%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TTG++EG ++ K +L LS+Q LIDCS GNNGC G ++++I
Sbjct: 135 DQGSCGSCWAFSTTGSLEGQHFRKTGQLISLSEQNLIDCS--PGNNGCKNGAVEYAFRYI 192
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T+ Y PY C T AT TGFV + P E L A+A GP+SV
Sbjct: 193 QSNKGIDTEISY-PYEAAQNQCRFRRDTIGATSTGFVKLNPGDEMELAQAVATVGPISVL 251
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELD-GKPYWQVKNSWSTYWGNQ 328
I++S SF FY +GVY D CN P+ L HAVL VGYG D G +W VKNSWST+WG Q
Sbjct: 252 INSSLDSFKFYHDGVYNDPSCN--PNKLTHAVLVVGYGTDDRGGDFWLVKNSWSTHWGEQ 309
Query: 329 GYVLMSIKDNN-CGVMTAPTY 348
GYV + NN CG+ + Y
Sbjct: 310 GYVKIKRNANNLCGIASNALY 330
>gi|357158628|ref|XP_003578189.1| PREDICTED: thiol protease aleurain-like [Brachypodium distachyon]
Length = 363
Score = 167 bits (424), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 88/199 (44%), Positives = 119/199 (59%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQS CGSCW+F TTGA+E AY K LS+Q L+DC+ Y N GC+GG +++++I
Sbjct: 162 DQSHCGSCWTFSTTGALEAAYTQATGKNISLSEQQLVDCAGAYNNFGCNGGLPSQAFEYI 221
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ GL T++ Y PY G + CH A + VN+T N+ED L+ A+ PVSVA
Sbjct: 222 KYNGGLDTEESY-PYKGVNGVCHYKPENAAVQVLDSVNITLNAEDELQNAVGLVRPVSVA 280
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ F Y +GVY + C +PD ++HAVLAVGYG +G PYW +KNSW WG++G
Sbjct: 281 FEVI-NGFRQYKSGVYTSDHCGTTPDDVNHAVLAVGYGVENGTPYWLIKNSWGESWGDKG 339
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y M N C V T +Y
Sbjct: 340 YFKMERGKNMCAVATCASY 358
>gi|440893559|gb|ELR46281.1| Cathepsin L1 [Bos grunniens mutus]
Length = 330
Score = 167 bits (424), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 98/235 (41%), Positives = 137/235 (58%), Gaps = 17/235 (7%)
Query: 130 NKASKDAI------PVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLS 182
NK K+ I + + KGY + +Q CGSCW+F TGA+EG + K KL LS
Sbjct: 100 NKKGKETIFASIPPSMDWREKGYVTPVKNQGKCGSCWAFSATGALEGQMFQKTGKLVSLS 159
Query: 183 QQALIDCSWGYGNNGCDGGEDFRSYQWIMK-HGLPTQDDYGPYLGQDAYCHIANTTATAT 241
+Q L+DCS GN GC GG ++Q+++ GL +++ Y PY G C + A
Sbjct: 160 EQNLVDCSQPEGNRGCHGGFIDNAFQYVLDVGGLDSEESY-PYTGLVGTCLYNPNNSAAN 218
Query: 242 MTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAV 301
TGFV++ P E AL A+A GP+SVA+DA SF FY +G+YY+ C S + +DHAV
Sbjct: 219 ETGFVDL-PKQEKALMKAVATLGPISVAVDAHNPSFQFYKSGIYYEPNC--SSESVDHAV 275
Query: 302 LAVGYG----ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
L VGYG + D YW VKNSW +WG GY+ M+ ++N+CG+ T +Y T+
Sbjct: 276 LVVGYGFEGADSDDNKYWLVKNSWGEHWGMDGYIKMAKDRNNHCGIATMASYPTV 330
>gi|255550445|ref|XP_002516273.1| cysteine protease, putative [Ricinus communis]
gi|223544759|gb|EEF46275.1| cysteine protease, putative [Ricinus communis]
Length = 358
Score = 167 bits (424), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 87/199 (43%), Positives = 120/199 (60%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTGA+E AY+ K LS+Q L+DC+ + N GC GG +++++I
Sbjct: 158 NQGHCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFNNFGCHGGLPSQAFEYI 217
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ GL T++ Y PY G+D C ++ + VN+T +ED LK A+ PVSVA
Sbjct: 218 KYNGGLETEEAY-PYTGEDGACKFSSENVGIQVLDSVNITLGAEDELKEAVGLVRPVSVA 276
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ F FY +GVY + C ++P ++HAVLAVGYG DG PYW VKNSW WG+ G
Sbjct: 277 FEVVS-GFRFYKSGVYTSDTCGSTPMDVNHAVLAVGYGVEDGVPYWLVKNSWGENWGDHG 335
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y M + N CGV T +Y
Sbjct: 336 YFKMEMGKNMCGVATCASY 354
>gi|339765072|gb|AEK01110.1| cathepsin L [Cristaria plicata]
gi|397880684|gb|AFO67888.1| cathepsin L [Cristaria plicata]
Length = 333
Score = 167 bits (424), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 90/223 (40%), Positives = 129/223 (57%), Gaps = 4/223 (1%)
Query: 128 RYNKASKDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQAL 186
+Y S+ V + KGY + DQ CGSC++F TGA+EG ++ K KL LS+Q +
Sbjct: 110 KYTNLSEYPRQVDWRRKGYVTRVKDQGGCGSCYAFSATGALEGQHFRKTGKLVSLSEQNI 169
Query: 187 IDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFV 246
+DCS+ GN GC GG +S+ +I + +++ PY +D C + AT G+V
Sbjct: 170 VDCSFKEGNKGCKGGLMDKSFTYIKNNNGIDKEEAYPYEARDGPCRFRRSEVGATDRGYV 229
Query: 247 NVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGY 306
++ N E AL+ A+A GP+SVAID +F FY +GV+ + C S ++H VL VGY
Sbjct: 230 DLPENDETALRHAVATIGPISVAIDGHHFNFRFYDHGVFDNPNC--SKTKINHGVLVVGY 287
Query: 307 GELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
G +G YW VKNSW WG +GY+LMS DN C + A +Y
Sbjct: 288 GTRNGLDYWMVKNSWGRGWGAKGYILMSRNNDNQCCIACAASY 330
>gi|148224682|ref|NP_001086670.1| cathepsin S [Xenopus laevis]
gi|50418223|gb|AAH77285.1| Ctss-prov protein [Xenopus laevis]
Length = 320
Score = 167 bits (424), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 84/199 (42%), Positives = 120/199 (60%), Gaps = 4/199 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F GA+EG +K K+ LS Q L+DCS YGN GC GG R++Q++
Sbjct: 122 NQGGCGSCWAFSAVGALEGQLMLKTGKIVSLSPQNLVDCSSKYGNKGCSGGFMTRAFQYV 181
Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
+ + D Y PY D CH ++ + + P +ED LK AL GP+SVAI
Sbjct: 182 IDNNGIDSDTYYPYHAMDEKCHYELAGKASSCVKYREIVPGTEDNLKQALGNIGPISVAI 241
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
D ++ +F Y +GVY D C+ ++H VLAVGYG L+G+ +W +KNSW T +G+QGY
Sbjct: 242 DGTRPTFFLYKSGVYSDPSCSQE---VNHGVLAVGYGTLNGQDFWLLKNSWGTKYGDQGY 298
Query: 331 VLMSI-KDNNCGVMTAPTY 348
V ++ K+N CGV + +Y
Sbjct: 299 VRIARNKENLCGVASYTSY 317
>gi|28932706|gb|AAO60047.1| midgut cysteine proteinase 4 [Rhipicephalus appendiculatus]
Length = 345
Score = 167 bits (424), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 85/205 (41%), Positives = 126/205 (61%), Gaps = 6/205 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCS-WGYGNNGCDGGEDFRSYQW 209
+Q CGSCW+F +TGA+EG + + ++L LS+Q L+DC+ YGNNGC+GG+ ++Q+
Sbjct: 143 NQGQCGSCWAFSSTGALEGQVFKRTRRLISLSEQNLMDCAGQRYGNNGCNGGQMPGAFQY 202
Query: 210 IMKHG-LPTQDDYGPYLGQDAYCHIANT--TATATMTGFVNVTPNSEDALKLALAKHGPV 266
+ G L T+ Y G + C +N+ ++ G V P +E L+ A+A GP+
Sbjct: 203 VQDAGGLDTEARYPYRQGTNFQCQFSNSFEARRVSVNGHTRVPPRNERVLQDAVANVGPI 262
Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWG 326
S+AI+AS ++F FY NG+Y + C+ P GL+HAVL VGYGE G PYW VKNSW WG
Sbjct: 263 SIAINASPQTFMFYKNGIYGEPNCD--PRGLNHAVLLVGYGEERGVPYWIVKNSWGPGWG 320
Query: 327 NQGYVLMSIKDNNCGVMTAPTYVTM 351
GY+ + N CG+ P++ +
Sbjct: 321 EGGYIKILRNRNVCGMSQDPSFPNL 345
>gi|323454730|gb|EGB10599.1| hypothetical protein AURANDRAFT_4398, partial [Aureococcus
anophagefferens]
Length = 221
Score = 167 bits (424), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 92/218 (42%), Positives = 128/218 (58%), Gaps = 10/218 (4%)
Query: 142 EMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGG 201
++ G + DQ CGSCW+FGT A+EGA ++K LS+QAL+DCSW GNNGCDGG
Sbjct: 6 KVGGVAPVKDQGTCGSCWAFGTVAALEGALFVKTGDQLTLSEQALVDCSWPEGNNGCDGG 65
Query: 202 EDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATAT----MTGFVNVTPNSEDAL 256
ED +Y+W++ H G+P+ +YGPYL D CH+ A + + V ++ AL
Sbjct: 66 EDSMAYEWMLHHQGVPSTAEYGPYLNADGLCHVKAANPKAMPGLEIKAWTRVN-DTRAAL 124
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVY--YDEKCNNSPDGLDHAVLAVGYG-ELDGKP 313
K AL+ G + V + A+ SF FY +G+Y +C+ DH V VGYG E G
Sbjct: 125 KDALSTRGTLVVRVQATPLSFYFYGSGLYDVPHAECSRDAADSDHIVALVGYGYEPSGLG 184
Query: 314 YWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
YW ++NSWS +WG GY+ + + DN CGV TY T+
Sbjct: 185 YWILRNSWSVHWGEGGYMKIGM-DNVCGVENTATYPTV 221
>gi|351694995|gb|EHA97913.1| Cathepsin L1 [Heterocephalus glaber]
Length = 278
Score = 167 bits (424), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 92/217 (42%), Positives = 136/217 (62%), Gaps = 11/217 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TG++EG + K +L LS+Q L+DCS GN G
Sbjct: 63 VDWRKKGYVTPVKNQGQCGSCWAFSATGSLEGQMFRKTGQLVSLSEQNLVDCSQPQGNQG 122
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG +++++ ++ GL ++ Y PY G+D C + A TGFV++ P E AL
Sbjct: 123 CNGGLMDFAFEYVKENKGLESEKSY-PYEGKDGSCRYKPELSAANDTGFVDI-PQREKAL 180
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG--ELDGK-- 312
A+A+ GP+SVA+DA SF FY +G+Y+D +C S L+H VL VGYG E+D +
Sbjct: 181 MKAVAEKGPISVAVDAGLMSFQFYKDGIYFDPEC--SSKDLNHGVLVVGYGYEEVDTEKN 238
Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
YW VKNSW WG +GY+ ++ ++N+CG+ TA +Y
Sbjct: 239 EYWLVKNSWGPEWGAEGYIKIARNRNNHCGIATAASY 275
>gi|75812934|ref|NP_001028787.1| cathepsin S precursor [Bos taurus]
gi|115503669|sp|P25326.2|CATS_BOVIN RecName: Full=Cathepsin S; Flags: Precursor
gi|74353837|gb|AAI02246.1| Cathepsin S [Bos taurus]
gi|296489535|tpg|DAA31648.1| TPA: cathepsin S precursor [Bos taurus]
Length = 331
Score = 167 bits (424), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 90/200 (45%), Positives = 120/200 (60%), Gaps = 7/200 (3%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG-YGNNGCDGGEDFRSYQWI 210
Q CGSCW+F GA+E +K KL LS Q L+DCS YGN GC+GG ++Q+I
Sbjct: 133 QGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTAKYGNKGCNGGFMTEAFQYI 192
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ +G+ ++ Y PY D C AT + ++ + SE+ALK A+A GPVSV
Sbjct: 193 IDNNGIDSEASY-PYKAMDGKCQYDVKNRAATCSRYIELPFGSEEALKEAVANKGPVSVG 251
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS SF Y GVYYD C + ++H VL VGYG LDGK YW VKNSW ++G+QG
Sbjct: 252 IDASHSSFFLYKTGVYYDPSCTQN---VNHGVLVVGYGNLDGKDYWLVKNSWGLHFGDQG 308
Query: 330 YVLMSIKD-NNCGVMTAPTY 348
Y+ M+ N+CG+ P+Y
Sbjct: 309 YIRMARNSGNHCGIANYPSY 328
>gi|2677828|gb|AAB97142.1| cysteine protease [Prunus armeniaca]
Length = 358
Score = 167 bits (423), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 88/199 (44%), Positives = 116/199 (58%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TTGA+E AY +K LS+Q L+DC+ + N GC GG +++++I
Sbjct: 158 DQGHCGSCWTFSTTGALEAAYVQAFRKQISLSEQQLVDCAGAFNNFGCHGGLPSQAFEYI 217
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ GL T+ Y PY+G D C + + VN+T E LK A+A PVSVA
Sbjct: 218 KYNGGLDTEAAY-PYVGTDGACKFSAENVGVQVLDSVNITLGDEQELKHAVAFVRPVSVA 276
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
KSF Y +GVY + C +SP ++HAVLAVGYGE G P+W +KNSW WG+ G
Sbjct: 277 FQVV-KSFRIYKSGVYTSDTCGSSPMDVNHAVLAVGYGEEGGVPFWLIKNSWGESWGDNG 335
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y M N CGV T +Y
Sbjct: 336 YFKMEFGKNMCGVATCASY 354
>gi|440293514|gb|ELP86618.1| cysteine proteinase 2 precursor, putative [Entamoeba invadens IP1]
Length = 319
Score = 167 bits (423), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 86/204 (42%), Positives = 124/204 (60%), Gaps = 7/204 (3%)
Query: 149 LLDQSVCGSCWSFGTTGAVEGA--YYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRS 206
+ DQ+ CG C+SFG ++EG + K LS++ I CS NNGC+GG ++
Sbjct: 116 IRDQNPCGGCYSFGALASLEGRLLFIKGDPKTLDLSEEQAIRCSE---NNGCNGGFGYKV 172
Query: 207 YQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
Y +I ++G+ + DY P+ D C + + AT++G+ V + DALK AL G V
Sbjct: 173 YDYIKENGIVQEKDY-PFTATDGECKVDLSKKFATVSGYTKVAKRNNDALKSALVD-GVV 230
Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWG 326
VAIDAS F Y GVY+D KC+++ + L+H + AVGYG L+G+ YW V+NSW WG
Sbjct: 231 DVAIDASSVKFQLYKTGVYHDTKCSSAINRLNHEISAVGYGNLNGEEYWIVRNSWGRSWG 290
Query: 327 NQGYVLMSIKDNNCGVMTAPTYVT 350
++GY+LM+I+ N CGV T P Y T
Sbjct: 291 DKGYILMAIEGNTCGVTTDPVYPT 314
>gi|351700981|gb|EHB03900.1| Cathepsin H [Heterocephalus glaber]
Length = 334
Score = 167 bits (423), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 83/203 (40%), Positives = 124/203 (61%), Gaps = 3/203 (1%)
Query: 147 NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRS 206
+++ +Q CGSCW+F TTGA+E A + K+ L++Q L+DC+ + N+GC GG ++
Sbjct: 129 SAVKNQGACGSCWTFSTTGALESAVAIASGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQA 188
Query: 207 YQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGP 265
+++I+ G+ +D Y PY G+D +C A A + VN+T N E+A+ A+A + P
Sbjct: 189 FEYILYNKGIMGEDTY-PYEGKDGHCRFQPQKAIAFVKDIVNITLNDEEAMVEAVALYNP 247
Query: 266 VSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYW 325
VS A + ++ F Y G+Y C+ +PD ++HAVLAVGYG G PYW VKNSW T W
Sbjct: 248 VSFAYEVTE-DFMSYKRGIYSSTSCHKTPDKVNHAVLAVGYGVDHGVPYWIVKNSWGTQW 306
Query: 326 GNQGYVLMSIKDNNCGVMTAPTY 348
GN GY L+ N CG+ +Y
Sbjct: 307 GNNGYFLIERGKNMCGLAACASY 329
>gi|391333248|ref|XP_003741031.1| PREDICTED: uncharacterized protein LOC100898636 [Metaseiulus
occidentalis]
Length = 642
Score = 167 bits (423), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 94/215 (43%), Positives = 130/215 (60%), Gaps = 8/215 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
+ + +GY + +Q CGSCW+F TGAVEG ++ +L LS+Q L+DC + G
Sbjct: 432 IDWRQQGYVTPVKNQGNCGSCWAFSATGAVEGQHFKATGRLESLSEQNLVDCV--KESKG 489
Query: 198 CDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
CDGG +++Q+I +G + T+D Y PY D C + AT++G+ + SE L
Sbjct: 490 CDGGFFEQAFQYIKDNGGINTEDSY-PYEAFDGSCRFREDSIGATVSGYQTIPKGSEADL 548
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
+ A++ GP+SVAID S SF Y GVYY+ C++S LDHAVL VGYG G+ YW
Sbjct: 549 QKAVSTIGPISVAIDVSNPSFQNYREGVYYEPSCSSS--NLDHAVLVVGYGSDGGEDYWL 606
Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVT 350
VKNSW T +G QGYV M+ K NNCG+ +A Y T
Sbjct: 607 VKNSWGTSFGEQGYVRMARNKGNNCGIASAAAYPT 641
Score = 154 bits (388), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 79/216 (36%), Positives = 121/216 (56%), Gaps = 6/216 (2%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + +GY + DQ CG+CW+F TGA+EG ++ L LS+Q ++DC +NG
Sbjct: 110 VDWTQQGYVTPVKDQGKCGACWTFAATGAIEGQHFKATGNLVSLSEQNILDCVKTATSNG 169
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
C GG ++ ++ G ++ PY C + AT++G+ ++ +E L+
Sbjct: 170 CSGGLFVEAFDYLKNSGGIDAEESYPYEASGGTCRFRQDSVAATVSGYQAISAGNEAELQ 229
Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQV 317
A+A GP+SV ID+ F Y G+YY+ +C + L HAVL VGYG +G+ YW V
Sbjct: 230 EAVATIGPISVGIDSGHPGFQHYTGGIYYEPECT---EHLSHAVLVVGYGTENGEDYWLV 286
Query: 318 KNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY-VTM 351
KNSW +G QGY+ M+ ++NNCG+ T Y +TM
Sbjct: 287 KNSWGASYGLQGYIKMARNRNNNCGIATGAAYPITM 322
>gi|296189340|ref|XP_002742739.1| PREDICTED: cathepsin L1 [Callithrix jacchus]
Length = 333
Score = 167 bits (423), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 94/220 (42%), Positives = 132/220 (60%), Gaps = 11/220 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TGA+EG + K KL LS+Q L+DCS GN G
Sbjct: 118 VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSGPQGNQG 177
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
CDGG ++Q++ ++ GL +++ Y PY + C + A TGFV++ P E AL
Sbjct: 178 CDGGLMDYAFQYVQENGGLDSEESY-PYEATEESCKYNPEYSVANDTGFVDI-PKLEKAL 235
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GP+SVAIDA +SF FY G+Y++ +C S + +DH VL VGYG D
Sbjct: 236 MKAVATVGPISVAIDAGHESFQFYKEGIYFEPEC--SSEDMDHGVLVVGYGFERTGSDNS 293
Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
YW VKNSW WG GY+ M+ + N+CG+ +A +Y T+
Sbjct: 294 KYWLVKNSWGEKWGMDGYIKMAKDRKNHCGIASAASYPTV 333
>gi|74151179|dbj|BAE27712.1| unnamed protein product [Mus musculus]
Length = 334
Score = 167 bits (423), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 90/204 (44%), Positives = 127/204 (62%), Gaps = 10/204 (4%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F +G +EG ++K KL LS+Q L+DCS GN GC+GG ++Q+I
Sbjct: 131 NQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYI 190
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
++ GL +++ Y PY +D C A A TGFV++ P E+AL A+A GP+SVA
Sbjct: 191 KENGGLDSEESY-PYEAKDGSCKYRAEFAVANDTGFVDI-PQQEEALMKAVATVGPISVA 248
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTYW 325
+DAS S FY +G+YY+ C S LDH VL VGYG + + YW VKNSW + W
Sbjct: 249 MDASHPSLQFYSSGIYYEPNC--SSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEW 306
Query: 326 GNQGYV-LMSIKDNNCGVMTAPTY 348
G +GY+ + +DN+CG+ TA +Y
Sbjct: 307 GMEGYIKIAKDRDNHCGLATAASY 330
>gi|340368358|ref|XP_003382719.1| PREDICTED: digestive cysteine proteinase 2-like [Amphimedon
queenslandica]
Length = 329
Score = 167 bits (423), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 88/225 (39%), Positives = 130/225 (57%), Gaps = 4/225 (1%)
Query: 127 VRYNKASKDAIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQAL 186
VR A D++ R + + +Q CG+CW+F TGA+EG +++ L LS+Q L
Sbjct: 105 VRSTFALADSVDWR-KSGAVTGVKNQGKCGACWAFSATGALEGQHFINTGTLISLSEQQL 163
Query: 187 IDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFV 246
+DCS +GNNGC GG +++++ ++ PYL + C ++ A T +
Sbjct: 164 MDCSSSFGNNGCKGGLMDNAFRYLETVAGDMTEEAYPYLAEVGTCRYNSSEAKVKNTVYK 223
Query: 247 NVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGY 306
++ EDAL+ A+A GP+SV+I++ SF Y GVYY+ C++S LDH VL +GY
Sbjct: 224 DIPEGDEDALQEAVATIGPISVSINSEHSSFQLYDQGVYYEPTCSSSK--LDHGVLVIGY 281
Query: 307 GELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVT 350
G D YW VKNSW T WG GY++MS K+NNCG+ T +Y T
Sbjct: 282 GTSDNNDYWLVKNSWGTNWGMDGYIMMSRNKENNCGIATRASYPT 326
>gi|12805315|gb|AAH02125.1| Ctss protein [Mus musculus]
Length = 340
Score = 167 bits (422), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 90/200 (45%), Positives = 119/200 (59%), Gaps = 6/200 (3%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCS--WGYGNNGCDGGEDFRSYQW 209
Q CG+CW+F GA+EG +K KL LS Q L+DCS YGN GC GG ++Q+
Sbjct: 141 QGSCGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQY 200
Query: 210 IMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
I+ +G D PY D CH + AT + ++ + EDALK A+A GPVSV
Sbjct: 201 IIDNGGIEADASYPYKAMDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVG 260
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS SF FY +GVY D C + ++H VL VGYG LDGK YW VKNSW +G+QG
Sbjct: 261 IDASHSSFFFYKSGVYDDPSCTGN---VNHGVLVVGYGTLDGKDYWLVKNSWGLNFGDQG 317
Query: 330 YVLMSIKD-NNCGVMTAPTY 348
Y+ M+ + N+CG+ + +Y
Sbjct: 318 YIRMARNNKNHCGIASDCSY 337
>gi|255211|gb|AAB23202.1| cathepsin S [cattle, spleen, Peptide Partial, 217 aa]
gi|227966|prf||1714236A cathepsin S
Length = 217
Score = 167 bits (422), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 90/200 (45%), Positives = 120/200 (60%), Gaps = 7/200 (3%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG-YGNNGCDGGEDFRSYQWI 210
Q CGSCW+F GA+E +K KL LS Q L+DCS YGN GC+GG ++Q+I
Sbjct: 19 QGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTAKYGNKGCNGGFMTEAFQYI 78
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ +G+ ++ Y PY D C AT + ++ + SE+ALK A+A GPVSV
Sbjct: 79 IDNNGIDSEASY-PYKAMDGKCQYDVKNRAATCSRYIELPFGSEEALKEAVANKGPVSVG 137
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS SF Y GVYYD C + ++H VL VGYG LDGK YW VKNSW ++G+QG
Sbjct: 138 IDASHSSFFLYKTGVYYDPSCTQN---VNHGVLVVGYGNLDGKDYWLVKNSWGLHFGDQG 194
Query: 330 YVLMSIKD-NNCGVMTAPTY 348
Y+ M+ N+CG+ P+Y
Sbjct: 195 YIRMARNSGNHCGIANYPSY 214
>gi|256535831|gb|ACU82390.1| cathepsin L 2 [Pheronema raphanus]
Length = 320
Score = 167 bits (422), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 85/214 (39%), Positives = 124/214 (57%), Gaps = 3/214 (1%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + DQ CGSCW+F G++EG Y+ LS+Q L+DCS GN G
Sbjct: 109 VDWRTKGYVTKVKDQKQCGSCWAFSAAGSLEGQYFTGQTNY-FLSEQQLVDCSRKEGNEG 167
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
CDGG+ ++++ K+G + Y YLG+DA C A A +TG+ + ++LK
Sbjct: 168 CDGGDMVLAFKYYEKYGAMNESVY-SYLGRDAKCKFDENKAVAKVTGYKKIPKMDCNSLK 226
Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQV 317
A A GP+SVA+DAS SF Y +G+Y ++C LDH VL VGYG G+ +W +
Sbjct: 227 NAAAVTGPISVAMDASHNSFQIYSSGIYDPKRCGKRNRQLDHGVLVVGYGTESGEDFWLI 286
Query: 318 KNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
KNSW WG QGY +++ + CG+ T+ ++ T+
Sbjct: 287 KNSWGEDWGMQGYFKIAVGKDECGICTSASFPTV 320
>gi|1498185|dbj|BAA06738.1| cysteine proteinase-1 precursor [Drosophila melanogaster]
Length = 254
Score = 167 bits (422), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 90/214 (42%), Positives = 128/214 (59%), Gaps = 7/214 (3%)
Query: 139 VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KG ++ DQ CGSCW+F +TGA+EG ++ K L LS+Q L+DCS YGNNG
Sbjct: 41 VDWRTKGAVTAVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNG 100
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++ +I + G+ T+ Y PY D CH AT GF ++ E +
Sbjct: 101 CNGGLMDNAFPYIKDNGGIDTEKSY-PYEAIDDSCHFNRAQVGATDRGFTDIPQGDEKKM 159
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYW 315
+ GPVSVAIDAS +SF FY GVY + +C+ LDH VL VG+G + G+ YW
Sbjct: 160 PEPVPTVGPVSVAIDASHESFQFYSEGVYNEPQCD--AQNLDHGVLVVGFGTDESGEDYW 217
Query: 316 QVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
VKNSW T WG++G++ ++ K+N CG+ + +Y
Sbjct: 218 LVKNSWGTTWGDKGFIKMLRNKENQCGIASPSSY 251
>gi|363737841|ref|XP_001232765.2| PREDICTED: pro-cathepsin H [Gallus gallus]
Length = 327
Score = 167 bits (422), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 85/224 (37%), Positives = 127/224 (56%), Gaps = 3/224 (1%)
Query: 126 WVRYNKASKDAIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQA 185
++R + +A+ R + + +Q CGSCW+F TTG +E A + KL L++Q
Sbjct: 101 FLRSDGPCPEAVDWRKKGNFVTPVKNQGPCGSCWTFSTTGCLESAIAIATGKLLSLAEQQ 160
Query: 186 LIDCSWGYGNNGCDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTG 244
L+DC+ + N+GC GG +++++I+ GL +D Y PY Q+ C A A +
Sbjct: 161 LVDCAQAFNNHGCSGGLPSQAFEYILYNKGLMGEDAY-PYRAQNGTCKFQPDKAIAFVKD 219
Query: 245 FVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAV 304
+N+T E + A+ KH PVS A + + F Y GVY + +C ++PD ++HAVLAV
Sbjct: 220 VINITQYDEAGMVEAVGKHNPVSFAFEVT-SDFMHYRKGVYSNPRCEHTPDKVNHAVLAV 278
Query: 305 GYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTY 348
GYGE DG+PYW VKNSW WG GY L+ N CG+ +Y
Sbjct: 279 GYGEEDGRPYWIVKNSWGPLWGMDGYFLIERGKNMCGLAACASY 322
>gi|3850787|emb|CAA05360.1| cathepsin S [Mus musculus]
Length = 330
Score = 167 bits (422), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 90/200 (45%), Positives = 119/200 (59%), Gaps = 6/200 (3%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCS--WGYGNNGCDGGEDFRSYQW 209
Q CG+CW+F GA+EG +K KL LS Q L+DCS YGN GC GG ++Q+
Sbjct: 131 QGSCGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQY 190
Query: 210 IMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
I+ +G D PY D CH + AT + ++ + EDALK A+A GPVSV
Sbjct: 191 IIDNGGIEADASYPYKAMDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVG 250
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS SF FY +GVY D C + ++H VL VGYG LDGK YW VKNSW +G+QG
Sbjct: 251 IDASHSSFFFYKSGVYDDPSCTGN---VNHGVLVVGYGTLDGKDYWLVKNSWGLNFGDQG 307
Query: 330 YVLMSIKD-NNCGVMTAPTY 348
Y+ M+ + N+CG+ + +Y
Sbjct: 308 YIRMARNNKNHCGIASYCSY 327
>gi|344275472|ref|XP_003409536.1| PREDICTED: cathepsin S-like isoform 2 [Loxodonta africana]
Length = 281
Score = 167 bits (422), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 92/205 (44%), Positives = 122/205 (59%), Gaps = 7/205 (3%)
Query: 147 NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW-GYGNNGCDGGEDFR 205
N L D CG+CW+F GA+E +K KL LS Q L+DCS Y N GC+GG R
Sbjct: 78 NHLGDMGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSGEKYSNKGCNGGFMTR 137
Query: 206 SYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHG 264
++Q+I+ +G+ ++ Y PY D C AT + + + SEDALK A+A G
Sbjct: 138 AFQYIIDNNGIDSEASY-PYKATDGKCQYDPKNRAATCSKYTELPYGSEDALKEAVANKG 196
Query: 265 PVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTY 324
PVSV IDAS+ SF Y +GVYYD C D ++H VL VGYG L+GK YW VKNSW
Sbjct: 197 PVSVGIDASRPSFFLYKSGVYYDPSCT---DNVNHGVLVVGYGNLNGKDYWLVKNSWGLN 253
Query: 325 WGNQGYVLMSIKD-NNCGVMTAPTY 348
+G QGY+ M+ N+CG+ + P+Y
Sbjct: 254 FGEQGYIRMARNSGNHCGIASFPSY 278
>gi|398258447|gb|AFO72231.1| cathepsin L-like protease, partial [Leucinodes orbonalis]
Length = 207
Score = 167 bits (422), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 96/190 (50%), Positives = 120/190 (63%), Gaps = 8/190 (4%)
Query: 136 AIP--VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG 192
A+P V + KG + DQ CGSCWSF +TGA+EG ++ K L LS+Q LIDCS
Sbjct: 20 ALPEEVDWRKKGAVTEVKDQGKCGSCWSFSSTGALEGQHFRKSGYLVSLSEQNLIDCSSA 79
Query: 193 YGNNGCDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPN 251
YGNNGC+GG ++++I HG+ T+ Y PY D C + + A GFV+V
Sbjct: 80 YGNNGCNGGLMDNAFKYIRDNHGIDTEKTY-PYEAVDDKCRYNSKNSGADDVGFVDVPAG 138
Query: 252 SEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELD 310
ED LK A+A GPV+VAIDASQ+SF FY GVYYDE C++S LDH VL VGYG + D
Sbjct: 139 DEDKLKEAVATVGPVAVAIDASQESFQFYSTGVYYDENCSSS--NLDHGVLVVGYGTDED 196
Query: 311 GKPYWQVKNS 320
G YW VKNS
Sbjct: 197 GGDYWLVKNS 206
>gi|6978721|ref|NP_037071.1| pro-cathepsin H precursor [Rattus norvegicus]
gi|115729|sp|P00786.1|CATH_RAT RecName: Full=Pro-cathepsin H; Contains: RecName: Full=Cathepsin H
mini chain; Contains: RecName: Full=Cathepsin H;
Contains: RecName: Full=Cathepsin H heavy chain;
Contains: RecName: Full=Cathepsin H light chain; Flags:
Precursor
gi|55886|emb|CAA68699.1| cathepsin H pre-pro-peptide [Rattus norvegicus]
gi|55391460|gb|AAH85352.1| Cathepsin H [Rattus norvegicus]
gi|149018921|gb|EDL77562.1| cathepsin H, isoform CRA_a [Rattus norvegicus]
gi|226475|prf||1514114A cathepsin H
Length = 333
Score = 167 bits (422), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 82/199 (41%), Positives = 122/199 (61%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTGA+E A + K+ L++Q L+DC+ + N+GC GG +++++I
Sbjct: 132 NQGACGSCWTFSTTGALESAVAIASGKMMTLAEQQLVDCAQNFNNHGCQGGLPSQAFEYI 191
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ +D Y PY+G++ C A A + VN+T N E A+ A+A + PVS A
Sbjct: 192 LYNKGIMGEDSY-PYIGKNGQCKFNPEKAVAFVKNVVNITLNDEAAMVEAVALYNPVSFA 250
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ ++ F Y +GVY C+ +PD ++HAVLAVGYGE +G YW VKNSW + WGN G
Sbjct: 251 FEVTE-DFMMYKSGVYSSNSCHKTPDKVNHAVLAVGYGEQNGLLYWIVKNSWGSNWGNNG 309
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y L+ N CG+ +Y
Sbjct: 310 YFLIERGKNMCGLAACASY 328
>gi|432108215|gb|ELK33129.1| Cathepsin L1 [Myotis davidii]
Length = 334
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 97/221 (43%), Positives = 134/221 (60%), Gaps = 12/221 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TG++EG + K KL LS+Q L+DCS GN G
Sbjct: 118 VDWRDKGYVTPVKNQGQCGSCWAFSATGSLEGQMFRKTGKLVSLSEQNLVDCSRAQGNEG 177
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDA-YCHIANTTATATMTGFVNVTPNSEDA 255
C+GG ++Q++ + GL T++ Y PYL +++ C+ + A TGFV++ P E A
Sbjct: 178 CNGGLMDNAFQYVKDNKGLDTEESY-PYLARESNTCNYRPEYSAANDTGFVDI-PQREKA 235
Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGK--- 312
L A+A GP+SVAIDA SF FY G+YY+ C S LDH VL VGYG G+
Sbjct: 236 LLKAVATVGPISVAIDAGHSSFQFYNAGIYYEPNC--SSKDLDHGVLVVGYGSEGGESKN 293
Query: 313 -PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
+W VKNSW + WG GYV M+ + N+CG+ TA +Y T+
Sbjct: 294 NKFWIVKNSWGSGWGMNGYVKMARDQSNHCGIATAASYPTV 334
>gi|443685370|gb|ELT89004.1| hypothetical protein CAPTEDRAFT_95613, partial [Capitella teleta]
Length = 295
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 95/221 (42%), Positives = 126/221 (57%), Gaps = 10/221 (4%)
Query: 136 AIPVR------YEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALID 188
AIPV + KGY + +Q CGSCW+F GA+EG ++ K KL LS+Q L+D
Sbjct: 74 AIPVSVPAEVDWRKKGYVTPVKNQGQCGSCWAFSAIGALEGQHFRKTGKLVSLSEQNLVD 133
Query: 189 CSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
CS YGNNGC+GG ++++I + + PY D C AT G+ ++
Sbjct: 134 CSKSYGNNGCNGGVMDYAFKYIKDNDGDDTEACYPYEAVDGMCRFKRECVGATCRGYTDL 193
Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
+E +K A+A GPVSVAIDAS SF Y GVY +++C SP LDH VL VGYG
Sbjct: 194 PWGNEVKMKEAVALVGPVSVAIDASHSSFMSYKGGVYVEKEC--SPYQLDHGVLVVGYGT 251
Query: 309 LDGKPYWQVKNSWSTYWGNQGYVLMSIK-DNNCGVMTAPTY 348
G YW VKNSW T WG+QGY+ M+ N+CG+ + Y
Sbjct: 252 EQGLDYWLVKNSWGTTWGDQGYIKMARNMHNHCGIASMACY 292
>gi|14422331|emb|CAC41636.1| early leaf senescence abundant cysteine protease [Pisum sativum]
Length = 350
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 87/199 (43%), Positives = 117/199 (58%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TTGA+E AY K LS+Q L+DC+ + N GC GG +++++I
Sbjct: 150 DQGSCGSCWTFSTTGALESAYAQAFGKNISLSEQQLVDCAGAFNNFGCSGGLPSQAFEYI 209
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ GL T++ Y PY G + C + + G VN+T +ED LK A+A PVSVA
Sbjct: 210 KYNGGLETEEAY-PYTGSNGLCKFRSEHVAVKVLGSVNITLGAEDELKHAIAFARPVSVA 268
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ F Y +GVY C ++P ++HAVLAVGYG DG PYW +KNSW WG+ G
Sbjct: 269 FEVVH-DFRLYKSGVYTSTACGSTPMDVNHAVLAVGYGIEDGIPYWLIKNSWGGDWGDHG 327
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y M + N CGV T +Y
Sbjct: 328 YFKMEMGKNMCGVATCSSY 346
>gi|74178074|dbj|BAE29827.1| unnamed protein product [Mus musculus]
gi|74178231|dbj|BAE29900.1| unnamed protein product [Mus musculus]
gi|74220784|dbj|BAE31361.1| unnamed protein product [Mus musculus]
Length = 326
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 90/200 (45%), Positives = 119/200 (59%), Gaps = 6/200 (3%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCS--WGYGNNGCDGGEDFRSYQW 209
Q CG+CW+F GA+EG +K KL LS Q L+DCS YGN GC GG ++Q+
Sbjct: 127 QGSCGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQY 186
Query: 210 IMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
I+ +G D PY D CH + AT + ++ + EDALK A+A GPVSV
Sbjct: 187 IIDNGGIEADASYPYKATDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVG 246
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS SF FY +GVY D C + ++H VL VGYG LDGK YW VKNSW +G+QG
Sbjct: 247 IDASHSSFFFYKSGVYDDPSCTGN---VNHGVLVVGYGTLDGKDYWLVKNSWGLNFGDQG 303
Query: 330 YVLMSIKD-NNCGVMTAPTY 348
Y+ M+ + N+CG+ + +Y
Sbjct: 304 YIRMARNNKNHCGIASYCSY 323
>gi|351705687|gb|EHB08606.1| Cathepsin S [Heterocephalus glaber]
Length = 331
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 91/200 (45%), Positives = 123/200 (61%), Gaps = 7/200 (3%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW-GYGNNGCDGGEDFRSYQWI 210
Q CGSCW+F GA+EG +K KL LS Q L+DCS Y N GC GG ++Q++
Sbjct: 133 QGACGSCWAFSAVGALEGQLKLKTGKLVSLSAQNLVDCSTEKYRNKGCSGGFMTEAFQYV 192
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ +G+ ++ Y PY D CH + AT + + + SE+ALK A+A GPVSVA
Sbjct: 193 IDNNGIDSETSY-PYKATDEKCHYDSKNRAATCSRYTELPYGSEEALKEAVANKGPVSVA 251
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+DAS+ SF Y NGVY D C + + H VLAVGYG L+GK YW VKNSW Y+G+QG
Sbjct: 252 VDASRPSFFLYKNGVYDDPSCTQN---VTHGVLAVGYGNLNGKDYWLVKNSWGLYFGDQG 308
Query: 330 YVLMSI-KDNNCGVMTAPTY 348
Y+ M+ K N+CG+ + +Y
Sbjct: 309 YIRMARNKGNHCGIASYSSY 328
>gi|392306967|ref|NP_067256.3| cathepsin S isoform 2 preproprotein [Mus musculus]
gi|26390492|dbj|BAC25906.1| unnamed protein product [Mus musculus]
gi|148706872|gb|EDL38819.1| cathepsin S [Mus musculus]
Length = 342
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 90/200 (45%), Positives = 119/200 (59%), Gaps = 6/200 (3%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCS--WGYGNNGCDGGEDFRSYQW 209
Q CG+CW+F GA+EG +K KL LS Q L+DCS YGN GC GG ++Q+
Sbjct: 143 QGSCGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQY 202
Query: 210 IMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
I+ +G D PY D CH + AT + ++ + EDALK A+A GPVSV
Sbjct: 203 IIDNGGIEADASYPYKATDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVG 262
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS SF FY +GVY D C + ++H VL VGYG LDGK YW VKNSW +G+QG
Sbjct: 263 IDASHSSFFFYKSGVYDDPSCTGN---VNHGVLVVGYGTLDGKDYWLVKNSWGLNFGDQG 319
Query: 330 YVLMSIKD-NNCGVMTAPTY 348
Y+ M+ + N+CG+ + +Y
Sbjct: 320 YIRMARNNKNHCGIASYCSY 339
>gi|1134882|emb|CAA92583.1| cysteine protease [Pisum sativum]
Length = 350
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 87/199 (43%), Positives = 117/199 (58%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TTGA+E AY K LS+Q L+DC+ + N GC GG +++++I
Sbjct: 150 DQGSCGSCWTFSTTGALESAYAQAFGKNISLSEQQLVDCAGAFNNFGCSGGLPSQAFEYI 209
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ GL T++ Y PY G + C + + G VN+T +ED LK A+A PVSVA
Sbjct: 210 KYNGGLETEEAY-PYTGSNGLCKFRSEHVAVKVLGSVNITLGAEDELKHAIAFARPVSVA 268
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ F Y +GVY C ++P ++HAVLAVGYG DG PYW +KNSW WG+ G
Sbjct: 269 FEVVH-DFRLYKSGVYTSTACGSTPMDVNHAVLAVGYGIEDGIPYWLIKNSWGGDWGDHG 327
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y M + N CGV T +Y
Sbjct: 328 YFKMEMGKNMCGVATCSSY 346
>gi|390608645|ref|NP_001254624.1| cathepsin S isoform 1 preproprotein [Mus musculus]
gi|74214026|dbj|BAE29430.1| unnamed protein product [Mus musculus]
Length = 343
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 90/200 (45%), Positives = 119/200 (59%), Gaps = 6/200 (3%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCS--WGYGNNGCDGGEDFRSYQW 209
Q CG+CW+F GA+EG +K KL LS Q L+DCS YGN GC GG ++Q+
Sbjct: 144 QGSCGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQY 203
Query: 210 IMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
I+ +G D PY D CH + AT + ++ + EDALK A+A GPVSV
Sbjct: 204 IIDNGGIEADASYPYKATDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVG 263
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS SF FY +GVY D C + ++H VL VGYG LDGK YW VKNSW +G+QG
Sbjct: 264 IDASHSSFFFYKSGVYDDPSCTGN---VNHGVLVVGYGTLDGKDYWLVKNSWGLNFGDQG 320
Query: 330 YVLMSIKD-NNCGVMTAPTY 348
Y+ M+ + N+CG+ + +Y
Sbjct: 321 YIRMARNNKNHCGIASYCSY 340
>gi|203341|gb|AAA63484.1| cathepsin H [Rattus norvegicus]
Length = 298
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 82/199 (41%), Positives = 122/199 (61%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTGA+E A + K+ L++Q L+DC+ + N+GC GG +++++I
Sbjct: 97 NQGACGSCWTFSTTGALESAVAIASGKMMTLAEQQLVDCAQNFNNHGCQGGLPSQAFEYI 156
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ +D Y PY+G++ C A A + VN+T N E A+ A+A + PVS A
Sbjct: 157 LYNKGIMGEDSY-PYIGKNGQCKFNPEKAVAFVKNVVNITLNDEAAMVEAVALYNPVSFA 215
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ ++ F Y +GVY C+ +PD ++HAVLAVGYGE +G YW VKNSW + WGN G
Sbjct: 216 FEVTE-DFMMYKSGVYSSNSCHKTPDKVNHAVLAVGYGEQNGLLYWIVKNSWGSNWGNNG 274
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y L+ N CG+ +Y
Sbjct: 275 YFLIERGKNMCGLAACASY 293
>gi|356530431|ref|XP_003533785.1| PREDICTED: cysteine proteinase [Glycine max]
Length = 354
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 87/203 (42%), Positives = 119/203 (58%), Gaps = 3/203 (1%)
Query: 147 NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRS 206
+S+ DQ CGSCW+F TTGA+E AY K LS+Q L+DC+ + N GC GG ++
Sbjct: 150 SSVKDQGSCGSCWTFSTTGALEAAYAQAFGKSISLSEQQLVDCAGPFNNFGCHGGLPSQA 209
Query: 207 YQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGP 265
+++I + GL T++ Y PY G+D C + + VN+T +ED LK A+A P
Sbjct: 210 FEYIKYNGGLETEEAY-PYTGKDGVCKFSAENVAVQVLDSVNITLGAEDELKHAVAFVRP 268
Query: 266 VSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYW 325
VSVA F FY NGV+ + C ++ ++HAVLAVGYG +G PYW +KNSW W
Sbjct: 269 VSVAFQVVN-GFHFYENGVFTSDTCGSTSQDVNHAVLAVGYGVENGVPYWLIKNSWGESW 327
Query: 326 GNQGYVLMSIKDNNCGVMTAPTY 348
G GY M + N CGV T +Y
Sbjct: 328 GENGYFKMELGKNMCGVATCASY 350
>gi|330376140|gb|AEC13302.1| cathepsin H [Gallus gallus]
Length = 329
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 85/224 (37%), Positives = 127/224 (56%), Gaps = 3/224 (1%)
Query: 126 WVRYNKASKDAIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQA 185
++R + +A+ R + + +Q CGSCW+F TTG +E A + KL L++Q
Sbjct: 103 FLRSDGPCPEAVDWRKKGNFVTPVKNQGPCGSCWTFSTTGCLESAIAIATGKLLSLAEQL 162
Query: 186 LIDCSWGYGNNGCDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTG 244
L+DC+ + N+GC GG +++++I+ GL +D Y PY Q+ C A A +
Sbjct: 163 LVDCAQAFNNHGCSGGLPSQAFEYILYNKGLMGEDAY-PYRAQNGTCKFQPDKAIAFVKD 221
Query: 245 FVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAV 304
+N+T E + A+ KH PVS A + + F Y GVY + +C ++PD ++HAVLAV
Sbjct: 222 VINITQYDEAGMVEAVGKHNPVSFAFEVT-SDFMHYRKGVYSNPRCEHTPDKVNHAVLAV 280
Query: 305 GYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTY 348
GYGE DG+PYW VKNSW WG GY L+ N CG+ +Y
Sbjct: 281 GYGEEDGRPYWIVKNSWGPLWGMDGYFLIERGKNMCGLAACASY 324
>gi|348551380|ref|XP_003461508.1| PREDICTED: pro-cathepsin H-like [Cavia porcellus]
Length = 335
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 81/199 (40%), Positives = 123/199 (61%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTGA+E A + K+ L++Q L+DC+ + N+GC+GG +++++I
Sbjct: 134 NQGACGSCWTFSTTGALESAVAIASGKMLSLAEQQLVDCAQDFNNHGCEGGLPSQAFEYI 193
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ +D Y PY G+D +C A A + VN+T N E+A+ A+A + PVS A
Sbjct: 194 LYNKGIMGEDTY-PYQGKDGHCRFQPQKAIAFVKDVVNITLNDEEAMVEAVALYNPVSFA 252
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ ++ F Y +G+Y C+ +PD ++HAVLAVGYG +G PYW VKNSW T WG G
Sbjct: 253 FEVTE-DFISYQSGIYSSTSCHKTPDKVNHAVLAVGYGVQNGVPYWIVKNSWGTAWGQDG 311
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y L+ N CG+ ++
Sbjct: 312 YFLIERGKNMCGLAACASF 330
>gi|2746723|gb|AAB94925.1| cathepsin S precursor [Mus musculus]
Length = 340
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 90/200 (45%), Positives = 119/200 (59%), Gaps = 6/200 (3%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCS--WGYGNNGCDGGEDFRSYQW 209
Q CG+CW+F GA+EG +K KL LS Q L+DCS YGN GC GG ++Q+
Sbjct: 141 QGSCGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQY 200
Query: 210 IMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
I+ +G D PY D CH + AT + ++ + EDALK A+A GPVSV
Sbjct: 201 IIDNGGIEADASYPYKAMDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVG 260
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS SF FY +GVY D C + ++H VL VGYG LDGK YW VKNSW +G+QG
Sbjct: 261 IDASHSSFFFYKSGVYDDPSCTGN---VNHGVLVVGYGTLDGKDYWLVKNSWGLNFGDQG 317
Query: 330 YVLMSIKD-NNCGVMTAPTY 348
Y+ M+ + N+CG+ + +Y
Sbjct: 318 YIRMARNNKNHCGIASYCSY 337
>gi|28192371|gb|AAK07729.1| NTCP23-like cysteine proteinase [Nicotiana tabacum]
Length = 360
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 84/199 (42%), Positives = 121/199 (60%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTGA+E AY K LS+Q L+DC+ + N GC+GG +++++I
Sbjct: 160 NQGKCGSCWTFSTTGALEAAYSQAFGKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYI 219
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ GL T++ Y PY G++ C ++ + VN+T +ED LK A+A PVS+A
Sbjct: 220 KSNGGLDTEEAY-PYTGKNGLCKFSSENVGVKVIDSVNITLGAEDELKYAVALVRPVSIA 278
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ K F Y +GVY +C N+P ++HAVLAVGYG +G PYW +KNSW WG+ G
Sbjct: 279 FEVI-KGFKQYKSGVYTSTECGNTPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNG 337
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y M + N CG+ T +Y
Sbjct: 338 YFKMEMGKNMCGIATCASY 356
>gi|146386366|gb|ABQ23971.1| cathepsin K [Oryctolagus cuniculus]
Length = 183
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 90/186 (48%), Positives = 116/186 (62%), Gaps = 6/186 (3%)
Query: 155 CGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKH- 213
CGSCW+F + GA+EG K KL LS Q L+DC N GC GG ++Q++ ++
Sbjct: 1 CGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENYGCGGGYMTNAFQYVQRNR 58
Query: 214 GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDAS 273
G+ ++D Y PY+GQD C T A G+ + +E ALK A+A+ GPVSVAIDAS
Sbjct: 59 GIDSEDAY-PYVGQDESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDAS 117
Query: 274 QKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLM 333
SF FY GVYYDE C S D ++HAVLAVGYG G +W +KNSW WGN+GY+LM
Sbjct: 118 LTSFQFYSKGVYYDENC--SSDNVNHAVLAVGYGIQKGNKHWIIKNSWGESWGNKGYILM 175
Query: 334 SIKDNN 339
+ NN
Sbjct: 176 ARNKNN 181
>gi|2961621|gb|AAC05781.1| cathepsin S [Mus musculus]
Length = 340
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 90/200 (45%), Positives = 119/200 (59%), Gaps = 6/200 (3%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCS--WGYGNNGCDGGEDFRSYQW 209
Q CG+CW+F GA+EG +K KL LS Q L+DCS YGN GC GG ++Q+
Sbjct: 141 QGSCGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQY 200
Query: 210 IMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
I+ +G D PY D CH + AT + ++ + EDALK A+A GPVSV
Sbjct: 201 IIDNGGIEADASYPYKATDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVG 260
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS SF FY +GVY D C + ++H VL VGYG LDGK YW VKNSW +G+QG
Sbjct: 261 IDASHSSFFFYKSGVYDDPSCTGN---VNHGVLVVGYGTLDGKDYWLVKNSWGLNFGDQG 317
Query: 330 YVLMSIKD-NNCGVMTAPTY 348
Y+ M+ + N+CG+ + +Y
Sbjct: 318 YIRMARNNKNHCGIASYCSY 337
>gi|341940310|sp|O70370.2|CATS_MOUSE RecName: Full=Cathepsin S; Flags: Precursor
Length = 340
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 90/200 (45%), Positives = 119/200 (59%), Gaps = 6/200 (3%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCS--WGYGNNGCDGGEDFRSYQW 209
Q CG+CW+F GA+EG +K KL LS Q L+DCS YGN GC GG ++Q+
Sbjct: 141 QGSCGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQY 200
Query: 210 IMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
I+ +G D PY D CH + AT + ++ + EDALK A+A GPVSV
Sbjct: 201 IIDNGGIEADASYPYKATDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVG 260
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS SF FY +GVY D C + ++H VL VGYG LDGK YW VKNSW +G+QG
Sbjct: 261 IDASHSSFFFYKSGVYDDPSCTGN---VNHGVLVVGYGTLDGKDYWLVKNSWGLNFGDQG 317
Query: 330 YVLMSIKD-NNCGVMTAPTY 348
Y+ M+ + N+CG+ + +Y
Sbjct: 318 YIRMARNNKNHCGIASYCSY 337
>gi|8347420|dbj|BAA96501.1| cysteine protease [Nicotiana tabacum]
Length = 360
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 84/199 (42%), Positives = 121/199 (60%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTGA+E AY K LS+Q L+DC+ + N GC+GG +++++I
Sbjct: 160 NQGKCGSCWTFSTTGALEAAYSQAFGKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYI 219
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ GL T++ Y PY G++ C ++ + VN+T +ED LK A+A PVS+A
Sbjct: 220 KSNGGLDTEEAY-PYTGKNGLCKFSSENVGVKVIDSVNITLGAEDELKYAVALVRPVSIA 278
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ K F Y +GVY +C N+P ++HAVLAVGYG +G PYW +KNSW WG+ G
Sbjct: 279 FEVI-KGFKQYKSGVYTSTECGNTPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNG 337
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y M + N CG+ T +Y
Sbjct: 338 YFKMEMGKNMCGIATCASY 356
>gi|74149661|dbj|BAE36450.1| unnamed protein product [Mus musculus]
Length = 334
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 90/204 (44%), Positives = 126/204 (61%), Gaps = 10/204 (4%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F +G +EG ++K KL LS+Q L+DCS GN GC+GG ++Q+I
Sbjct: 131 NQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYI 190
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
++ GL +++ Y PY +D C A A TGFV++ P E AL A+A GP+SVA
Sbjct: 191 KENGGLDSEESY-PYEAKDGSCKYRAEFAVANGTGFVDI-PQQEKALMKAVATVGPISVA 248
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTYW 325
+DAS S FY +G+YY+ C S LDH VL VGYG + + YW VKNSW + W
Sbjct: 249 MDASHPSLQFYSSGIYYEPNC--SSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEW 306
Query: 326 GNQGYV-LMSIKDNNCGVMTAPTY 348
G +GY+ + +DN+CG+ TA +Y
Sbjct: 307 GMEGYIKIAKDRDNHCGLATAASY 330
>gi|2146900|pir||S67481 cathepsin L-like cysteine proteinase (EC 3.4.22.-) CP1 [similarity]
- fruit fly (Drosophila melanogaster) (fragment)
Length = 218
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 90/214 (42%), Positives = 128/214 (59%), Gaps = 7/214 (3%)
Query: 139 VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KG ++ DQ CGSCW+F +TGA+EG ++ K L LS+Q L+DCS YGNNG
Sbjct: 5 VDWRTKGAVTAVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNG 64
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++ +I + G+ T+ Y PY D CH AT GF ++ E +
Sbjct: 65 CNGGLMDNAFPYIKDNGGIDTEKSY-PYEAIDDSCHFNRAQVGATDRGFTDIPQGDEKKM 123
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYW 315
+ GPVSVAIDAS +SF FY GVY + +C+ LDH VL VG+G + G+ YW
Sbjct: 124 PEPVPTVGPVSVAIDASHESFQFYSEGVYNEPQCD--AQNLDHGVLVVGFGTDESGEDYW 181
Query: 316 QVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
VKNSW T WG++G++ ++ K+N CG+ + +Y
Sbjct: 182 LVKNSWGTTWGDKGFIKMLRNKENQCGIASPSSY 215
>gi|333827692|gb|AEG19548.1| cathepsin L-like cysteine protease [Taenia pisiformis]
Length = 338
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 87/200 (43%), Positives = 121/200 (60%), Gaps = 6/200 (3%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F +TGA+E A K KL LS+Q L+DCS GN+GC+GG +++++
Sbjct: 140 NQGNCGSCWAFSSTGALEAALAKKTGKLISLSEQQLVDCSLKNGNDGCNGGYMSNAFKYL 199
Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
H + + Y PY D C + T+T + +E AL A+A GP+S+AI
Sbjct: 200 EDHSIEPESAY-PYRATDGPCRYNESLGVGTVTDIGEIPEGNETALMEAVATVGPISIAI 258
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
DAS F FY +G+Y C S L+H VLAVGYG+LDGKPYW VKNSW + WG +GY
Sbjct: 259 DASSLGFMFYRHGIYKSHWC--SSKFLNHGVLAVGYGKLDGKPYWLVKNSWGSGWGMKGY 316
Query: 331 VLMSIKD--NNCGVMTAPTY 348
++M+ KD N CG+ + +
Sbjct: 317 IMMA-KDYHNMCGIASLADF 335
>gi|209738038|gb|ACI69888.1| Digestive cysteine proteinase 2 precursor [Salmo salar]
Length = 367
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 89/205 (43%), Positives = 125/205 (60%), Gaps = 9/205 (4%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCWSF TTGA+E Y +K+ K+ + S+Q L+DC + GC+GG ++++I
Sbjct: 135 DQGNCGSCWSFSTTGAMESQYRLKYGKMKLFSEQQLVDCDRQNIDQGCNGGFPVAAFEYI 194
Query: 211 MKHGLPTQDDYGPYLGQDAYCHIA---NTTATATMTGFVNVTPNSEDALKLALAKHGPVS 267
+ GL T+++Y PY C N +T V +E+AL A+ K GP+S
Sbjct: 195 REFGLLTEEEY-PYSAHSNQCRFKPDENGHLNSTKVTGYTVIEMNENALTEAIYKRGPIS 253
Query: 268 VAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG--ELDGKPYWQVKNSWSTYW 325
VAIDAS F FY +GVY + C ++ LDHAVLAVG+G ++ PY+ VKNSWS+ W
Sbjct: 254 VAIDASSSDFQFYHSGVYQNPSCGSAVSELDHAVLAVGFGVDKVHKTPYYIVKNSWSSGW 313
Query: 326 GNQGYVLMSIKD--NNCGVMTAPTY 348
G+ GY+ M I++ NNCG+ T TY
Sbjct: 314 GDHGYIKM-IRNGKNNCGIATFATY 337
>gi|449270628|gb|EMC81287.1| Cathepsin H, partial [Columba livia]
Length = 261
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 85/224 (37%), Positives = 128/224 (57%), Gaps = 3/224 (1%)
Query: 126 WVRYNKASKDAIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQA 185
++R + D+I R + + +Q CGSCW+F TTG +E A + KL L++Q
Sbjct: 35 FLRSSGPYPDSIDWRKKGNYVTPVKNQGPCGSCWTFSTTGCLESAIAIATGKLLSLAEQQ 94
Query: 186 LIDCSWGYGNNGCDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTG 244
L+DC+ + N+GC GG +++++I+ GL +D Y PY ++ C A A +
Sbjct: 95 LVDCAQAFNNHGCSGGLPSQAFEYILYNRGLMGEDTY-PYRAENGTCKFQPEKAIAFVRD 153
Query: 245 FVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAV 304
+N+T ED + A+ KH PVS A + + +F Y GVY + +C ++PD ++HAVLAV
Sbjct: 154 VINITQYDEDGMVEAVGKHNPVSFAFEVT-SNFMHYRKGVYSNPRCEHTPDKVNHAVLAV 212
Query: 305 GYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTY 348
GYGE DG P+W VKNSW WG GY L+ N CG+ +Y
Sbjct: 213 GYGEEDGTPFWIVKNSWGPLWGMDGYFLIERGKNMCGLAACASY 256
>gi|74213650|dbj|BAE35627.1| unnamed protein product [Mus musculus]
Length = 334
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 90/204 (44%), Positives = 126/204 (61%), Gaps = 10/204 (4%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F +G +EG ++K KL LS+Q L+DCS GN GC+GG ++Q+I
Sbjct: 131 NQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYI 190
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
++ GL +++ Y PY +D C A A TGFV++ P E AL A+A GP+SVA
Sbjct: 191 KENGGLDSEESY-PYEAKDGSCKYRAEFAVANDTGFVDI-PQQEKALMKAVATVGPISVA 248
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTYW 325
+DAS S FY +G+YY+ C S LDH VL VGYG + + YW VKNSW + W
Sbjct: 249 MDASHPSLQFYSSGIYYEPNC--SSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEW 306
Query: 326 GNQGYV-LMSIKDNNCGVMTAPTY 348
G +GY+ + +DN+CG+ TA +Y
Sbjct: 307 GMEGYIEIAKDRDNHCGLATAASY 330
>gi|6753558|ref|NP_034114.1| cathepsin L1 preproprotein [Mus musculus]
gi|115742|sp|P06797.2|CATL1_MOUSE RecName: Full=Cathepsin L1; AltName: Full=Major excreted protein;
Short=MEP; AltName: Full=p39 cysteine proteinase;
Contains: RecName: Full=Cathepsin L1 heavy chain;
Contains: RecName: Full=Cathepsin L1 light chain; Flags:
Precursor
gi|53047|emb|CAA29470.1| unnamed protein product [Mus musculus]
gi|309186|gb|AAA37445.1| preprocysteine proteinase [Mus musculus]
gi|12832050|dbj|BAB21945.1| unnamed protein product [Mus musculus]
gi|26340196|dbj|BAC33761.1| unnamed protein product [Mus musculus]
gi|45768760|gb|AAH68163.1| Cathepsin L [Mus musculus]
gi|74139700|dbj|BAE31701.1| unnamed protein product [Mus musculus]
gi|74146632|dbj|BAE41323.1| unnamed protein product [Mus musculus]
gi|74151584|dbj|BAE41141.1| unnamed protein product [Mus musculus]
gi|74185397|dbj|BAE30172.1| unnamed protein product [Mus musculus]
gi|74197196|dbj|BAE35143.1| unnamed protein product [Mus musculus]
gi|74203006|dbj|BAE26206.1| unnamed protein product [Mus musculus]
gi|74219606|dbj|BAE29572.1| unnamed protein product [Mus musculus]
gi|148684295|gb|EDL16242.1| cathepsin L [Mus musculus]
Length = 334
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 90/204 (44%), Positives = 126/204 (61%), Gaps = 10/204 (4%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F +G +EG ++K KL LS+Q L+DCS GN GC+GG ++Q+I
Sbjct: 131 NQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYI 190
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
++ GL +++ Y PY +D C A A TGFV++ P E AL A+A GP+SVA
Sbjct: 191 KENGGLDSEESY-PYEAKDGSCKYRAEFAVANDTGFVDI-PQQEKALMKAVATVGPISVA 248
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTYW 325
+DAS S FY +G+YY+ C S LDH VL VGYG + + YW VKNSW + W
Sbjct: 249 MDASHPSLQFYSSGIYYEPNC--SSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEW 306
Query: 326 GNQGYV-LMSIKDNNCGVMTAPTY 348
G +GY+ + +DN+CG+ TA +Y
Sbjct: 307 GMEGYIKIAKDRDNHCGLATAASY 330
>gi|158524604|gb|ABW71226.1| cysteine protease [Nicotiana tabacum]
Length = 360
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 84/199 (42%), Positives = 121/199 (60%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTGA+E AY K LS+Q L+DC+ + N GC+GG +++++I
Sbjct: 160 NQGKCGSCWTFSTTGALEAAYSQAFGKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYI 219
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ GL T++ Y PY G++ C ++ + VN+T +ED LK A+A PVS+A
Sbjct: 220 KSNGGLDTEEAY-PYTGKNGLCKFSSENVGVKVIDSVNITLGAEDELKYAVALVRPVSIA 278
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ K F Y +GVY +C N+P ++HAVLAVGYG +G PYW +KNSW WG+ G
Sbjct: 279 FEVI-KGFKQYKSGVYSSTECGNTPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDDG 337
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y M + N CG+ T +Y
Sbjct: 338 YFKMEMGKNMCGIATCASY 356
>gi|332260024|ref|XP_003279085.1| PREDICTED: cathepsin L1 isoform 3 [Nomascus leucogenys]
gi|441593306|ref|XP_004087072.1| PREDICTED: cathepsin L1 [Nomascus leucogenys]
gi|441593309|ref|XP_004087073.1| PREDICTED: cathepsin L1 [Nomascus leucogenys]
Length = 333
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 93/220 (42%), Positives = 131/220 (59%), Gaps = 11/220 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TGA+EG + K KL LS+Q L+DCS GN G
Sbjct: 118 VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSGPQGNEG 177
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++Q++ + GL +++ Y PY + C + A TGFV++ P E AL
Sbjct: 178 CNGGLMDYAFQYVQDNGGLDSEESY-PYEATEESCKYNPKYSVANDTGFVDI-PKQEKAL 235
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GP+SVA+DA +SF FY G+Y++ C S + +DH VL VGYG E D
Sbjct: 236 MKAVATVGPISVAVDAGHQSFQFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDNN 293
Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
YW VKNSW WG GY+ M+ + N+CG+ +A +Y T+
Sbjct: 294 KYWLVKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYPTV 333
>gi|8547325|gb|AAF76330.1|AF271385_1 cathepsin L [Fasciola hepatica]
Length = 326
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 90/218 (41%), Positives = 124/218 (56%), Gaps = 8/218 (3%)
Query: 131 KASKDAIPVRYEMK--GY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALI 187
KA+K A+P R + + GY + DQ CGSCW+F TTGA+EG Y + S+Q L+
Sbjct: 102 KANKRAVPDRIDWRESGYVTEVKDQGGCGSCWAFSTTGAMEGQYMKNQRTSISFSEQQLV 161
Query: 188 DCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVN 247
DCS +GN GC+GG +Y+++ + GL T+ Y PY + C A +TG+
Sbjct: 162 DCSRDFGNYGCNGGLMENAYEYLKRFGLETESSY-PYRAVEGQCRYNEQLGVAKVTGYYT 220
Query: 248 VTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG 307
V E L+ + GP +VA+D + F Y +G+Y + C SPD L+H VLAVGYG
Sbjct: 221 VHSGDEVELQNLVGAEGPAAVALDV-ESDFMMYRSGIYQSQTC--SPDRLNHGVLAVGYG 277
Query: 308 ELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMT 344
DG YW VKNSW T+WG GY+ M K N CG+ +
Sbjct: 278 IQDGTDYWIVKNSWGTWWGEDGYIRMVRKRGNMCGIAS 315
>gi|12847813|dbj|BAB27719.1| unnamed protein product [Mus musculus]
Length = 334
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 90/204 (44%), Positives = 126/204 (61%), Gaps = 10/204 (4%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F +G +EG ++K KL LS+Q L+DCS GN GC+GG ++Q+I
Sbjct: 131 NQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDYAFQYI 190
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
++ GL +++ Y PY +D C A A TGFV++ P E AL A+A GP+SVA
Sbjct: 191 KENGGLDSEESY-PYEAKDGSCKYRAEFAVANDTGFVDI-PQQEKALMKAVATVGPISVA 248
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTYW 325
+DAS S FY +G+YY+ C S LDH VL VGYG + + YW VKNSW + W
Sbjct: 249 MDASHPSLQFYSSGIYYEPNC--SSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEW 306
Query: 326 GNQGYV-LMSIKDNNCGVMTAPTY 348
G +GY+ + +DN+CG+ TA +Y
Sbjct: 307 GMEGYIKIAKDRDNHCGLATAASY 330
>gi|4886998|gb|AAD32136.1|AF121837_1 cathepsin L [Mus musculus]
gi|4887000|gb|AAD32137.1|AF121838_1 cathepsin L [Mus musculus]
gi|4887002|gb|AAD32138.1|AF121839_1 cathepsin L [Mus musculus]
gi|200501|gb|AAA39984.1| preprocathepsin L precursor [Mus musculus]
Length = 334
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 90/204 (44%), Positives = 126/204 (61%), Gaps = 10/204 (4%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F +G +EG ++K KL LS+Q L+DCS GN GC+GG ++Q+I
Sbjct: 131 NQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYI 190
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
++ GL +++ Y PY +D C A A TGFV++ P E AL A+A GP+SVA
Sbjct: 191 KENGGLDSEESY-PYEAKDGSCKYRAEFAVANDTGFVDI-PQQEKALMKAVATVGPISVA 248
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTYW 325
+DAS S FY +G+YY+ C S LDH VL VGYG + + YW VKNSW + W
Sbjct: 249 MDASHPSLQFYSSGIYYEPNC--SSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEW 306
Query: 326 GNQGYV-LMSIKDNNCGVMTAPTY 348
G +GY+ + +DN+CG+ TA +Y
Sbjct: 307 GMEGYIKIAKDRDNHCGLATAASY 330
>gi|149030666|gb|EDL85703.1| cathepsin S [Rattus norvegicus]
Length = 291
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 90/207 (43%), Positives = 121/207 (58%), Gaps = 6/207 (2%)
Query: 145 GYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCS--WGYGNNGCDGGE 202
G N + D CGSCW+F GA+EG +K KL LS Q L+DCS YGN GC GG
Sbjct: 85 GMNHMGDMGSCGSCWAFSAVGALEGQLKLKTGKLVSLSAQNLVDCSTEEKYGNKGCGGGF 144
Query: 203 DFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAK 262
++Q+I+ +G + PY D CH AT + ++ + E+ALK A+A
Sbjct: 145 MTEAFQYIIDNGGIDSEASYPYKAMDEKCHYDPKNRAATCSRYIELPFGDEEALKEAVAT 204
Query: 263 HGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWS 322
GPVSV IDAS SF Y +GVY D C + ++H VL VGYG LDGK YW VKNSW
Sbjct: 205 KGPVSVGIDASHSSFFLYQSGVYDDPSCT---ENVNHGVLVVGYGTLDGKDYWLVKNSWG 261
Query: 323 TYWGNQGYVLMSIKD-NNCGVMTAPTY 348
++G+QGY+ M+ + N+CG+ + +Y
Sbjct: 262 LHFGDQGYIRMARNNKNHCGIASYCSY 288
>gi|149755226|ref|XP_001494409.1| PREDICTED: cathepsin L1-like [Equus caballus]
Length = 334
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 95/220 (43%), Positives = 130/220 (59%), Gaps = 10/220 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TGA+EG + K KL LS+Q L+DCS GN G
Sbjct: 118 VDWREKGYVTPVKNQGPCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAEGNQG 177
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDA-YCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++Q++ +G ++ PYL ++ C+ + A TG+V++ P E AL
Sbjct: 178 CNGGLMDNAFQYVKDNGGLDSEESYPYLAKEGNNCNYKPEYSAANDTGYVDI-PQKEKAL 236
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GP+SVAIDA +SF FY +G+YYD C S LDH VL VGYG + +
Sbjct: 237 MKAVATVGPISVAIDAGHESFQFYKSGIYYDPDC--SSKDLDHGVLVVGYGFEGRDSNNN 294
Query: 313 PYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
+W VKNSW WG GYV M+ NN CG+ TA +Y T+
Sbjct: 295 KFWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPTV 334
>gi|332384364|gb|AEE69034.1| cysteine protease [Taenia pisiformis]
Length = 338
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 87/200 (43%), Positives = 121/200 (60%), Gaps = 6/200 (3%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F +TGA+E A K KL LS+Q L+DCS GN+GC+GG +++++
Sbjct: 140 NQGNCGSCWAFSSTGALEAALAKKTGKLISLSEQQLVDCSLKNGNDGCNGGYMSNAFKYL 199
Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
H + + Y PY D C + T+T + +E AL A+A GP+S+AI
Sbjct: 200 EDHSIEPESAY-PYRATDGPCRYNESLGVGTVTDIGEIPEGNETALMEAVATVGPISIAI 258
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
DAS F FY +G+Y C S L+H VLAVGYG+LDGKPYW VKNSW + WG +GY
Sbjct: 259 DASSLGFMFYRHGIYKSHWC--SSKFLNHGVLAVGYGKLDGKPYWLVKNSWGSGWGMKGY 316
Query: 331 VLMSIKD--NNCGVMTAPTY 348
++M+ KD N CG+ + +
Sbjct: 317 IMMA-KDYHNMCGIASLADF 335
>gi|111036376|dbj|BAF02517.1| cathepsin L-like proteinase [Echinococcus multilocularis]
Length = 338
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 86/200 (43%), Positives = 123/200 (61%), Gaps = 6/200 (3%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F +TGA+EGA K KL LS+Q L+DC+ GN+GC+GG +++++
Sbjct: 140 NQGNCGSCWAFSSTGALEGALAKKTGKLISLSEQQLVDCTLENGNDGCNGGYMSNAFKYL 199
Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
H + + Y PY D C + ++T ++ +E AL A+A GP+S+AI
Sbjct: 200 EGHSIEPESAY-PYRATDGPCRYNESLGVGSVTDIGDIPEGNETALMEAVATVGPISIAI 258
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
DAS F FY +G+Y C S L+H VLA+GYG+LDGKPYW VKNSW + WG +GY
Sbjct: 259 DASTLGFMFYHHGIYKSHWC--SSKFLNHGVLAIGYGKLDGKPYWLVKNSWGSRWGMKGY 316
Query: 331 VLMSIKD--NNCGVMTAPTY 348
++M+ KD N CGV + +
Sbjct: 317 IMMA-KDYHNMCGVASLADF 335
>gi|74200292|dbj|BAE22939.1| unnamed protein product [Mus musculus]
Length = 308
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 90/204 (44%), Positives = 126/204 (61%), Gaps = 10/204 (4%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F +G +EG ++K KL LS+Q L+DCS GN GC+GG ++Q+I
Sbjct: 105 NQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYI 164
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
++ GL +++ Y PY +D C A A TGFV++ P E AL A+A GP+SVA
Sbjct: 165 KENGGLDSEESY-PYEAKDGSCKYRAEFAVANDTGFVDI-PQQEKALMKAVATVGPISVA 222
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTYW 325
+DAS S FY +G+YY+ C S LDH VL VGYG + + YW VKNSW + W
Sbjct: 223 MDASHPSLQFYSSGIYYEPNC--SSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEW 280
Query: 326 GNQGYV-LMSIKDNNCGVMTAPTY 348
G +GY+ + +DN+CG+ TA +Y
Sbjct: 281 GMEGYIKIAKDRDNHCGLATAASY 304
>gi|71482942|gb|AAZ32410.1| cysteine proteinase aleuran type [Nicotiana benthamiana]
Length = 360
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 84/199 (42%), Positives = 121/199 (60%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTGA+E AY K LS+Q L+DC+ + N GC+GG +++++I
Sbjct: 160 NQGKCGSCWTFSTTGALEAAYGQAFGKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYI 219
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ GL T++ Y PY G++ C ++ + VN+T +ED LK A+A PVS+A
Sbjct: 220 KSNGGLDTEEAY-PYTGKNGLCKFSSENVGVKVIDSVNITLGAEDELKYAVALVRPVSIA 278
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ K F Y +GVY +C N+P ++HAVLAVGYG +G PYW +KNSW WG+ G
Sbjct: 279 FEVI-KGFKQYKSGVYTSTECGNTPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNG 337
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y M + N CG+ T +Y
Sbjct: 338 YFKMEMGKNMCGIATCASY 356
>gi|159464745|ref|XP_001690602.1| cystein endopsptidase [Chlamydomonas reinhardtii]
gi|158280102|gb|EDP05861.1| cystein endopsptidase [Chlamydomonas reinhardtii]
Length = 616
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 82/223 (36%), Positives = 130/223 (58%), Gaps = 13/223 (5%)
Query: 133 SKDAIPVRYEMKGYNS---LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDC 189
S+ +P + +G + + DQ +CGSC++F TG ++G +++ + S+Q +IDC
Sbjct: 374 SRADLPPSVDWRGTGADPGVKDQGMCGSCYTFAATGTMDGTWFVATGQRRSFSEQQIIDC 433
Query: 190 SWGYGNNGCDGGEDFRSYQWIMKH-----GLPTQDDYGPYLGQDAYCHIANTTATATMTG 244
+W YG NGC GG YQ ++ + G+ + DY Y G+ YC +N T +G
Sbjct: 434 AWDYGPNGCFGG----YYQPVLNYVAEQGGMALEQDYT-YRGEPGYCRASNHTRVGLFSG 488
Query: 245 FVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAV 304
++NV +E AL A+AK+GP++V+++A ++FSFY GV+ + C LDH V
Sbjct: 489 YMNVESRNELALMEAVAKYGPIAVSVNADPEAFSFYSEGVFDEPACTTRMRDLDHTVTLF 548
Query: 305 GYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPT 347
GYG DGK YW V+NSWS +WG+ GY+ + ++CG+ T P
Sbjct: 549 GYGSQDGKDYWLVRNSWSHFWGDDGYIKIVRGKHDCGIATDPA 591
>gi|395740610|ref|XP_002819972.2| PREDICTED: cathepsin L1 [Pongo abelii]
Length = 333
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 95/220 (43%), Positives = 131/220 (59%), Gaps = 11/220 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TGA+EG + K KL LS+Q L+DCS GN G
Sbjct: 118 VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLISLSEQNLVDCSGPQGNEG 177
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++Q++ + GL +++ Y PY + C + A TGFV++ P E AL
Sbjct: 178 CNGGLMDYAFQYVQDNGGLDSEESY-PYEATEESCKYNPKYSVANDTGFVDI-PKQEKAL 235
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GP+SVAIDA +SF FY G+Y++ C S + +DH VL VGYG E D
Sbjct: 236 MKAVATVGPISVAIDAGHESFLFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDNN 293
Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
YW VKNSW WG GYV M+ + N+CG+ +A +Y T+
Sbjct: 294 KYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 333
>gi|355567871|gb|EHH24212.1| Cathepsin L1 [Macaca mulatta]
Length = 333
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 92/219 (42%), Positives = 128/219 (58%), Gaps = 9/219 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TGA+EG + K KL LS+Q L+DCS GN G
Sbjct: 118 VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSGPQGNEG 177
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
C+GG ++Q++ +G ++ PY + C + A TGFV++ P E AL
Sbjct: 178 CNGGLMDYAFQYVADNGGLDSEEAYPYEATEESCKYNPEYSVANDTGFVDI-PKQEKALM 236
Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKP 313
A+A GP+SVAIDA +SF FY G+Y++ C S + +DH VL VGYG E D
Sbjct: 237 KAVATVGPISVAIDAGHESFMFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDNSK 294
Query: 314 YWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
YW VKNSW WG GY+ M+ + N+CG+ +A +Y T+
Sbjct: 295 YWLVKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYPTV 333
>gi|301769891|ref|XP_002920367.1| PREDICTED: cathepsin L1-like [Ailuropoda melanoleuca]
gi|281346353|gb|EFB21937.1| hypothetical protein PANDA_009084 [Ailuropoda melanoleuca]
Length = 333
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 92/217 (42%), Positives = 133/217 (61%), Gaps = 11/217 (5%)
Query: 139 VRYEMKGYNSLLD-QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + + Q C SCW+F TGA+EG + K KL LS+Q L+DCSW N+G
Sbjct: 118 VDWREKGYVTPVKYQGHCQSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSWPQNNDG 177
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG +++++ + GL + + Y PYLG++ C + A +T F +V+ N ED L
Sbjct: 178 CRGGLMDNAFRYVKDNGGLDSAESY-PYLGRNESCKYRPEKSAANLTTFWSVS-NKEDGL 235
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
+A GPVS A+D+S SF FY G+YYD C + + L+HAVL VGYG E + K
Sbjct: 236 MTTVATVGPVSAAVDSSLHSFQFYKKGIYYDPNCRS--NRLNHAVLVVGYGFEGEESENK 293
Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
YW +KNSW T WG +GY+L++ +DN+CG+ T ++
Sbjct: 294 KYWIIKNSWGTNWGMKGYMLLAKDRDNHCGIATMASF 330
>gi|454101|gb|AAA82966.1| cathepsin H prepropeptide [Mus musculus]
Length = 333
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 79/198 (39%), Positives = 121/198 (61%), Gaps = 1/198 (0%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q C SCW+F TTGA+E A + K+ L++Q L+DC+ + N+GC GG +++++I
Sbjct: 132 NQGACASCWTFSTTGALESAVAIASGKMLSLAEQQLVDCAQAFNNHGCKGGLPSQAFEYI 191
Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
+ + ++D PY+G+D+ C A A + VN+T N E A+ A+A + PVS A
Sbjct: 192 LYNKGIMEEDSYPYIGKDSSCRFNPQKAVAFVKNVVNITLNDEAAMVEAVALYNPVSFAF 251
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
+ ++ F Y +GVY + C+ +PD ++HAVLAVGYGE +G YW VKNSW + WG GY
Sbjct: 252 EVTE-DFLMYKSGVYSSKSCHKTPDKVNHAVLAVGYGEQNGLLYWIVKNSWGSQWGENGY 310
Query: 331 VLMSIKDNNCGVMTAPTY 348
L+ N CG+ +Y
Sbjct: 311 FLIERGKNMCGLAACASY 328
>gi|432853333|ref|XP_004067655.1| PREDICTED: cathepsin L2-like [Oryzias latipes]
Length = 352
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 96/236 (40%), Positives = 132/236 (55%), Gaps = 8/236 (3%)
Query: 116 EGEKVSKYSLWVRYNKASKDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMK 174
G++VS L RYN +A V Y GY + DQ CGSCW+F TTGA+EG K
Sbjct: 119 RGKEVSARRL--RYNARKSEAGFVDYRNMGYVTEVKDQGYCGSCWAFSTTGAIEGQIVKK 176
Query: 175 HKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAY-CHI 233
+L LS+Q L+DCS YG +GC G +Y +++ +GL T D Y PY D C
Sbjct: 177 TGQLLSLSEQNLVDCSRPYGTHGCSGAWMASAYDYVLSNGLQTTDSY-PYTSVDTQPCFY 235
Query: 234 ANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNS 293
+ A A + + + E AL A+A GP++VAIDA SF FY +G+Y + C+
Sbjct: 236 DSRLAVAHIKDYRFIPQGDEQALADAVATIGPITVAIDADHASFLFYSSGIYDEPNCD-- 293
Query: 294 PDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
P+ L HAVL VGYG +G+ YW +KNSW + WG GY+ ++ N CG+ + Y
Sbjct: 294 PNRLSHAVLLVGYGSEEGQDYWIIKNSWGSSWGEGGYMRIIRNGSNTCGIASYALY 349
>gi|348542778|ref|XP_003458861.1| PREDICTED: digestive cysteine proteinase 3-like [Oreochromis
niloticus]
Length = 218
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 89/196 (45%), Positives = 126/196 (64%), Gaps = 6/196 (3%)
Query: 155 CGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKH- 213
CGSCW+F TGA+EG ++ K L LS+Q L+DCS + N+GCDGG ++++I +
Sbjct: 24 CGSCWAFSATGALEGQHFKKTGNLVSLSEQQLVDCSRNFFNHGCDGGWMIPAFKYIKDNG 83
Query: 214 GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDAS 273
G+ T++ Y Y +D CH A +G+ V + E+ALK A+A GP+S+A+DAS
Sbjct: 84 GIQTEESY-TYEARDGRCHYNANFVGAQCSGYGTVKQD-EEALKQAVAAIGPISIAVDAS 141
Query: 274 QKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLM 333
+SF Y +GVY + C+N L+HAVLAVGYG +G YW VKNSW + WGN+GY+ M
Sbjct: 142 HESFQLYQSGVYDEPWCSNI--NLNHAVLAVGYGTENGHDYWLVKNSWGSEWGNKGYIKM 199
Query: 334 SI-KDNNCGVMTAPTY 348
+ KDN CG+ T +Y
Sbjct: 200 TRNKDNQCGIATEASY 215
>gi|75067394|sp|Q9GKL8.1|CATL1_CERAE RecName: Full=Cathepsin L1; AltName: Full=Major excreted protein;
Short=MEP; Contains: RecName: Full=Cathepsin L1 heavy
chain; Contains: RecName: Full=Cathepsin L1 light chain;
Flags: Precursor
gi|11493685|gb|AAG35605.1|AF201700_1 cysteine protease [Chlorocebus aethiops]
Length = 333
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 94/220 (42%), Positives = 131/220 (59%), Gaps = 11/220 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TGA+EG + K KL LS+Q L+DCS GN G
Sbjct: 118 VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSGPQGNEG 177
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++Q++ + GL +++ Y PY + C + A TGFV++ P E AL
Sbjct: 178 CNGGLMDYAFQYVADNGGLDSEESY-PYEATEESCKYNPEYSVANDTGFVDI-PKQEKAL 235
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GP+SVAIDA +SF FY G+Y++ C S + +DH VL VGYG E D
Sbjct: 236 MKAVATVGPISVAIDAGHESFMFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDNS 293
Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
YW VKNSW WG GY+ M+ + N+CG+ +A +Y T+
Sbjct: 294 KYWLVKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYPTV 333
>gi|301775254|ref|XP_002923050.1| PREDICTED: cathepsin H-like [Ailuropoda melanoleuca]
Length = 307
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 91/227 (40%), Positives = 126/227 (55%), Gaps = 13/227 (5%)
Query: 123 YSLWVRYNKASKDAIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLS 182
Y +V + K K PV+ +Q CGSCW+F TTGA+E A +K KL L+
Sbjct: 88 YPPFVDWRKKGKFVSPVK----------NQGGCGSCWTFSTTGALESAIAIKTGKLLSLA 137
Query: 183 QQALIDCSWGYGNNGCDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATAT 241
+Q L+DC+ + N+GC GG +++++I G+ +D Y PY GQD C + A A
Sbjct: 138 EQQLVDCAQDFNNHGCQGGLPSQAFEYIRYNRGIMGEDSY-PYKGQDGDCKFQPSKAIAF 196
Query: 242 MTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAV 301
+ N+T N E A+ A+A PVS A + + F Y GVY C+ +PD ++HAV
Sbjct: 197 VKDVANITINDEQAMVEAVALFNPVSFAFEVT-GDFMMYRKGVYSSTSCHKTPDKVNHAV 255
Query: 302 LAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTY 348
LAVGYGE +G PYW VKNSW WG GY L+ N CG+ +Y
Sbjct: 256 LAVGYGEQNGVPYWIVKNSWGPQWGMHGYFLIERGKNMCGLAACASY 302
>gi|109112057|ref|XP_001086247.1| PREDICTED: cathepsin L1-like isoform 5 [Macaca mulatta]
gi|402897797|ref|XP_003911929.1| PREDICTED: cathepsin L1 [Papio anubis]
Length = 333
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 94/220 (42%), Positives = 131/220 (59%), Gaps = 11/220 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TGA+EG + K KL LS+Q L+DCS GN G
Sbjct: 118 VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSGPQGNEG 177
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++Q++ + GL +++ Y PY + C + A TGFV++ P E AL
Sbjct: 178 CNGGLMDYAFQYVADNGGLDSEESY-PYEATEESCKYNPEYSVANDTGFVDI-PKQEKAL 235
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GP+SVAIDA +SF FY G+Y++ C S + +DH VL VGYG E D
Sbjct: 236 MKAVATVGPISVAIDAGHESFMFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDNS 293
Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
YW VKNSW WG GY+ M+ + N+CG+ +A +Y T+
Sbjct: 294 KYWLVKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYPTV 333
>gi|380790141|gb|AFE66946.1| cathepsin L1 preproprotein [Macaca mulatta]
gi|384939708|gb|AFI33459.1| cathepsin L1 preproprotein [Macaca mulatta]
Length = 333
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 94/220 (42%), Positives = 131/220 (59%), Gaps = 11/220 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TGA+EG + K KL LS+Q L+DCS GN G
Sbjct: 118 VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSGPQGNEG 177
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++Q++ + GL +++ Y PY + C + A TGFV++ P E AL
Sbjct: 178 CNGGLMDYAFQYVADNGGLDSEESY-PYEATEESCKYNPEYSVANDTGFVDI-PKQEKAL 235
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GP+SVAIDA +SF FY G+Y++ C S + +DH VL VGYG E D
Sbjct: 236 MKAVATVGPISVAIDAGHESFMFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDNS 293
Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
YW VKNSW WG GY+ M+ + N+CG+ +A +Y T+
Sbjct: 294 KYWLVKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYPTV 333
>gi|281350252|gb|EFB25836.1| hypothetical protein PANDA_012122 [Ailuropoda melanoleuca]
Length = 294
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 91/227 (40%), Positives = 126/227 (55%), Gaps = 13/227 (5%)
Query: 123 YSLWVRYNKASKDAIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLS 182
Y +V + K K PV+ +Q CGSCW+F TTGA+E A +K KL L+
Sbjct: 75 YPPFVDWRKKGKFVSPVK----------NQGGCGSCWTFSTTGALESAIAIKTGKLLSLA 124
Query: 183 QQALIDCSWGYGNNGCDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATAT 241
+Q L+DC+ + N+GC GG +++++I G+ +D Y PY GQD C + A A
Sbjct: 125 EQQLVDCAQDFNNHGCQGGLPSQAFEYIRYNRGIMGEDSY-PYKGQDGDCKFQPSKAIAF 183
Query: 242 MTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAV 301
+ N+T N E A+ A+A PVS A + + F Y GVY C+ +PD ++HAV
Sbjct: 184 VKDVANITINDEQAMVEAVALFNPVSFAFEVT-GDFMMYRKGVYSSTSCHKTPDKVNHAV 242
Query: 302 LAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTY 348
LAVGYGE +G PYW VKNSW WG GY L+ N CG+ +Y
Sbjct: 243 LAVGYGEQNGVPYWIVKNSWGPQWGMHGYFLIERGKNMCGLAACASY 289
>gi|256535829|gb|ACU82389.1| cathepsin L 1 [Pheronema raphanus]
Length = 328
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 81/198 (40%), Positives = 123/198 (62%), Gaps = 4/198 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F +TG++EG Y++ + KL S+ L+DCS YGNNGC GG ++++
Sbjct: 132 NQGQCGSCWAFSSTGSLEGQYFINNDKLLSFSESELVDCSRRYGNNGCKGGLMDNAFRYW 191
Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
+ + DY PY+ +D C + T++ + NV S+ +L+ A+ GP+SVA+
Sbjct: 192 EVYKEELESDY-PYVAKDGPCRYSQDKGVTTISSYKNVPHFSQISLQDAVRTIGPISVAM 250
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
DAS KSF Y +GVY + +C+ + LDH VL VGYG +P+W VKNSW WG GY
Sbjct: 251 DASHKSFQLYHSGVYSESECSQTK--LDHGVLVVGYG-TSSEPFWLVKNSWGAGWGMDGY 307
Query: 331 VLMSIKDNNCGVMTAPTY 348
+++++N CG+ T P+Y
Sbjct: 308 FEIAMRNNMCGLETEPSY 325
>gi|225707912|gb|ACO09802.1| Cathepsin K precursor [Osmerus mordax]
Length = 331
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 95/217 (43%), Positives = 126/217 (58%), Gaps = 9/217 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
+ Y KG + +Q CGSCW+F + GA+EG KL LS Q L+DC NNG
Sbjct: 120 IDYRRKGMVTPVKNQGSCGSCWAFSSVGALEGQLMKTTGKLVDLSPQNLVDCV--TENNG 177
Query: 198 CDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG ++ ++ G+ ++ Y PY+GQD C + TA+ G+ + +E AL
Sbjct: 178 CGGGYMTNAFNYVRDNQGIDSEAAY-PYIGQDETCAYNVSGMTASCRGYKEIPEGNERAL 236
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGEL-DGKPYW 315
+A+AK GPVSV IDA+ +F FY GVYYD CN D ++HAVLAVGYG GK YW
Sbjct: 237 TVAVAKVGPVSVGIDATLSTFQFYQKGVYYDRNCN--KDDINHAVLAVGYGVTPKGKKYW 294
Query: 316 QVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
VKNSWS WGN+GY+LM+ N CG+ +Y M
Sbjct: 295 IVKNSWSESWGNKGYILMARNRGNLCGIANLASYPIM 331
>gi|348565223|ref|XP_003468403.1| PREDICTED: cathepsin L1-like [Cavia porcellus]
Length = 333
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 96/217 (44%), Positives = 129/217 (59%), Gaps = 11/217 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TG++EG + K L LS+Q L+DCS GN G
Sbjct: 118 VDWREKGYVTEVKNQGQCGSCWAFSATGSLEGQMFHKTGNLVSLSEQNLVDCSRPQGNQG 177
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++Q++ + GL + Y PY+G+D C + A TGFV+V P E +
Sbjct: 178 CNGGLMDFAFQYVKDNKGLEAEKSY-PYVGKDGECKYKPELSAANDTGFVDV-PQREKVV 235
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
+ ALA GP+SVAIDA +SF FY G+YYD C S L+H VL VGYG E
Sbjct: 236 QKALATVGPLSVAIDAGLQSFQFYKEGIYYDPGC--SSRDLNHGVLLVGYGTDASETGKG 293
Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
YW +KNSW T WG GYV ++ ++N+CGV TA +Y
Sbjct: 294 DYWLIKNSWGTTWGADGYVKIARNRNNHCGVATAASY 330
>gi|388513209|gb|AFK44666.1| unknown [Lotus japonicus]
gi|388514955|gb|AFK45539.1| unknown [Lotus japonicus]
Length = 352
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 83/198 (41%), Positives = 113/198 (57%), Gaps = 1/198 (0%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ+ CGSCW+F TTGA+E AY H K LS+Q L+DC+ + N GC+GG +++++I
Sbjct: 152 DQAHCGSCWTFSTTGALEAAYAQAHGKNISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYI 211
Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
+G + PY +D C + VN+T +ED LK A+A PVSVA
Sbjct: 212 KYNGGIALEKEYPYTAKDEACKFTAENVAVRVLDSVNITLGAEDELKHAVAFARPVSVAF 271
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
F Y GVY + C N+P ++HAVLAVGYG + PYW +KNSW + WG+ GY
Sbjct: 272 QVVD-GFRLYKEGVYTSDTCGNTPMDVNHAVLAVGYGVENNVPYWIIKNSWGSTWGDHGY 330
Query: 331 VLMSIKDNNCGVMTAPTY 348
M + N CGV T +Y
Sbjct: 331 FKMELGKNMCGVATCASY 348
>gi|74927078|sp|Q86GF7.1|CRUST_PANBO RecName: Full=Crustapain; AltName: Full=NsCys; Flags: Precursor
gi|28971811|dbj|BAC65417.1| crustapain [Pandalus borealis]
Length = 323
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 89/201 (44%), Positives = 120/201 (59%), Gaps = 6/201 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F A+EGA+++K L LS+Q L+DCS YGN GC+GG +++YQ+I
Sbjct: 123 DQGQCGSCWAFSAVAALEGAHFLKTGDLVSLSEQNLVDCSSSYGNQGCNGGWPYQAYQYI 182
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T+ Y PY D C AT++ +V E AL+ A+ GPVSV
Sbjct: 183 IANRGIDTESSY-PYKAIDDNCRYDAGNIGATVSSYVEPASGDESALQHAVQNEGPVSVC 241
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
IDA Q SF Y GVYY+ C++ +HAV AVGYG + +G YW VKNSW +WG
Sbjct: 242 IDAGQSSFGSYGGGVYYEPNCDSWY--ANHAVTAVGYGTDANGGDYWIVKNSWGAWWGES 299
Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
GY+ M+ +DNNC + T Y
Sbjct: 300 GYIKMARNRDNNCAIATYSVY 320
>gi|344275470|ref|XP_003409535.1| PREDICTED: cathepsin S-like isoform 1 [Loxodonta africana]
Length = 331
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 90/200 (45%), Positives = 120/200 (60%), Gaps = 7/200 (3%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW-GYGNNGCDGGEDFRSYQWI 210
Q CG+CW+F GA+E +K KL LS Q L+DCS Y N GC+GG R++Q+I
Sbjct: 133 QGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSGEKYSNKGCNGGFMTRAFQYI 192
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ +G+ ++ Y PY D C AT + + + SEDALK A+A GPVSV
Sbjct: 193 IDNNGIDSEASY-PYKATDGKCQYDPKNRAATCSKYTELPYGSEDALKEAVANKGPVSVG 251
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS+ SF Y +GVYYD C D ++H VL VGYG L+GK YW VKNSW +G QG
Sbjct: 252 IDASRPSFFLYKSGVYYDPSCT---DNVNHGVLVVGYGNLNGKDYWLVKNSWGLNFGEQG 308
Query: 330 YVLMSIKD-NNCGVMTAPTY 348
Y+ M+ N+CG+ + P+Y
Sbjct: 309 YIRMARNSGNHCGIASFPSY 328
>gi|410042826|ref|XP_003951516.1| PREDICTED: cathepsin L1 [Pan troglodytes]
Length = 278
Score = 165 bits (418), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 94/220 (42%), Positives = 131/220 (59%), Gaps = 11/220 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TGA+EG + K +L LS+Q L+DCS GN G
Sbjct: 63 VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEG 122
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++Q++ + GL +++ Y PY + C + A TGFV++ P E AL
Sbjct: 123 CNGGLMDYAFQYVQDNGGLDSEESY-PYEATEESCKYNPKYSVANDTGFVDI-PKQEKAL 180
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GP+SVAIDA +SF FY G+Y++ C S + +DH VL VGYG E D
Sbjct: 181 MKAVATVGPISVAIDAGHESFLFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDNN 238
Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
YW VKNSW WG GYV M+ + N+CG+ +A +Y T+
Sbjct: 239 KYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 278
>gi|328869030|gb|EGG17408.1| cysteine protease [Dictyostelium fasciculatum]
Length = 379
Score = 165 bits (418), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 98/243 (40%), Positives = 133/243 (54%), Gaps = 36/243 (14%)
Query: 139 VRYEMKGYNS-LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KG S + +Q CGSCWSF TTG+VEGA+Y+ K+ LS+Q L+DCS GN G
Sbjct: 137 VDWRAKGAVSPIKNQGQCGSCWSFSTTGSVEGAHYISTGKMVPLSEQNLVDCSGSEGNMG 196
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDA-YCHIANTTATATMTGFVNVTPNSEDA 255
C GG ++ +I+K+ G+ T+D Y PY + C T AT++ + N+T E
Sbjct: 197 CQGGLMNLAFDYIIKNEGIDTEDSY-PYSAETGKKCLFNKTNVGATISSYKNITSGDESN 255
Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELD----- 310
L A+ GPVSVAIDAS SF Y +G+YY++ C S LDH VL VGYG D
Sbjct: 256 LADAVKNAGPVSVAIDASHNSFQLYSHGIYYEKDC--SSVNLDHGVLVVGYGSGDPSSLA 313
Query: 311 ------------------------GKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTA 345
YW VKNSW + WG+ G++ MS+ +DNNCG+ T+
Sbjct: 314 NNVGGRSGPKMVVFNNRMVKTPSSNGDYWIVKNSWGSTWGSHGFIFMSMNRDNNCGIATS 373
Query: 346 PTY 348
+Y
Sbjct: 374 ASY 376
>gi|6630974|gb|AAF19631.1|AF194427_1 cysteine proteinase precursor [Myxine glutinosa]
Length = 324
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 92/212 (43%), Positives = 121/212 (57%), Gaps = 4/212 (1%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + +GY + DQ CGSCW+F TG++EG ++ K L LS+Q L+DC+ YGN G
Sbjct: 112 VDWRNQGYVTPVKDQGQCGSCWTFSATGSLEGQHFAKTGNLLSLSEQQLVDCAGRYGNYG 171
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
C+GG +Y +I G + PY +D C + AT G+V + E AL
Sbjct: 172 CNGGLMESAYDYIKGVGGVELESAYPYTARDGRCKFDRSKVVATCKGYVVIPVGDEQALM 231
Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQV 317
A+ GPV+V+IDAS SF Y +GVY +C S LDH VLAVGYG G+ YW V
Sbjct: 232 QAVGTIGPVAVSIDASGYSFQLYESGVYDFRRC--SSTNLDHGVLAVGYGTEGGQNYWLV 289
Query: 318 KNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
KNSW WG+QGY+ MS K+N CG+ T Y
Sbjct: 290 KNSWGPGWGDQGYIKMSKDKNNQCGIATDSCY 321
>gi|417409876|gb|JAA51427.1| Putative cathepsin s, partial [Desmodus rotundus]
Length = 342
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 90/200 (45%), Positives = 122/200 (61%), Gaps = 7/200 (3%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG-YGNNGCDGGEDFRSYQWI 210
Q CGSCW+F GA+E +K KL LS Q L+DCS G Y N GC+GG ++Q+I
Sbjct: 144 QGSCGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSVGKYSNRGCNGGFMTEAFQYI 203
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ +G+ ++ Y PY D C + AT + + + +SEDALK A+A GPVSVA
Sbjct: 204 IDNNGIESEASY-PYKAMDGKCQYDSKYRAATCSRYTELPEDSEDALKEAVANKGPVSVA 262
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS SF Y +GVYYD C ++H VL VGYG L+GK YW VKNSW ++G+QG
Sbjct: 263 IDASHPSFFLYRSGVYYDPACTLH---VNHGVLVVGYGNLNGKDYWLVKNSWGLHFGDQG 319
Query: 330 YVLMSIKD-NNCGVMTAPTY 348
Y+ M+ N+CG+ + +Y
Sbjct: 320 YIRMARNSGNHCGIASYASY 339
>gi|403300975|ref|XP_003941187.1| PREDICTED: cathepsin L1-like isoform 1 [Saimiri boliviensis
boliviensis]
gi|403300977|ref|XP_003941188.1| PREDICTED: cathepsin L1-like isoform 2 [Saimiri boliviensis
boliviensis]
gi|403300979|ref|XP_003941189.1| PREDICTED: cathepsin L1-like isoform 3 [Saimiri boliviensis
boliviensis]
Length = 333
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 93/220 (42%), Positives = 132/220 (60%), Gaps = 11/220 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TGA+EG + K KL LS+Q L+DCS GN G
Sbjct: 118 VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSGPQGNQG 177
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++Q++ ++ GL +++ Y PY + C + A TGFV++ P E AL
Sbjct: 178 CNGGLMDYAFQYVQENGGLDSEESY-PYEATEESCKYNPKYSVANDTGFVDI-PKLEKAL 235
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GP+SVAIDA +SF FY G+Y++ +C S + +DH VL VGYG D
Sbjct: 236 MKAVATVGPISVAIDAGHESFQFYKEGIYFEPEC--SSEDMDHGVLVVGYGFERTGSDNS 293
Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
YW VKNSW WG GY+ M+ + N+CG+ +A +Y T+
Sbjct: 294 KYWLVKNSWGEEWGMDGYIKMAKDRKNHCGIASAASYPTV 333
>gi|431920312|gb|ELK18347.1| Cathepsin H [Pteropus alecto]
Length = 232
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 82/195 (42%), Positives = 118/195 (60%), Gaps = 3/195 (1%)
Query: 155 CGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM-KH 213
CGSCW+F TTGA+E A +K K+ L++Q L+DC+ + N+GC GG +++++I
Sbjct: 35 CGSCWTFSTTGALESAIAIKTGKMLSLAEQQLVDCAQNFNNHGCKGGLPSQAFEYIRYNK 94
Query: 214 GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDAS 273
G+ +D Y PY G+D C A A + N+T N E+A+ A+A + PVS A + +
Sbjct: 95 GIMGEDTY-PYQGKDGTCKFQPEKAIAFVKDVANITINDEEAMVEAVALYNPVSFAFEVT 153
Query: 274 QKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLM 333
+ F Y G+Y C+ +PD ++HAVLAVGYGE +GKPYW VKNSW WG GY L+
Sbjct: 154 E-DFMLYRKGIYSSTSCHKTPDKVNHAVLAVGYGEENGKPYWIVKNSWGPQWGMNGYFLI 212
Query: 334 SIKDNNCGVMTAPTY 348
N CG+ +Y
Sbjct: 213 ERGKNMCGLAACASY 227
>gi|342305192|dbj|BAK55650.1| cathepsin S [Oplegnathus fasciatus]
Length = 337
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 94/223 (42%), Positives = 131/223 (58%), Gaps = 7/223 (3%)
Query: 133 SKDAIP--VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDC 189
S A+P V + KG S+ Q CGSCW+F GA+EG KL LS Q L+DC
Sbjct: 118 SGAAVPDTVDWREKGCVTSVKMQGSCGSCWAFSAAGALEGQLAKTTGKLLDLSPQNLVDC 177
Query: 190 SWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVT 249
S YGN+GC+GG R++Q+++ + D PY GQ CH A + + +
Sbjct: 178 SSKYGNHGCNGGFMHRAFQYVIDNQGIDSDASYPYTGQSQQCHYNPAYRAANCSRYSFLP 237
Query: 250 PNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGEL 309
E ALK ALA GP+SVAIDA++ SF+FY +GVY D+ C + ++H VLAVGYG L
Sbjct: 238 EGDEGALKEALATIGPISVAIDATRPSFTFYRSGVYDDQTCTRN---VNHGVLAVGYGTL 294
Query: 310 DGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
+GK YW VKNSW + +G++G++ M+ K++ CG+ Y M
Sbjct: 295 NGKDYWLVKNSWGSTFGDKGFIRMARNKNDQCGIALYGCYPIM 337
>gi|410303012|gb|JAA30106.1| cathepsin L1 [Pan troglodytes]
Length = 333
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 94/220 (42%), Positives = 131/220 (59%), Gaps = 11/220 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TGA+EG + K +L LS+Q L+DCS GN G
Sbjct: 118 VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEG 177
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++Q++ + GL +++ Y PY + C + A TGFV++ P E AL
Sbjct: 178 CNGGLMDYAFQYVQDNGGLDSEESY-PYEATEESCKYNPKYSVANDTGFVDI-PKQEKAL 235
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GP+SVAIDA +SF FY G+Y++ C S + +DH VL VGYG E D
Sbjct: 236 MKAVATVGPISVAIDAGHESFLFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDNN 293
Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
YW VKNSW WG GYV M+ + N+CG+ +A +Y T+
Sbjct: 294 KYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 333
>gi|294889035|ref|XP_002772673.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
gi|239877094|gb|EER04489.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
Length = 358
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 92/211 (43%), Positives = 126/211 (59%), Gaps = 19/211 (9%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKK--LAVLSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
+Q CGSCW+F TGA+EG Y + + K L S+Q L+DCS GN GC+GG ++
Sbjct: 134 NQQSCGSCWAFSATGALEGRYEIANPKHELVEFSEQQLVDCSSEEGNMGCNGGLMDNAFA 193
Query: 209 WIMKHGLPTQDDYGPYLGQDAYCHIANTTATATM-----TGFVNVTPNSEDALKLALAKH 263
++M+HGL T++DY Y D C + A + TGFV+V +A+K AL +
Sbjct: 194 YVMQHGLCTEEDYA-YEAIDEPCRNSTVKEKARLHPHDVTGFVDVHSKDGEAMKEAL-QS 251
Query: 264 GPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWST 323
GPVSVAI+A F FY GV E C +S LDH VL VGYGELDGK YW+VKNSW
Sbjct: 252 GPVSVAIEADMPDFQFYHEGVLTGE-CGDS---LDHGVLLVGYGELDGKKYWKVKNSWGA 307
Query: 324 YWGNQGYVLMS------IKDNNCGVMTAPTY 348
WG++GY+L+ +++ CG++ +Y
Sbjct: 308 EWGHEGYILLERERADGTEEDECGILLQGSY 338
>gi|410990002|ref|XP_004001239.1| PREDICTED: cathepsin L1-like isoform 1 [Felis catus]
Length = 333
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 94/220 (42%), Positives = 130/220 (59%), Gaps = 11/220 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY +Q C S W+F TGA+EG + K KL LS+Q L+DC+W GN G
Sbjct: 118 VNWRKKGYVTPTKNQGQCASGWAFSATGAIEGQMFGKTHKLVSLSEQNLLDCAWSEGNGG 177
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG ++Q++ + GL +++ Y PY Q C ++ A +TGF+ + P +E L
Sbjct: 178 CSGGSMGNAFQYVKDNGGLDSEESY-PYHAQLQSCKYNPESSAADITGFLTI-PKTEYKL 235
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
++A GPVS AIDAS +F FY G+YYD C S + L H VL VGYG ELD K
Sbjct: 236 MRSVAIVGPVSAAIDASLDTFRFYDQGIYYDSNC--SHEILHHGVLIVGYGFEGAELDNK 293
Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
YW VKNSW WG GY+LM+ +DN+CG+ + ++ T+
Sbjct: 294 KYWIVKNSWGEAWGIDGYILMAKDRDNHCGIASLASFPTV 333
>gi|46251290|gb|AAS84611.1| cathepsin L-like cysteine proteinase I variant form precursor
[Heterodera glycines]
Length = 374
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 86/202 (42%), Positives = 123/202 (60%), Gaps = 7/202 (3%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q +CGSCW+F TGA+EG + L LS+Q LIDCS YGN GC+GG ++Q+I
Sbjct: 173 NQGMCGSCWAFSATGALEGQHVRDKGHLVSLSEQNLIDCSKKYGNMGCNGGIMDNAFQYI 232
Query: 211 MKH-GLPTQDDYGPYLGQDA-YCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
+ G+ + Y PY + C AT +G+ ++ E+ L++A+A GPVSV
Sbjct: 233 KDNKGIDKETAY-PYKAKTGKKCLFKRNDVGATDSGYNDIAEGDEEDLRMAVATQGPVSV 291
Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGN 327
AIDA +SF Y NGVY++++C+ P LDH VL GYG + YW VKNSW T WG
Sbjct: 292 AIDAGHRSFQLYTNGVYFEKECD--PQNLDHGVLVEGYGTDPTQGDYWIVKNSWGTRWGE 349
Query: 328 QGYVLMSI-KDNNCGVMTAPTY 348
QGY+ M+ ++NNCG+ + ++
Sbjct: 350 QGYIRMARNRNNNCGIASHASF 371
>gi|291398027|ref|XP_002715626.1| PREDICTED: cathepsin S [Oryctolagus cuniculus]
Length = 331
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 89/200 (44%), Positives = 121/200 (60%), Gaps = 7/200 (3%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW-GYGNNGCDGGEDFRSYQWI 210
Q CG+CW+F GA+E +K L LS Q L+DCS YGN GC+GG ++Q+I
Sbjct: 133 QGSCGACWAFSAVGALEAQLKLKTGNLVSLSAQNLVDCSTTKYGNKGCNGGFMTEAFQYI 192
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ +G+ ++ Y PY D CH + AT + + + SE+ALK A+A GPVSVA
Sbjct: 193 IDNNGIDSEASY-PYKAMDQKCHYDSKHRAATCSKYTELPFGSEEALKEAVANKGPVSVA 251
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS SF Y +GVYY+ C + ++H VLAVGYG L GK YW VKNSW ++G QG
Sbjct: 252 IDASHSSFFLYRSGVYYEPSCTQN---VNHGVLAVGYGNLKGKDYWLVKNSWGIHFGEQG 308
Query: 330 YVLMSIKD-NNCGVMTAPTY 348
Y+ M+ N+CG+ P+Y
Sbjct: 309 YIRMARNSKNHCGIANYPSY 328
>gi|193786743|dbj|BAG52066.1| unnamed protein product [Homo sapiens]
Length = 333
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 94/220 (42%), Positives = 131/220 (59%), Gaps = 11/220 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TGA+EG + K +L LS+Q L+DCS GN G
Sbjct: 118 VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEG 177
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++Q++ + GL +++ Y PY + C + A TGFV++ P E AL
Sbjct: 178 CNGGLMDYAFQYVQDNGGLDSEESY-PYEATEESCKYNPKYSVANDTGFVDI-PKQEKAL 235
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GP+SVAIDA +SF FY G+Y++ C S + +DH VL VGYG E D
Sbjct: 236 MKAVATVGPISVAIDAGHESFLFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDNN 293
Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
YW VKNSW WG GYV M+ + N+CG+ +A +Y T+
Sbjct: 294 KYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 333
>gi|345798093|ref|XP_536212.3| PREDICTED: pro-cathepsin H [Canis lupus familiaris]
Length = 350
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 91/228 (39%), Positives = 127/228 (55%), Gaps = 14/228 (6%)
Query: 123 YSLWVRYNKASKDAIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLS 182
Y +V + K K PV+ +Q CGSCW+F TTGA+E A +K KL L+
Sbjct: 130 YPPFVDWRKKGKFVSPVK----------NQGSCGSCWTFSTTGALESAIAIKSGKLLSLA 179
Query: 183 QQALIDCSWGYGNNGCDG-GEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATA 240
+Q L+DC+ + N+GC G G +++++I G+ +D Y PY GQD C + A A
Sbjct: 180 EQQLVDCAQNFNNHGCQGYGAPLQAFEYIRYNKGIMGEDSY-PYKGQDGDCKYQPSKAIA 238
Query: 241 TMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHA 300
+ N+T N E A+ A+A + PVS A + + F Y G+Y C+ +PD ++HA
Sbjct: 239 FVKDVANITINDEQAMVEAVALYNPVSFAFEVTS-DFMMYRKGIYSSTSCHKTPDKVNHA 297
Query: 301 VLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTY 348
VLAVGYGE +G PYW VKNSW WG GY LM N CG+ +Y
Sbjct: 298 VLAVGYGEQNGIPYWIVKNSWGPQWGMNGYFLMERGKNMCGLAACASY 345
>gi|302831502|ref|XP_002947316.1| hypothetical protein VOLCADRAFT_79608 [Volvox carteri f.
nagariensis]
gi|300267180|gb|EFJ51364.1| hypothetical protein VOLCADRAFT_79608 [Volvox carteri f.
nagariensis]
Length = 326
Score = 165 bits (417), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 82/212 (38%), Positives = 124/212 (58%), Gaps = 12/212 (5%)
Query: 130 NKASKDAIPVRYEMKGYNS----LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQA 185
N + + +P + +G + + DQ CGSCW+F GA++GA++ + S+Q
Sbjct: 115 NGSLRGRLPTFVDWRGTGADGPGVKDQVSCGSCWAFSAAGAMQGAWFKATGQALSFSEQQ 174
Query: 186 LIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCH-------IANTTA 238
L+DC+W YGN+GC GG + Q+++ G Q+ PYL Q+++C+ A
Sbjct: 175 LVDCAWDYGNDGCAGGFVEPTLQYVVDAGGIAQESEYPYLAQNSFCNDVTQRGGPATVPV 234
Query: 239 TATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLD 298
+A +G++N+ E AL A+A HGPV+V ++A+ F FY GVYY+E+C PD +D
Sbjct: 235 SARFSGYLNIPSRDEAALMEAVALHGPVAVLMNAALAPFRFYSEGVYYNEECGQDPDSMD 294
Query: 299 HAVLAVGYGEL-DGKPYWQVKNSWSTYWGNQG 329
HA+L VGYG +G YW +KNSWS YWG G
Sbjct: 295 HAILLVGYGTTPEGVDYWLIKNSWSKYWGMDG 326
>gi|162815|gb|AAA30435.1| cathepsin S, partial [Bos taurus]
gi|312895|emb|CAA43971.1| cathepsin S [Bos taurus]
Length = 196
Score = 165 bits (417), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 89/197 (45%), Positives = 119/197 (60%), Gaps = 7/197 (3%)
Query: 155 CGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG-YGNNGCDGGEDFRSYQWIM-K 212
CGSCW+F GA+E +K KL LS Q L+DCS YGN GC+GG ++Q+I+
Sbjct: 1 CGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTAKYGNKGCNGGFMTEAFQYIIDN 60
Query: 213 HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDA 272
+G+ ++ Y PY D C AT + ++ + SE+ALK A+A GPVSV IDA
Sbjct: 61 NGIDSEASY-PYKAMDGKCQYDVKNRAATCSRYIELPFGSEEALKEAVANKGPVSVGIDA 119
Query: 273 SQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVL 332
S SF Y GVYYD C + ++H VL VGYG LDGK YW VKNSW ++G+QGY+
Sbjct: 120 SHSSFFLYKTGVYYDPSCTQN---VNHGVLVVGYGNLDGKDYWLVKNSWGLHFGDQGYIR 176
Query: 333 MSIKD-NNCGVMTAPTY 348
M+ N+CG+ P+Y
Sbjct: 177 MARNSGNHCGIANYPSY 193
>gi|148745204|gb|AAI42984.1| Cathepsin L1 [Homo sapiens]
Length = 333
Score = 165 bits (417), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 94/220 (42%), Positives = 131/220 (59%), Gaps = 11/220 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TGA+EG + K +L LS+Q L+DCS GN G
Sbjct: 118 VDWREKGYVTPVKNQGPCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEG 177
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++Q++ + GL +++ Y PY + C + A TGFV++ P E AL
Sbjct: 178 CNGGLMDYAFQYVQDNGGLDSEESY-PYEATEESCKYNPKYSVANDTGFVDI-PKQEKAL 235
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GP+SVAIDA +SF FY G+Y++ C S + +DH VL VGYG E D
Sbjct: 236 MKAVATVGPISVAIDAGHESFLFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDNN 293
Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
YW VKNSW WG GYV M+ + N+CG+ +A +Y T+
Sbjct: 294 KYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 333
>gi|60827856|gb|AAX36816.1| cathepsin L [synthetic construct]
Length = 334
Score = 165 bits (417), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 92/219 (42%), Positives = 128/219 (58%), Gaps = 9/219 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TGA+EG + K +L LS+Q L+DCS GN G
Sbjct: 118 VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEG 177
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
C+GG ++Q++ +G ++ PY + C + A TGFV++ P E AL
Sbjct: 178 CNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDI-PKQEKALM 236
Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKP 313
A+A GP+SVAIDA +SF FY G+Y++ C S + +DH VL VGYG E D
Sbjct: 237 KAVATVGPISVAIDAGHESFLFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDNNK 294
Query: 314 YWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
YW VKNSW WG GYV M+ + N+CG+ +A +Y T+
Sbjct: 295 YWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 333
>gi|15214962|gb|AAH12612.1| Cathepsin L1 [Homo sapiens]
gi|61363426|gb|AAX42388.1| cathepsin L [synthetic construct]
gi|123988681|gb|ABM83856.1| cathepsin L [synthetic construct]
gi|123999196|gb|ABM87178.1| cathepsin L [synthetic construct]
Length = 333
Score = 165 bits (417), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 94/220 (42%), Positives = 131/220 (59%), Gaps = 11/220 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TGA+EG + K +L LS+Q L+DCS GN G
Sbjct: 118 VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEG 177
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++Q++ + GL +++ Y PY + C + A TGFV++ P E AL
Sbjct: 178 CNGGLMDYAFQYVQDNGGLDSEESY-PYEATEESCKYNPKYSVANDTGFVDI-PKQEKAL 235
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GP+SVAIDA +SF FY G+Y++ C S + +DH VL VGYG E D
Sbjct: 236 MKAVATVGPISVAIDAGHESFLFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDNN 293
Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
YW VKNSW WG GYV M+ + N+CG+ +A +Y T+
Sbjct: 294 KYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 333
>gi|344284284|ref|XP_003413898.1| PREDICTED: pro-cathepsin H-like [Loxodonta africana]
Length = 335
Score = 165 bits (417), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 87/213 (40%), Positives = 125/213 (58%), Gaps = 5/213 (2%)
Query: 139 VRYEMKGY--NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNN 196
V + KG+ + + +Q CGSCW+F TTGA+E A + KL L++Q L+DC+ + N+
Sbjct: 120 VDWRKKGHFVSPVKNQGACGSCWTFSTTGALESAIAIAGGKLLSLAEQQLVDCAKDFNNH 179
Query: 197 GCDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDA 255
GC GG +++++I+ G+ +D Y PY GQD C A A + N+T N E+A
Sbjct: 180 GCQGGLPSQAFEYILYNKGIMGEDTY-PYKGQDDVCKFQPKKAIAFVKDVANITLNDEEA 238
Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYW 315
+ A+A + PVS A + + F Y G+Y C+ +PD ++HAVLAVGYGE G PYW
Sbjct: 239 MVEAVALYNPVSFAFEVTD-DFMKYSKGIYSSTSCHKTPDKVNHAVLAVGYGEEKGIPYW 297
Query: 316 QVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTY 348
VKNSW YWG GY L+ N CG+ +Y
Sbjct: 298 IVKNSWGPYWGMDGYFLIERGKNMCGLAACASY 330
>gi|66810271|ref|XP_638859.1| cysteine proteinase 3 [Dictyostelium discoideum AX4]
gi|166201983|sp|Q23894.2|CYSP3_DICDI RecName: Full=Cysteine proteinase 3; AltName: Full=Cysteine
proteinase II; Flags: Precursor
gi|60467526|gb|EAL65548.1| cysteine proteinase 3 [Dictyostelium discoideum AX4]
Length = 337
Score = 165 bits (417), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 111/298 (37%), Positives = 163/298 (54%), Gaps = 20/298 (6%)
Query: 68 SDFKVNIYRLFFLR-PRFHENEKIRYNWTYIGEELVNGIIL------EKWRLV---TSEG 117
S+ K ++ F R F +N +NW G + V G+ E++RL T
Sbjct: 40 SNNKAYTHKEFMPRYEEFKKNMDYVHNWNSKGSKTVLGLNQHADLSNEEYRLNYLGTRAH 99
Query: 118 EKVSKY---SLWVRYNKAS-KDAIPVRY-EMKGYNSLLDQSVCGSCWSFGTTGAVEGAYY 172
K++ Y +L +R N+ K + V + E + DQ CGSC+SF TTG+VEG
Sbjct: 100 IKLNGYHKRNLGLRLNRPQFKQPLNVDWREKDAVTPVKDQGQCGSCYSFSTTGSVEGVTA 159
Query: 173 MKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMK-HGLPTQDDYGPYLGQDAYC 231
+K KL LS+Q ++DCS +GN GC+GG ++++I+K +GL +++ Y + + C
Sbjct: 160 IKTGKLVSLSEQNILDCSSSFGNEGCNGGLMTNAFEYIIKNNGLNSEEQYPYEMKVNDEC 219
Query: 232 HIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCN 291
+ A +T + + E+ L+ AL + PVSVAIDAS SF Y GVYY+ C
Sbjct: 220 KFQEGSVAAKITSYKEIEAGDENDLQNALLLN-PVSVAIDASHNSFQLYTAGVYYEPAC- 277
Query: 292 NSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
S + LDH VLAVG G +G+ Y+ VKNSW WG GY+ M+ KDNNCG+ T +Y
Sbjct: 278 -SSEDLDHGVLAVGMGTDNGEDYYIVKNSWGPSWGLNGYIHMARNKDNNCGISTMASY 334
>gi|4503155|ref|NP_001903.1| cathepsin L1 isoform 1 preproprotein [Homo sapiens]
gi|22202619|ref|NP_666023.1| cathepsin L1 isoform 1 preproprotein [Homo sapiens]
gi|384081592|ref|NP_001244900.1| cathepsin L1 isoform 1 preproprotein [Homo sapiens]
gi|384081594|ref|NP_001244901.1| cathepsin L1 isoform 1 preproprotein [Homo sapiens]
gi|332832229|ref|XP_003312197.1| PREDICTED: cathepsin L1 isoform 2 [Pan troglodytes]
gi|332832233|ref|XP_001137800.2| PREDICTED: cathepsin L1 isoform 1 [Pan troglodytes]
gi|397470218|ref|XP_003806728.1| PREDICTED: cathepsin L1 isoform 1 [Pan paniscus]
gi|397470220|ref|XP_003806729.1| PREDICTED: cathepsin L1 isoform 2 [Pan paniscus]
gi|397470222|ref|XP_003806730.1| PREDICTED: cathepsin L1 isoform 3 [Pan paniscus]
gi|410042824|ref|XP_003951515.1| PREDICTED: cathepsin L1 [Pan troglodytes]
gi|115741|sp|P07711.2|CATL1_HUMAN RecName: Full=Cathepsin L1; AltName: Full=Major excreted protein;
Short=MEP; Contains: RecName: Full=Cathepsin L1 heavy
chain; Contains: RecName: Full=Cathepsin L1 light chain;
Flags: Precursor
gi|29715|emb|CAA30981.1| pro-(cathepsin L) [Homo sapiens]
gi|190418|gb|AAA66974.1| preprocathepsin L precursor [Homo sapiens]
gi|31873292|emb|CAD97637.1| hypothetical protein [Homo sapiens]
gi|48146223|emb|CAG33334.1| CTSL [Homo sapiens]
gi|119583135|gb|EAW62731.1| cathepsin L, isoform CRA_a [Homo sapiens]
gi|119583136|gb|EAW62732.1| cathepsin L, isoform CRA_a [Homo sapiens]
gi|119583137|gb|EAW62733.1| cathepsin L, isoform CRA_a [Homo sapiens]
gi|119583138|gb|EAW62734.1| cathepsin L, isoform CRA_a [Homo sapiens]
gi|119583140|gb|EAW62736.1| cathepsin L, isoform CRA_a [Homo sapiens]
gi|208965934|dbj|BAG72981.1| cathepsin L1 [synthetic construct]
gi|410303006|gb|JAA30103.1| cathepsin L1 [Pan troglodytes]
gi|410303008|gb|JAA30104.1| cathepsin L1 [Pan troglodytes]
gi|410303010|gb|JAA30105.1| cathepsin L1 [Pan troglodytes]
Length = 333
Score = 165 bits (417), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 94/220 (42%), Positives = 131/220 (59%), Gaps = 11/220 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TGA+EG + K +L LS+Q L+DCS GN G
Sbjct: 118 VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEG 177
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++Q++ + GL +++ Y PY + C + A TGFV++ P E AL
Sbjct: 178 CNGGLMDYAFQYVQDNGGLDSEESY-PYEATEESCKYNPKYSVANDTGFVDI-PKQEKAL 235
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GP+SVAIDA +SF FY G+Y++ C S + +DH VL VGYG E D
Sbjct: 236 MKAVATVGPISVAIDAGHESFLFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDNN 293
Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
YW VKNSW WG GYV M+ + N+CG+ +A +Y T+
Sbjct: 294 KYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 333
>gi|440910969|gb|ELR60703.1| Cathepsin H, partial [Bos grunniens mutus]
Length = 329
Score = 165 bits (417), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 83/199 (41%), Positives = 118/199 (59%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTGA+E A + KL L++Q L+DC+ + N+GC GG +++++I
Sbjct: 128 NQGSCGSCWTFSTTGALESAVAIATGKLPFLAEQQLVDCAQNFNNHGCQGGLPSQAFEYI 187
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
G+ +D Y PY GQD C + A A + N+T N E+A+ A+A H PVS A
Sbjct: 188 RYNKGIMGEDTY-PYRGQDGDCKYQPSKAIAFVKDVANITLNDEEAMVEAVALHNPVSFA 246
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ + F Y G+Y C+ +PD ++HAVLAVGYGE G PYW VKNSW WG +G
Sbjct: 247 FEVT-ADFMMYRKGIYSSTSCHKTPDKVNHAVLAVGYGEEKGIPYWIVKNSWGPNWGMKG 305
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y L+ N CG+ ++
Sbjct: 306 YFLIERGKNMCGLAACASF 324
>gi|328875652|gb|EGG24016.1| counting factor associated protein [Dictyostelium fasciculatum]
Length = 529
Score = 165 bits (417), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 96/202 (47%), Positives = 132/202 (65%), Gaps = 5/202 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKK-LAVLSQQALIDCSWGYGNNGCDGGEDFRSYQW 209
DQ VCGSCW+FG+ ++E +KH K L LS+Q L+DC++ + GC+GG +YQ+
Sbjct: 325 DQGVCGSCWTFGSLASLETVACLKHNKDLISLSEQELVDCAYVGQSMGCNGGFASNAYQY 384
Query: 210 IMKHG-LPTQDDYGPYLGQDAYCHIANTTATAT-MTGFVNVTPNSEDALKLALAKHGPVS 267
IM G + T+ DY PYL Q+AYC + + + +VNVT SE AL+ A+A G V+
Sbjct: 385 IMNAGGIATESDY-PYLMQNAYCKASTVQNSGVRVQSYVNVTAFSEAALQNAVATVGVVA 443
Query: 268 VAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGN 327
VAIDAS F +Y +GVYY C + D LDH V +GYG +G+ YW VKNSWSTY+GN
Sbjct: 444 VAIDASAPDFRYYSSGVYYSTVCQSGLDYLDHEVAVLGYGTDNGQQYWIVKNSWSTYYGN 503
Query: 328 QGYVLMSI-KDNNCGVMTAPTY 348
+G V M++ +DNNCG+ + TY
Sbjct: 504 EGIVWMAMNRDNNCGIASQATY 525
>gi|77735725|ref|NP_001029557.1| pro-cathepsin H precursor [Bos taurus]
gi|115312126|sp|Q3T0I2.1|CATH_BOVIN RecName: Full=Pro-cathepsin H; Contains: RecName: Full=Cathepsin H
mini chain; Contains: RecName: Full=Cathepsin H;
Contains: RecName: Full=Cathepsin H heavy chain;
Contains: RecName: Full=Cathepsin H light chain; Flags:
Precursor
gi|74267711|gb|AAI02387.1| Cathepsin H [Bos taurus]
gi|296475480|tpg|DAA17595.1| TPA: cathepsin H precursor [Bos taurus]
Length = 335
Score = 165 bits (417), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 83/199 (41%), Positives = 118/199 (59%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTGA+E A + KL L++Q L+DC+ + N+GC GG +++++I
Sbjct: 134 NQGSCGSCWTFSTTGALESAVAIATGKLPFLAEQQLVDCAQNFNNHGCQGGLPSQAFEYI 193
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
G+ +D Y PY GQD C + A A + N+T N E+A+ A+A H PVS A
Sbjct: 194 RYNKGIMGEDTY-PYRGQDGDCKYQPSKAIAFVKDVANITLNDEEAMVEAVALHNPVSFA 252
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ + F Y G+Y C+ +PD ++HAVLAVGYGE G PYW VKNSW WG +G
Sbjct: 253 FEVT-ADFMMYRKGIYSSTSCHKTPDKVNHAVLAVGYGEEKGIPYWIVKNSWGPNWGMKG 311
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y L+ N CG+ ++
Sbjct: 312 YFLIERGKNMCGLAACASF 330
>gi|354473025|ref|XP_003498737.1| PREDICTED: cathepsin S-like [Cricetulus griseus]
Length = 341
Score = 165 bits (417), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 88/200 (44%), Positives = 119/200 (59%), Gaps = 6/200 (3%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCS--WGYGNNGCDGGEDFRSYQW 209
Q CGSCW+F GA+E +K KL LS Q L+DCS YGN GCDGG R++Q+
Sbjct: 142 QGSCGSCWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEEKYGNKGCDGGFMTRAFQY 201
Query: 210 IMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
I+ +G D PY CH + + AT + ++ + E+ALK A+A GPVSV
Sbjct: 202 IIDNGGIDSDASYPYKAVAEKCHYDSKSRAATCSRYMELPSGDEEALKEAVANKGPVSVG 261
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS SF Y +GVY + C + ++H VL VGYG LDGK YW VKNSW ++G+QG
Sbjct: 262 IDASHPSFFLYKSGVYDEPSCT---ENVNHGVLVVGYGNLDGKDYWLVKNSWGLHFGDQG 318
Query: 330 YVLMSIKD-NNCGVMTAPTY 348
Y+ M+ + N CG+ + +Y
Sbjct: 319 YIRMARNNKNQCGIASYGSY 338
>gi|403302730|ref|XP_003942006.1| PREDICTED: cathepsin S isoform 1 [Saimiri boliviensis boliviensis]
Length = 339
Score = 165 bits (417), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 85/199 (42%), Positives = 119/199 (59%), Gaps = 6/199 (3%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM 211
Q CG+CW+F GA+E +K KL LS Q L+DCS YGN GC+GG ++Q+I+
Sbjct: 142 QGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSEKYGNKGCNGGFMTEAFQYII 201
Query: 212 KH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
+ G+ ++ Y PY D C + AT + + + ED LK A+A GPV V +
Sbjct: 202 DNKGIDSEASY-PYKATDQKCQYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVCVGV 260
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
DAS SF Y +GVYYD C ++H VL +GYG+L+GK YW VKNSW + +G QGY
Sbjct: 261 DASHPSFFLYRSGVYYDPACTQK---VNHGVLVIGYGDLNGKEYWLVKNSWGSNFGEQGY 317
Query: 331 VLMSI-KDNNCGVMTAPTY 348
+ M+ K N+CG+ + P+Y
Sbjct: 318 IRMARNKGNHCGIASYPSY 336
>gi|113603|sp|P05167.1|ALEU_HORVU RecName: Full=Thiol protease aleurain; Flags: Precursor
gi|19021|emb|CAA28804.1| aleurain [Hordeum vulgare]
Length = 362
Score = 165 bits (417), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 85/199 (42%), Positives = 119/199 (59%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q+ CGSCW+F TTGA+E AY K LS+Q L+DC+ G+ N GC+GG +++++I
Sbjct: 161 NQAHCGSCWTFSTTGALEAAYTQATGKNISLSEQQLVDCAGGFNNFGCNGGLPSQAFEYI 220
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T++ Y PY G + CH A + VN+T N+ED LK A+ PVSVA
Sbjct: 221 KYNGGIDTEESY-PYKGVNGVCHYKAENAAVQVLDSVNITLNAEDELKNAVGLVRPVSVA 279
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
F Y +GVY + C +PD ++HAVLAVGYG +G PYW +KNSW WG+ G
Sbjct: 280 FQVID-GFRQYKSGVYTSDHCGTTPDDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNG 338
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y M + N C + T +Y
Sbjct: 339 YFKMEMGKNMCAIATCASY 357
>gi|431896621|gb|ELK06033.1| Cathepsin S [Pteropus alecto]
Length = 331
Score = 165 bits (417), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 90/200 (45%), Positives = 122/200 (61%), Gaps = 7/200 (3%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW-GYGNNGCDGGEDFRSYQWI 210
Q CGSCW+F GA+E +K KL LS Q L+DCS Y N GC+GG ++Q+I
Sbjct: 133 QGSCGSCWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYSNKGCNGGFMTSAFQYI 192
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ +G+ ++ Y PY QD C + AT + + + SE+ALK A+A GPVSVA
Sbjct: 193 IDNNGIDSEASY-PYKAQDGKCQYDSKFRAATCSKYTELPFGSEEALKEAVANKGPVSVA 251
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS SF Y +GVYYD+ C ++H VL VGYG LDGK YW VKNSW +G++G
Sbjct: 252 IDASHPSFFLYRSGVYYDQSCTLK---VNHGVLVVGYGNLDGKDYWLVKNSWGLNFGDKG 308
Query: 330 YVLMSIKD-NNCGVMTAPTY 348
Y+ M+ N+CG+ + P+Y
Sbjct: 309 YIRMARNSGNHCGIASYPSY 328
>gi|19698255|dbj|BAB86770.1| cathepsin L-like [Engraulis japonicus]
Length = 324
Score = 165 bits (417), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 88/202 (43%), Positives = 125/202 (61%), Gaps = 7/202 (3%)
Query: 149 LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
+ +Q CGSCWSF TGA+E ++ L LS+Q L+DCS YGN GC+GG +++Q
Sbjct: 125 IKNQGQCGSCWSFSATGALESQTCLRRGYLPSLSEQQLVDCSGSYGNYGCNGGWPDQAFQ 184
Query: 209 WIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTP-NSEDALKLALAKHGPVS 267
+I +G + Y PY + CH + + AT +G+ +VTP SE AL+ +A GP+S
Sbjct: 185 YIQANGGIDSESYYPYQARVGTCHYNSAYSAATCSGYQDVTPVGSESALQYYVANVGPLS 244
Query: 268 VAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGN 327
+AIDAS + Y +GV+ D C+ + DHAVL VGYG +G+ YW VKNSW T+WG
Sbjct: 245 IAIDAS--GWQSYQSGVFNDPSCSQTA---DHAVLLVGYGTYNGQDYWLVKNSWGTWWGE 299
Query: 328 QGYVLMSIKDNN-CGVMTAPTY 348
QGY++M+ NN CG+ +Y
Sbjct: 300 QGYIMMTRNANNQCGIANHASY 321
>gi|340368362|ref|XP_003382721.1| PREDICTED: cathepsin L1-like [Amphimedon queenslandica]
Length = 329
Score = 164 bits (416), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 90/225 (40%), Positives = 129/225 (57%), Gaps = 7/225 (3%)
Query: 129 YNKASKDAIPVRYEMKGYNSLLD---QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQA 185
++ + ++A+P + + + D Q CGSCW+F TTG++EG + +K L LS+Q
Sbjct: 103 FSSSLRNAMPSSLDWRDKKVVTDVKNQGKCGSCWAFSTTGSLEGLHALKTGHLVSLSEQQ 162
Query: 186 LIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGF 245
L+DCS YGNNGCDGG ++Q+I G ++ PY ++ C AT G+
Sbjct: 163 LMDCSVKYGNNGCDGGNMRSAFQYIKDAGGDDTEESYPYTAKNESCRFDPKKVGATDEGY 222
Query: 246 VNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVG 305
V + E +L AL + GP+SVA+DA K+F FY G+Y D C+N+ L+H V +G
Sbjct: 223 VRIPSGDEVSLMHALYEVGPISVAMDAGLKTFQFYKKGIYSDYLCSNTH--LNHGVTLIG 280
Query: 306 YGE-LDGKPYWQVKNSWSTYWGNQGYVLMS-IKDNNCGVMTAPTY 348
YGE DG PYW VKNSW WG GY +++ N CGV T +Y
Sbjct: 281 YGESSDGSPYWLVKNSWGKDWGIDGYFMLARYVGNMCGVATDASY 325
>gi|62510453|sp|Q8HY82.1|CATS_SAIBB RecName: Full=Cathepsin S; Flags: Precursor
gi|27497536|gb|AAO13008.1| cathepsin S preproprotein [Saimiri boliviensis]
Length = 330
Score = 164 bits (416), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 85/199 (42%), Positives = 119/199 (59%), Gaps = 6/199 (3%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM 211
Q CG+CW+F GA+E +K KL LS Q L+DCS YGN GC+GG ++Q+I+
Sbjct: 133 QGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSEKYGNKGCNGGFMTEAFQYII 192
Query: 212 KH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
+ G+ ++ Y PY D C + AT + + + ED LK A+A GPV V +
Sbjct: 193 DNKGIDSEASY-PYKATDQKCQYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVCVGV 251
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
DAS SF Y +GVYYD C ++H VL +GYG+L+GK YW VKNSW + +G QGY
Sbjct: 252 DASHPSFFLYRSGVYYDPACTQK---VNHGVLVIGYGDLNGKEYWLVKNSWGSNFGEQGY 308
Query: 331 VLMSI-KDNNCGVMTAPTY 348
+ M+ K N+CG+ + P+Y
Sbjct: 309 IRMARNKGNHCGIASYPSY 327
>gi|74222595|dbj|BAE38161.1| unnamed protein product [Mus musculus]
Length = 334
Score = 164 bits (416), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 90/204 (44%), Positives = 125/204 (61%), Gaps = 10/204 (4%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F +G +EG ++K KL LS+Q L+DCS GN GC+GG ++Q+I
Sbjct: 131 NQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYI 190
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
++ GL +++ Y PY +D C A A TGFV++ P E AL A+A GP+SVA
Sbjct: 191 KENGGLDSEESY-PYEAKDGSCKYRAEFAVANDTGFVDI-PQQEKALMKAVATVGPISVA 248
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTYW 325
+DAS S FY G+YY+ C S LDH VL VGYG + + YW VKNSW + W
Sbjct: 249 MDASHPSLQFYSLGIYYEPNC--SSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEW 306
Query: 326 GNQGYV-LMSIKDNNCGVMTAPTY 348
G +GY+ + +DN+CG+ TA +Y
Sbjct: 307 GMEGYIKIAKDRDNHCGLATAASY 330
>gi|224069140|ref|XP_002326284.1| predicted protein [Populus trichocarpa]
gi|118482340|gb|ABK93094.1| unknown [Populus trichocarpa]
gi|222833477|gb|EEE71954.1| predicted protein [Populus trichocarpa]
Length = 358
Score = 164 bits (416), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 85/199 (42%), Positives = 119/199 (59%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTGA+E AY+ K LS+Q L+DC+ + N GC+GG +++++I
Sbjct: 157 NQGHCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCARAFNNFGCNGGLPSQAFEYI 216
Query: 211 -MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
GL T++ Y PY G+D C ++ + VN+T +ED LK A+A PVSVA
Sbjct: 217 KFNGGLDTEEAY-PYTGKDDACKFSSENVGVRVVESVNITLGAEDELKHAVAFVRPVSVA 275
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ SF Y GVY C ++P ++HAVLAVGYG +G PYW +KNSW WG+ G
Sbjct: 276 FEVV-GSFRLYKEGVYTTSTCGSTPMDVNHAVLAVGYGVENGIPYWLIKNSWGEDWGDNG 334
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y M + N CG+ T +Y
Sbjct: 335 YFKMEMGKNMCGIATCASY 353
>gi|381283083|gb|AFG19440.1| cathepsin L, partial [Larimichthys crocea]
Length = 257
Score = 164 bits (416), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 90/177 (50%), Positives = 115/177 (64%), Gaps = 9/177 (5%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TTGA+EG ++ K KL LS+Q L+DCS GN GC+GG +++Q++
Sbjct: 84 DQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYV 143
Query: 211 M-KHGLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
HGL ++D Y PYLG D CH +A TGFV+V E AL A+A GPVSV
Sbjct: 144 KDNHGLDSEDSY-PYLGTDDQPCHYDPNYNSANDTGFVDVPSGKEHALMKAVAAVGPVSV 202
Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSW 321
AIDA +SF FY +G+YY++ C S + LDH VL VGYG ++DGK YW V NSW
Sbjct: 203 AIDAGHESFQFYQSGIYYEKDC--SSEELDHGVLVVGYGFEGEDVDGKKYWIVNNSW 257
>gi|194689248|gb|ACF78708.1| unknown [Zea mays]
gi|414885653|tpg|DAA61667.1| TPA: cysteine protease2 [Zea mays]
Length = 360
Score = 164 bits (416), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 86/199 (43%), Positives = 117/199 (58%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTGA+E AY K LS+Q LIDC + + N GC+GG +++++I
Sbjct: 160 NQGHCGSCWTFSTTGALEAAYTQATGKPISLSEQQLIDCGFAFNNFGCNGGLPSQAFEYI 219
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ GL T++ Y PY G + C N + VN+T +ED LK A+ PVSVA
Sbjct: 220 KYNGGLDTEESY-PYQGVNGICKFKNENVGVKVLDSVNITLGAEDELKDAVGLVRPVSVA 278
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ F Y +GVY + C +P ++HAVLAVGYG DG PYW +KNSW WG++G
Sbjct: 279 FEVIT-GFRLYKSGVYTSDHCGTTPMDVNHAVLAVGYGVEDGVPYWLIKNSWGADWGDEG 337
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y M + N CGV T +Y
Sbjct: 338 YFKMEMGKNMCGVATCASY 356
>gi|326516056|dbj|BAJ88051.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 362
Score = 164 bits (416), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 85/199 (42%), Positives = 119/199 (59%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q+ CGSCW+F TTGA+E AY K LS+Q L+DC+ G+ N GC+GG +++++I
Sbjct: 161 NQAHCGSCWTFSTTGALEAAYTQATGKNISLSEQQLVDCAGGFNNFGCNGGLPSQAFEYI 220
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T++ Y PY G + CH A + VN+T N+ED LK A+ PVSVA
Sbjct: 221 KYNGGIDTEESY-PYKGVNGVCHYKAENAAVQVLDSVNITLNAEDELKNAVGLVRPVSVA 279
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
F Y +GVY + C +PD ++HAVLAVGYG +G PYW +KNSW WG+ G
Sbjct: 280 FQVID-GFRQYKSGVYTSDHCGTTPDDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNG 338
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y M + N C + T +Y
Sbjct: 339 YFKMEMGKNMCAIATCASY 357
>gi|74142447|dbj|BAE31977.1| unnamed protein product [Mus musculus]
Length = 334
Score = 164 bits (416), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 89/204 (43%), Positives = 126/204 (61%), Gaps = 10/204 (4%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
++ CGSCW+F +G +EG ++K KL LS+Q L+DCS GN GC+GG ++Q+I
Sbjct: 131 NKGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYI 190
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
++ GL +++ Y PY +D C A A TGFV++ P E AL A+A GP+SVA
Sbjct: 191 KENGGLDSEESY-PYEAKDGSCKYRAEFAVANDTGFVDI-PQQEKALMKAVATVGPISVA 248
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTYW 325
+DAS S FY +G+YY+ C S LDH VL VGYG + + YW VKNSW + W
Sbjct: 249 MDASHPSLQFYSSGIYYEPNC--SSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEW 306
Query: 326 GNQGYV-LMSIKDNNCGVMTAPTY 348
G +GY+ + +DN+CG+ TA +Y
Sbjct: 307 GMEGYIKIAKDRDNHCGLATAASY 330
>gi|432114311|gb|ELK36239.1| Cathepsin S [Myotis davidii]
Length = 340
Score = 164 bits (416), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 90/200 (45%), Positives = 123/200 (61%), Gaps = 7/200 (3%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG-YGNNGCDGGEDFRSYQWI 210
Q CGSCW+F GA+E +K KL LS Q L+DCS G Y N GC+GG ++Q+I
Sbjct: 142 QGSCGSCWAFSAVGALEAQLKLKTGKLVSLSVQNLVDCSTGKYSNKGCNGGFMTEAFQYI 201
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ +G+ ++ Y PY D C AT + +V + +E+ALK A+A GPVSVA
Sbjct: 202 IDNNGIDSEASY-PYKAMDGKCQYDVKNRAATCSKYVELPFGNEEALKEAVANKGPVSVA 260
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS SF Y +GVYYD+ C + ++H VLAVGYG +GK YW VKNSW ++G QG
Sbjct: 261 IDASHPSFFLYRSGVYYDKACTLN---VNHGVLAVGYGNYNGKDYWLVKNSWGLHFGEQG 317
Query: 330 YVLMSIKD-NNCGVMTAPTY 348
Y+ M+ N+CG+ + P+Y
Sbjct: 318 YIRMARNSGNHCGIASYPSY 337
>gi|410256886|gb|JAA16410.1| cathepsin L1 [Pan troglodytes]
Length = 333
Score = 164 bits (416), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 94/220 (42%), Positives = 131/220 (59%), Gaps = 11/220 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TGA+EG + K +L LS+Q L+DCS GN G
Sbjct: 118 VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEG 177
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++Q++ + GL +++ Y PY + C + A TGFV++ P E AL
Sbjct: 178 CNGGLMDYAFQYVQDNGGLDSEESY-PYEATEESCKYNPKYSVANDTGFVDI-PKQEKAL 235
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GP+SVAIDA +SF FY G+Y++ C S + +DH VL VGYG E D
Sbjct: 236 MKAVATVGPISVAIDAGHESFLFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDNN 293
Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
YW VKNSW WG GYV M+ + N+CG+ +A +Y T+
Sbjct: 294 KYWLVKNSWGGEWGMGGYVKMAKDRRNHCGIASAASYPTV 333
>gi|66270077|gb|AAY43368.1| cysteine protease [Phytophthora infestans]
Length = 510
Score = 164 bits (416), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 88/209 (42%), Positives = 131/209 (62%), Gaps = 14/209 (6%)
Query: 145 GYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDF 204
G + +Q +CGSCW+F TTGAVEGA ++ KL LS+Q L+DC G+ GC+GG
Sbjct: 129 GVTPVKNQGMCGSCWAFSTTGAVEGAAFVSSGKLVSLSEQELVDCDHN-GDMGCNGGLMD 187
Query: 205 RSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKH 263
++ WI +G + ++DDY Y + C + ++GF +V P E ALK+A+A+
Sbjct: 188 HAFAWIEDNGGICSEDDY-EYKAKAQVCR--DCEKVVKISGFQDVNPQDEHALKVAVAQQ 244
Query: 264 GPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWST 323
PVSVAI+A QK+F FY +GV+ + C LDH VLAVGYG +G+ +W+VKNSW +
Sbjct: 245 -PVSVAIEADQKAFQFYKSGVF-NLTCGTR---LDHGVLAVGYGSENGQKFWKVKNSWGS 299
Query: 324 YWGNQGYVLMSIKDN----NCGVMTAPTY 348
WG +GY+ ++ ++N CG+ + P+Y
Sbjct: 300 SWGEKGYIRLAREENGPAGQCGIASVPSY 328
>gi|410256882|gb|JAA16408.1| cathepsin L1 [Pan troglodytes]
gi|410256884|gb|JAA16409.1| cathepsin L1 [Pan troglodytes]
Length = 333
Score = 164 bits (416), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 94/220 (42%), Positives = 131/220 (59%), Gaps = 11/220 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TGA+EG + K +L LS+Q L+DCS GN G
Sbjct: 118 VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEG 177
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++Q++ + GL +++ Y PY + C + A TGFV++ P E AL
Sbjct: 178 CNGGLMDYAFQYVQDNGGLDSEESY-PYEATEESCKYNPKYSVANDTGFVDI-PKQEKAL 235
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GP+SVAIDA +SF FY G+Y++ C S + +DH VL VGYG E D
Sbjct: 236 MKAVATVGPISVAIDAGHESFLFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDNN 293
Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
YW VKNSW WG GYV M+ + N+CG+ +A +Y T+
Sbjct: 294 KYWLVKNSWGGEWGMGGYVKMAKDRRNHCGIASAASYPTV 333
>gi|379991182|emb|CCA61803.1| cathepsin protein CatL1-MM3p, partial [Fasciola hepatica]
Length = 326
Score = 164 bits (416), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 89/218 (40%), Positives = 123/218 (56%), Gaps = 8/218 (3%)
Query: 131 KASKDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALI 187
+A+ A+P + + GY + DQ CGSCW+F TTG +EG Y + S+Q L+
Sbjct: 102 EANNRAVPDKIDWRESGYVTEVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLV 161
Query: 188 DCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVN 247
DCS +GNNGC GG +YQ++ + GL T+ Y PY + C A +TG+
Sbjct: 162 DCSGPWGNNGCSGGLMENAYQYLKQFGLETESSY-PYTAVEGQCRYNKQLGVAKVTGYYT 220
Query: 248 VTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG 307
V SE LK + GP +VA+D + F Y G+Y + C SP GL+HAVLAVGYG
Sbjct: 221 VHSGSEVELKNLVGAEGPAAVAVDV-ESDFMMYSGGIYQSQTC--SPLGLNHAVLAVGYG 277
Query: 308 ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMT 344
G YW VKNSW +YWG +GY+ M+ + N CG+ +
Sbjct: 278 TQGGTDYWIVKNSWGSYWGERGYIRMARNRGNMCGIAS 315
>gi|145334857|ref|NP_001078774.1| thiol protease aleurain [Arabidopsis thaliana]
gi|332009932|gb|AED97315.1| thiol protease aleurain [Arabidopsis thaliana]
Length = 361
Score = 164 bits (416), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 84/192 (43%), Positives = 116/192 (60%), Gaps = 3/192 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TTGA+E AY+ K LS+Q L+DC+ + N GC+GG +++++I
Sbjct: 158 DQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYI 217
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ GL T+ Y PY G+D C + + VN+T +ED LK A+ PVS+A
Sbjct: 218 KSNGGLDTEKAY-PYTGKDETCKFSAENVGVQVLNSVNITLGAEDELKHAVGLVRPVSIA 276
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ SF Y +GVY D C ++P ++HAVLAVGYG DG PYW +KNSW WG++G
Sbjct: 277 FEVIH-SFRLYKSGVYTDSHCGSTPMDVNHAVLAVGYGVEDGVPYWLIKNSWGADWGDKG 335
Query: 330 YVLMSIKDNNCG 341
Y M + N CG
Sbjct: 336 YFKMEMGKNMCG 347
>gi|403300987|ref|XP_003941193.1| PREDICTED: cathepsin L2 [Saimiri boliviensis boliviensis]
Length = 333
Score = 164 bits (416), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 93/217 (42%), Positives = 126/217 (58%), Gaps = 11/217 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TGA+EG + K KL LS+Q L+DCS GN G
Sbjct: 118 VDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQG 177
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
C+GG +++++ ++G + PY +D C + A TGFV V P E L
Sbjct: 178 CNGGFMNYAFRYVKENGGLDSEASYPYEAKDGICKYKPENSVANDTGFV-VIPTHEKELM 236
Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-----ELDGK 312
A+A GP+SVA+DAS SF FY +G+Y+++KC S LDH VL VGYG D K
Sbjct: 237 KAVATVGPISVAVDASHSSFQFYKSGIYFEKKC--SSKNLDHGVLVVGYGFEGANSKDNK 294
Query: 313 PYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTY 348
YW +KNSW WG GY+ ++ NN CG+ TA +Y
Sbjct: 295 -YWLIKNSWGPEWGLNGYIKIAKDQNNHCGIATAASY 330
>gi|301116794|ref|XP_002906125.1| cysteine protease family C01A, putative [Phytophthora infestans
T30-4]
gi|262107474|gb|EEY65526.1| cysteine protease family C01A, putative [Phytophthora infestans
T30-4]
Length = 535
Score = 164 bits (416), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 88/209 (42%), Positives = 131/209 (62%), Gaps = 14/209 (6%)
Query: 145 GYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDF 204
G + +Q +CGSCW+F TTGAVEGA ++ KL LS+Q L+DC G+ GC+GG
Sbjct: 129 GVTPVKNQGMCGSCWAFSTTGAVEGAAFVSSGKLVSLSEQELVDCDHN-GDMGCNGGLMD 187
Query: 205 RSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKH 263
++ WI +G + ++DDY Y + C + ++GF +V P E ALK+A+A+
Sbjct: 188 HAFAWIEDNGGICSEDDY-EYKAKAQVCR--DCEKVVKISGFQDVNPQDEHALKVAVAQQ 244
Query: 264 GPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWST 323
PVSVAI+A QK+F FY +GV+ + C LDH VLAVGYG +G+ +W+VKNSW +
Sbjct: 245 -PVSVAIEADQKAFQFYKSGVF-NLTCGTR---LDHGVLAVGYGSENGQKFWKVKNSWGS 299
Query: 324 YWGNQGYVLMSIKDN----NCGVMTAPTY 348
WG +GY+ ++ ++N CG+ + P+Y
Sbjct: 300 SWGEKGYIRLAREENGPAGQCGIASVPSY 328
>gi|301767946|ref|XP_002919405.1| PREDICTED: cathepsin S-like [Ailuropoda melanoleuca]
Length = 340
Score = 164 bits (416), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 95/228 (41%), Positives = 131/228 (57%), Gaps = 10/228 (4%)
Query: 127 VRYNKASKDAIP--VRYEMKGYNSLLD-QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQ 183
V Y S +P V + KG + + Q CG+CW+F GA+E +K KL LS
Sbjct: 114 VTYKSNSNQKLPDSVDWREKGCVTKVKYQGACGACWAFSAVGALEAQLKLKTGKLVSLSA 173
Query: 184 QALIDCSW-GYGNNGCDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATAT 241
Q L+DCS YGN GC+GG ++Q+I+ +G+ ++ Y PY D C + AT
Sbjct: 174 QNLVDCSTEKYGNKGCNGGFMTEAFQYIIDNNGIDSEASY-PYKATDGKCRYDSKNRAAT 232
Query: 242 MTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAV 301
+ + + SED LK A+A GPVSVAIDA SF Y +GVYYD C + ++H V
Sbjct: 233 CSKYTELPSGSEDDLKEAVANKGPVSVAIDARHSSFFLYRSGVYYDPSCTQN---VNHGV 289
Query: 302 LAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKD-NNCGVMTAPTY 348
L VGYG L+GK YW VKNSW +G+QGY+ M+ N+CG+ + P+Y
Sbjct: 290 LVVGYGNLNGKDYWLVKNSWGLNFGDQGYIRMARNSGNHCGIASYPSY 337
>gi|86279349|gb|ABC88770.1| putative cathepsin L-like proteinase [Tenebrio molitor]
Length = 416
Score = 164 bits (416), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 86/203 (42%), Positives = 124/203 (61%), Gaps = 7/203 (3%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCWSF TTGA+EG ++ +L LS+Q LIDCS YGN GCDGG ++ +I
Sbjct: 219 DQGQCGSCWSFSTTGAIEGQLALQRGRLTSLSEQNLIDCSSSYGNAGCDGGWMDSAFSYI 278
Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
+G+ ++ Y PY Q YC ++ T++G+ ++ E++L A+ + GPV+VAI
Sbjct: 279 PDYGIMSEFAY-PYEAQGDYCRFDSSQFVTTLSGYYDLPSGGENSLADAVGQAGPVAVAI 337
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
DA + FY G++YD+ CN S L+H V VGYG +G+ YW +KNSW WG GY
Sbjct: 338 DAPDE-LQFYSGGLFYDQTCNQSD--LNHGVFVVGYGSDNGQDYWILKNSWGFGWGESGY 394
Query: 331 VLMSIKD--NNCGVMTAPTYVTM 351
+++ NNCG+ TA +Y +
Sbjct: 395 -WRQVRNYGNNCGIATAASYPAL 416
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 72/176 (40%), Positives = 107/176 (60%), Gaps = 4/176 (2%)
Query: 136 AIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGN 195
A V + + + DQ CGSCWSF TTGA+EG ++ +L LS+Q LIDCS YGN
Sbjct: 36 AASVDWRSNAVSEVKDQGQCGSCWSFSTTGAIEGQLALQRGRLTSLSEQNLIDCSSSYGN 95
Query: 196 NGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDA 255
GCDGG ++ +I +G+ ++ Y PY Q YC ++ + T++G+ ++ E++
Sbjct: 96 AGCDGGWMDSAFSYIHDYGIMSESAY-PYEAQGDYCRFDSSQSVTTLSGYYDLPSGGENS 154
Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDG 311
L A+ + GPV+VAIDA+ + FY G++YD+ CN S L+H VL VGYG +G
Sbjct: 155 LADAVGQAGPVAVAIDATDE-LQFYSGGLFYDQTCNQSD--LNHGVLVVGYGSDNG 207
>gi|147903593|ref|NP_001080822.1| cathepsin S precursor [Xenopus laevis]
gi|33417128|gb|AAH56059.1| Ctss-a protein [Xenopus laevis]
Length = 333
Score = 164 bits (416), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 81/199 (40%), Positives = 119/199 (59%), Gaps = 4/199 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F GA+EG +K KL LS Q L+DC+ YGN GC GG ++Q++
Sbjct: 135 NQGGCGSCWAFSAVGALEGQLMLKTGKLVSLSPQNLVDCASKYGNKGCSGGFMTSAFQYV 194
Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
+ + D Y PY D CH ++ + + P +ED LK AL GP+SVAI
Sbjct: 195 IDNNGIDSDSYYPYHAMDEKCHYELAGKASSCVKYTEIVPGTEDNLKQALGTIGPISVAI 254
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
D ++ +F Y +GVY D C+ ++H VLA+GYG L+G+ +W +KNSW TY+G++G+
Sbjct: 255 DGTRPTFFLYKSGVYSDPSCSQE---VNHGVLAIGYGTLNGQDFWLLKNSWGTYYGDKGF 311
Query: 331 VLMSI-KDNNCGVMTAPTY 348
V ++ K N CGV + +Y
Sbjct: 312 VRIARNKGNLCGVASYTSY 330
>gi|281352890|gb|EFB28474.1| hypothetical protein PANDA_008012 [Ailuropoda melanoleuca]
Length = 328
Score = 164 bits (415), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 95/228 (41%), Positives = 131/228 (57%), Gaps = 10/228 (4%)
Query: 127 VRYNKASKDAIP--VRYEMKGYNSLLD-QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQ 183
V Y S +P V + KG + + Q CG+CW+F GA+E +K KL LS
Sbjct: 102 VTYKSNSNQKLPDSVDWREKGCVTKVKYQGACGACWAFSAVGALEAQLKLKTGKLVSLSA 161
Query: 184 QALIDCSW-GYGNNGCDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATAT 241
Q L+DCS YGN GC+GG ++Q+I+ +G+ ++ Y PY D C + AT
Sbjct: 162 QNLVDCSTEKYGNKGCNGGFMTEAFQYIIDNNGIDSEASY-PYKATDGKCRYDSKNRAAT 220
Query: 242 MTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAV 301
+ + + SED LK A+A GPVSVAIDA SF Y +GVYYD C + ++H V
Sbjct: 221 CSKYTELPSGSEDDLKEAVANKGPVSVAIDARHSSFFLYRSGVYYDPSCTQN---VNHGV 277
Query: 302 LAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKD-NNCGVMTAPTY 348
L VGYG L+GK YW VKNSW +G+QGY+ M+ N+CG+ + P+Y
Sbjct: 278 LVVGYGNLNGKDYWLVKNSWGLNFGDQGYIRMARNSGNHCGIASYPSY 325
>gi|426216524|ref|XP_004002512.1| PREDICTED: cathepsin S isoform 1 [Ovis aries]
Length = 331
Score = 164 bits (415), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 89/200 (44%), Positives = 119/200 (59%), Gaps = 7/200 (3%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW-GYGNNGCDGGEDFRSYQWI 210
Q CGSCW+F GA+E +K KL LS Q L+DCS YGN GC+GG ++Q+I
Sbjct: 133 QGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTVKYGNKGCNGGFMTEAFQYI 192
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ +G+ ++ Y PY D C AT + ++ + SE+ALK A+A GPVSV
Sbjct: 193 IDNNGIDSEASY-PYKAMDGRCQYDVKNRAATCSRYIELPFGSEEALKEAVANKGPVSVG 251
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDA Q SF Y GVYYD C + ++H VL VGYG L+GK YW VKNSW +G+QG
Sbjct: 252 IDAKQTSFFLYKTGVYYDPSCTQN---VNHGVLVVGYGSLNGKDYWLVKNSWGLNFGDQG 308
Query: 330 YVLMSIKD-NNCGVMTAPTY 348
Y+ M+ N+CG+ P+Y
Sbjct: 309 YIRMARNSGNHCGIANFPSY 328
>gi|1706261|sp|Q10717.1|CYSP2_MAIZE RecName: Full=Cysteine proteinase 2; Flags: Precursor
gi|644490|dbj|BAA08245.1| cysteine proteinase [Zea mays]
Length = 360
Score = 164 bits (415), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 85/199 (42%), Positives = 117/199 (58%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTGA+E AY K LS+Q L+DC + + N GC+GG +++++I
Sbjct: 160 NQGHCGSCWTFSTTGALEAAYTQATGKPISLSEQQLVDCGFAFNNFGCNGGLPSQAFEYI 219
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ GL T++ Y PY G + C N + VN+T +ED LK A+ PVSVA
Sbjct: 220 KYNGGLDTEESY-PYQGVNGICKFKNENVGVKVLDSVNITLGAEDELKDAVGLVRPVSVA 278
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ F Y +GVY + C +P ++HAVLAVGYG DG PYW +KNSW WG++G
Sbjct: 279 FEVIT-GFRLYKSGVYTSDHCGTTPMDVNHAVLAVGYGVEDGVPYWLIKNSWGADWGDEG 337
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y M + N CGV T +Y
Sbjct: 338 YFKMEMGKNMCGVATCASY 356
>gi|410990004|ref|XP_004001240.1| PREDICTED: cathepsin L1-like isoform 2 [Felis catus]
Length = 334
Score = 164 bits (415), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 95/221 (42%), Positives = 131/221 (59%), Gaps = 12/221 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY +Q C S W+F TGA+EG + K KL LS+Q L+DC+W GN G
Sbjct: 118 VNWRKKGYVTPTKNQGQCASGWAFSATGAIEGQMFGKTHKLVSLSEQNLLDCAWSEGNGG 177
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQD-AYCHIANTTATATMTGFVNVTPNSEDA 255
C GG ++Q++ + GL +++ Y PY QD C ++ A +TGF+ + P +E
Sbjct: 178 CSGGSMGNAFQYVKDNGGLDSEESY-PYHAQDLQSCKYNPESSAADITGFLTI-PKTEYK 235
Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
L ++A GPVS AIDAS +F FY G+YYD C S + L H VL VGYG ELD
Sbjct: 236 LMRSVAIVGPVSAAIDASLDTFRFYDQGIYYDSNC--SHEILHHGVLIVGYGFEGAELDN 293
Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
K YW VKNSW WG GY+LM+ +DN+CG+ + ++ T+
Sbjct: 294 KKYWIVKNSWGEAWGIDGYILMAKDRDNHCGIASLASFPTV 334
>gi|399764976|gb|AFP50672.1| cathepsin L-like protease, partial [Earias vitella]
Length = 222
Score = 164 bits (415), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 90/174 (51%), Positives = 113/174 (64%), Gaps = 5/174 (2%)
Query: 149 LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
+ DQ CGSCWSF TTGA+EG +Y K L LS+Q LIDCS YGNNGC+GG +++
Sbjct: 52 VKDQGKCGSCWSFSTTGALEGQHYRKTGVLVSLSEQNLIDCSTAYGNNGCNGGLMDNAFK 111
Query: 209 WIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVS 267
+I HGL T+ Y PY D C + A GFV++ ED LK A+A GPVS
Sbjct: 112 YIKDNHGLDTEKSY-PYEAVDDKCRCNPKNSGADDVGFVDIPSGDEDKLKAAVATVGPVS 170
Query: 268 VAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNS 320
VAIDASQ+SF FY +GVY +++C+++ LDH VL VGYG DG+ YW VKNS
Sbjct: 171 VAIDASQQSFQFYSDGVYDEDQCSSTE--LDHGVLVVGYGTSEDGQDYWLVKNS 222
>gi|158148921|dbj|BAF81994.1| cysteine proteinase [Platycodon grandiflorus]
Length = 359
Score = 164 bits (415), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 85/199 (42%), Positives = 118/199 (59%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTGA+E AY K LS+Q L+DC+ Y N GC+GG +++++I
Sbjct: 159 NQGHCGSCWTFSTTGALEAAYVQAFGKAIFLSEQQLVDCARAYNNFGCNGGLPSQAFEYI 218
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ GL T++ Y PY G D C ++ + VN+T +ED LK A+A PVSVA
Sbjct: 219 KANGGLDTEEAY-PYTGVDGVCKFSSENIGVQVLDSVNITLGAEDELKDAVAFVRPVSVA 277
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ F Y +GVY + C N+P ++HAV+AVGYG + PYW +KNSW WG+ G
Sbjct: 278 FEVVS-GFRLYKSGVYTSDTCGNTPMDVNHAVVAVGYGVENDVPYWLIKNSWGADWGDNG 336
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y M + N CGV T +Y
Sbjct: 337 YFKMEMGKNMCGVATCASY 355
>gi|426365294|ref|XP_004049712.1| PREDICTED: LOW QUALITY PROTEIN: putative cathepsin L-like protein
6-like [Gorilla gorilla gorilla]
Length = 333
Score = 164 bits (415), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 95/220 (43%), Positives = 134/220 (60%), Gaps = 12/220 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + DQ CGSCW+F TGA+EG + K KL L++Q L+DCS GN G
Sbjct: 119 VDWREKGYMTPVKDQGQCGSCWAFSATGALEGQVFWKTGKLISLNEQNLVDCSGPQGNEG 178
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++++ ++ GL ++ Y PY G+ C + A TGFV++ P+ E L
Sbjct: 179 CNGGFMDNPFRYVQENRGLDSEASY-PYEGKVKTCGYNPKYSAANDTGFVDI-PSREKDL 236
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGY----GELDGK 312
A+A GP+SVA+ AS SF FY G+Y++ +C+ P+GLDHA+L VGY + D K
Sbjct: 237 AKAVATVGPISVAVGASHVSFQFYKKGIYFEPRCD--PEGLDHAMLVVGYSYEGADWDNK 294
Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
YW VKNSW WG GY+ M+ + NNCG+ TA +Y T+
Sbjct: 295 -YWLVKNSWGKNWGMDGYIKMAKDRRNNCGITTAASYPTV 333
>gi|302790930|ref|XP_002977232.1| hypothetical protein SELMODRAFT_228454 [Selaginella moellendorffii]
gi|300155208|gb|EFJ21841.1| hypothetical protein SELMODRAFT_228454 [Selaginella moellendorffii]
Length = 353
Score = 164 bits (415), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 86/199 (43%), Positives = 118/199 (59%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTGA+E A+ K+ VLS+Q L+DC+ GY N GC+GG +++++I
Sbjct: 149 NQQHCGSCWTFSTTGALESAHAQATGKMVVLSEQQLVDCAGGYNNFGCNGGLPSQAFEYI 208
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ GL T+D Y PY G D C + A + VN+T +ED L A+A + PVS+A
Sbjct: 209 RYNGGLDTEDSY-PYTGHDGKCTYNQNSIGAKVYDVVNITEGAEDELIHAVAFNRPVSIA 267
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ K F FY +GVY C PD ++HAVLAVGY PYW +KNSW +G G
Sbjct: 268 YEV-LKDFRFYKSGVYTSNVCGTGPDTVNHAVLAVGYNRDAPVPYWIIKNSWGESFGLDG 326
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y M + N CG+ T +Y
Sbjct: 327 YFYMEMGKNMCGIATCASY 345
>gi|6650705|gb|AAF21977.1|AF115280_1 thiolproteinase SmTP1 [Sarcocystis muris]
Length = 394
Score = 164 bits (415), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 89/201 (44%), Positives = 123/201 (61%), Gaps = 9/201 (4%)
Query: 148 SLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSY 207
S+ DQ CGSCW+F TGA+EG Y K KL LSQQ L+DCS GN GCDGG ++
Sbjct: 190 SVKDQGDCGSCWAFSATGAMEGVYCAKTGKLVNLSQQQLVDCSRFLGNQGCDGGRMEEAF 249
Query: 208 QWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVS 267
++++++G + PY+ +D C + T+ AT+TG+ +V SE ++K ALA PVS
Sbjct: 250 EYVVENGGICSGENYPYMRKDGVCKSSQCTSVATITGYRSVPRRSEKSMKTALALRSPVS 309
Query: 268 VAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGY-GELDGK-PYWQVKNSWSTYW 325
VAI A+Q +F FY +G+ +D C + LDH VL VGY E G+ YW +KNSW W
Sbjct: 310 VAIQANQAAFQFYYDGI-FDAPCGTN---LDHGVLLVGYSAETAGQGDYWIMKNSWGAAW 365
Query: 326 GNQGYVLMSIKD---NNCGVM 343
G GY+LM++ CGV+
Sbjct: 366 GKGGYMLMAMHKGPAGQCGVL 386
>gi|473159|emb|CAA83538.1| cathepsin L [Schistosoma mansoni]
Length = 317
Score = 164 bits (415), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 88/220 (40%), Positives = 133/220 (60%), Gaps = 8/220 (3%)
Query: 133 SKDAIPVRYEMKGYNSLL---DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDC 189
S D +P +++ + + ++ +Q +CGSCW+F GAVEG KHKKL LS+Q L+DC
Sbjct: 99 SNDPLPSKWDWRDHGAVTPVKNQGLCGSCWAFSAAGAVEGQLVKKHKKLISLSEQQLVDC 158
Query: 190 SWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVT 249
S+ YGN+GC GG +S+ ++ K+ + ++ DY Y+G D+ CH + + FV++
Sbjct: 159 SYKYGNDGCQGGTMDQSFAYLEKYPIESEKDY-KYIGHDSSCHFRKSKGVVKVKKFVDLP 217
Query: 250 PNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGEL 309
E+ L+ AL +GP+SVAIDA Y +G+Y ++C++ L+H VLAVGYG
Sbjct: 218 ARDEEKLQKALYHYGPISVAIDALD-DLILYKSGIYESKQCSSFL--LNHGVLAVGYGRE 274
Query: 310 DGKPYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
+ K YW +KNSW T WG GY L K N CG+ T ++
Sbjct: 275 NRKDYWLIKNSWGTTWGMNGYFKLRRNKHNMCGIATNASF 314
>gi|432910512|ref|XP_004078392.1| PREDICTED: cathepsin K-like [Oryzias latipes]
Length = 331
Score = 164 bits (414), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 92/216 (42%), Positives = 125/216 (57%), Gaps = 7/216 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V Y KG S+ +Q CGSCW+F + GA+EG K L LS Q L+DC N+G
Sbjct: 120 VDYRKKGMVTSVKNQGSCGSCWAFSSVGALEGQLAKKTGNLVDLSPQNLVDCV--TENDG 177
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
C GG +++++ ++G + PY+G+D C + A + G+ V E AL
Sbjct: 178 CGGGYMTNAFKYVQENGGIDSEAAYPYMGEDQPCRYNVSGLAAQIKGYKEVPEGDEHALA 237
Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQ 316
+AL K GPVSV IDASQ SF +Y G+Y+D CN + ++HAVLAVGYG GK +W
Sbjct: 238 VALFKAGPVSVGIDASQNSFLYYQKGIYFDRNCNK--EDINHAVLAVGYGVNAKGKKFWI 295
Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
VKNSW WGN+GYVLM+ + N CG+ +Y M
Sbjct: 296 VKNSWGETWGNKGYVLMARNRGNVCGIANLASYPVM 331
>gi|444519959|gb|ELV12909.1| Cathepsin L1 [Tupaia chinensis]
Length = 333
Score = 164 bits (414), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 91/217 (41%), Positives = 132/217 (60%), Gaps = 11/217 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TGA+EG + K KL LS+Q L+DCS N+G
Sbjct: 118 VDWREKGYVTPVKNQHRCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSQPQHNSG 177
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG +++Q++ + GL +++ Y PY ++ C + + AT+TGF ++ P E AL
Sbjct: 178 CKGGLVIKAFQYVKDNGGLDSEESY-PYEEMESTCRYSPGNSAATVTGFKHI-PAEEKAL 235
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGEL----DGK 312
+ A+A GP+SVAIDA SF FY G+ ++ C SP L+HAVL VGYG + +
Sbjct: 236 EKAVASVGPISVAIDAHHHSFQFYTGGILHEPNC--SPKWLNHAVLVVGYGVMQEGSNNN 293
Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
YW VKNSW WG GY++M+ K+N+CG+ + Y
Sbjct: 294 TYWLVKNSWGERWGVGGYIMMAKDKNNHCGIASDALY 330
>gi|195624522|gb|ACG34091.1| thiol protease aleurain precursor [Zea mays]
Length = 360
Score = 164 bits (414), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 86/199 (43%), Positives = 117/199 (58%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTGA+E AY K LS+Q LIDC + + N GC+GG +++++I
Sbjct: 160 NQGHCGSCWTFSTTGALEAAYTQATGKPISLSEQQLIDCGFAFNNFGCNGGLPSQAFEYI 219
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ GL T++ Y PY G + C N + VN+T +ED LK A+ PVSVA
Sbjct: 220 KYNGGLDTEESY-PYQGVNGICKFKNENVGFKVLDSVNITLGAEDELKDAVGLVRPVSVA 278
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ F Y +GVY + C +P ++HAVLAVGYG DG PYW +KNSW WG++G
Sbjct: 279 FEVIT-GFRLYKSGVYTSDHCGTTPMDVNHAVLAVGYGVEDGVPYWLIKNSWGADWGDEG 337
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y M + N CGV T +Y
Sbjct: 338 YFKMEMGKNMCGVATCASY 356
>gi|281200606|gb|EFA74824.1| cysteine proteinase 5 precursor [Polysphondylium pallidum PN500]
Length = 307
Score = 164 bits (414), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 99/223 (44%), Positives = 134/223 (60%), Gaps = 19/223 (8%)
Query: 136 AIPVRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYG 194
A V + KG + +Q CGSCWSF TTG+ EGA+++K L LS+Q L+DCS G
Sbjct: 83 AANVDWRAKGAVTPIKNQGQCGSCWSFSTTGSTEGAHFIKTGNLVSLSEQNLMDCSKPEG 142
Query: 195 NNGCDGGEDFRSYQWIMK-HGLPTQDDYGPYLGQDA-YCHIANTTATATMTGFVNVTPNS 252
N GC+GG ++++I+K +G+ T+ Y PY +D C + AT++ +VNVT S
Sbjct: 143 NQGCNGGLMTAAFEYIIKNNGIDTESSY-PYKAEDGKKCLYNPANSAATLSSYVNVTTGS 201
Query: 253 EDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGK 312
E L + + GPVSVAIDAS SF Y +GVYY+ KC+ + LDH VL VGYG D
Sbjct: 202 ESDLAVK-SGLGPVSVAIDASHNSFQLYSSGVYYEPKCSQTQ--LDHGVLVVGYGS-DAL 257
Query: 313 P----------YWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMT 344
P +W VKNSW T WG +GY+ MS ++NNCG+ T
Sbjct: 258 PSAGVSAGSGDWWIVKNSWGTTWGVEGYIYMSRNRNNNCGIAT 300
>gi|261824899|pdb|3H89|A Chain A, A Combined Crystallographic And Molecular Dynamics Study
Of Cathepsin-L Retro-Binding Inhibitors(Compound 4)
gi|261824900|pdb|3H89|B Chain B, A Combined Crystallographic And Molecular Dynamics Study
Of Cathepsin-L Retro-Binding Inhibitors(Compound 4)
gi|261824901|pdb|3H89|C Chain C, A Combined Crystallographic And Molecular Dynamics Study
Of Cathepsin-L Retro-Binding Inhibitors(Compound 4)
gi|261824902|pdb|3H89|D Chain D, A Combined Crystallographic And Molecular Dynamics Study
Of Cathepsin-L Retro-Binding Inhibitors(Compound 4)
gi|261824903|pdb|3H89|E Chain E, A Combined Crystallographic And Molecular Dynamics Study
Of Cathepsin-L Retro-Binding Inhibitors(Compound 4)
gi|261824904|pdb|3H89|F Chain F, A Combined Crystallographic And Molecular Dynamics Study
Of Cathepsin-L Retro-Binding Inhibitors(Compound 4)
gi|261824905|pdb|3H8B|A Chain A, A Combined Crystallographic And Molecular Dynamics Study
Of Cathepsin-L Retro-Binding Inhibitors(Compound 9)
gi|261824906|pdb|3H8B|B Chain B, A Combined Crystallographic And Molecular Dynamics Study
Of Cathepsin-L Retro-Binding Inhibitors(Compound 9)
gi|261824907|pdb|3H8B|C Chain C, A Combined Crystallographic And Molecular Dynamics Study
Of Cathepsin-L Retro-Binding Inhibitors(Compound 9)
gi|261824908|pdb|3H8B|D Chain D, A Combined Crystallographic And Molecular Dynamics Study
Of Cathepsin-L Retro-Binding Inhibitors(Compound 9)
gi|261824909|pdb|3H8B|E Chain E, A Combined Crystallographic And Molecular Dynamics Study
Of Cathepsin-L Retro-Binding Inhibitors(Compound 9)
gi|261824910|pdb|3H8B|F Chain F, A Combined Crystallographic And Molecular Dynamics Study
Of Cathepsin-L Retro-Binding Inhibitors(Compound 9)
gi|317455049|pdb|2XU3|A Chain A, Cathepsin L With A Nitrile Inhibitor
gi|317455050|pdb|2XU4|A Chain A, Cathepsin L With A Nitrile Inhibitor
gi|317455051|pdb|2XU5|A Chain A, Cathepsin L With A Nitrile Inhibitor
gi|358009432|pdb|2YJ2|A Chain A, Cathepsin L With A Nitrile Inhibitor
gi|358009433|pdb|2YJ8|A Chain A, Cathepsin L With A Nitrile Inhibitor
gi|358009434|pdb|2YJ9|A Chain A, Cathepsin L With A Nitrile Inhibitor
gi|358009435|pdb|2YJB|A Chain A, Cathepsin L With A Nitrile Inhibitor
gi|358009436|pdb|2YJC|A Chain A, Cathepsin L With A Nitrile Inhibitor
Length = 220
Score = 164 bits (414), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 94/220 (42%), Positives = 131/220 (59%), Gaps = 11/220 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TGA+EG + K +L LS+Q L+DCS GN G
Sbjct: 5 VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEG 64
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++Q++ + GL +++ Y PY + C + A TGFV++ P E AL
Sbjct: 65 CNGGLMDYAFQYVQDNGGLDSEESY-PYEATEESCKYNPKYSVANDTGFVDI-PKQEKAL 122
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GP+SVAIDA +SF FY G+Y++ C S + +DH VL VGYG E D
Sbjct: 123 MKAVATVGPISVAIDAGHESFLFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDNN 180
Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
YW VKNSW WG GYV M+ + N+CG+ +A +Y T+
Sbjct: 181 KYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 220
>gi|146215994|gb|ABQ10199.1| cysteine protease Cp1 [Actinidia deliciosa]
Length = 358
Score = 164 bits (414), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 90/226 (39%), Positives = 130/226 (57%), Gaps = 12/226 (5%)
Query: 130 NKASKDAIPVRYEMKGY------NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQ 183
+K +++A+P EMK + + + DQ CGSCW+F TTGA+E AY K LS+
Sbjct: 134 HKLTEEALP---EMKDWRVSGIVSPVKDQGHCGSCWTFSTTGALEAAYKQAFGKGISLSE 190
Query: 184 QALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATM 242
Q L+DC+ + N GC GG ++++++ + GL T++ Y PY G++ C ++ +
Sbjct: 191 QQLVDCAGAFNNFGCSGGLPSQAFEYVKYNGGLDTEEAY-PYTGKNGECKFSSENVGVQV 249
Query: 243 TGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVL 302
VN+T +ED LK A+A PVSVA F Y GVY + C +P ++HAVL
Sbjct: 250 LDSVNITLGAEDELKHAVAFVRPVSVAFQVVN-GFRLYKEGVYTSDTCGRTPMDVNHAVL 308
Query: 303 AVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTY 348
AVGYG +G PYW +KNSW WG+ GY M + N CGV T +Y
Sbjct: 309 AVGYGVENGVPYWLIKNSWGADWGDSGYFKMEMGKNMCGVATCASY 354
>gi|6851030|emb|CAB71032.1| cysteine protease [Lolium multiflorum]
Length = 359
Score = 164 bits (414), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 86/199 (43%), Positives = 117/199 (58%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ+ CGSCW+F TTGA+E AY K LS+Q L+DC+ Y N GC+GG +++++I
Sbjct: 158 DQASCGSCWTFSTTGALEAAYTQATGKNISLSEQQLVDCAGAYNNFGCNGGLPSQAFEYI 217
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T++ Y PY G + C A + VN+T N+ED LK A+ PVSVA
Sbjct: 218 KYNGGIDTEESY-PYKGVNGVCKYRPENAAVQVADSVNITLNAEDELKNAVGLVRPVSVA 276
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ F Y +GVY + C +PD ++HAVLAVGYG +G PYW +KNSW WG G
Sbjct: 277 FEVID-GFKQYKSGVYTSDHCGTTPDDVNHAVLAVGYGVENGVPYWLIKNSWGADWGEDG 335
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y M + N C V T +Y
Sbjct: 336 YFKMEMGKNMCAVATCASY 354
>gi|313754424|pdb|3OF8|A Chain A, Structural Basis For Reversible And Irreversible
Inhibition Of Human Cathepsin L By Their Respective
Dipeptidyl Glyoxal And Diazomethylketone Inhibitors
gi|313754425|pdb|3OF9|A Chain A, Structural Basis For Irreversible Inhibition Of Human
Cathepsin L By A Diazomethylketone Inhibitor
Length = 221
Score = 164 bits (414), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 94/220 (42%), Positives = 131/220 (59%), Gaps = 11/220 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TGA+EG + K +L LS+Q L+DCS GN G
Sbjct: 6 VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEG 65
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++Q++ + GL +++ Y PY + C + A TGFV++ P E AL
Sbjct: 66 CNGGLMDYAFQYVQDNGGLDSEESY-PYEATEESCKYNPKYSVANDTGFVDI-PKQEKAL 123
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GP+SVAIDA +SF FY G+Y++ C S + +DH VL VGYG E D
Sbjct: 124 MKAVATVGPISVAIDAGHESFLFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDNN 181
Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
YW VKNSW WG GYV M+ + N+CG+ +A +Y T+
Sbjct: 182 KYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 221
>gi|432117576|gb|ELK37815.1| Cathepsin L1 [Myotis davidii]
Length = 299
Score = 163 bits (413), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 110/296 (37%), Positives = 157/296 (53%), Gaps = 36/296 (12%)
Query: 86 ENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKASKDAIP--VRYEM 143
E+ + R N+T + + E++RLV + G + K+ + + + IP V +
Sbjct: 10 EHSQGRRNFT-LAMNAFGDMTNEEFRLVMN-GFQNQKHKKGDMFQEPALAEIPPSVDWRK 67
Query: 144 KG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGE 202
KG + DQ CGSCW+F TGA+EG + K KL LS+Q L+DCS GN GC GG
Sbjct: 68 KGCVTPVKDQGGCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNEGCSGGL 127
Query: 203 DFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALA 261
++Q++ + GL T++ Y PY G D C + A TGFV++ + E +L A+A
Sbjct: 128 MDNAFQYVKDNEGLDTEESY-PYYGTDDTCKYKPEFSAANDTGFVDIHKD-ERSLMKAVA 185
Query: 262 KHGPVSVAIDASQKSFSFYVN---------------------GVYYDEKCNNSPDGLDHA 300
GP+SVA+DAS +SF FY G+YYD C S + L+H
Sbjct: 186 SVGPISVALDASLESFQFYEKGKVTVSSYLEIFTPAMTSVFLGIYYDPDC--SSEDLNHG 243
Query: 301 VLAVGYG----ELDGKPYWQVKNSWSTYWGNQGYVLMSIK-DNNCGVMTAPTYVTM 351
VL VGYG E+D YW VKNSW T WG GY+ M+ DN+CG+ + +Y T+
Sbjct: 244 VLVVGYGFEGVEMDNNKYWIVKNSWGTKWGMDGYIKMAKDLDNHCGIASMASYPTV 299
>gi|301612003|ref|XP_002935514.1| PREDICTED: cathepsin K-like [Xenopus (Silurana) tropicalis]
Length = 331
Score = 163 bits (413), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 91/216 (42%), Positives = 125/216 (57%), Gaps = 8/216 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
+ Y KGY + DQ CGSCW+F T GA+EG K KL +S Q L+DC N G
Sbjct: 121 IDYRKKGYVTPIRDQGECGSCWAFSTVGALEGQLMKKTGKLVGISPQNLVDCV--KDNFG 178
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG +++++ K+ G+ +++ Y PY+G D C + A + GF V SE AL
Sbjct: 179 CGGGYMTTAFKYVKKNKGIDSEEAY-PYVGMDQKCKYNVSGRAAEIKGFKEVKKGSETAL 237
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
K A+ GP+SV IDA +F Y G+YYD+ C+ D ++HAVLAVGYG+ YW
Sbjct: 238 KKAVGLVGPISVGIDAGLDTFFLYKKGIYYDKSCDG--DSINHAVLAVGYGKQKKGKYWI 295
Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
+KNSW WGN+GY+LM+ K N CG+ +Y M
Sbjct: 296 IKNSWGEDWGNKGYILMAREKGNACGIANLASYPVM 331
>gi|79331505|ref|NP_001032106.1| thiol protease aleurain [Arabidopsis thaliana]
gi|332009931|gb|AED97314.1| thiol protease aleurain [Arabidopsis thaliana]
Length = 357
Score = 163 bits (413), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 85/199 (42%), Positives = 119/199 (59%), Gaps = 4/199 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TTGA+E AY+ K LS+Q L+DC+ + N GC+GG +++++I
Sbjct: 158 DQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYI 217
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ GL T+ Y PY G+D C + + VN+T +ED LK A+ PVS+A
Sbjct: 218 KSNGGLDTEKAY-PYTGKDETCKFSAENVGVQVLNSVNITLGAEDELKHAVGLVRPVSIA 276
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ SF Y +GVY D C ++P ++HAVLAVGYG DG PYW +KNSW WG++G
Sbjct: 277 FEVIH-SFRLYKSGVYTDSHCGSTPMDVNHAVLAVGYGVEDGVPYWLIKNSWGADWGDKG 335
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y M + N C + T +Y
Sbjct: 336 YFKMEMGKNMC-IATCASY 353
>gi|28194643|gb|AAO33583.1|AF479265_1 cathepsin P [Meriones unguiculatus]
Length = 334
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 94/233 (40%), Positives = 138/233 (59%), Gaps = 13/233 (5%)
Query: 128 RYNKASKDAIPV--RYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQ 184
R +K D +P + +GY + +Q CGSCW+F TGA+EG + K KL LS Q
Sbjct: 105 RKDKKIVDGLPKFKNWINEGYVTPVRNQGTCGSCWAFAATGAIEGQMFWKTGKLTPLSVQ 164
Query: 185 ALIDCSWGYGNNGCDGGEDFRSYQWIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMT 243
L+DCS GN GC G FR++ ++ + GL + Y PY G+ C ++ + A +T
Sbjct: 165 NLVDCSEKQGNKGCAQGSAFRAFMYVNETKGLQDEISY-PYEGKQGTCRYNSSNSRAYVT 223
Query: 244 GFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLA 303
F + P +E L +A+A GPV+ A+DASQ SF FY G+YY+ KC S ++HAVL
Sbjct: 224 DF-RLLPQNEIYLLVAVASIGPVAAAVDASQDSFRFYRGGIYYEPKC--SQYSVNHAVLV 280
Query: 304 VGYG----ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
VGYG E DGK YW +KNSW WG +GY+ ++ ++N+CG+ + ++V +
Sbjct: 281 VGYGYEGNETDGKDYWLIKNSWGENWGMRGYMKIARDRNNHCGIASQASFVDI 333
>gi|440291372|gb|ELP84641.1| cysteine proteinase 3 precursor, putative [Entamoeba invadens IP1]
Length = 247
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 90/222 (40%), Positives = 124/222 (55%), Gaps = 17/222 (7%)
Query: 131 KASKDAIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVL--SQQALID 188
+ S PVR DQ+ CGSC+SFG+ ++EG M + L +++ ++
Sbjct: 36 RTSGKVTPVR----------DQAQCGSCYSFGSLASLEGRLLMIKGSASTLDLAEEQIVR 85
Query: 189 CSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
CS NNGC+GG Y +I +G+ + DY PY D C + + AT+TG+ V
Sbjct: 86 CS---DNNGCNGGTGSVVYDYIKANGIVQEKDY-PYTATDGSCTVDTSKKYATITGYKKV 141
Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
T +ALK AL G V V+IDAS + Y GVY+D KC L+H V +VGYG
Sbjct: 142 TKKDNNALKTALV-DGVVDVSIDASSVALQLYKTGVYHDTKCKTGLLDLNHEVASVGYGA 200
Query: 309 LDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVT 350
L+G+ YW VKNSW T WG++GYVLM++ N CG+ T P Y T
Sbjct: 201 LNGEEYWIVKNSWGTSWGDKGYVLMAMAGNTCGITTDPIYPT 242
>gi|10946820|ref|NP_067420.1| cathepsin 6 precursor [Mus musculus]
gi|9931384|gb|AAG02172.1|AF223401_1 cathepsin-6 [Mus musculus]
gi|12838129|dbj|BAB24093.1| unnamed protein product [Mus musculus]
gi|16445021|gb|AAK00510.1| cathepsin 6 precursor [Mus musculus]
gi|68534635|gb|AAH99455.1| Cathepsin 6 [Mus musculus]
gi|148709368|gb|EDL41314.1| cathepsin 6 [Mus musculus]
Length = 334
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 90/226 (39%), Positives = 132/226 (58%), Gaps = 11/226 (4%)
Query: 131 KASKDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALI 187
+A D +P V + KGY + Q C SCW+F GA+EG + K KL LS Q L+
Sbjct: 109 RAVGDVLPKFVDWRKKGYVTRVRRQKFCNSCWAFAVNGAIEGQMFKKTGKLTPLSVQNLV 168
Query: 188 DCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVN 247
DC+ GN+GC G+ + +Y++++ +G + PY G++ C + A +TGFV+
Sbjct: 169 DCTKTQGNDGCQWGDPYIAYEYVLNNGGLEAEATYPYEGKEGPCRYNPKNSKAEITGFVS 228
Query: 248 VTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG 307
+ P SED L A+A GP+S A+DAS FSFY G+Y+ C+N + ++HAVL VGYG
Sbjct: 229 L-PESEDILMEAVATIGPISAAVDASFNRFSFYDGGIYHQPNCSN--NTVNHAVLVVGYG 285
Query: 308 ----ELDGKPYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
E DG YW +KNSW WG GY+ ++ ++N+CG+ T Y
Sbjct: 286 TEGNETDGNKYWLIKNSWGRRWGIGGYMKIIRDQNNHCGIATYAHY 331
>gi|18407961|ref|NP_566880.1| thiol protease aleurain-like protein [Arabidopsis thaliana]
gi|73622182|sp|Q8RWQ9.1|ALEUL_ARATH RecName: Full=Thiol protease aleurain-like; Flags: Precursor
gi|20147207|gb|AAM10319.1| AT3g45310/F18N11_70 [Arabidopsis thaliana]
gi|332644500|gb|AEE78021.1| thiol protease aleurain-like protein [Arabidopsis thaliana]
Length = 358
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 85/199 (42%), Positives = 117/199 (58%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTGA+E AY+ K LS+Q L+DC+ + N GC GG +++++I
Sbjct: 158 EQGHCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGTFNNFGCHGGLPSQAFEYI 217
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ GL T++ Y PY G+D C + + VN+T +ED LK A+ PVSVA
Sbjct: 218 KYNGGLDTEEAY-PYTGKDGGCKFSAKNIGVQVRDSVNITLGAEDELKHAVGLVRPVSVA 276
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ + F FY GV+ C N+P ++HAVLAVGYG D PYW +KNSW WG+ G
Sbjct: 277 FEVVHE-FRFYKKGVFTSNTCGNTPMDVNHAVLAVGYGVEDDVPYWLIKNSWGGEWGDNG 335
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y M + N CGV T +Y
Sbjct: 336 YFKMEMGKNMCGVATCSSY 354
>gi|45550334|gb|AAS67923.1| cathepsin L [Artemia franciscana]
Length = 226
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 87/199 (43%), Positives = 120/199 (60%), Gaps = 3/199 (1%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM 211
Q C SC +F TGA+E + K KL LS+Q LIDCS YGN GC GG +++++I
Sbjct: 26 QGQCASCLAFSPTGALESQTFRKTGKLISLSEQNLIDCSGEYGNLGCKGGWISQAFEYIK 85
Query: 212 KH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
+ G+ T++ Y Y ++ +C A GFVN+ ED LK A+A GPVS I
Sbjct: 86 DNKGIDTENKY-HYEAKENFCRDNPRNRGAVALGFVNIPSGEEDKLKAAVATVGPVSAVI 144
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
D S + F FY GVYY+ C S + L+HAVL +G G +G+ YW VKNSWS +WG++GY
Sbjct: 145 DVSHEGFQFYSKGVYYEPSCKTSFEHLNHAVLVIGCGSDNGEDYWLVKNSWSKHWGDEGY 204
Query: 331 VLMSI-KDNNCGVMTAPTY 348
+ ++ + N+CGV TA Y
Sbjct: 205 LKIARNRKNHCGVATAALY 223
>gi|20136379|gb|AAM11647.1|AF490984_1 cathepsin L, partial [Fasciola hepatica]
Length = 311
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 88/218 (40%), Positives = 124/218 (56%), Gaps = 8/218 (3%)
Query: 131 KASKDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALI 187
+A+ A+P + + GY + DQ CGSCW+F TTG +EG Y + S+Q L+
Sbjct: 87 EANNRAVPDKIDWRESGYVTEVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLV 146
Query: 188 DCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVN 247
DCS +GNNGC GG +YQ++ + GL T+ Y PY + C A +TG+
Sbjct: 147 DCSGPWGNNGCSGGLMENAYQYLKQFGLETESSY-PYTAVEGQCRYNKQLGVAKVTGYYT 205
Query: 248 VTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG 307
V SE LK + GP +VA+D + F Y +G+Y + C SP ++HAVLAVGYG
Sbjct: 206 VHSGSEVELKNLVGAEGPAAVAVDV-ESDFMMYRSGIYQSQTC--SPLRVNHAVLAVGYG 262
Query: 308 ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMT 344
DG YW VKNSW +YWG +GY+ M+ + N CG+ +
Sbjct: 263 TQDGTDYWIVKNSWGSYWGERGYIRMARNRGNMCGIAS 300
>gi|383410403|gb|AFH28415.1| cathepsin L1 preproprotein [Macaca mulatta]
Length = 333
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 93/220 (42%), Positives = 130/220 (59%), Gaps = 11/220 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TGA+EG + K KL LS+Q L+DCS GN G
Sbjct: 118 VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSGPQGNEG 177
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++Q++ + GL +++ Y PY + C + A TGFV++ P E AL
Sbjct: 178 CNGGLMDYAFQYVADNGGLDSEESY-PYEATEESCKYNPEYSVANDTGFVDI-PKQEKAL 235
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GP+SVAIDA +SF FY G+Y++ C S + +DH VL VGYG E D
Sbjct: 236 MKAVATVGPISVAIDAGHESFMFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDNS 293
Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
YW KNSW WG GY+ M+ + N+CG+ +A +Y T+
Sbjct: 294 KYWLGKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYPTV 333
>gi|225444726|ref|XP_002278624.1| PREDICTED: thiol protease aleurain-like isoform 1 [Vitis vinifera]
gi|147826441|emb|CAN62278.1| hypothetical protein VITISV_031382 [Vitis vinifera]
gi|297738562|emb|CBI27807.3| unnamed protein product [Vitis vinifera]
Length = 362
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 87/199 (43%), Positives = 116/199 (58%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TTGA+E AY K LS+Q L+DC+ + N GC GG +++++I
Sbjct: 162 DQGHCGSCWTFSTTGALEAAYAQAFGKGISLSEQQLVDCAGAFNNFGCHGGLPSQAFEYI 221
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ GL T++ Y PY G D C ++ + VN+T +ED LK A+A PVSVA
Sbjct: 222 KYNGGLDTEEAY-PYTGLDGTCKFSSENIGVQVLDSVNITLGAEDELKHAVAFVRPVSVA 280
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ F FY GVY C ++P ++HAVLAVGYG DG YW +KNSW WG+ G
Sbjct: 281 FEVVH-DFRFYKKGVYTSGTCGSTPMDVNHAVLAVGYGVEDGVAYWLIKNSWGENWGDNG 339
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y M + N CGV T +Y
Sbjct: 340 YFKMELGKNMCGVATCSSY 358
>gi|338717354|ref|XP_001492337.3| PREDICTED: pro-cathepsin H-like [Equus caballus]
Length = 323
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 83/199 (41%), Positives = 118/199 (59%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTGA+E A + KL L++Q L+DC+ + N+GC GG +++++I
Sbjct: 122 NQGGCGSCWTFSTTGALESAVAIASGKLLSLAEQQLVDCAQNFNNHGCQGGLPSQAFEYI 181
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
G+ +D Y PY GQD C A A + N+T N E A+ A+A + PVS A
Sbjct: 182 RYNKGIMGEDTY-PYKGQDGDCKFQPNKAIAFVKDVANITLNDEKAMVEAVALYNPVSFA 240
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ ++ F Y G+Y C+ +PD ++HAVLAVGYGE +G PYW VKNSW +WG G
Sbjct: 241 FEVTE-DFMMYRKGIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPHWGMNG 299
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y L+ N CG+ +Y
Sbjct: 300 YFLIERGKNMCGLAACASY 318
>gi|258588539|pdb|3HWN|A Chain A, Cathepsin L With Az13010160
gi|258588540|pdb|3HWN|B Chain B, Cathepsin L With Az13010160
gi|258588541|pdb|3HWN|C Chain C, Cathepsin L With Az13010160
gi|258588542|pdb|3HWN|D Chain D, Cathepsin L With Az13010160
Length = 258
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 93/220 (42%), Positives = 130/220 (59%), Gaps = 11/220 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TGA+EG + K +L LS+Q L+DCS GN G
Sbjct: 43 VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEG 102
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++Q++ + GL +++ Y PY + C + A GFV++ P E AL
Sbjct: 103 CNGGLMDYAFQYVQDNGGLDSEESY-PYEATEESCKYNPKYSVANDAGFVDI-PKQEKAL 160
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GP+SVAIDA +SF FY G+Y++ C S + +DH VL VGYG E D
Sbjct: 161 MKAVATVGPISVAIDAGHESFLFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDNN 218
Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
YW VKNSW WG GYV M+ + N+CG+ +A +Y T+
Sbjct: 219 KYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 258
>gi|71897043|ref|NP_001026516.1| cathepsin S precursor [Gallus gallus]
gi|53126701|emb|CAG30977.1| hypothetical protein RCJMB04_1f23 [Gallus gallus]
Length = 328
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 91/223 (40%), Positives = 131/223 (58%), Gaps = 7/223 (3%)
Query: 128 RYNKASKDAIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALI 187
R + DA+ R E + +Q CG+CW+F GA+E +K KL LS Q L+
Sbjct: 108 RRRGGAPDAMDWR-EKGCVTEVKNQGACGACWAFSAVGALEAQVKLKTGKLVSLSAQNLV 166
Query: 188 DCSWGYGNNGCDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFV 246
DCS YGN GC GG R++Q+I+ +G+ +++ Y PY+ Q+ C +T AT + +V
Sbjct: 167 DCSMMYGNKGCGGGFMTRAFQYIIDNNGIDSEESY-PYMAQNGTCQYNVSTRAATCSKYV 225
Query: 247 NVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGY 306
+ E ALK A+A GPVSVAIDA+Q +F Y +GVY D +C ++H VL VGY
Sbjct: 226 ELPYADEAALKDAVANVGPVSVAIDATQPTFFLYRSGVYDDPRCTQE---VNHGVLVVGY 282
Query: 307 GELDGKPYWQVKNSWSTYWGNQGYVLMSIKD-NNCGVMTAPTY 348
G L+ K +W VKNSW +G+ GY+ MS N+CG+ + +Y
Sbjct: 283 GTLNEKDFWLVKNSWGERFGDGGYIRMSRNHANHCGIASYASY 325
>gi|139002720|dbj|BAF51966.1| cathepsin K [Carassius auratus]
gi|139002725|dbj|BAF51967.1| tartrate-resistant acid phosphatase [Carassius auratus]
Length = 332
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 93/217 (42%), Positives = 128/217 (58%), Gaps = 9/217 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
+ Y GY S+ +Q CGSCW+F + GA+EG +L LS Q L+DC N+G
Sbjct: 121 IDYRKLGYVTSVKNQGSCGSCWAFSSVGALEGQLKKTKGQLVDLSPQNLVDCV--TDNDG 178
Query: 198 CDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG +++++ G+ +++ Y PY+G D C ++ AT GF + +E AL
Sbjct: 179 CGGGYMTNAFRYVKDNQGIDSEEGY-PYVGTDQQCAYNSSARAATCKGFKEIPQGNEKAL 237
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGEL-DGKPYW 315
A+AK GPVSV IDA Q +F +Y +GVYYD CN D ++HAVLAVGYG GK YW
Sbjct: 238 TAAVAKVGPVSVGIDAMQSTFLYYKSGVYYDPNCNK--DDVNHAVLAVGYGATPKGKKYW 295
Query: 316 QVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
VKNSW WG +GYVLM+ NN CG+ + ++ M
Sbjct: 296 IVKNSWGEDWGKKGYVLMARNRNNACGIASLASFPVM 332
>gi|19698257|dbj|BAB86771.1| cathepsin L-like [Engraulis japonicus]
Length = 324
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 87/202 (43%), Positives = 124/202 (61%), Gaps = 7/202 (3%)
Query: 149 LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
+ +Q CGSCWSF TGA+E ++ L LS+Q L+DCS YGN GC+GG ++Q
Sbjct: 125 IKNQGQCGSCWSFSATGALESQTCLRRGYLPSLSEQQLVDCSGPYGNYGCNGGWPDHAFQ 184
Query: 209 WIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTP-NSEDALKLALAKHGPVS 267
++ +G + Y PY + CH + + AT +G+ +VTP SE AL+ +A GP+S
Sbjct: 185 YVQANGGIDSESYYPYQARVGTCHYNSAYSAATCSGYQDVTPVGSESALQYYVANVGPLS 244
Query: 268 VAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGN 327
+AIDAS + Y +GV+ D C+ + DHAVL VGYG +G+ YW VKNSW T+WG
Sbjct: 245 IAIDAS--GWQSYQSGVFNDPSCSQT---ADHAVLLVGYGTYNGQDYWLVKNSWGTWWGE 299
Query: 328 QGYVLMSIKDNN-CGVMTAPTY 348
QGY++M+ NN CG+ +Y
Sbjct: 300 QGYIMMARNANNQCGIANHASY 321
>gi|226821419|gb|ACO82385.1| cathepsin K [Lutjanus argentimaculatus]
Length = 330
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 92/216 (42%), Positives = 123/216 (56%), Gaps = 7/216 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
+ Y KG S+ +Q CGSCW+F + GA+EG K +L LS Q L+DC N+G
Sbjct: 119 IDYRKKGMVTSVKNQGSCGSCWAFSSAGALEGQLAKKTGQLVDLSPQNLVDCV--TENDG 176
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
C GG +++Q++ +G ++ PY+G+D C T A G+ + +E AL
Sbjct: 177 CGGGYMTKAFQYVADNGGIDSEEAYPYIGEDQPCRYNATGMAAQCKGYKEIPEGNEHALA 236
Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGEL-DGKPYWQ 316
+AL K GPVSV IDA+ SF FY GVYYD CN + ++HAVLAVGYG GK YW
Sbjct: 237 VALFKAGPVSVGIDATLSSFQFYSKGVYYDPSCNK--EDINHAVLAVGYGVTGKGKKYWI 294
Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
VKNSW WG GY+LM+ N CG+ +Y M
Sbjct: 295 VKNSWGESWGKGGYILMARNRGNLCGIANLASYPIM 330
>gi|296228726|ref|XP_002759933.1| PREDICTED: cathepsin S isoform 1 [Callithrix jacchus]
Length = 330
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 83/199 (41%), Positives = 120/199 (60%), Gaps = 6/199 (3%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM 211
Q CG+CW+F GA+E +K KL LS Q L+DCS YGN GC+GG ++Q+I+
Sbjct: 133 QGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSEKYGNKGCNGGFMTEAFQYII 192
Query: 212 KH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
+ G+ ++ Y PY D C + AT + + + ED LK A+A GPV V +
Sbjct: 193 DNKGIDSEASY-PYKAMDQKCQYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVCVGV 251
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
DAS SF Y +GVYYD C + ++H VL +GYG+L+G+ YW VKNSW + +G +GY
Sbjct: 252 DASHSSFFLYRSGVYYDPACTQN---VNHGVLVIGYGDLNGEEYWLVKNSWGSNFGERGY 308
Query: 331 VLMSI-KDNNCGVMTAPTY 348
+ M+ K N+CG+ + P+Y
Sbjct: 309 IRMARNKGNHCGIASYPSY 327
>gi|47169476|tpe|CAE48375.1| TPA: cathepsin Q-like 2 [Rattus norvegicus]
Length = 342
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 91/223 (40%), Positives = 130/223 (58%), Gaps = 13/223 (5%)
Query: 134 KDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCS 190
+DA+P + + +GY + +Q C SCW+F GA+EG + K KL LS Q L+DCS
Sbjct: 122 RDALPKSIDWRKEGYVTRVREQGKCKSCWAFPVAGAIEGQMFKKTGKLTPLSVQNLVDCS 181
Query: 191 WGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTP 250
GN GC GG + ++Q+++++G + PY G++ C A A +T FV + P
Sbjct: 182 KPQGNKGCRGGTTYNAFQYVLQNGGLESEATYPYKGKEGLCKYNPKNAYAKITRFVAL-P 240
Query: 251 NSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG--- 307
ED L ALA GPV+ I AS SF F V+G+Y++ KCNN ++HAVL VGYG
Sbjct: 241 EDEDVLMDALATKGPVAAGIHASHGSFHF-VSGIYHEPKCNNR---VNHAVLVVGYGFEG 296
Query: 308 -ELDGKPYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
E DG YW +KNSW WG +GY+ + ++N+CG+ T Y
Sbjct: 297 NETDGNNYWLIKNSWGKQWGLKGYMKIAKDRNNHCGIATFAQY 339
>gi|334332716|ref|XP_001367365.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
Length = 335
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 88/214 (41%), Positives = 128/214 (59%), Gaps = 6/214 (2%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + +GY + +Q C SCW+F GA+EG ++ K +L LS Q L+DC+ +
Sbjct: 121 VDWREEGYVTPVKNQGQCLSCWAFSAVGAIEGQWFRKTGELVSLSIQNLVDCTTSDSISS 180
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
C GG R++Q++ +G ++ PY+G+ C + A + GFV++ E AL
Sbjct: 181 CHGGFMDRAFQYVQDNGGIDTEECYPYVGEVNECKYQPECSGANVVGFVDIPSMDERALM 240
Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE--LDGKPYW 315
A+A GP+SVAID SF FY +GVYYD +C++S L+HA L VGYG +DG+ YW
Sbjct: 241 EAVATVGPISVAIDGGNPSFKFYESGVYYDPQCSSSQ--LNHAGLVVGYGSEGIDGRKYW 298
Query: 316 QVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
VKNSW WGN GY+LM+ +DN+CG+ T +Y
Sbjct: 299 IVKNSWGELWGNNGYILMAKDEDNHCGIATEASY 332
>gi|443694581|gb|ELT95681.1| hypothetical protein CAPTEDRAFT_173171 [Capitella teleta]
Length = 342
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 93/217 (42%), Positives = 129/217 (59%), Gaps = 8/217 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F +TG++EG ++ +L LS+Q L+DC+ YGN G
Sbjct: 128 VDWRDKGYVTPVKNQGACGSCWAFSSTGSLEGQHFRLTGQLVSLSEQNLVDCTKKYGNAG 187
Query: 198 CDGGEDFRSYQWI-MKHGLPTQDDYGPYLGQDAYCHIANTTAT--ATMTGFVNVTPNSED 254
C+GG ++ ++ +G+ T+ Y PY G D +C + A TG V+V E
Sbjct: 188 CNGGWMDNAFNYVKANNGIDTEAFY-PYEGHDDWCGYDGSPGHKGANCTGHVDVQQGDEL 246
Query: 255 ALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPY 314
ALK A+A GPVSV IDA+ +SF Y +G+Y + C+NS DHAVL VGYG G Y
Sbjct: 247 ALKQAVATVGPVSVGIDATHRSFQLYKSGIYDEVACSNSS--TDHAVLVVGYGSQGGHDY 304
Query: 315 WQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVT 350
W VKNSW T WG GY++MS K N C + + +Y T
Sbjct: 305 WLVKNSWGTSWGMDGYIMMSRNKGNQCAIASYASYPT 341
>gi|155970232|gb|ABU41785.1| cysteine protease [Rosa x borboniana]
Length = 357
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 87/199 (43%), Positives = 114/199 (57%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TTGA+E AY K S+Q L+DC+ + N GC GG +++++I
Sbjct: 157 DQGHCGSCWTFSTTGALEAAYVQAFGKQISPSEQQLVDCAGAFNNFGCSGGLPSQAFEYI 216
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ GL T+ Y PY D C ++ + VN+T N E+ LK A+A PVSVA
Sbjct: 217 KYNGGLDTEQAY-PYTAVDGACKFSSENVGVRVLDSVNITLNDEEELKHAVAFVRPVSVA 275
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
Q F Y +GVY E C N+P ++HAVLAVGYG +G PYW +KNSW WG+ G
Sbjct: 276 FQVVQ-DFRLYKSGVYTSETCGNTPMDVNHAVLAVGYGVENGVPYWLIKNSWGQSWGDNG 334
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y M N CGV T +Y
Sbjct: 335 YFKMEYGKNMCGVATCASY 353
>gi|432910514|ref|XP_004078393.1| PREDICTED: cathepsin S-like [Oryzias latipes]
Length = 339
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 86/201 (42%), Positives = 122/201 (60%), Gaps = 4/201 (1%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM 211
Q CGSCW+F GA+EG K KL LS Q L+DCS YGN+GC+GG +++Q+++
Sbjct: 142 QGSCGSCWAFSAVGALEGQLCRKTGKLVDLSPQNLVDCSSKYGNHGCNGGFMHQAFQYVI 201
Query: 212 KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAID 271
+ D PY+G CH ++ A + + + E ALK A+A GP+SVAID
Sbjct: 202 DNQGIDSDAGYPYVGVTQNCHYSSEYRAANCSQYSFLPEGDEGALKEAIATIGPISVAID 261
Query: 272 ASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYV 331
A++ F+FY +GVY D C+ + ++H VLAVGYG L+G+ YW VKNSW T +G QGY+
Sbjct: 262 ATRPRFAFYRSGVYDDSSCSQN---VNHGVLAVGYGTLNGQDYWLVKNSWGTTFGEQGYI 318
Query: 332 LMSI-KDNNCGVMTAPTYVTM 351
M+ K++ CG+ Y M
Sbjct: 319 RMARNKNDQCGIAMYGCYPIM 339
>gi|208972988|dbj|BAG74343.1| silicatein-M2 [Ephydatia fluviatilis]
Length = 326
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 83/199 (41%), Positives = 127/199 (63%), Gaps = 5/199 (2%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM 211
Q CG+ ++F TGA+EGA + + K LS+Q +IDCS YGN+GC GG+ + ++++++
Sbjct: 128 QGQCGASYAFAATGALEGASALANDKQVTLSEQNIIDCSVPYGNHGCSGGDTYTAFKYVI 187
Query: 212 KH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
+ G+ T+ Y + G+ + C N T+ A+ TG V++ SE+ L A+A GPV+VAI
Sbjct: 188 DNGGIDTESSY-SFKGKQSSCQYNNKTSGASATGVVSIAYGSENDLLAAVATVGPVAVAI 246
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
DA+ +F FY +GV+ C+++ L+HA+L GYG +GK YW VKNSWS WG+ GY
Sbjct: 247 DANTNAFRFYQSGVFDSSSCSSTK--LNHAMLVTGYGSYNGKDYWLVKNSWSKNWGDSGY 304
Query: 331 VLM-SIKDNNCGVMTAPTY 348
+LM K N CG+ + Y
Sbjct: 305 ILMVRNKYNQCGIASDALY 323
>gi|114658412|ref|XP_001153217.1| PREDICTED: pro-cathepsin H isoform 6 [Pan troglodytes]
gi|397478882|ref|XP_003810764.1| PREDICTED: pro-cathepsin H [Pan paniscus]
gi|12803323|gb|AAH02479.1| Cathepsin H [Homo sapiens]
gi|60655259|gb|AAX32193.1| cathepsin H [synthetic construct]
gi|123979560|gb|ABM81609.1| cathepsin H [synthetic construct]
gi|123994193|gb|ABM84698.1| cathepsin H [synthetic construct]
gi|189054474|dbj|BAG37247.1| unnamed protein product [Homo sapiens]
gi|410254318|gb|JAA15126.1| cathepsin H [Pan troglodytes]
gi|410294916|gb|JAA26058.1| cathepsin H [Pan troglodytes]
gi|410331109|gb|JAA34501.1| cathepsin H [Pan troglodytes]
Length = 335
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 81/199 (40%), Positives = 118/199 (59%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTGA+E A + K+ L++Q L+DC+ + N+GC GG +++++I
Sbjct: 134 NQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYI 193
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ +D Y PY G+D YC A + N+T E+A+ A+A + PVS A
Sbjct: 194 LYNKGIMGEDTY-PYQGKDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFA 252
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ +Q F Y G+Y C+ +PD ++HAVLAVGYGE +G PYW VKNSW WG G
Sbjct: 253 FEVTQ-DFMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNG 311
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y L+ N CG+ +Y
Sbjct: 312 YFLIERGKNMCGLAACASY 330
>gi|375340657|emb|CBJ56264.1| cathepsin S protein [Dicentrarchus labrax]
Length = 337
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 86/201 (42%), Positives = 121/201 (60%), Gaps = 4/201 (1%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM 211
Q CGSCW+F GA+EG KL LS Q L+DCS YGN+GC+GG +++Q+++
Sbjct: 140 QGSCGSCWAFSAAGALEGQLAKTTGKLVDLSPQNLVDCSTKYGNHGCNGGFMHQAFQYVI 199
Query: 212 KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAID 271
+ D PY G++ C + A + + + +E ALK ALA GP+SVAID
Sbjct: 200 DNQGIDSDASYPYTGRNGECRYNSKFRAANCSQYSFLPEGNEGALKEALANIGPISVAID 259
Query: 272 ASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYV 331
A++ +F+FY +GVY D C+ ++H VLAVGYG LDG+ YW VKNSW +G+QGY+
Sbjct: 260 ATRPTFTFYRSGVYNDPNCSQK---VNHGVLAVGYGTLDGQDYWLVKNSWGKTFGDQGYI 316
Query: 332 LMSI-KDNNCGVMTAPTYVTM 351
MS K++ CG+ Y M
Sbjct: 317 RMSRNKNDQCGIALYGCYPIM 337
>gi|213512938|ref|NP_001133871.1| Cathepsin K precursor [Salmo salar]
gi|209155648|gb|ACI34056.1| Cathepsin K precursor [Salmo salar]
gi|223647252|gb|ACN10384.1| Cathepsin K precursor [Salmo salar]
gi|223673129|gb|ACN12746.1| Cathepsin K precursor [Salmo salar]
Length = 331
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 93/233 (39%), Positives = 128/233 (54%), Gaps = 7/233 (3%)
Query: 122 KYSLWVRYNKASKDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAV 180
+ + W+ N K + Y KG + +Q CGSCW+F + GA+EG KL
Sbjct: 103 RSNTWIPDNNVVKIPRSIDYRKKGMVTPVKNQLSCGSCWAFSSAGALEGQLAKTTGKLID 162
Query: 181 LSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATA 240
LS Q L+DC NNGC GG +++++ ++G ++ PYLGQD C + A
Sbjct: 163 LSPQNLVDCV--TENNGCGGGYMTNAFEYVEENGGIDTEEAYPYLGQDGQCAYNASGMGA 220
Query: 241 TMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHA 300
GF + E AL A+ K GPV+V IDA+ +F FY GVYYD CN D ++HA
Sbjct: 221 QCRGFKEIPEGDEWALTKAVVKVGPVAVGIDATLSTFQFYQRGVYYDPNCNK--DDINHA 278
Query: 301 VLAVGYGE-LDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
VLAVGYG+ G +W VKNSWS WG QGY++M+ + N CG+ +Y M
Sbjct: 279 VLAVGYGQTAKGMKFWIVKNSWSESWGKQGYIMMARNRGNACGIANLASYPIM 331
>gi|61372279|gb|AAX43816.1| cathepsin H [synthetic construct]
Length = 336
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 81/199 (40%), Positives = 118/199 (59%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTGA+E A + K+ L++Q L+DC+ + N+GC GG +++++I
Sbjct: 134 NQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYI 193
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ +D Y PY G+D YC A + N+T E+A+ A+A + PVS A
Sbjct: 194 LYNKGIMGEDTY-PYQGKDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFA 252
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ +Q F Y G+Y C+ +PD ++HAVLAVGYGE +G PYW VKNSW WG G
Sbjct: 253 FEVTQ-DFMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNG 311
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y L+ N CG+ +Y
Sbjct: 312 YFLIERGKNMCGLAACASY 330
>gi|60827884|gb|AAX36817.1| cathepsin H [synthetic construct]
Length = 336
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 81/199 (40%), Positives = 118/199 (59%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTGA+E A + K+ L++Q L+DC+ + N+GC GG +++++I
Sbjct: 134 NQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYI 193
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ +D Y PY G+D YC A + N+T E+A+ A+A + PVS A
Sbjct: 194 LYNKGIMGEDTY-PYQGKDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFA 252
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ +Q F Y G+Y C+ +PD ++HAVLAVGYGE +G PYW VKNSW WG G
Sbjct: 253 FEVTQ-DFMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNG 311
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y L+ N CG+ +Y
Sbjct: 312 YFLIERGKNMCGLAACASY 330
>gi|47522632|ref|NP_999094.1| pro-cathepsin H precursor [Sus scrofa]
gi|5915886|sp|O46427.1|CATH_PIG RecName: Full=Pro-cathepsin H; Contains: RecName: Full=Cathepsin H
mini chain; Contains: RecName: Full=Cathepsin H;
Contains: RecName: Full=Cathepsin H heavy chain;
Contains: RecName: Full=Cathepsin H light chain; Flags:
Precursor
gi|2735659|gb|AAB93957.1| preprocathepsin H [Sus scrofa]
gi|172050733|gb|ACB70168.1| cathepsin H [Sus scrofa]
Length = 335
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 82/199 (41%), Positives = 118/199 (59%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTGA+E A + K+ L++Q L+DC+ + N+GC GG +++++I
Sbjct: 134 NQGSCGSCWTFSTTGALESAVAIATGKMLSLAEQQLVDCAQNFNNHGCQGGLPSQAFEYI 193
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
G+ +D Y PY GQD +C A A + N+T N E+A+ A+A + PVS A
Sbjct: 194 RYNKGIMGEDTY-PYKGQDDHCKFQPDKAIAFVKDVANITMNDEEAMVEAVALYNPVSFA 252
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ + F Y G+Y C+ +PD ++HAVLAVGYGE +G PYW VKNSW WG G
Sbjct: 253 FEVTN-DFLMYRKGIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGMNG 311
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y L+ N CG+ +Y
Sbjct: 312 YFLIERGKNMCGLAACASY 330
>gi|410921048|ref|XP_003973995.1| PREDICTED: digestive cysteine proteinase 2-like [Takifugu rubripes]
Length = 290
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 92/223 (41%), Positives = 126/223 (56%), Gaps = 8/223 (3%)
Query: 130 NKASK-DAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALI 187
N A K D + + Y GY + DQ CGSCW+F TTGA+EG + K +L LS+Q L+
Sbjct: 69 NSAKKFDGMVIDYRQMGYVTEVKDQGYCGSCWAFSTTGAIEGQIFKKTGQLMSLSEQNLV 128
Query: 188 DCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVN 247
DCS YG GC G +Y +++ +GL + Y PY C+ + A A + +
Sbjct: 129 DCSKSYGTYGCSGAWMANAYDYVVNNGLESTITY-PYTSDTQPCYYDSRLAVAHIKDYRF 187
Query: 248 VTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG 307
+ E AL A+A GP++VAIDAS SF FY +G+Y + CN P+ L HAVL VGYG
Sbjct: 188 IPKGDEQALADAVATIGPITVAIDASHSSFLFYSSGIYEESNCN--PNNLSHAVLLVGYG 245
Query: 308 ELDGKPYWQVKNSWSTYWGNQGYVLMSIKD--NNCGVMTAPTY 348
G+ YW +KNSW WG GY+ + I+D N CG+ + Y
Sbjct: 246 SEGGQDYWLIKNSWGPSWGEGGYMRL-IRDGKNPCGIASYALY 287
>gi|332252750|ref|XP_003275518.1| PREDICTED: pro-cathepsin H [Nomascus leucogenys]
Length = 335
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 81/199 (40%), Positives = 118/199 (59%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTGA+E A + K+ L++Q L+DC+ + N+GC GG +++++I
Sbjct: 134 NQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYI 193
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ +D Y PY G+D YC A + N+T E+A+ A+A + PVS A
Sbjct: 194 LYNKGIMGEDTY-PYQGKDGYCKFRPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFA 252
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ +Q F Y G+Y C+ +PD ++HAVLAVGYGE +G PYW VKNSW WG G
Sbjct: 253 FEVTQ-DFMMYRRGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNG 311
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y L+ N CG+ +Y
Sbjct: 312 YFLIERGKNMCGLAACASY 330
>gi|4139678|pdb|8PCH|A Chain A, Crystal Structure Of Porcine Cathepsin H Determined At 2.1
Angstrom Resolution: Location Of The Mini-Chain
C-Terminal Carboxyl Group Defines Cathepsin H
Aminopeptidase Function
gi|28948781|pdb|1NB3|A Chain A, Crystal Structure Of Stefin A In Complex With Cathepsin H:
N-Terminal Residues Of Inhibitors Can Adapt To The
Active Sites Of Endo-And Exopeptidases
gi|28948784|pdb|1NB3|B Chain B, Crystal Structure Of Stefin A In Complex With Cathepsin H:
N-Terminal Residues Of Inhibitors Can Adapt To The
Active Sites Of Endo-And Exopeptidases
gi|28948787|pdb|1NB3|C Chain C, Crystal Structure Of Stefin A In Complex With Cathepsin H:
N-Terminal Residues Of Inhibitors Can Adapt To The
Active Sites Of Endo-And Exopeptidases
gi|28948790|pdb|1NB3|D Chain D, Crystal Structure Of Stefin A In Complex With Cathepsin H:
N-Terminal Residues Of Inhibitors Can Adapt To The
Active Sites Of Endo-And Exopeptidases
gi|28948793|pdb|1NB5|A Chain A, Crystal Structure Of Stefin A In Complex With Cathepsin H
gi|28948796|pdb|1NB5|B Chain B, Crystal Structure Of Stefin A In Complex With Cathepsin H
gi|28948799|pdb|1NB5|C Chain C, Crystal Structure Of Stefin A In Complex With Cathepsin H
gi|28948802|pdb|1NB5|D Chain D, Crystal Structure Of Stefin A In Complex With Cathepsin H
Length = 220
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 82/199 (41%), Positives = 118/199 (59%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTGA+E A + K+ L++Q L+DC+ + N+GC GG +++++I
Sbjct: 19 NQGSCGSCWTFSTTGALESAVAIATGKMLSLAEQQLVDCAQNFNNHGCQGGLPSQAFEYI 78
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
G+ +D Y PY GQD +C A A + N+T N E+A+ A+A + PVS A
Sbjct: 79 RYNKGIMGEDTY-PYKGQDDHCKFQPDKAIAFVKDVANITMNDEEAMVEAVALYNPVSFA 137
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ + F Y G+Y C+ +PD ++HAVLAVGYGE +G PYW VKNSW WG G
Sbjct: 138 FEVTN-DFLMYRKGIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGMNG 196
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y L+ N CG+ +Y
Sbjct: 197 YFLIERGKNMCGLAACASY 215
>gi|218478069|dbj|BAH03395.1| cathepsin L-like cysteine peptidase [Taenia solium]
Length = 346
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 88/203 (43%), Positives = 123/203 (60%), Gaps = 13/203 (6%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F +TGA+EGA+ K KL LS+Q L+DCS GN+GC+GG +++++
Sbjct: 141 NQGNCGSCWAFSSTGALEGAFAKKTGKLISLSEQQLVDCSLKNGNDGCNGGYMSYAFKYL 200
Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
+H + + Y PY D C + T+T ++ +E AL A+A GP+S+AI
Sbjct: 201 EEHFIEPESAY-PYRATDGPCRYNESLGVGTVTDIGDIPEGNETALMEAVATVGPISIAI 259
Query: 271 DASQKSFSFYV-------NGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWST 323
DAS F FY +G+Y C S L+H VLA+GYG+ DGKPYW VKNSW T
Sbjct: 260 DASSLGFMFYRQVATNPHHGIYKSHWC--SSKFLNHGVLAIGYGKQDGKPYWLVKNSWGT 317
Query: 324 YWGNQGYVLMSIKD--NNCGVMT 344
WG +GY++M+ KD N CGV +
Sbjct: 318 RWGMKGYIMMA-KDYHNMCGVAS 339
>gi|213514640|ref|NP_001134963.1| Cathepsin S precursor [Salmo salar]
gi|209155506|gb|ACI33985.1| Cathepsin S precursor [Salmo salar]
gi|209737594|gb|ACI69666.1| Cathepsin S precursor [Salmo salar]
gi|223647278|gb|ACN10397.1| Cathepsin S precursor [Salmo salar]
gi|223673157|gb|ACN12760.1| Cathepsin S precursor [Salmo salar]
Length = 330
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 91/210 (43%), Positives = 119/210 (56%), Gaps = 5/210 (2%)
Query: 144 KGYNSLLD-QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGE 202
KGY + + Q CGSCW+F GA+EG KL +S Q L+DCS YGN GC+GG
Sbjct: 124 KGYVTEVKMQGSCGSCWAFSAVGALEGQLMKTTGKLIDISSQNLVDCSSKYGNKGCNGGF 183
Query: 203 DFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAK 262
+++Q+++ + D PY G C A + + + E LK ALA
Sbjct: 184 MSQAFQYVIDNQGIDSDQSYPYKGVQQQCSYNPAQRAANCSKYSFLPEGDEGVLKEALAT 243
Query: 263 HGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWS 322
GP+SVAIDA++ F+FY +GVY D C ++HAVLAVGYG L G+ YW VKNSWS
Sbjct: 244 IGPISVAIDATRPLFTFYRSGVYNDPTCTKK---INHAVLAVGYGTLGGQDYWLVKNSWS 300
Query: 323 TYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
WG+QGY+ MS KDN CG+ Y M
Sbjct: 301 LSWGDQGYIRMSRNKDNQCGIALYGCYPVM 330
>gi|426379977|ref|XP_004056662.1| PREDICTED: pro-cathepsin H [Gorilla gorilla gorilla]
Length = 335
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 81/199 (40%), Positives = 118/199 (59%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTGA+E A + K+ L++Q L+DC+ + N+GC GG +++++I
Sbjct: 134 NQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYI 193
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ +D Y PY G+D YC A + N+T E+A+ A+A + PVS A
Sbjct: 194 LYNKGIMGEDTY-PYQGKDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFA 252
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ +Q F Y G+Y C+ +PD ++HAVLAVGYGE +G PYW VKNSW WG G
Sbjct: 253 FEVTQ-DFMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPKWGMNG 311
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y L+ N CG+ +Y
Sbjct: 312 YFLIERGKNMCGLAACASY 330
>gi|945081|gb|AAC49361.1| P21 [Petunia x hybrida]
Length = 358
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 83/199 (41%), Positives = 121/199 (60%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTGA+E AY K K LS+Q L+DC+ + N GC+GG +++++I
Sbjct: 158 NQGKCGSCWTFSTTGALEAAYTQKFGKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYI 217
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ GL T++ Y PY G++ C ++ +T VN+T +ED LK A+A PVSVA
Sbjct: 218 KSNGGLETEEAY-PYTGKNGLCKFSSQNVGVKVTDSVNITLGAEDELKYAVALVRPVSVA 276
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ K F Y +GVY +C +P ++HAVLAVGYG G P+W +KNSW WG+
Sbjct: 277 FEVV-KGFKQYKSGVYTSTECGTTPMDVNHAVLAVGYGVEYGVPFWLIKNSWGADWGDNA 335
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y M + ++ CG+ T +Y
Sbjct: 336 YFKMEMGNDMCGIATCASY 354
>gi|326926970|ref|XP_003209669.1| PREDICTED: cathepsin H-like [Meleagris gallopavo]
Length = 323
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 82/198 (41%), Positives = 117/198 (59%), Gaps = 3/198 (1%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM 211
Q CGSCW+F TTG +E A + KL L++Q L+DC+ + N+GC GG +++++I+
Sbjct: 123 QGPCGSCWTFSTTGCLESAIAIATGKLLSLAEQQLVDCAQAFNNHGCSGGLPSQAFEYIL 182
Query: 212 -KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
GL +D Y PY Q+ C A A + +N+T E ++ A+ KH PVS A
Sbjct: 183 YNKGLMGEDAY-PYRAQNGTCKFQPDKAVAFVRDVINITQYDEASMVEAVGKHNPVSFAF 241
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
+ + F Y GVY + +C ++PD ++HAVLAVGYGE DG PYW VKNSW + WG GY
Sbjct: 242 EVTND-FMHYRKGVYSNPRCEHTPDKVNHAVLAVGYGEEDGLPYWIVKNSWGSLWGMDGY 300
Query: 331 VLMSIKDNNCGVMTAPTY 348
L+ N CG+ +Y
Sbjct: 301 FLIERGKNMCGLAACASY 318
>gi|28194647|gb|AAO33585.1|AF479267_1 cathepsin L [Mesocricetus auratus]
Length = 333
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 90/207 (43%), Positives = 127/207 (61%), Gaps = 10/207 (4%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F GA+EG +K L LS+Q L+DCS G GN GC+GG ++Q++
Sbjct: 131 NQGQCGSCWAFSACGALEGQMCLKTGVLVSLSEQNLVDCSRGEGNQGCNGGLMDFAFQYV 190
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ + GL +++ Y PY +D C A A TG+V++ P E AL A+A GP++VA
Sbjct: 191 LNNKGLDSEESY-PYEAKDGTCKYKPEFAAANDTGYVDI-PQLEKALMKAVATVGPIAVA 248
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTYW 325
IDAS SF FY +G+Y++ C S LDH VL +GYG + + K YW VKNSW T W
Sbjct: 249 IDASHPSFQFYSSGIYFEPNC--SSKDLDHGVLVIGYGFEGTDSNKKKYWIVKNSWGTGW 306
Query: 326 GNQGYV-LMSIKDNNCGVMTAPTYVTM 351
G G+ + K+N+CG+ TA +Y T+
Sbjct: 307 GMGGFFHIAKDKNNHCGIATAASYPTV 333
>gi|535600|gb|AAA29137.1| cathepsin [Fasciola hepatica]
Length = 326
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 90/218 (41%), Positives = 123/218 (56%), Gaps = 8/218 (3%)
Query: 131 KASKDAIPVRYEMK--GY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALI 187
KA+K A+P R + + GY + DQ CGSCW+F TTGA+EG Y K S+Q L+
Sbjct: 102 KANKRAVPDRIDWRESGYVTEVKDQGGCGSCWAFSTTGAMEGQYMKNEKTSISFSEQQLV 161
Query: 188 DCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVN 247
DCS +GN GC+GG +Y+++ + GL T+ Y PY + C A +TG+
Sbjct: 162 DCSGPFGNYGCNGGLMENAYEYLKRFGLETESSY-PYRAVEGQCRYNEQLGVAKVTGYYT 220
Query: 248 VTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG 307
V E L+ + P +VA+D + F Y +G+Y + C SPD L+H VLAVGYG
Sbjct: 221 VHSGDEVELQNLVGCRRPAAVALDV-ESDFMMYRSGIYQSQTC--SPDRLNHGVLAVGYG 277
Query: 308 ELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMT 344
DG YW VKNSW T+WG GY+ M K N CG+ +
Sbjct: 278 IQDGTDYWIVKNSWGTWWGEDGYIRMVRKRGNMCGIAS 315
>gi|402856107|ref|XP_003892641.1| PREDICTED: cathepsin S isoform 2 [Papio anubis]
Length = 281
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 87/206 (42%), Positives = 118/206 (57%), Gaps = 5/206 (2%)
Query: 145 GYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW-GYGNNGCDGGED 203
G N L D CG+CW+F GA+E +K KL LS Q L+DCS YGN GC+GG
Sbjct: 76 GMNHLGDMGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFM 135
Query: 204 FRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKH 263
R++Q+I+ + D PY D C + AT + + + ED LK +A
Sbjct: 136 TRAFQYIIDNNGIDSDASYPYKATDQKCQYDSKYRAATCSKYTELPYGREDVLKEVVANK 195
Query: 264 GPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWST 323
GPVSV +DAS SF Y +GVYY+ C + ++H VL VGYG L+GK YW VKNSW
Sbjct: 196 GPVSVGVDASHPSFFLYRSGVYYEPSCTQN---VNHGVLVVGYGVLNGKEYWLVKNSWGR 252
Query: 324 YWGNQGYVLMSI-KDNNCGVMTAPTY 348
+G +GY+ M+ K N+CG+ + P+Y
Sbjct: 253 NFGEEGYIRMARNKGNHCGIASFPSY 278
>gi|148927396|gb|ABR19829.1| cysteine proteinase [Elaeis guineensis]
Length = 358
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 82/199 (41%), Positives = 115/199 (57%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TTGA+E AY K LS+Q L+DC++ + N GC+GG +++++I
Sbjct: 157 DQGSCGSCWTFSTTGALEAAYTQATGKGISLSEQQLVDCAYAFNNFGCNGGLPSQAFEYI 216
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ GL T++ Y PY G + +CH + VN+T +ED L A+ PVS+A
Sbjct: 217 KYNGGLDTEESY-PYAGVNGFCHFKPENVGVKVVESVNITLGAEDELLHAVGLVRPVSIA 275
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ F FY GVY + C + ++HAVLAVGYG +G PYW +KNSW WG G
Sbjct: 276 FEVVS-GFRFYKGGVYTSDTCGRTQMDVNHAVLAVGYGVENGVPYWLIKNSWGEEWGVDG 334
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y M + N CG+ T +Y
Sbjct: 335 YFKMELGKNMCGIATCASY 353
>gi|23110955|ref|NP_004381.2| pro-cathepsin H preproprotein [Homo sapiens]
gi|288558851|sp|P09668.4|CATH_HUMAN RecName: Full=Pro-cathepsin H; Contains: RecName: Full=Cathepsin H
mini chain; Contains: RecName: Full=Cathepsin H;
Contains: RecName: Full=Cathepsin H heavy chain;
Contains: RecName: Full=Cathepsin H light chain; Flags:
Precursor
gi|119619549|gb|EAW99143.1| cathepsin H [Homo sapiens]
Length = 335
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 81/199 (40%), Positives = 118/199 (59%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTGA+E A + K+ L++Q L+DC+ + N+GC GG +++++I
Sbjct: 134 NQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYI 193
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ +D Y PY G+D YC A + N+T E+A+ A+A + PVS A
Sbjct: 194 LYNKGIMGEDTY-PYQGKDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFA 252
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ +Q F Y G+Y C+ +PD ++HAVLAVGYGE +G PYW VKNSW WG G
Sbjct: 253 FEVTQ-DFMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNG 311
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y L+ N CG+ +Y
Sbjct: 312 YFLIERGKNMCGLAACASY 330
>gi|86285730|gb|ABC94586.1| silicatein alpha [Hymeniacidon perlevis]
Length = 331
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 84/206 (40%), Positives = 123/206 (59%), Gaps = 4/206 (1%)
Query: 148 SLLDQSVCGSCWSFGTTGAVEGAYYMKHK-KLAVLSQQALIDCSWGYGNNGCDGGEDFRS 206
++ DQ CG+ ++F GA+EGAY + H S+Q +IDCS YGN GC GG + +
Sbjct: 128 AVKDQGDCGASYAFSAMGALEGAYALAHNGNQESFSEQNIIDCSIPYGNYGCHGGNMYDA 187
Query: 207 YQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
+ +++ +G ++ P+LG+ + C+ T M+G V + SED L+ A+A GPV
Sbjct: 188 FLYVIANGGVAKESAYPFLGKQSSCNYNRNTRGTGMSGSVAIKSESEDDLQTAVANVGPV 247
Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWG 326
+VAID + +F FY +GVY +C S L+HA++ GYG GK YW KNSWST WG
Sbjct: 248 AVAIDGANNAFRFYYSGVYDSSRC--SSTSLNHAMVVTGYGTYAGKKYWLAKNSWSTNWG 305
Query: 327 NQGYVLMSI-KDNNCGVMTAPTYVTM 351
GYV+M+ K N CG+ T +Y T+
Sbjct: 306 QSGYVMMARGKYNQCGIATDASYPTL 331
>gi|48145879|emb|CAG33162.1| CTSH [Homo sapiens]
Length = 335
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 81/199 (40%), Positives = 118/199 (59%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTGA+E A + K+ L++Q L+DC+ + N+GC GG +++++I
Sbjct: 134 NQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYI 193
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ +D Y PY G+D YC A + N+T E+A+ A+A + PVS A
Sbjct: 194 LYNKGIMGEDTY-PYQGKDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFA 252
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ +Q F Y G+Y C+ +PD ++HAVLAVGYGE +G PYW VKNSW WG G
Sbjct: 253 FEVTQ-DFMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNG 311
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y L+ N CG+ +Y
Sbjct: 312 YFLIERGKNMCGLAACASY 330
>gi|172050735|gb|ACB70169.1| cathepsin H transcript variant 3 [Sus scrofa]
Length = 251
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 82/199 (41%), Positives = 118/199 (59%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTGA+E A + K+ L++Q L+DC+ + N+GC GG +++++I
Sbjct: 50 NQGSCGSCWTFSTTGALESAVAIATGKMLSLAEQQLVDCAQNFNNHGCQGGLPSQAFEYI 109
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
G+ +D Y PY GQD +C A A + N+T N E+A+ A+A + PVS A
Sbjct: 110 RYNKGIMGEDTY-PYKGQDDHCKFQPDKAIAFVKDVANITMNDEEAMVEAVALYNPVSFA 168
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ + F Y G+Y C+ +PD ++HAVLAVGYGE +G PYW VKNSW WG G
Sbjct: 169 FEVTN-DFLMYRKGIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGMNG 227
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y L+ N CG+ +Y
Sbjct: 228 YFLIERGKNMCGLAACASY 246
>gi|426248750|ref|XP_004018122.1| PREDICTED: pro-cathepsin H [Ovis aries]
Length = 355
Score = 162 bits (410), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 81/199 (40%), Positives = 119/199 (59%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTGA+E A + KL L++Q L+DC+ + N+GC GG +++++I
Sbjct: 154 NQGSCGSCWTFSTTGALESAVAIATGKLPFLAEQQLVDCAQNFNNHGCQGGLPSQAFEYI 213
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
G+ +D Y PY G+D C + A A + N+T N E+A+ A+A + PVS A
Sbjct: 214 RYNKGIMGEDTY-PYRGEDGDCKYQPSKAIAFVKDVANITLNDEEAMVEAVALYNPVSFA 272
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ + F Y G+Y C+ +PD ++HAVLAVGYGE G PYW VKNSW +WG +G
Sbjct: 273 FEVT-ADFMMYRKGIYSSTSCHKTPDKVNHAVLAVGYGEEKGIPYWIVKNSWGPHWGMKG 331
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y L+ N CG+ ++
Sbjct: 332 YFLIERGKNMCGLAACASF 350
>gi|380236892|emb|CBK52289.1| cathepsin S protein [Dicentrarchus labrax]
Length = 337
Score = 162 bits (409), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 86/201 (42%), Positives = 120/201 (59%), Gaps = 4/201 (1%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM 211
Q CGSCW+F GA+EG KL LS Q L+DCS YGN+GC+GG ++Q+++
Sbjct: 140 QGSCGSCWAFSAAGALEGQLAKTTGKLVDLSPQNLVDCSTKYGNHGCNGGLMHHAFQYVI 199
Query: 212 KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAID 271
+ D PY G++ C + A + + + +E ALK ALA GP+SVAID
Sbjct: 200 DNQGIDSDASYPYTGRNGECRYNSKFRAANCSQYSFLPEGNEGALKEALANIGPISVAID 259
Query: 272 ASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYV 331
A++ +F+FY +GVY D C+ ++H VLAVGYG LDG+ YW VKNSW +G+QGY+
Sbjct: 260 ATRPTFTFYRSGVYNDPNCSQK---VNHGVLAVGYGTLDGQDYWLVKNSWGKTFGDQGYI 316
Query: 332 LMSI-KDNNCGVMTAPTYVTM 351
MS K++ CG+ Y M
Sbjct: 317 RMSRNKNDQCGIALYGCYPIM 337
>gi|62510452|sp|Q8HY81.1|CATS_CANFA RecName: Full=Cathepsin S; Flags: Precursor
gi|27497538|gb|AAO13009.1| cathepsin S preproprotein [Canis lupus familiaris]
Length = 331
Score = 162 bits (409), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 96/234 (41%), Positives = 136/234 (58%), Gaps = 10/234 (4%)
Query: 121 SKYSLWVRYNKASKDAIP--VRYEMKGYNSLLD-QSVCGSCWSFGTTGAVEGAYYMKHKK 177
S++ V Y S +P V + KG + + Q CG+CW+F GA+E +K K
Sbjct: 99 SQWQRNVTYRSNSNQKLPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGK 158
Query: 178 LAVLSQQALIDCSW-GYGNNGCDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIAN 235
L LS Q L+DCS YGN GC+GG ++Q+I+ +G+ ++ Y PY + C +
Sbjct: 159 LVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNNGIDSEASY-PYKAMNGKCRYDS 217
Query: 236 TTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPD 295
AT + + + SEDALK A+A GPVSVAIDAS SF Y +GVYY+ C +
Sbjct: 218 KKRAATCSKYTELPFGSEDALKEAVANKGPVSVAIDASHYSFFLYRSGVYYEPSCTQN-- 275
Query: 296 GLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKD-NNCGVMTAPTY 348
++H VL VGYG L+GK YW VKNSW +G+QGY+ M+ N+CG+ + P+Y
Sbjct: 276 -VNHGVLVVGYGNLNGKDYWLVKNSWGLNFGDQGYIRMARNSGNHCGIASYPSY 328
>gi|253722774|pdb|1CJL|A Chain A, Crystal Structure Of A Cysteine Protease Proform
Length = 312
Score = 162 bits (409), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 93/220 (42%), Positives = 130/220 (59%), Gaps = 11/220 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGS W+F TGA+EG + K +L LS+Q L+DCS GN G
Sbjct: 97 VDWREKGYVTPVKNQGQCGSSWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPEGNEG 156
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++Q++ + GL +++ Y PY + C + A GFV++ P E AL
Sbjct: 157 CNGGLMDYAFQYVQDNGGLDSEESY-PYEATEESCKYNPKYSVANDAGFVDI-PKQEKAL 214
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GP+SVAIDA +SF FY G+Y++ C S + +DH VL VGYG E DG
Sbjct: 215 MKAVATVGPISVAIDAGHESFLFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDGN 272
Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
YW VKNSW WG GYV M+ + N+CG+ +A +Y T+
Sbjct: 273 KYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 312
>gi|296213765|ref|XP_002753411.1| PREDICTED: pro-cathepsin H [Callithrix jacchus]
Length = 336
Score = 162 bits (409), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 84/213 (39%), Positives = 126/213 (59%), Gaps = 5/213 (2%)
Query: 139 VRYEMKGY--NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNN 196
V + KG+ + + +Q CGSCW+F TTGA+E A + K+ L++Q L+DC+ + N+
Sbjct: 121 VDWRKKGHFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNH 180
Query: 197 GCDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDA 255
GC GG +++++I+ +G+ +D Y PY G+D+ C A + N+T EDA
Sbjct: 181 GCQGGLPSQAFEYILYNNGIMGEDTY-PYQGKDSDCKFQPGKAIGFVKDVANITIYDEDA 239
Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYW 315
+ A+A + PVS A + +Q F Y G+Y C+ +PD ++HAVLAVGYGE +G PYW
Sbjct: 240 MVEAVALYNPVSFAFEVTQ-DFMMYKRGIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYW 298
Query: 316 QVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTY 348
VKNSW WG GY L+ N CG+ +Y
Sbjct: 299 IVKNSWGPQWGMNGYFLIERGKNMCGLAACASY 331
>gi|241913450|pdb|3HHA|A Chain A, Crystal Structure Of Cathepsin L In Complex With
Az12878478
gi|241913451|pdb|3HHA|B Chain B, Crystal Structure Of Cathepsin L In Complex With
Az12878478
gi|241913452|pdb|3HHA|C Chain C, Crystal Structure Of Cathepsin L In Complex With
Az12878478
gi|241913453|pdb|3HHA|D Chain D, Crystal Structure Of Cathepsin L In Complex With
Az12878478
gi|317455045|pdb|2XU1|A Chain A, Cathepsin L With A Nitrile Inhibitor
gi|317455046|pdb|2XU1|B Chain B, Cathepsin L With A Nitrile Inhibitor
gi|317455047|pdb|2XU1|C Chain C, Cathepsin L With A Nitrile Inhibitor
gi|317455048|pdb|2XU1|D Chain D, Cathepsin L With A Nitrile Inhibitor
Length = 220
Score = 162 bits (409), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 93/220 (42%), Positives = 130/220 (59%), Gaps = 11/220 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F TGA+EG + K +L LS+Q L+DCS GN G
Sbjct: 5 VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEG 64
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++Q++ + GL +++ Y PY + C + A GFV++ P E AL
Sbjct: 65 CNGGLMDYAFQYVQDNGGLDSEESY-PYEATEESCKYNPKYSVANDAGFVDI-PKQEKAL 122
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GP+SVAIDA +SF FY G+Y++ C S + +DH VL VGYG E D
Sbjct: 123 MKAVATVGPISVAIDAGHESFLFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDNN 180
Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
YW VKNSW WG GYV M+ + N+CG+ +A +Y T+
Sbjct: 181 KYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 220
>gi|355681664|gb|AER96818.1| cathepsin S [Mustela putorius furo]
Length = 338
Score = 162 bits (409), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 95/234 (40%), Positives = 134/234 (57%), Gaps = 10/234 (4%)
Query: 121 SKYSLWVRYNKASKDAIP--VRYEMKGYNSLLD-QSVCGSCWSFGTTGAVEGAYYMKHKK 177
S++ V Y S +P V + KG + + Q CG+CW+F GA+E +K
Sbjct: 107 SQWQANVTYKSNSNQKLPDSVDWREKGCVTEVKYQGACGACWAFSAVGALEAQLKLKTGN 166
Query: 178 LAVLSQQALIDCSW-GYGNNGCDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIAN 235
L LS Q L+DCS YGN GC+GG +++Q+I+ +G+ ++ Y PY D C +
Sbjct: 167 LVSLSAQNLVDCSTERYGNKGCNGGFMTKAFQYIIDNNGIDSEVSY-PYKAMDGNCRYDS 225
Query: 236 TTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPD 295
AT + + + SEDALK A+A GPVSVAIDA SF Y +GVYYD C +
Sbjct: 226 KHRAATCSKYTELPFGSEDALKEAVANKGPVSVAIDAKHSSFFLYKSGVYYDPSCTQN-- 283
Query: 296 GLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKD-NNCGVMTAPTY 348
++H VL VGYG L+G+ YW VKNSW +G QGY+ M+ N+CG+ + P+Y
Sbjct: 284 -VNHGVLVVGYGNLNGRDYWLVKNSWGLNFGEQGYIRMARNSGNHCGIASYPSY 336
>gi|195729975|gb|ACG50798.1| cathepsin L1 [Fascioloides magna]
Length = 327
Score = 162 bits (409), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 84/218 (38%), Positives = 128/218 (58%), Gaps = 8/218 (3%)
Query: 131 KASKDAIPVRYEMKGY---NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALI 187
+A + +P + + Y + DQ CGSCW+F +TGA+EG Y K + S+Q L+
Sbjct: 103 QAKGNDVPESIDWRDYGYVTEVKDQGQCGSCWAFSSTGAMEGQYIKKFRTTVSFSEQQLV 162
Query: 188 DCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVN 247
DC+ YGN+GC+GG R+++++ ++GL T+ Y PY D +C + A +TG+
Sbjct: 163 DCTRNYGNSGCNGGWMERAFEYLRRNGLETESSY-PYRAVDDHCRYESQLGVAKVTGYYT 221
Query: 248 VTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG 307
+E +L + GPV+VA+D Q FS Y +G+Y E C S ++HAVLAVGYG
Sbjct: 222 EHSGNEVSLMNMVGGEGPVAVAVDV-QSDFSMYKSGIYQSETC--STYYVNHAVLAVGYG 278
Query: 308 ELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMT 344
G YW +KNSW ++WG+QGY+ + NN CG+ +
Sbjct: 279 TESGTDYWILKNSWGSWWGDQGYIRFARNRNNMCGIAS 316
>gi|237844793|ref|XP_002371694.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
gi|50313163|gb|AAT74529.1| toxopain-2 [Toxoplasma gondii]
gi|89242977|gb|ABD64744.1| cathepsin L [Toxoplasma gondii]
gi|95007485|emb|CAJ20707.1| toxopain-2 [Toxoplasma gondii RH]
gi|211969358|gb|EEB04554.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
gi|221480879|gb|EEE19300.1| cysteine protease, putative [Toxoplasma gondii GT1]
gi|221501596|gb|EEE27366.1| cysteine protease, putative [Toxoplasma gondii VEG]
Length = 422
Score = 162 bits (409), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 88/206 (42%), Positives = 119/206 (57%), Gaps = 10/206 (4%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TTGA+EGA+ K KL LS+Q L+DCS GN C GGE ++Q++
Sbjct: 222 DQRDCGSCWAFSTTGALEGAHCAKTGKLVSLSEQELMDCSRAEGNQSCSGGEMNDAFQYV 281
Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
+ G +D PYL +D C + + GF +V SE A+K ALAK PVS+AI
Sbjct: 282 LDSGGICSEDAYPYLARDEECRAQSCEKVVKILGFKDVPRRSEAAMKAALAK-SPVSIAI 340
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG--ELDGKPYWQVKNSWSTYWGNQ 328
+A Q F FY GV +D C LDH VL VGYG + K +W +KNSW T WG
Sbjct: 341 EADQMPFQFYHEGV-FDASCGTD---LDHGVLLVGYGTDKESKKDFWIMKNSWGTGWGRD 396
Query: 329 GYVLMSI---KDNNCGVMTAPTYVTM 351
GY+ M++ ++ CG++ ++ M
Sbjct: 397 GYMYMAMHKGEEGQCGLLLDASFPVM 422
>gi|354622947|ref|NP_001002938.2| cathepsin S precursor [Canis lupus familiaris]
Length = 339
Score = 162 bits (409), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 96/234 (41%), Positives = 136/234 (58%), Gaps = 10/234 (4%)
Query: 121 SKYSLWVRYNKASKDAIP--VRYEMKGYNSLLD-QSVCGSCWSFGTTGAVEGAYYMKHKK 177
S++ V Y S +P V + KG + + Q CG+CW+F GA+E +K K
Sbjct: 107 SQWQRNVTYRSNSNQKLPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGK 166
Query: 178 LAVLSQQALIDCSW-GYGNNGCDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIAN 235
L LS Q L+DCS YGN GC+GG ++Q+I+ +G+ ++ Y PY + C +
Sbjct: 167 LVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNNGIDSEASY-PYKAMNGKCRYDS 225
Query: 236 TTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPD 295
AT + + + SEDALK A+A GPVSVAIDAS SF Y +GVYY+ C +
Sbjct: 226 KKRAATCSKYTELPFGSEDALKEAVANKGPVSVAIDASHYSFFLYRSGVYYEPSCTQN-- 283
Query: 296 GLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKD-NNCGVMTAPTY 348
++H VL VGYG L+GK YW VKNSW +G+QGY+ M+ N+CG+ + P+Y
Sbjct: 284 -VNHGVLVVGYGNLNGKDYWLVKNSWGLNFGDQGYIRMARNSGNHCGIASYPSY 336
>gi|388491952|gb|AFK34042.1| unknown [Lotus japonicus]
Length = 352
Score = 162 bits (409), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 82/198 (41%), Positives = 112/198 (56%), Gaps = 1/198 (0%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ+ CGSCW+F TTGA+E AY H K LS+Q L+DC+ + N GC+GG +++++I
Sbjct: 152 DQAHCGSCWTFSTTGALEAAYAQAHGKNISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYI 211
Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
+G + PY +D + VN+T +ED LK A+A PVSVA
Sbjct: 212 KYNGGIALEKEYPYTAKDEASKFTAENVAVRVLDSVNITLGAEDELKHAVAFARPVSVAF 271
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
F Y GVY + C N+P ++HAVLAVGYG + PYW +KNSW + WG+ GY
Sbjct: 272 QVVD-GFRLYKEGVYTSDTCGNTPMDVNHAVLAVGYGVENNVPYWIIKNSWGSTWGDHGY 330
Query: 331 VLMSIKDNNCGVMTAPTY 348
M + N CGV T +Y
Sbjct: 331 FKMELGKNMCGVATCASY 348
>gi|328866326|gb|EGG14711.1| hypothetical protein DFA_10969 [Dictyostelium fasciculatum]
Length = 369
Score = 162 bits (409), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 94/240 (39%), Positives = 139/240 (57%), Gaps = 26/240 (10%)
Query: 134 KDAIPVRYEMKGYNS-LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG 192
KD V + +G S + +Q CGSCWSF +TGA+EGA+ +K ++ LS+Q L+DCS
Sbjct: 134 KDNPNVDWRKQGAVSHVKNQGQCGSCWSFSSTGAIEGAHAIKTGEMISLSEQQLVDCSKR 193
Query: 193 YGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQD-AYCHIANTTATATMTGFVNVTPN 251
YGNNGC+GG ++ +++ G ++ PY D + C +T A +++ N+
Sbjct: 194 YGNNGCNGGLMTLAFDYVIDAGGLESEEAYPYTTTDTSACMFNSTNAVTSISDHQNIRAG 253
Query: 252 SEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDG 311
+E L+ L GPVSVAIDAS +SF FY +G++Y +C++S LDH VLAVG+G+ G
Sbjct: 254 NEKHLETVLRNVGPVSVAIDASPRSFRFYKSGIFYAPECSSSQ--LDHGVLAVGFGK--G 309
Query: 312 KP-------------------YWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
P Y+ VKNSW + WG+ G++ MS + NNCG+ T TY T+
Sbjct: 310 NPESNFENKVSFIHDDTKNNEYYIVKNSWGSDWGSNGFIYMSKNRKNNCGIATMATYPTI 369
>gi|327289213|ref|XP_003229319.1| PREDICTED: cathepsin S-like [Anolis carolinensis]
Length = 333
Score = 162 bits (409), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 91/228 (39%), Positives = 132/228 (57%), Gaps = 6/228 (2%)
Query: 126 WVRYNKASKDAIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQA 185
W R ++ D + R E ++ +Q CGSCW+F GA+E +K L LS Q
Sbjct: 110 WARQGASAPDTVDWR-EKGCVTNVKNQGSCGSCWAFSAVGALECQLKLKTGNLVSLSPQN 168
Query: 186 LIDCSWGYGNNGCDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTG 244
L+DCS +GN+GC+GG ++Q+++ +G+ ++ Y PY GQ C AT +
Sbjct: 169 LVDCSSAFGNHGCNGGYISAAFQYVIYNNGIDSEASY-PYTGQSGTCRYNLQGRAATCSR 227
Query: 245 FVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAV 304
+V++ +E ALK A+A GPVSVAIDAS+ SF + GVY D C ++ ++H VL V
Sbjct: 228 YVDLPSGNEAALKDAVANFGPVSVAIDASRPSFFLFRKGVYDDPSCTSAH--INHGVLVV 285
Query: 305 GYGELDGKPYWQVKNSWSTYWGNQGYVLMSIK-DNNCGVMTAPTYVTM 351
GYG DG YW VKNSW +G+QGY+ ++ DN CG+ + TY M
Sbjct: 286 GYGTEDGIDYWLVKNSWGVSFGDQGYIKIARNHDNRCGIASQCTYPLM 333
>gi|395755765|ref|XP_002833453.2| PREDICTED: putative cathepsin L-like protein 6-like, partial [Pongo
abelii]
Length = 213
Score = 162 bits (409), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 94/215 (43%), Positives = 130/215 (60%), Gaps = 11/215 (5%)
Query: 144 KGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGE 202
KGY + DQ+ CGSCW+F TGA+EG + K KL LS+Q L+DCS GN GC+GG
Sbjct: 3 KGYVTPVKDQAQCGSCWAFSATGALEGQMFWKTGKLTSLSEQNLVDCSGPQGNEGCNGGF 62
Query: 203 DFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALA 261
S+Q++ ++ GL ++ Y Y G+ C + A TGF ++ P+ E L A+A
Sbjct: 63 MDNSFQYVQENGGLDSEASY-SYEGKVKTCRYNPKYSAANDTGFADI-PSWEKDLAKAVA 120
Query: 262 KHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGY----GELDGKPYWQV 317
GP+SVA+DAS SF FY G+Y++ C+ P+GLDHA+L V Y + D YW V
Sbjct: 121 TVGPISVAVDASHVSFQFYKKGIYFEPCCD--PEGLDHAMLVVDYSYEGADSDNNKYWLV 178
Query: 318 KNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
KNSW WG GY+ M+ + NNCG+ TA +Y T+
Sbjct: 179 KNSWGKNWGMDGYIKMAKDRKNNCGIATAASYPTV 213
>gi|226821425|gb|ACO82388.1| cathepsin S [Lutjanus argentimaculatus]
Length = 337
Score = 162 bits (409), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 88/205 (42%), Positives = 120/205 (58%), Gaps = 4/205 (1%)
Query: 148 SLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSY 207
S+ Q CGSCW+F GA+EG K KL LS Q L+DCS YGN+GC+GG ++
Sbjct: 136 SVKTQGSCGSCWAFSAVGALEGQLAKKTGKLVDLSPQNLVDCSTKYGNHGCNGGFMDHAF 195
Query: 208 QWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVS 267
Q+++ + D PY G+ CH + A + + + E ALK ALA GP+S
Sbjct: 196 QYVIDNQGIDSDASYPYTGRSDQCHYNPSYRAANCSSYNFLPEGDEGALKQALATIGPIS 255
Query: 268 VAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGN 327
VAIDA++ F FY +GVY D C+ ++H VLAVGYG L+G+ YW VKNSW T +G+
Sbjct: 256 VAIDATRPRFIFYRSGVYNDPSCSQE---VNHGVLAVGYGTLNGQDYWLVKNSWGTKFGD 312
Query: 328 QGYVLMSIKDNN-CGVMTAPTYVTM 351
QGY+ M+ N+ CG+ Y M
Sbjct: 313 QGYIRMARNQNDQCGIAMYGCYPIM 337
>gi|115743|sp|P07154.2|CATL1_RAT RecName: Full=Cathepsin L1; AltName: Full=Cyclic protein 2;
Short=CP-2; AltName: Full=Major excreted protein;
Short=MEP; Contains: RecName: Full=Procathepsin L;
Contains: RecName: Full=Cathepsin L1 heavy chain;
Contains: RecName: Full=Cathepsin L1 light chain; Flags:
Precursor
gi|38648869|gb|AAH63175.1| Cathepsin L1 [Rattus norvegicus]
gi|149029152|gb|EDL84437.1| cathepsin L, isoform CRA_a [Rattus norvegicus]
gi|386267881|dbj|BAM14518.1| cathepsin L [Rattus norvegicus]
Length = 334
Score = 162 bits (409), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 87/203 (42%), Positives = 121/203 (59%), Gaps = 8/203 (3%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F +G +EG ++K KL LS+Q L+DCS GN GC+GG ++Q+I
Sbjct: 131 NQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHDQGNQGCNGGLMDFAFQYI 190
Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
++G ++ PY +D C A A TGFV++ P E AL A+A GP+SVA+
Sbjct: 191 KENGGLDSEESYPYEAKDGSCKYRAEYAVANDTGFVDI-PQQEKALMKAVATVGPISVAM 249
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTYWG 326
DAS S FY +G+YY+ C S LDH VL VGYG + + YW VKNSW WG
Sbjct: 250 DASHPSLQFYSSGIYYEPNC--SSKDLDHGVLVVGYGYEGTDSNKDKYWLVKNSWGKEWG 307
Query: 327 NQGYV-LMSIKDNNCGVMTAPTY 348
GY+ + ++N+CG+ TA +Y
Sbjct: 308 MDGYIKIAKDRNNHCGLATAASY 330
>gi|318844127|ref|NP_001187181.1| cathspsin H precursor [Ictalurus punctatus]
gi|196475594|gb|ACG76366.1| cathspsin H [Ictalurus punctatus]
Length = 326
Score = 161 bits (408), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 84/215 (39%), Positives = 126/215 (58%), Gaps = 3/215 (1%)
Query: 135 DAIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYG 194
D+I R + + +Q CGSCW+F TTG +E + KL +L++Q L+DC+ +
Sbjct: 111 DSIDWRKKGNYVTEVKNQGACGSCWTFSTTGCLESVTAIATGKLPLLAEQQLVDCAGAFN 170
Query: 195 NNGCDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSE 253
N+GC+GG +++++IM GL T+DDY PY+G+D C A A + VN+T E
Sbjct: 171 NHGCNGGLPSQAFEYIMYNKGLMTEDDY-PYVGRDGPCKFDPKLAAAFVKDVVNITKYDE 229
Query: 254 DALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKP 313
+ A+A+ PVS+A + + F Y +GVY +C+N+ + ++HAVLAVGY E +G P
Sbjct: 230 MGIVDAVARLNPVSIAFEVLPE-FMHYKDGVYTSNECHNTTETVNHAVLAVGYAEENGTP 288
Query: 314 YWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTY 348
YW VKNSW WG GY + N CG+ +Y
Sbjct: 289 YWIVKNSWGPQWGIDGYFYIERGQNMCGLAACASY 323
>gi|340370388|ref|XP_003383728.1| PREDICTED: digestive cysteine proteinase 2-like [Amphimedon
queenslandica]
Length = 398
Score = 161 bits (408), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 87/202 (43%), Positives = 119/202 (58%), Gaps = 7/202 (3%)
Query: 149 LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
+ DQ CGSCW+F G++EG +++ L LS+Q L+DCS N+GC+GG +++
Sbjct: 198 IKDQGQCGSCWAFSAIGSLEGQHFINTGNLVSLSEQQLVDCS--LKNDGCNGGMLSTAFK 255
Query: 209 WIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVS 267
+I G ++ DY PY ++ C + A A +TG+ + ED+L A+ GP+S
Sbjct: 256 YIESVAGEESETDY-PYTAKNGTCQYDPSKAVAKVTGYTALPSGDEDSLNDAVTSKGPIS 314
Query: 268 VAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGN 327
V IDAS KSF Y GVYY++ C S LDH VL VGYG D YW VKNSW T WG
Sbjct: 315 VCIDASHKSFQLYSEGVYYEKSC--SYFLLDHCVLVVGYGTEDTADYWLVKNSWGTSWGM 372
Query: 328 QGYVLMSI-KDNNCGVMTAPTY 348
+GY+ MS + NNCG+ T Y
Sbjct: 373 KGYIRMSRNRKNNCGIATNAAY 394
>gi|45822207|emb|CAE47500.1| cathepsin L-like proteinase [Diabrotica virgifera virgifera]
Length = 326
Score = 161 bits (408), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 92/220 (41%), Positives = 127/220 (57%), Gaps = 8/220 (3%)
Query: 137 IPVRY---EMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGY 193
+P ++ E + DQ CGSCWSF TTG VEGAY++K KL LS+Q L+DC+
Sbjct: 110 LPSKFDWREKGAVTEVKDQGSCGSCWSFSTTGTVEGAYFLKTGKLVSLSEQNLVDCA-KE 168
Query: 194 GNNGCDGGEDFRSYQWI-MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNS 252
GC GG ++ ++I G+ +++DY PY G D C ++ A ++ F + N
Sbjct: 169 DCYGCSGGYMDKALEYIETAGGIMSENDY-PYEGIDDKCRFDSSKVAAKISNFTYIKKND 227
Query: 253 EDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGK 312
ED LK A+ GP+SVAIDAS +F Y +G+ D C + + L+H VL VGYG +
Sbjct: 228 EDDLKNAVIAKGPISVAIDASF-NFQLYDSGILDDSSCYSDFNSLNHGVLVVGYGTEKEQ 286
Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
YW VKNSW WG GY+ MS K+N CG+ T TY T+
Sbjct: 287 DYWIVKNSWGADWGMDGYIWMSRNKNNQCGIATDATYPTI 326
>gi|157128512|ref|XP_001661463.1| cathepsin l [Aedes aegypti]
gi|91992510|gb|ABE72971.1| cathepsin L [Aedes aegypti]
gi|108872552|gb|EAT36777.1| AAEL011167-PA [Aedes aegypti]
Length = 327
Score = 161 bits (408), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 94/224 (41%), Positives = 134/224 (59%), Gaps = 6/224 (2%)
Query: 131 KASKDAIPVRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDC 189
+A+ + + KG + DQ CGSC++F GA+EGA + K KL LS+Q ++DC
Sbjct: 107 QATTTVTSIDWRTKGAVTPVKDQGRCGSCYAFSALGALEGATFTKTGKLVNLSEQNIVDC 166
Query: 190 SWGYGNNGCDGGEDFRSYQWI-MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
+ YGN GC+GG +++I +G+ T Y A C AT TG+V +
Sbjct: 167 TSTYGNYGCNGGSMTSVFKYIKTNNGVDTGAFYPYKAAVAATCGFNPAYVGATDTGYV-L 225
Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
P +E AL+ A+A GPVSVAIDAS SF Y +G+YY+ C++S L+H VL VGYG
Sbjct: 226 LPANETALQTAVANIGPVSVAIDASNPSFQQYKSGIYYEPLCSSSK--LNHGVLVVGYGT 283
Query: 309 LDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
+G YWQVKNSW T WG +GY+ M+ K+N+CG+ + +Y T+
Sbjct: 284 ENGTDYWQVKNSWGTTWGEKGYIKMARNKNNHCGIASFASYPTV 327
>gi|308322047|gb|ADO28161.1| cathepsin H [Ictalurus furcatus]
Length = 326
Score = 161 bits (408), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 84/215 (39%), Positives = 126/215 (58%), Gaps = 3/215 (1%)
Query: 135 DAIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYG 194
D+I R + + +Q CGSCW+F TTG +E + KL +L++Q L+DC+ +
Sbjct: 111 DSIDWRKKGNYVTEVKNQGACGSCWTFSTTGCLESVTAIATGKLPLLAEQQLVDCAGAFN 170
Query: 195 NNGCDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSE 253
N+GC+GG +++++IM GL T+DDY PY+G+D C A A + VN+T E
Sbjct: 171 NHGCNGGLPSQAFEYIMYNKGLMTEDDY-PYVGRDGPCKFDPKLAAAFVKDVVNITKYDE 229
Query: 254 DALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKP 313
+ A+A+ PVS+A + + F Y +GVY +C+N+ + ++HAVLAVGY E +G P
Sbjct: 230 MGIVDAVARLNPVSIAFEVLPE-FMHYKDGVYTSNECHNTTETVNHAVLAVGYAEENGTP 288
Query: 314 YWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTY 348
YW VKNSW WG GY + N CG+ +Y
Sbjct: 289 YWIVKNSWGPQWGIDGYFYIERGQNMCGLAACASY 323
>gi|302763927|ref|XP_002965385.1| hypothetical protein SELMODRAFT_439207 [Selaginella moellendorffii]
gi|300167618|gb|EFJ34223.1| hypothetical protein SELMODRAFT_439207 [Selaginella moellendorffii]
Length = 353
Score = 161 bits (408), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 85/199 (42%), Positives = 116/199 (58%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTGA+E A+ K+ VLS+Q L+DC+ GY N GC GG +++++I
Sbjct: 149 NQQHCGSCWTFSTTGALESAHAQATGKMVVLSEQQLVDCAGGYNNFGCSGGLPSQAFEYI 208
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ GL T+D Y PY D C + A + VN+T +ED L A+A + PVS+A
Sbjct: 209 RYNGGLDTEDSY-PYTAHDGKCMYNQNSIGAKVYDVVNITEGAEDELIHAVAFNRPVSIA 267
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ K F FY +GVY C PD ++HAVLAVGY PYW +KNSW +G G
Sbjct: 268 YEV-LKDFRFYKSGVYTSNVCGTGPDTVNHAVLAVGYNRDAPVPYWIIKNSWGESFGLDG 326
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y M + N CG+ T +Y
Sbjct: 327 YFYMEMGKNMCGIATCASY 345
>gi|16506815|gb|AAL23962.1|AF426248_1 truncated cathepsin H [Homo sapiens]
Length = 323
Score = 161 bits (408), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 81/199 (40%), Positives = 117/199 (58%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTGA+E A + K+ L++Q L+DC+ + N GC GG +++++I
Sbjct: 122 NQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNYGCQGGLPSQAFEYI 181
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ +D Y PY G+D YC A + N+T E+A+ A+A + PVS A
Sbjct: 182 LYNKGIMGEDTY-PYQGKDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFA 240
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ +Q F Y G+Y C+ +PD ++HAVLAVGYGE +G PYW VKNSW WG G
Sbjct: 241 FEVTQ-DFMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNG 299
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y L+ N CG+ +Y
Sbjct: 300 YFLIERGKNMCGLAACASY 318
>gi|27532972|ref|NP_083912.2| cathepsin Q precursor [Mus musculus]
gi|27960482|gb|AAO27845.1|AF456461_1 cathepsin Q [Mus musculus]
gi|16445011|gb|AAK00505.1| cathepsin Q precursor [Mus musculus]
gi|71050990|gb|AAH99415.1| Cathepsin Q [Mus musculus]
gi|148709365|gb|EDL41311.1| cathepsin Q, isoform CRA_a [Mus musculus]
Length = 343
Score = 161 bits (408), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 92/227 (40%), Positives = 130/227 (57%), Gaps = 14/227 (6%)
Query: 134 KDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCS 190
KDA+P V + +GY + +Q C SCW+F TGA+EG + K KL LS Q L+DCS
Sbjct: 122 KDALPKFVDWRNEGYVTRVRNQRNCNSCWAFPVTGAIEGQMFKKTGKLIPLSVQNLVDCS 181
Query: 191 WGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVT 249
GN GC G + +Q+++ + GL Q Y PY G++ C + A +TGFV V
Sbjct: 182 RPQGNRGCRWGNTYNGFQYVLHNGGLEAQATY-PYEGKEGLCRYNPKNSAAKITGFV-VL 239
Query: 250 PNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-- 307
P SED L A+A GP++ I SF FY GVYY+ C +S ++HAVL +GYG
Sbjct: 240 PESEDVLMDAVATKGPIATGIHVVSSSFRFYDGGVYYEPNCTSS---VNHAVLIIGYGYV 296
Query: 308 --ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
E DG YW +KNSW WG GY++++ ++N+C + + Y T+
Sbjct: 297 GNETDGNNYWLIKNSWGRRWGLSGYMMIAKDRNNHCAIASLAQYPTV 343
>gi|164472556|gb|ABY58967.1| cathepsin L [Toxoplasma gondii]
Length = 421
Score = 161 bits (408), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 88/206 (42%), Positives = 119/206 (57%), Gaps = 10/206 (4%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TTGA+EGA+ K KL LS+Q L+DCS GN C GGE ++Q++
Sbjct: 221 DQRDCGSCWAFSTTGALEGAHCAKTGKLVSLSEQELMDCSRAEGNQSCSGGEMNDAFQYV 280
Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
+ G +D PYL +D C + + GF +V SE A+K ALAK PVS+AI
Sbjct: 281 LDSGGICSEDAYPYLARDEECRAQSCEKVVKILGFKDVPRRSEAAMKAALAK-SPVSIAI 339
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG--ELDGKPYWQVKNSWSTYWGNQ 328
+A Q F FY GV +D C LDH VL VGYG + K +W +KNSW T WG
Sbjct: 340 EADQMPFQFYHEGV-FDASCGTD---LDHGVLLVGYGTDKESKKDFWIMKNSWGTGWGRD 395
Query: 329 GYVLMSI---KDNNCGVMTAPTYVTM 351
GY+ M++ ++ CG++ ++ M
Sbjct: 396 GYMYMAMHKGEEGQCGLLLDASFPVM 421
>gi|224809458|ref|NP_001019580.2| cathepsin S, b.1 precursor [Danio rerio]
gi|63101450|gb|AAH95788.1| Cathepsin S, b.1 [Danio rerio]
gi|77748418|gb|AAI07613.1| Cathepsin S, b.1 [Danio rerio]
Length = 330
Score = 161 bits (408), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 94/224 (41%), Positives = 131/224 (58%), Gaps = 7/224 (3%)
Query: 132 ASKDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALID 188
+S DA+P + + KGY +S+ Q CGSCW+F + GA+EG KL LS Q L+D
Sbjct: 110 SSGDAVPDSLDWREKGYVSSVKMQGACGSCWAFSSVGALEGQLKKTTGKLVDLSPQNLVD 169
Query: 189 CSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
CS YGN GC+GG ++Q+++ +G D PY G C +++ A T + V
Sbjct: 170 CSSKYGNKGCNGGFMSDAFQYVIDNGGIASDSAYPYRGVQQQCSYSSSQRAANCTKYYFV 229
Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
E+ALK A+A GP+SVAIDA++ F Y +GVY D C+ ++HAVL VGYG
Sbjct: 230 RQGDENALKQAVASVGPISVAIDATRPQFVLYHSGVYNDPTCSKR---VNHAVLVVGYGT 286
Query: 309 LDGKPYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
L G+ YW VKNSW T +G+ GY+ M+ NN CG+ + Y M
Sbjct: 287 LSGQDYWLVKNSWGTRFGDGGYIRMARNKNNMCGIASYACYPVM 330
>gi|357458911|ref|XP_003599736.1| Cysteine proteinase [Medicago truncatula]
gi|357474719|ref|XP_003607644.1| Cysteine proteinase [Medicago truncatula]
gi|355488784|gb|AES69987.1| Cysteine proteinase [Medicago truncatula]
gi|355508699|gb|AES89841.1| Cysteine proteinase [Medicago truncatula]
Length = 340
Score = 161 bits (408), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 94/222 (42%), Positives = 125/222 (56%), Gaps = 12/222 (5%)
Query: 136 AIP--VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG 192
AIP V + +KG + DQ CGSCW+F T A EG + KL LS+Q L+DC
Sbjct: 123 AIPAAVDWRVKGAVTPIKDQGQCGSCWAFSTVAATEGINQITTGKLVSLSEQELVDCDTK 182
Query: 193 YGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNS 252
+ GC+GG +++I+K+G T + PY D C+ A TT A +TG+ V NS
Sbjct: 183 GEDQGCEGGLMEDGFEFIIKNGGITSETNYPYKAADGSCNTATTTPVAKITGYEKVPVNS 242
Query: 253 EDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGK 312
E +L A+A P+SV+IDAS SF FY +G+Y E C LDH V AVGYG +G
Sbjct: 243 EKSLLKAVANQ-PISVSIDASDSSFMFYSSGIYTGE-CGTE---LDHGVTAVGYGSANGT 297
Query: 313 PYWQVKNSWSTYWGNQGYVLM----SIKDNNCGVMTAPTYVT 350
YW VKNSW T WG +GY+ M + K+ CG+ +Y T
Sbjct: 298 DYWIVKNSWGTVWGEKGYIRMQRGIAAKEGLCGIAMDSSYPT 339
>gi|444522624|gb|ELV13407.1| Cathepsin L1 [Tupaia chinensis]
Length = 307
Score = 161 bits (408), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 91/219 (41%), Positives = 128/219 (58%), Gaps = 9/219 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F +TGA+EG + K KL LS+Q L+DCS GN G
Sbjct: 92 VDWREKGYVTPVKNQGDCGSCWAFSSTGALEGQMFRKTGKLVSLSEQNLVDCSISEGNFG 151
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
C+GG ++ ++ +G ++ PY D C + A TGFV++ P E AL+
Sbjct: 152 CNGGIMDNAFLYVKDNGGLDSEESYPYEAVDDSCKYNPKNSAANDTGFVHL-PVEEKALE 210
Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELD----GKP 313
A+A GP+SV IDAS SF FY G+Y++ C S LDHAVL VGYG ++
Sbjct: 211 KAVATVGPISVGIDASADSFQFYKEGIYFEPNC--SSVELDHAVLVVGYGVMEEASTNNK 268
Query: 314 YWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
+W VKNSW WG GY++M+ ++NNCG+ + Y T+
Sbjct: 269 FWLVKNSWGKNWGMDGYIMMAKDRNNNCGIASYAMYPTV 307
>gi|2499879|sp|Q40143.1|CYSP3_SOLLC RecName: Full=Cysteine proteinase 3; Flags: Precursor
gi|1235545|emb|CAA88629.1| pre-pro-cysteine proteinase [Solanum lycopersicum]
Length = 356
Score = 161 bits (408), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 84/198 (42%), Positives = 116/198 (58%), Gaps = 3/198 (1%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI- 210
Q CGSCW+F TTGA+E AY K LS+Q L+DC+ + N GC+GG +++++I
Sbjct: 157 QGKCGSCWTFSTTGALEAAYAQAFGKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIK 216
Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
GL T++ Y PY G++ C + + VN+T +E LK A+A PVSVA
Sbjct: 217 FNGGLDTEEAY-PYTGKNGICKFSQANIGVKVISSVNITLGAEYELKYAVALVRPVSVAF 275
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
+ K F Y +GVY +C ++P ++HAVLAVGYG +G PYW +KNSW WG GY
Sbjct: 276 EVV-KGFKQYKSGVYASTECGDTPMDVNHAVLAVGYGVENGTPYWLIKNSWGADWGEDGY 334
Query: 331 VLMSIKDNNCGVMTAPTY 348
M + N CGV T +Y
Sbjct: 335 FKMEMGKNMCGVATCASY 352
>gi|281204231|gb|EFA78427.1| cysteine proteinase 3 [Polysphondylium pallidum PN500]
Length = 329
Score = 161 bits (408), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 106/292 (36%), Positives = 150/292 (51%), Gaps = 19/292 (6%)
Query: 67 VSDFKVNIYRLFFLRPRFHENEKIRYNWTYIGEELVNGIIL------EKWRLVTSEGEKV 120
V D + + Y F +N + W + +E G + E++R V G V
Sbjct: 35 VQDRQYDAYEFRTRYSAFKDNLDFIHRWNAVNKETELGATVFADLTNEEYRAVY-LGMNV 93
Query: 121 SKYSLWVRYNKASKDAIPVRYEMKGYNS-----LLDQSVCGSCWSFGTTGAVEGAYYMKH 175
+ + + PVR + N+ + DQ CGSCW+F TTGAVEGA+ +
Sbjct: 94 DASNFAAQPATLDQVYQPVRSTLDWRNNGAVGRVKDQGQCGSCWAFSTTGAVEGAHQIAT 153
Query: 176 KKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAY-CHI 233
LS+Q L+DCS YGN+GC GG + +I+K G+ T++ Y PY +D+Y C
Sbjct: 154 GNFVSLSEQQLMDCSRSYGNHGCQGGLMDSAMSYIVKQGGINTEESY-PYEMRDSYTCKY 212
Query: 234 ANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNS 293
A ++G+ N+ SE L L GPV++A+DAS SF Y +GV+YD C S
Sbjct: 213 NPANNGAKLSGYSNIKRGSEADLAAKL-NIGPVAIALDASHSSFQLYKSGVFYDPAC--S 269
Query: 294 PDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMT 344
L H VLAVGYG YW VKNSW T WG+ GY+ + ++N+CGV T
Sbjct: 270 STSLSHGVLAVGYGTEGSSAYWIVKNSWGTRWGDAGYIWIAKDRNNHCGVAT 321
>gi|344271939|ref|XP_003407794.1| PREDICTED: cathepsin L1-like [Loxodonta africana]
Length = 335
Score = 161 bits (408), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 92/221 (41%), Positives = 132/221 (59%), Gaps = 13/221 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + +GY + DQ C SCW+F TGA+EG + K KL LS+Q L+DCS NNG
Sbjct: 120 VNWTQRGYVTPVKDQGSCHSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPESNNG 179
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDA 255
C GG +++Q++ + GL +++ Y PY +++ C + A TGFVN+ P E A
Sbjct: 180 CSGGLMDKAFQYVKNNGGLDSEESY-PYTAKESRNCLYKPEFSAANNTGFVNIPP-QEKA 237
Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
L A+A GP+SVA+DAS KSF FY +G+Y+D C + ++H VL VGYG + D
Sbjct: 238 LMNAVASVGPISVAVDASLKSFRFYKSGIYFDPACRLA---VNHGVLVVGYGFEGTDPDK 294
Query: 312 KPYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTYVTM 351
YW VKNSW WG GY+ + ++N+CG+ A +Y T+
Sbjct: 295 NKYWLVKNSWGKSWGADGYIKIAKDRNNHCGIARAASYPTV 335
>gi|164519063|ref|NP_001002813.2| cathepsin Q-like 2 precursor [Rattus norvegicus]
gi|67678196|gb|AAH97257.1| Ctsql2 protein [Rattus norvegicus]
gi|149039735|gb|EDL93851.1| rCG24202 [Rattus norvegicus]
Length = 343
Score = 161 bits (407), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 88/223 (39%), Positives = 126/223 (56%), Gaps = 12/223 (5%)
Query: 134 KDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCS 190
+DA+P + + +GY + +Q C SCW+F GA+EG + K KL LS Q L+DCS
Sbjct: 122 RDALPKSIDWRKEGYVTRVREQGKCKSCWAFPVAGAIEGQMFKKTGKLTPLSVQNLVDCS 181
Query: 191 WGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTP 250
GN GC GG + ++Q+++++G + PY G++ C A A +T FV + P
Sbjct: 182 KPQGNKGCRGGTTYNAFQYVLQNGGLESEATYPYKGKEGLCKYNPKNAYAKITRFVAL-P 240
Query: 251 NSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG--- 307
ED L ALA GPV+ I S FY G+Y++ KCNN ++HAVL VGYG
Sbjct: 241 EDEDVLMDALATKGPVAAGIHVVYSSLRFYKKGIYHEPKCNNR---VNHAVLVVGYGFEG 297
Query: 308 -ELDGKPYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
E DG YW +KNSW WG +GY+ + ++N+CG+ T Y
Sbjct: 298 NETDGNNYWLIKNSWGKQWGLKGYMKIAKDRNNHCGIATFAQY 340
>gi|54400500|ref|NP_001005999.1| cathepsin L, like [Danio rerio]
gi|53734591|gb|AAH83200.1| Cathepsin L, like [Danio rerio]
Length = 261
Score = 161 bits (407), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 100/222 (45%), Positives = 141/222 (63%), Gaps = 13/222 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
+ + KGY + DQ CGSCW+F +TGA+EG + K KL LS+Q L+DCS GNNG
Sbjct: 44 IDWRQKGYVTPIKDQKRCGSCWAFSSTGALEGQVFRKTGKLVSLSEQNLMDCSRPQGNNG 103
Query: 198 CDGGEDFRSYQWIM-KHGLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDA 255
CDGG +++Q++ +GL +++ Y PYL D CH + A +TGFV++ E A
Sbjct: 104 CDGGLMDQAFQYVQDNNGLDSEESY-PYLATDDQPCHYDPRYSAANVTGFVDIPSGKEHA 162
Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
L A+A GPV+VAIDA +SF FY +G+YY + C S + LDH VL VGYG ++ G
Sbjct: 163 LMKAVAAVGPVAVAIDAGHESFQFYQSGIYYGKAC--STEELDHGVLVVGYGYEGVDVAG 220
Query: 312 KPYWQVKNSWSTYWGNQGYVLMSIKD--NNCGVMTAPTYVTM 351
+ YW VKNSW+ WG++GY+ M+ KD N+ G+ T+ +Y M
Sbjct: 221 RRYWIVKNSWTDRWGDKGYIYMA-KDLKNHRGIATSASYPLM 261
>gi|62955291|ref|NP_001017661.1| cathepsin S, b.2 precursor [Danio rerio]
gi|62204682|gb|AAH93339.1| Cathepsin S, b.2 [Danio rerio]
gi|182891354|gb|AAI64362.1| Ctssb.2 protein [Danio rerio]
Length = 330
Score = 161 bits (407), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 89/207 (42%), Positives = 120/207 (57%), Gaps = 5/207 (2%)
Query: 144 KGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGE 202
KGY S+ +Q CGSCW+F + GA+EG KL LS Q L+DCS YGN GC+GG
Sbjct: 124 KGYVTSVKNQGACGSCWAFSSVGALEGQLMKTTGKLVDLSPQNLVDCSSKYGNLGCNGGY 183
Query: 203 DFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAK 262
+++Q+++ +G + PY G C + A T + V+ E ALK ALA
Sbjct: 184 MSQAFQYVIDNGGIDSESSYPYQGTQGSCRYDPSQRAANCTSYKFVSQGDEQALKEALAN 243
Query: 263 HGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWS 322
GPVSVAIDA++ F FY +GVY D C ++H VLAVGYG L G+ YW VKNSW
Sbjct: 244 IGPVSVAIDATRPQFIFYRSGVYDDPSCTQK---VNHGVLAVGYGTLSGQDYWLVKNSWG 300
Query: 323 TYWGNQGYVLMSIKDNN-CGVMTAPTY 348
+G+ GY+ ++ NN CG+ + Y
Sbjct: 301 AGFGDGGYIRIARNKNNMCGIASEACY 327
>gi|28932704|gb|AAO60046.1| midgut cysteine proteinase 3 [Rhipicephalus appendiculatus]
Length = 334
Score = 161 bits (407), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 94/200 (47%), Positives = 128/200 (64%), Gaps = 6/200 (3%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTG++EG ++ K +KL LS+Q L+DC GNNGC GG ++++I
Sbjct: 136 NQGQCGSCWAFSTTGSLEGQHFRKMRKLVSLSEQNLVDCMQKLGNNGCGGGLMDNAFKYI 195
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T+ Y PY D CH + AT TGF ++ E++ A+A GPVSVA
Sbjct: 196 KANKGIDTELSY-PYNATDGVCHFKKSGVGATATGFEDIPARDENSWD-AVAPVGPVSVA 253
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS +SF FY GV + +C S D LDH VL VGYG DG+ YW VKNSW T WG++G
Sbjct: 254 IDASHESFQFYSEGVLDEPEC--SSDQLDHGVLVVGYGTKDGQDYWLVKNSWGTTWGDEG 311
Query: 330 YVLMSI-KDNNCGVMTAPTY 348
Y+ M+ KDN CG+ ++ +Y
Sbjct: 312 YIYMTRNKDNQCGIASSASY 331
>gi|39930363|ref|NP_058817.1| cathepsin J precursor [Rattus norvegicus]
gi|84028185|sp|Q63088.2|CATJ_RAT RecName: Full=Cathepsin J; AltName: Full=Cathepsin L-related
protein; AltName: Full=Cathepsin P; AltName:
Full=Catlrp-p; Flags: Precursor
gi|28196048|gb|AAL26793.2| cathepsin P [Rattus norvegicus]
gi|66910531|gb|AAH97263.1| Cathepsin J [Rattus norvegicus]
gi|149039736|gb|EDL93852.1| cathepsin J [Rattus norvegicus]
Length = 334
Score = 161 bits (407), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 91/215 (42%), Positives = 129/215 (60%), Gaps = 11/215 (5%)
Query: 141 YEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCD 199
+ +GY + +Q CGSCW+F GA+EG + K L LS Q L+DCS GNNGC
Sbjct: 120 WRKEGYVTPVRNQGKCGSCWAFAAVGAIEGQMFSKTGNLTPLSVQNLLDCSKSEGNNGCR 179
Query: 200 GGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKL 258
G +++ +++K+ GL + Y PY G+D C + A+A +TGFVN+ PN E L +
Sbjct: 180 WGTAHQAFNYVLKNKGLEAEATY-PYEGKDGPCRYHSENASANITGFVNLPPN-ELYLWV 237
Query: 259 ALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPY 314
A+A GPVS AIDAS SF FY GVY++ C S ++HAVL VGYG E DG Y
Sbjct: 238 AVASIGPVSAAIDASHDSFRFYSGGVYHEPNC--SSYVVNHAVLVVGYGFEGNETDGNNY 295
Query: 315 WQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
W +KNSW WG G++ + ++N+CG+ + ++
Sbjct: 296 WLIKNSWGEEWGINGFMKIAKDRNNHCGIASQASF 330
>gi|246148|gb|AAB21516.1| Cyclic Protein-2 [Rattus sp.]
Length = 247
Score = 161 bits (407), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 87/203 (42%), Positives = 121/203 (59%), Gaps = 8/203 (3%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F +G +EG ++K KL LS+Q L+DCS GN GC+GG ++Q+I
Sbjct: 44 NQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHDQGNQGCNGGLMDFAFQYI 103
Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
++G ++ PY +D C A A TGFV++ P E AL A+A GP+SVA+
Sbjct: 104 KENGGLDSEESYPYEAKDGSCKYRAEYAVANDTGFVDI-PQQEKALMKAVATVGPISVAM 162
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTYWG 326
DAS S FY +G+YY+ C S LDH VL VGYG + + YW VKNSW WG
Sbjct: 163 DASHPSLQFYSSGIYYEPNC--SSKDLDHGVLVVGYGYEGTDSNKDKYWLVKNSWGKEWG 220
Query: 327 NQGYV-LMSIKDNNCGVMTAPTY 348
GY+ + ++N+CG+ TA +Y
Sbjct: 221 MDGYIKIAKDRNNHCGLATAASY 243
>gi|29710|emb|CAA34734.1| unnamed protein product [Homo sapiens]
Length = 335
Score = 161 bits (407), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 81/199 (40%), Positives = 117/199 (58%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTGA+E A + K+ L++Q L+DC+ + N GC GG +++++I
Sbjct: 134 NQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNYGCQGGLPSQAFEYI 193
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ +D Y PY G+D YC A + N+T E+A+ A+A + PVS A
Sbjct: 194 LYNKGIMGEDTY-PYQGKDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFA 252
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ +Q F Y G+Y C+ +PD ++HAVLAVGYGE +G PYW VKNSW WG G
Sbjct: 253 FEVTQ-DFMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNG 311
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y L+ N CG+ +Y
Sbjct: 312 YFLIERGKNMCGLAACASY 330
>gi|403258371|ref|XP_003921746.1| PREDICTED: pro-cathepsin H [Saimiri boliviensis boliviensis]
Length = 336
Score = 161 bits (407), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 81/199 (40%), Positives = 118/199 (59%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTGA+E A + K+ L++Q L+DC+ + N+GC GG +++++I
Sbjct: 135 NQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYI 194
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ +D Y PY G+D+ C A + N+T EDA+ A+A + PVS A
Sbjct: 195 LYNKGIMGEDTY-PYQGKDSDCKFQPGKAIGFVKDVANITIYDEDAMVEAVALYNPVSFA 253
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ +Q F Y G+Y C+ +PD ++HAVLAVGYGE +G PYW VKNSW WG G
Sbjct: 254 FEVTQ-DFMMYKRGIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGMNG 312
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y L+ N CG+ +Y
Sbjct: 313 YFLIERGKNMCGLAACASY 331
>gi|94448670|emb|CAI91573.1| silicatein a4 [Lubomirskia baicalensis]
gi|312386085|gb|ADQ74587.1| silicatein alpha 4 [Lubomirskia baicalensis]
Length = 326
Score = 161 bits (407), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 81/202 (40%), Positives = 128/202 (63%), Gaps = 5/202 (2%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM 211
Q CG+ ++F TGA+EGA + + K +LS+Q +IDCS YGN+GC GG+ + + ++++
Sbjct: 128 QGQCGASYAFAATGALEGASALSNDKQVILSEQNIIDCSVPYGNHGCSGGDTYTAMKYVI 187
Query: 212 KH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
+ G+ T+ Y + G+ + C ++ + A+ TG +++ SE L A+A GPV+VA+
Sbjct: 188 DNGGIDTESSY-SFQGKQSSCQYSSKNSGASATGVISIASGSETDLFAAVATVGPVAVAV 246
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
DA+ +F FY +GV+ C+N+ L+HA+L GYG +GK YW VKNSWS WG+ GY
Sbjct: 247 DANTNAFRFYQSGVFDSSSCSNTK--LNHAMLVTGYGSYNGKDYWLVKNSWSKNWGDNGY 304
Query: 331 VLM-SIKDNNCGVMTAPTYVTM 351
++M K N CG+ T Y T+
Sbjct: 305 IMMVRNKYNQCGIATDALYPTL 326
>gi|9931986|ref|NP_064680.1| cathepsin R precursor [Mus musculus]
gi|23813621|sp|Q9JIA9.1|CATR_MOUSE RecName: Full=Cathepsin R; Flags: Precursor
gi|9623188|gb|AAF90051.1|AF245399_1 cathepsin R [Mus musculus]
gi|12837970|dbj|BAB24023.1| unnamed protein product [Mus musculus]
gi|12852278|dbj|BAB29345.1| unnamed protein product [Mus musculus]
gi|16445015|gb|AAK00507.1| cathepsin R precursor [Mus musculus]
gi|71682221|gb|AAI00339.1| Cathepsin R [Mus musculus]
gi|148709367|gb|EDL41313.1| cathepsin R [Mus musculus]
Length = 334
Score = 161 bits (407), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 89/219 (40%), Positives = 126/219 (57%), Gaps = 9/219 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + Q C +CW+F TGA+E + KL LS Q L+DCS GNNG
Sbjct: 119 VDWRKKGYVTPVRRQGDCDACWAFAVTGAIEAQAIWQTGKLTPLSVQNLVDCSKPQGNNG 178
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
C GG+ + ++Q+++ +G + PY G+D C + A +TGFV++ P SED L
Sbjct: 179 CLGGDTYNAFQYVLHNGGLESEATYPYEGKDGPCRYNPKNSKAEITGFVSL-PQSEDILM 237
Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKP 313
A+A GP++ IDAS +SF Y G+Y++ C S D + H VL VGYG E DG
Sbjct: 238 AAVATIGPITAGIDASHESFKNYKGGIYHEPNC--SSDTVTHGVLVVGYGFKGIETDGNH 295
Query: 314 YWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTYVTM 351
YW +KNSW WG +GY+ L K+N+CG+ + Y T+
Sbjct: 296 YWLIKNSWGKRWGIRGYMKLAKDKNNHCGIASYAHYPTI 334
>gi|67678376|gb|AAH96862.1| Cathepsin S, b.2 [Danio rerio]
Length = 330
Score = 161 bits (407), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 89/207 (42%), Positives = 120/207 (57%), Gaps = 5/207 (2%)
Query: 144 KGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGE 202
KGY S+ +Q CGSCW+F + GA+EG KL LS Q L+DCS YGN GC+GG
Sbjct: 124 KGYVTSVKNQGACGSCWAFSSVGALEGQLMKTTGKLVDLSPQNLVDCSSKYGNLGCNGGY 183
Query: 203 DFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAK 262
+++Q+++ +G + PY G C + A T + V+ E ALK ALA
Sbjct: 184 MSQAFQYVIDNGGIDSESSYPYQGTQGSCRYDPSQRAANCTSYKFVSQGDEQALKEALAN 243
Query: 263 HGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWS 322
GPVSVAIDA++ F FY +GVY D C ++H VLAVGYG L G+ YW VKNSW
Sbjct: 244 IGPVSVAIDATRPQFIFYRSGVYDDPSCTQK---VNHGVLAVGYGTLSGQDYWLVKNSWG 300
Query: 323 TYWGNQGYVLMSIKDNN-CGVMTAPTY 348
+G+ GY+ ++ NN CG+ + Y
Sbjct: 301 AGFGDGGYIRIARNKNNMCGIASEACY 327
>gi|16506813|gb|AAL23961.1|AF426247_1 cathepsin H [Homo sapiens]
Length = 335
Score = 161 bits (407), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 81/199 (40%), Positives = 117/199 (58%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTGA+E A + K+ L++Q L+DC+ + N GC GG +++++I
Sbjct: 134 NQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNYGCQGGLPSQAFEYI 193
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ +D Y PY G+D YC A + N+T E+A+ A+A + PVS A
Sbjct: 194 LYNKGIMGEDTY-PYQGKDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFA 252
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ +Q F Y G+Y C+ +PD ++HAVLAVGYGE +G PYW VKNSW WG G
Sbjct: 253 FEVTQ-DFMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNG 311
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y L+ N CG+ +Y
Sbjct: 312 YFLIERGKNMCGLAACASY 330
>gi|354504284|ref|XP_003514207.1| PREDICTED: cathepsin R-like [Cricetulus griseus]
Length = 334
Score = 160 bits (406), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 87/216 (40%), Positives = 127/216 (58%), Gaps = 9/216 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY S+ Q+ C SCW+F GA+EG + K L LS Q L+DCS +GNNG
Sbjct: 119 VNWRKKGYVTSVKKQAYCNSCWAFAVNGAIEGQMFKKTGNLTRLSVQNLVDCSKPHGNNG 178
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
CD G+ + +Y++++ +G + PY G++ C + A +TGFV++ P SE++L
Sbjct: 179 CDWGDPYIAYEYVLHNGGVEAEATYPYEGKEGPCRYNPKYSAANITGFVSL-PKSEESLM 237
Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKP 313
A+A GP+S ID + F FY G++YD KC+N D ++H VL VGYG E DG
Sbjct: 238 AAVATIGPISAGIDIASDFFMFYKKGIFYDPKCHN--DTVNHVVLVVGYGFEGNETDGNN 295
Query: 314 YWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTY 348
YW VKNS+ WG +GY+ ++ NN C + + Y
Sbjct: 296 YWLVKNSYGKKWGLRGYMKIAKDQNNHCAIASYAHY 331
>gi|29708|emb|CAA30428.1| cathepsin H [Homo sapiens]
Length = 248
Score = 160 bits (406), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 81/199 (40%), Positives = 117/199 (58%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTGA+E A + K+ L++Q L+DC+ + N GC GG +++++I
Sbjct: 47 NQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNYGCQGGLPSQAFEYI 106
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ +D Y PY G+D YC A + N+T E+A+ A+A + PVS A
Sbjct: 107 LYNKGIMGEDTY-PYQGKDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFA 165
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ +Q F Y G+Y C+ +PD ++HAVLAVGYGE +G PYW VKNSW WG G
Sbjct: 166 FEVTQ-DFMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNG 224
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y L+ N CG+ +Y
Sbjct: 225 YFLIERGKNMCGLAACASY 243
>gi|354507493|ref|XP_003515790.1| PREDICTED: cathepsin L1-like [Cricetulus griseus]
gi|344259154|gb|EGW15258.1| Cathepsin L1 [Cricetulus griseus]
Length = 333
Score = 160 bits (406), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 90/207 (43%), Positives = 126/207 (60%), Gaps = 10/207 (4%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F GA+EG +K L LS+Q L+DCS GN GC+GG ++Q++
Sbjct: 131 NQGQCGSCWAFSACGALEGQMCLKTGVLVSLSEQNLVDCSQAEGNQGCNGGLMDFAFQYV 190
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ + GL +++ Y PY +D C A A TG+V++ P E AL A+A GP+++A
Sbjct: 191 LNNKGLDSEESY-PYEAKDGTCKYKPEFAAANDTGYVDI-PQLEKALMKAVATVGPIAIA 248
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTYW 325
IDAS SF FY +G+YY+ C S LDH VL VGYG + + K YW VKNSW + W
Sbjct: 249 IDASHPSFQFYSSGIYYEPNC--SSKELDHGVLVVGYGFEGTDSNKKKYWIVKNSWGSSW 306
Query: 326 GNQGYV-LMSIKDNNCGVMTAPTYVTM 351
G G+ + K+N+CGV TA +Y T+
Sbjct: 307 GMGGFFHIAKDKNNHCGVATAASYPTV 333
>gi|463046|gb|AAA49207.1| cysteine proteinase [Cyprinus carpio]
Length = 331
Score = 160 bits (406), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 91/220 (41%), Positives = 127/220 (57%), Gaps = 7/220 (3%)
Query: 132 ASKDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALID 188
+S AIP + + KGY +S+ +Q CGSCW+F + GA+EG KL LS Q L+D
Sbjct: 110 SSGAAIPDSLDWREKGYVSSVKNQGACGSCWAFSSVGALEGQLMKTTGKLVDLSPQNLVD 169
Query: 189 CSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
CS YGN GC GG ++Q+++ +G + PY G C + A T + V
Sbjct: 170 CSSSYGNYGCGGGLMSAAFQYVIDNGGIDSESSYPYEGVQGQCRYNPSQLAANCTKYYYV 229
Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
E+ALK A+A GP+SVAIDA+ F Y +GVY D C + ++HAVLAVGYG
Sbjct: 230 RQGDEEALKQAVANIGPISVAIDATHPQFILYRSGVYNDPSCTTN---INHAVLAVGYGA 286
Query: 309 LDGKPYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPT 347
+ G+ +W VKNSW T +G+ GY+ M+ NN CG+ P+
Sbjct: 287 IAGQDFWLVKNSWGTGFGDGGYIRMARNQNNMCGIAHTPS 326
>gi|395535909|ref|XP_003769963.1| PREDICTED: cathepsin S [Sarcophilus harrisii]
Length = 347
Score = 160 bits (406), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 86/200 (43%), Positives = 116/200 (58%), Gaps = 6/200 (3%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG--YGNNGCDGGEDFRSYQW 209
Q CGSCW+F GA+E +K KL LS Q L+DCS Y N+GC+GG ++Q+
Sbjct: 148 QGTCGSCWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTNEKYENHGCNGGCMTEAFQY 207
Query: 210 IMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
I+ + D PY +D C AT + + + SEDALK A+A GPVSV
Sbjct: 208 IIDNNGIDSDASYPYKAKDGKCQYNPANRAATCSRYTELPYGSEDALKEAVANKGPVSVG 267
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS SF Y +GVYYD C + ++H VL GYG LDGK YW VKNSW +G++G
Sbjct: 268 IDASLPSFFLYKSGVYYDPSCTQN---VNHGVLVTGYGNLDGKDYWLVKNSWGLSFGDKG 324
Query: 330 YVLMSI-KDNNCGVMTAPTY 348
Y+ ++ + N+CG+ P+Y
Sbjct: 325 YIRIARNRGNHCGIANFPSY 344
>gi|168047065|ref|XP_001775992.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162672650|gb|EDQ59184.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 336
Score = 160 bits (406), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 85/200 (42%), Positives = 118/200 (59%), Gaps = 4/200 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q+ CGSCW+F TTGA+E A+ K+ +LS+Q L+DC+ + N GC GG +++++I
Sbjct: 128 NQASCGSCWTFSTTGALEAAHAQATGKMVLLSEQQLVDCAGEFNNFGCGGGLPSQAFEYI 187
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T+D Y PY +D+ C T A + VN+T +E LK A+A PVSVA
Sbjct: 188 RYNGGIDTEDSY-PYNAKDSQCRFHKNTIGAQVWDVVNITEGAETQLKHAIATMRPVSVA 246
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE-LDGKPYWQVKNSWSTYWGNQ 328
+ F Y GVY C+ P ++HAVLAVGYGE +G PYW +KNSW WG
Sbjct: 247 FEVVH-DFRLYNGGVYTSLNCHTGPQTVNHAVLAVGYGEDENGVPYWIIKNSWGADWGMN 305
Query: 329 GYVLMSIKDNNCGVMTAPTY 348
GY M + N CGV T +Y
Sbjct: 306 GYFNMEMGKNMCGVATCASY 325
>gi|118140100|gb|ABK63481.1| cathepsin S [Channa argus]
Length = 335
Score = 160 bits (406), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 86/197 (43%), Positives = 123/197 (62%), Gaps = 6/197 (3%)
Query: 148 SLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSY 207
S+ +Q CGSCW+F GA+EG KL LS Q L+DCS YGN+GCDGG ++
Sbjct: 134 SVKNQGSCGSCWAFSAVGALEGQLAKTTGKLVDLSPQNLVDCSGKYGNHGCDGGFMTNAF 193
Query: 208 QWIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
Q++++ G+ ++ Y PY+G + CH + A + + + E+ALK A+A GP+
Sbjct: 194 QYVIENQGIESEASY-PYIGLEQQCHYNPEESAANCSQYHFLPEKDEEALKEAIATIGPI 252
Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWG 326
SVAIDAS+ +F+FY +GVY D C+ + ++H VLAVGYG + W VKNSW TY+G
Sbjct: 253 SVAIDASKPTFTFYSSGVYDDPTCS---EVINHGVLAVGYGTQSTQDSWLVKNSWGTYFG 309
Query: 327 NQGYVLMSI-KDNNCGV 342
+ GY+ MS K N CG+
Sbjct: 310 DSGYIRMSRNKGNQCGI 326
>gi|215261455|pdb|3F75|A Chain A, Activated Toxoplasma Gondii Cathepsin L (Tgcpl) In Complex
With Its Propeptide
Length = 224
Score = 160 bits (406), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 88/206 (42%), Positives = 119/206 (57%), Gaps = 10/206 (4%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TTGA+EGA+ K KL LS+Q L+DCS GN C GGE ++Q++
Sbjct: 24 DQRDCGSCWAFSTTGALEGAHCAKTGKLVSLSEQELMDCSRAEGNQSCSGGEMNDAFQYV 83
Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
+ G +D PYL +D C + + GF +V SE A+K ALAK PVS+AI
Sbjct: 84 LDSGGICSEDAYPYLARDEECRAQSCEKVVKILGFKDVPRRSEAAMKAALAKS-PVSIAI 142
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG--ELDGKPYWQVKNSWSTYWGNQ 328
+A Q F FY GV +D C LDH VL VGYG + K +W +KNSW T WG
Sbjct: 143 EADQMPFQFYHEGV-FDASCGTD---LDHGVLLVGYGTDKESKKDFWIMKNSWGTGWGRD 198
Query: 329 GYVLMSI---KDNNCGVMTAPTYVTM 351
GY+ M++ ++ CG++ ++ M
Sbjct: 199 GYMYMAMHKGEEGQCGLLLDASFPVM 224
>gi|148575301|gb|ABQ95351.1| secreted cathepsin L2 [Fasciola hepatica]
Length = 326
Score = 160 bits (405), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 89/218 (40%), Positives = 122/218 (55%), Gaps = 8/218 (3%)
Query: 131 KASKDAIPVRYEMKGY---NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALI 187
KA+K A+P + + Y + DQ CGSCW+F TTGAVEG + + A S+Q L+
Sbjct: 102 KANKLAVPESIDWRDYYYVTEVKDQGQCGSCWAFSTTGAVEGQFRKNERASASFSEQQLV 161
Query: 188 DCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVN 247
DC+ +GN GC GG +Y+++ +GL T+ Y PY + C A A +TG+
Sbjct: 162 DCTRDFGNYGCGGGYMENAYEYLKHNGLETESYY-PYQAVEGPCQYDGRLAYAKVTGYYT 220
Query: 248 VTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG 307
V E LK + GP +VA+DA F Y +G+Y + C PD L HAVLAVGYG
Sbjct: 221 VHSGDEIELKNLVGTEGPAAVALDAD-SDFMMYQSGIYQSQTC--LPDRLTHAVLAVGYG 277
Query: 308 ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMT 344
DG YW VKNSW T+WG GY+ + + N CG+ +
Sbjct: 278 SQDGTDYWIVKNSWGTWWGEDGYIRFARNRGNMCGIAS 315
>gi|330793420|ref|XP_003284782.1| hypothetical protein DICPUDRAFT_28222 [Dictyostelium purpureum]
gi|325085276|gb|EGC38686.1| hypothetical protein DICPUDRAFT_28222 [Dictyostelium purpureum]
Length = 347
Score = 160 bits (405), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 98/237 (41%), Positives = 132/237 (55%), Gaps = 28/237 (11%)
Query: 136 AIPVRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYG 194
A + + KG + +Q CG CWSF TTG+ EGA + L LS+Q LIDCS YG
Sbjct: 112 AASIDWRTKGAVTPIKNQQQCGGCWSFSTTGSTEGATAIAKGNLPSLSEQNLIDCSGSYG 171
Query: 195 NNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDA-YCHIANTTATATMTGFVNVTPNS 252
NNGC+GG ++++I+ + G+ T+ Y PY +D C AT++ + NVT S
Sbjct: 172 NNGCNGGLMTLAFEYIINNKGIDTESSY-PYTAKDGKTCKYNPANIGATLSSYSNVTSGS 230
Query: 253 EDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----- 307
E +L+ A A GPVSVAIDAS SF Y +G+YY+ C S LDH VL VGY
Sbjct: 231 EPSLESA-ANIGPVSVAIDASHNSFQLYSSGIYYEPAC--STTSLDHGVLVVGYASGSGS 287
Query: 308 ------------ELDGKP---YWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
++G YW VKNSW T WG +GY+LMS ++NNCG+ T ++
Sbjct: 288 GSGSGSGSGSGLAVEGASSGNYWIVKNSWGTSWGIEGYILMSKDRNNNCGIATMASF 344
>gi|440290877|gb|ELP84181.1| cysteine proteinase 2 precursor, putative [Entamoeba invadens IP1]
Length = 315
Score = 160 bits (405), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 86/202 (42%), Positives = 120/202 (59%), Gaps = 7/202 (3%)
Query: 151 DQSVCGSCWSFGTTGAVEGA--YYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
DQ CGSC+SFG ++EG + ++ LS++ ++ CS NNGC+GG Y
Sbjct: 114 DQKSCGSCYSFGALASLEGRLLFLGGNENTLDLSEEQIMRCS---ANNGCNGGTGSIVYD 170
Query: 209 WIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
+I ++G+ + DY PY + C + T AT+ G+ +T DALK AL G V V
Sbjct: 171 YIKENGVVQEKDY-PYTATEGNCGVDLTKKFATIEGYKYITRKDNDALKSALVD-GVVDV 228
Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQ 328
+IDAS FS Y +GVY+D KC S L+H + AVGYG+L+G+ YW VKNSW T WG
Sbjct: 229 SIDASSPMFSLYKSGVYHDTKCKTSLFSLNHEIAAVGYGKLNGEEYWIVKNSWGTNWGES 288
Query: 329 GYVLMSIKDNNCGVMTAPTYVT 350
GY+LM+++ N CGV + P T
Sbjct: 289 GYILMAMEGNTCGVTSEPIVPT 310
>gi|149392541|gb|ABR26073.1| oryzain gamma chain precursor [Oryza sativa Indica Group]
Length = 367
Score = 160 bits (405), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 85/199 (42%), Positives = 114/199 (57%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TTG++E AY K LS+Q L+DC+ Y N GC GG +++++I
Sbjct: 167 DQGHCGSCWTFSTTGSLEAAYTQATGKPVSLSEQQLVDCATAYNNFGCSGGLPSQAFEYI 226
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ GL T++ Y PY G + CH + VN+T +ED LK A+ PVSVA
Sbjct: 227 KYNGGLDTEEAY-PYTGVNGICHYKPENVGVKVLDSVNITLGAEDELKNAVGLVRPVSVA 285
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
F Y +GVY + C SP ++HAVLAVGYG +G PYW +KNSW WG+ G
Sbjct: 286 FQVIN-GFRMYKSGVYTSDHCGTSPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNG 344
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y M + N CG+ T +Y
Sbjct: 345 YFKMEMGKNMCGIATCASY 363
>gi|315075311|ref|NP_001186668.1| cathepsin S isoform 2 preproprotein [Homo sapiens]
gi|194376464|dbj|BAG62991.1| unnamed protein product [Homo sapiens]
Length = 281
Score = 160 bits (405), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 86/206 (41%), Positives = 118/206 (57%), Gaps = 5/206 (2%)
Query: 145 GYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW-GYGNNGCDGGED 203
G N L D CG+CW+F GA+E +K KL LS Q L+DCS YGN GC+GG
Sbjct: 76 GMNHLGDMGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFM 135
Query: 204 FRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKH 263
++Q+I+ + D PY D C + AT + + + ED LK A+A
Sbjct: 136 TTAFQYIIDNKGIDSDASYPYKAMDQKCQYDSKYRAATCSKYTELPYGREDVLKEAVANK 195
Query: 264 GPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWST 323
GPVSV +DA SF Y +GVYY+ C + ++H VL VGYG+L+GK YW VKNSW
Sbjct: 196 GPVSVGVDARHPSFFLYRSGVYYEPSCTQN---VNHGVLVVGYGDLNGKEYWLVKNSWGH 252
Query: 324 YWGNQGYVLMSI-KDNNCGVMTAPTY 348
+G +GY+ M+ K N+CG+ + P+Y
Sbjct: 253 NFGEEGYIRMARNKGNHCGIASFPSY 278
>gi|218202220|gb|EEC84647.1| hypothetical protein OsI_31538 [Oryza sativa Indica Group]
Length = 363
Score = 160 bits (405), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 85/199 (42%), Positives = 114/199 (57%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TTG++E AY K LS+Q L+DC+ Y N GC GG +++++I
Sbjct: 163 DQGHCGSCWTFSTTGSLEAAYTQATGKPVSLSEQQLVDCATAYNNFGCSGGLPSQAFEYI 222
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ GL T++ Y PY G + CH + VN+T +ED LK A+ PVSVA
Sbjct: 223 KYNGGLDTEEAY-PYTGVNGICHYKPENVGVKVLDSVNITLGAEDELKNAVGLVRPVSVA 281
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
F Y +GVY + C SP ++HAVLAVGYG +G PYW +KNSW WG+ G
Sbjct: 282 FQVIN-GFRMYKSGVYTSDHCGTSPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNG 340
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y M + N CG+ T +Y
Sbjct: 341 YFKMEMGKNMCGIATCASY 359
>gi|209731972|gb|ACI66855.1| Cathepsin H precursor [Salmo salar]
Length = 328
Score = 160 bits (405), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 81/202 (40%), Positives = 119/202 (58%), Gaps = 3/202 (1%)
Query: 148 SLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSY 207
++ +Q CGSCW+F TTG +E + KL LS+Q L+DC+ + N+GC+GG +++
Sbjct: 124 AVKNQGSCGSCWTFSTTGCLESVTAIATGKLLQLSEQQLVDCAQAFNNHGCNGGLPSQAF 183
Query: 208 QWI-MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
++I G+ T+DDY PY D C A A + VN+T E + A+A+ PV
Sbjct: 184 EYIKFNKGIMTEDDY-PYTAHDDTCKFKTDLAAAFVKDVVNITKYDEMGMVDAVARFNPV 242
Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWG 326
S+A + + F Y GVY ++C+N+ D ++HAVLAVGYGE G PYW VKNSW + WG
Sbjct: 243 SLAYEVTS-DFMHYDGGVYTSKECHNTTDTVNHAVLAVGYGEEKGTPYWIVKNSWGSSWG 301
Query: 327 NQGYVLMSIKDNNCGVMTAPTY 348
+GY + N CG+ +Y
Sbjct: 302 MKGYFFIERGKNMCGLAACSSY 323
>gi|1222694|gb|AAA92018.1| CP5 [Dictyostelium discoideum]
Length = 344
Score = 160 bits (405), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 99/267 (37%), Positives = 147/267 (55%), Gaps = 38/267 (14%)
Query: 104 GIILEKWRLVTSEGEKVSKYSLWVRYNKASKDAI-PVRYEMKGYNSLLDQSVCGSCWSFG 162
G + L+ ++ EKV S + S+ A+ PV+ +Q CG CWSF
Sbjct: 91 GTKFDASSLIGTQEEKVHTNSSAASKDWRSEGAVTPVK----------NQGQCGGCWSFS 140
Query: 163 TTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM-KHGLPTQDDY 221
TTG+ EGA++ +L LS+Q LIDCS N+GCDGG ++++I+ +G+ T+ Y
Sbjct: 141 TTGSTEGAHFQSKGELVSLSEQNLIDCS--TENSGCDGGLMTYAFEYIINNNGIDTESSY 198
Query: 222 GPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYV 281
PY ++ C + + AT++ + VT SE +L+ A+ + PVSVAIDAS +SF Y
Sbjct: 199 -PYKAENGKCEYKSENSGATLSSYKTVTAGSESSLESAVNVN-PVSVAIDASHQSFQLYT 256
Query: 282 NGVYYDEKCNNSPDGLDHAVLAVGY-------------------GELDGKPYWQVKNSWS 322
+G+YY+ +C S + LDH VLAVGY YW VKNSW
Sbjct: 257 SGIYYEPEC--SSENLDHGVLAVGYGSGSGSSSGQSSGQSSGNLSASSSNEYWIVKNSWG 314
Query: 323 TYWGNQGYVLMSI-KDNNCGVMTAPTY 348
T WG +GY+LMS +DNNCG+ ++ ++
Sbjct: 315 TSWGIEGYILMSRNRDNNCGIASSASF 341
>gi|344250850|gb|EGW06954.1| Cathepsin R [Cricetulus griseus]
Length = 279
Score = 160 bits (405), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 87/216 (40%), Positives = 127/216 (58%), Gaps = 9/216 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY S+ Q+ C SCW+F GA+EG + K L LS Q L+DCS +GNNG
Sbjct: 64 VNWRKKGYVTSVKKQAYCNSCWAFAVNGAIEGQMFKKTGNLTRLSVQNLVDCSKPHGNNG 123
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
CD G+ + +Y++++ +G + PY G++ C + A +TGFV++ P SE++L
Sbjct: 124 CDWGDPYIAYEYVLHNGGVEAEATYPYEGKEGPCRYNPKYSAANITGFVSL-PKSEESLM 182
Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKP 313
A+A GP+S ID + F FY G++YD KC+N D ++H VL VGYG E DG
Sbjct: 183 AAVATIGPISAGIDIASDFFMFYKKGIFYDPKCHN--DTVNHVVLVVGYGFEGNETDGNN 240
Query: 314 YWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTY 348
YW VKNS+ WG +GY+ ++ NN C + + Y
Sbjct: 241 YWLVKNSYGKKWGLRGYMKIAKDQNNHCAIASYAHY 276
>gi|291463491|pdb|3IV2|A Chain A, Crystal Structure Of Mature Apo-Cathepsin L C25a Mutant
gi|291463492|pdb|3IV2|B Chain B, Crystal Structure Of Mature Apo-Cathepsin L C25a Mutant
gi|291463519|pdb|3K24|A Chain A, Crystal Structure Of Mature Apo-Cathepsin L C25a Mutant In
Complex With Gln-Leu-Ala Peptide
gi|291463520|pdb|3K24|B Chain B, Crystal Structure Of Mature Apo-Cathepsin L C25a Mutant In
Complex With Gln-Leu-Ala Peptide
Length = 220
Score = 160 bits (405), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 93/220 (42%), Positives = 130/220 (59%), Gaps = 11/220 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGS W+F TGA+EG + K +L LS+Q L+DCS GN G
Sbjct: 5 VDWREKGYVTPVKNQGQCGSAWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEG 64
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++Q++ + GL +++ Y PY + C + A TGFV++ P E AL
Sbjct: 65 CNGGLMDYAFQYVQDNGGLDSEESY-PYEATEESCKYNPKYSVANDTGFVDI-PKQEKAL 122
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GP+SVAIDA +SF FY G+Y++ C S + +DH VL VGYG E D
Sbjct: 123 MKAVATVGPISVAIDAGHESFLFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDNN 180
Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
YW VKNSW WG GYV M+ + N+CG+ +A +Y T+
Sbjct: 181 KYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 220
>gi|118388791|ref|XP_001027491.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89309261|gb|EAS07249.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 356
Score = 160 bits (405), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 84/200 (42%), Positives = 119/200 (59%), Gaps = 3/200 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYM-KHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQW 209
DQ CGSCW+F TTGA+E Y + + + LS+Q LIDC+ + NNGC GG +++++
Sbjct: 144 DQQNCGSCWTFSTTGAIESHYAIFEDVEPTSLSEQQLIDCAGAFNNNGCSGGLPSQAFEY 203
Query: 210 IMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTG-FVNVTPNSEDALKLALAKHGPVSV 268
I +G + ++ Y+ QD C + T A + G N+T ED LK A+ GPVS+
Sbjct: 204 IKYNGGISYENSYYYIAQDQECQFSPETVGARVRGGSFNITQGDEDQLKQAVGTVGPVSI 263
Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQ 328
A F Y +GVY + C++SP ++HAVLAVGYG +G YW VKNSWS +WG++
Sbjct: 264 AFQV-MGDFKLYKSGVYSNPDCSSSPQTVNHAVLAVGYGSENGVDYWYVKNSWSEFWGDE 322
Query: 329 GYVLMSIKDNNCGVMTAPTY 348
GY + N CGV T +Y
Sbjct: 323 GYFKIQRGVNMCGVATCASY 342
>gi|260516654|gb|ACX43954.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516656|gb|ACX43955.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516658|gb|ACX43956.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516660|gb|ACX43957.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516662|gb|ACX43958.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516664|gb|ACX43959.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516666|gb|ACX43960.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516668|gb|ACX43961.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516670|gb|ACX43962.1| cysteine protease 2 [Brachiaria hybrid cultivar]
Length = 338
Score = 160 bits (405), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 90/222 (40%), Positives = 128/222 (57%), Gaps = 11/222 (4%)
Query: 135 DAIPVRYEMKGYNSLL---DQSVCGSCWSFGTTGAVEGAYYMKHKK-LAVLSQQALIDCS 190
+A P + + N++ DQ CGSCW+F TG++EGA+ ++ K L LS+Q L+DCS
Sbjct: 123 EAAPTSIDWRTSNAVTPIKDQGQCGSCWAFSATGSIEGAWVLQGKHTLTSLSEQQLVDCS 182
Query: 191 WGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVT 249
YGN GC+GG ++++I+ + G+ + Y PY G C + T T++G+ +V
Sbjct: 183 TSYGNAGCNGGLMDYAFEYIIANKGICAESAY-PYKGVGGLCQ-KSCTKVVTISGYKDVA 240
Query: 250 PNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGEL 309
E +L A+ GPVSVAI+A Q F FY +GV+ +N LDH VLAVGYG
Sbjct: 241 SGDEASLLNAVGTVGPVSVAIEADQAGFQFYSSGVFSGTCGHN----LDHGVLAVGYGTT 296
Query: 310 DGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
+ YW VKNSW T WG GY+ M N CG+ P+Y T+
Sbjct: 297 GSQDYWIVKNSWGTSWGESGYIRMIRNKNQCGIAIQPSYPTV 338
>gi|157835400|pdb|2NQD|B Chain B, Crystal Structure Of Cysteine Protease Inhibitor,
Chagasin, In Complex With Human Cathepsin L
Length = 221
Score = 160 bits (405), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 93/220 (42%), Positives = 130/220 (59%), Gaps = 11/220 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGS W+F TGA+EG + K +L LS+Q L+DCS GN G
Sbjct: 6 VDWREKGYVTPVKNQGQCGSAWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEG 65
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++Q++ + GL +++ Y PY + C + A TGFV++ P E AL
Sbjct: 66 CNGGLMDYAFQYVQDNGGLDSEESY-PYEATEESCKYNPKYSVANDTGFVDI-PKQEKAL 123
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GP+SVAIDA +SF FY G+Y++ C S + +DH VL VGYG E D
Sbjct: 124 MKAVATVGPISVAIDAGHESFLFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDNN 181
Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
YW VKNSW WG GYV M+ + N+CG+ +A +Y T+
Sbjct: 182 KYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 221
>gi|115479391|ref|NP_001063289.1| Os09g0442300 [Oryza sativa Japonica Group]
gi|115510968|sp|P25778.2|ORYC_ORYSJ RecName: Full=Oryzain gamma chain; Flags: Precursor
gi|51535997|dbj|BAD38077.1| putative oryzain gamma chain precursor [Oryza sativa Japonica
Group]
gi|113631522|dbj|BAF25203.1| Os09g0442300 [Oryza sativa Japonica Group]
gi|215694919|dbj|BAG90110.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 362
Score = 160 bits (405), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 85/199 (42%), Positives = 114/199 (57%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TTG++E AY K LS+Q L+DC+ Y N GC GG +++++I
Sbjct: 162 DQGHCGSCWTFSTTGSLEAAYTQATGKPVSLSEQQLVDCATAYNNFGCSGGLPSQAFEYI 221
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ GL T++ Y PY G + CH + VN+T +ED LK A+ PVSVA
Sbjct: 222 KYNGGLDTEEAY-PYTGVNGICHYKPENVGVKVLDSVNITLGAEDELKNAVGLVRPVSVA 280
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
F Y +GVY + C SP ++HAVLAVGYG +G PYW +KNSW WG+ G
Sbjct: 281 FQVIN-GFRMYKSGVYTSDHCGTSPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNG 339
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y M + N CG+ T +Y
Sbjct: 340 YFKMEMGKNMCGIATCASY 358
>gi|45550332|gb|AAS67922.1| cathepsin L [Artemia franciscana]
Length = 226
Score = 160 bits (405), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 85/196 (43%), Positives = 118/196 (60%), Gaps = 3/196 (1%)
Query: 155 CGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKH- 213
C SC +F TGA+E + K KL LS+Q LIDCS YGN GC GG +++++I +
Sbjct: 29 CASCLAFSPTGALESQTFRKTGKLISLSEQNLIDCSGEYGNLGCKGGWISQAFEYIKDNK 88
Query: 214 GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDAS 273
G+ T++ Y Y ++ +C A GFVN+ ED LK A+A GPVS ID S
Sbjct: 89 GIDTENKY-HYEAKENFCRDNPRNRGAVALGFVNIPSGEEDKLKAAVATVGPVSAVIDVS 147
Query: 274 QKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLM 333
+ F FY GVYY+ C S + L+H VL +G G +G+ YW VKNSWS +WG++GY+ +
Sbjct: 148 HEGFQFYSKGVYYEPSCKTSFEHLNHEVLVIGCGSDNGEDYWLVKNSWSKHWGDEGYLKI 207
Query: 334 SI-KDNNCGVMTAPTY 348
+ + N+CGV TA Y
Sbjct: 208 ARNRKNHCGVATAALY 223
>gi|387915132|gb|AFK11175.1| cathspsin H [Callorhinchus milii]
Length = 330
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 82/199 (41%), Positives = 119/199 (59%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTG +E A +K KL L++Q L+DC+ Y N+GC+GG +++++I
Sbjct: 129 NQGGCGSCWTFSTTGCLESAIAIKTGKLLSLAEQQLVDCAGAYKNHGCNGGLPSQAFEYI 188
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ GL + DY PY QD +C A A + VN+T E+ + A+A+ PVS+A
Sbjct: 189 KYNGGLEAEKDY-PYTAQDQHCQYQPNKAVAFVKEVVNITQYDENGIVDAVARLNPVSIA 247
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ + F Y GVY + C+++PD ++HAVLAVGYG +G YW VKNSW WG G
Sbjct: 248 FEVTDDFFQ-YEGGVYSNSNCDSTPDKVNHAVLAVGYGVQNGTKYWIVKNSWGPEWGLNG 306
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y + N CG+ P+Y
Sbjct: 307 YFYIIRGKNMCGLAACPSY 325
>gi|327322928|gb|AEA48885.1| cathepsin S [Oplegnathus fasciatus]
Length = 337
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 92/223 (41%), Positives = 130/223 (58%), Gaps = 7/223 (3%)
Query: 133 SKDAIP--VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDC 189
S A+P V + KG S+ Q CGSCW+F GA+EG KL LS Q L+DC
Sbjct: 118 SGAAVPDTVDWREKGCVTSVKMQGSCGSCWAFSAAGALEGQLAKTTGKLLDLSPQNLVDC 177
Query: 190 SWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVT 249
S YGN+GC+GG R++Q+++ + D PY GQ CH A + + ++
Sbjct: 178 SSKYGNHGCNGGFMHRAFQYVIDNQGIDSDASYPYTGQSQQCHYNPAYRAANCSRYSSLP 237
Query: 250 PNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGEL 309
E ALK ALA GP+SVAIDA++ SF+FY +GVY D+ + ++H VLAVGYG L
Sbjct: 238 EGDEGALKEALATIGPISVAIDATRPSFTFYRSGVYDDQTYTRN---VNHGVLAVGYGTL 294
Query: 310 DGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
+GK YW VKN W + +G++G++ M+ K++ CG+ Y M
Sbjct: 295 NGKDYWLVKNGWGSTFGDKGFIRMARNKNDQCGIALYGCYPIM 337
>gi|84660244|emb|CAI43319.1| silicatein alpha [Lubomirskia baicalensis]
gi|85677148|emb|CAI46306.1| silicatein alpha [Lubomirskia baicalensis]
gi|220675708|emb|CAP69653.1| silcatein [Lubomirskia baicalensis]
Length = 326
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 82/199 (41%), Positives = 126/199 (63%), Gaps = 5/199 (2%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM 211
Q CG+ ++F TGA+EGA + + K LS+Q +IDCS YGN+GC GG+ + ++++++
Sbjct: 128 QGQCGASYAFAATGALEGASALANDKQVTLSEQNIIDCSVPYGNHGCSGGDTYTAFKYVI 187
Query: 212 KH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
+ G+ T+ Y + G+ + C N T+ A+ TG V++ SE L A+A GPV+VA+
Sbjct: 188 DNGGIDTESSY-SFKGKQSSCQYNNKTSGASATGVVSIGYGSESDLLAAVATVGPVAVAV 246
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
DA+ +F FY +GV+ C+++ L+HA+L GYG +GK YW VKNSWS WG+ GY
Sbjct: 247 DANTNAFRFYQSGVFDSSSCSSTK--LNHAMLVTGYGSYNGKDYWLVKNSWSKNWGDSGY 304
Query: 331 VLM-SIKDNNCGVMTAPTY 348
+LM K N CG+ + Y
Sbjct: 305 ILMVRNKYNQCGIASDALY 323
>gi|410904753|ref|XP_003965856.1| PREDICTED: cathepsin S-like [Takifugu rubripes]
Length = 334
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 86/206 (41%), Positives = 122/206 (59%), Gaps = 6/206 (2%)
Query: 148 SLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSY 207
S+ Q CGSCW+F GA+EG +L LS Q L+DCS YGN+GC+GG R++
Sbjct: 133 SVKKQGSCGSCWAFSAAGALEGQLAKTTGRLVDLSPQNLVDCSGKYGNHGCNGGYMHRAF 192
Query: 208 QWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
Q+++ G+ ++ Y PY GQ CH A + + +T E L+ A+A GP+
Sbjct: 193 QYVIDNQGIDSEASY-PYRGQVQQCHYNPAFRAANCSQYRFLTQGDEGNLQAAVASIGPI 251
Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWG 326
SVAIDA Q F FY +GVY D C+ + ++HAVLAVGYG L+G+ YW VKNSW +G
Sbjct: 252 SVAIDAKQPKFYFYKSGVYDDPSCSQT---INHAVLAVGYGTLNGQDYWLVKNSWGVKFG 308
Query: 327 NQGYVLM-SIKDNNCGVMTAPTYVTM 351
++GY+ M K++ CG+ + M
Sbjct: 309 DKGYIRMVRNKNDQCGIAQFACFPIM 334
>gi|444514070|gb|ELV10520.1| Cathepsin L1 [Tupaia chinensis]
Length = 450
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 94/216 (43%), Positives = 123/216 (56%), Gaps = 16/216 (7%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KG+ + +Q CGSCW+F TGA+EG + K KL LS+Q L+DCS GN G
Sbjct: 242 VDWREKGFVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLISLSEQNLVDCSRRQGNLG 301
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG ++Q+I + GL +++ Y PY G D C A A TGF E AL
Sbjct: 302 CQGGLMDNAFQYIKDNGGLDSEESY-PYKGMDGTCQYKAEWAVANDTGF-------EKAL 353
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG---ELDGKP 313
A+A GP+SVAIDA SF FY +G+YY+ C S + LDH VL VGYG
Sbjct: 354 MKAVASVGPISVAIDAGHASFQFYKDGIYYEPDC--SSENLDHGVLVVGYGVEKRNSNDK 411
Query: 314 YWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
YW +KNSW WG GYV + ++N+CGV +A +Y
Sbjct: 412 YWLIKNSWGEQWGANGYVKIAKDRNNHCGVASAASY 447
>gi|449471885|ref|XP_004186123.1| PREDICTED: LOW QUALITY PROTEIN: pro-cathepsin H [Taeniopygia
guttata]
Length = 334
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 86/224 (38%), Positives = 125/224 (55%), Gaps = 9/224 (4%)
Query: 132 ASKDAIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW 191
A D+I R + + Q CGSCW+F TTG +E A + KL L++Q L+DC+
Sbjct: 108 AVPDSIDWRKKGNFVTPVKIQGACGSCWTFSTTGCLESAIAIATGKLLSLAEQQLVDCAQ 167
Query: 192 GYGNNGCDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCH------IANTTATATMTG 244
+ N+GC GG +++++I+ GL +D Y PY ++ C I A A +
Sbjct: 168 AFNNHGCSGGLPSQAFEYILYNRGLMGEDSY-PYRAKNGTCRFQPDNDIRVGKAIAFVKD 226
Query: 245 FVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAV 304
+N+T ED + A+ +H PVS A + + F Y GVY + +C ++PD ++HAVLAV
Sbjct: 227 VINITQYDEDGMVEAVGRHNPVSFAFEVT-SDFMHYRKGVYSNPRCEHTPDKVNHAVLAV 285
Query: 305 GYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTY 348
GYG+ DG PYW VKNSW WG QGY L+ N CG+ +Y
Sbjct: 286 GYGQEDGTPYWIVKNSWGRLWGMQGYFLIERGKNMCGLAACASY 329
>gi|63101996|gb|AAH95694.1| Cathepsin S, b.1 [Danio rerio]
Length = 330
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 93/224 (41%), Positives = 131/224 (58%), Gaps = 7/224 (3%)
Query: 132 ASKDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALID 188
+S DA+P + + KGY +S+ Q CGSCW+F + GA+EG KL LS Q L+D
Sbjct: 110 SSGDAVPDSLDWREKGYVSSVKMQGACGSCWAFSSVGALEGQLKKTTGKLVDLSPQNLVD 169
Query: 189 CSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
CS YGN GC+GG ++Q+++ +G D PY G C +++ A T + V
Sbjct: 170 CSSKYGNKGCNGGFMSDAFQYVIDNGGIASDSAYPYRGVQQQCSYSSSQRAANCTKYYFV 229
Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
E+ALK A+A GP+SVAIDA++ F Y +GVY D C+ ++HAVL VGYG
Sbjct: 230 RQGDENALKQAVASVGPISVAIDATRPQFVLYHSGVYNDPTCSKR---VNHAVLVVGYGT 286
Query: 309 LDGKPYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
L G+ +W VKNSW T +G+ GY+ M+ NN CG+ + Y M
Sbjct: 287 LSGQDHWLVKNSWGTRFGDGGYIRMARNKNNMCGIASYACYPVM 330
>gi|116242316|gb|ABJ89815.1| putative cathepsin L preprotein [Clonorchis sinensis]
Length = 371
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 95/227 (41%), Positives = 129/227 (56%), Gaps = 15/227 (6%)
Query: 135 DAIP---VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCS 190
DA P V + KG + +Q CGSCW+F TG +EG +Y+ KL LS+Q L+DCS
Sbjct: 150 DAAPPAEVDWRTKGAVTPVKNQGDCGSCWAFSATGGIEGQHYLATGKLVSLSEQQLVDCS 209
Query: 191 WGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYL-GQDAY---CHIANTTATATMTGF 245
N+GCDGG +++++ +H G+ T+ Y PY+ G Y C A +TG+
Sbjct: 210 --SSNDGCDGGLMDLAFEYVKEHKGIDTEVHY-PYVSGNTGYARQCSFDPKYAAVNVTGY 266
Query: 246 VNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVG 305
V++ E L+ A+ HGP+SV I+A SF Y +G+Y D +CN P LDH VL VG
Sbjct: 267 VDIPEGQELLLQQAVGFHGPISVGINAGLPSFMAYESGIYSDHRCN--PHDLDHGVLVVG 324
Query: 306 YGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
YG +G PYW +KNSW WG GYV + NN CGV T +Y M
Sbjct: 325 YGVDNGVPYWLIKNSWGEDWGENGYVRILRNHNNLCGVATMASYPLM 371
>gi|395844675|ref|XP_003795081.1| PREDICTED: cathepsin L1-like [Otolemur garnettii]
Length = 333
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 89/219 (40%), Positives = 128/219 (58%), Gaps = 9/219 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F +TGA+EG + K KL LS+Q L+DCS GN+G
Sbjct: 118 VDWRKKGYVTPVKNQGQCGSCWAFSSTGALEGQMFRKTGKLISLSEQNLVDCSQRQGNHG 177
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
C GG ++ ++ ++G + PY+ +D C + A TGFVN+ P E AL
Sbjct: 178 CSGGLMNFAFNYVKENGGLDSEVSYPYVARDEKCKYKPEYSVANDTGFVNI-PTQEKALM 236
Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKP 313
A+A GP+S+AIDAS S FY +G+YY+ C S LDH VL +GYG + D
Sbjct: 237 KAVAIIGPISIAIDASHISIQFYKSGIYYEPNC--SSKNLDHGVLLIGYGFEGTDSDDNK 294
Query: 314 YWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTYVTM 351
+W +KNSW WG G + + K+N+CG+ +A +Y T+
Sbjct: 295 FWFIKNSWGIEWGLDGCIKIAKDKNNHCGIASAASYPTV 333
>gi|66823245|ref|XP_644977.1| cysteine proteinase 5 precursor [Dictyostelium discoideum AX4]
gi|166201986|sp|P54640.2|CYSP5_DICDI RecName: Full=Cysteine proteinase 5; Flags: Precursor
gi|60473097|gb|EAL71045.1| cysteine proteinase 5 precursor [Dictyostelium discoideum AX4]
Length = 344
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 99/267 (37%), Positives = 147/267 (55%), Gaps = 38/267 (14%)
Query: 104 GIILEKWRLVTSEGEKVSKYSLWVRYNKASKDAI-PVRYEMKGYNSLLDQSVCGSCWSFG 162
G + L+ ++ EKV S + S+ A+ PV+ +Q CG CWSF
Sbjct: 91 GTKFDASSLIGTQEEKVFTTSSAASKDWRSEGAVTPVK----------NQGQCGGCWSFS 140
Query: 163 TTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM-KHGLPTQDDY 221
TTG+ EGA++ +L LS+Q LIDCS N+GCDGG ++++I+ +G+ T+ Y
Sbjct: 141 TTGSTEGAHFQSKGELVSLSEQNLIDCS--TENSGCDGGLMTYAFEYIINNNGIDTESSY 198
Query: 222 GPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYV 281
PY ++ C + + AT++ + VT SE +L+ A+ + PVSVAIDAS +SF Y
Sbjct: 199 -PYKAENGKCEYKSENSGATLSSYKTVTAGSESSLESAVNVN-PVSVAIDASHQSFQLYT 256
Query: 282 NGVYYDEKCNNSPDGLDHAVLAVGY-------------------GELDGKPYWQVKNSWS 322
+G+YY+ +C S + LDH VLAVGY YW VKNSW
Sbjct: 257 SGIYYEPEC--SSENLDHGVLAVGYGSGSGSSSGQSSGQSSGNLSASSSNEYWIVKNSWG 314
Query: 323 TYWGNQGYVLMSI-KDNNCGVMTAPTY 348
T WG +GY+LMS +DNNCG+ ++ ++
Sbjct: 315 TSWGIEGYILMSRNRDNNCGIASSASF 341
>gi|223673161|gb|ACN12762.1| Cathepsin S precursor [Salmo salar]
Length = 330
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 88/201 (43%), Positives = 112/201 (55%), Gaps = 4/201 (1%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM 211
Q CGSCW+F + GA+EG KL LS Q L+DCS YGN GC GG +++Q+++
Sbjct: 133 QGSCGSCWAFSSVGALEGQLMKTTGKLIDLSPQNLVDCSSKYGNKGCHGGFMTKAFQYVI 192
Query: 212 KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAID 271
+ D PY G C A + + + E LK ALA GP+SV ID
Sbjct: 193 DNQGIASDQSYPYKGVQQQCIYNPAQRAANCSRYSFLPEGDEGVLKEALATIGPISVGID 252
Query: 272 ASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYV 331
A++ SF+FY +GVY D C +HAVLAVGYG L G+ YW VKNSW WG+QGY+
Sbjct: 253 ATRPSFAFYRSGVYNDPTCTKK---TNHAVLAVGYGTLGGQDYWLVKNSWGLSWGDQGYI 309
Query: 332 LMSI-KDNNCGVMTAPTYVTM 351
MS KDN CG+ Y M
Sbjct: 310 RMSRNKDNQCGIALYGCYPVM 330
>gi|337255596|gb|AEI61876.1| cathepsin K [Gadus morhua]
Length = 331
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 92/225 (40%), Positives = 125/225 (55%), Gaps = 7/225 (3%)
Query: 130 NKASKDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALID 188
+K K V Y KG S+ +Q CGSCW+F + GA+EG L LS Q L+D
Sbjct: 111 DKLEKLPKSVDYRKKGMVTSVKNQGSCGSCWAFSSAGALEGQLAKTTGTLRDLSPQNLVD 170
Query: 189 CSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
C N+GC GG ++ ++M++G D+ PY+GQD C + A G+ +
Sbjct: 171 CV--TENSGCGGGYMTNAFSYVMQNGGIDSDESYPYVGQDQQCGFNVSGVAAECKGYKQI 228
Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG- 307
E AL +AL K GPVSV IDA +F FY +GVYYD CN + ++HAVLAVG+G
Sbjct: 229 PVGDERALAVALFKAGPVSVGIDAGLGTFQFYQHGVYYDRNCN--AEDINHAVLAVGFGV 286
Query: 308 ELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
GK YW +KNSW WG++GY+LM+ N CG+ +Y M
Sbjct: 287 TAKGKKYWIIKNSWGEDWGHKGYILMARNRGNLCGIANMASYPIM 331
>gi|6978723|ref|NP_037288.1| cathepsin L1 preproprotein [Rattus norvegicus]
gi|55888|emb|CAA68691.1| prepro-cathepsin L [Rattus norvegicus]
Length = 334
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 86/203 (42%), Positives = 120/203 (59%), Gaps = 8/203 (3%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F +G +EG ++K KL LS+Q L+DCS GN GC+GG ++Q+I
Sbjct: 131 NQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHDQGNQGCNGGLMDFAFQYI 190
Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
++G ++ PY +D C A A TGFV++ P E AL +A GP+SVA+
Sbjct: 191 KENGGLDSEESYPYEAKDGSCKYRAEYAVANDTGFVDI-PQQEKALMKPVATVGPISVAM 249
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTYWG 326
DAS S FY +G+YY+ C S LDH VL VGYG + + YW VKNSW WG
Sbjct: 250 DASHPSLQFYSSGIYYEPNC--SSKDLDHGVLVVGYGYEGTDSNKDKYWLVKNSWGKEWG 307
Query: 327 NQGYV-LMSIKDNNCGVMTAPTY 348
GY+ + ++N+CG+ TA +Y
Sbjct: 308 MDGYIKIAKDRNNHCGLATAASY 330
>gi|27960480|gb|AAO27844.1|AF456460_1 cathepsin Q2 [Rattus norvegicus]
Length = 343
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 92/243 (37%), Positives = 135/243 (55%), Gaps = 19/243 (7%)
Query: 124 SLWVRYNKAS-------KDAIP--VRYEMKGYNSLLD-QSVCGSCWSFGTTGAVEGAYYM 173
SLW R +S +DA+P V + +GY + + Q C SCW+F GA+EG +
Sbjct: 105 SLWKRALGSSLPNSWYWRDALPKFVDWRKEGYVTHVRVQGRCNSCWAFPVVGAIEGQMFK 164
Query: 174 KHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHI 233
K KL LS Q L+DCS GN GC GG + ++Q+++++G + PY G++ C
Sbjct: 165 KTGKLTPLSVQNLVDCSKPQGNKGCRGGTTYNAFQYVLQNGGLESEATYPYEGKEGLCRY 224
Query: 234 ANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNS 293
++A +T FV + P +ED L A+A GPV+ I S FY G+Y++ KCNN
Sbjct: 225 NPNNSSAKITRFVAL-PENEDVLMDAVATKGPVAAGIHVVHSSLRFYKKGIYHEPKCNNY 283
Query: 294 PDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
++HAVL VGYG E DG YW ++NSW WG GY+ + ++N+CG+ T Y
Sbjct: 284 ---VNHAVLVVGYGFEGNETDGNNYWLIQNSWGERWGLNGYMKIAKDRNNHCGIATFAQY 340
Query: 349 VTM 351
T+
Sbjct: 341 PTV 343
>gi|170292465|pdb|3BC3|A Chain A, Exploring Inhibitor Binding At The S Subsites Of Cathepsin
L
gi|170292466|pdb|3BC3|B Chain B, Exploring Inhibitor Binding At The S Subsites Of Cathepsin
L
gi|261824911|pdb|3H8C|A Chain A, A Combined Crystallographic And Molecular Dynamics Study
Of Cathepsin-L Retro-Binding Inhibitors (Compound 14)
gi|261824912|pdb|3H8C|B Chain B, A Combined Crystallographic And Molecular Dynamics Study
Of Cathepsin-L Retro-Binding Inhibitors (Compound 14)
Length = 220
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 93/220 (42%), Positives = 130/220 (59%), Gaps = 11/220 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGS W+F TGA+EG + K +L LS+Q L+DCS GN G
Sbjct: 5 VDWREKGYVTPVKNQGQCGSXWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEG 64
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++Q++ + GL +++ Y PY + C + A TGFV++ P E AL
Sbjct: 65 CNGGLMDYAFQYVQDNGGLDSEESY-PYEATEESCKYNPKYSVANDTGFVDI-PKQEKAL 122
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GP+SVAIDA +SF FY G+Y++ C S + +DH VL VGYG E D
Sbjct: 123 MKAVATVGPISVAIDAGHESFLFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDNN 180
Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
YW VKNSW WG GYV M+ + N+CG+ +A +Y T+
Sbjct: 181 KYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 220
>gi|328872971|gb|EGG21338.1| cysteine proteinase 5 precursor [Dictyostelium fasciculatum]
Length = 358
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 92/226 (40%), Positives = 125/226 (55%), Gaps = 33/226 (14%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCWSF TG+ EGA+ + L LS+Q LIDCS YGN+GC+GG ++++I
Sbjct: 133 NQGQCGSCWSFSATGSTEGAHQISTSNLVALSEQNLIDCSSSYGNDGCNGGLMDNAFKYI 192
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ + G+ T+ Y PY+ + C + AT++ +V+VT SE AL+ K GPVSVA
Sbjct: 193 IANGGIDTEASY-PYVAKVQKCKYNPANSGATLSSYVDVTSGSESALQSQTVK-GPVSVA 250
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG---------------------- 307
IDAS +SF Y +GVYY+ C S LDH VL VGYG
Sbjct: 251 IDASHQSFQLYDSGVYYEPAC--SSTNLDHGVLVVGYGTASANGSSDSDSSAASQSSSSE 308
Query: 308 -----ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPT 347
G +W+VKNSW WG GY+ M+ +DNNCG+ T +
Sbjct: 309 SSDDQATQGAQFWKVKNSWGPEWGLSGYIQMARNRDNNCGIATTAS 354
>gi|74765984|sp|Q24940.1|CATLL_FASHE RecName: Full=Cathepsin L-like proteinase; Flags: Precursor
gi|497700|gb|AAA29136.1| cathepsin [Fasciola hepatica]
Length = 326
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 87/218 (39%), Positives = 122/218 (55%), Gaps = 8/218 (3%)
Query: 131 KASKDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALI 187
+A+ A+P + + GY + DQ CGSCW+F TTG +EG Y + S+Q L+
Sbjct: 102 EANNRAVPDKIDWRESGYVTEVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLV 161
Query: 188 DCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVN 247
DCS +GNNGC GG +YQ++ + GL T+ Y PY + C A +TG+
Sbjct: 162 DCSGPWGNNGCSGGLMENAYQYLKQFGLETESSY-PYTAVEGQCRYNKQLGVAKVTGYYT 220
Query: 248 VTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG 307
V SE LK + P +VA+D + F Y +G+Y + C SP ++HAVLAVGYG
Sbjct: 221 VHSGSEVELKNLVGARRPAAVAVDV-ESDFMMYRSGIYQSQTC--SPLRVNHAVLAVGYG 277
Query: 308 ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMT 344
G YW VKNSW TYWG +GY+ M+ + N CG+ +
Sbjct: 278 TQGGTDYWIVKNSWGTYWGERGYIRMARNRGNMCGIAS 315
>gi|61200410|gb|AAX39778.1| cathepsin R [Mus musculus]
Length = 335
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 88/214 (41%), Positives = 124/214 (57%), Gaps = 9/214 (4%)
Query: 144 KGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGE 202
KGY + Q C +CW+F TGA+E + KL LS Q L+DCS GNNGC GG+
Sbjct: 125 KGYVTPVRRQGDCDACWAFAVTGAIEAQAIWQTGKLTPLSVQNLVDCSKPQGNNGCLGGD 184
Query: 203 DFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAK 262
+ ++Q+++ +G + PY G+D C + A +TGFV++ P SED L A+A
Sbjct: 185 TYNAFQYVLHNGGLESEATYPYEGKDGPCRYNPKNSKAEITGFVSL-PQSEDILMAAVAT 243
Query: 263 HGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVK 318
GP++ IDAS +SF Y G+Y++ C S D + H VL VGYG E DG YW +K
Sbjct: 244 IGPITAGIDASHESFKNYKGGIYHEPNC--SSDTVTHGVLVVGYGFKGIETDGNHYWLIK 301
Query: 319 NSWSTYWGNQGYV-LMSIKDNNCGVMTAPTYVTM 351
NSW WG +GY+ L K+N+CG+ + Y T+
Sbjct: 302 NSWGKRWGIRGYMKLAKDKNNHCGIASYAHYPTI 335
>gi|340371596|ref|XP_003384331.1| PREDICTED: digestive cysteine proteinase 2-like [Amphimedon
queenslandica]
Length = 327
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 90/217 (41%), Positives = 130/217 (59%), Gaps = 9/217 (4%)
Query: 136 AIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG 192
AIP V + KG + +Q CGSCW+F TTG++EGA+ K KL LS+Q L+DC
Sbjct: 113 AIPDSVDWRTKGLVTPVKNQKQCGSCWAFSTTGSLEGAHAKKTGKLVSLSEQNLVDCD-- 170
Query: 193 YGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPN 251
++GC GG ++++I ++ G+ T++ Y PY ++ C AT+ V++
Sbjct: 171 KKDHGCQGGLMTTAFKYIEENKGIDTEESY-PYKAKNGRCEFKKDDIGATVERHVSILTT 229
Query: 252 SEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDG 311
+ALK A+A+ GP+SVA+DAS SF Y +G+Y + C S LDH VL VGYG+ DG
Sbjct: 230 DCEALKKAVAEIGPISVAMDASHSSFQLYKSGIYDPKIC--SSRKLDHGVLVVGYGKEDG 287
Query: 312 KPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTY 348
+ YW VKNSW WG +GY ++ K N CG+ T+ Y
Sbjct: 288 EEYWLVKNSWGKNWGMEGYFKIASKKNLCGICTSACY 324
>gi|296168728|emb|CAQ54044.1| silicatein alpha 4 [Baikalospongia intermedia]
Length = 197
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 80/199 (40%), Positives = 127/199 (63%), Gaps = 5/199 (2%)
Query: 155 CGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKH- 213
CG+ ++F TGA+EGA + + K +LS+Q +IDCS YGN+GC GG+ + + ++++ +
Sbjct: 2 CGASYAFAATGALEGASALSNDKQVILSEQNIIDCSVPYGNHGCSGGDTYTAMKYVIDNG 61
Query: 214 GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDAS 273
G+ T+ Y + G+ + C ++ + A+ TG +++ SE L A+A GPV+VA+DA+
Sbjct: 62 GIDTESSY-SFQGKQSSCQYSSKNSGASATGVISIASGSETDLFAAVATVGPVAVAVDAN 120
Query: 274 QKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLM 333
+F FY +GV+ C+N+ L+HA+L GYG +GK YW VKNSWS WG+ GY++M
Sbjct: 121 TNAFRFYQSGVFDSSSCSNTK--LNHAMLVTGYGSYNGKDYWLVKNSWSKNWGDNGYIMM 178
Query: 334 -SIKDNNCGVMTAPTYVTM 351
K N CG+ T Y T+
Sbjct: 179 VRNKYNQCGIATDALYPTL 197
>gi|410990008|ref|XP_004001242.1| PREDICTED: cathepsin L1 isoform 1 [Felis catus]
Length = 333
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 96/252 (38%), Positives = 139/252 (55%), Gaps = 12/252 (4%)
Query: 108 EKWRLVTSEGEKVSKYSLWVRYNKASKDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTT 164
E++R V + G K+ K W + IP V + KGY + DQ C CW+F T
Sbjct: 86 EEFRQVMN-GLKIQKRKKWKVFQAPFFVEIPSSVDWREKGYVTPVKDQGYCLCCWAFSAT 144
Query: 165 GAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPY 224
GA+EG + K KL LS+Q L+DCS GN G GG ++Q++ +G ++ PY
Sbjct: 145 GALEGQMFRKTGKLVSLSEQNLVDCSQTEGNEGYSGGLIDDAFQYVKDNGGLDSEESYPY 204
Query: 225 LGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGV 284
Q C + A +T + ++ P+ E+ L + LA GP+S AIDAS +F FY G+
Sbjct: 205 HAQGDSCKYRPENSVANVTDYWDI-PSKENELMITLAAVGPISAAIDASLDTFRFYKEGI 263
Query: 285 YYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNN 339
YYD C S + +DH VL VGYG E + K YW +KNSW T WG GY+ M+ +DN+
Sbjct: 264 YYDPSC--SSEDVDHGVLVVGYGADGTETENKKYWIIKNSWGTDWGMDGYIKMAKDRDNH 321
Query: 340 CGVMTAPTYVTM 351
CG+ + ++ T+
Sbjct: 322 CGIASLASFPTV 333
>gi|346574373|gb|AEO36958.1| silicatein-alpha 1 [Baikalospongia fungiformis]
Length = 324
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 81/199 (40%), Positives = 126/199 (63%), Gaps = 5/199 (2%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM 211
Q CG+ ++F TGA+EGA + + K LS+Q +IDCS YGN+GC GG+ + ++++++
Sbjct: 126 QGQCGASYAFAATGALEGASALANDKQVTLSEQNIIDCSVPYGNHGCSGGDTYTAFKYVI 185
Query: 212 KH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
+ G+ T+ Y + G+ + C N T+ A+ TG V++ SE L A+A GPV+VA+
Sbjct: 186 DNGGIDTESSY-SFKGKQSSCQYNNKTSGASATGVVSIGYGSESDLLAAVATVGPVAVAV 244
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
DA+ +F FY +GV+ C+++ L+HA+L GYG +GK YW +KNSWS WG+ GY
Sbjct: 245 DANTNAFRFYQSGVFDSSSCSSTK--LNHAMLVTGYGSYNGKDYWLIKNSWSKNWGDSGY 302
Query: 331 VLM-SIKDNNCGVMTAPTY 348
+LM K N CG+ + Y
Sbjct: 303 ILMVRNKYNQCGIASDALY 321
>gi|358255476|dbj|GAA57175.1| cathepsin L [Clonorchis sinensis]
Length = 385
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 86/208 (41%), Positives = 122/208 (58%), Gaps = 10/208 (4%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TGA+EG ++ L LS+Q L+DCS YGNN C+GG +++++
Sbjct: 181 NQGNCGSCWAFSATGAIEGQNFLATGNLVSLSEQQLVDCSSEYGNNACNGGLMDNAFKYV 240
Query: 211 M-KHGLPTQDDYGPYLG-----QDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHG 264
+G+ T+ Y PY+ + C A +TG++++ LK A+ +G
Sbjct: 241 KDSNGIDTEASY-PYVSGETGDANPTCRFNLKEAVVRVTGYIDLPRGQVSELKQAVGHYG 299
Query: 265 PVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTY 324
P+SVAI+A SF Y +GVY D++C S D LDH VL VGYGE +G PYW +KNSW +
Sbjct: 300 PISVAINAGLPSFMSYKSGVYSDDQC--SSDDLDHGVLLVGYGEENGIPYWLIKNSWGPH 357
Query: 325 WGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
WG GYV + NN CGV + +Y M
Sbjct: 358 WGENGYVKILRDHNNLCGVASMASYPLM 385
>gi|291410711|ref|XP_002721635.1| PREDICTED: cathepsin H [Oryctolagus cuniculus]
Length = 333
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 78/199 (39%), Positives = 119/199 (59%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTGA+E A + K+ L++Q L+DC+ + N+GC+GG +++++I
Sbjct: 132 NQGACGSCWTFSTTGALESAVAIAGGKMLSLAEQQLVDCAQNFNNHGCEGGLPSQAFEYI 191
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ +D Y PY + C A A + N+T N E+A+ A+A + PVS A
Sbjct: 192 LYNKGIMGEDSY-PYRAMEGRCKFQPQKAIAFVKDVANITLNDEEAMVEAVALYNPVSFA 250
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ ++ F Y G+Y C+ +PD ++HAVLAVGYGE +G PYW VKNSW ++WG G
Sbjct: 251 FEVTE-DFMQYRKGIYSSTSCHKTPDKVNHAVLAVGYGEENGVPYWIVKNSWGSHWGMNG 309
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y + N CG+ +Y
Sbjct: 310 YFYIERGKNMCGLAACASY 328
>gi|50657027|emb|CAH04631.1| cathepsin H [Suberites domuncula]
Length = 335
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 79/199 (39%), Positives = 119/199 (59%), Gaps = 2/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTG +E +++K +L LS+Q L+DC+ + NNGC+GG +++++I
Sbjct: 135 NQGQCGSCWTFSTTGCLESHHFLKTGQLVSLSEQQLVDCAQAFNNNGCNGGLPSQAFEYI 194
Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
+G ++ PY D CH + +AT++ VN+T E L A+ GPVS+A
Sbjct: 195 HYNGGLDSEESYPYRAHDEKCHFVPSEVSATVSNVVNITSKDEMQLYNAVGTVGPVSIAY 254
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELD-GKPYWQVKNSWSTYWGNQG 329
D S F FY GVY ++C P+ ++HAVLAVGY + G+ YW VKNSW T +G G
Sbjct: 255 DVS-ADFRFYKKGVYKSKECKTDPEHVNHAVLAVGYNTTESGEDYWIVKNSWGTKFGING 313
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y ++ +N CG+ +Y
Sbjct: 314 YFWIARGENMCGLADCASY 332
>gi|146386356|gb|ABQ23966.1| cathepsin H [Oryctolagus cuniculus]
Length = 215
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 78/199 (39%), Positives = 119/199 (59%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTGA+E A + K+ L++Q L+DC+ + N+GC+GG +++++I
Sbjct: 15 NQGACGSCWTFSTTGALESAVAIAGGKMLSLAEQQLVDCAQNFNNHGCEGGLPSQAFEYI 74
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ +D Y PY + C A A + N+T N E+A+ A+A + PVS A
Sbjct: 75 LYNKGIMGEDSY-PYRAMEGRCKFQPQKAIAFVKDVANITLNDEEAMVEAVALYNPVSFA 133
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ ++ F Y G+Y C+ +PD ++HAVLAVGYGE +G PYW VKNSW ++WG G
Sbjct: 134 FEVTE-DFMQYRKGIYSSTSCHKTPDKVNHAVLAVGYGEENGVPYWIVKNSWGSHWGMNG 192
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y + N CG+ +Y
Sbjct: 193 YFYIERGKNMCGLAACASY 211
>gi|222641669|gb|EEE69801.1| hypothetical protein OsJ_29533 [Oryza sativa Japonica Group]
Length = 314
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 85/199 (42%), Positives = 114/199 (57%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TTG++E AY K LS+Q L+DC+ Y N GC GG +++++I
Sbjct: 114 DQGHCGSCWTFSTTGSLEAAYTQATGKPVSLSEQQLVDCATAYNNFGCSGGLPSQAFEYI 173
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ GL T++ Y PY G + CH + VN+T +ED LK A+ PVSVA
Sbjct: 174 KYNGGLDTEEAY-PYTGVNGICHYKPENVGVKVLDSVNITLGAEDELKNAVGLVRPVSVA 232
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
F Y +GVY + C SP ++HAVLAVGYG +G PYW +KNSW WG+ G
Sbjct: 233 FQVIN-GFRMYKSGVYTSDHCGTSPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNG 291
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y M + N CG+ T +Y
Sbjct: 292 YFKMEMGKNMCGIATCASY 310
>gi|5822035|pdb|1CS8|A Chain A, Crystal Structure Of Procathepsin L
Length = 316
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 92/220 (41%), Positives = 129/220 (58%), Gaps = 11/220 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGS W+F TGA+EG + K +L LS+Q L+DCS GN G
Sbjct: 101 VDWREKGYVTPVKNQGQCGSXWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEG 160
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++Q++ + GL +++ Y PY + C + A GFV++ P E AL
Sbjct: 161 CNGGLMDYAFQYVQDNGGLDSEESY-PYEATEESCKYNPKYSVANDAGFVDI-PKQEKAL 218
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GP+SVAIDA +SF FY G+Y++ C S + +DH VL VGYG E D
Sbjct: 219 MKAVATVGPISVAIDAGHESFLFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDNN 276
Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
YW VKNSW WG GYV M+ + N+CG+ +A +Y T+
Sbjct: 277 KYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 316
>gi|348504496|ref|XP_003439797.1| PREDICTED: digestive cysteine proteinase 2-like [Oreochromis
niloticus]
Length = 352
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 91/236 (38%), Positives = 130/236 (55%), Gaps = 8/236 (3%)
Query: 116 EGEKVSKYSLWVRYNKASKDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMK 174
G++VS L R + D V Y G+ + DQ CGSCW+F TTGA+E Y K
Sbjct: 119 RGKEVSARRL--RNSARHLDGFAVDYRSMGFVTEVKDQGFCGSCWAFSTTGAIEAQLYKK 176
Query: 175 HKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAY-CHI 233
+L LS+Q L+DCS +G GC G +Y +++ +GL + + Y PY D C
Sbjct: 177 TGQLISLSEQNLVDCSKSFGTYGCSGAWMANAYDYVVSNGLESSNTY-PYTSVDTQPCFY 235
Query: 234 ANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNS 293
++ A A + + + E A+ ALA GP++V IDA SF FY +G+Y + CN
Sbjct: 236 DSSLAVAHIRDYRFIPRGDEQAMADALATIGPITVTIDADHASFLFYSSGIYDEPNCN-- 293
Query: 294 PDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
P+ L+HAVL VGYG +G+ YW +KNSW T WG GY+ ++ N CG+ + Y
Sbjct: 294 PNNLNHAVLLVGYGSQEGQDYWIIKNSWGTGWGEGGYMRIVRNGQNACGLASYALY 349
>gi|211909240|gb|ACJ12893.1| cathepsin L1D [Fasciola hepatica]
Length = 326
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 86/218 (39%), Positives = 122/218 (55%), Gaps = 8/218 (3%)
Query: 131 KASKDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALI 187
+AS A+P + + GY + DQ CGSCW+F TTG +EG Y + S+Q L+
Sbjct: 102 EASNRAVPDKIDWRESGYVTGVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLV 161
Query: 188 DCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVN 247
DCS +GNNGC GG +Y+++ + GL T+ Y PY + C A +TG+
Sbjct: 162 DCSGPWGNNGCGGGLMENAYEYLKQFGLETESSY-PYRAVEGQCRYNRQLGVAKVTGYYT 220
Query: 248 VTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG 307
+ +E LK + GP +VA+D + F Y +G+Y + C SP GL+HAVLAVGYG
Sbjct: 221 LHSGNEAGLKSLVGSEGPAAVAVDV-ESDFMMYRSGIYQSQTC--SPLGLNHAVLAVGYG 277
Query: 308 ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMT 344
G YW VKNSW WG +GY+ M+ + N CG+ +
Sbjct: 278 TQGGTDYWIVKNSWGLSWGERGYIRMARNRGNMCGIAS 315
>gi|440297812|gb|ELP90453.1| cysteine proteinase 2 precursor, putative [Entamoeba invadens IP1]
Length = 304
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 90/238 (37%), Positives = 133/238 (55%), Gaps = 14/238 (5%)
Query: 123 YSLWVRYN-------KASKDAIPVRYEMKGYNSLL---DQSVCGSCWSFGTTGAVEGAYY 172
+SL R+N K+ K P +++ N + +Q CG C++FG+ ++EG
Sbjct: 61 FSLLKRFNSDPPTNVKSLKMGFPASLDLRADNKVTPVRNQGDCGGCYAFGSLSSLEGRLL 120
Query: 173 M--KHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAY 230
M + K LS++ ++ CS GNNGC GG RS+++I K+G+ + DY PY D
Sbjct: 121 MIGGNYKTLDLSEEQIVQCSLIEGNNGCWGGLGKRSFEYIKKNGVVQESDY-PYTAVDGK 179
Query: 231 CHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKC 290
C + T AT+ G++NV + LK AL G V V+IDA Y +GVY D+ C
Sbjct: 180 CLVDLTKKFATIGGYINVPSKDVNELKSALV-GGVVDVSIDARTVKLQLYKSGVYTDKNC 238
Query: 291 NNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTY 348
+ L+H V AVGYG +G+ YW VKNSW WG++G++LM+I++N CGV T Y
Sbjct: 239 KSEMSDLNHEVSAVGYGNENGQEYWIVKNSWGEEWGDKGFILMAIENNTCGVATDALY 296
>gi|313213752|emb|CBY40632.1| unnamed protein product [Oikopleura dioica]
Length = 440
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 85/200 (42%), Positives = 116/200 (58%), Gaps = 5/200 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F T ++E + + L LS+Q L+DCS +GN GC GG + + +I
Sbjct: 241 DQGQCGSCWAFSTIASLESQWALAGNALTSLSEQQLVDCSMNWGNYGCSGGLMTQGFTYI 300
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+G+ T+ Y PY QD C ++T N+ E AL A+ GP+SVA
Sbjct: 301 HDNNGVDTEASY-PYTAQDGKCVFNPANVGTSLTSCYNIASGDEAALANAVQMVGPMSVA 359
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS SF Y +GVYY+ C S LDH V AVGYG G ++ VKNSW+ WG+ G
Sbjct: 360 IDASHMSFQLYTSGVYYEPNC--SSQFLDHGVTAVGYGSSSGNDFFIVKNSWAATWGDNG 417
Query: 330 YVLMSI-KDNNCGVMTAPTY 348
Y++MS K+NNCG+ T+ +Y
Sbjct: 418 YIMMSRNKNNNCGIATSASY 437
>gi|148669360|gb|EDL01307.1| mCG12220, isoform CRA_b [Mus musculus]
Length = 335
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 90/216 (41%), Positives = 127/216 (58%), Gaps = 9/216 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
+ ++ +GY + Q C SCW+F TGA+EG + K +L LS Q L+DCS GN G
Sbjct: 120 INWKKRGYVTPVQTQGRCNSCWAFSVTGAIEGQMFRKTGQLIPLSVQNLVDCSRPQGNWG 179
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
C G + + ++M++G + PY +D C + +TA +TGF P +EDAL
Sbjct: 180 CYLGNTYLALHYVMENGGLESEATYPYEEKDGSCRYSPENSTANITGF-EFVPKNEDALM 238
Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKP 313
A+A GP+SVAIDA SF FY G+YY+ C++S + H++L VGYG E DG+
Sbjct: 239 NAVASIGPISVAIDARHASFLFYKRGIYYEPNCSSSV--VTHSMLLVGYGFTGRESDGRK 296
Query: 314 YWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
YW VKNS T WGN+GY+ +S K N+CG+ T Y
Sbjct: 297 YWLVKNSMGTQWGNKGYMKISRDKGNHCGIATYALY 332
>gi|385298943|gb|AFI60244.1| cysteine protease/senescence-enhanced 1, partial [Panicum virgatum]
Length = 282
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 85/200 (42%), Positives = 118/200 (59%), Gaps = 5/200 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTGA+E AY K LS+Q L+DC+ Y N GC+GG +++++I
Sbjct: 82 NQGHCGSCWTFSTTGALEAAYTQATGKPVSLSEQQLVDCAGAYNNFGCNGGLPSQAFEYI 141
Query: 211 MKH--GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
KH GL T++ Y PY G + C + + VN+T +E+ LK A+ PVSV
Sbjct: 142 -KHNGGLDTEESY-PYKGVNGLCQFKASNVGVKVLDSVNITLGAENELKDAVGLVRPVSV 199
Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQ 328
A + F Y +GVY + C +P ++HAVLAVGYG +G PYW +KNSW WG++
Sbjct: 200 AFEVIN-GFRLYKSGVYTSDHCGTTPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDE 258
Query: 329 GYVLMSIKDNNCGVMTAPTY 348
GY M + N CGV T +Y
Sbjct: 259 GYFKMEMGKNMCGVATCASY 278
>gi|308322193|gb|ADO28234.1| cathepsin K [Ictalurus furcatus]
Length = 331
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 95/228 (41%), Positives = 125/228 (54%), Gaps = 9/228 (3%)
Query: 129 YNKASKDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQA 185
Y S D +P + Y GY + +Q CGSCW+F + GA+EG KL LS Q
Sbjct: 108 YYPDSLDKLPKSIDYRKLGYVTPVRNQGSCGSCWAFSSVGALEGQLMKTTGKLVNLSPQN 167
Query: 186 LIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGF 245
L+DC N+GC GG ++ ++ +G ++ PY+GQD C + A F
Sbjct: 168 LVDCV--TENDGCGGGYMTNAFSYVRDNGGIDSEEAYPYVGQDQQCAYNKSGKAAECRRF 225
Query: 246 VNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVG 305
V SE AL A+AK GPVSV IDA Q +F FY GVYYD C+ + ++HAVLAVG
Sbjct: 226 KEVKKGSEYALASAVAKVGPVSVGIDAMQSTFQFYKRGVYYDPNCDK--ESINHAVLAVG 283
Query: 306 YGEL-DGKPYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
YG GK +W VKNSW WG +GYVLM+ NN CG+ ++ M
Sbjct: 284 YGATPKGKKHWIVKNSWGEEWGMKGYVLMARNRNNACGIANLASFPVM 331
>gi|332220183|ref|XP_003259237.1| PREDICTED: cathepsin S isoform 1 [Nomascus leucogenys]
Length = 331
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 84/199 (42%), Positives = 117/199 (58%), Gaps = 5/199 (2%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW-GYGNNGCDGGEDFRSYQWI 210
Q CG+CW+F GA+E +K KL LS Q L+DCS YGN GC+GG ++Q+I
Sbjct: 133 QGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYI 192
Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
+ + D PY D C + AT + + + + ED LK A+A GPVSV +
Sbjct: 193 IDNKGIDSDASYPYKAMDQKCQYDSKYRAATCSKYTELPYSREDVLKEAVANKGPVSVGV 252
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
DAS SF Y +GVYY+ C + ++H VL VGYG+L+GK YW VKNSW +G +GY
Sbjct: 253 DASHPSFFLYRSGVYYEPSCTQN---VNHGVLVVGYGDLNGKEYWLVKNSWGRNFGEEGY 309
Query: 331 VLMSI-KDNNCGVMTAPTY 348
+ M+ K N+CG+ + P+Y
Sbjct: 310 IRMARNKGNHCGIASFPSY 328
>gi|310751866|gb|ADP09371.1| cathepsin L-like proteinase [Fasciola hepatica]
Length = 326
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 87/225 (38%), Positives = 124/225 (55%), Gaps = 8/225 (3%)
Query: 131 KASKDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALI 187
+A+ A+P + + GY + DQ CGSCW+F TTG +EG Y + S+Q L+
Sbjct: 102 EANNRAVPDKIDWRESGYVTEVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLV 161
Query: 188 DCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVN 247
DCS +GNNGC GG +Y+++ + GL T+ Y PY + C A +TG+
Sbjct: 162 DCSRPWGNNGCGGGLMENAYEYLKQFGLETESSY-PYRAVEGQCRYNKQLGVAKVTGYYT 220
Query: 248 VTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG 307
V SE LK + GP +VA+D + F Y G+Y + C SP GL+HAVLAVGYG
Sbjct: 221 VHSGSEVELKNLVGAEGPAAVAVDV-ESDFMMYSGGIYQSQTC--SPLGLNHAVLAVGYG 277
Query: 308 ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
G YW VKNSW WG +GY+ M+ + N CG+ + + + +
Sbjct: 278 TQGGTDYWIVKNSWGLSWGERGYIRMARNRGNMCGIASLASLLMV 322
>gi|114559420|ref|XP_001171183.1| PREDICTED: cathepsin S isoform 1 [Pan troglodytes]
gi|397492868|ref|XP_003817342.1| PREDICTED: cathepsin S isoform 2 [Pan paniscus]
Length = 281
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 86/206 (41%), Positives = 118/206 (57%), Gaps = 5/206 (2%)
Query: 145 GYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW-GYGNNGCDGGED 203
G N L D CG+CW+F GA+E +K KL LS Q L+DCS YGN GC+GG
Sbjct: 76 GMNHLGDMGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFM 135
Query: 204 FRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKH 263
++Q+I+ + D PY D C + AT + + + ED LK A+A
Sbjct: 136 TTAFQYIIDNKGIDSDASYPYKATDQKCQYDSKYRAATCSKYTELPYGREDVLKEAVANK 195
Query: 264 GPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWST 323
GPVSV +DA SF Y +GVYY+ C + ++H VL VGYG+L+GK YW VKNSW
Sbjct: 196 GPVSVGVDALHPSFFLYRSGVYYEPSCTQN---VNHGVLVVGYGDLNGKEYWLVKNSWGH 252
Query: 324 YWGNQGYVLMSI-KDNNCGVMTAPTY 348
+G +GY+ M+ K N+CG+ + P+Y
Sbjct: 253 NFGEEGYIRMARNKGNHCGIASFPSY 278
>gi|31981229|ref|NP_071721.2| cathepsin M precursor [Mus musculus]
gi|12837968|dbj|BAB24022.1| unnamed protein product [Mus musculus]
gi|12838184|dbj|BAB24116.1| unnamed protein product [Mus musculus]
gi|16445013|gb|AAK00506.1| cathepsin M precursor [Mus musculus]
gi|148669359|gb|EDL01306.1| mCG12220, isoform CRA_a [Mus musculus]
gi|148921940|gb|AAI46421.1| Cathepsin M [synthetic construct]
gi|151555373|gb|AAI48858.1| Cathepsin M [synthetic construct]
Length = 333
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 90/216 (41%), Positives = 127/216 (58%), Gaps = 9/216 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
+ ++ +GY + Q C SCW+F TGA+EG + K +L LS Q L+DCS GN G
Sbjct: 118 INWKKRGYVTPVQTQGRCNSCWAFSVTGAIEGQMFRKTGQLIPLSVQNLVDCSRPQGNWG 177
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
C G + + ++M++G + PY +D C + +TA +TGF P +EDAL
Sbjct: 178 CYLGNTYLALHYVMENGGLESEATYPYEEKDGSCRYSPENSTANITGF-EFVPKNEDALM 236
Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKP 313
A+A GP+SVAIDA SF FY G+YY+ C++S + H++L VGYG E DG+
Sbjct: 237 NAVASIGPISVAIDARHASFLFYKRGIYYEPNCSSSV--VTHSMLLVGYGFTGRESDGRK 294
Query: 314 YWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
YW VKNS T WGN+GY+ +S K N+CG+ T Y
Sbjct: 295 YWLVKNSMGTQWGNKGYMKISRDKGNHCGIATYALY 330
>gi|225706086|gb|ACO08889.1| Cathepsin S precursor [Osmerus mordax]
Length = 333
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 90/215 (41%), Positives = 120/215 (55%), Gaps = 5/215 (2%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F + GA+EG KL LS Q L+DCS YGN G
Sbjct: 122 VDWRQKGYVTQVKNQGSCGSCWAFSSVGALEGQLMRTTGKLLDLSPQNLVDCSSKYGNKG 181
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
C+GG ++Q+++ + D PY G CH + +A T + + E LK
Sbjct: 182 CNGGFMSEAFQYVIDNKGIDSDTSYPYQGVQGTCHYNPSYRSANCTRYSFLPEGDETTLK 241
Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQV 317
A+A GP+SVAIDA++ SF + +GVY D C ++HAVL VGYG LDG+ YW V
Sbjct: 242 QAVAMIGPISVAIDATRPSFILWRSGVYNDLTCTQK---INHAVLVVGYGTLDGQDYWLV 298
Query: 318 KNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
KNSW T +G GY+ MS NN CG+ Y M
Sbjct: 299 KNSWGTRFGENGYIRMSRNRNNQCGIALYGCYPIM 333
>gi|162460343|ref|NP_001105479.1| cysteine protease2 precursor [Zea mays]
gi|1491774|emb|CAA68192.1| cysteine protease [Zea mays]
Length = 360
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 84/199 (42%), Positives = 115/199 (57%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTGA+E AY K LS+Q L+DC + N GC+GG +++++I
Sbjct: 160 NQGHCGSCWTFSTTGALEAAYTQATGKPISLSEQQLVDCGLAFNNFGCNGGLPSQAFEYI 219
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ GL T++ Y PY G + N + VN+T +ED LK A+ PVSVA
Sbjct: 220 KYNGGLDTEESY-PYQGVNGISKFKNENVGVKVLDSVNITLGAEDELKDAVGLVRPVSVA 278
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ F Y +GVY + C +P ++HAVLAVGYG DG PYW +KNSW WG++G
Sbjct: 279 FEVIT-GFRLYKSGVYTSDHCGTTPMDVNHAVLAVGYGVEDGVPYWLIKNSWGADWGDEG 337
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y M + N CGV T +Y
Sbjct: 338 YFKMEMGKNMCGVATCASY 356
>gi|208972992|dbj|BAG74345.1| silicatein-M4 [Ephydatia fluviatilis]
gi|296168739|emb|CAQ54047.1| silicatein alpha 3 [Ephydatia muelleri]
Length = 327
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 83/202 (41%), Positives = 118/202 (58%), Gaps = 3/202 (1%)
Query: 148 SLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSY 207
S+ Q CGS ++F GA+EGA + KL LS+Q +IDCS YGN+GC GG+ + ++
Sbjct: 125 SVQSQGSCGSSYAFAAAGALEGANALAADKLVALSEQNIIDCSVAYGNHGCSGGDVYTAF 184
Query: 208 QWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVS 267
++++ +G D PY G+ C + A TG V +T SE L A+A GP++
Sbjct: 185 KYVVDNGGIDTDSSYPYKGKQYSCQYNSKNLGAVATGVVKITSGSETDLLSAVASVGPIA 244
Query: 268 VAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGN 327
VA+DA+ SF FY +GV+ C+ + L+HA+L GYG +GK YW VKNSW T WG
Sbjct: 245 VAVDATVNSFMFYQSGVFDSSSCSTTK--LNHAMLVTGYGSTNGKDYWLVKNSWGTGWGE 302
Query: 328 QGYVLM-SIKDNNCGVMTAPTY 348
GY+ M K N CG+ + Y
Sbjct: 303 SGYIKMVRNKYNQCGIASDALY 324
>gi|318065049|ref|NP_001187379.1| cathepsin K precursor [Ictalurus punctatus]
gi|308322859|gb|ADO28567.1| cathepsin K [Ictalurus punctatus]
Length = 331
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 95/228 (41%), Positives = 125/228 (54%), Gaps = 9/228 (3%)
Query: 129 YNKASKDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQA 185
Y S D +P + Y GY + +Q CGSCW+F + GA+EG KL LS Q
Sbjct: 108 YYPDSLDKLPKSIDYRKLGYVTPVRNQGSCGSCWAFSSVGALEGQLMKTTGKLVNLSPQN 167
Query: 186 LIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGF 245
L+DC N+GC GG ++ ++ +G ++ PY+GQD C + A F
Sbjct: 168 LVDCV--TENDGCGGGYMTNAFSYVRDNGGIDSEEAYPYVGQDQQCAYNKSGKAAECRRF 225
Query: 246 VNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVG 305
V SE AL A+AK GPVSV IDA Q +F FY GVYYD C+ + ++HAVLAVG
Sbjct: 226 KEVKKGSEYALASAVAKVGPVSVGIDAMQSTFQFYKRGVYYDPNCDK--ESINHAVLAVG 283
Query: 306 YGEL-DGKPYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
YG GK +W VKNSW WG +GYVLM+ NN CG+ ++ M
Sbjct: 284 YGATPKGKKHWIVKNSWGEEWGMKGYVLMARNRNNACGIANLASFPVM 331
>gi|355692920|gb|EHH27523.1| Cathepsin H, partial [Macaca mulatta]
Length = 305
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 80/199 (40%), Positives = 118/199 (59%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTGA+E A + K+ L++Q L+DC+ + N+GC GG +++++I
Sbjct: 104 NQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYI 163
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ +D Y PY G+D C A + N+T +E+A+ A+A + PVS A
Sbjct: 164 LYNKGIMGEDTY-PYQGKDGDCKFRPGKAIGFVKDVANITIYAEEAMVEAVALYNPVSFA 222
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ +Q F Y G+Y C+ +PD ++HAVLAVGYGE +G PYW VKNSW WG G
Sbjct: 223 FEVTQ-DFMMYKTGIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGMNG 281
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y L+ N CG+ +Y
Sbjct: 282 YFLIERGKNMCGLAACASY 300
>gi|313246319|emb|CBY35240.1| unnamed protein product [Oikopleura dioica]
Length = 326
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 85/200 (42%), Positives = 116/200 (58%), Gaps = 5/200 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F T ++E + + L LS+Q L+DCS +GN GC GG + + +I
Sbjct: 127 DQGQCGSCWAFSTIASLESQWALAGNALTSLSEQQLVDCSMNWGNYGCSGGLMTQGFTYI 186
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+G+ T+ Y PY QD C ++T N+ E AL A+ GP+SVA
Sbjct: 187 HDNNGVDTEASY-PYTAQDGKCVFNPANVGTSLTSCYNIASGDEAALANAVQMVGPMSVA 245
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS SF Y +GVYY+ C S LDH V AVGYG G ++ VKNSW+ WG+ G
Sbjct: 246 IDASHMSFQLYTSGVYYEPNC--SSQFLDHGVTAVGYGSSSGNDFFIVKNSWAATWGDNG 303
Query: 330 YVLMSI-KDNNCGVMTAPTY 348
Y++MS K+NNCG+ T+ +Y
Sbjct: 304 YIMMSRNKNNNCGIATSASY 323
>gi|313235127|emb|CBY24999.1| unnamed protein product [Oikopleura dioica]
Length = 326
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 85/200 (42%), Positives = 116/200 (58%), Gaps = 5/200 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F T ++E + + L LS+Q L+DCS +GN GC GG + + +I
Sbjct: 127 DQGQCGSCWAFSTIASLESQWALAGNALTSLSEQQLVDCSMNWGNYGCSGGLMTQGFTYI 186
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+G+ T+ Y PY QD C ++T N+ E AL A+ GP+SVA
Sbjct: 187 HDNNGVDTEASY-PYTAQDGKCVFNPANVGTSLTSCYNIASGDEAALANAVQMVGPMSVA 245
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS SF Y +GVYY+ C S LDH V AVGYG +G ++ VKNSW+ WG+ G
Sbjct: 246 IDASHMSFQLYTSGVYYEPNC--SSQFLDHGVTAVGYGSSNGNDFFIVKNSWAATWGDNG 303
Query: 330 YVLMSI-KDNNCGVMTAPTY 348
Y++MS K NNCG+ T+ +Y
Sbjct: 304 YIMMSRNKSNNCGIATSASY 323
>gi|355763133|gb|EHH62119.1| hypothetical protein EGM_20318 [Macaca fascicularis]
Length = 331
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 84/199 (42%), Positives = 115/199 (57%), Gaps = 5/199 (2%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW-GYGNNGCDGGEDFRSYQWI 210
Q CG+CW+F GA+E +K KL LS Q L+DCS YGN GC+GG R++Q+I
Sbjct: 133 QGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTRAFQYI 192
Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
+ + D PY D C + AT + + + ED LK +A GPVSV +
Sbjct: 193 IDNNGIDSDASYPYKATDQKCQYDSKYRAATCSKYTELPYGREDVLKEVVANKGPVSVGV 252
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
DAS SF Y +GVYY+ C + ++H VL VGYG L+GK YW VKNSW +G +GY
Sbjct: 253 DASHPSFFLYRSGVYYEPSCTQN---VNHGVLVVGYGVLNGKEYWLVKNSWGRNFGEEGY 309
Query: 331 VLMSI-KDNNCGVMTAPTY 348
+ M+ K N+CG+ + P+Y
Sbjct: 310 IRMARNKGNHCGIASFPSY 328
>gi|355558399|gb|EHH15179.1| hypothetical protein EGK_01236 [Macaca mulatta]
gi|380809986|gb|AFE76868.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
gi|383416071|gb|AFH31249.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
gi|383416073|gb|AFH31250.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
gi|383416075|gb|AFH31251.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
gi|383416077|gb|AFH31252.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
gi|383416079|gb|AFH31253.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
Length = 331
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 84/199 (42%), Positives = 115/199 (57%), Gaps = 5/199 (2%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW-GYGNNGCDGGEDFRSYQWI 210
Q CG+CW+F GA+E +K KL LS Q L+DCS YGN GC+GG R++Q+I
Sbjct: 133 QGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTRAFQYI 192
Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
+ + D PY D C + AT + + + ED LK +A GPVSV +
Sbjct: 193 IDNNGIDSDASYPYKATDQKCQYDSKYRAATCSKYTELPYGREDVLKEVVANKGPVSVGV 252
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
DAS SF Y +GVYY+ C + ++H VL VGYG L+GK YW VKNSW +G +GY
Sbjct: 253 DASHPSFFLYRSGVYYEPSCTQN---VNHGVLVVGYGVLNGKEYWLVKNSWGRNFGEEGY 309
Query: 331 VLMSI-KDNNCGVMTAPTY 348
+ M+ K N+CG+ + P+Y
Sbjct: 310 IRMARNKGNHCGIASFPSY 328
>gi|297297049|ref|XP_002804951.1| PREDICTED: cathepsin H [Macaca mulatta]
Length = 323
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 80/199 (40%), Positives = 117/199 (58%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTGA+E A + K+ L++Q L+DC+ + N+GC GG +++++I
Sbjct: 122 NQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYI 181
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ +D Y PY G+D C A + N+T E+A+ A+A + PVS A
Sbjct: 182 LYNKGIMGEDTY-PYQGKDGDCKFRPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFA 240
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ +Q F Y G+Y C+ +PD ++HAVLAVGYGE +G PYW VKNSW WG G
Sbjct: 241 FEVTQ-DFMIYKTGIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGMNG 299
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y L+ N CG+ +Y
Sbjct: 300 YFLIERGKNMCGLAACASY 318
>gi|402856105|ref|XP_003892640.1| PREDICTED: cathepsin S isoform 1 [Papio anubis]
Length = 331
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 84/199 (42%), Positives = 115/199 (57%), Gaps = 5/199 (2%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW-GYGNNGCDGGEDFRSYQWI 210
Q CG+CW+F GA+E +K KL LS Q L+DCS YGN GC+GG R++Q+I
Sbjct: 133 QGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTRAFQYI 192
Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
+ + D PY D C + AT + + + ED LK +A GPVSV +
Sbjct: 193 IDNNGIDSDASYPYKATDQKCQYDSKYRAATCSKYTELPYGREDVLKEVVANKGPVSVGV 252
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
DAS SF Y +GVYY+ C + ++H VL VGYG L+GK YW VKNSW +G +GY
Sbjct: 253 DASHPSFFLYRSGVYYEPSCTQN---VNHGVLVVGYGVLNGKEYWLVKNSWGRNFGEEGY 309
Query: 331 VLMSI-KDNNCGVMTAPTY 348
+ M+ K N+CG+ + P+Y
Sbjct: 310 IRMARNKGNHCGIASFPSY 328
>gi|110349475|gb|ABG73218.1| cathepsin L 2 precursor [Diaprepes abbreviatus]
Length = 348
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 86/200 (43%), Positives = 124/200 (62%), Gaps = 8/200 (4%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCWSF TGA+E ++ K KL LS+Q L+DCS YGN+GC GG ++ +I
Sbjct: 152 NQRNCGSCWSFSATGALEAQWFKKTNKLISLSEQQLVDCSGRYGNHGCHGGWMHWAFGYI 211
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
++ G+ T+ Y PY +D C AT++ + V P E+ L ++ GP+S+A
Sbjct: 212 KENGGIDTEQSY-PYTAKDGRCAYKPGNKAATVSQVIMV-PRGENQLAAKVSSVGPISIA 269
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ S K F FY +GVY + +C +S L+HA+LAVGYG + GK +W VKNSW T WG+QG
Sbjct: 270 AEVSHK-FQFYHSGVYDEPQCGHS---LNHAMLAVGYGSMGGKNFWLVKNSWGTGWGDQG 325
Query: 330 YVLMSI-KDNNCGVMTAPTY 348
Y+ M+ K+N CG+ +Y
Sbjct: 326 YIRMAKDKNNQCGIALMASY 345
>gi|405958752|gb|EKC24846.1| Cathepsin L1 [Crassostrea gigas]
Length = 290
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 82/199 (41%), Positives = 123/199 (61%), Gaps = 4/199 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TGA+EG K KL VLS Q ++DCS +GN+GC+GG ++++I
Sbjct: 91 NQFACGSCWAFTATGALEGQVRNKTGKLIVLSVQQMMDCSEKWGNHGCEGGLMDAAFKYI 150
Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
G + PY + C + A + G+ ++ P SE++L +A+A GP+S A+
Sbjct: 151 HDVGGIESNASYPYKPAEEKCKFNKSAVVAKVKGYKDL-PKSEESLMVAVATVGPISAAL 209
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
DAS SF Y +GVY + C++ +DH+++ VGYG +DGK YW KNSW T WG++GY
Sbjct: 210 DASHSSFQLYKSGVYDEPNCSSGQ--VDHSLVVVGYGLMDGKKYWIAKNSWGTSWGDKGY 267
Query: 331 VLMSI-KDNNCGVMTAPTY 348
+L+S K+N CG+ +Y
Sbjct: 268 ILLSKDKNNQCGIANTLSY 286
>gi|301609080|ref|XP_002934105.1| PREDICTED: cathepsin S-like [Xenopus (Silurana) tropicalis]
Length = 334
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 83/198 (41%), Positives = 118/198 (59%), Gaps = 4/198 (2%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM 211
Q CGSC++FG GA+E + K +L S Q L+DCS+ GNNGC GG S+ ++
Sbjct: 137 QGSCGSCYAFGAVGALECQWKRKTGRLVTFSPQELVDCSYTVGNNGCKGGGSNASFTYMK 196
Query: 212 KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAID 271
K+G+ + Y PY G++A C + + F + +E LK A+ GPV VAID
Sbjct: 197 KYGVMEESAY-PYTGKEAQCKKEKPSNVGVVKQFYRLPTGNEVLLKKAVGTVGPVYVAID 255
Query: 272 ASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYV 331
+S++ F Y +GVYYD C+ + L HAVL VGY + +G+ YW VKNSW Y+G++GY+
Sbjct: 256 SSRQGFRMYKSGVYYDPYCSTT--SLSHAVLIVGYSKENGQYYWLVKNSWGEYFGDKGYI 313
Query: 332 LMSIKDNN-CGVMTAPTY 348
M+ K NN CG+ T Y
Sbjct: 314 KMARKRNNHCGIATRAAY 331
>gi|61661067|gb|AAX51229.1| cathepsin S cysteine protease [Paralichthys olivaceus]
Length = 337
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 90/218 (41%), Positives = 129/218 (59%), Gaps = 7/218 (3%)
Query: 137 IPVRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGN 195
+ V + ++G S+ Q CGSCW+F GA+EG KL LS Q L+DCS YGN
Sbjct: 124 VSVDWRLQGCVTSVKMQGSCGSCWAFSAAGALEGQLAKTTGKLVDLSPQNLVDCSLKYGN 183
Query: 196 NGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSED 254
GC+GG R++Q+++ + G+ ++ Y PY GQ C + A + + + E
Sbjct: 184 KGCNGGFMDRAFQYVIDNKGIDSEASY-PYRGQLQQCSYNPSYRAANCSRYSFLPEGDEG 242
Query: 255 ALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPY 314
ALK ALA GP+SVAIDA++ +F+FY +GVY D C ++H VLAVGYG G+ Y
Sbjct: 243 ALKNALATIGPISVAIDATRPTFAFYRSGVYNDPTCTQR---VNHGVLAVGYGTESGQDY 299
Query: 315 WQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
W VKNSW T +G++GY+ MS K++ CG+ +Y M
Sbjct: 300 WLVKNSWGTSFGDKGYIRMSRNKNDQCGIALYCSYPIM 337
>gi|357474725|ref|XP_003607647.1| Cysteine proteinase [Medicago truncatula]
gi|355508702|gb|AES89844.1| Cysteine proteinase [Medicago truncatula]
Length = 340
Score = 159 bits (401), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 93/222 (41%), Positives = 125/222 (56%), Gaps = 12/222 (5%)
Query: 136 AIP--VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG 192
AIP V + +KG + DQ CGSCW+F T A+EG + KL LS+Q L+DC
Sbjct: 123 AIPEAVDWRVKGAVTPIKDQGQCGSCWAFSTVAAIEGINQITTGKLISLSEQELVDCDTK 182
Query: 193 YGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNS 252
+ GC+GG +++I+K+G T + PY D C+ A T A +TG+ V NS
Sbjct: 183 GEDQGCEGGLMEDGFEFIIKNGGITSETNYPYKAADGSCNTATTAPVAKITGYEKVPVNS 242
Query: 253 EDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGK 312
E +L A+A P+SV+IDAS SF FY +G+Y E C LDH V AVGYG +G
Sbjct: 243 EISLLKAVANQ-PISVSIDASDSSFMFYSSGIYTGE-CGTE---LDHGVTAVGYGSANGT 297
Query: 313 PYWQVKNSWSTYWGNQGYVLM----SIKDNNCGVMTAPTYVT 350
YW VKNSW T WG +GY+ M + K+ CG+ +Y T
Sbjct: 298 DYWIVKNSWGTVWGEKGYIRMQRGIADKEGLCGIAMDSSYPT 339
>gi|440291600|gb|ELP84863.1| cysteine proteinase 2 precursor, putative [Entamoeba invadens IP1]
Length = 315
Score = 159 bits (401), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 90/222 (40%), Positives = 127/222 (57%), Gaps = 8/222 (3%)
Query: 131 KASKDAIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVL--SQQALID 188
K+ +++ +R + K + DQ+ CGSC+SFG ++EG + L S++ ++
Sbjct: 95 KSFPESLDLRQQGK-VTKVKDQNPCGSCYSFGALASLEGRLLFVGGNVDTLDLSEEQIVR 153
Query: 189 CSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
CS NNGC+GG Y +I +G+ + DY PY D C T AT+ G+ +
Sbjct: 154 CS---PNNGCNGGTGSVVYDYIKGNGILQEKDY-PYTASDGKCEADLTYIFATVEGYKYI 209
Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
T DALK AL G V V+IDAS FS Y +GVY+D KC S L+H + AVGYG+
Sbjct: 210 TRKDNDALKSALVD-GVVDVSIDASSPKFSLYKSGVYHDTKCKTSLLSLNHEIAAVGYGK 268
Query: 309 LDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVT 350
L+G+ YW VKNSW T WG GY+LM+++ N CGV + P T
Sbjct: 269 LNGEEYWIVKNSWGTNWGENGYILMAMEGNTCGVTSEPIIPT 310
>gi|60649669|gb|AAH90560.1| LOC594890 protein, partial [Xenopus (Silurana) tropicalis]
Length = 355
Score = 159 bits (401), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 92/218 (42%), Positives = 127/218 (58%), Gaps = 10/218 (4%)
Query: 131 KASKDAIPVRYEMKGYN---SLLDQSVCGSCWSFGTTGAVEGAYYMKHK--KLAVLSQQA 185
+ SK + P + + N S+ DQ C + W+F + GA+E MK + KL LS Q
Sbjct: 133 EISKSSPPESIDWRNKNCVTSVKDQGSCIASWAFSSIGALE-CQNMKRRTGKLESLSVQN 191
Query: 186 LIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGF 245
L+DCS YGNNGC GG S+++I+ +G+ + +Y PY G+D C + T +
Sbjct: 192 LLDCSQTYGNNGCKGGWVVSSFRYIIDNGIELESNY-PYQGKDGKCSYTPVKKASVCTSY 250
Query: 246 VNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVG 305
+ E LK + GPVSVAIDAS+K+F Y NGVYYD C++S DH+VL VG
Sbjct: 251 RQLPYGDEATLKQVVGLMGPVSVAIDASRKTFRMYKNGVYYDPNCSSSTP--DHSVLVVG 308
Query: 306 YGELDGKPYWQVKNSWSTYWGNQGYVLMSIK-DNNCGV 342
YG DG YW VKNSW T +G++GY+ M+ NNCG+
Sbjct: 309 YGAEDGVEYWLVKNSWGTSFGDEGYIKMARNHHNNCGI 346
>gi|62945374|ref|NP_001017509.1| uncharacterized protein LOC498688 precursor [Rattus norvegicus]
gi|60552853|gb|AAH91563.1| Similar to cathepsin R [Rattus norvegicus]
gi|149039732|gb|EDL93848.1| similar to cathepsin R [Rattus norvegicus]
Length = 334
Score = 159 bits (401), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 86/219 (39%), Positives = 127/219 (57%), Gaps = 9/219 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + Q C +CW+F TGA+E + KL LS Q L+DCS GNNG
Sbjct: 119 VDWRKKGYVTPVRRQGNCNACWAFSVTGAIEAQTIWQSGKLIPLSVQNLVDCSKPQGNNG 178
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
C GG+ + ++Q+++ +G + PY G+D C ++A +TGFV++ P SED L
Sbjct: 179 CLGGDTYNAFQYVLHNGGLQSEATYPYEGKDGPCRYNPKNSSAEITGFVSL-PESEDILM 237
Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKP 313
+A+A GP+S IDAS +SF FY G+Y++ C S + + H VL VGYG + G
Sbjct: 238 VAVATIGPISAGIDASHESFKFYKKGIYHEPNC--SSNSVTHGVLVVGYGFKGNDTGGDH 295
Query: 314 YWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTYVTM 351
YW +KNSW WG +GY+ + K+N+C + + Y T+
Sbjct: 296 YWLIKNSWGKQWGIRGYMKITKDKNNHCAIASYAHYPTI 334
>gi|84660246|emb|CAI43320.1| cathepsin L [Lubomirskia baicalensis]
gi|85677150|emb|CAI46307.1| cathepsin L [Lubomirskia baicalensis]
Length = 327
Score = 159 bits (401), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 92/214 (42%), Positives = 127/214 (59%), Gaps = 7/214 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KG+ ++ +Q CGSCW+F +EG ++ L LS+Q L+DCS GN G
Sbjct: 112 VDWRTKGFVTAIKNQGQCGSCWAFSAVAGLEGQHFNATGTLVSLSEQNLVDCSTAEGNQG 171
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPN-SEDA 255
C+GG ++Q+++K+ G+ T+ Y PY D C +T +GF ++ P+ SE A
Sbjct: 172 CNGGLMDNAFQYVIKNGGIDTEASY-PYKAVDQKCKFNAANVGSTCSGFSDILPHKSEAA 230
Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYW 315
L++A+A GP+SVAIDAS SF Y +GVY + C S LDH V AVGY G YW
Sbjct: 231 LQVAVAVVGPISVAIDASHTSFQLYKSGVYSESAC--SQTSLDHGVTAVGYDSSSGVAYW 288
Query: 316 QVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
VKNSW T WG GY+ MS K+N CG+ TA +Y
Sbjct: 289 IVKNSWGTTWGQAGYIWMSRNKNNQCGIATAASY 322
>gi|374530932|gb|AEP83812.2| cysteine endopeptidase EP8 [Secale cereale x Triticum durum]
Length = 364
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 99/230 (43%), Positives = 128/230 (55%), Gaps = 14/230 (6%)
Query: 128 RYNKASKDAIPVRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQAL 186
RY A V + KG ++ DQ CGSCW+F T AVEG ++ KL LS+Q L
Sbjct: 124 RYGDADNLPPAVDWRQKGAVTAIKDQGQCGSCWAFSTIVAVEGINKIRTGKLVSLSEQEL 183
Query: 187 IDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATA-TMTGF 245
+DC N GCDGG ++Q+I K+G+ T+ +Y PY G+ C +A A A T+ G+
Sbjct: 184 MDCD-NVNNQGCDGGLMDYAFQFIHKNGITTESNY-PYQGEQGSCDLAKEKAHAVTIDGY 241
Query: 246 VNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVG 305
+V N E AL+ A+A PVSVAIDAS F FY GV+ E C+ LDH V AVG
Sbjct: 242 EDVPANDESALQKAVAGQ-PVSVAIDASGNDFQFYSEGVFTGE-CSTD---LDHGVAAVG 296
Query: 306 YGEL-DGKPYWQVKNSWSTYWGNQGYVLM----SIKDNNCGVMTAPTYVT 350
YG DG YW VKNSW WG +GY+ M S + CG+ +Y T
Sbjct: 297 YGTTRDGTKYWIVKNSWGEDWGEKGYIRMQRGVSQAEGQCGIAMQASYPT 346
>gi|260516678|gb|ACX43965.1| cysteine protease 1 [Brachiaria hybrid cultivar]
Length = 338
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 89/222 (40%), Positives = 128/222 (57%), Gaps = 11/222 (4%)
Query: 135 DAIPVRYEMKGYNSLL---DQSVCGSCWSFGTTGAVEGAYYMKHKK-LAVLSQQALIDCS 190
+A P + + N++ DQ CGSCW+F TG++EGA+ ++ K L LS+Q L+DCS
Sbjct: 123 EAAPTSIDWRTSNAVTPIKDQGQCGSCWAFSATGSIEGAWVLQGKHTLTSLSEQQLVDCS 182
Query: 191 WGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVT 249
YG+ GC+GG ++++I+ + G+ + Y PY G C + T T++G+ +V
Sbjct: 183 TSYGDAGCNGGLMDYAFEYIIANKGICAESAY-PYKGVGGLCQ-KSCTKVVTISGYKDVA 240
Query: 250 PNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGEL 309
E +L A+ GPVSVAI+A Q F FY +GV+ +N LDH VLAVGYG
Sbjct: 241 SGDEASLLNAVGTVGPVSVAIEADQAGFQFYSSGVFSGTCGHN----LDHGVLAVGYGTT 296
Query: 310 DGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
+ YW VKNSW T WG GY+ M N CG+ P+Y T+
Sbjct: 297 GSQDYWIVKNSWGTSWGESGYIRMIRNKNQCGIAIQPSYPTV 338
>gi|387915678|gb|AFK11448.1| cathepsin L1 [Callorhinchus milii]
Length = 336
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 89/213 (41%), Positives = 128/213 (60%), Gaps = 6/213 (2%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F +TGA+EG + K KL LS+Q L+DCS GN+G
Sbjct: 124 VDWRDKGYVTPVKNQGACGSCWAFSSTGALEGQTFKKTGKLIPLSEQNLVDCSQKQGNHG 183
Query: 198 CDGGEDFRSYQWIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG R++ +I + +G+ T+ Y PY ++ C+ AT G+ E AL
Sbjct: 184 CNGGMMDRAFTYIQQNNGIDTEASY-PYTAKEHPCNYDPRHNAATCHGYRYSEQYDEMAL 242
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
+A GP+SVAIDA SF FY +G+Y + +C + ++HAVL VGY G YW
Sbjct: 243 AETVATIGPISVAIDAKHISFQFYKSGIYQEPRCQSY--NINHAVLVVGYNSQGGNNYWI 300
Query: 317 VKNSWSTYWGNQGYVLM-SIKDNNCGVMTAPTY 348
VKNS+ + WGN+GY+ M K+N+CG+ + PTY
Sbjct: 301 VKNSFGSRWGNKGYIWMPKDKNNHCGIASYPTY 333
>gi|28932708|gb|AAO60048.1| midgut cysteine proteinase 5 [Rhipicephalus appendiculatus]
Length = 329
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 89/200 (44%), Positives = 118/200 (59%), Gaps = 7/200 (3%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F TG++EG + +K +L LS+Q L+DCS +GNNGC+GG ++ +I
Sbjct: 132 DQGQCGSCWAFSATGSLEGQHLLKDGELVSLSEQNLVDCSQSFGNNGCEGGLMDNAFNYI 191
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ T++ Y PY D C AT TGFV++ ED LK A P +
Sbjct: 192 KANDGIDTEEGY-PYEAVDGECRFKKEDVGATDTGFVDIPGGIEDDLKKASFCWPPPWLW 250
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
S SF Y GVY + C S + LDH VL VGYG GK YW VKNSW+ WG+QG
Sbjct: 251 --RSPSSFQLYSEGVYDESDC--SSEQLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQG 306
Query: 330 YVLMSI-KDNNCGVMTAPTY 348
Y+LMS K+N CG+ +A +Y
Sbjct: 307 YILMSRDKNNQCGIASAASY 326
>gi|403364285|gb|EJY81901.1| Cathepsin H [Oxytricha trifallax]
Length = 363
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 83/208 (39%), Positives = 121/208 (58%), Gaps = 3/208 (1%)
Query: 142 EMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGG 201
E G + DQ CGSCW+F T G +E + +K+++ LS+Q L+DC+ Y N GC+GG
Sbjct: 143 EHNGVTPVKDQGSCGSCWTFSTVGTLEAHFLIKYQQSRNLSEQQLVDCAGAYDNYGCNGG 202
Query: 202 EDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTT-ATATMTGFVNVTPNSEDALKLAL 260
++Q+I +G + PY +D C I + + + G VN+T SED L +A+
Sbjct: 203 LPSHAFQYISDNGGIATEAAYPYFAKDRPCTIQQSQKSVGVVGGSVNLT-KSEDELAIAI 261
Query: 261 AKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNS 320
+HGPVS+A + F Y +GVY + C N PD ++HAV+AVG+G +G YW VKNS
Sbjct: 262 FQHGPVSIAYEVI-DDFMDYHSGVYTTKDCKNGPDDVNHAVVAVGFGTENGVDYWLVKNS 320
Query: 321 WSTYWGNQGYVLMSIKDNNCGVMTAPTY 348
WST WG+ GY + N CG+ +Y
Sbjct: 321 WSTKWGDNGYFKIQRGVNMCGINNCNSY 348
>gi|208972990|dbj|BAG74344.1| silicatein-M3 [Ephydatia fluviatilis]
Length = 326
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 81/202 (40%), Positives = 127/202 (62%), Gaps = 5/202 (2%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM 211
Q CG+ ++F TGA+EGA + + K +LS+Q +IDCS YGN+GC GG+ + + ++++
Sbjct: 128 QGQCGASYAFAATGALEGASALANDKQVILSEQNIIDCSVPYGNHGCSGGDTYTAMKYVI 187
Query: 212 KH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
+ G+ T+ Y + G+ + C ++ + A+ TG +++T SE L A+A GPV+VA+
Sbjct: 188 DNGGIDTESSY-SFQGKQSSCQYSSKNSGASATGVISITSGSETDLLAAVATVGPVAVAV 246
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
DA+ +F FY +GV+ C+N+ +HA+L GYG +GK YW VKNSWS WG+ GY
Sbjct: 247 DANTNAFRFYQSGVFDSSSCSNTKP--NHAMLVTGYGSYNGKDYWLVKNSWSKNWGDNGY 304
Query: 331 VLM-SIKDNNCGVMTAPTYVTM 351
+LM K N C + T Y T+
Sbjct: 305 ILMVRNKYNQCAIATDALYPTL 326
>gi|288548564|gb|ADC52430.1| cathepsin L1 cysteine protease [Pinctada fucata]
Length = 331
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 89/214 (41%), Positives = 125/214 (58%), Gaps = 7/214 (3%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCWSF TG++EG ++ KL LS+Q LIDCS GN+G
Sbjct: 118 VDWRDKGYVTPVKNQGHCGSCWSFSATGSLEGQHFKSTGKLVSLSEQNLIDCSKKEGNHG 177
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDA 255
C GG ++++I K+ G+ T+ Y PY +D C AT G V++ SE A
Sbjct: 178 CKGGLMDFAFEYIQKNDGIDTEQSY-PYTAKDGIECRFKKADVGATDKGKVDLPRQSEKA 236
Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYW 315
L+ A+A GP+SVA+DA +SF Y G+Y + C+++ LDH VLAVGYG YW
Sbjct: 237 LQEAVATVGPISVAMDAGHRSFQLYKRGIYTEPMCSSTK--LDHGVLAVGYGSEGEGDYW 294
Query: 316 QVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTY 348
VKNSW WG +G+ +++ N CG+ T +Y
Sbjct: 295 LVKNSWGATWGMEGFFMLARNHRNECGIATQASY 328
>gi|118429523|gb|ABK91809.1| cathepsin L-like proteinase precursor [Clonorchis sinensis]
Length = 373
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 85/205 (41%), Positives = 121/205 (59%), Gaps = 10/205 (4%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TGA+EG ++ L LS+Q L+DCS YGNN C+GG +++++
Sbjct: 169 NQGNCGSCWAFSATGAIEGQNFLATGNLVSLSEQQLVDCSSEYGNNACNGGLMDNAFKYV 228
Query: 211 M-KHGLPTQDDYGPYLG-----QDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHG 264
+G+ T+ Y PY+ + C A +TG++++ LK A+ +G
Sbjct: 229 KDSNGIDTEASY-PYVSGETGDANPTCRFNLKEAVVRVTGYIDLPRGQVSELKQAVGHYG 287
Query: 265 PVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTY 324
P+SVAI+A SF Y +GVY D++C S D LDH VL VGYGE +G PYW +KNSW +
Sbjct: 288 PISVAINAGLPSFMSYKSGVYSDDQC--SSDDLDHGVLLVGYGEENGIPYWLIKNSWGPH 345
Query: 325 WGNQGYVLMSIKDNN-CGVMTAPTY 348
WG GYV + NN CGV + +Y
Sbjct: 346 WGENGYVKILRDHNNLCGVASMASY 370
>gi|195984441|gb|ACG63793.1| silicatein A1 [Latrunculia oparinae]
Length = 329
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 84/206 (40%), Positives = 125/206 (60%), Gaps = 5/206 (2%)
Query: 148 SLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSY 207
S+ DQS CG+ ++F A+EGA + L LS+Q LIDCS YGN+GC GG ++
Sbjct: 127 SVKDQSQCGASYAFSAMAALEGANALATDTLVNLSEQNLIDCSVPYGNHGCKGGNMLYAF 186
Query: 208 QWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
++++ + G+ T + Y P+ G+ + C A ++G V ++ SE L A+A GPV
Sbjct: 187 KYVIANEGVDTANSY-PFYGKQSSCVYNEKYAAVKISGMVRISQGSESDLLGAVANVGPV 245
Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWG 326
+VAID S +F FY +GVY +C++S L+HA++ GYG GK YW VKNSW WG
Sbjct: 246 AVAIDGSSNAFRFYSSGVYDSSRCSSSK--LNHAIVVTGYGSYSGKKYWLVKNSWGKNWG 303
Query: 327 NQGYVLMSI-KDNNCGVMTAPTYVTM 351
N GY++M+ K N CG+ + +Y T+
Sbjct: 304 NYGYIMMARGKYNQCGIASDASYPTL 329
>gi|334324657|ref|XP_003340546.1| PREDICTED: cathepsin S-like isoform 2 [Monodelphis domestica]
Length = 281
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 85/204 (41%), Positives = 118/204 (57%), Gaps = 5/204 (2%)
Query: 147 NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG-YGNNGCDGGEDFR 205
N L D CGSCW+F GA+E +K KL LS Q L+DCS Y N+GC+GG
Sbjct: 78 NHLGDMGSCGSCWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTDKYDNHGCNGGFMTS 137
Query: 206 SYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGP 265
++Q+++ + D PY D C + AT + + + SE+ALK A+A GP
Sbjct: 138 AFQYVIDNNGIDSDVSYPYKATDGKCQYNPASRAATCSKYTELPYGSEEALKEAVANKGP 197
Query: 266 VSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYW 325
VSV IDA SF Y +GVYYD C ++H VL +GYG LDG+ YW VKNSW ++
Sbjct: 198 VSVGIDAKTPSFFLYKSGVYYDPSCTQK---VNHGVLVIGYGNLDGQDYWLVKNSWGLHF 254
Query: 326 GNQGYVLMSI-KDNNCGVMTAPTY 348
G++GYV ++ + N+CG+ P+Y
Sbjct: 255 GDKGYVRIARNRGNHCGIANFPSY 278
>gi|392873946|gb|AFM85805.1| cathepsin H [Callorhinchus milii]
Length = 259
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 82/199 (41%), Positives = 119/199 (59%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTG +E A +K KL L++Q L+DC+ Y N+GC+GG +++++I
Sbjct: 58 NQGGCGSCWTFSTTGCLESAIAIKTGKLLSLAEQQLVDCAGAYKNHGCNGGLPSQAFEYI 117
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ GL + DY PY QD +C A A + VN+T E+ + A+A+ PVS+A
Sbjct: 118 KYNGGLEAEKDY-PYTAQDQHCQYQPNKAVAFVKEVVNITQYDENGIVDAVARLNPVSIA 176
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ + F Y GVY + C+++PD ++HAVLAVGYG +G YW VKNSW WG G
Sbjct: 177 FEVTDDFFQ-YEGGVYSNSNCDSTPDKVNHAVLAVGYGVQNGTKYWIVKNSWGPEWGLNG 235
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y + N CG+ P+Y
Sbjct: 236 YFYIIRGKNMCGLAACPSY 254
>gi|109082090|ref|XP_001108862.1| PREDICTED: cathepsin H isoform 2 [Macaca mulatta]
Length = 335
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 80/199 (40%), Positives = 117/199 (58%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTGA+E A + K+ L++Q L+DC+ + N+GC GG +++++I
Sbjct: 134 NQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYI 193
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ +D Y PY G+D C A + N+T E+A+ A+A + PVS A
Sbjct: 194 LYNKGIMGEDTY-PYQGKDGDCKFRPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFA 252
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ +Q F Y G+Y C+ +PD ++HAVLAVGYGE +G PYW VKNSW WG G
Sbjct: 253 FEVTQ-DFMIYKTGIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGMNG 311
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y L+ N CG+ +Y
Sbjct: 312 YFLIERGKNMCGLAACASY 330
>gi|402875039|ref|XP_003901328.1| PREDICTED: pro-cathepsin H [Papio anubis]
Length = 335
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 80/199 (40%), Positives = 117/199 (58%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTGA+E A + K+ L++Q L+DC+ + N+GC GG +++++I
Sbjct: 134 NQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYI 193
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ +D Y PY G+D C A + N+T E+A+ A+A + PVS A
Sbjct: 194 LYNKGIMGEDTY-PYQGKDGDCKFRPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFA 252
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ +Q F Y G+Y C+ +PD ++HAVLAVGYGE +G PYW VKNSW WG G
Sbjct: 253 FEVTQ-DFMMYKTGIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGMNG 311
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y L+ N CG+ +Y
Sbjct: 312 YFLIERGKNMCGLAACASY 330
>gi|293342574|ref|XP_002725265.1| PREDICTED: cathepsin Q-like isoform 2 [Rattus norvegicus]
gi|79152841|gb|AAI07914.1| Ctsq protein [Rattus norvegicus]
gi|149039734|gb|EDL93850.1| rCG24269 [Rattus norvegicus]
Length = 343
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 91/240 (37%), Positives = 133/240 (55%), Gaps = 19/240 (7%)
Query: 124 SLWVRYNKAS-------KDAIP--VRYEMKGYNSLLD-QSVCGSCWSFGTTGAVEGAYYM 173
SLW R +S +DA+P V + +GY + + Q C SCW+F GA+EG +
Sbjct: 105 SLWKRALGSSLPNSWYWRDALPKFVDWRKEGYVTHVRVQGRCNSCWAFPVVGAIEGQMFK 164
Query: 174 KHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHI 233
K KL LS Q L+DCS GN GC GG + ++Q+++++G + PY G++ C
Sbjct: 165 KTGKLTPLSVQNLVDCSKPQGNKGCRGGTTYNAFQYVLQNGGLESEATYPYEGKEGLCRY 224
Query: 234 ANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNS 293
++A +T FV + P +ED L A+A GPV+ I S FY G+Y++ KCNN
Sbjct: 225 NPNNSSAKITRFVAL-PENEDVLMDAVATKGPVAAGIHVVHSSLRFYKKGIYHEPKCNNY 283
Query: 294 PDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
++HAVL VGYG E DG YW ++NSW WG GY+ + ++N+CG+ T Y
Sbjct: 284 ---VNHAVLVVGYGFEGNETDGNNYWLIQNSWGERWGLNGYMKIAKDRNNHCGIATFAQY 340
>gi|355778231|gb|EHH63267.1| Cathepsin H, partial [Macaca fascicularis]
Length = 305
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 80/199 (40%), Positives = 117/199 (58%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTGA+E A + K+ L++Q L+DC+ + N+GC GG +++++I
Sbjct: 104 NQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYI 163
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ +D Y PY G+D C A + N+T E+A+ A+A + PVS A
Sbjct: 164 LYNKGIMGEDTY-PYQGKDGDCKFRPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFA 222
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ +Q F Y G+Y C+ +PD ++HAVLAVGYGE +G PYW VKNSW WG G
Sbjct: 223 FEVTQ-DFMMYKTGIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGMNG 281
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y L+ N CG+ +Y
Sbjct: 282 YFLIERGKNMCGLAACASY 300
>gi|291234299|ref|XP_002737086.1| PREDICTED: cathepsin L-like [Saccoglossus kowalevskii]
Length = 808
Score = 158 bits (400), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 85/186 (45%), Positives = 118/186 (63%), Gaps = 5/186 (2%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + +GY + DQ CGSCWSF TG++EG + K KL LS+Q L+DCS GN G
Sbjct: 594 VDWRAQGYVTEVKDQGQCGSCWSFSATGSLEGQTFKKTGKLPDLSEQNLVDCSTTQGNAG 653
Query: 198 CDGGEDFRSYQWI-MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++++++ + +G+ +++ Y PY G D C + AT TG+ +V SE AL
Sbjct: 654 CNGGLMDQAFEYVKVNNGIDSEESY-PYEGVDGDCRFKASDVAATCTGYTDVESGSESAL 712
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
+ A+A GPVSVAIDAS +SF FY +GVY + +C+++ LDH VL VGYG D YW
Sbjct: 713 QQAVASVGPVSVAIDASHQSFQFYSSGVYIEPECSSTE--LDHGVLTVGYGTEDSSDYWL 770
Query: 317 VKNSWS 322
VKNS S
Sbjct: 771 VKNSES 776
>gi|10798511|emb|CAC12806.1| cathepsin L1 [Fasciola hepatica]
Length = 311
Score = 158 bits (400), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 86/218 (39%), Positives = 122/218 (55%), Gaps = 8/218 (3%)
Query: 131 KASKDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALI 187
+A+ A+P + + GY + DQ CGSCW+F TTG +EG Y K S+Q L+
Sbjct: 87 EANNRAVPDKIDWRESGYVTGVKDQGNCGSCWAFSTTGTMEGQYMKNEKTSISFSEQQLV 146
Query: 188 DCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVN 247
DCS +GNNGC GG +Y+++ + GL T+ Y PY + C A +TG+
Sbjct: 147 DCSGPWGNNGCSGGLMENAYEYLKRFGLETESSY-PYRAVEGQCRYNEQLGVAKVTGYYT 205
Query: 248 VTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG 307
V SE LK + GP ++A++A + F Y +G+Y + C P L+HAVLAVGYG
Sbjct: 206 VHSGSEVELKNLVGSEGPAAIAVEA-ESDFMMYRSGIYQSQTC--LPFALNHAVLAVGYG 262
Query: 308 ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMT 344
DG YW VKNSW WG +GY+ M+ + N CG+ +
Sbjct: 263 TQDGTDYWIVKNSWGLSWGERGYIRMARNRGNMCGIAS 300
>gi|384251069|gb|EIE24547.1| cysteine proteinase [Coccomyxa subellipsoidea C-169]
Length = 548
Score = 158 bits (399), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 82/211 (38%), Positives = 127/211 (60%), Gaps = 11/211 (5%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYG-NNGCDGGEDFRSYQW 209
DQ+ CGSCW+FG T A++GAY+ K S+Q L+DCSW +G N+ CDGG+ + +
Sbjct: 322 DQATCGSCWAFGATAALQGAYWAATGKATSFSEQQLMDCSWKFGANHACDGGDYDNAIDY 381
Query: 210 IMK-HGLPTQDDYGPYLGQDAYC---HIANT----TATATMTGFVNVTPNSEDALKLALA 261
++ G+ + DY YLGQD +C +A+T ++ + G+ V P+ ++AL A+
Sbjct: 382 LVNVGGIAKEKDY-EYLGQDDFCGNPFMADTKRPSSSLVKVKGYSYVPPHDDEALMEAVY 440
Query: 262 KHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNS 320
G ++V++DASQ +F FY +G Y + C D LDHAV VGYG + G YW +KNS
Sbjct: 441 SRGTIAVSLDASQPTFRFYASGTYDEPNCMWKSDDLDHAVALVGYGTDEAGVDYWIIKNS 500
Query: 321 WSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
WS +WG+ G++ ++ + CG+ + P Y
Sbjct: 501 WSNHWGDGGFIKIARGHHGCGITSDPVYAVF 531
>gi|281204396|gb|EFA78592.1| cysteine proteinase 3 [Polysphondylium pallidum PN500]
Length = 330
Score = 158 bits (399), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 91/200 (45%), Positives = 118/200 (59%), Gaps = 7/200 (3%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCWSF TTG++EGA + A LS+Q L+DCS YGN GC+GG + +++
Sbjct: 131 DQGQCGSCWSFSTTGSIEGANQIATGNFASLSEQQLMDCSRDYGNEGCNGGLMDAAMKYV 190
Query: 211 MKH-GLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
+ GL T++ Y PY D+Y C A ++ +++V SE L L K GPVSV
Sbjct: 191 IAQGGLDTEESY-PYTMSDSYTCKFNPANIGAKISSYIDVQRGSETDLAAKLNK-GPVSV 248
Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQ 328
AIDAS SF Y +GVYY+ C S LDH VLAVGYG YW VKNSW WG
Sbjct: 249 AIDASHSSFQLYKSGVYYEPAC--SSYNLDHGVLAVGYGTEGSSNYWIVKNSWGPNWGLS 306
Query: 329 GYVLMSI-KDNNCGVMTAPT 347
GY+ M+ K N+CG+ + +
Sbjct: 307 GYIWMAKDKSNHCGISSMAS 326
>gi|211909242|gb|ACJ12894.1| cathepsin L1D [Fasciola hepatica]
Length = 326
Score = 158 bits (399), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 85/218 (38%), Positives = 122/218 (55%), Gaps = 8/218 (3%)
Query: 131 KASKDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALI 187
+A+ A+P + + GY + DQ CGSCW+F TTG +EG Y + S+Q L+
Sbjct: 102 EANNRAVPDKIDWRESGYVTGVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLV 161
Query: 188 DCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVN 247
DCS +GNNGC GG +Y+++ + GL T+ Y PY + C A +TG+
Sbjct: 162 DCSGPWGNNGCGGGLMENAYEYLKQFGLETESSY-PYRAVEGQCRYNRQLGVAKVTGYYT 220
Query: 248 VTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG 307
+ +E LK + GP +VA+D + F Y +G+Y + C SP GL+HAVLAVGYG
Sbjct: 221 LHSGNEAGLKSLVGSEGPAAVAVDV-ESDFMMYRSGIYQSQTC--SPLGLNHAVLAVGYG 277
Query: 308 ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMT 344
G YW VKNSW WG +GY+ M+ + N CG+ +
Sbjct: 278 TQGGTDYWIVKNSWGLSWGERGYIRMARNRGNMCGIAS 315
>gi|242044818|ref|XP_002460280.1| hypothetical protein SORBIDRAFT_02g025920 [Sorghum bicolor]
gi|241923657|gb|EER96801.1| hypothetical protein SORBIDRAFT_02g025920 [Sorghum bicolor]
Length = 363
Score = 158 bits (399), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 85/199 (42%), Positives = 116/199 (58%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTGA+E AY K LS+Q L+DC + N GC+GG +++++I
Sbjct: 163 NQGHCGSCWTFSTTGALEAAYTQATGKPISLSEQQLVDCGKPFNNFGCNGGLPSQAFEYI 222
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ GL T++ Y PY G + C + VN+T +ED LK A+A PVSVA
Sbjct: 223 KYNGGLDTEESY-PYKGVNGICDFKAENVGVKVLDSVNITLGAEDELKDAVALVRPVSVA 281
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
F Y +GVY + C N+P ++HAVLAVGYG +G PYW +KNSW WG++G
Sbjct: 282 FQVVN-GFRQYKSGVYTSDSCGNTPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDKG 340
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y M + N CGV T +Y
Sbjct: 341 YFKMEMGKNMCGVATCASY 359
>gi|380798253|gb|AFE71002.1| pro-cathepsin H preproprotein, partial [Macaca mulatta]
Length = 242
Score = 158 bits (399), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 80/199 (40%), Positives = 117/199 (58%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTGA+E A + K+ L++Q L+DC+ + N+GC GG +++++I
Sbjct: 41 NQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYI 100
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ G+ +D Y PY G+D C A + N+T E+A+ A+A + PVS A
Sbjct: 101 LYNKGIMGEDTY-PYQGKDGDCKFRPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFA 159
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ +Q F Y G+Y C+ +PD ++HAVLAVGYGE +G PYW VKNSW WG G
Sbjct: 160 FEVTQ-DFMMYKTGIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGMNG 218
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y L+ N CG+ +Y
Sbjct: 219 YFLIERGKNMCGLAACASY 237
>gi|388512155|gb|AFK44139.1| unknown [Medicago truncatula]
Length = 340
Score = 158 bits (399), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 93/222 (41%), Positives = 124/222 (55%), Gaps = 12/222 (5%)
Query: 136 AIP--VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG 192
AIP V + +KG + DQ CGSCW+F T A+EG + KL LS+Q L+DC
Sbjct: 123 AIPEAVDWRVKGAVTPIKDQGQCGSCWAFSTVAAIEGINQITTGKLISLSEQELVDCDTK 182
Query: 193 YGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNS 252
+ GC+GG +++I+K+G T + PY D C A T A +TG+ V NS
Sbjct: 183 GEDQGCEGGLMEDGFEFIIKNGGITSETNYPYKAADGSCSAATTAPVAKITGYEKVPVNS 242
Query: 253 EDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGK 312
E +L A+A P+SV+IDAS SF FY +G+Y E C LDH V AVGYG +G
Sbjct: 243 EISLLKAVANQ-PISVSIDASDSSFMFYSSGIYTGE-CGTE---LDHGVTAVGYGSANGT 297
Query: 313 PYWQVKNSWSTYWGNQGYVLM----SIKDNNCGVMTAPTYVT 350
YW VKNSW T WG +GY+ M + K+ CG+ +Y T
Sbjct: 298 DYWIVKNSWGTVWGEKGYIRMQRGIADKEGLCGIAMDSSYPT 339
>gi|259089092|ref|NP_001158584.1| Cathepsin L precursor [Oncorhynchus mykiss]
gi|225705034|gb|ACO08363.1| Cathepsin L precursor [Oncorhynchus mykiss]
Length = 330
Score = 158 bits (399), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 88/224 (39%), Positives = 121/224 (54%), Gaps = 12/224 (5%)
Query: 133 SKDAIPVRYEMKGY---NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDC 189
S A P ++ + Y + +Q CGSC++F GA EG + + L LS+Q L+DC
Sbjct: 114 SLSAAPETWDRRQYGYVTPVKNQGSCGSCYAFAAVGAPEGQLFKQTGTLLPLSEQNLVDC 173
Query: 190 SWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVT 249
S Y NNGCDGG R + ++ HG+ ++ Y PY + C N T A GF V
Sbjct: 174 SGDYHNNGCDGGLAMRCFSYVSDHGIMSERKY-PYTAEVGPCQYQNATKEAWCKGFNRVP 232
Query: 250 PNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE- 308
E + L + GP++V+++A+ SF FY +GV Y+ C+ +HAVLAVGYG
Sbjct: 233 SLDEKVFRDTLYEVGPIAVSVNATHPSFKFYKDGVLYEPDCSTR---TNHAVLAVGYGSS 289
Query: 309 -LDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
LD YW VKNSW T WG GY+LM+ N CG+ P Y M
Sbjct: 290 YLD---YWIVKNSWGTGWGRDGYILMARGYNRCGIARRPVYPIM 330
>gi|7271893|gb|AAF44677.1|AF239266_1 cathepsin L [Fasciola gigantica]
Length = 326
Score = 158 bits (399), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 87/218 (39%), Positives = 121/218 (55%), Gaps = 8/218 (3%)
Query: 131 KASKDAIPVRYEMKGY---NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALI 187
KA+K A+P + + Y + DQ CGSCW+F TTGA+EG + + A S+Q L+
Sbjct: 102 KANKPAVPESIDWRDYYYVTEVKDQGQCGSCWAFSTTGAMEGQFRKNERASASFSEQQLV 161
Query: 188 DCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVN 247
DC+ +GN+GC GG +Y+++ GL T D Y PY + C A A +T +
Sbjct: 162 DCTRNFGNHGCGGGYMENAYEYLKHSGLET-DSYYPYQAVEGPCQYDGRLAYAKVTDYYT 220
Query: 248 VTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG 307
V E LK + GP +VA+D F Y +G+Y+ E C PD L HAVLAVGYG
Sbjct: 221 VHSGDEVELKNLVGTEGPAAVALDVDY-DFMMYESGIYHSETC--LPDRLTHAVLAVGYG 277
Query: 308 ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMT 344
DG YW VKNSW + WG +GY+ + + N CG+ +
Sbjct: 278 AQDGTDYWIVKNSWGSSWGEKGYIRFARNRGNMCGIAS 315
>gi|310975577|gb|ADP55137.1| cathepsin S [Miichthys miiuy]
Length = 338
Score = 158 bits (399), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 87/203 (42%), Positives = 120/203 (59%), Gaps = 7/203 (3%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM 211
Q CGSCW+F GA+EG KL LS Q L+DCS YGN+GC+GG +++Q+++
Sbjct: 140 QGACGSCWAFSAAGALEGQLAKTTGKLVDLSPQNLVDCSTKYGNHGCNGGFMHKAFQYVI 199
Query: 212 -KHGLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
HG+ + Y PY G Q CH + A + + + E ALK ALA GP+SVA
Sbjct: 200 DNHGIDSDAAY-PYTGRQSQECHYSPKFRAANCSQYSFLPEGDEGALKQALATIGPISVA 258
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDA + F+FY +GVY D C+ ++H VLAVGYG L+G+ YW VKNSW +G+ G
Sbjct: 259 IDARRPRFAFYSSGVYDDPSCSQD---VNHGVLAVGYGTLNGQDYWLVKNSWGQTFGDNG 315
Query: 330 YVLMSI-KDNNCGVMTAPTYVTM 351
Y+ M+ K++ CG+ Y M
Sbjct: 316 YIRMARNKNDQCGIARYGCYPIM 338
>gi|209732040|gb|ACI66889.1| Cathepsin H precursor [Salmo salar]
Length = 330
Score = 158 bits (399), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 89/248 (35%), Positives = 132/248 (53%), Gaps = 16/248 (6%)
Query: 105 IILEKWRLVTSEGEKVSKYSLW---VRYNKASKDAIPVRYEMKGYNSLLDQSVCGSCWSF 161
++ E ++G +S + + V + + PV+Y Q CGSCW+F
Sbjct: 90 LLTEPQNCSATKGSHISSHGPYPGSVDWREKGNYVSPVKY----------QGHCGSCWTF 139
Query: 162 GTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI-MKHGLPTQDD 220
TTG +E + KL +LS+Q L+DC+ + N+GC GG ++++++ +GL T+DD
Sbjct: 140 STTGCLESVTAIATGKLPLLSEQQLVDCAQDFNNHGCMGGLPSQAFEYVKYNNGLMTEDD 199
Query: 221 YGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFY 280
Y PY G D C+ A A + VN+T E + A+A+ PVS + + F Y
Sbjct: 200 Y-PYTGHDGSCNFKPELAAAFVKDVVNITSYDEKGMVDAVARLNPVSFGYEVTD-DFLHY 257
Query: 281 VNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNC 340
+GVY C N+ D ++HAVLAVGYGE + PYW VKNSW T WG GY L+ N C
Sbjct: 258 KDGVYSSTTCKNTTDNVNHAVLAVGYGEKNSTPYWIVKNSWGTNWGMDGYFLIERGRNMC 317
Query: 341 GVMTAPTY 348
G+ +Y
Sbjct: 318 GLAACSSY 325
>gi|134025544|gb|AAI35768.1| LOC594890 protein [Xenopus (Silurana) tropicalis]
Length = 333
Score = 158 bits (399), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 92/218 (42%), Positives = 127/218 (58%), Gaps = 10/218 (4%)
Query: 131 KASKDAIPVRYEMKGYN---SLLDQSVCGSCWSFGTTGAVEGAYYMKHK--KLAVLSQQA 185
+ SK + P + + N S+ DQ C + W+F + GA+E MK + KL LS Q
Sbjct: 111 EISKSSPPESIDWRNKNCVTSVKDQGSCIASWAFSSIGALE-CQNMKRRTGKLESLSVQN 169
Query: 186 LIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGF 245
L+DCS YGNNGC GG S+++I+ +G+ + +Y PY G+D C + T +
Sbjct: 170 LLDCSQTYGNNGCKGGWVVSSFRYIIDNGIELESNY-PYQGKDGKCSYTPVKKASVCTSY 228
Query: 246 VNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVG 305
+ E LK + GPVSVAIDAS+K+F Y NGVYYD C++S DH+VL VG
Sbjct: 229 RQLPYGDEATLKQVVGLMGPVSVAIDASRKTFRMYKNGVYYDPNCSSSTP--DHSVLVVG 286
Query: 306 YGELDGKPYWQVKNSWSTYWGNQGYVLMSIK-DNNCGV 342
YG DG YW VKNSW T +G++GY+ M+ NNCG+
Sbjct: 287 YGAEDGVEYWLVKNSWGTSFGDEGYIKMARNHHNNCGI 324
>gi|410990006|ref|XP_004001241.1| PREDICTED: cathepsin L1-like [Felis catus]
Length = 328
Score = 158 bits (399), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 85/200 (42%), Positives = 119/200 (59%), Gaps = 10/200 (5%)
Query: 155 CGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKH- 213
C S W+F GA+EG + K+ K LS Q L+DCS GN GC+GG ++Q++ +
Sbjct: 130 CASGWAFSAAGAIEGQIFRKYGKRVSLSVQNLLDCSQAEGNEGCNGGLMSNAFQYVRNNR 189
Query: 214 GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDAS 273
GL T++ Y PY+ +D C + A +T F + P E+AL +A+ G +S AIDAS
Sbjct: 190 GLDTEESY-PYVARDGPCKYRPEYSAANVTAFQTI-PRREEALLVAMKNMGSISAAIDAS 247
Query: 274 QKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTYWGNQG 329
+F FY G+YYD KC S + L+H VL VGYG E D + YW VKNSW T WG G
Sbjct: 248 LDTFRFYKGGIYYDPKC--SSEDLNHGVLVVGYGFQGKESDNQKYWFVKNSWGTDWGMGG 305
Query: 330 YVLMSI-KDNNCGVMTAPTY 348
Y+ M+ ++NNCG+ T ++
Sbjct: 306 YIKMARERNNNCGIATRASF 325
>gi|197205900|gb|ACH48003.1| cathepsin [Latrunculia oparinae]
Length = 351
Score = 157 bits (398), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 78/206 (37%), Positives = 113/206 (54%), Gaps = 3/206 (1%)
Query: 147 NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRS 206
+++ DQ C SC++F TT A+EG + + +L LS+Q +IDCS YGN GC GG +
Sbjct: 148 SAVKDQGRCKSCYAFATTAALEGMHALATGRLVPLSEQNVIDCSVPYGNRGCSGGSRMAT 207
Query: 207 YQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
+ + +G PYLG+ C + TG V + E LK A+A GPV
Sbjct: 208 IMYAVDNGGIDGTSSYPYLGRQYLCKFTEESIATGCTGMVRIKRGKEQDLKKAVAVVGPV 267
Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWG 326
+VA+D+ SF FY +G+Y + C+ + L H ++ +GYG G YW +KNSW T WG
Sbjct: 268 TVAVDSRHTSFQFYASGIYSEPSCSRTK--LTHTLIIIGYGSKSGHDYWLLKNSWGTSWG 325
Query: 327 NQGYVLMSIK-DNNCGVMTAPTYVTM 351
GY++MS N CG+ T Y T+
Sbjct: 326 EDGYIMMSRNYANQCGIATKAMYTTI 351
>gi|350606375|ref|NP_001076821.2| uncharacterized protein LOC594890 precursor [Xenopus (Silurana)
tropicalis]
Length = 333
Score = 157 bits (398), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 92/218 (42%), Positives = 127/218 (58%), Gaps = 10/218 (4%)
Query: 131 KASKDAIPVRYEMKGYN---SLLDQSVCGSCWSFGTTGAVEGAYYMKHK--KLAVLSQQA 185
+ SK + P + + N S+ DQ C + W+F + GA+E MK + KL LS Q
Sbjct: 111 EISKSSPPESIDWRNKNCVTSVKDQGSCIASWAFSSIGALE-CQNMKRRTGKLESLSVQN 169
Query: 186 LIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGF 245
L+DCS YGNNGC GG S+++I+ +G+ + +Y PY G+D C + T +
Sbjct: 170 LLDCSQTYGNNGCKGGWVVSSFRYIIDNGIELESNY-PYQGKDGKCSYTPVKKASVCTSY 228
Query: 246 VNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVG 305
+ E LK + GPVSVAIDAS+K+F Y NGVYYD C++S DH+VL VG
Sbjct: 229 RQLPYGDEATLKQVVGLMGPVSVAIDASRKTFRMYKNGVYYDPNCSSSTP--DHSVLVVG 286
Query: 306 YGELDGKPYWQVKNSWSTYWGNQGYVLMSIK-DNNCGV 342
YG DG YW VKNSW T +G++GY+ M+ NNCG+
Sbjct: 287 YGAEDGVEYWLVKNSWGTSFGDEGYIKMARNHHNNCGI 324
>gi|94448668|emb|CAI91572.1| silicatein a3 [Lubomirskia baicalensis]
Length = 344
Score = 157 bits (398), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 84/203 (41%), Positives = 121/203 (59%), Gaps = 5/203 (2%)
Query: 148 SLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSY 207
S+ Q CGS ++F GA+EGA + KL LS+Q +IDCS YGN+GC GG+ + ++
Sbjct: 142 SVQSQGQCGSSYAFAAAGALEGATALAADKLVALSEQNIIDCSVPYGNHGCSGGDVYTAF 201
Query: 208 QWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
++++ + G+ T+ Y PY G+ + C + A TG V + SE L A+A GP+
Sbjct: 202 KYVVDNGGIDTESSY-PYKGKKSSCQYNSKNVGAISTGVVKIASGSETDLLSAVASVGPI 260
Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWG 326
+VA+DAS +F FY +GV+ C+ S L+HA+L GYG +GK YW VKNSW T WG
Sbjct: 261 AVAVDASVNAFMFYQSGVFDSSTCSTSK--LNHAMLVTGYGSTNGKDYWLVKNSWGTGWG 318
Query: 327 NQGYVLM-SIKDNNCGVMTAPTY 348
GY+ M K N CG+ + Y
Sbjct: 319 ESGYIKMVRNKYNQCGIASDALY 341
>gi|179959|gb|AAA35655.1| cathepsin [Homo sapiens]
gi|248406|gb|AAB22005.1| cathepsin S [Homo sapiens]
Length = 331
Score = 157 bits (398), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 83/199 (41%), Positives = 115/199 (57%), Gaps = 5/199 (2%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW-GYGNNGCDGGEDFRSYQWI 210
Q CG+CW+F GA+E +K KL LS Q L+DCS YGN GC+GG ++Q+I
Sbjct: 133 QGSCGACWAFSAVGALEAQLKLKTGKLVTLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYI 192
Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
+ + D PY D C + AT + + + ED LK A+A GPVSV +
Sbjct: 193 IDNKGIDSDASYPYKAMDQKCQYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVSVGV 252
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
DA SF Y +GVYY+ C + ++H VL VGYG+L+GK YW VKNSW +G +GY
Sbjct: 253 DARHPSFFLYRSGVYYEPSCTQN---VNHGVLVVGYGDLNGKEYWLVKNSWGHNFGEEGY 309
Query: 331 VLMSI-KDNNCGVMTAPTY 348
+ M+ K N+CG+ + P+Y
Sbjct: 310 IRMARNKGNHCGIASFPSY 328
>gi|346574377|gb|AEO36960.1| silicatein-alpha 3 [Baikalospongia fungiformis]
Length = 324
Score = 157 bits (398), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 84/203 (41%), Positives = 121/203 (59%), Gaps = 5/203 (2%)
Query: 148 SLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSY 207
S+ Q CGS ++F GA+EGA + KL LS+Q +IDCS YGN+GC GG+ + ++
Sbjct: 122 SVQSQGQCGSSYAFAAAGALEGATALAADKLVALSEQNIIDCSVPYGNHGCSGGDVYTAF 181
Query: 208 QWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
++++ + G+ T+ Y PY G+ + C + A TG V + SE L A+A GP+
Sbjct: 182 KYVVDNGGIDTESSY-PYKGKQSSCQYNSKNVGAISTGVVKIASGSETDLLSAVASVGPI 240
Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWG 326
+VA+DAS +F FY +GV+ C+ S L+HA+L GYG +GK YW VKNSW T WG
Sbjct: 241 AVAVDASVNAFMFYQSGVFDSSTCSTSK--LNHAMLVTGYGSTNGKDYWLVKNSWGTGWG 298
Query: 327 NQGYVLM-SIKDNNCGVMTAPTY 348
GY+ M K N CG+ + Y
Sbjct: 299 ESGYIKMVRNKYNQCGIASDALY 321
>gi|291383488|ref|XP_002708302.1| PREDICTED: cathepsin L1 [Oryctolagus cuniculus]
Length = 344
Score = 157 bits (398), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 92/212 (43%), Positives = 128/212 (60%), Gaps = 11/212 (5%)
Query: 144 KGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGE 202
+GY + +Q +CGS W+F TGA+EG + K +L LS+Q L+DCSW GN GC GG
Sbjct: 123 RGYVTPVKNQELCGSSWAFSATGALEGQMFRKTGRLVSLSEQNLVDCSWPQGNQGCSGGL 182
Query: 203 DFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALA 261
++Q++ GL +++ Y PY + C + A +TGFV+V+ + E AL A+A
Sbjct: 183 MDYAFQYVKDNRGLDSEESY-PYEQRKGSCKYNPRFSAANVTGFVDVSKD-EKALMEAVA 240
Query: 262 KHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG--ELDGK--PYWQV 317
GPVSV I + +SF FY G+YYD KC S + ++HAVL VGYG E+ K YW +
Sbjct: 241 TVGPVSVGIATTPESFLFYEGGIYYDPKC--SSENVNHAVLVVGYGFEEVGSKNNKYWLI 298
Query: 318 KNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTY 348
KNSW WG GY+ M+ NN CG+ TA +Y
Sbjct: 299 KNSWGKDWGMGGYMKMAKDQNNHCGIATAASY 330
>gi|312386083|gb|ADQ74586.1| silicatein alpha 3 [Lubomirskia baicalensis]
Length = 330
Score = 157 bits (398), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 84/203 (41%), Positives = 121/203 (59%), Gaps = 5/203 (2%)
Query: 148 SLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSY 207
S+ Q CGS ++F GA+EGA + KL LS+Q +IDCS YGN+GC GG+ + ++
Sbjct: 128 SVQSQGQCGSSYAFAAAGALEGATALAADKLVALSEQNIIDCSVPYGNHGCSGGDVYTAF 187
Query: 208 QWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
++++ + G+ T+ Y PY G+ + C + A TG V + SE L A+A GP+
Sbjct: 188 KYVVDNGGIDTESSY-PYKGKKSSCQYNSKNVGAISTGVVKIASGSETDLLSAVASVGPI 246
Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWG 326
+VA+DAS +F FY +GV+ C+ S L+HA+L GYG +GK YW VKNSW T WG
Sbjct: 247 AVAVDASVNAFMFYQSGVFDSSTCSTSK--LNHAMLVTGYGSTNGKDYWLVKNSWGTGWG 304
Query: 327 NQGYVLM-SIKDNNCGVMTAPTY 348
GY+ M K N CG+ + Y
Sbjct: 305 ESGYIKMVRNKYNQCGIASDALY 327
>gi|70912393|ref|NP_783171.2| cathepsin R precursor [Rattus norvegicus]
gi|66911479|gb|AAH97484.1| Cathepsin R [Rattus norvegicus]
gi|149039731|gb|EDL93847.1| cathepsin R [Rattus norvegicus]
Length = 334
Score = 157 bits (398), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 96/267 (35%), Positives = 149/267 (55%), Gaps = 21/267 (7%)
Query: 103 NGIILE--KWRLVTSEGEKVSKYSLWVRYNKASK--------DAIP--VRYEMKGY-NSL 149
NG I+E ++ +T+E + ++ +R ++ K + +P V + KGY +
Sbjct: 71 NGFIMEMNEFGDLTAEEFRKMMVNIPIRSHRKGKIIRKRDVGNVLPKFVDWRKKGYVTRV 130
Query: 150 LDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQW 209
+Q C SCW+F TGA+EG + K +L LS Q L+DC+ GN GC G+ +Y++
Sbjct: 131 QNQKFCNSCWAFAVTGAIEGQMFNKTGQLTPLSVQNLVDCTKSQGNEGCQWGDPHIAYEY 190
Query: 210 IMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
++ +G + PY G++ C + A +TGFV++ P SED L A+A GP+SVA
Sbjct: 191 VLNNGGLEAEATYPYKGKEGVCRYNPKHSKAEITGFVSL-PESEDILMEAVATIGPISVA 249
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTYW 325
+DAS SF FY G+Y + C+N+ ++H+VL VGYG E DG YW +KNSW W
Sbjct: 250 VDASFNSFGFYKKGLYDEPNCSNNT--VNHSVLVVGYGFEGNETDGNSYWLIKNSWGRKW 307
Query: 326 GNQGYVLMSIKDNN-CGVMTAPTYVTM 351
G +GY+ + NN C + + Y T+
Sbjct: 308 GLRGYMKIPKDQNNFCAIASYAHYPTV 334
>gi|225706914|gb|ACO09303.1| Cathepsin H precursor [Osmerus mordax]
Length = 328
Score = 157 bits (398), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 88/245 (35%), Positives = 134/245 (54%), Gaps = 10/245 (4%)
Query: 105 IILEKWRLVTSEGEKVSKYSLWVRYNKASKDAIPVRYEMKGYNSLLDQSVCGSCWSFGTT 164
++ E ++G VS L+ D++ R + ++ +Q CGSCW+F TT
Sbjct: 90 LLTEPQNCSATKGTHVSSKGLY-------PDSVDWRKKGNYVTNVKNQGPCGSCWTFSTT 142
Query: 165 GAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM-KHGLPTQDDYGP 223
G +E + KL LS+Q L+DC+ + N+GC+GG +++++I GL T+DDY P
Sbjct: 143 GCLESVTAISTGKLLQLSEQQLVDCAQAFNNHGCNGGLPSQAFEYIKYNKGLMTEDDY-P 201
Query: 224 YLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNG 283
Y QD C A A + VN+T E + A+A+ PVS+A + + F Y +G
Sbjct: 202 YTAQDGTCKFKPERAAAFVKDVVNITMYDEMGMVDAVARLNPVSMAYEVT-SDFMHYHSG 260
Query: 284 VYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVM 343
VY +C+N+ D ++HAVLAVGY E + PYW VKNSW +WG +GY + N CG+
Sbjct: 261 VYSSSECHNTTDTVNHAVLAVGYDEENVTPYWIVKNSWGPFWGMKGYFFIERGKNMCGLS 320
Query: 344 TAPTY 348
+Y
Sbjct: 321 ACSSY 325
>gi|313103779|pdb|3KSE|A Chain A, Unreduced Cathepsin L In Complex With Stefin A
gi|313103780|pdb|3KSE|B Chain B, Unreduced Cathepsin L In Complex With Stefin A
gi|313103781|pdb|3KSE|C Chain C, Unreduced Cathepsin L In Complex With Stefin A
Length = 220
Score = 157 bits (398), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 92/220 (41%), Positives = 129/220 (58%), Gaps = 11/220 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGS W+F TGA+EG + K +L LS+Q L+DCS GN G
Sbjct: 5 VDWREKGYVTPVKNQGQCGSXWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEG 64
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C+GG ++Q++ + GL +++ Y PY + C + A GFV++ P E AL
Sbjct: 65 CNGGLMDYAFQYVQDNGGLDSEESY-PYEATEESCKYNPKYSVANDAGFVDI-PKQEKAL 122
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
A+A GP+SVAIDA +SF FY G+Y++ C S + +DH VL VGYG E D
Sbjct: 123 MKAVATVGPISVAIDAGHESFLFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDDN 180
Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
YW VKNSW WG GYV M+ + N+CG+ +A +Y T+
Sbjct: 181 KYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 220
>gi|116488416|gb|AAB41670.2| secreted cathepsin L 1 [Fasciola hepatica]
Length = 326
Score = 157 bits (398), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 87/218 (39%), Positives = 120/218 (55%), Gaps = 8/218 (3%)
Query: 131 KASKDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALI 187
+A+ A+P + + GY + DQ CGSCW+F TTG +EG Y + S+Q L+
Sbjct: 102 EANNRAVPDKIDWRESGYVTEVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLV 161
Query: 188 DCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVN 247
DCS +GNNGC GG +YQ++ + GL T+ Y PY + C A +TGF
Sbjct: 162 DCSRPWGNNGCGGGLMENAYQYLKQFGLETESSY-PYTAVEGQCRYNKQLGVAKVTGFYT 220
Query: 248 VTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG 307
V SE LK + GP +VA+D + F Y +G+Y + C SP ++HAVLAVGYG
Sbjct: 221 VHSGSEVELKNLVGAEGPAAVAVDV-ESDFMMYRSGIYQSQTC--SPLRVNHAVLAVGYG 277
Query: 308 ELDGKPYWQVKNSWSTYWGNQGYVLM-SIKDNNCGVMT 344
G YW VKNSW WG +GY+ M + N CG+ +
Sbjct: 278 TQGGTDYWIVKNSWGLSWGERGYIRMVRNRGNMCGIAS 315
>gi|308321226|gb|ADO27765.1| cathepsin S [Ictalurus furcatus]
Length = 329
Score = 157 bits (398), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 85/215 (39%), Positives = 122/215 (56%), Gaps = 5/215 (2%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + +Q CGSCW+F GA+EG ++ LS Q L+DCS YGN G
Sbjct: 118 VDWREKGYVTEVKNQGSCGSCWAFSAAGALEGQLKRTTGQVKSLSPQNLVDCSSKYGNKG 177
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
C+GG +++Q+++ G D+ PY D C + A + + V+ E+ALK
Sbjct: 178 CNGGFMTQAFQYVIDDGGIDSDEAYPYTAMDGQCRYDQSQRAANCSSYNYVSEGDEEALK 237
Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQV 317
A+A GP+SVAIDA++ F Y +GVY D C + ++H VL VGYG L+G+ YW V
Sbjct: 238 QAVATIGPISVAIDATRPMFILYHSGVYSDPTCTQN---VNHGVLVVGYGSLNGEDYWLV 294
Query: 318 KNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
KNSW T +G+ GY+ ++ K N CG+ Y M
Sbjct: 295 KNSWGTRFGDGGYIRIARNKGNMCGIANYACYPLM 329
>gi|242081867|ref|XP_002445702.1| hypothetical protein SORBIDRAFT_07g024430 [Sorghum bicolor]
gi|241942052|gb|EES15197.1| hypothetical protein SORBIDRAFT_07g024430 [Sorghum bicolor]
Length = 372
Score = 157 bits (397), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 95/216 (43%), Positives = 121/216 (56%), Gaps = 13/216 (6%)
Query: 139 VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KG + DQ CGSCW+F T AVEG +K K L LS+Q L+DC GN G
Sbjct: 140 VDWRQKGAVTDVKDQGQCGSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCD-TKGNAG 198
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
CDGG ++Q+I KHG +D PY + A C + A T+ G+ +V N E ALK
Sbjct: 199 CDGGLMDYAFQYIAKHGGVAAEDAYPYKARQASCKKSPAPAV-TIDGYEDVPANDESALK 257
Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQ 316
A+A H PVSVAI+AS F FY GV+ +C LDH V AVGYG DG YW
Sbjct: 258 KAVA-HQPVSVAIEASGSHFQFYSEGVFAG-RCGTE---LDHGVTAVGYGVAADGTKYWV 312
Query: 317 VKNSWSTYWGNQGYVLM----SIKDNNCGVMTAPTY 348
VKNSW WG +GY+ M + K+ +CG+ +Y
Sbjct: 313 VKNSWGPEWGEKGYIRMARDVAAKEGHCGIAMEASY 348
>gi|22653680|sp|Q9JL96.1|CATM_MOUSE RecName: Full=Cathepsin M; Flags: Precursor
gi|7715970|gb|AAF68224.1|AF202528_1 cathepsin M [Mus musculus]
Length = 333
Score = 157 bits (397), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 89/216 (41%), Positives = 126/216 (58%), Gaps = 9/216 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
+ ++ +GY + Q C SCW+F TGA+EG + K +L LS Q L+DCS GN G
Sbjct: 118 INWKKRGYVTPVQTQGRCNSCWAFSVTGAIEGQMFRKTGQLIPLSVQNLVDCSRPQGNWG 177
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
C G + + ++M++G + PY +D C + +TA +TGF P +EDAL
Sbjct: 178 CYLGNTYLALHYVMENGGLESEATYPYEEKDGSCRYSPENSTANITGF-EFVPKNEDALM 236
Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKP 313
A+A GP+SVAIDA SF FY G+YY+ C++ + H++L VGYG E DG+
Sbjct: 237 NAVASIGPISVAIDARHASFLFYKRGIYYEPNCSSCV--VTHSMLLVGYGFTGRESDGRK 294
Query: 314 YWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
YW VKNS T WGN+GY+ +S K N+CG+ T Y
Sbjct: 295 YWLVKNSMGTQWGNKGYMKISRDKGNHCGIATYALY 330
>gi|410904751|ref|XP_003965855.1| PREDICTED: cathepsin K-like [Takifugu rubripes]
Length = 331
Score = 157 bits (397), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 89/212 (41%), Positives = 121/212 (57%), Gaps = 9/212 (4%)
Query: 141 YEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCD 199
Y KG ++ DQ CGSCW+F + GA+EG K KL LS Q L+DC N+GC
Sbjct: 122 YRKKGIVTAVKDQGQCGSCWAFSSAGALEGMQAKKTGKLVDLSPQNLVDCV--KENDGCG 179
Query: 200 GGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKL 258
GG +++++ G+ ++ Y PY+ Q+ C + A + + V +E L
Sbjct: 180 GGYMTNAFRYVATNRGIDSEASY-PYVAQEQSCQYKESGKAAECSSYEEVPQGNEKQLAY 238
Query: 259 ALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQV 317
AL KHGP++V IDA+ +F Y GVYYD CN P+ ++HAVL VGYG G+ YW V
Sbjct: 239 ALFKHGPIAVGIDATLSTFQLYSKGVYYDPNCN--PENINHAVLLVGYGVNSRGQHYWIV 296
Query: 318 KNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTY 348
KNSWST WGN GYVLM+ N CG+ +Y
Sbjct: 297 KNSWSTNWGNGGYVLMARNRGNLCGIANLASY 328
>gi|410990014|ref|XP_004001245.1| PREDICTED: cathepsin L1-like [Felis catus]
Length = 334
Score = 157 bits (397), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 87/203 (42%), Positives = 122/203 (60%), Gaps = 13/203 (6%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM 211
Q C S W+F TGA+EG + ++ V Q L+DCSW GN GC+GG ++Q++
Sbjct: 136 QGRCASGWAFSATGALEGQMFPENWPNVV---QNLLDCSWPQGNEGCNGGLMSNAFQYVK 192
Query: 212 KHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
+G L T++ Y PY+ +D C + A +T F N+ P E+AL +A+A GP+S AI
Sbjct: 193 NNGGLDTEESY-PYVARDGPCKYRPEHSAANVTAFENI-PQQEEALMMAVANMGPISAAI 250
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTYWG 326
DAS +F FY G+YYD KC S + L+H VL VGYG E D + YW VKNSW WG
Sbjct: 251 DASLDTFRFYKEGIYYDPKC--SSEDLNHGVLVVGYGFQGKESDNQKYWFVKNSWGADWG 308
Query: 327 NQGYVLMSI-KDNNCGVMTAPTY 348
GY+ M+ +DN+CG+ T ++
Sbjct: 309 MDGYIKMAKDRDNHCGIATMASF 331
>gi|47199802|emb|CAF88807.1| unnamed protein product [Tetraodon nigroviridis]
Length = 261
Score = 157 bits (397), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 90/188 (47%), Positives = 121/188 (64%), Gaps = 10/188 (5%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + DQ CGSCW+F TTGA+EG ++ + KL LS+Q L+DCS GN G
Sbjct: 77 VDWRDKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRQTGKLVSLSEQNLVDCSRPEGNEG 136
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDA 255
C+GG +++Q+I + GL ++ Y PYL D CH + +A TGFV+V SE A
Sbjct: 137 CNGGLMDQAFQYIKDNGGLDSEASY-PYLATDDQPCHYDPSNNSANETGFVDVPSGSERA 195
Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
L A+A GPVSVAIDA +SF FY +G+YY+++C S + LDH VL VGYG ++DG
Sbjct: 196 LMKAVASVGPVSVAIDAGHESFQFYQSGIYYEKEC--SSEELDHGVLVVGYGFQGEDVDG 253
Query: 312 KPYWQVKN 319
K +W VKN
Sbjct: 254 KKFWIVKN 261
>gi|281211531|gb|EFA85693.1| cysteine protease [Polysphondylium pallidum PN500]
Length = 366
Score = 157 bits (397), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 92/230 (40%), Positives = 123/230 (53%), Gaps = 39/230 (16%)
Query: 149 LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
+ DQ CGSCWSF TTG+VEGA+ +K + LS+Q L+DCS GN GC+GG ++
Sbjct: 129 IKDQGQCGSCWSFSTTGSVEGAHQIKTGNMVELSEQNLVDCSSAEGNMGCNGGLMNNAFD 188
Query: 209 WIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVS 267
+I+ HG+ T+ Y + C T AT++ + ++TP SE L A+ GPVS
Sbjct: 189 YIISNHGIDTEQSYPYTANTGSVCKFNKTNVGATISSYKSITPGSETDLANAVKTAGPVS 248
Query: 268 VAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-------ELD---------- 310
VAIDAS +SF Y +G+YY+ C S LDH VL VGYG ++D
Sbjct: 249 VAIDASHRSFQLYSHGIYYEWLC--SSTRLDHGVLVVGYGSGNPPNSDMDHMILKKTAKT 306
Query: 311 ------------------GKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCG 341
K YW VKNSWS WG++GY+ MS + NNCG
Sbjct: 307 DHYHGKKSLKVEKVDTTSSKNYWIVKNSWSDTWGDKGYIYMSKDRKNNCG 356
>gi|387915178|gb|AFK11198.1| cathepsin L1 [Callorhinchus milii]
Length = 331
Score = 157 bits (397), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 80/212 (37%), Positives = 127/212 (59%), Gaps = 6/212 (2%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + KGY + Q CGSC+ F TG++E + K+ KL LS+Q+++DCS YGN G
Sbjct: 122 VDWRKKGYVTPVYYQGTCGSCYGFAATGSLEAMIFKKYGKLIKLSEQSIVDCSSYYGNLG 181
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
C GG R+Y+++M++G+ + D Y PY + C ++ ATM+ ++ V N E+AL
Sbjct: 182 CSGGSIIRAYKFVMENGIQSADTY-PYTAKAGTCRHNKSSTVATMSHYIKVHSN-EEALA 239
Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQV 317
++ GPV+V + +S+ FY +G+ YD C N DH +L VG+G + YW V
Sbjct: 240 QTVSTVGPVAVCVHTKTRSWQFYKSGILYDPLCKNYT--YDHGILVVGFGIQKDEHYWIV 297
Query: 318 KNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
KNSW + WG +GY+ ++ ++N+CG+ + TY
Sbjct: 298 KNSWGSRWGMKGYIWIAKDRNNHCGISSYATY 329
>gi|392354135|ref|XP_225128.6| PREDICTED: LOW QUALITY PROTEIN: cathepsin M [Rattus norvegicus]
Length = 333
Score = 157 bits (397), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 84/216 (38%), Positives = 126/216 (58%), Gaps = 9/216 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + +GY S+ +Q C SCW+F GA+EG + K +L LS Q L+DCS GN G
Sbjct: 118 VDWRRRGYVTSVKNQGRCNSCWAFSVAGAIEGQMFRKTGRLVSLSAQNLVDCSRPEGNRG 177
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
C G F +++++ +G + PY G++ +C + A + GF ++ ++E+AL
Sbjct: 178 CISGHTFYTFKYVWNNGGLEAESTYPYEGREGHCRYLPERSAARIKGF-SIISSTEEALM 236
Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKP 313
A+A GP+SV IDAS +SF+FY G+YY+ KC N ++HAVL VGYG E DG+
Sbjct: 237 NAVATIGPISVGIDASHESFTFYSGGIYYEPKCRNKT--VNHAVLLVGYGYEGRESDGRK 294
Query: 314 YWQVKNSWSTYWGNQGYVLMSIK-DNNCGVMTAPTY 348
YW +KNS WG GY+ ++ + +CG+ T Y
Sbjct: 295 YWLIKNSHGVGWGMNGYMKLARGWNKHCGIATCAFY 330
>gi|23110962|ref|NP_004070.3| cathepsin S isoform 1 preproprotein [Homo sapiens]
gi|88984046|sp|P25774.3|CATS_HUMAN RecName: Full=Cathepsin S; Flags: Precursor
gi|60816153|gb|AAX36372.1| cathepsin S [synthetic construct]
gi|61358282|gb|AAX41541.1| cathepsin S [synthetic construct]
gi|119573903|gb|EAW53518.1| cathepsin S, isoform CRA_b [Homo sapiens]
gi|119573904|gb|EAW53519.1| cathepsin S, isoform CRA_b [Homo sapiens]
Length = 331
Score = 157 bits (397), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 83/199 (41%), Positives = 115/199 (57%), Gaps = 5/199 (2%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW-GYGNNGCDGGEDFRSYQWI 210
Q CG+CW+F GA+E +K KL LS Q L+DCS YGN GC+GG ++Q+I
Sbjct: 133 QGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYI 192
Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
+ + D PY D C + AT + + + ED LK A+A GPVSV +
Sbjct: 193 IDNKGIDSDASYPYKAMDQKCQYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVSVGV 252
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
DA SF Y +GVYY+ C + ++H VL VGYG+L+GK YW VKNSW +G +GY
Sbjct: 253 DARHPSFFLYRSGVYYEPSCTQN---VNHGVLVVGYGDLNGKEYWLVKNSWGHNFGEEGY 309
Query: 331 VLMSI-KDNNCGVMTAPTY 348
+ M+ K N+CG+ + P+Y
Sbjct: 310 IRMARNKGNHCGIASFPSY 328
>gi|414589597|tpg|DAA40168.1| TPA: hypothetical protein ZEAMMB73_868349 [Zea mays]
Length = 252
Score = 157 bits (397), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 83/199 (41%), Positives = 114/199 (57%), Gaps = 3/199 (1%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
+Q CGSCW+F TTGA+E AY K LS+Q L+DC + + N GC GG +++++I
Sbjct: 52 NQGHCGSCWTFSTTGALEAAYTQATGKAISLSEQQLVDCGFAFNNFGCKGGLPSQAFEYI 111
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ GL T++ Y PY G + C + VN+T +ED LK A+ PVSVA
Sbjct: 112 KYNGGLDTEESY-PYQGVNGICQFKAENVGVKVLDSVNITLGAEDELKDAVGLVRPVSVA 170
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
+ F Y GVY + C +P ++HAVLAVGYG +G PYW +KNSW WG++G
Sbjct: 171 FEVI-SGFRLYKTGVYTSDHCGTTPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDEG 229
Query: 330 YVLMSIKDNNCGVMTAPTY 348
Y M + N CGV T +Y
Sbjct: 230 YFKMEMGKNMCGVATCASY 248
>gi|308160643|gb|EFO63118.1| Cathepsin L precursor [Giardia lamblia P15]
Length = 577
Score = 157 bits (397), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 88/219 (40%), Positives = 129/219 (58%), Gaps = 17/219 (7%)
Query: 147 NSLLDQSVCGSCWSFGTTGAVEGAYY--------MKHKKLAVLSQQALIDCSWGYGNNGC 198
N DQ CGSCW+FG G +EG ++H+ L S+Q+++DC WG+G+ GC
Sbjct: 357 NMAKDQVACGSCWTFGAIGTIEGRINKLRVVEEGLRHEPLKAYSEQSIVDCYWGFGSFGC 416
Query: 199 DGGEDFRSYQWIMKHG---LPTQDDYGPYLGQDAYCHIA--NTTATATMTGFVNVTPNSE 253
DGG+ + +W++++ + + +Y PYLGQ+ C A + + ++TG+ V S
Sbjct: 417 DGGDTLAALKWLVENNRGRVAFESEY-PYLGQNDLCKEALFDQEPSYSVTGYSAVKQYSI 475
Query: 254 DALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDG-K 312
+LK AL + GPV+V+I ++ S FY GVY D C + D L HAVLAVGYG D
Sbjct: 476 PSLKAAL-QDGPVAVSIGITE-SLLFYSGGVYNDPACPHEYDDLAHAVLAVGYGTDDTYG 533
Query: 313 PYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
YW V+NSWS WG GY +S+KDN CG++T +Y +
Sbjct: 534 DYWIVRNSWSPLWGMDGYFYLSMKDNICGILTDASYAVV 572
>gi|171948778|gb|ACB59246.1| cathepsin H [Sus scrofa]
Length = 297
Score = 157 bits (397), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 82/202 (40%), Positives = 118/202 (58%), Gaps = 6/202 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGED---FRSY 207
+Q CGSCW+F TTGA+E A + K+ L++Q L+DC+ + N+GC GG +++
Sbjct: 93 NQGSCGSCWTFSTTGALESAVAIATGKMLSLAEQQLVDCAQNFNNHGCQGGLPGLPSQAF 152
Query: 208 QWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
++I G+ +D Y PY GQD +C A A + N+T N E+A+ A+A + PV
Sbjct: 153 EYIRYNKGIMGEDTY-PYKGQDDHCKFQPDKAIAFVKDVANITMNDEEAMVEAVALYNPV 211
Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWG 326
S A + + F Y G+Y C+ +PD ++HAVLAVGYGE +G PYW VKNSW WG
Sbjct: 212 SFAFEVTN-DFLMYRKGIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWG 270
Query: 327 NQGYVLMSIKDNNCGVMTAPTY 348
GY L+ N CG+ +Y
Sbjct: 271 MNGYFLIERGKNMCGLAACASY 292
>gi|27960487|gb|AAO27847.1|AF456463_1 cathepsin M [Mus musculus]
gi|16323039|gb|AAL15416.1| cathepsin M [Mus musculus]
Length = 333
Score = 157 bits (397), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 89/216 (41%), Positives = 126/216 (58%), Gaps = 9/216 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
+ ++ +GY + Q C SCW+F TGA+EG + K L LS Q L+DCS GN G
Sbjct: 118 INWKKRGYVTPVRTQGRCNSCWAFSVTGAIEGQMFRKTGPLIPLSVQNLVDCSRPQGNWG 177
Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
C G + + ++M++G + PY ++ C + +TA +TGF P +EDAL
Sbjct: 178 CYLGNTYLALHYVMENGGLESEATYPYEEKEGSCRYSPENSTANITGF-EFVPKNEDALM 236
Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKP 313
A+A GP+SVAIDA SF FY G+YY+ C++S + H++L VGYG E DG+
Sbjct: 237 NAVASIGPISVAIDARHASFLFYKRGIYYEPNCSSSV--VTHSMLLVGYGFAGRESDGRK 294
Query: 314 YWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
YW VKNS T WGN+GY+ +S K N+CG+ T Y
Sbjct: 295 YWLVKNSMGTQWGNKGYMKISRDKGNHCGIATYALY 330
>gi|47086663|ref|NP_997853.1| cathepsin H precursor [Danio rerio]
gi|45709087|gb|AAH67615.1| Cathepsin H [Danio rerio]
Length = 330
Score = 157 bits (396), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 90/245 (36%), Positives = 131/245 (53%), Gaps = 10/245 (4%)
Query: 105 IILEKWRLVTSEGEKVSKYSLWVRYNKASKDAIPVRYEMKGYNSLLDQSVCGSCWSFGTT 164
++ E + G VS L+ DAI R + + +Q CGSCW+F TT
Sbjct: 90 LLTEPQNCSATRGNHVSSNGLY-------PDAIDWRTKGHYITDVKNQGPCGSCWTFSTT 142
Query: 165 GAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM-KHGLPTQDDYGP 223
G +E + KL L++Q LIDC+ + N+GC+GG ++++IM GL T+DDY P
Sbjct: 143 GCLESVTAIATGKLLQLAEQQLIDCAGDFDNHGCNGGLPSHAFEYIMYNKGLMTEDDY-P 201
Query: 224 YLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNG 283
Y + C A A + VN+T E + A+A+ PVS A + + F Y +G
Sbjct: 202 YQAKGGQCRFKPQLAAAFVKEVVNITKYDEMGMVDAVARLNPVSFAYEVT-SDFMHYKDG 260
Query: 284 VYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVM 343
+Y +C+N+ D ++HAVLAVGY E +G PYW VKNSW T WG +GY + N CG+
Sbjct: 261 IYTSTECHNTTDMVNHAVLAVGYAEENGTPYWIVKNSWGTNWGIKGYFYIERGKNMCGLA 320
Query: 344 TAPTY 348
+Y
Sbjct: 321 ACSSY 325
>gi|61368403|gb|AAX43172.1| cathepsin S [synthetic construct]
Length = 332
Score = 157 bits (396), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 83/199 (41%), Positives = 115/199 (57%), Gaps = 5/199 (2%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW-GYGNNGCDGGEDFRSYQWI 210
Q CG+CW+F GA+E +K KL LS Q L+DCS YGN GC+GG ++Q+I
Sbjct: 133 QGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYI 192
Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
+ + D PY D C + AT + + + ED LK A+A GPVSV +
Sbjct: 193 IDNKGIDSDASYPYKAMDQKCQYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVSVGV 252
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
DA SF Y +GVYY+ C + ++H VL VGYG+L+GK YW VKNSW +G +GY
Sbjct: 253 DARHPSFFLYRSGVYYEPSCTQN---VNHGVLVVGYGDLNGKEYWLVKNSWGHNFGEEGY 309
Query: 331 VLMSI-KDNNCGVMTAPTY 348
+ M+ K N+CG+ + P+Y
Sbjct: 310 IRMARNKGNHCGIASFPSY 328
>gi|334324655|ref|XP_001370975.2| PREDICTED: cathepsin S-like isoform 1 [Monodelphis domestica]
Length = 331
Score = 157 bits (396), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 83/199 (41%), Positives = 116/199 (58%), Gaps = 5/199 (2%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG-YGNNGCDGGEDFRSYQWI 210
Q CGSCW+F GA+E +K KL LS Q L+DCS Y N+GC+GG ++Q++
Sbjct: 133 QGSCGSCWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTDKYDNHGCNGGFMTSAFQYV 192
Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
+ + D PY D C + AT + + + SE+ALK A+A GPVSV I
Sbjct: 193 IDNNGIDSDVSYPYKATDGKCQYNPASRAATCSKYTELPYGSEEALKEAVANKGPVSVGI 252
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
DA SF Y +GVYYD C ++H VL +GYG LDG+ YW VKNSW ++G++GY
Sbjct: 253 DAKTPSFFLYKSGVYYDPSCTQK---VNHGVLVIGYGNLDGQDYWLVKNSWGLHFGDKGY 309
Query: 331 VLMSI-KDNNCGVMTAPTY 348
V ++ + N+CG+ P+Y
Sbjct: 310 VRIARNRGNHCGIANFPSY 328
>gi|354502591|ref|XP_003513367.1| PREDICTED: cathepsin L1-like isoform 1 [Cricetulus griseus]
Length = 330
Score = 157 bits (396), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 92/218 (42%), Positives = 126/218 (57%), Gaps = 10/218 (4%)
Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
V + GY + DQ C SCW+F G++EG + K KL LS+Q L+DCS NNG
Sbjct: 118 VDWRKHGYVTPVKDQGSCVSCWAFSAVGSLEGQMFRKTGKLVPLSEQNLVDCSRSQHNNG 177
Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
C GG ++Q+I + GL T + Y PY QD C + A +TGFV V P++E+AL
Sbjct: 178 CHGGLFTSAFQYIKDNGGLDTSESY-PYEAQDGPCRYDPKHSAANITGFV-VVPSNEEAL 235
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKC-NNSPDGLDHAVLAVGYG-ELDGKPY 314
A+A GP+S+ I +S FY +G YYD C N+ P +H+VL VGYG E DG+ Y
Sbjct: 236 MKAVATVGPISIGISVRLRSLLFYKSGFYYDPDCYNHYP---NHSVLLVGYGEESDGQKY 292
Query: 315 WQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTYVTM 351
W VKNSW WG GY+ + ++N+C + T Y T+
Sbjct: 293 WLVKNSWGEEWGMDGYIKIAKDRNNHCSIATIAAYPTV 330
>gi|391333246|ref|XP_003741030.1| PREDICTED: digestive cysteine proteinase 2-like [Metaseiulus
occidentalis]
Length = 327
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 90/214 (42%), Positives = 125/214 (58%), Gaps = 9/214 (4%)
Query: 138 PVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNN 196
V + GY + DQ CGSCW+F TGAVEG Y+ K +L LS+Q L+DC ++
Sbjct: 117 AVDWRQNGYVTPVKDQGKCGSCWAFAATGAVEGQYFKKTGQLVSLSEQNLVDCD--RSSD 174
Query: 197 GCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDA 255
GC+GG + S+++I + G+ T+ YG Y C + AT++G +V E+A
Sbjct: 175 GCEGGYFYESFEYIRSNGGIATESSYG-YEATAGSCRFTADSIGATVSGRDSVASGDEEA 233
Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYW 315
L A+A GP+SV ID +F Y +GVYYD +C++S +HAVL VGYG G YW
Sbjct: 234 LLKAVASIGPISVTIDVID-TFRHYSSGVYYDAECSSSS--RNHAVLVVGYGTEAGGDYW 290
Query: 316 QVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
VKNSW T +G QGY+ M+ K NNCG+ + Y
Sbjct: 291 LVKNSWGTSFGEQGYIKMARNKGNNCGIASEAGY 324
>gi|197205894|gb|ACH48000.1| silicatein A2 [Latrunculia oparinae]
Length = 329
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 85/203 (41%), Positives = 121/203 (59%), Gaps = 5/203 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQS CG+ ++F GA+EGA + L+VLS+Q +IDCS YGN+GC GG ++++I
Sbjct: 130 DQSQCGASYAFSAVGALEGANALATGSLSVLSEQNIIDCSVPYGNHGCKGGNMLYAFKYI 189
Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ + GL Y P+ G+ C + ++G V + SE L A+A GPVSVA
Sbjct: 190 IANDGLDVAKSY-PFQGKQQSCVYDDQDTGGKISGMVRIKQGSESDLIGAVANVGPVSVA 248
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
ID S +F FY +GVY +C++S L+HA++ GYG GK YW VKNSW T WG G
Sbjct: 249 IDGSSNAFRFYASGVYDSSRCSSSK--LNHAMVVTGYGTYGGKDYWLVKNSWGTNWGQSG 306
Query: 330 YVLMSI-KDNNCGVMTAPTYVTM 351
Y++M+ K N CG+ + Y T+
Sbjct: 307 YIMMARGKYNQCGIASDACYPTL 329
>gi|179957|gb|AAC37592.1| cathepsin S [Homo sapiens]
Length = 331
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 83/199 (41%), Positives = 115/199 (57%), Gaps = 5/199 (2%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW-GYGNNGCDGGEDFRSYQWI 210
Q CG+CW+F GA+E +K KL LS Q L+DCS YGN GC+GG ++Q+I
Sbjct: 133 QGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYI 192
Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
+ + D PY D C + AT + + + ED LK A+A GPVSV +
Sbjct: 193 IDNKGIDSDASYPYKAMDLKCQYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVSVGV 252
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
DA SF Y +GVYY+ C + ++H VL VGYG+L+GK YW VKNSW +G +GY
Sbjct: 253 DARHPSFFLYRSGVYYEPSCTQN---VNHGVLVVGYGDLNGKEYWLVKNSWGHNFGEEGY 309
Query: 331 VLMSI-KDNNCGVMTAPTY 348
+ M+ K N+CG+ + P+Y
Sbjct: 310 IRMARNKGNHCGIASFPSY 328
>gi|395856029|ref|XP_003800445.1| PREDICTED: cathepsin S [Otolemur garnettii]
Length = 331
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 85/200 (42%), Positives = 119/200 (59%), Gaps = 7/200 (3%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW-GYGNNGCDGGEDFRSYQWI 210
Q CGSCW+F GA+E + KL LS Q L+DCS Y N GC GG ++Q+I
Sbjct: 133 QGSCGSCWAFSAVGALEAQLKLTTGKLVSLSAQNLVDCSTEKYRNEGCHGGFMTEAFQYI 192
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ +G+ ++ Y PY D C + AT + + + SE+ALK A+A GPVSVA
Sbjct: 193 IDNNGIDSEASY-PYKAMDEKCQYDSKNRAATCSKYTELPFGSEEALKEAVASKGPVSVA 251
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDAS SF Y +GVYY+ C ++H VL VGYG L+G YW VKNSW Y+G++G
Sbjct: 252 IDASHSSFFLYRSGVYYEPACTQV---VNHGVLVVGYGNLNGNDYWLVKNSWGLYFGDKG 308
Query: 330 YVLMSI-KDNNCGVMTAPTY 348
Y+ M+ ++N+CG+ + +Y
Sbjct: 309 YIRMARNRENHCGIASYSSY 328
>gi|205689966|sp|A6NFJ7.3|CATL6_HUMAN RecName: Full=Putative cathepsin L-like protein 6
Length = 277
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 100/254 (39%), Positives = 146/254 (57%), Gaps = 15/254 (5%)
Query: 108 EKWRLVTSEGEKVSKYSLWVRYNKASKDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTT 164
E++R V + G + K+ ++ + IP V + KGY + DQ CGSCW+F T
Sbjct: 29 EEFRQVMN-GFQYQKHRKGKQFQERLLPEIPTSVDWREKGYMTPVKDQGQCGSCWAFSAT 87
Query: 165 GAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGP 223
GA+EG + K KL L++Q L+DCS GN GC+GG ++++ ++ GL ++ Y P
Sbjct: 88 GALEGQMFWKTGKLISLNEQNLVDCSGPQGNEGCNGGFMDNPFRYVQENGGLDSEASY-P 146
Query: 224 YLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNG 283
Y G+ C + A TGFV++ P+ E L A+A GP+SVA AS SF FY G
Sbjct: 147 YEGKVKTCRYNPKYSAANDTGFVDI-PSQEKDLAKAVATVGPISVAAGASHVSFQFYKKG 205
Query: 284 VYYDEKCNNSPDGLDHAVLAVGY----GELDGKPYWQVKN-SWSTYWGNQGYVLMSI-KD 337
+Y++ +C+ P+GLDHA+L VGY + D YW VKN S WG GY+ M+ +
Sbjct: 206 IYFEPRCD--PEGLDHAMLLVGYSYEGADSDNNKYWLVKNSSEGKNWGMDGYIKMAKDRR 263
Query: 338 NNCGVMTAPTYVTM 351
NNCG+ TA +Y T+
Sbjct: 264 NNCGIATAASYPTV 277
>gi|391341652|ref|XP_003745141.1| PREDICTED: counting factor associated protein D-like [Metaseiulus
occidentalis]
Length = 751
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 78/195 (40%), Positives = 111/195 (56%), Gaps = 6/195 (3%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKK--LAVLSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
+Q CGSC+SF + +E Y +++ K S+Q ++DCSW N GC GG +++
Sbjct: 551 NQGTCGSCYSFASVAYLESQYIIRNGKGNTTRFSEQQIVDCSWDSLNIGCKGGFPHGAFE 610
Query: 209 WIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTP-NSEDALKLALAKHGPVS 267
++ K+GL T+D YGPYL + C A + + T + L + HGP++
Sbjct: 611 YVQKYGLFTEDQYGPYLDDEGKCRDAEMKGEPIIPTLKSFTMMEGAECLLRHVGLHGPIA 670
Query: 268 VAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGN 327
V I S SF Y G+Y D C++S L HAVL VGYG L G+PYW VKNSW WG
Sbjct: 671 VGIHGSSDSFRAYSRGIYNDPTCDHS---LTHAVLVVGYGSLRGEPYWLVKNSWGPKWGA 727
Query: 328 QGYVLMSIKDNNCGV 342
+GY+L+S K+N CG+
Sbjct: 728 EGYILVSRKENYCGI 742
>gi|1185459|gb|AAA87849.1| preprocathepsin cathepsin L [Schistosoma japonicum]
Length = 331
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 79/198 (39%), Positives = 118/198 (59%), Gaps = 5/198 (2%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM 211
Q +CGSCW+F TGA+EG KHKKL LS+Q L+DC + YGN+GC+GG ++ ++
Sbjct: 134 QGLCGSCWAFSATGAIEGQLRRKHKKLVKLSEQQLVDCRYNYGNDGCEGGTMDLAFNYLE 193
Query: 212 KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAID 271
KH + +++DY YLG DA CH + + F ++ E L+ A+ ++GP+SV I
Sbjct: 194 KHYIESENDY-KYLGHDANCHYRKSKGVVKVKKFGDLPARDEKTLEKAVYQYGPISVGI- 251
Query: 272 ASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYV 331
+ S Y +G+Y + C + ++H VLAVGYG +GK YW +KNSW WG GY
Sbjct: 252 VALDSLILYKSGIYESKDCKYAD--INHGVLAVGYGRENGKDYWLIKNSWGDLWGMNGYF 309
Query: 332 -LMSIKDNNCGVMTAPTY 348
L K + CG+ + ++
Sbjct: 310 KLRRNKPHMCGISSNSSF 327
>gi|148283737|gb|ABN50361.2| cathepsin L [Fasciola hepatica]
Length = 326
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 87/218 (39%), Positives = 122/218 (55%), Gaps = 8/218 (3%)
Query: 131 KASKDAIPVRYEMKGY---NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALI 187
KA+K A+P + + Y + +Q CGSCW+F TTGAVEG + + A S+Q L+
Sbjct: 102 KANKLAVPESIDWRDYYYVTEVKNQGQCGSCWAFSTTGAVEGQFRKNERASASFSEQQLV 161
Query: 188 DCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVN 247
+C+ +GN GC GG +Y+++ +GL T+ Y PY + C A A +TG+
Sbjct: 162 NCTRDFGNYGCGGGYVENAYEYLKHNGLETESYY-PYQAVEGPCQYDGRLAYAKVTGYYT 220
Query: 248 VTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG 307
V E LK + GP +VA+DA F Y +G+Y + C PD L HAVLAVGYG
Sbjct: 221 VHSGDEIELKNLVGTEGPAAVALDAD-SDFMMYQSGIYQSQTC--LPDRLTHAVLAVGYG 277
Query: 308 ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMT 344
DG YW VKNSW T+WG GY+ + + N CG+ +
Sbjct: 278 SQDGTDYWIVKNSWGTWWGEDGYIRFARNRGNMCGIAS 315
>gi|350583407|ref|XP_003481511.1| PREDICTED: cathepsin S [Sus scrofa]
Length = 331
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 87/200 (43%), Positives = 118/200 (59%), Gaps = 7/200 (3%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW-GYGNNGCDGGEDFRSYQWI 210
Q CGSCW+F GA+E MK +L LS Q L+DCS Y N GC+GG ++Q+I
Sbjct: 133 QGSCGSCWAFSAVGALEAQVKMKTGRLVSLSAQNLVDCSTEKYRNKGCNGGFMTEAFQYI 192
Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
+ +G+ ++ Y PY D C + AT + + + E ALK A+A GPVSVA
Sbjct: 193 IDNNGIDSEASY-PYKAVDGKCKYDSKNRAATCSRYTELPFADEYALKEAVANKGPVSVA 251
Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
IDA SF FY +GVYYD C + ++H VL VGYG L+GK YW VKNSW +G+ G
Sbjct: 252 IDAKHSSFFFYRSGVYYDPSCTQN---VNHGVLVVGYGNLNGKDYWLVKNSWGLNFGDGG 308
Query: 330 YVLMSIK-DNNCGVMTAPTY 348
Y+ M+ +N+CG+ P+Y
Sbjct: 309 YIRMARNSENHCGIANYPSY 328
>gi|157862755|gb|ABV90500.1| cathepsin L, partial [Fasciola gigantica]
Length = 251
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 89/219 (40%), Positives = 126/219 (57%), Gaps = 10/219 (4%)
Query: 131 KASKDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAV-LSQQAL 186
+A A+P + + GY + DQ CGSCW+F TTGA+EG YMK +++ + S+Q L
Sbjct: 27 RAKNRAVPTSIDWRESGYVTEVKDQGGCGSCWAFSTTGAMEGQ-YMKSQRINISFSEQQL 85
Query: 187 IDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFV 246
+DCS +GN+GC GG ++Y+++ GL T+ Y PY + C A ++ +
Sbjct: 86 VDCSGDFGNHGCSGGLMEKAYEYLRHFGLETESSY-PYRADEGPCQYDKQLGVAQLSDYY 144
Query: 247 NVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGY 306
V E ALK + GP +VA+D + F Y +G+Y DE C S L+HA+LAVGY
Sbjct: 145 IVHSQDEVALKNLIGVEGPAAVALDVN-IDFMMYKSGIYQDEIC--SSRYLNHALLAVGY 201
Query: 307 GELDGKPYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMT 344
G DG YW VKNSW + WG GY+ L +DN CG+ T
Sbjct: 202 GTEDGTEYWIVKNSWGSRWGEHGYIRLARNRDNMCGIAT 240
>gi|7770062|ref|NP_036137.1| cathepsin J precursor [Mus musculus]
gi|6467374|gb|AAF13142.1|AF136272_1 cathepsin J precursor [Mus musculus]
gi|15418834|gb|AAK58455.1| cathepsin J [Mus musculus]
gi|148709364|gb|EDL41310.1| cathepsin J, isoform CRA_b [Mus musculus]
Length = 333
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 88/206 (42%), Positives = 122/206 (59%), Gaps = 10/206 (4%)
Query: 149 LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
+ +Q CGSCW+F GA+EG + K L LS Q L+DCS GN GC G ++++
Sbjct: 128 VRNQGKCGSCWAFAAAGAIEGQMFWKTGNLTPLSVQNLLDCSKTVGNKGCQSGTAHQAFE 187
Query: 209 WIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVS 267
+++K+ GL + Y PY G+D C + A+A +T +VN+ PN E L +A+A GPVS
Sbjct: 188 YVLKNKGLEAEATY-PYEGKDGPCRYRSENASANITDYVNLPPN-ELYLWVAVASIGPVS 245
Query: 268 VAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE----LDGKPYWQVKNSWST 323
AIDAS SF FY G+YY+ C S ++HAVL VGYG DG YW +KNSW
Sbjct: 246 AAIDASHDSFRFYNGGIYYEPNC--SSYFVNHAVLVVGYGSEGDVKDGNNYWLIKNSWGE 303
Query: 324 YWGNQGYVLMSIKDNN-CGVMTAPTY 348
WG GY+ ++ NN CG+ + +Y
Sbjct: 304 EWGMNGYMQIAKDHNNHCGIASLASY 329
>gi|12803615|gb|AAH02642.1| Cathepsin S [Homo sapiens]
gi|49456313|emb|CAG46477.1| CTSS [Homo sapiens]
gi|60821573|gb|AAX36579.1| cathepsin S [synthetic construct]
gi|189069420|dbj|BAG37086.1| unnamed protein product [Homo sapiens]
gi|261858586|dbj|BAI45815.1| cathepsin S [synthetic construct]
Length = 331
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 83/199 (41%), Positives = 115/199 (57%), Gaps = 5/199 (2%)
Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW-GYGNNGCDGGEDFRSYQWI 210
Q CG+CW+F GA+E +K KL LS Q L+DCS YGN GC+GG ++Q+I
Sbjct: 133 QGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYI 192
Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
+ + D PY D C + AT + + + ED LK A+A GPVSV +
Sbjct: 193 IDNKGIDSDASYPYKAMDQKCQYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVSVGV 252
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
DA SF Y +GVYY+ C + ++H VL VGYG+L+GK YW VKNSW +G +GY
Sbjct: 253 DARHPSFFLYRSGVYYEPSCTQN---VNHGVLVVGYGDLNGKEYWLVKNSWGHNFGEEGY 309
Query: 331 VLMSI-KDNNCGVMTAPTY 348
+ M+ K N+CG+ + P+Y
Sbjct: 310 IRMARNKGNHCGIASFPSY 328
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.318 0.134 0.422
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,976,626,717
Number of Sequences: 23463169
Number of extensions: 256321855
Number of successful extensions: 525954
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 5628
Number of HSP's successfully gapped in prelim test: 1683
Number of HSP's that attempted gapping in prelim test: 505102
Number of HSP's gapped (non-prelim): 8766
length of query: 351
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 208
effective length of database: 9,003,962,200
effective search space: 1872824137600
effective search space used: 1872824137600
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 77 (34.3 bits)