BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy3960
         (351 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|350425511|ref|XP_003494144.1| PREDICTED: counting factor associated protein D-like [Bombus
           impatiens]
          Length = 549

 Score =  341 bits (875), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 159/202 (78%), Positives = 175/202 (86%), Gaps = 1/202 (0%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQSVCGSCWSFGTTGAVEGAYYMK+ KL  LSQQALIDCSWGYGNNGCDGGEDFRSYQWI
Sbjct: 348 DQSVCGSCWSFGTTGAVEGAYYMKYGKLVRLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 407

Query: 211 MKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           MKHG LPT+DDYG YLGQD YCHI N T TA +TG+VNVT    +ALK+A+AKHGP+SVA
Sbjct: 408 MKHGGLPTEDDYGGYLGQDGYCHINNATVTAKITGYVNVTSGDANALKVAIAKHGPISVA 467

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS K+FSFY +GVYYDE C N+ + LDHAVLAVGYG L+GK YW VKNSWS YWGN G
Sbjct: 468 IDASHKTFSFYSHGVYYDESCGNTEESLDHAVLAVGYGSLNGKDYWLVKNSWSNYWGNDG 527

Query: 330 YVLMSIKDNNCGVMTAPTYVTM 351
           Y+LMS + NNCGV+TAPTYVTM
Sbjct: 528 YILMSQEKNNCGVLTAPTYVTM 549



 Score =  114 bits (285), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 57/142 (40%), Positives = 85/142 (59%), Gaps = 29/142 (20%)

Query: 14  GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
           GMVKT+QLS  G+ G S+K+AP+TTE+ LN+ +C    G  +  ++PQ+++PD ++ +  
Sbjct: 65  GMVKTFQLSHKGSYGKSIKIAPITTESVLNEETCLEVNGTKEIKVQPQAIIPDTTEMEC- 123

Query: 74  IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKAS 133
                                  IGEE++NG++ EKWRL      K++KY+ W+RY K+ 
Sbjct: 124 -----------------------IGEEVINGLLCEKWRLEQEIDGKLNKYTFWIRYKKSP 160

Query: 134 -----KDAIPVRYEMKGYNSLL 150
                K+ IPVRYEMKG+NSLL
Sbjct: 161 RVPGLKEPIPVRYEMKGFNSLL 182


>gi|340728972|ref|XP_003402785.1| PREDICTED: counting factor associated protein D-like [Bombus
           terrestris]
          Length = 549

 Score =  338 bits (867), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 157/202 (77%), Positives = 175/202 (86%), Gaps = 1/202 (0%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQSVCGSCWSFGTTGAVEGAYYMK+ KL  LSQQALIDCSWGYGNNGCDGGEDFRSYQWI
Sbjct: 348 DQSVCGSCWSFGTTGAVEGAYYMKYGKLVRLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 407

Query: 211 MKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           MKHG LPT+D+YG YLGQD YCH+ N T TA +TG+VNVT    +ALK+A+AKHGP+SVA
Sbjct: 408 MKHGGLPTEDEYGGYLGQDGYCHVNNVTLTAKITGYVNVTSGDANALKVAIAKHGPISVA 467

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS K+FSFY +GVYYDE C N+ + LDHAVLAVGYG L+GK YW VKNSWS YWGN G
Sbjct: 468 IDASHKTFSFYSHGVYYDESCGNTEESLDHAVLAVGYGSLNGKDYWLVKNSWSNYWGNDG 527

Query: 330 YVLMSIKDNNCGVMTAPTYVTM 351
           Y+LMS + NNCGV+TAPTYVTM
Sbjct: 528 YILMSQEKNNCGVLTAPTYVTM 549



 Score =  114 bits (285), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 57/142 (40%), Positives = 85/142 (59%), Gaps = 29/142 (20%)

Query: 14  GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
           GMVKT+QLS  G+ G S+K+AP+TTE+ LN+ +C    G  +  ++PQ+++PD ++ +  
Sbjct: 65  GMVKTFQLSHKGSYGKSIKIAPITTESVLNEETCLEVNGTKEIKVQPQAIIPDTTEMEC- 123

Query: 74  IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKAS 133
                                  IGEE++NG++ EKWRL      K++KY+ W+RY K+ 
Sbjct: 124 -----------------------IGEEVINGLLCEKWRLEQEIDGKLNKYTFWIRYKKSP 160

Query: 134 -----KDAIPVRYEMKGYNSLL 150
                K+ IPVRYEMKG+NSLL
Sbjct: 161 RVPGFKEPIPVRYEMKGFNSLL 182


>gi|322787487|gb|EFZ13575.1| hypothetical protein SINV_11743 [Solenopsis invicta]
          Length = 552

 Score =  337 bits (865), Expect = 4e-90,   Method: Compositional matrix adjust.
 Identities = 156/202 (77%), Positives = 173/202 (85%), Gaps = 1/202 (0%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQSVCGSCWSFGTTGAVEGAYYMKH KL  LSQQALIDCSWG+GNNGCDGGEDFRSYQW+
Sbjct: 351 DQSVCGSCWSFGTTGAVEGAYYMKHNKLVRLSQQALIDCSWGFGNNGCDGGEDFRSYQWM 410

Query: 211 MKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           MKHG LPT++DYG YLGQD YCHI N T TA +TGFVNV P S  ALK+A+AKHGP+SVA
Sbjct: 411 MKHGGLPTEEDYGGYLGQDGYCHINNVTLTANITGFVNVPPRSVQALKIAIAKHGPISVA 470

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS K+FSFY +GVYYD  C N+ D LDHAVLAVGYG ++GK YW VKNSWS YWGN G
Sbjct: 471 IDASHKTFSFYSHGVYYDPACGNTEDKLDHAVLAVGYGTMNGKGYWLVKNSWSNYWGNDG 530

Query: 330 YVLMSIKDNNCGVMTAPTYVTM 351
           Y+LM+ KD+NCGV+  PTYVTM
Sbjct: 531 YILMAQKDDNCGVLLTPTYVTM 552



 Score =  118 bits (295), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 61/142 (42%), Positives = 84/142 (59%), Gaps = 31/142 (21%)

Query: 14  GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
           GMVKT+QL+  G  G S+K+AP+TTET  N+ +C    G  D  I+PQS+LPD S+ +  
Sbjct: 70  GMVKTFQLANKGLYGASIKLAPITTETVSNEETCLQVNGTDDVKIQPQSILPDASEMEA- 128

Query: 74  IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKAS 133
                                    +E++NG++ EKWRLV   GEK +KY+LW+RY K+ 
Sbjct: 129 -------------------------KEMINGLLCEKWRLVQEIGEKTNKYTLWIRYKKSP 163

Query: 134 -----KDAIPVRYEMKGYNSLL 150
                K+ IP+RYEMKG+NSLL
Sbjct: 164 YLPKVKEPIPMRYEMKGFNSLL 185


>gi|383860620|ref|XP_003705787.1| PREDICTED: counting factor associated protein D-like [Megachile
           rotundata]
          Length = 549

 Score =  337 bits (864), Expect = 5e-90,   Method: Compositional matrix adjust.
 Identities = 155/202 (76%), Positives = 174/202 (86%), Gaps = 1/202 (0%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQSVCGSCWSFGTTGAVEGAYYMK+ KL  LSQQALIDCSWG+GNNGCDGGEDFRSYQWI
Sbjct: 348 DQSVCGSCWSFGTTGAVEGAYYMKYGKLVRLSQQALIDCSWGFGNNGCDGGEDFRSYQWI 407

Query: 211 MKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           MKHG LP +D+YG YLGQD YCH  N T  A +TGFVNVTP   +ALK+A+AKHGP+SVA
Sbjct: 408 MKHGGLPAEDEYGGYLGQDGYCHANNVTKVAKITGFVNVTPGDPNALKVAIAKHGPISVA 467

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDA+ K+FSFY +GVYYDE C N+ + LDHAVLAVGYG+L+GK YW VKNSWS YWGN G
Sbjct: 468 IDAAHKTFSFYSHGVYYDESCGNTEESLDHAVLAVGYGKLNGKDYWLVKNSWSNYWGNDG 527

Query: 330 YVLMSIKDNNCGVMTAPTYVTM 351
           Y+LMS + NNCGV+TAPTYVTM
Sbjct: 528 YILMSQEKNNCGVLTAPTYVTM 549



 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 65/142 (45%), Positives = 85/142 (59%), Gaps = 29/142 (20%)

Query: 14  GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
           GMVKTYQLS  G+ G S+K+AP+TTE  LN  +C    G  D  IEPQ+++PD +  +  
Sbjct: 65  GMVKTYQLSHKGSYGKSIKIAPITTENILNAETCLEVNGTRDLKIEPQTIIPDTTGMEC- 123

Query: 74  IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKAS 133
                                  IGEE+VNG++ EKWRLV +  EK +KY+LW+RY K+ 
Sbjct: 124 -----------------------IGEEVVNGLLCEKWRLVETVNEKTNKYTLWIRYKKSP 160

Query: 134 -----KDAIPVRYEMKGYNSLL 150
                K+ IPVRYEMKG+NSLL
Sbjct: 161 RVPQFKEPIPVRYEMKGFNSLL 182


>gi|307202924|gb|EFN82144.1| Digestive cysteine proteinase 1 [Harpegnathos saltator]
          Length = 484

 Score =  335 bits (860), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 155/202 (76%), Positives = 174/202 (86%), Gaps = 1/202 (0%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQSVCGSCWSFGTTG +EGAYYMK+ KL  LSQQALIDCSWGYGNNGCDGGEDFR+YQWI
Sbjct: 283 DQSVCGSCWSFGTTGTIEGAYYMKYNKLVRLSQQALIDCSWGYGNNGCDGGEDFRAYQWI 342

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           MKH GLPT++DYG YLGQD YCHI N T TA +TG+VNVTP S  ALK+A+AKHGP+SVA
Sbjct: 343 MKHDGLPTEEDYGGYLGQDGYCHINNVTKTAKITGYVNVTPLSVAALKIAIAKHGPISVA 402

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS K+FSFY +GVYYD  C N+ D LDHAVLAVGYG ++GK YW VKNSWS YWGN G
Sbjct: 403 IDASHKTFSFYSHGVYYDPACGNTEDKLDHAVLAVGYGNMNGKDYWLVKNSWSNYWGNDG 462

Query: 330 YVLMSIKDNNCGVMTAPTYVTM 351
           Y+LM+ KD+NCGV+ APTYVTM
Sbjct: 463 YILMAQKDDNCGVLLAPTYVTM 484



 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 65/141 (46%), Positives = 85/141 (60%), Gaps = 29/141 (20%)

Query: 15  MVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVNI 74
           MVKTYQLS  G  G S+K+AP+TTET  NK +C    G  D  I+PQ++LPD S+ +   
Sbjct: 1   MVKTYQLSHKGQYGASIKIAPVTTETASNKDTCLQVNGTSDLKIQPQTILPDTSEMEC-- 58

Query: 75  YRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKAS- 133
                                 IGEE++NG++ EKWRLV    EK +KY+LW+RY K+  
Sbjct: 59  ----------------------IGEEMINGLMCEKWRLVQEIEEKTNKYTLWIRYKKSPY 96

Query: 134 ----KDAIPVRYEMKGYNSLL 150
               K+AIPVRYEM+G+NSLL
Sbjct: 97  VPQLKEAIPVRYEMRGFNSLL 117


>gi|307180251|gb|EFN68284.1| Cathepsin L [Camponotus floridanus]
          Length = 555

 Score =  335 bits (859), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 152/202 (75%), Positives = 174/202 (86%), Gaps = 1/202 (0%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQSVCGSCWSFGTTG +EGAYY+K+ KL  LSQQALIDCSWGYGNNGCDGGEDFRSYQWI
Sbjct: 354 DQSVCGSCWSFGTTGTIEGAYYLKYNKLVRLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 413

Query: 211 MKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           MKHG LPT+DDYG YLGQD +CH+ N T TA + GFVNVTP + DALK+A+AKHGP+SVA
Sbjct: 414 MKHGGLPTEDDYGSYLGQDGFCHVNNVTLTAKIKGFVNVTPRNVDALKIAIAKHGPISVA 473

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS K+FSFY +GVYYD  C N+ D LDHAVLAVGYG ++GK YW +KNSWS YWGN G
Sbjct: 474 IDASHKTFSFYSHGVYYDPTCGNTEDTLDHAVLAVGYGTMNGKGYWLIKNSWSNYWGNDG 533

Query: 330 YVLMSIKDNNCGVMTAPTYVTM 351
           Y+LM+ +D+NCGV+ APTYVTM
Sbjct: 534 YILMAQQDDNCGVLLAPTYVTM 555



 Score =  119 bits (297), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 63/142 (44%), Positives = 84/142 (59%), Gaps = 29/142 (20%)

Query: 14  GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
           GMVKT+QLS  G  G S+K+AP+TTET  NK +C    G  +  I+PQS++PD S+ +  
Sbjct: 71  GMVKTFQLSHKGEFGASIKIAPVTTETVNNKETCLQVNGTINLKIQPQSIVPDASEMEC- 129

Query: 74  IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKAS 133
                                  IGEE++NG+  EKWRLV   G K +KY+LW+RY K+ 
Sbjct: 130 -----------------------IGEEMINGLSCEKWRLVQEIGGKTNKYTLWIRYKKSP 166

Query: 134 -----KDAIPVRYEMKGYNSLL 150
                K+ IPVRYEM+G+NSLL
Sbjct: 167 SLPQVKEPIPVRYEMRGFNSLL 188


>gi|156554010|ref|XP_001605879.1| PREDICTED: counting factor associated protein D-like [Nasonia
           vitripennis]
          Length = 553

 Score =  327 bits (837), Expect = 6e-87,   Method: Compositional matrix adjust.
 Identities = 152/223 (68%), Positives = 181/223 (81%), Gaps = 4/223 (1%)

Query: 131 KASKDAIPVRYEMKGYNSLL---DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALI 187
           +  K  +P  ++ + Y ++    DQSVCGSCWSFGTTGAVEGAY+MK+KKL  LSQQALI
Sbjct: 328 EKEKADVPDSFDWRLYGAVTPVKDQSVCGSCWSFGTTGAVEGAYFMKYKKLVRLSQQALI 387

Query: 188 DCSWGYGNNGCDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFV 246
           DCSWG+GNNGCDGGEDFRSYQWI+KHG LPT+++YG YLGQD YCHI N T  A + GFV
Sbjct: 388 DCSWGFGNNGCDGGEDFRSYQWIIKHGGLPTEEEYGGYLGQDGYCHIKNVTQIAKLKGFV 447

Query: 247 NVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGY 306
           NV  N+ DA+KLAL KHGP+SVAIDAS K+FSFY NGVYY+  C N+ + LDHAVLAVGY
Sbjct: 448 NVDTNNVDAMKLALFKHGPISVAIDASHKTFSFYSNGVYYEPACGNTENSLDHAVLAVGY 507

Query: 307 GELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYV 349
           G ++GK +W +KNSWS YWGN GY+LM+ K+NNCGVMTAPTY 
Sbjct: 508 GTINGKGFWLIKNSWSNYWGNDGYILMAQKNNNCGVMTAPTYA 550



 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 63/142 (44%), Positives = 85/142 (59%), Gaps = 29/142 (20%)

Query: 14  GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
           GMVKTYQLSK G  G S+K+AP+T E   NK +C    G  DA I+ Q+++PD +  +  
Sbjct: 67  GMVKTYQLSKEGPYGSSIKIAPVTDEDNFNKETCLQVNGTSDARIDLQTIIPDTTGMEC- 125

Query: 74  IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKAS 133
                                  IGEE++NG+  EKWRL+ S GEK +KY+LW+RY K+ 
Sbjct: 126 -----------------------IGEEMINGLACEKWRLIDSFGEKTNKYTLWIRYKKSP 162

Query: 134 -----KDAIPVRYEMKGYNSLL 150
                K+AIPVRYEM+G+N+LL
Sbjct: 163 STPQMKEAIPVRYEMRGFNTLL 184


>gi|6448469|dbj|BAA86911.1| homologue of Sarcophaga 26,29kDa proteinase [Periplaneta americana]
          Length = 552

 Score =  327 bits (837), Expect = 7e-87,   Method: Compositional matrix adjust.
 Identities = 154/224 (68%), Positives = 178/224 (79%), Gaps = 4/224 (1%)

Query: 132 ASKDAIPVRYEMKGYNSLL---DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALID 188
           A  D IP   + + Y ++    DQSVCGSCWSFGTTG +EGAY++K+  L  LSQQALID
Sbjct: 329 AIMDQIPDDLDWRIYGAVTPVKDQSVCGSCWSFGTTGTIEGAYFLKYGHLVRLSQQALID 388

Query: 189 CSWGYGNNGCDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVN 247
           CSWGYGNNGCDGGEDFRSY+W+MKHG +P +D+YG YLGQD YCH+ N T TA +TG+VN
Sbjct: 389 CSWGYGNNGCDGGEDFRSYEWMMKHGGIPLEDEYGGYLGQDGYCHVENVTLTAKITGYVN 448

Query: 248 VTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG 307
           VT    DALK+ALAKHGP+SVAIDAS K+FSFY NG+YYD +C N  D LDHAVL VGYG
Sbjct: 449 VTSGDIDALKVALAKHGPISVAIDASHKTFSFYSNGIYYDPECGNKLDQLDHAVLLVGYG 508

Query: 308 ELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
            ++G PYW VKNSWS YWGN GY+LMS KDNNCGV T PTYVTM
Sbjct: 509 IINGNPYWLVKNSWSNYWGNDGYILMSPKDNNCGVATDPTYVTM 552



 Score =  130 bits (327), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 71/142 (50%), Positives = 88/142 (61%), Gaps = 29/142 (20%)

Query: 14  GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
           GMVKTYQ+SK G  G SLKVAP+TTE   NK +C    G  D  IEPQS+LPD++ FK+ 
Sbjct: 67  GMVKTYQISKYGNFGTSLKVAPVTTEKVENKETCLQVNGTKDNKIEPQSILPDLTGFKL- 125

Query: 74  IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKAS 133
                                  IG E +NG+  EKWRLV ++G KV+KY++W+RY K++
Sbjct: 126 -----------------------IGTEPINGLNCEKWRLVDTKGAKVNKYTMWIRYKKSA 162

Query: 134 KD-----AIPVRYEMKGYNSLL 150
            D     AIPVRYEMKGYNSLL
Sbjct: 163 NDPGVKIAIPVRYEMKGYNSLL 184


>gi|380013206|ref|XP_003690657.1| PREDICTED: counting factor associated protein D-like [Apis florea]
          Length = 549

 Score =  325 bits (833), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 147/202 (72%), Positives = 172/202 (85%), Gaps = 1/202 (0%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQSVCGSCWSFGTTGAVEGAY+MK+ KL  LSQQALIDCSWG+GNNGCDGGEDFRSYQWI
Sbjct: 348 DQSVCGSCWSFGTTGAVEGAYFMKYGKLVRLSQQALIDCSWGFGNNGCDGGEDFRSYQWI 407

Query: 211 MKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           MKHG LPT+D+YG YLGQD YCH+ N +  A +TG+VNVT    +ALK+A+AKHGP+SVA
Sbjct: 408 MKHGGLPTEDEYGGYLGQDGYCHVNNISMIAKITGYVNVTSGDANALKIAIAKHGPISVA 467

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS K+FSFY +G+YY+  C N  + LDHAVLAVGYG+++GK YW +KNSWS YWGN G
Sbjct: 468 IDASHKTFSFYSHGIYYESTCGNIEESLDHAVLAVGYGKINGKDYWLIKNSWSNYWGNDG 527

Query: 330 YVLMSIKDNNCGVMTAPTYVTM 351
           Y+LMS + NNCGV+T PTYVTM
Sbjct: 528 YILMSQEKNNCGVLTTPTYVTM 549



 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 61/142 (42%), Positives = 83/142 (58%), Gaps = 29/142 (20%)

Query: 14  GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
           GMVKT+QLS  G  G S+K+AP+TTE+  N+ +C    G  +  I+PQ +LPD +  +  
Sbjct: 65  GMVKTFQLSHKGEYGKSIKIAPVTTESVTNEETCLEVNGTKEIKIQPQVILPDTTSMQC- 123

Query: 74  IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKAS 133
                                  IGEE+VNG++ EKWRL     EK++KY+LW+RY K+ 
Sbjct: 124 -----------------------IGEEIVNGLLCEKWRLEQQIDEKINKYTLWIRYKKSP 160

Query: 134 -----KDAIPVRYEMKGYNSLL 150
                K+ IPVRYEMKG+NSLL
Sbjct: 161 HLPNFKEPIPVRYEMKGFNSLL 182


>gi|195128649|ref|XP_002008774.1| GI11630 [Drosophila mojavensis]
 gi|193920383|gb|EDW19250.1| GI11630 [Drosophila mojavensis]
          Length = 547

 Score =  317 bits (812), Expect = 6e-84,   Method: Compositional matrix adjust.
 Identities = 149/223 (66%), Positives = 178/223 (79%), Gaps = 5/223 (2%)

Query: 134 KDAIPVRYEMKGYNSL---LDQSVCGSCWSFGTTGAVEGAYYMKHK-KLAVLSQQALIDC 189
           KD IP +Y+ + + ++    DQSVCGSCWSFGT G +EGAY++K+   L  LSQQALIDC
Sbjct: 325 KDEIPSQYDWRLFGAVSPVKDQSVCGSCWSFGTVGHLEGAYFLKNGGNLVRLSQQALIDC 384

Query: 190 SWGYGNNGCDGGEDFRSYQWIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
           SWGYGNNGCDGGEDFR YQW+M+  G+PT+++YGPYLGQD YCH+ N T  A + GFVNV
Sbjct: 385 SWGYGNNGCDGGEDFRVYQWMMQVGGVPTEEEYGPYLGQDGYCHVNNVTLVAPIKGFVNV 444

Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
           T N  +A KLA+ KHGP+SVAIDAS K+FSFY NGVYY+ +C N  DGLDHAVLAVGYG 
Sbjct: 445 TSNDPNAFKLAMLKHGPLSVAIDASPKTFSFYSNGVYYEPECKNDVDGLDHAVLAVGYGT 504

Query: 309 LDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
           ++G+ YW VKNSWSTYWGN GY+LMS + NNCGVMT PTYV M
Sbjct: 505 INGEDYWLVKNSWSTYWGNDGYILMSARKNNCGVMTMPTYVEM 547



 Score =  103 bits (258), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 59/142 (41%), Positives = 79/142 (55%), Gaps = 29/142 (20%)

Query: 14  GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
           GMVKTYQL+  G  G +LK+AP++T+ ELNK+SC    G  D  +E QS+LPD   F + 
Sbjct: 64  GMVKTYQLAGEGQYGTALKLAPISTKEELNKLSCLQVNGTADQVVEIQSILPDAKPFAL- 122

Query: 74  IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKA- 132
                                  IG E   G   +K+RL  + G+K + Y+LWVRY K+ 
Sbjct: 123 -----------------------IGTESFLGYTCDKFRLEQTIGQKKNVYTLWVRYKKSP 159

Query: 133 ----SKDAIPVRYEMKGYNSLL 150
               S+  IPVRYEM+GYN+LL
Sbjct: 160 HYPGSRMPIPVRYEMRGYNTLL 181


>gi|195020395|ref|XP_001985188.1| GH14645 [Drosophila grimshawi]
 gi|193898670|gb|EDV97536.1| GH14645 [Drosophila grimshawi]
          Length = 549

 Score =  316 bits (810), Expect = 9e-84,   Method: Compositional matrix adjust.
 Identities = 149/223 (66%), Positives = 178/223 (79%), Gaps = 5/223 (2%)

Query: 134 KDAIPVRYEMKGYNSL---LDQSVCGSCWSFGTTGAVEGAYYMKHK-KLAVLSQQALIDC 189
           KD IP +Y+ + Y ++    DQSVCGSCWSFGT G +EGA+++K+   L  LSQQALIDC
Sbjct: 327 KDEIPSQYDWRLYGAVSPVKDQSVCGSCWSFGTIGHLEGAFFLKNGGNLVRLSQQALIDC 386

Query: 190 SWGYGNNGCDGGEDFRSYQWIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
           SWGYGNNGCDGGEDFR YQW+M+  G+PT+++YGPYLGQD YCH+ N T  A + GFVNV
Sbjct: 387 SWGYGNNGCDGGEDFRVYQWMMQVGGVPTEEEYGPYLGQDGYCHVNNVTLVAPIKGFVNV 446

Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
           T N  +A KLAL KHGP+SVAIDAS K+FSFY +GVYY+ +C N  DGLDHAVLAVGYG 
Sbjct: 447 TANDPNAFKLALLKHGPISVAIDASPKTFSFYSHGVYYEPECKNDVDGLDHAVLAVGYGT 506

Query: 309 LDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
           ++G+ YW VKNSWSTYWGN GY+LMS + NNCGVMT PTYV M
Sbjct: 507 INGEDYWLVKNSWSTYWGNDGYILMSARKNNCGVMTMPTYVEM 549



 Score =  100 bits (250), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 58/142 (40%), Positives = 79/142 (55%), Gaps = 29/142 (20%)

Query: 14  GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
           GMVKTYQL+  G  G +LK+ P+TT+ ELNK+SC    G  +  ++ QS+LPD   F++ 
Sbjct: 64  GMVKTYQLAGEGQFGTALKLTPITTKDELNKLSCLLVNGTAEQPVQIQSILPDAKPFEL- 122

Query: 74  IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNK-- 131
                                  IG E   G   +K+RL +  G+K + Y+LWVRY K  
Sbjct: 123 -----------------------IGTETFLGYTCDKFRLESVIGQKKNVYTLWVRYKKSP 159

Query: 132 ---ASKDAIPVRYEMKGYNSLL 150
              AS+  IPVRYEM+GYN+LL
Sbjct: 160 HYPASRMPIPVRYEMRGYNTLL 181


>gi|282158089|ref|NP_001164088.1| cathepsin L precursor [Tribolium castaneum]
          Length = 552

 Score =  315 bits (807), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 148/228 (64%), Positives = 179/228 (78%), Gaps = 5/228 (2%)

Query: 129 YNKASKDAIPVRYE---MKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHK-KLAVLSQQ 184
           Y   +K+ +P +++   +     + DQSVCGSCWSFGT G VEGA ++ +  +L  LSQQ
Sbjct: 325 YENINKEDLPDQWDWRLLGAVTPVKDQSVCGSCWSFGTVGTVEGALFLHNGGRLFRLSQQ 384

Query: 185 ALIDCSWGYGNNGCDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANTTATATMT 243
           AL+DCSWGYGNNGCDGGEDFR+YQW++KHG +PT++ YGPYLGQD YCH       A +T
Sbjct: 385 ALVDCSWGYGNNGCDGGEDFRAYQWMLKHGGIPTEEAYGPYLGQDGYCHADKVQKVAKIT 444

Query: 244 GFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLA 303
           G+VNVT N E+AL+LAL KHGP+SVAIDASQ++FSFY NGVYY+ KC N  D LDHAVLA
Sbjct: 445 GYVNVTTNDENALRLALFKHGPISVAIDASQRTFSFYSNGVYYEPKCGNKIDELDHAVLA 504

Query: 304 VGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
           VGYG ++G+ YW VKNSWS YWGN GY+LMS KDNNCGVMT PTYVTM
Sbjct: 505 VGYGTINGENYWLVKNSWSNYWGNDGYILMSSKDNNCGVMTTPTYVTM 552



 Score =  114 bits (284), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 61/141 (43%), Positives = 83/141 (58%), Gaps = 29/141 (20%)

Query: 15  MVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVNI 74
           MVKTYQLS  G  G SLK+AP+TTETE N  +C    G  + TIEPQS+LPD++ F+   
Sbjct: 68  MVKTYQLSTEGPYGSSLKIAPVTTETETNSRTCLQVNGTEENTIEPQSILPDLTGFEC-- 125

Query: 75  YRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRY----- 129
                                  GEE ++G++ EKW L  + G+KV+KY++W+ Y     
Sbjct: 126 ----------------------TGEETISGVLTEKWALSQTIGQKVNKYAMWLYYKDDPD 163

Query: 130 NKASKDAIPVRYEMKGYNSLL 150
           N   K A+PV+YEM+G+NSLL
Sbjct: 164 NTGVKIAVPVKYEMRGFNSLL 184


>gi|195428245|ref|XP_002062184.1| GK16790 [Drosophila willistoni]
 gi|194158269|gb|EDW73170.1| GK16790 [Drosophila willistoni]
          Length = 549

 Score =  314 bits (804), Expect = 4e-83,   Method: Compositional matrix adjust.
 Identities = 147/222 (66%), Positives = 178/222 (80%), Gaps = 5/222 (2%)

Query: 135 DAIPVRYEMKGYNSLL---DQSVCGSCWSFGTTGAVEGAYYMKHK-KLAVLSQQALIDCS 190
           D IP +Y+ + Y ++    DQSVCGSCWSFGT G +EGA+++K+   L  LSQQALIDCS
Sbjct: 328 DDIPSQYDWRLYGAVTPVKDQSVCGSCWSFGTIGHLEGAFFLKNGGNLVRLSQQALIDCS 387

Query: 191 WGYGNNGCDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVT 249
           WG+GNNGCDGGEDFR YQW++K G +PT+++YGPYLGQD YCH+ N T  A +TGFVNVT
Sbjct: 388 WGFGNNGCDGGEDFRVYQWMLKSGGVPTEEEYGPYLGQDGYCHVNNVTLVAPITGFVNVT 447

Query: 250 PNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGEL 309
            N  +A K+AL KHGP+SVAIDAS K+FSFY +GVYY+ +C N  DGLDHAVLAVGYG +
Sbjct: 448 SNDPNAFKIALLKHGPLSVAIDASPKTFSFYSHGVYYEPECKNDVDGLDHAVLAVGYGTI 507

Query: 310 DGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
           +G+ YW VKNSWSTYWGN GY+LMS + NNCGVMT PTYV M
Sbjct: 508 NGEDYWLVKNSWSTYWGNDGYILMSARKNNCGVMTMPTYVEM 549



 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 60/142 (42%), Positives = 77/142 (54%), Gaps = 29/142 (20%)

Query: 14  GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
           GMVKTYQL+  G  G  LK+AP+TT  +LNK++C    G  D  +E QS+LPD   F   
Sbjct: 64  GMVKTYQLAGEGDFGTMLKLAPITTSDDLNKLTCLQVNGTTDQKVEIQSILPDAKPF--- 120

Query: 74  IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNK-- 131
                                T IG E   G   +K+RL  S G+K + Y+LWVRY K  
Sbjct: 121 ---------------------TLIGTETFLGYTCDKFRLEQSIGQKKNVYTLWVRYKKSP 159

Query: 132 ---ASKDAIPVRYEMKGYNSLL 150
              +S+  IPVRYEM+GYN+LL
Sbjct: 160 HYPSSRMPIPVRYEMRGYNTLL 181


>gi|209693437|ref|NP_001129411.1| cathepsin K-1 precursor [Acyrthosiphon pisum]
          Length = 558

 Score =  313 bits (802), Expect = 7e-83,   Method: Compositional matrix adjust.
 Identities = 143/228 (62%), Positives = 175/228 (76%), Gaps = 5/228 (2%)

Query: 129 YNKASKDAIPVRYEMK---GYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQA 185
           YNK     +P   + +       + DQS+CGSCWSFGTTG +EGAY+MKH++    S+QA
Sbjct: 331 YNKTDFKNLPSEIDWRIAGAVTPVKDQSICGSCWSFGTTGTIEGAYFMKHRERKSFSEQA 390

Query: 186 LIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTT--ATATMT 243
           L+DCSWG+GNNGCDGGEDFR+Y WI KHGLPT+++YGPYL QD +CHIAN +      +T
Sbjct: 391 LVDCSWGFGNNGCDGGEDFRAYSWIEKHGLPTEEEYGPYLSQDGFCHIANISNGDLTKIT 450

Query: 244 GFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLA 303
           GFVNVT   E+ALK+AL   GP+S+AI AS K+FSFY NGVYYD+ C NSP  L+HAVL 
Sbjct: 451 GFVNVTTYDEEALKIALVNEGPISIAIYASLKTFSFYSNGVYYDKDCRNSPLELNHAVLL 510

Query: 304 VGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
           VGYG LDG PYW VKNSWST+WGN GYVL+S KDNNCGV+++PTYV +
Sbjct: 511 VGYGTLDGHPYWLVKNSWSTHWGNAGYVLISRKDNNCGVLSSPTYVIL 558



 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 52/137 (37%), Positives = 72/137 (52%), Gaps = 26/137 (18%)

Query: 14  GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
           GM KTYQLS  G+ G SLK+ P+TTETE N + C    G  +  I PQS LPD+SDFK+ 
Sbjct: 79  GMAKTYQLSNKGSFGSSLKIVPVTTETETNTLKCLEVNGTDEFHISPQSTLPDLSDFKL- 137

Query: 74  IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKAS 133
                                  + E L NGI++EKW  +    + V KY +W+   +  
Sbjct: 138 -----------------------VSESLENGIVVEKWVYIKRVDKTVGKYIMWLY--RKD 172

Query: 134 KDAIPVRYEMKGYNSLL 150
             A P+R++M G+NS+ 
Sbjct: 173 DIAYPMRFQMNGFNSVF 189


>gi|332374780|gb|AEE62531.1| unknown [Dendroctonus ponderosae]
          Length = 544

 Score =  313 bits (802), Expect = 8e-83,   Method: Compositional matrix adjust.
 Identities = 145/220 (65%), Positives = 178/220 (80%), Gaps = 5/220 (2%)

Query: 137 IPVRYEMKGYNSLL---DQSVCGSCWSFGTTGAVEGAYYMKHK-KLAVLSQQALIDCSWG 192
           +P +++ + Y ++    DQSVCGSCWSFGT G VEGA+++K+   L  LSQQALIDCSWG
Sbjct: 324 LPDQWDWRLYGAVTPVKDQSVCGSCWSFGTIGTVEGAFFLKNGGHLVRLSQQALIDCSWG 383

Query: 193 YGNNGCDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPN 251
           +GNNGCDGGEDFR+YQWI KHG +PT++DYGPYLGQD YCH       A ++ + NVT N
Sbjct: 384 FGNNGCDGGEDFRAYQWIKKHGGIPTEEDYGPYLGQDGYCHADKLPKVAKLSSWTNVTTN 443

Query: 252 SEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDG 311
            E+AL+LA+ KHGP+SVAIDASQ++FSFY NGVY++EKC+N  D LDHAVLAVGYG ++G
Sbjct: 444 DENALRLAIFKHGPISVAIDASQRTFSFYSNGVYFEEKCHNKVDELDHAVLAVGYGSING 503

Query: 312 KPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
             YW VKNSWS YWGN GYVLMS K+NNCGVM+APTYVT+
Sbjct: 504 NHYWLVKNSWSNYWGNDGYVLMSSKNNNCGVMSAPTYVTL 543



 Score =  107 bits (268), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 59/138 (42%), Positives = 83/138 (60%), Gaps = 27/138 (19%)

Query: 15  MVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVNI 74
           MVKTYQL K+G+ G SLK+AP+TTE E+N  +C    G  +  I PQ+VLP+++ F+   
Sbjct: 63  MVKTYQLGKDGS-GTSLKIAPVTTEEEVNIRTCLQVNGSQEQPIRPQTVLPELTGFE--- 118

Query: 75  YRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNK--A 132
                                Y+G+E++ GI  +KW L T+ GEK +KY LWV + +   
Sbjct: 119 ---------------------YLGDEVIEGIQTQKWNLTTTVGEKENKYFLWVTFKEDGD 157

Query: 133 SKDAIPVRYEMKGYNSLL 150
            K A+PVRYEM+G+NSLL
Sbjct: 158 QKIAVPVRYEMRGFNSLL 175


>gi|195379510|ref|XP_002048521.1| GJ11312 [Drosophila virilis]
 gi|194155679|gb|EDW70863.1| GJ11312 [Drosophila virilis]
          Length = 549

 Score =  313 bits (801), Expect = 9e-83,   Method: Compositional matrix adjust.
 Identities = 148/223 (66%), Positives = 176/223 (78%), Gaps = 5/223 (2%)

Query: 134 KDAIPVRYEMKGYNSLL---DQSVCGSCWSFGTTGAVEGAYYMKHK-KLAVLSQQALIDC 189
           KD IP +Y+ + + ++    DQSVCGSCWSFGT G +EGAY++K+   L  LS QALIDC
Sbjct: 327 KDEIPPQYDWRLFGAVTPVKDQSVCGSCWSFGTIGHLEGAYFLKNGGNLVRLSTQALIDC 386

Query: 190 SWGYGNNGCDGGEDFRSYQWIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
           SWGYGNNGCDGGEDFR YQW+M+  G+PT++ YGPYLGQD YCH+ N T  A + GFVNV
Sbjct: 387 SWGYGNNGCDGGEDFRVYQWVMQVGGVPTEEQYGPYLGQDGYCHVNNVTLVAPIKGFVNV 446

Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
           T N  +A KLAL KHGP+SVAIDAS K+FSFY +GVYY+ +C N  DGLDHAVLAVGYG 
Sbjct: 447 TSNDPNAFKLALLKHGPLSVAIDASPKTFSFYSHGVYYEPECKNDVDGLDHAVLAVGYGT 506

Query: 309 LDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
           ++G+ YW VKNSWSTYWGN GY+LMS + NNCGVMT PTYV M
Sbjct: 507 INGEDYWLVKNSWSTYWGNDGYILMSARKNNCGVMTMPTYVEM 549



 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 58/142 (40%), Positives = 79/142 (55%), Gaps = 29/142 (20%)

Query: 14  GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
           GMVKTYQL+  G  G +LK+AP++T  ELNK+SC    G  +  +E QS+LPD   F++ 
Sbjct: 64  GMVKTYQLAGEGQYGTALKLAPVSTRDELNKLSCLQVNGTAEQPVEIQSILPDAKPFEL- 122

Query: 74  IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNK-- 131
                                  IG E   G   +K+RL  + G+K + Y+LWVRY K  
Sbjct: 123 -----------------------IGTETFMGYTCDKFRLEQTIGQKKNVYTLWVRYKKSP 159

Query: 132 ---ASKDAIPVRYEMKGYNSLL 150
              +S+  IPVRYEM+GYN+LL
Sbjct: 160 HYPSSRMPIPVRYEMRGYNTLL 181


>gi|389611850|dbj|BAM19484.1| cathepsin L [Papilio xuthus]
          Length = 342

 Score =  312 bits (800), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 161/289 (55%), Positives = 199/289 (68%), Gaps = 10/289 (3%)

Query: 71  KVNIYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYN 130
           ++NI+R   LR   H N + R  +T     L +    E   L        S   L   Y 
Sbjct: 56  RLNIFRQN-LR-YIHSNNRARRGFTLAVNHLADWAEDELAALRGRRYSGPSPLGLQFPYG 113

Query: 131 KASKDA----IPVRYEMKGYNSLL---DQSVCGSCWSFGTTGAVEGAYYMKHK-KLAVLS 182
           +A  +     +P  ++ + + ++    DQSVCGSCWSFGT GAVEGA ++ +   L  LS
Sbjct: 114 EAELEEMSVKLPPEFDWRLFGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHLVRLS 173

Query: 183 QQALIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATM 242
           QQ LIDCSWG+GNNGCDGGEDFR+YQWIMKHGLPT++DYG YLGQD YCH+ N T   ++
Sbjct: 174 QQTLIDCSWGFGNNGCDGGEDFRAYQWIMKHGLPTEEDYGDYLGQDGYCHVDNVTLVTSI 233

Query: 243 TGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVL 302
            G+VNVT N+E+AL+LA+ KHGP+SVAIDAS KSFSFY +GVYY+ KC N  D LDHAVL
Sbjct: 234 KGWVNVTTNNENALRLAIFKHGPISVAIDASHKSFSFYSHGVYYEPKCKNKIDELDHAVL 293

Query: 303 AVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
           AVGYG L+G+ YW VKNSWS  WGN GYVLMS +DNNCGV +APTYV +
Sbjct: 294 AVGYGVLNGQKYWLVKNSWSNLWGNDGYVLMSSRDNNCGVESAPTYVLI 342


>gi|195590156|ref|XP_002084812.1| GD14469 [Drosophila simulans]
 gi|194196821|gb|EDX10397.1| GD14469 [Drosophila simulans]
          Length = 549

 Score =  312 bits (800), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 148/223 (66%), Positives = 176/223 (78%), Gaps = 5/223 (2%)

Query: 134 KDAIPVRYEMKGYNSLL---DQSVCGSCWSFGTTGAVEGAYYMKHK-KLAVLSQQALIDC 189
           KD IP +Y+ + Y ++    DQSVCGSCWSFGT G +EGA+++K+   L  LSQQALIDC
Sbjct: 327 KDEIPDQYDWRLYGAVTPVKDQSVCGSCWSFGTIGHLEGAFFLKNGGNLVRLSQQALIDC 386

Query: 190 SWGYGNNGCDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
           SW YGNNGCDGGEDFR YQW+++ G +PT+++YGPYLGQD YCH+ N T  A + GFVNV
Sbjct: 387 SWAYGNNGCDGGEDFRVYQWMLQSGGVPTEEEYGPYLGQDGYCHVNNVTLVAPIKGFVNV 446

Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
           T N  +A KLAL KHGP+SVAIDAS K+FSFY +GVYY+  C N  DGLDHAVLAVGYG 
Sbjct: 447 TSNDPNAFKLALLKHGPLSVAIDASPKTFSFYSHGVYYEPTCKNDVDGLDHAVLAVGYGS 506

Query: 309 LDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
           ++G+ YW VKNSWSTYWGN GY+LMS K NNCGVMT PTYV M
Sbjct: 507 INGEDYWLVKNSWSTYWGNDGYILMSAKKNNCGVMTMPTYVEM 549



 Score =  102 bits (254), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 58/142 (40%), Positives = 80/142 (56%), Gaps = 29/142 (20%)

Query: 14  GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
           GMVKTYQL+  G  G  LK+AP+TT+TE NK++C    G  D T++ QS+LPD   F + 
Sbjct: 64  GMVKTYQLAGEGQYGTLLKLAPITTKTENNKLTCLQVNGTADQTVDIQSILPDAKPFSL- 122

Query: 74  IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNK-- 131
                                  +G E   G   +K+RL ++ G+K + Y+LWVRY K  
Sbjct: 123 -----------------------VGTESFLGYTCDKFRLESTIGQKKNIYTLWVRYKKSP 159

Query: 132 ---ASKDAIPVRYEMKGYNSLL 150
              +S+  IPVRYEM+GYN+LL
Sbjct: 160 HYPSSRMPIPVRYEMRGYNTLL 181


>gi|195327474|ref|XP_002030443.1| GM25442 [Drosophila sechellia]
 gi|194119386|gb|EDW41429.1| GM25442 [Drosophila sechellia]
          Length = 549

 Score =  312 bits (800), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 148/223 (66%), Positives = 176/223 (78%), Gaps = 5/223 (2%)

Query: 134 KDAIPVRYEMKGYNSLL---DQSVCGSCWSFGTTGAVEGAYYMKHK-KLAVLSQQALIDC 189
           KD IP +Y+ + Y ++    DQSVCGSCWSFGT G +EGA+++K+   L  LSQQALIDC
Sbjct: 327 KDEIPDQYDWRLYGAVTPVKDQSVCGSCWSFGTIGHLEGAFFLKNGGNLVRLSQQALIDC 386

Query: 190 SWGYGNNGCDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
           SW YGNNGCDGGEDFR YQW+++ G +PT+++YGPYLGQD YCH+ N T  A + GFVNV
Sbjct: 387 SWAYGNNGCDGGEDFRVYQWMLQSGGVPTEEEYGPYLGQDGYCHVNNVTLVAPIKGFVNV 446

Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
           T N  +A KLAL KHGP+SVAIDAS K+FSFY +GVYY+  C N  DGLDHAVLAVGYG 
Sbjct: 447 TSNDPNAFKLALLKHGPLSVAIDASPKTFSFYSHGVYYEPTCKNDVDGLDHAVLAVGYGS 506

Query: 309 LDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
           ++G+ YW VKNSWSTYWGN GY+LMS K NNCGVMT PTYV M
Sbjct: 507 INGEDYWLVKNSWSTYWGNDGYILMSAKKNNCGVMTMPTYVEM 549



 Score =  101 bits (251), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 58/142 (40%), Positives = 80/142 (56%), Gaps = 29/142 (20%)

Query: 14  GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
           GMVKTYQL+  G  G  LK+AP+TT+TE NK++C    G  + TI+ QS+LPD   F + 
Sbjct: 64  GMVKTYQLAGEGQYGTLLKLAPITTKTENNKLTCLQVNGTANQTIDIQSILPDAKPFSL- 122

Query: 74  IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNK-- 131
                                  +G E   G   +K+RL ++ G+K + Y+LWVRY K  
Sbjct: 123 -----------------------VGTESFLGYTCDKFRLESTIGQKKNIYTLWVRYKKSP 159

Query: 132 ---ASKDAIPVRYEMKGYNSLL 150
              +S+  IPVRYEM+GYN+LL
Sbjct: 160 HYPSSRMPIPVRYEMRGYNTLL 181


>gi|20428641|ref|NP_620470.1| 26-29kD-proteinase [Drosophila melanogaster]
 gi|6448467|dbj|BAA86910.1| homologue of Sarcophaga 26,29kDa proteinase [Drosophila
           melanogaster]
 gi|7294432|gb|AAF49777.1| 26-29kD-proteinase [Drosophila melanogaster]
 gi|21483518|gb|AAM52734.1| RE18380p [Drosophila melanogaster]
          Length = 549

 Score =  312 bits (800), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 148/223 (66%), Positives = 176/223 (78%), Gaps = 5/223 (2%)

Query: 134 KDAIPVRYEMKGYNSLL---DQSVCGSCWSFGTTGAVEGAYYMKHK-KLAVLSQQALIDC 189
           KD IP +Y+ + Y ++    DQSVCGSCWSFGT G +EGA+++K+   L  LSQQALIDC
Sbjct: 327 KDEIPDQYDWRLYGAVTPVKDQSVCGSCWSFGTIGHLEGAFFLKNGGNLVRLSQQALIDC 386

Query: 190 SWGYGNNGCDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
           SW YGNNGCDGGEDFR YQW+++ G +PT+++YGPYLGQD YCH+ N T  A + GFVNV
Sbjct: 387 SWAYGNNGCDGGEDFRVYQWMLQSGGVPTEEEYGPYLGQDGYCHVNNVTLVAPIKGFVNV 446

Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
           T N  +A KLAL KHGP+SVAIDAS K+FSFY +GVYY+  C N  DGLDHAVLAVGYG 
Sbjct: 447 TSNDPNAFKLALLKHGPLSVAIDASPKTFSFYSHGVYYEPTCKNDVDGLDHAVLAVGYGS 506

Query: 309 LDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
           ++G+ YW VKNSWSTYWGN GY+LMS K NNCGVMT PTYV M
Sbjct: 507 INGEDYWLVKNSWSTYWGNDGYILMSAKKNNCGVMTMPTYVEM 549



 Score =  100 bits (250), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 57/142 (40%), Positives = 79/142 (55%), Gaps = 29/142 (20%)

Query: 14  GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
           GMVKTYQL+  G  G  LK+AP+TT+TE NK++C    G  D  ++ QS+LPD   F + 
Sbjct: 64  GMVKTYQLAGEGQYGTLLKLAPITTKTENNKLTCLQVNGTADQAVDIQSILPDAKPFSL- 122

Query: 74  IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNK-- 131
                                  +G E   G   +K+RL ++ G+K + Y+LWVRY K  
Sbjct: 123 -----------------------VGTESFLGYTCDKFRLESTIGQKKNIYTLWVRYKKSP 159

Query: 132 ---ASKDAIPVRYEMKGYNSLL 150
              +S+  IPVRYEM+GYN+LL
Sbjct: 160 HYPSSRMPIPVRYEMRGYNTLL 181


>gi|194749983|ref|XP_001957411.1| GF24054 [Drosophila ananassae]
 gi|190624693|gb|EDV40217.1| GF24054 [Drosophila ananassae]
          Length = 549

 Score =  312 bits (799), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 146/223 (65%), Positives = 177/223 (79%), Gaps = 5/223 (2%)

Query: 134 KDAIPVRYEMKGYNSLL---DQSVCGSCWSFGTTGAVEGAYYMKHK-KLAVLSQQALIDC 189
           KD IP +Y+ + + ++    DQSVCGSCWSFGT G +EGA+++K+   L  LSQQALIDC
Sbjct: 327 KDEIPTQYDWRLFGAVTPVKDQSVCGSCWSFGTIGHLEGAFFLKNGGNLVRLSQQALIDC 386

Query: 190 SWGYGNNGCDGGEDFRSYQWIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
           SW YGNNGCDGGEDFR YQW+++  G+PT+++YGPYLGQD YCH+ N T  A + GFVNV
Sbjct: 387 SWAYGNNGCDGGEDFRVYQWMLQVGGVPTEEEYGPYLGQDGYCHVNNVTLVAPIKGFVNV 446

Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
           T N  +A KLAL KHGP+SVAIDAS K+FSFY +GVYY+ +C N  DGLDHAVLAVGYG 
Sbjct: 447 TSNDPNAFKLALLKHGPLSVAIDASPKTFSFYSHGVYYEPECKNDVDGLDHAVLAVGYGT 506

Query: 309 LDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
           ++G+ YW VKNSWSTYWGN GY+LMS + NNCGVMT PTYV M
Sbjct: 507 INGEDYWLVKNSWSTYWGNDGYILMSARKNNCGVMTMPTYVEM 549



 Score =  101 bits (251), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 57/142 (40%), Positives = 79/142 (55%), Gaps = 29/142 (20%)

Query: 14  GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
           GMVKTYQL+  G  G  LK+AP+TT+TE NK++C    G  +  ++ QS+LPD   F++ 
Sbjct: 64  GMVKTYQLAGEGDFGTMLKLAPITTKTETNKLTCLQVNGTKNEAVQIQSILPDAKPFQL- 122

Query: 74  IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNK-- 131
                                  +G E   G   +K+RL +  G+K + Y+LWVRY K  
Sbjct: 123 -----------------------VGTETFLGYTCDKFRLESVIGQKKNVYTLWVRYKKSP 159

Query: 132 ---ASKDAIPVRYEMKGYNSLL 150
              AS+  IPVRYEM+GYN+LL
Sbjct: 160 HYPASRMPIPVRYEMRGYNTLL 181


>gi|83944666|gb|ABC48937.1| cathepsin L like protease [Glossina morsitans morsitans]
 gi|289739541|gb|ADD18518.1| cathepsin L-like protease [Glossina morsitans morsitans]
          Length = 550

 Score =  312 bits (799), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 147/223 (65%), Positives = 176/223 (78%), Gaps = 5/223 (2%)

Query: 134 KDAIPVRYEMKGYNSLL---DQSVCGSCWSFGTTGAVEGAYYMKHK-KLAVLSQQALIDC 189
           KD IP +Y+ + Y ++    DQSVCGSCWSFGT G +EGAY++K+   L  LSQQALIDC
Sbjct: 328 KDQIPDQYDWRLYGAVTPVKDQSVCGSCWSFGTIGHIEGAYFLKNGGNLVRLSQQALIDC 387

Query: 190 SWGYGNNGCDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
           SW YGNNGCDGGEDFRSYQW+M+ G +PT+++YGPYLGQD YCH  N T  A + GFVNV
Sbjct: 388 SWQYGNNGCDGGEDFRSYQWMMQMGGVPTEEEYGPYLGQDGYCHAQNATLVAPIKGFVNV 447

Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
           T N  DA K+AL  HGP+SVAIDAS K+FSFY +GVYY+  C NS D LDHAVLAVGYG+
Sbjct: 448 TSNDPDAFKIALLNHGPLSVAIDASPKTFSFYSHGVYYEPSCKNSMDSLDHAVLAVGYGK 507

Query: 309 LDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
           ++ + YW +KNSWSTYWGN GYVLMS ++NNCGVMT PT+V M
Sbjct: 508 MNDEHYWLIKNSWSTYWGNDGYVLMSARENNCGVMTTPTFVEM 550



 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 56/142 (39%), Positives = 81/142 (57%), Gaps = 29/142 (20%)

Query: 14  GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
           GMVKTYQL+     G SLK+AP+T + +LNK++C    G  D  I+ QSVLP+ SDF++ 
Sbjct: 64  GMVKTYQLAGEDQYGTSLKLAPVTNKNQLNKVTCLQLNGTADNRIQIQSVLPNASDFQL- 122

Query: 74  IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKA- 132
                                  +G E   G++ +K+RL  +  +K + Y+LWVRY K+ 
Sbjct: 123 -----------------------VGTETFLGLVCDKFRLEETIEKKKNVYTLWVRYKKSP 159

Query: 133 ----SKDAIPVRYEMKGYNSLL 150
               S+  +PVRYEM+GYN+LL
Sbjct: 160 HYPGSRMPMPVRYEMRGYNTLL 181


>gi|194870649|ref|XP_001972693.1| GG15663 [Drosophila erecta]
 gi|190654476|gb|EDV51719.1| GG15663 [Drosophila erecta]
          Length = 549

 Score =  311 bits (797), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 147/223 (65%), Positives = 176/223 (78%), Gaps = 5/223 (2%)

Query: 134 KDAIPVRYEMKGYNSLL---DQSVCGSCWSFGTTGAVEGAYYMKHK-KLAVLSQQALIDC 189
           +D IP +Y+ + Y ++    DQSVCGSCWSFGT G +EGA+++K+   L  LSQQALIDC
Sbjct: 327 QDEIPDQYDWRLYGAVTPVKDQSVCGSCWSFGTIGHLEGAFFLKNGGNLVRLSQQALIDC 386

Query: 190 SWGYGNNGCDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
           SW YGNNGCDGGEDFR YQW+++ G +PT+++YGPYLGQD YCH+ N T  A + GFVNV
Sbjct: 387 SWAYGNNGCDGGEDFRVYQWMLQSGGVPTEEEYGPYLGQDGYCHVNNVTLVAPIKGFVNV 446

Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
           T N  +A KLAL KHGP+SVAIDAS K+FSFY +GVYY+  C N  DGLDHAVLAVGYG 
Sbjct: 447 TSNDPNAFKLALLKHGPLSVAIDASPKTFSFYSHGVYYEPTCKNDVDGLDHAVLAVGYGS 506

Query: 309 LDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
           ++G+ YW VKNSWSTYWGN GY+LMS K NNCGVMT PTYV M
Sbjct: 507 INGEDYWLVKNSWSTYWGNDGYILMSAKKNNCGVMTMPTYVEM 549



 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 57/142 (40%), Positives = 78/142 (54%), Gaps = 29/142 (20%)

Query: 14  GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
           GMVKTYQL+     G  LK+AP+TT+TE NK++C    G  D  +E QS+LPD   F + 
Sbjct: 64  GMVKTYQLAGENQYGTLLKLAPITTKTENNKLTCLQVNGTADQAVEIQSILPDAKPFSL- 122

Query: 74  IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNK-- 131
                                  +G E   G   +K+RL ++ G+K + Y+LWVRY K  
Sbjct: 123 -----------------------VGTESFLGYTCDKFRLESTIGQKKNIYTLWVRYKKSP 159

Query: 132 ---ASKDAIPVRYEMKGYNSLL 150
              +S+  IPVRYEM+GYN+LL
Sbjct: 160 HYPSSRMPIPVRYEMRGYNTLL 181


>gi|195494228|ref|XP_002094747.1| GE21992 [Drosophila yakuba]
 gi|194180848|gb|EDW94459.1| GE21992 [Drosophila yakuba]
          Length = 549

 Score =  311 bits (797), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 147/223 (65%), Positives = 176/223 (78%), Gaps = 5/223 (2%)

Query: 134 KDAIPVRYEMKGYNSLL---DQSVCGSCWSFGTTGAVEGAYYMKHK-KLAVLSQQALIDC 189
           +D IP +Y+ + Y ++    DQSVCGSCWSFGT G +EGA+++K+   L  LSQQALIDC
Sbjct: 327 QDEIPDQYDWRLYGAVTPVKDQSVCGSCWSFGTIGHLEGAFFLKNGGNLVRLSQQALIDC 386

Query: 190 SWGYGNNGCDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
           SW YGNNGCDGGEDFR YQW+++ G +PT+++YGPYLGQD YCH+ N T  A + GFVNV
Sbjct: 387 SWAYGNNGCDGGEDFRVYQWMLQSGGVPTEEEYGPYLGQDGYCHVNNVTLVAPIKGFVNV 446

Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
           T N  +A KLAL KHGP+SVAIDAS K+FSFY +GVYY+  C N  DGLDHAVLAVGYG 
Sbjct: 447 TSNDPNAFKLALLKHGPLSVAIDASPKTFSFYSHGVYYEPTCKNDVDGLDHAVLAVGYGS 506

Query: 309 LDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
           ++G+ YW VKNSWSTYWGN GY+LMS K NNCGVMT PTYV M
Sbjct: 507 INGEDYWLVKNSWSTYWGNDGYILMSAKKNNCGVMTMPTYVEM 549



 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 58/142 (40%), Positives = 80/142 (56%), Gaps = 29/142 (20%)

Query: 14  GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
           GMVKTYQL+  G  G  LK+AP+TT+TE NK++C    G  D T++ QS+LPD   F + 
Sbjct: 64  GMVKTYQLAGEGQFGTLLKLAPITTKTENNKLTCLQVNGTADQTVDIQSILPDAKPFSL- 122

Query: 74  IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNK-- 131
                                  +G E   G   +K+RL ++ G+K + Y+LWVRY K  
Sbjct: 123 -----------------------VGTESFLGYTCDKFRLESTIGQKKNVYTLWVRYKKSP 159

Query: 132 ---ASKDAIPVRYEMKGYNSLL 150
              +S+  IPVRYEM+GYN+LL
Sbjct: 160 HYPSSRMPIPVRYEMRGYNTLL 181


>gi|321478980|gb|EFX89936.1| hypothetical protein DAPPUDRAFT_309603 [Daphnia pulex]
          Length = 584

 Score =  310 bits (794), Expect = 6e-82,   Method: Compositional matrix adjust.
 Identities = 142/228 (62%), Positives = 178/228 (78%), Gaps = 4/228 (1%)

Query: 128 RYNKASKDAIPVRYEMKGYNSLL---DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQ 184
            Y K+  + +P + + + Y ++    DQSVCGSCWSFGT G +EGA ++K  KL  LSQQ
Sbjct: 357 HYTKSDVENLPEQMDWRLYGAVTPVKDQSVCGSCWSFGTVGTLEGALFLKTGKLTPLSQQ 416

Query: 185 ALIDCSWGYGNNGCDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANTTATATMT 243
           AL+DCSWG+GNNGCDGGEDFR YQW+MKHG +P+++ YGPYLG D YCH+ N T  A++ 
Sbjct: 417 ALVDCSWGFGNNGCDGGEDFRVYQWMMKHGGIPSEESYGPYLGADGYCHVDNATLVASIK 476

Query: 244 GFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLA 303
           G+VNVT    DAL++A+ K+GP+SVAIDA+ ++FSFY NGVYY+ +C +  D LDHAVLA
Sbjct: 477 GYVNVTSGDVDALRVAIFKYGPISVAIDAAHRAFSFYANGVYYNPECGSGEDSLDHAVLA 536

Query: 304 VGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
           VGYG L G+PYW VKNSWSTYWGN GYVLMS K+NNCGV T+PTYV M
Sbjct: 537 VGYGILKGEPYWLVKNSWSTYWGNSGYVLMSQKENNCGVATSPTYVIM 584



 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 57/140 (40%), Positives = 82/140 (58%), Gaps = 27/140 (19%)

Query: 14  GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
           GM KTYQL+  G  G + K+APMTTET  N+++CF   G  D  +  Q VLPD+S+FK+ 
Sbjct: 102 GMEKTYQLANEGPYGTAKKIAPMTTETVRNQLNCFQTSGSADEPVTVQGVLPDISNFKL- 160

Query: 74  IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKA- 132
                                  IG E++NG+  EKW  + S G+K ++Y++W+++  A 
Sbjct: 161 -----------------------IGTEIMNGVSCEKWEYINSVGDKTNRYTMWLQHQFAG 197

Query: 133 --SKDAIPVRYEMKGYNSLL 150
              + A+PVRYEMKG+NSLL
Sbjct: 198 TRQQIAVPVRYEMKGFNSLL 217


>gi|357624871|gb|EHJ75484.1| putative 26,29kDa proteinase [Danaus plexippus]
          Length = 553

 Score =  310 bits (793), Expect = 8e-82,   Method: Compositional matrix adjust.
 Identities = 142/202 (70%), Positives = 165/202 (81%), Gaps = 1/202 (0%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHK-KLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQW 209
           DQSVCGSCWSFGT GAVEGA ++ +   L  LSQQAL+DCSWG+GNNGCDGGED+R+YQW
Sbjct: 352 DQSVCGSCWSFGTVGAVEGALFLSNGGHLVRLSQQALVDCSWGFGNNGCDGGEDYRAYQW 411

Query: 210 IMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           IM+HGLPT+DDYG YLGQD YCH+ N T    M G+VNVT  +E+ALKLA+ KHGPVSVA
Sbjct: 412 IMRHGLPTEDDYGGYLGQDGYCHMENVTVATKMKGWVNVTAKNENALKLAIFKHGPVSVA 471

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS K+FSFY NGVY++ KC NS + LDHAVLAVG+G L+G  YW VKNSWS  WGN G
Sbjct: 472 IDASHKTFSFYSNGVYFEPKCKNSVEELDHAVLAVGFGVLNGHKYWLVKNSWSNMWGNDG 531

Query: 330 YVLMSIKDNNCGVMTAPTYVTM 351
           YVLMS +D+NCGV  APTYV +
Sbjct: 532 YVLMSARDDNCGVQAAPTYVII 553



 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 60/141 (42%), Positives = 81/141 (57%), Gaps = 29/141 (20%)

Query: 15  MVKTYQLSKN--GTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKV 72
           MVKTYQLS +     G S+K+AP+TTE  LN+ +C    G     I  Q+VLPD++DFK 
Sbjct: 67  MVKTYQLSASVYPQYGTSIKIAPVTTEHVLNQDTCLQVNGTEGENINIQTVLPDMTDFK- 125

Query: 73  NIYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKA 132
                                  ++G E +      KWR+VTS G+KV+KY++WV+Y K+
Sbjct: 126 -----------------------FVGTETMKDSDTFKWRMVTSVGDKVNKYTMWVKYRKS 162

Query: 133 SKD---AIPVRYEMKGYNSLL 150
            +    AIPVRYEMKG+NSLL
Sbjct: 163 LRGDNIAIPVRYEMKGFNSLL 183


>gi|4521167|dbj|BAA76272.1| 26,29kDa proteinase [Sarcophaga peregrina]
          Length = 550

 Score =  307 bits (786), Expect = 5e-81,   Method: Compositional matrix adjust.
 Identities = 144/223 (64%), Positives = 176/223 (78%), Gaps = 5/223 (2%)

Query: 134 KDAIPVRYEMKGYNSLL---DQSVCGSCWSFGTTGAVEGAYYMKHK-KLAVLSQQALIDC 189
           +D +P +Y+ + Y ++    DQSVCGSCWSFGT G +EGA+++K+   L  LSQQALIDC
Sbjct: 328 RDTVPDQYDWRLYGAVTPVKDQSVCGSCWSFGTIGHLEGAFFLKNGGNLVRLSQQALIDC 387

Query: 190 SWGYGNNGCDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
           SW YGNNGCDGGEDFR+Y+W+M+ G +PT+++YGPYLGQD YCH  N +  A +TGF NV
Sbjct: 388 SWEYGNNGCDGGEDFRAYKWMMEMGGVPTEEEYGPYLGQDGYCHAKNVSLVAPITGFFNV 447

Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
           TPN   ALK+AL KHGP+SVAIDAS K+FSFY +GVYY+  C N  D LDHAVLAVGYG 
Sbjct: 448 TPNDPMALKIALLKHGPISVAIDASPKTFSFYSHGVYYEPTCKNGLDELDHAVLAVGYGT 507

Query: 309 LDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
           ++G+ YW VKNSWSTYWGN GY+LMS + NNCGVMT PTYV M
Sbjct: 508 INGEDYWLVKNSWSTYWGNDGYILMSARKNNCGVMTMPTYVEM 550



 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 58/142 (40%), Positives = 81/142 (57%), Gaps = 29/142 (20%)

Query: 14  GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
           GMVKTYQL+     G SLK+AP+TT++ELNK++C    G  +  ++ Q++LP+  DFK+ 
Sbjct: 64  GMVKTYQLANEHPFGTSLKLAPITTKSELNKVTCLQLNGTSEDPVQVQAILPNAKDFKL- 122

Query: 74  IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNK-- 131
                                  IG E   G   +K+RL  + G+K + Y+LWVRY K  
Sbjct: 123 -----------------------IGTETFLGFNCDKFRLEETIGDKKNVYTLWVRYKKSP 159

Query: 132 ---ASKDAIPVRYEMKGYNSLL 150
              AS+  IPVRYEM+GYN+LL
Sbjct: 160 HYPASRMPIPVRYEMRGYNTLL 181


>gi|125979159|ref|XP_001353612.1| GA21427 [Drosophila pseudoobscura pseudoobscura]
 gi|54642377|gb|EAL31126.1| GA21427 [Drosophila pseudoobscura pseudoobscura]
          Length = 549

 Score =  306 bits (784), Expect = 9e-81,   Method: Compositional matrix adjust.
 Identities = 144/223 (64%), Positives = 175/223 (78%), Gaps = 5/223 (2%)

Query: 134 KDAIPVRYEMKGYNSLL---DQSVCGSCWSFGTTGAVEGAYYMKHK-KLAVLSQQALIDC 189
           +D IP +Y+ + + ++    DQSVCGSCWSFGT G +EGA+++K+   L  LSQQALIDC
Sbjct: 327 QDEIPAQYDWRLFGAVTPVKDQSVCGSCWSFGTIGHLEGAFFLKNGGDLVRLSQQALIDC 386

Query: 190 SWGYGNNGCDGGEDFRSYQWIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
           SW +GNNGCDGGEDFR YQW+M+  G+PT+++YGPYLGQD YC I N T  A + GFVNV
Sbjct: 387 SWPFGNNGCDGGEDFRVYQWMMQVGGVPTEEEYGPYLGQDGYCRINNVTLVAPIKGFVNV 446

Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
           T N  +A KLAL KHGP+SVAIDAS K+FSFY +GVYY+ +C N  DGLDHAVLAVG+G 
Sbjct: 447 TSNDPNAFKLALLKHGPLSVAIDASPKTFSFYSHGVYYEPECKNDVDGLDHAVLAVGFGT 506

Query: 309 LDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
           + G+ YW VKNSWSTYWGN GY+LMS + NNCGVMT PTYV M
Sbjct: 507 IKGEDYWLVKNSWSTYWGNDGYILMSARKNNCGVMTMPTYVEM 549



 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 59/142 (41%), Positives = 79/142 (55%), Gaps = 29/142 (20%)

Query: 14  GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
           GMVKTYQL+  G  G  LK+AP+TT+ E+NK++C    G  +  +E QS+LPD   FK+ 
Sbjct: 64  GMVKTYQLAGEGDYGTMLKLAPITTQQEMNKLTCLQVNGTSEQKVETQSILPDAKPFKL- 122

Query: 74  IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNK-- 131
                                  IG E   G   +K+RL  + G+K + Y+LWVRY K  
Sbjct: 123 -----------------------IGTETFLGFTCDKFRLEETIGQKKNVYTLWVRYKKSP 159

Query: 132 ---ASKDAIPVRYEMKGYNSLL 150
              AS+  IPVRYEM+GYN+LL
Sbjct: 160 HYPASRMPIPVRYEMRGYNTLL 181


>gi|158284547|ref|XP_307325.4| Anopheles gambiae str. PEST AGAP012577-PA [Anopheles gambiae str.
           PEST]
 gi|157021017|gb|EAA03137.4| AGAP012577-PA [Anopheles gambiae str. PEST]
          Length = 547

 Score =  302 bits (773), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 139/202 (68%), Positives = 162/202 (80%), Gaps = 1/202 (0%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQSVCGSCWSFGT G +EGAY+ K +KL   SQQALIDCSWGYGNNGCDGGEDFR+YQW+
Sbjct: 346 DQSVCGSCWSFGTIGHIEGAYFRKTQKLVRFSQQALIDCSWGYGNNGCDGGEDFRAYQWM 405

Query: 211 MK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           M+  G+P +D+YG YLGQD YCH+ N T  A +TG+VNVT    DA KLAL KHGP+S+A
Sbjct: 406 MEVGGVPMEDEYGGYLGQDGYCHVQNMTLYAPITGWVNVTSGDPDAFKLALFKHGPLSIA 465

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDA  KSFSFY NGVYY+ +C N  D LDHAVLAVGYG+L+G+ YW +KNSWS YWGN G
Sbjct: 466 IDAGHKSFSFYSNGVYYEPECKNKLDELDHAVLAVGYGQLNGEDYWLIKNSWSNYWGNDG 525

Query: 330 YVLMSIKDNNCGVMTAPTYVTM 351
           Y LM++KDNNCG+ T  TYV M
Sbjct: 526 YALMAVKDNNCGLTTDATYVLM 547



 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 57/142 (40%), Positives = 78/142 (54%), Gaps = 29/142 (20%)

Query: 14  GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
           GMVKTYQL+K G  G+SLK+AP+TT+ ++NK +C    G     I+ Q +LP V DF++ 
Sbjct: 62  GMVKTYQLTKQGDYGVSLKLAPVTTQNQMNKETCLQVNGSASYKIDVQGILPYVRDFQL- 120

Query: 74  IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNK-- 131
                                  +G E   G   +K+ L    G+K + Y+LWVRY K  
Sbjct: 121 -----------------------LGTEQCGGYQCDKFGLTEVIGQKRNVYTLWVRYVKSP 157

Query: 132 ---ASKDAIPVRYEMKGYNSLL 150
              A++  IPVRYEMKGYN+LL
Sbjct: 158 KYPAARMPIPVRYEMKGYNTLL 179


>gi|170064305|ref|XP_001867470.1| cathepsin l [Culex quinquefasciatus]
 gi|167881732|gb|EDS45115.1| cathepsin l [Culex quinquefasciatus]
          Length = 547

 Score =  298 bits (764), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 136/202 (67%), Positives = 164/202 (81%), Gaps = 1/202 (0%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQSVCGSCWSFGT G +E AY++K KKL   SQQAL+DCSWGYGNNGCDGGEDFR+YQW+
Sbjct: 346 DQSVCGSCWSFGTAGHIESAYFLKTKKLMRFSQQALVDCSWGYGNNGCDGGEDFRAYQWM 405

Query: 211 MK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           M+  G+P++++YG YLGQDAYC + N T  A +TG+VNVT    DA+KLAL KHGP+S+A
Sbjct: 406 MEVGGIPSEEEYGGYLGQDAYCRLENKTLYAPITGWVNVTSGDADAMKLALFKHGPLSIA 465

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDA  KSFS+Y NGVYY+ +C N+ DGLDHAVLAVGYG L+G+ YW +KNSWS  WGN G
Sbjct: 466 IDAGHKSFSYYANGVYYEPECKNNLDGLDHAVLAVGYGTLNGEDYWLIKNSWSNLWGNDG 525

Query: 330 YVLMSIKDNNCGVMTAPTYVTM 351
           Y LM++KDNNCG+ T  TYV M
Sbjct: 526 YALMAMKDNNCGLATDATYVLM 547



 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 62/142 (43%), Positives = 78/142 (54%), Gaps = 29/142 (20%)

Query: 14  GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
           GMVKTYQL+K G  G SLKVAP++T  +LNK +C    G  D  IE QS+LP+  +FK+ 
Sbjct: 62  GMVKTYQLTKAGEYGTSLKVAPVSTRQQLNKETCLQVNGSADNAIEVQSILPNAKEFKLE 121

Query: 74  IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNK-- 131
                                   G E  NG   +K+ L+   G+K + Y LWVRY K  
Sbjct: 122 ------------------------GTETRNGFTCDKFTLIVDVGQKRNYYYLWVRYIKSP 157

Query: 132 ---ASKDAIPVRYEMKGYNSLL 150
              AS+  IPVRYEMKGYN+LL
Sbjct: 158 KYPASRMPIPVRYEMKGYNTLL 179


>gi|443691408|gb|ELT93269.1| hypothetical protein CAPTEDRAFT_181131 [Capitella teleta]
          Length = 541

 Score =  296 bits (759), Expect = 8e-78,   Method: Compositional matrix adjust.
 Identities = 138/202 (68%), Positives = 163/202 (80%), Gaps = 2/202 (0%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ+VCGSCWSFGTTG VEGA ++K  KL  LSQQAL+DCSWG GNN CDGGEDFR+YQWI
Sbjct: 341 DQAVCGSCWSFGTTGTVEGANFLKTGKLVRLSQQALMDCSWGEGNNACDGGEDFRAYQWI 400

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTAT-ATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
            KHGLP+++DYG YL QD+YCHIAN + T AT++ +VNVT  +E  L+ A+A  GP+SV 
Sbjct: 401 KKHGLPSEEDYGAYLAQDSYCHIANVSNTIATISSYVNVTSTTE-GLRAAIAFKGPISVG 459

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS KS SFY NGVYY+  C N PD LDHAVLAVGYG ++G+ YW +KNSWST+WGN G
Sbjct: 460 IDASHKSLSFYANGVYYEPACGNKPDDLDHAVLAVGYGYMNGQGYWLIKNSWSTHWGNDG 519

Query: 330 YVLMSIKDNNCGVMTAPTYVTM 351
           YVLM+ KDNNCGV TA TYV +
Sbjct: 520 YVLMAQKDNNCGVTTAATYVVL 541



 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 55/139 (39%), Positives = 76/139 (54%), Gaps = 30/139 (21%)

Query: 14  GMVKTYQLSKNG--TNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFK 71
           G+VKTYQL  +G  T G+S KVAPMTTE  LN+I+CF   G  +A I  Q+VLPD+++F 
Sbjct: 64  GVVKTYQLPGDGVSTFGISRKVAPMTTEEVLNQINCFQVNGTKEAPITIQTVLPDLANF- 122

Query: 72  VNIYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNK 131
                                  T  G E++N +    W+     G+K + Y++W+    
Sbjct: 123 -----------------------TKTGHEMMNSLNCTVWQYSVKIGQKKNVYTIWL---- 155

Query: 132 ASKDAIPVRYEMKGYNSLL 150
           A +  +PVRYEM GYNSLL
Sbjct: 156 ADETNLPVRYEMFGYNSLL 174


>gi|91992516|gb|ABE72974.1| cathepsin L [Ochlerotatus atropalpus]
          Length = 313

 Score =  296 bits (758), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 133/202 (65%), Positives = 163/202 (80%), Gaps = 1/202 (0%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQSVCGSCWSFGT G +E AY++K+ KL   SQQALIDCSWGYGNNGCDGGEDFR+YQW+
Sbjct: 111 DQSVCGSCWSFGTAGHIESAYFLKYNKLMRFSQQALIDCSWGYGNNGCDGGEDFRAYQWM 170

Query: 211 MK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           M+  G+P++++YG YLGQD YC + N T  A + G+VNVT    DA+KLAL KHGP+S+A
Sbjct: 171 MEVGGIPSEEEYGGYLGQDGYCRLENKTLYAAIDGWVNVTSGDSDAMKLALFKHGPLSIA 230

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDA  KSFS+Y NGVYY+ +C N+ DGLDHAVLAVGYG+L G+ YW +KNSWS YWGN G
Sbjct: 231 IDAGHKSFSYYANGVYYEPECKNNLDGLDHAVLAVGYGKLKGEDYWLIKNSWSNYWGNDG 290

Query: 330 YVLMSIKDNNCGVMTAPTYVTM 351
           Y LM++K+NNCG+ T  TYV +
Sbjct: 291 YALMAMKNNNCGLTTDATYVNL 312


>gi|157113282|ref|XP_001657758.1| cathepsin l [Aedes aegypti]
 gi|108877803|gb|EAT42028.1| AAEL006389-PA, partial [Aedes aegypti]
          Length = 538

 Score =  294 bits (752), Expect = 5e-77,   Method: Compositional matrix adjust.
 Identities = 132/202 (65%), Positives = 163/202 (80%), Gaps = 1/202 (0%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQSVCGSCWSFGT G +E AY++K+ KL   SQQALIDCSWGYGNNGCDGGEDFR+YQW+
Sbjct: 336 DQSVCGSCWSFGTAGHIESAYFLKYNKLMRFSQQALIDCSWGYGNNGCDGGEDFRAYQWM 395

Query: 211 MK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           M+  G+P++++YG YLGQD YC + N T  A + G+VNVT    +A+K+AL KHGP+S+A
Sbjct: 396 MEVGGIPSEEEYGGYLGQDGYCRLENKTLYAAIDGWVNVTSGDAEAMKVALFKHGPLSIA 455

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDA  KSFS+Y NGVYY+ +C N+ DGLDHAVLAVGYG+L G+ YW +KNSWS YWGN G
Sbjct: 456 IDAGHKSFSYYANGVYYEPECRNNLDGLDHAVLAVGYGKLKGEDYWLIKNSWSNYWGNDG 515

Query: 330 YVLMSIKDNNCGVMTAPTYVTM 351
           Y LM++KDNNCG+ T  TYV +
Sbjct: 516 YALMAMKDNNCGLSTDATYVNL 537



 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 61/141 (43%), Positives = 74/141 (52%), Gaps = 29/141 (20%)

Query: 15  MVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVNI 74
           MVKTYQL+K G  G SLKVAP+T    +NK +C    G  D  I  Q +LPD  +FK+  
Sbjct: 53  MVKTYQLTKEGDYGTSLKVAPVTNRESMNKQTCLQVNGSSDNPISVQGILPDTKNFKL-- 110

Query: 75  YRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNK--- 131
                                  G E  NG   +K+ LV   G+K + YSLWVRY K   
Sbjct: 111 ----------------------AGTEQRNGFNCDKFVLVDDVGQKRNYYSLWVRYIKSPK 148

Query: 132 --ASKDAIPVRYEMKGYNSLL 150
             AS+  IPVRYEMKGYN+LL
Sbjct: 149 YPASRMPIPVRYEMKGYNTLL 169


>gi|91992512|gb|ABE72972.1| cathepsin L [Aedes aegypti]
          Length = 548

 Score =  294 bits (752), Expect = 5e-77,   Method: Compositional matrix adjust.
 Identities = 132/202 (65%), Positives = 163/202 (80%), Gaps = 1/202 (0%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQSVCGSCWSFGT G +E AY++K+ KL   SQQALIDCSWGYGNNGCDGGEDFR+YQW+
Sbjct: 346 DQSVCGSCWSFGTAGHIESAYFLKYNKLMRFSQQALIDCSWGYGNNGCDGGEDFRAYQWM 405

Query: 211 MK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           M+  G+P++++YG YLGQD YC + N T  A + G+VNVT    +A+K+AL KHGP+S+A
Sbjct: 406 MEVGGIPSEEEYGGYLGQDGYCRLENKTLYAAIDGWVNVTSGDAEAMKVALFKHGPLSIA 465

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDA  KSFS+Y NGVYY+ +C N+ DGLDHAVLAVGYG+L G+ YW +KNSWS YWGN G
Sbjct: 466 IDAGHKSFSYYANGVYYEPECRNNLDGLDHAVLAVGYGKLKGEDYWLIKNSWSNYWGNDG 525

Query: 330 YVLMSIKDNNCGVMTAPTYVTM 351
           Y LM++KDNNCG+ T  TYV +
Sbjct: 526 YALMAMKDNNCGLSTDATYVNL 547



 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 61/141 (43%), Positives = 74/141 (52%), Gaps = 29/141 (20%)

Query: 15  MVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVNI 74
           MVKTYQL+K G  G SLKVAP+T    +NK +C    G  D  I  Q +LPD  +FK+  
Sbjct: 63  MVKTYQLTKEGDYGTSLKVAPVTNRESMNKQTCLQVNGSSDNPISVQGILPDTKNFKL-- 120

Query: 75  YRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNK--- 131
                                  G E  NG   +K+ LV   G+K + YSLWVRY K   
Sbjct: 121 ----------------------AGTEQRNGFNCDKFVLVDDVGQKRNYYSLWVRYVKSPK 158

Query: 132 --ASKDAIPVRYEMKGYNSLL 150
             AS+  IPVRYEMKGYN+LL
Sbjct: 159 YPASRMPIPVRYEMKGYNTLL 179


>gi|312377879|gb|EFR24605.1| hypothetical protein AND_10691 [Anopheles darlingi]
          Length = 375

 Score =  291 bits (746), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 136/202 (67%), Positives = 159/202 (78%), Gaps = 2/202 (0%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQSVCGSCWSFGT G +EGAY+ K +KL   SQQAL+DCSWGYGNNGCDGGEDFR+YQWI
Sbjct: 175 DQSVCGSCWSFGTIGHIEGAYFRKTQKLVRFSQQALVDCSWGYGNNGCDGGEDFRAYQWI 234

Query: 211 MK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           M+  G+P +D+Y  YLGQD YC + N T  A +TG+VNVT    DA K+AL KHGP+S+A
Sbjct: 235 MQVGGVPMEDEY-EYLGQDGYCRVENVTLYAPITGWVNVTSGDPDAFKVALFKHGPLSIA 293

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDA  KSFSFY NGVYY+ +C N  D LDHAVLAVGYG L+G+ YW +KNSWS YWGN G
Sbjct: 294 IDAGHKSFSFYSNGVYYEPECKNKLDELDHAVLAVGYGNLNGEDYWLIKNSWSNYWGNDG 353

Query: 330 YVLMSIKDNNCGVMTAPTYVTM 351
           Y LM++KDNNCG+ T  TYV M
Sbjct: 354 YALMAVKDNNCGLTTDATYVLM 375


>gi|328722454|ref|XP_001951172.2| PREDICTED: counting factor associated protein D-like [Acyrthosiphon
           pisum]
          Length = 558

 Score =  286 bits (731), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 131/228 (57%), Positives = 169/228 (74%), Gaps = 5/228 (2%)

Query: 129 YNKASKDAIPVRYEMK---GYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQA 185
           YNK     +P   + +      ++ DQS+CGSCWSF T G +EGAY++K+++    S+QA
Sbjct: 331 YNKTDFQNLPSEIDWRISGAVTAVKDQSICGSCWSFSTIGTIEGAYFIKYRERKNFSEQA 390

Query: 186 LIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTA--TATMT 243
           L+DCS G+GN+GCDGGEDFR+Y WI KHGLPT+++YGPYL     CHIAN +      +T
Sbjct: 391 LVDCSLGFGNDGCDGGEDFRAYSWIEKHGLPTEEEYGPYLFSGGVCHIANISNGNLTKIT 450

Query: 244 GFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLA 303
            FVNVT   E+ALK+AL   GP+S+AI AS K+F FY NGVYYD++C N+P  L+HAVL 
Sbjct: 451 SFVNVTTYDEEALKIALVNEGPISIAIYASLKTFFFYSNGVYYDKECCNNPLELNHAVLL 510

Query: 304 VGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
           VGYG LDG PYW VKNSWST+WGN GY+L+S +DNNCGV+T+PTYV +
Sbjct: 511 VGYGTLDGHPYWLVKNSWSTHWGNSGYILISRRDNNCGVLTSPTYVIL 558



 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 47/137 (34%), Positives = 71/137 (51%), Gaps = 26/137 (18%)

Query: 14  GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
           GM KTYQLS  G+ G  LK+ P+TTET+ N + C    G  +  I PQS LPD+SDF+  
Sbjct: 79  GMAKTYQLSNKGSFGSYLKIVPVTTETKTNTLKCLEVNGSDEFHILPQSTLPDLSDFR-- 136

Query: 74  IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKAS 133
                                 ++ E L +GI++EKW  +    + V KY +W+   +  
Sbjct: 137 ----------------------FVSESLEDGIVVEKWVYIKRVEKTVRKYIMWLY--RKD 172

Query: 134 KDAIPVRYEMKGYNSLL 150
             A P++++M G+NS+ 
Sbjct: 173 DIAYPMKFQMNGFNSVF 189


>gi|156384930|ref|XP_001633385.1| predicted protein [Nematostella vectensis]
 gi|156220454|gb|EDO41322.1| predicted protein [Nematostella vectensis]
          Length = 548

 Score =  285 bits (729), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 139/240 (57%), Positives = 166/240 (69%), Gaps = 16/240 (6%)

Query: 128 RYNKASKDAIPVRYEMKGYNSL---------------LDQSVCGSCWSFGTTGAVEGAYY 172
           RY K     +P + +M   N +                DQ+VCGSCWSFGTTG +EG  +
Sbjct: 308 RYTKGYNGGLPYKPDMASINDVPDEMNWVIRGAVTPVKDQAVCGSCWSFGTTGTIEGTLF 367

Query: 173 MKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYC 231
           +K K L  LSQQ L+DCSWG GNN CDGGEDFRSYQ+IMK G + T++ YGPYLG D YC
Sbjct: 368 LKTKYLTRLSQQNLMDCSWGEGNNACDGGEDFRSYQYIMKSGGIATEESYGPYLGADGYC 427

Query: 232 HIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCN 291
           H  +    AT+TG+VN+T     ALK A+A+ GP+SV+IDAS KS SFY  GVYY+ KC 
Sbjct: 428 HKKDAEIGATITGYVNITEGDLSALKTAIAQKGPISVSIDASHKSLSFYSYGVYYEPKCG 487

Query: 292 NSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
           N  + LDH+VLAVGYG +DGKPYW +KNSWST+WG  GYVLMS KDNNCGV TA TYV M
Sbjct: 488 NKNEDLDHSVLAVGYGTMDGKPYWMIKNSWSTHWGMNGYVLMSQKDNNCGVATAATYVLM 547



 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 38/137 (27%), Positives = 60/137 (43%), Gaps = 27/137 (19%)

Query: 14  GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
           GM  T+Q     T G + ++ PM+TET+LN  SCF   G   + + PQSVLPD++ F   
Sbjct: 69  GMDSTFQRGDILTAGANFRICPMSTETKLNVRSCFQTNGTSASPVGPQSVLPDLAKF--- 125

Query: 74  IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKAS 133
                                +++G      +    W    + G K +KY++   Y    
Sbjct: 126 ---------------------SFVGNSTYGDVKCSIWEYKLTVGHKRNKYTM---YTSMH 161

Query: 134 KDAIPVRYEMKGYNSLL 150
               P+ YEM G++ L+
Sbjct: 162 DPPAPIHYEMLGFDDLI 178


>gi|348542138|ref|XP_003458543.1| PREDICTED: counting factor associated protein D-like [Oreochromis
           niloticus]
          Length = 551

 Score =  285 bits (729), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 128/202 (63%), Positives = 161/202 (79%), Gaps = 1/202 (0%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ++CGSCWSF TTGAVEGA ++K   L VLSQQ LIDCSWG+GNNGCDGGE++R+Y+WI
Sbjct: 349 DQAICGSCWSFATTGAVEGALFVKTGSLQVLSQQMLIDCSWGFGNNGCDGGEEWRAYEWI 408

Query: 211 MKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           MKHG + T + YG Y+G + +CH+ ++  TA +  + NVT   + ALK+AL K+GPV+V+
Sbjct: 409 MKHGGIATTETYGAYMGMNGFCHVDSSELTARIQSYTNVTSGDQLALKMALFKNGPVAVS 468

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS +SF FY +GVYY+  C N+ D LDHAVLAVGYG L G+PYW +KNSWSTYWGN G
Sbjct: 469 IDASHRSFVFYSHGVYYEPACGNTVDDLDHAVLAVGYGTLSGEPYWLIKNSWSTYWGNDG 528

Query: 330 YVLMSIKDNNCGVMTAPTYVTM 351
           Y+LMS+KDNNCGV T  TYVT+
Sbjct: 529 YILMSMKDNNCGVATDATYVTL 550



 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 50/142 (35%), Positives = 72/142 (50%), Gaps = 29/142 (20%)

Query: 14  GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
           G V TYQL+     G + K++P TTE ELN + CF   G  D  + PQ+ LPD+ DFK  
Sbjct: 67  GQVLTYQLASEQQYGTAYKISPETTEEELNVMKCFQINGTKDEAVTPQAALPDLQDFK-- 124

Query: 74  IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKAS 133
                FLR  ++                  G + + W+ VT+ G K + Y+LW+ +++  
Sbjct: 125 -----FLRVEYYR-----------------GSLCQVWQNVTTVGYKKNTYTLWMTHSERG 162

Query: 134 KDA-----IPVRYEMKGYNSLL 150
            D      IP+ YEM GYN+LL
Sbjct: 163 ADGGKDTPIPIHYEMMGYNTLL 184


>gi|209155876|gb|ACI34170.1| Digestive cysteine proteinase 2 precursor [Salmo salar]
          Length = 551

 Score =  282 bits (721), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 126/202 (62%), Positives = 157/202 (77%), Gaps = 1/202 (0%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ++CGSCWSF TTGAVEGA ++    L VLSQQ L+DCSWG+GNNGCDGGE++R+Y+WI
Sbjct: 349 DQAICGSCWSFATTGAVEGALFLTSGSLQVLSQQMLVDCSWGFGNNGCDGGEEWRAYEWI 408

Query: 211 MKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           MKHG + T + YG Y+G +  CH   +  TA +  + NVT    +ALK+AL KHGPV+V+
Sbjct: 409 MKHGGIATTETYGSYMGMNGLCHFNTSQLTARIQSYTNVTSGDAEALKVALFKHGPVAVS 468

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDA  +SF FY +GVYY+ KC N+ D LDHAVLAVGYG ++ +PYW VKNSWSTYWGN G
Sbjct: 469 IDAGHRSFVFYSHGVYYEPKCGNTTDSLDHAVLAVGYGVMEAEPYWLVKNSWSTYWGNDG 528

Query: 330 YVLMSIKDNNCGVMTAPTYVTM 351
           Y+LMS+KDNNCGV T  TYVT+
Sbjct: 529 YILMSMKDNNCGVTTDATYVTL 550



 Score = 74.7 bits (182), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 47/142 (33%), Positives = 65/142 (45%), Gaps = 29/142 (20%)

Query: 14  GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
           G V TYQ+      G S K+ P TTE ELN + CF   G  +  + PQ  LPD+  F   
Sbjct: 67  GQVSTYQMGMELPWGSSYKITPETTEVELNIMKCFQVNGTKEEPVTPQGSLPDLQGF--- 123

Query: 74  IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNK-- 131
                FLR  ++                  G++ E W+ VT  G K + Y+LWV   +  
Sbjct: 124 ----MFLRMEYY-----------------GGVLCEVWQNVTVVGHKKNTYTLWVTRPEGG 162

Query: 132 ---ASKDAIPVRYEMKGYNSLL 150
               ++   P+ YEM GYN+LL
Sbjct: 163 VMGGAEVTTPLHYEMMGYNTLL 184


>gi|432881828|ref|XP_004073923.1| PREDICTED: counting factor associated protein D-like [Oryzias
           latipes]
          Length = 563

 Score =  282 bits (721), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 126/202 (62%), Positives = 160/202 (79%), Gaps = 1/202 (0%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ++CGSCWSF TTGA+EGA ++K   L VLSQQ L+DCSWG+GNNGCDGGE++R+Y+WI
Sbjct: 361 DQAICGSCWSFATTGAIEGALFLKTGSLQVLSQQMLVDCSWGFGNNGCDGGEEWRAYEWI 420

Query: 211 MKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           MKHG + T + YG Y+G + +CH+ ++   A +  + NVT    +ALKLAL K+GPV+V+
Sbjct: 421 MKHGGIATTETYGSYMGMNGFCHMNSSELVAPIKSYTNVTSGDAEALKLALFKNGPVAVS 480

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS +SF FY  GVYY+  C N+ D LDHAVLAVGYG L+G+PYW +KNSWSTYWGN G
Sbjct: 481 IDASHRSFVFYGYGVYYEPACGNTTDDLDHAVLAVGYGTLNGEPYWLIKNSWSTYWGNDG 540

Query: 330 YVLMSIKDNNCGVMTAPTYVTM 351
           Y+LMS+KDNNCGV T  TYVT+
Sbjct: 541 YILMSMKDNNCGVTTDATYVTL 562



 Score = 84.0 bits (206), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 50/142 (35%), Positives = 73/142 (51%), Gaps = 31/142 (21%)

Query: 14  GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
           G V TYQL+ +   G++ K+ P TTETE N + CF   G  +  ++PQ  LPD+  F+  
Sbjct: 81  GQVSTYQLASHQEFGVAYKITPETTETESNVMKCFQVNGTKEEAVQPQGTLPDLQGFQ-- 138

Query: 74  IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKAS 133
                FLR  ++                  G + E W+ VT+ G K + Y+LWV   K+ 
Sbjct: 139 -----FLRMEYY-----------------GGSLCEVWQNVTTVGHKKNTYTLWVA--KSE 174

Query: 134 KD-----AIPVRYEMKGYNSLL 150
           +D     A+P+ YEM GYN+LL
Sbjct: 175 RDGRKGTAVPLHYEMMGYNTLL 196


>gi|332022677|gb|EGI62958.1| Counting factor associated protein D [Acromyrmex echinatior]
          Length = 525

 Score =  281 bits (719), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 129/175 (73%), Positives = 149/175 (85%), Gaps = 1/175 (0%)

Query: 178 LAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANT 236
           + VL  QALIDCSWG+GNNGCDGGEDFRSYQWIMKHG LPT++DYG YLGQD YCHI N 
Sbjct: 351 IKVLILQALIDCSWGFGNNGCDGGEDFRSYQWIMKHGGLPTEEDYGDYLGQDGYCHINNV 410

Query: 237 TATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDG 296
           T TA +TGFVNVTP S +ALK+A+AKHGP+SVAIDAS K+FSFY +GVYYD  C N+ + 
Sbjct: 411 TLTAKITGFVNVTPRSVEALKIAIAKHGPISVAIDASHKTFSFYSHGVYYDPACGNTENK 470

Query: 297 LDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
           LDHAVLAVGYG ++GK YW +KNSWS YWGN GY+LM+ KD+NCGV+ APTYVTM
Sbjct: 471 LDHAVLAVGYGTMNGKGYWLIKNSWSNYWGNDGYILMAQKDDNCGVLLAPTYVTM 525



 Score =  121 bits (303), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 62/142 (43%), Positives = 85/142 (59%), Gaps = 29/142 (20%)

Query: 14  GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
           GMVKT+QL+  G  G S+K+AP+TTET  N+ +C    G  D  I+PQS+LPD  + +  
Sbjct: 70  GMVKTFQLAHKGQYGASIKIAPITTETVSNEETCLQVNGTSDFKIQPQSILPDTPEMEC- 128

Query: 74  IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKAS 133
                                  IGEE++NG++ EKWRLV   G+K +KY+LW+RY K+ 
Sbjct: 129 -----------------------IGEEMINGLLCEKWRLVQEIGDKTNKYTLWIRYKKSP 165

Query: 134 -----KDAIPVRYEMKGYNSLL 150
                K+ IP+RYEMKG+NSLL
Sbjct: 166 YLPKVKEPIPMRYEMKGFNSLL 187


>gi|198285481|gb|ACH85279.1| cathepsin l-like [Salmo salar]
          Length = 444

 Score =  281 bits (719), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 126/202 (62%), Positives = 157/202 (77%), Gaps = 1/202 (0%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ++CGSCWSF TTGAVEGA ++    L VLSQQ L+DCSWG+GNNGCDGGE++R+Y+WI
Sbjct: 242 DQAICGSCWSFATTGAVEGALFLTSGSLQVLSQQMLVDCSWGFGNNGCDGGEEWRAYEWI 301

Query: 211 MKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           MKHG + T + YG Y+G +  CH   +  TA +  + NVT    +ALK+AL KHGPV+V+
Sbjct: 302 MKHGGIATTETYGSYMGMNGLCHFNTSQLTARIQSYTNVTSGDAEALKVALFKHGPVAVS 361

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDA  +SF FY +GVYY+ KC N+ D LDHAVLAVGYG ++ +PYW VKNSWSTYWGN G
Sbjct: 362 IDAGHRSFVFYSHGVYYEPKCGNTTDSLDHAVLAVGYGVMEAEPYWLVKNSWSTYWGNDG 421

Query: 330 YVLMSIKDNNCGVMTAPTYVTM 351
           Y+LMS+KDNNCGV T  TYVT+
Sbjct: 422 YILMSMKDNNCGVTTDATYVTL 443



 Score = 43.5 bits (101), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 29/98 (29%), Positives = 43/98 (43%), Gaps = 29/98 (29%)

Query: 58  IEPQSVLPDVSDFKVNIYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEG 117
           + PQ  LPD+  F        FLR  ++                  G++ E W+ VT  G
Sbjct: 4   VTPQGSLPDLQGF-------MFLRMEYY-----------------GGVLCEVWQNVTVVG 39

Query: 118 EKVSKYSLWVRYNK-----ASKDAIPVRYEMKGYNSLL 150
            K + Y+LWV   +      ++   P+ YEM GYN+LL
Sbjct: 40  HKKNTYTLWVTRPEGGVMGGAEVTTPLHYEMMGYNTLL 77


>gi|410911058|ref|XP_003969007.1| PREDICTED: counting factor associated protein D-like [Takifugu
           rubripes]
          Length = 549

 Score =  280 bits (715), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 124/202 (61%), Positives = 158/202 (78%), Gaps = 1/202 (0%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ++CGSCWSF TTG +EGA ++K   L VLSQQ L+DCSWG+GNN CDGGE++RSY+WI
Sbjct: 347 DQAICGSCWSFATTGTIEGALFLKTGFLQVLSQQILMDCSWGFGNNACDGGEEWRSYEWI 406

Query: 211 MKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           MKHG +   + YGPY+G + +CH+ ++   A +  + NVT     ALKLAL KHGPV+V+
Sbjct: 407 MKHGGIALAETYGPYMGMNGFCHVNSSELVAQIQSYTNVTSGDAMALKLALFKHGPVAVS 466

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS +SF FY +GVYY+  C ++ D LDHAVLAVGYG L+G+PYW +KNSWSTYWGN G
Sbjct: 467 IDASHRSFVFYSHGVYYEPACGSTIDDLDHAVLAVGYGNLNGEPYWLIKNSWSTYWGNDG 526

Query: 330 YVLMSIKDNNCGVMTAPTYVTM 351
           Y+LMS+KDNNCGV T  T+VT+
Sbjct: 527 YILMSMKDNNCGVATDATFVTL 548



 Score = 74.7 bits (182), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 48/142 (33%), Positives = 68/142 (47%), Gaps = 29/142 (20%)

Query: 14  GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
           G V TYQL      G S K++P T E E N + CF   G  +  + PQ+ LP +  F+  
Sbjct: 65  GQVSTYQLGGEKQWGTSYKISPETNEVEQNVMKCFQVNGTKEEAVAPQAALPQLQGFQ-- 122

Query: 74  IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWV-RYNKA 132
                FLR  ++                 +G + E W+ VT+ G K + Y+LWV R  +A
Sbjct: 123 -----FLRVEYY-----------------SGSLCEVWQNVTNVGYKKNTYTLWVTRAERA 160

Query: 133 S----KDAIPVRYEMKGYNSLL 150
           +       +PV YEM GYN+LL
Sbjct: 161 TNGTESPVLPVHYEMMGYNTLL 182


>gi|161408099|dbj|BAF94153.1| 26,29 kDa proteinase-like cysteine protease [Plautia stali]
          Length = 521

 Score =  279 bits (713), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 132/187 (70%), Positives = 153/187 (81%), Gaps = 4/187 (2%)

Query: 136 AIPVRYEMKGYNSLL---DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG 192
           A+P +Y+ + Y ++    DQSVCGSCWSFGTTGAVEGAY++K   L  LSQQALIDCSWG
Sbjct: 333 AVPDQYDWRLYGAVTPVKDQSVCGSCWSFGTTGAVEGAYFVKTGTLVRLSQQALIDCSWG 392

Query: 193 YGNNGCDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPN 251
           YGNNGCDGGEDFRSYQWIMKHG LP + DYG YLGQD YCH+  T  TA +TG+VNVT  
Sbjct: 393 YGNNGCDGGEDFRSYQWIMKHGGLPLESDYGGYLGQDGYCHVDKTNLTAKITGYVNVTTG 452

Query: 252 SEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDG 311
            E+ALKLA+ KHGP+SVAIDASQK+FSFY NGVYYD +C N+P+GLDHAVLAVGYG++ G
Sbjct: 453 DENALKLAIFKHGPISVAIDASQKTFSFYSNGVYYDPECKNTPEGLDHAVLAVGYGKIHG 512

Query: 312 KPYWQVK 318
             YW VK
Sbjct: 513 NSYWLVK 519



 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 60/139 (43%), Positives = 79/139 (56%), Gaps = 26/139 (18%)

Query: 14  GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
           GMVKTYQ+S  G  G SLK+APMTTE +LN  +CF   G  +  I+ Q++LPD++ F+  
Sbjct: 70  GMVKTYQISDKGKYGTSLKLAPMTTEDQLNVETCFKVDGTIEDKIQAQAILPDLTGFQC- 128

Query: 74  IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKAS 133
                                  IG+E VNG   EKW   T +GEK +KY +++R  K  
Sbjct: 129 -----------------------IGKEWVNGFNCEKWHNETIKGEKSNKYIMYIRRKKTE 165

Query: 134 KDA--IPVRYEMKGYNSLL 150
             +  IPVRYEM+GYNSLL
Sbjct: 166 TGSIIIPVRYEMRGYNSLL 184


>gi|346469497|gb|AEO34593.1| hypothetical protein [Amblyomma maculatum]
          Length = 557

 Score =  276 bits (706), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 143/289 (49%), Positives = 190/289 (65%), Gaps = 12/289 (4%)

Query: 69  DFKVNIYR--LFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLW 126
           D + +I+R  L F+      N   +    ++ +     I + + RL + +G   ++   +
Sbjct: 275 DRRRDIFRQNLRFIDSTNRANLGYQVAVNHLADRTPEEISVMRGRLQSRDGSSTAE--PF 332

Query: 127 VRYNKASKDAIPVRYEMKGYNSLL---DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQ 183
            RY   +K  +P + + + Y ++    DQ+VCGSCWSFGTTG +EGAY+ K  KL  LS+
Sbjct: 333 PRYKFTAK--LPEQIDWRLYGAVTPVKDQAVCGSCWSFGTTGELEGAYFRKTGKLVRLSE 390

Query: 184 QALIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTAT-ATM 242
           Q L+DCSW  GNNGCDGGEDFR+Y++I KHGL + +DYG Y+GQD  CH     AT +T+
Sbjct: 391 QQLVDCSWNSGNNGCDGGEDFRAYEYIRKHGLASDEDYGAYIGQDGVCHDTKVNATISTI 450

Query: 243 TGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVL 302
             ++N+T  + D L  ALA  GPVSV+IDA+ +SFSFY NGV+YD KC N  D LDHAVL
Sbjct: 451 KSYINIT--NRDDLLTALANVGPVSVSIDAALRSFSFYSNGVFYDPKCRNDTDSLDHAVL 508

Query: 303 AVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
           AVGYG L  +PYW +KNSWSTYWGN GYVL+S KDNNCGV T  T V +
Sbjct: 509 AVGYGTLQEQPYWLIKNSWSTYWGNDGYVLISQKDNNCGVATQGTIVEL 557



 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 43/146 (29%), Positives = 68/146 (46%), Gaps = 25/146 (17%)

Query: 14  GMVKTYQLSKNGTN------GLSLKVAPMTTETELNKI-SCFHAYGIPDATIEPQSVLPD 66
           G+V+T QL+    +      G++ K+A M        + +CF   G  + T+  Q+VLP+
Sbjct: 62  GIVQTVQLAPGSESAEGYPYGVNYKIAYMPDPKTWESVRTCFQVNGTENVTVSLQNVLPN 121

Query: 67  VSDFKVNIYRLFFLRPRFHENEKIRYNWTYIGEELVN--GIILEKWRLVTSEGEKVSKYS 124
           V+ F             F   E     W+   E L        E+++   S   +VSKY+
Sbjct: 122 VTGFT------------FVRKESC---WSGNDELLAQYGRKQCERFQETVSTEGRVSKYT 166

Query: 125 LWVRYNKASKDAIPVRYEMKGYNSLL 150
           LW+  ++    AIP RY M+GYN+LL
Sbjct: 167 LWLSRDRDGS-AIPRRYLMRGYNTLL 191


>gi|449676370|ref|XP_002156627.2| PREDICTED: counting factor associated protein D-like [Hydra
           magnipapillata]
          Length = 551

 Score =  275 bits (703), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 131/246 (53%), Positives = 175/246 (71%), Gaps = 3/246 (1%)

Query: 109 KWRLVTSEGEK-VSKYSLWVRYNKASKDAIPVRYEMKG-YNSLLDQSVCGSCWSFGTTGA 166
           + R  + + EK VS    +++  K S     + + + G    + DQ+VCGSCWSFGTTGA
Sbjct: 305 RGRRTSKKSEKNVSNAKPFIKTMKVSDLPTDINWRLFGAVTPVKDQAVCGSCWSFGTTGA 364

Query: 167 VEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHG-LPTQDDYGPYL 225
           +EGA ++K  +L  LS+Q L+DCSWG+GNNGCDGGE+FR+Y++IMKHG + T D YG YL
Sbjct: 365 IEGALFLKTGRLVRLSEQNLMDCSWGFGNNGCDGGEEFRAYEYIMKHGGIATDDSYGNYL 424

Query: 226 GQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVY 285
           G D YCH  ++   A +  +VNVT    DALK+A+ +HGP++V IDA+  +F FY +GVY
Sbjct: 425 GIDGYCHQKSSVIGAKIASYVNVTSGDMDALKMAIVQHGPIAVGIDAAHLAFVFYSHGVY 484

Query: 286 YDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTA 345
           Y+ +C N P+ LDHAVLAVGYG  +G+PY  VKNSWST+WGN GYVLMS +DNNCGV T 
Sbjct: 485 YNPECGNKPENLDHAVLAVGYGVQNGEPYTLVKNSWSTHWGNDGYVLMSQRDNNCGVATD 544

Query: 346 PTYVTM 351
            T+V +
Sbjct: 545 ATFVNL 550



 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 44/137 (32%), Positives = 65/137 (47%), Gaps = 28/137 (20%)

Query: 14  GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
           G+ KT Q    G  G    VAPM+ E +LN+I+CF   G  +    PQS+LPD+ +F   
Sbjct: 77  GISKTIQRGDKGEFGAHFLVAPMSNEVQLNQINCFELSGTSEEPALPQSLLPDMENF--- 133

Query: 74  IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKAS 133
                                TY GE+      ++ W+    +  K + Y+L+V+ +   
Sbjct: 134 ---------------------TYTGEKAFMDFKVDIWQYKYGDDNKYNVYTLYVKKDS-- 170

Query: 134 KDAIPVRYEMKGYNSLL 150
             +IPV YEM GY+SLL
Sbjct: 171 --SIPVYYEMIGYDSLL 185


>gi|148970410|gb|ABR20116.1| venom cathepsin cysteine peptidase, partial [Mesobuthus eupeus]
          Length = 225

 Score =  275 bits (702), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 128/202 (63%), Positives = 154/202 (76%), Gaps = 1/202 (0%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ++CGSCWSFGT G +EGAY++K KKL  LSQQ LID SW Y NNGCDGGED+R+Y++I
Sbjct: 19  DQAICGSCWSFGTAGTIEGAYFLKTKKLVRLSQQQLIDGSWKYENNGCDGGEDYRAYRYI 78

Query: 211 MKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
            + G L T+DDYGP LGQD  C   N + TA +  +VNVT    DALK A+A HGP+S++
Sbjct: 79  KESGGLATEDDYGPILGQDGICRERNVSKTAKIGSYVNVTSGDLDALKFAIAYHGPISIS 138

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS KS SFY NGVYY+  C N+ D LDH+VLAVGYGEL G+ YW VKNSWSTYWGN G
Sbjct: 139 IDASHKSLSFYANGVYYEPACGNTNDDLDHSVLAVGYGELFGQKYWLVKNSWSTYWGNDG 198

Query: 330 YVLMSIKDNNCGVMTAPTYVTM 351
           Y L++ KDNNCG  T PT+V +
Sbjct: 199 YFLLNQKDNNCGAATEPTFVVL 220


>gi|405976506|gb|EKC41011.1| Counting factor associated protein D [Crassostrea gigas]
          Length = 349

 Score =  273 bits (697), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 139/242 (57%), Positives = 166/242 (68%), Gaps = 21/242 (8%)

Query: 120 VSKYSL--------WVRYNKASKDAI-PVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGA 170
            SKYSL        W  +   S+ A+ PV+          DQ+VCGSCWSFGTTG +EGA
Sbjct: 119 TSKYSLRDLPDSMDWRLHGYLSQRAVTPVK----------DQAVCGSCWSFGTTGTIEGA 168

Query: 171 YYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDA 229
           Y++K   L  LSQQ L+DCSWG GNN CDGGEDFR+YQW+MK+G L +++ YGPY  QD 
Sbjct: 169 YFLKTGDLVRLSQQQLMDCSWGEGNNACDGGEDFRAYQWMMKNGGLTSEELYGPYKAQDG 228

Query: 230 YCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEK 289
            C+    T    +  +VNVT     ALK A+A  GPVSVAIDAS  S SFY NGVYY+ +
Sbjct: 229 KCN-KTITPIVQLKNYVNVTSGDLQALKFAIAHQGPVSVAIDASHLSLSFYANGVYYEPQ 287

Query: 290 CNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYV 349
           C N PD LDHAVLAVGYG ++G+ YW +KNSWSTYWGN GYVLMS KDNNCGV T PT+V
Sbjct: 288 CGNKPDDLDHAVLAVGYGVMNGQAYWLIKNSWSTYWGNDGYVLMSQKDNNCGVATDPTFV 347

Query: 350 TM 351
            +
Sbjct: 348 IV 349


>gi|224081608|ref|XP_002191568.1| PREDICTED: counting factor associated protein D-like [Taeniopygia
           guttata]
          Length = 546

 Score =  271 bits (694), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 125/199 (62%), Positives = 154/199 (77%), Gaps = 1/199 (0%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ+VCGSCWSF TTGA+EGA ++K   L  LSQQ LIDCSWG+GN  CDGGE++R+Y+WI
Sbjct: 344 DQAVCGSCWSFATTGAMEGALFLKTGVLTPLSQQVLIDCSWGFGNYACDGGEEWRAYEWI 403

Query: 211 MKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
            KHG + + + YG Y GQ+  CH   +   A +TG+VNVT  +  A+K A+ KHGPV+V+
Sbjct: 404 KKHGGIASTESYGTYKGQNGLCHYNQSEMLAKITGYVNVTSGNITAVKTAIYKHGPVAVS 463

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS KSFSFY NGVYY+ KC+N+P  LDHAVLAVGYG L G+ YW +KNSWSTYWGN G
Sbjct: 464 IDASHKSFSFYSNGVYYEPKCDNTPGSLDHAVLAVGYGVLQGETYWLIKNSWSTYWGNDG 523

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y+LM++KDNNCGV T  TY
Sbjct: 524 YILMAMKDNNCGVATEATY 542



 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 47/137 (34%), Positives = 65/137 (47%), Gaps = 26/137 (18%)

Query: 14  GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
           G V TYQL+     G+  K+ P TTE E+N   CF   G  +  +  QSV P +  FK  
Sbjct: 69  GQVITYQLAAVKPYGMRYKITPETTEKEVNTRKCFQLPGSKEDVVTAQSVFPSMRGFK-- 126

Query: 74  IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKAS 133
                FLR  ++++   RY                 W+ +T   +K + Y+LWV    +S
Sbjct: 127 -----FLREEYYKD---RY--------------CAVWQNITRWAQKKNVYTLWV--TNSS 162

Query: 134 KDAIPVRYEMKGYNSLL 150
               PV YEM+GYNSLL
Sbjct: 163 CGVAPVHYEMRGYNSLL 179


>gi|194246075|gb|ACF35529.1| midgut cysteine proteinase 2 [Dermacentor variabilis]
          Length = 235

 Score =  271 bits (692), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 129/202 (63%), Positives = 153/202 (75%), Gaps = 3/202 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ+VCGSCWSFGT G ++GAY+ K  KL  LS+Q L+DCSW  GNNGCDGGEDFR+Y++I
Sbjct: 36  DQAVCGSCWSFGTVGELKGAYFRKTGKLVRLSEQQLVDCSWNSGNNGCDGGEDFRAYEYI 95

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTAT-ATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             HGL + +DYG YLGQD  CH     AT A++ G++N+T  + D L  ALA  GPVSV+
Sbjct: 96  RNHGLASDEDYGAYLGQDGVCHDTKVNATIASIKGYINIT--NRDDLLTALANVGPVSVS 153

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDA+ +SFSFY NGV+YD  C N  D LDHAVLAVGYG L G+PYW VKNSWSTYWGN G
Sbjct: 154 IDAALRSFSFYSNGVFYDPNCRNDTDSLDHAVLAVGYGTLQGEPYWLVKNSWSTYWGNDG 213

Query: 330 YVLMSIKDNNCGVMTAPTYVTM 351
           YVL+S KDNNCGV T  TYV +
Sbjct: 214 YVLISQKDNNCGVATQGTYVEL 235


>gi|327285051|ref|XP_003227248.1| PREDICTED: counting factor associated protein D-like [Anolis
           carolinensis]
          Length = 547

 Score =  268 bits (684), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 123/202 (60%), Positives = 155/202 (76%), Gaps = 1/202 (0%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ+VCGSCWSF +TGA+EG+ ++K  +L  LSQQ LIDCSWG+GN  CDGGE++++++W+
Sbjct: 345 DQAVCGSCWSFSSTGALEGSLFLKTGQLIPLSQQILIDCSWGFGNYACDGGEEWQAFEWV 404

Query: 211 MKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           +KHG + + + YGPY GQ+ YCH   T     ++G+VNVT  +  ALK A+ KHGPVSV+
Sbjct: 405 LKHGGIASTESYGPYKGQNGYCHSNKTHLVGKLSGYVNVTSGNITALKAAIYKHGPVSVS 464

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS ++FSFY NGVYY+ KC N    LDHAVLAVGYG L G+ YW VKNSWSTYWGN G
Sbjct: 465 IDASHRTFSFYSNGVYYEPKCGNKKGELDHAVLAVGYGVLQGELYWLVKNSWSTYWGNDG 524

Query: 330 YVLMSIKDNNCGVMTAPTYVTM 351
           Y+LMS+KDNNCGV T  TY  M
Sbjct: 525 YILMSMKDNNCGVATDATYPLM 546



 Score = 65.5 bits (158), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 42/137 (30%), Positives = 59/137 (43%), Gaps = 26/137 (18%)

Query: 14  GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
           G V T QL      G   K+ P TTET +N   CF   G  ++ + PQ V PD+ +F+  
Sbjct: 69  GQVITLQLGCVEPYGTLFKITPATTETVVNAEKCFQLNGTKESRVRPQEVFPDMKNFQA- 127

Query: 74  IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKAS 133
                                  + EE   G     W+ ++  G K + Y++WV     S
Sbjct: 128 -----------------------LREEYYKGQYCTVWQNISYWGHKKNIYTMWV--TNTS 162

Query: 134 KDAIPVRYEMKGYNSLL 150
               PV YEM+GYN+LL
Sbjct: 163 CGVSPVHYEMRGYNTLL 179


>gi|241577796|ref|XP_002403652.1| midgut cysteine proteinase, putative [Ixodes scapularis]
 gi|215500253|gb|EEC09747.1| midgut cysteine proteinase, putative [Ixodes scapularis]
          Length = 564

 Score =  266 bits (681), Expect = 8e-69,   Method: Compositional matrix adjust.
 Identities = 127/209 (60%), Positives = 154/209 (73%), Gaps = 10/209 (4%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHK-------KLAVLSQQALIDCSWGYGNNGCDGGED 203
           DQ+VCGSCWSFGT G +E A + K +       KL  LS+Q L+DCSW  GNNGCDGGED
Sbjct: 358 DQAVCGSCWSFGTVGELEAALFRKARRFSMFTGKLTRLSEQQLVDCSWNQGNNGCDGGED 417

Query: 204 FRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATAT-MTGFVNVTPNSEDALKLALAK 262
           FR+Y++I  HGL T +DYG YLGQD  CH     AT T +  ++NVT   +++L+ ALA 
Sbjct: 418 FRAYEYIRAHGLATDEDYGAYLGQDGICHDTKVNATVTTIKNYINVT--DKESLQKALAN 475

Query: 263 HGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWS 322
            GPVSV+IDA+ K+F+FY NGV+YD KC N  DGLDHAVLAVGYG L G+PYW +KNSWS
Sbjct: 476 VGPVSVSIDAAVKAFTFYSNGVFYDPKCRNDTDGLDHAVLAVGYGTLQGEPYWLIKNSWS 535

Query: 323 TYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
           TYWGN GYVL+S KDNNCGV +  TYV +
Sbjct: 536 TYWGNDGYVLISQKDNNCGVASQGTYVEL 564



 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 47/145 (32%), Positives = 74/145 (51%), Gaps = 22/145 (15%)

Query: 14  GMVKTYQLSKNGTNGLS-------LKVAPM-TTETELNKISCFHAYGIPDATIEPQSVLP 65
           GMV+T Q++ +   G S        K+A M   ET   + +CF   G  + T+  Q+VLP
Sbjct: 62  GMVQTIQVAPDAEEGRSETPYGANYKIAYMPDKETWQPRRTCFLVNGTANNTVPLQAVLP 121

Query: 66  DVSDFKVNIYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSL 125
           DV+ F+       F+R      ++  +  T   +E       E+++LV +   +VSKY+L
Sbjct: 122 DVTSFE-------FVR------QESCWEGTTAVDEPAERASCERFQLVVTNENRVSKYTL 168

Query: 126 WVRYNKASKDAIPVRYEMKGYNSLL 150
           WV  +   + A+P RY M GYN+LL
Sbjct: 169 WVSRDGQGR-AVPRRYLMMGYNTLL 192


>gi|315131333|emb|CBM69275.1| venom protein Ci-80a [Chelonus inanitus]
          Length = 165

 Score =  266 bits (681), Expect = 9e-69,   Method: Compositional matrix adjust.
 Identities = 125/162 (77%), Positives = 137/162 (84%), Gaps = 1/162 (0%)

Query: 189 CSWGYGNNGCDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVN 247
           CSWGYGNNGCDGGEDFRSYQWIMKHG LPT++DYG YLGQD YCH  N T TA M+GFVN
Sbjct: 1   CSWGYGNNGCDGGEDFRSYQWIMKHGGLPTEEDYGGYLGQDGYCHSDNVTLTAKMSGFVN 60

Query: 248 VTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG 307
           VT N  +ALK+A+AKHGPVSVAIDAS K+FSFY NGVYYD  C N  D LDHAVLAVGYG
Sbjct: 61  VTSNDPNALKVAIAKHGPVSVAIDASHKTFSFYANGVYYDSSCGNRIDQLDHAVLAVGYG 120

Query: 308 ELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYV 349
           +L+G+ YW VKNSWS YWGN GY+LM+ KDNNCGVMT PTYV
Sbjct: 121 KLNGQDYWLVKNSWSNYWGNDGYILMAQKDNNCGVMTTPTYV 162


>gi|363746764|ref|XP_003643791.1| PREDICTED: digestive cysteine proteinase 2-like, partial [Gallus
           gallus]
          Length = 203

 Score =  266 bits (680), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 123/199 (61%), Positives = 152/199 (76%), Gaps = 1/199 (0%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ+VCGSCWSF TTGA+EGA ++K   L  LSQQ LIDCSWG+GN  CDGGE++R+Y+WI
Sbjct: 1   DQAVCGSCWSFATTGAMEGALFLKTGVLTPLSQQVLIDCSWGFGNYACDGGEEWRAYEWI 60

Query: 211 MKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
            KHG + + + YG Y GQ+  CH   +   A +TG+VNVT  +  A+K A+ KHGPV+V+
Sbjct: 61  KKHGGIASTESYGTYKGQNGLCHYNQSEMLAKITGYVNVTSGNITAVKTAIYKHGPVAVS 120

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS K+FSFY NG+YY+ KC N P  LDHAVLAVGYG L G+ YW +KNSWSTYWGN G
Sbjct: 121 IDASHKTFSFYSNGIYYEPKCANKPGQLDHAVLAVGYGVLQGETYWLIKNSWSTYWGNDG 180

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y+LM++KDNNCGV T  TY
Sbjct: 181 YILMAMKDNNCGVATEATY 199


>gi|291223823|ref|XP_002731907.1| PREDICTED: predicted protein-like, partial [Saccoglossus
           kowalevskii]
          Length = 208

 Score =  266 bits (679), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 122/202 (60%), Positives = 149/202 (73%), Gaps = 1/202 (0%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ+VCGSCWSFG  GA+EGA +MK  KL  LSQQ L+DCSWGYGNNGC GGE FRSYQW+
Sbjct: 7   DQAVCGSCWSFGAIGAIEGALFMKSNKLVRLSQQNLVDCSWGYGNNGCFGGEQFRSYQWV 66

Query: 211 MKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           M +G + +++ YGPYLGQ+  CH   +     ++ +  V     D+LK  +A +GPVSV+
Sbjct: 67  MDNGGIASEEQYGPYLGQNGLCHFNKSMVATKISNYTQVPSGDIDSLKKTIANYGPVSVS 126

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS +S SFY NGVYY+E+C N P+ LDHAVLAVGYG +  + YW VKNSWSTYWG  G
Sbjct: 127 IDASHRSLSFYSNGVYYEEECGNKPENLDHAVLAVGYGTMANQSYWLVKNSWSTYWGMNG 186

Query: 330 YVLMSIKDNNCGVMTAPTYVTM 351
           +VLMS KDNNCGV T  T VT+
Sbjct: 187 FVLMSQKDNNCGVATDATLVTI 208


>gi|326932936|ref|XP_003212567.1| PREDICTED: counting factor associated protein D-like [Meleagris
           gallopavo]
          Length = 573

 Score =  263 bits (673), Expect = 7e-68,   Method: Compositional matrix adjust.
 Identities = 122/199 (61%), Positives = 151/199 (75%), Gaps = 1/199 (0%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ+VCGSCWSF TTGA+EGA ++K   L  LSQQ LIDCSWG+GN  CDGGE++R+Y+WI
Sbjct: 371 DQAVCGSCWSFATTGAMEGALFLKTGVLTPLSQQVLIDCSWGFGNYACDGGEEWRAYEWI 430

Query: 211 MKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
            KHG + + + YG Y GQ+  CH   +   A +TG+VNVT  +  A+K A+ KHGPV+V+
Sbjct: 431 KKHGGIASTESYGTYKGQNGLCHYNQSEMLAKITGYVNVTSGNITAVKTAIYKHGPVAVS 490

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS K+FSFY NG+YY+ KC N    LDHAVLAVGYG L G+ YW +KNSWSTYWGN G
Sbjct: 491 IDASHKTFSFYSNGIYYEPKCANKSGQLDHAVLAVGYGVLQGETYWLIKNSWSTYWGNDG 550

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y+LM++KDNNCGV T  TY
Sbjct: 551 YILMAMKDNNCGVATEATY 569



 Score = 64.7 bits (156), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 44/126 (34%), Positives = 57/126 (45%), Gaps = 26/126 (20%)

Query: 25  GTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVNIYRLFFLRPRF 84
           G  G+  K+ P TTE E+N   CF   G  +  ++ QSV P +  FK       FLR   
Sbjct: 106 GPYGMRYKITPETTEKEVNVRKCFQLPGSKEDVVKAQSVFPSLDGFK-------FLR--- 155

Query: 85  HENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKASKDAIPVRYEMK 144
                         EE   G     W+ VT   +K + Y+LWV    +S    PV YEM+
Sbjct: 156 --------------EEYYQGQYCAVWQNVTRWEQKKNVYTLWV--TNSSCGVAPVHYEMR 199

Query: 145 GYNSLL 150
           GYNSLL
Sbjct: 200 GYNSLL 205


>gi|195175517|ref|XP_002028493.1| GL18069 [Drosophila persimilis]
 gi|194103698|gb|EDW25741.1| GL18069 [Drosophila persimilis]
          Length = 276

 Score =  263 bits (671), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 129/223 (57%), Positives = 157/223 (70%), Gaps = 25/223 (11%)

Query: 134 KDAIPVRYEMKGYNSLL---DQSVCGSCWSFGTTGAVEGAYYMKHK-KLAVLSQQALIDC 189
           +D IP +Y+ + + ++    DQSVCGSCWSFGT G +EGA+++K+   L  LSQQALIDC
Sbjct: 74  QDEIPAQYDWRLFGAVTPVKDQSVCGSCWSFGTIGHLEGAFFLKNGGDLVRLSQQALIDC 133

Query: 190 SWGYGNNGCDGGEDFRSYQWIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
           SW +GNNGCDGGEDFR YQW+M+  G+PT+++YGPYLGQD YC I N T  A + GFVNV
Sbjct: 134 SWPFGNNGCDGGEDFRVYQWMMQVGGVPTEEEYGPYLGQDGYCRINNVTLVAPIKGFVNV 193

Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
           T N  +A KLAL KHGP+S                     +C N  DGLDHAVLAVG+G 
Sbjct: 194 TSNDPNAFKLALLKHGPLS--------------------PECKNDVDGLDHAVLAVGFGT 233

Query: 309 LDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
           + G+ YW VKNSWSTYWGN GY+LMS + NNCGVMT PTYV M
Sbjct: 234 IKGEDYWLVKNSWSTYWGNDGYILMSARKNNCGVMTMPTYVEM 276


>gi|62955235|ref|NP_001017633.1| uncharacterized protein LOC550326 precursor [Danio rerio]
 gi|62202194|gb|AAH92817.1| Zgc:110239 [Danio rerio]
          Length = 546

 Score =  263 bits (671), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 117/202 (57%), Positives = 154/202 (76%), Gaps = 1/202 (0%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ+VCGSCWSF TTG +EGA ++K  +L  LSQQ L+DC+WG+GNNGCDGGE++R+++WI
Sbjct: 344 DQAVCGSCWSFATTGTLEGALFLKTGQLTSLSQQMLVDCTWGFGNNGCDGGEEWRAFEWI 403

Query: 211 MKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           MKHG + T + YG Y+G +  CH   ++  A +TG+ NVT     ALK A+ K GPV+V+
Sbjct: 404 MKHGGISTAESYGAYMGMNGLCHYDKSSMVAQLTGYTNVTSGDILALKAAIFKFGPVAVS 463

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDA+ +SF+FY NGVYY+ +C N  + LDHAVLAVGYG ++ + YW VKNSWS+YWGN G
Sbjct: 464 IDAAHRSFAFYSNGVYYEPECKNGINDLDHAVLAVGYGIMNNESYWLVKNSWSSYWGNDG 523

Query: 330 YVLMSIKDNNCGVMTAPTYVTM 351
           Y+LMS+KDNNCGV T   Y T+
Sbjct: 524 YILMSMKDNNCGVATDAIYATL 545



 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 50/138 (36%), Positives = 64/138 (46%), Gaps = 25/138 (18%)

Query: 14  GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
           G V T+ +  +   G   K+ P+TTETE N + CF   G  D  I PQ  LPDV  F   
Sbjct: 66  GQVCTFFVGNDLDYGAVYKITPVTTETEFNTMKCFQLNGTKDEPIIPQIALPDVQGF--- 122

Query: 74  IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWV-RYNKA 132
                       E EKI Y           G++ E W+ VT+ G K + Y LWV R    
Sbjct: 123 ------------EFEKIEY---------YAGVLCEVWKNVTTVGHKKNTYRLWVTRPEGI 161

Query: 133 SKDAIPVRYEMKGYNSLL 150
              A+P  YEM G+N+LL
Sbjct: 162 DSSAMPHHYEMMGFNTLL 179


>gi|260821900|ref|XP_002606341.1| hypothetical protein BRAFLDRAFT_118507 [Branchiostoma floridae]
 gi|229291682|gb|EEN62351.1| hypothetical protein BRAFLDRAFT_118507 [Branchiostoma floridae]
          Length = 546

 Score =  258 bits (659), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 125/207 (60%), Positives = 151/207 (72%), Gaps = 6/207 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ+VCGSCWSFG   A+EGA++ K   L  LSQQ L+DCSWGYGNN CDGGE++R+Y+W+
Sbjct: 340 DQAVCGSCWSFGAAEAIEGAWFKKTGLLVPLSQQNLMDCSWGYGNNACDGGEEWRAYEWV 399

Query: 211 MKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           MKHG + T   YGPY G D  CH       A +TG+VNVT  +  ALK+ALA  GPV+V 
Sbjct: 400 MKHGGIATAAHYGPYTGADGKCHFDPKHVGAVVTGYVNVTSENGTALKMALANFGPVAVG 459

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGEL-DGK----PYWQVKNSWSTY 324
           IDA+ K+FSFY NGVYYD+ C N P+ LDHAVLAVGYG + D K     YW +KNSWST+
Sbjct: 460 IDAAVKTFSFYANGVYYDKDCGNKPEDLDHAVLAVGYGSMPDSKGVMQDYWLIKNSWSTH 519

Query: 325 WGNQGYVLMSIKDNNCGVMTAPTYVTM 351
           WG+ GYVL+S KDNNCGV T  TYV +
Sbjct: 520 WGDNGYVLISQKDNNCGVATDATYVKV 546



 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 43/136 (31%), Positives = 58/136 (42%), Gaps = 25/136 (18%)

Query: 15  MVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVNI 74
           M+ T Q S     G   K+A MTTE   N+ SCF   G  +  + PQS+ PD + FK   
Sbjct: 63  MMSTIQKSDPKPGGAMYKMAYMTTEKVTNQKSCFQINGDKENPVMPQSIFPDTTKFKS-- 120

Query: 75  YRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKASK 134
                                 IG     G+    + L T EG   +K + ++ Y    K
Sbjct: 121 ----------------------IGTFPYKGLDCTAYMLNTVEGGG-NKNNTYILYMSKGK 157

Query: 135 DAIPVRYEMKGYNSLL 150
             +PV Y MKGY+SLL
Sbjct: 158 TEVPVAYYMKGYDSLL 173


>gi|47229488|emb|CAF99476.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 478

 Score =  258 bits (658), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 128/256 (50%), Positives = 162/256 (63%), Gaps = 55/256 (21%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYY-------------------------------------- 172
           DQ++CGSCWSF TTG +EGA +                                      
Sbjct: 222 DQAICGSCWSFATTGTIEGALFLKTGSLQVLSQQMLIDCSWGFGNNACDGGEEWRAYEWI 281

Query: 173 MKHKKLA----------------VLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHG-L 215
           MKH  +A                VLSQQ LIDCSWG+GNN CDGGE++R+Y+WIMKHG +
Sbjct: 282 MKHGGIASAETYGPYLGMTGSLQVLSQQMLIDCSWGFGNNACDGGEEWRAYEWIMKHGGI 341

Query: 216 PTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQK 275
            + + YGPYLG + +CH+ ++  TA +  + NVT     ALKLAL K+GPV+V+IDAS +
Sbjct: 342 ASAETYGPYLGMNGFCHVNSSELTAQIQSYTNVTSGDALALKLALFKNGPVAVSIDASHR 401

Query: 276 SFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSI 335
           SF FY NGVYY+  C ++ + LDHAVLAVGYG L+G+PYW +KNSWSTYWGN GY+LMS+
Sbjct: 402 SFVFYSNGVYYEPACGSTVEDLDHAVLAVGYGNLNGEPYWLIKNSWSTYWGNDGYILMSM 461

Query: 336 KDNNCGVMTAPTYVTM 351
           KDNNCGV T  TYVT+
Sbjct: 462 KDNNCGVTTDATYVTL 477



 Score = 41.6 bits (96), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 22/56 (39%), Positives = 32/56 (57%), Gaps = 5/56 (8%)

Query: 100 ELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKASKDAI-----PVRYEMKGYNSLL 150
           E  +G + E W+ VT+ G K + Y+LWV  ++   +       PV YEM GYN+LL
Sbjct: 2   EYYSGSLCEVWQNVTTVGHKKNTYTLWVTRSEGDANLTEGPVSPVHYEMMGYNTLL 57


>gi|195995651|ref|XP_002107694.1| hypothetical protein TRIADDRAFT_36902 [Trichoplax adhaerens]
 gi|190588470|gb|EDV28492.1| hypothetical protein TRIADDRAFT_36902 [Trichoplax adhaerens]
          Length = 544

 Score =  256 bits (655), Expect = 8e-66,   Method: Compositional matrix adjust.
 Identities = 114/202 (56%), Positives = 152/202 (75%), Gaps = 1/202 (0%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ VCGSCWSFGTTG +EG+ Y+K  KL  LSQQ +IDC+WG+GNNGCDGGE+FR+++WI
Sbjct: 342 DQGVCGSCWSFGTTGTIEGSLYLKSGKLVSLSQQNMIDCTWGFGNNGCDGGEEFRAFEWI 401

Query: 211 MKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
            KHG + T+  YG YL QD  C +  T   A + G+V V   ++ ALKLA++  GPV+V 
Sbjct: 402 AKHGGIATEKSYGQYLAQDGKCKLNKTKIGAKIRGWVQVPHGNQSALKLAVSAVGPVAVG 461

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           +DA+ KSFSFY +G+YYD++C N    LDHAVLAVGYG  +G+ YW +KNSWST+WG+ G
Sbjct: 462 MDAALKSFSFYSSGIYYDKQCGNKEQDLDHAVLAVGYGNENGQDYWIIKNSWSTHWGDDG 521

Query: 330 YVLMSIKDNNCGVMTAPTYVTM 351
           YV +S+K+NNCG+ T  ++V +
Sbjct: 522 YVKLSMKNNNCGIATDASFVNI 543



 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 39/137 (28%), Positives = 59/137 (43%), Gaps = 27/137 (19%)

Query: 14  GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
           GM    Q S    +G   ++ P TTE   N   CF   G   A + PQ++LPD+S F   
Sbjct: 66  GMDIVLQRSDMMAHGAKYELVPETTEKFTNVRQCFSIKGSKAAPVTPQAILPDLSKF--- 122

Query: 74  IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKAS 133
                                +  G +  N I  + W     +  +V+ Y+++V   + S
Sbjct: 123 ---------------------SKTGTKTYNKIKCDVWTYKNKQMGRVNTYTMYV--TQGS 159

Query: 134 KDAIPVRYEMKGYNSLL 150
           K + PV Y M+GY+SLL
Sbjct: 160 KPS-PVHYIMEGYDSLL 175


>gi|242008549|ref|XP_002425065.1| predicted protein [Pediculus humanus corporis]
 gi|212508730|gb|EEB12327.1| predicted protein [Pediculus humanus corporis]
          Length = 549

 Score =  255 bits (651), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 127/191 (66%), Positives = 153/191 (80%), Gaps = 1/191 (0%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQSVCGSCWSFG+TGA+EGAY++K+ KL  +SQQALIDCSWG+GN GCDGGED  S+ +I
Sbjct: 355 DQSVCGSCWSFGSTGAIEGAYFLKNHKLVRVSQQALIDCSWGFGNGGCDGGEDSYSFDYI 414

Query: 211 MKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           +KHG +PT+D YGPYLGQD YCH+  +     + G+V V     DALK+A+A HGPV VA
Sbjct: 415 LKHGGIPTEDSYGPYLGQDGYCHVNKSMDMVKIKGYVAVPSGDVDALKVAVAVHGPVGVA 474

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS K+FSFY NGVYYD  C N+P+ LDHAVL VG+G+L+G+ YW VKNSWS YWGN G
Sbjct: 475 IDASHKTFSFYSNGVYYDPNCGNTPNDLDHAVLVVGFGKLNGEDYWLVKNSWSNYWGNDG 534

Query: 330 YVLMSIKDNNC 340
           YVLMS K+NNC
Sbjct: 535 YVLMSQKNNNC 545



 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 57/142 (40%), Positives = 75/142 (52%), Gaps = 29/142 (20%)

Query: 14  GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
           GMVKTYQL   G  G   KVAP+TTE   N  +CF   G  ++ +EPQ+VLPD+S+F+  
Sbjct: 68  GMVKTYQLGSEGQYGTLKKVAPVTTEKFQNVDTCFVVSGSSESPVEPQTVLPDISNFE-- 125

Query: 74  IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVR----- 128
                                 Y GE+ VN I   KW L    G+KV++Y++W       
Sbjct: 126 ----------------------YAGEDTVNEIPCHKWTLTNKFGDKVNRYTMWTSSKVTW 163

Query: 129 YNKASKDAIPVRYEMKGYNSLL 150
            +     A+PVRYEMKG+NSLL
Sbjct: 164 KSNGETTAVPVRYEMKGFNSLL 185


>gi|390355503|ref|XP_001200245.2| PREDICTED: counting factor associated protein D-like
           [Strongylocentrotus purpuratus]
          Length = 509

 Score =  253 bits (645), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 120/224 (53%), Positives = 158/224 (70%), Gaps = 5/224 (2%)

Query: 133 SKDAIPVRYE---MKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDC 189
           S DA+P   +   +   + + DQ+VCGSCWSFG+   +EGA +M+  K   LSQQ L+DC
Sbjct: 283 SDDAVPDHIDWNVLGAVSPVKDQAVCGSCWSFGSAETIEGAVFMQSGKRVRLSQQMLMDC 342

Query: 190 SWGYGNNGCDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
           +W  GNNGCDGGE++R Y+W+MK+G +P ++ YGPYLGQ+  CH   + A A++  + NV
Sbjct: 343 TWAAGNNGCDGGEEWRVYEWLMKNGGIPLEETYGPYLGQNGMCHYDKSKAVASIKKYYNV 402

Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG- 307
           T  ++  LK ALA  GP++V IDA+  SFSFY  G YYD  C N+ D LDHAVLAVGYG 
Sbjct: 403 TSGNQKDLKKALATKGPIAVGIDAAVPSFSFYSYGTYYDASCGNTVDDLDHAVLAVGYGT 462

Query: 308 ELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
           +  G+ YW +KNSWST+WGN GYV +S+KDNNCGV TA TYV +
Sbjct: 463 DSSGQDYWLIKNSWSTHWGNNGYVAISMKDNNCGVATAATYVEL 506


>gi|390363592|ref|XP_790934.3| PREDICTED: counting factor associated protein D-like
           [Strongylocentrotus purpuratus]
          Length = 560

 Score =  251 bits (641), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 123/225 (54%), Positives = 162/225 (72%), Gaps = 6/225 (2%)

Query: 133 SKDAIP--VRYEMKGYNS-LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDC 189
           S DA+P  + + ++G  S + DQ+VCGSCWSFG+   +EGA +M+  K   LSQQ L+DC
Sbjct: 333 SDDAVPDHIDWNVRGAVSPVKDQAVCGSCWSFGSAETIEGAVFMQSGKRVRLSQQMLMDC 392

Query: 190 SWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTT-ATATMTGFVN 247
           +W  GNNGCDGGE++R Y+W+MK+ G+P ++ YGPYLGQ+  CH   +T A A++  + N
Sbjct: 393 TWAAGNNGCDGGEEWRVYEWLMKNGGIPLEETYGPYLGQNGMCHYGKSTPAVASIKKYYN 452

Query: 248 VTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG 307
           VT  ++  LK ALA  GP++V IDA+  SFSFY  G YYD  C N+ D LDHAVLAVGYG
Sbjct: 453 VTSGNQKDLKKALATKGPIAVGIDAAVPSFSFYSYGTYYDASCGNTVDDLDHAVLAVGYG 512

Query: 308 -ELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
            +  G+ YW +KNSWST+WGN GYV +S+KDNNCGV TA TYV +
Sbjct: 513 TDSSGQDYWLIKNSWSTHWGNNGYVAISMKDNNCGVATAATYVEL 557


>gi|449267783|gb|EMC78685.1| Digestive cysteine proteinase 1, partial [Columba livia]
          Length = 320

 Score =  248 bits (634), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 116/199 (58%), Positives = 146/199 (73%), Gaps = 7/199 (3%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ+VCGSCWSF T GA+EGA ++K   L  LSQQ LIDC+WG+GN      E++R+Y+WI
Sbjct: 124 DQAVCGSCWSFATAGAMEGALFLKTGVLTPLSQQVLIDCAWGFGN------EEWRAYEWI 177

Query: 211 MKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
            KHG + T + YG Y GQ+  CH   +   A +TG+VNVT  +  A+K A+ KHGPV+V+
Sbjct: 178 KKHGGIATTESYGTYKGQNGLCHYNQSEMLAKITGYVNVTSGNITAVKAAIYKHGPVAVS 237

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS K+FSFY NG+YY+ KC N    LDHAVLAVGYG L G+ YW +KNSWSTYWGN G
Sbjct: 238 IDASHKTFSFYSNGIYYEPKCANKSGSLDHAVLAVGYGVLQGETYWLIKNSWSTYWGNDG 297

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y+LM+++DNNCGV T  TY
Sbjct: 298 YILMAMRDNNCGVATEATY 316


>gi|326430129|gb|EGD75699.1| hypothetical protein PTSG_07816 [Salpingoeca sp. ATCC 50818]
          Length = 545

 Score =  239 bits (610), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 113/230 (49%), Positives = 154/230 (66%), Gaps = 8/230 (3%)

Query: 123 YSLWVRYNKASKDAIPVRYEMKGYNS------LLDQSVCGSCWSFGTTGAVEGAYYMKHK 176
           +S    Y   ++   PVR   + + +      + DQ +CGSCWSFG   A+EG Y++   
Sbjct: 309 FSSRTDYAYVAETPSPVRSAARDWRTTGAVTGVKDQGICGSCWSFGAAQAIEGQYFLATN 368

Query: 177 KLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIAN 235
           +   +SQQAL+DCSWG+GNN CDGGE FR+Y+W++++G +PT+  YGPYL  D YCH   
Sbjct: 369 RTVPMSQQALMDCSWGFGNNACDGGEAFRAYEWVLQNGYIPTEASYGPYLMADGYCHPEK 428

Query: 236 TTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPD 295
                 + G+VN+T    + +   L   GP++VAIDAS KSFSFY +GVYYD  C N+PD
Sbjct: 429 ADKGPGIKGYVNITSGDMNKVLDMLDNDGPLAVAIDASLKSFSFYSSGVYYDSDCGNTPD 488

Query: 296 GLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMT 344
            LDHAVLAVG+G  +DG+ YW +KNSWST +G++GYV MS ++NNCGV T
Sbjct: 489 DLDHAVLAVGFGTSVDGEDYWIIKNSWSTNYGDRGYVRMSRRNNNCGVAT 538


>gi|357621272|gb|EHJ73161.1| putative C1A cysteine protease precursor [Danaus plexippus]
          Length = 545

 Score =  232 bits (591), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 111/221 (50%), Positives = 147/221 (66%), Gaps = 7/221 (3%)

Query: 135 DAIPVRYEMKGY---NSLLDQSVCGSCWSFGTTGAVEGAY-YMKHKKLAVLSQQALIDCS 190
           D +P  Y+ + Y    S+ DQ  CGSCW+FGTT AVEGA   +   +L  L+ QALIDC+
Sbjct: 328 DTLPKEYDARMYGLVTSVKDQQDCGSCWTFGTTSAVEGALARINGGRLMRLANQALIDCA 387

Query: 191 WGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTP 250
           WGY N GCDGG D  +Y W++ +G+PT+++YGPY+ +D +C I N T T  + GF NVTP
Sbjct: 388 WGYENFGCDGGTDTGAYHWMLNYGMPTEEEYGPYVNKDGFCRIHNMTQTYKIKGFTNVTP 447

Query: 251 NSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELD 310
            S +ALK+AL  HGP+SV+IDA+    ++Y  G+Y D  C  S   L+H V  VGYGELD
Sbjct: 448 YSVEALKVALVNHGPLSVSIDATD-MLTYYNGGIYSDSDC--STTNLNHEVTLVGYGELD 504

Query: 311 GKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
           G+ YW VKNSW   WG  GY  ++ +DN+CG+ T PTYV  
Sbjct: 505 GEEYWIVKNSWGRDWGVDGYFHITTRDNSCGITTEPTYVVF 545


>gi|167520318|ref|XP_001744498.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163776829|gb|EDQ90447.1| predicted protein [Monosiga brevicollis MX1]
          Length = 543

 Score =  231 bits (590), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 111/204 (54%), Positives = 142/204 (69%), Gaps = 2/204 (0%)

Query: 142 EMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGG 201
           E      + DQ++CGSCWSFGT  ++EG   ++   L  LSQQ+L+DCSW YGNN CDGG
Sbjct: 335 EQGAVTEVKDQAICGSCWSFGTAESIEGQVAIQKGILQPLSQQSLMDCSWSYGNNACDGG 394

Query: 202 EDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLAL 260
           E FR+Y+WIM++G +PT   YG Y G D  CH   TT  AT+  +VNVT    + +   L
Sbjct: 395 EAFRAYEWIMENGFIPTAASYGLYQGADGVCHSKGTTPGATLATYVNVTSADMNTVLDTL 454

Query: 261 AKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKN 319
           A  GP+++AIDAS +SF+FY +GVYYD  C N+ D LDHAVLAVGYG +  G  YW VKN
Sbjct: 455 ATKGPLAIAIDASLRSFNFYSSGVYYDADCGNTTDALDHAVLAVGYGTDALGNDYWIVKN 514

Query: 320 SWSTYWGNQGYVLMSIKDNNCGVM 343
           SWST++G+ GY+LMS KDNNCG +
Sbjct: 515 SWSTHYGDNGYILMSRKDNNCGTV 538


>gi|340378677|ref|XP_003387854.1| PREDICTED: counting factor associated protein D-like [Amphimedon
           queenslandica]
          Length = 540

 Score =  229 bits (583), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 112/230 (48%), Positives = 155/230 (67%), Gaps = 5/230 (2%)

Query: 124 SLWVRYNKASKDAIP--VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAV 180
           S +V  +  S + IP  V + +KG    + DQ++CGSCWSFGTTG +EG+ ++K   L  
Sbjct: 309 SEFVYTSSVSANDIPDYVNWWLKGAVTPVKDQAICGSCWSFGTTGTLEGSLFIKTGNLVA 368

Query: 181 LSQQALIDCSWGYGNNGCDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTAT 239
           LS+QA +DCSWG GNNGCDGGE  R+++W++    LPT+  Y PY+ QD+ C     T  
Sbjct: 369 LSEQAFVDCSWGQGNNGCDGGESERAFEWLLLGRCLPTEATY-PYMMQDSTCKDKTATCG 427

Query: 240 ATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDH 299
             ++ + NV   +   LK+ + + GP+S+AIDAS +SFSFY NGVY + KC N+P+ LDH
Sbjct: 428 VRISKYYNVMSTNVTDLKVKIFERGPISIAIDASHRSFSFYANGVYKEPKCGNTPNDLDH 487

Query: 300 AVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYV 349
            VLAVGYG+++G  Y  VKNSWST+WGN GYVLMS  +N CG+ T  ++ 
Sbjct: 488 QVLAVGYGDVNGDEYVIVKNSWSTHWGNDGYVLMSTDNNQCGIATDGSFA 537


>gi|254746342|emb|CAX16636.1| putative C1A cysteine protease precursor [Manduca sexta]
          Length = 545

 Score =  228 bits (582), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 108/205 (52%), Positives = 141/205 (68%), Gaps = 4/205 (1%)

Query: 148 SLLDQSVCGSCWSFGTTGAVEGAYYMKHK-KLAVLSQQALIDCSWGYGNNGCDGGEDFRS 206
           ++ +Q  CGSCW+FGTT AVEGA    +  KL  LS QALIDC+W +G  GCDGG D  +
Sbjct: 344 AVKNQDNCGSCWTFGTTAAVEGALAQHNGGKLLSLSNQALIDCAWPFGVRGCDGGSDNAA 403

Query: 207 YQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
           Y+W+M++GLPT+ +YGPYL +D YCHI N T    + GF +VTP S  ALK+AL  HGP+
Sbjct: 404 YEWMMEYGLPTEAEYGPYLNRDNYCHIDNVTVKYPIKGFTDVTPYSVGALKVALVNHGPL 463

Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWG 326
           SV++ AS+ +F+ Y  G++YD +C+  P  LDH V  VGYG  D   +W +KNSW   WG
Sbjct: 464 SVSVHASE-AFTLYSGGIFYDTECD--PTSLDHEVALVGYGTRDEDTFWILKNSWGPMWG 520

Query: 327 NQGYVLMSIKDNNCGVMTAPTYVTM 351
             GY+L+S +DNNCGV T PTYV  
Sbjct: 521 IDGYILISSRDNNCGVATEPTYVNF 545


>gi|391341656|ref|XP_003745143.1| PREDICTED: uncharacterized protein LOC100900885 [Metaseiulus
            occidentalis]
          Length = 1356

 Score =  227 bits (578), Expect = 7e-57,   Method: Compositional matrix adjust.
 Identities = 110/204 (53%), Positives = 143/204 (70%), Gaps = 8/204 (3%)

Query: 151  DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
            DQ++CGSCWSFGT G +EG Y++KH +L   ++Q L+DCSW  GN+ CDGG D+ +Y +I
Sbjct: 1158 DQAICGSCWSFGTVGHIEGQYFLKHGELVRFAEQQLVDCSWTSGNDACDGGLDYVAYDYI 1217

Query: 211  MKHGLPTQDDYGPYLGQDAYC---HIANTTATATMTGFVNVTPNSEDALKLALAKHGPVS 267
             K+GL +   YGPY G D  C    I N   T T+  + N++    + L+ A+A  GP+S
Sbjct: 1218 KKYGLSSDAQYGPYRGIDGKCKDVEIENKPIT-TIQRYYNIS--GVENLRKAIAFVGPIS 1274

Query: 268  VAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGN 327
            VAIDAS+ S SFY +GVY D  C+++   LDHAVLAVGYG L GKPYW +KNSWSTYWGN
Sbjct: 1275 VAIDASRPSLSFYAHGVYEDPDCSSTE--LDHAVLAVGYGVLHGKPYWLIKNSWSTYWGN 1332

Query: 328  QGYVLMSIKDNNCGVMTAPTYVTM 351
             GY+L+S KDN CGV + PTYV +
Sbjct: 1333 DGYILISQKDNMCGVASTPTYVEL 1356



 Score =  169 bits (429), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 82/206 (39%), Positives = 127/206 (61%), Gaps = 10/206 (4%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKK--LAVLSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
           +Q  CGSCWSF     +E  Y++ + K  L   S+Q L+DCSW + N GC GG    ++ 
Sbjct: 377 NQGTCGSCWSFAVIAHLESQYFLNNGKENLTRFSEQQLVDCSWDFSNTGCSGGSIESAFS 436

Query: 209 WIMKHGLPTQDDYGPYLGQDAYCH--IANTTAT-ATMTGFVNVTPNSEDALKLALAKHGP 265
           ++ ++GL T + YGPY  ++  C   +  T  T +T+ GF  +    ++ L+  +A  GP
Sbjct: 437 YVKEYGLFTDEQYGPYREEEGKCRDTVTGTEPTISTLEGFNAI--GGKECLRNYIALKGP 494

Query: 266 VSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYW 325
           ++VAIDAS  SF +Y +GVY +  C      L+HAVLA+GYGEL+G+PYW +KNSW   W
Sbjct: 495 IAVAIDASSPSFVYYSHGVYKNPACGRD---LNHAVLAIGYGELNGEPYWLIKNSWGDIW 551

Query: 326 GNQGYVLMSIKDNNCGVMTAPTYVTM 351
           G++G++L+S ++N CG+    +Y  +
Sbjct: 552 GSEGFMLISQENNTCGIEDELSYADL 577



 Score = 50.8 bits (120), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 44/140 (31%), Positives = 65/140 (46%), Gaps = 24/140 (17%)

Query: 14  GMVKTYQLSKNGTNGLSLKVA-PMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKV 72
           G +KT+Q++  G  G S KV      ET + + +C+ A         PQ +LPD+SDFK 
Sbjct: 871 GEMKTFQIA--GETGGSYKVVWSPNNETHVPEQNCYAA-----GPAAPQGILPDLSDFK- 922

Query: 73  NIYRLFFLRPR--FHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYN 130
                 F+R    F +        T +   L N     ++     + ++VSKY  W   +
Sbjct: 923 ------FVRTESCFTDTS------TLLNTLLSNAAKCYRYENTVQKYDRVSKYVFWANQD 970

Query: 131 KASKDAIPVRYEMKGYNSLL 150
               + IPVRY M GY+SLL
Sbjct: 971 -GDNNVIPVRYMMMGYDSLL 989


>gi|391339556|ref|XP_003744114.1| PREDICTED: counting factor associated protein D-like [Metaseiulus
           occidentalis]
          Length = 563

 Score =  223 bits (568), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 110/202 (54%), Positives = 140/202 (69%), Gaps = 5/202 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ++CGSCWSFG TG +EG Y  KH KL   S+Q LIDCSW +GN GCDGG+D+++YQ+I
Sbjct: 361 DQAICGSCWSFGATGNIEGMYARKHGKLVRFSEQQLIDCSWKFGNGGCDGGQDYQAYQYI 420

Query: 211 MKHGLPTQDDYGPYLGQDAYCH--IANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
           M+HGL T  +YG Y+G D  CH   A      T+ G+VNVT   E+ LK A+A  GP+SV
Sbjct: 421 MQHGLSTDKEYGAYMGIDGKCHDGPALKRELPTLLGYVNVT--GENDLKRAVAFVGPISV 478

Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGN 327
            I A+  S SFY  G++ D+ C N    LDHAVLAVGYG   +G+ +W VKNSWST WG+
Sbjct: 479 GIFAALPSLSFYHTGIFNDKDCKNGLADLDHAVLAVGYGVSHEGEAFWIVKNSWSTLWGD 538

Query: 328 QGYVLMSIKDNNCGVMTAPTYV 349
            GYV +++K+N CGV TA TY 
Sbjct: 539 DGYVKIAMKNNICGVTTAATYA 560


>gi|325180654|emb|CCA15059.1| cathepsin putative [Albugo laibachii Nc14]
          Length = 620

 Score =  219 bits (558), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 106/199 (53%), Positives = 132/199 (66%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCWSFGT+GA+EGA ++K KKL  +S+Q LIDCSW +GN  C+GG D++SY WI
Sbjct: 419 DQGSCGSCWSFGTSGAIEGALFLKEKKLVNVSEQNLIDCSWNFGNQACNGGFDYQSYNWI 478

Query: 211 MKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           +  G +   D YG Y     +CH   T A A MTGFVNVT   +  L  AL   GP+SV+
Sbjct: 479 ISSGGIEATDSYGDYRNLPDFCHFEPTRAVARMTGFVNVTGTEQ--LNDALLHVGPLSVS 536

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDA+Q SF FY  G Y DEKC +  D LDH VLAVG+    G+ +  +KNSW  +WG  G
Sbjct: 537 IDATQPSFYFYAGGYYDDEKCASDLDSLDHLVLAVGFTTFRGQRFTLIKNSWGKFWGEDG 596

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y+++S KDN CGV TA TY
Sbjct: 597 YIMISQKDNVCGVATAATY 615


>gi|300120790|emb|CBK21032.2| unnamed protein product [Blastocystis hominis]
          Length = 516

 Score =  218 bits (554), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 101/199 (50%), Positives = 141/199 (70%), Gaps = 2/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ +CGSCW++GT G +EG Y++K+ KL   S+Q L+DCSW +GN+GC+GGEDFR+Y W+
Sbjct: 315 DQCMCGSCWTYGTVGVLEGQYFLKYGKLVKFSEQNLLDCSWNFGNDGCNGGEDFRAYGWM 374

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           + + GL T +DYG YLG D +CH   + A   +T +V +TP S + L+ A+A  GP+SV 
Sbjct: 375 LHNGGLMTDEDYGHYLGIDGWCHFNKSAAAVKITDYVLITPGSVEELEDAVANVGPISVG 434

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           I A    F FY  GV+ + +C+++ +   HAVLAVGYG  +GK YW +KNSWSTYWG+ G
Sbjct: 435 I-AVTTDFLFYAEGVFDNPECSSAVEDQAHAVLAVGYGTENGKDYWLIKNSWSTYWGDNG 493

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           YV ++ K+N CGV TA +Y
Sbjct: 494 YVKIARKNNICGVATAASY 512


>gi|187944195|gb|ACD40324.1| cathepsin 1-like protease [Helicoverpa armigera]
          Length = 550

 Score =  215 bits (547), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 105/220 (47%), Positives = 146/220 (66%), Gaps = 5/220 (2%)

Query: 136 AIPVRYEMKG---YNSLLDQSVCGSCWSFGTTGAVEGAY-YMKHKKLAVLSQQALIDCSW 191
            IP  Y+ +     +++ +Q  CGSCW+FGTT AVEGA   +   KL  LS QAL+DC+W
Sbjct: 332 GIPDEYDARLEGLVSNIKNQEECGSCWTFGTTAAVEGALARINGGKLMALSNQALVDCAW 391

Query: 192 GYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPN 251
            YG  GC+GG D  +Y+W+M++GLPT  +YG Y  +D  C+I N T T  + GFV+VTP 
Sbjct: 392 AYGAYGCEGGTDNAAYEWMMEYGLPTVAEYGQYTNKDGECNILNMTETFPIRGFVDVTPL 451

Query: 252 SEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDG 311
           S  ALK+AL  HGP+SV++  ++ +F  Y +GV+YD +C+  P+ L+H V  VGYG+ DG
Sbjct: 452 SVGALKVALVNHGPLSVSVHVTE-AFLQYSSGVFYDTECSQLPNDLNHEVTPVGYGQRDG 510

Query: 312 KPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
             +W +KNSW  +WG  GY+ +S +DN CGV T PTYV  
Sbjct: 511 DTFWILKNSWGPHWGIGGYMYISSRDNICGVSTEPTYVVF 550



 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 37/150 (24%), Positives = 62/150 (41%), Gaps = 26/150 (17%)

Query: 14  GMVKTYQLSKNGT--NGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFK 71
           G VK++  SKN     G+  ++ P  +  +  ++ C    G     +EP ++LPD  +F+
Sbjct: 67  GNVKSFATSKNKQFRYGVQYEIHPEASRDQEVEVVCNAMEGGRYDRVEPSTILPDTDEFE 126

Query: 72  VNIYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNK 131
                                   Y  EE++N     K+     + +  S+ +LW  YN+
Sbjct: 127 ------------------------YDDEEVMNDKDCVKFVNNVKDEQTDSEQTLWATYNE 162

Query: 132 ASKDAIPVRYEMKGYNSLLDQSVCGSCWSF 161
             K  +PVRYE+  +NS L        W F
Sbjct: 163 DDKAWVPVRYEVLEFNSWLGSKDKHEIWDF 192


>gi|156392785|ref|XP_001636228.1| predicted protein [Nematostella vectensis]
 gi|156223329|gb|EDO44165.1| predicted protein [Nematostella vectensis]
          Length = 513

 Score =  214 bits (544), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 95/197 (48%), Positives = 138/197 (70%), Gaps = 4/197 (2%)

Query: 147 NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRS 206
           NS+  Q +CGSC++F   GA+EGA+++K      LS+Q ++DC+WG+GN GC GG  +R+
Sbjct: 309 NSVKSQGICGSCYAFAVAGALEGAHFIKTGLKLDLSEQQIVDCTWGFGNRGCKGGYPYRA 368

Query: 207 YQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGP 265
            QWI+KH GL T++ YG YL Q+ YCH  NT+  A +  ++++   +   LKLA+A +GP
Sbjct: 369 MQWILKHGGLATEESYGRYLAQEGYCHFKNTSIGARLDKYMSIRQGNTSQLKLAVAFYGP 428

Query: 266 VSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYW 325
           VS+ ++   K+F FY +G+YYD +C ++   LDHA LAVGYGE  G  YW VKNSWS  W
Sbjct: 429 VSILVNTQPKTFKFYGSGIYYDTQCTHA---LDHAALAVGYGEEKGVSYWIVKNSWSAMW 485

Query: 326 GNQGYVLMSIKDNNCGV 342
           G +GY+ +++KD+NCGV
Sbjct: 486 GEEGYIKIAMKDDNCGV 502



 Score = 50.8 bits (120), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 41/138 (29%), Positives = 58/138 (42%), Gaps = 28/138 (20%)

Query: 14  GMVKTYQLSKNGTNGLSLKVAPMTTE-TELNKISCFHAYGIPDATIEPQSVLPDVSDFKV 72
           GM +TYQ    G  G + K+ P  +E T      C H  G  +  I+ QS++P       
Sbjct: 36  GMDRTYQRGDLGKYGYACKIVPEFSERTGRTFKGCLHRRGNSNFQIKAQSIIPS------ 89

Query: 73  NIYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKA 132
                    P+F         + + G E   G    KW    +   KV+ Y+L   Y   
Sbjct: 90  ---------PKF---------FKFKGHEDFRGKNCAKWEHSFNIYNKVNTYTL---YTTP 128

Query: 133 SKDAIPVRYEMKGYNSLL 150
           S+   PVRYEM GY++LL
Sbjct: 129 SRPYTPVRYEMMGYDTLL 146


>gi|254746346|emb|CAX16638.1| putative C1A cysteine protease precursor [Spodoptera frugiperda]
          Length = 552

 Score =  212 bits (540), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 107/228 (46%), Positives = 149/228 (65%), Gaps = 8/228 (3%)

Query: 128 RYNKASKDAIPVRYEMKGY---NSLLDQSVCGSCWSFGTTGAVEGAY-YMKHKKLAVLSQ 183
           + N+ S+D +P  Y+ +     + + +Q  CGSCW+FGTT AVEGA   +   KL  L+ 
Sbjct: 329 KLNELSED-MPKEYDTRLLGLVSHVKNQEDCGSCWTFGTTAAVEGALARINGGKLMALAN 387

Query: 184 QALIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMT 243
           QAL+DC W +G  GCDGG D  +Y+W+M++GLPT+ +YGPY  +D  C+I N T T  + 
Sbjct: 388 QALVDCVWAFGAAGCDGGTDNAAYEWMMEYGLPTEAEYGPYTNKDGECNIINMTRTFPIK 447

Query: 244 GFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLA 303
           G+V+VTP S  ALK+AL  HGP+SV++  +  +FS Y +G++YD +CN     L+H V  
Sbjct: 448 GYVDVTPLSVGALKVALVNHGPLSVSVHVTD-TFSKYSSGIFYDTECNQL--HLNHEVTL 504

Query: 304 VGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
           VGYGE DG  +W +KNSW   WG  GY+ +S +DN CGV T PTYV  
Sbjct: 505 VGYGERDGDTFWILKNSWGPQWGIGGYMYISARDNICGVATEPTYVVF 552


>gi|206725499|ref|NP_001128673.1| cathepsin L like protein precursor [Bombyx mori]
 gi|198041259|dbj|BAG70408.1| cathepsin L like protein [Bombyx mori]
          Length = 547

 Score =  212 bits (540), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 103/217 (47%), Positives = 145/217 (66%), Gaps = 7/217 (3%)

Query: 137 IPVRYEMK--GYNSLL-DQSVCGSCWSFGTTGAVEGAY-YMKHKKLAVLSQQALIDCSWG 192
           +P  Y+++  GY S + +Q  CGSCW+FGTT A EGA   +   +L  LS QA++DC+W 
Sbjct: 332 LPSEYDLRILGYVSKVKNQEDCGSCWTFGTTAAAEGALARINGGRLLSLSNQAILDCAWP 391

Query: 193 YGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNS 252
           YG +GC+GG D  +Y W+MK GLPT+++YG Y   D  C+I N +    + G+ +VTP +
Sbjct: 392 YGGSGCEGGSDNAAYDWMMKFGLPTEEEYGSYTNADGICNIKNMSTIYPIRGWTDVTPLN 451

Query: 253 EDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGK 312
            +ALK+A   HGP+SV+IDA+ K FS Y  G++YD  C  +   L+H +  VGYGE DG 
Sbjct: 452 VEALKVAAVNHGPLSVSIDATDK-FSLYTGGIFYDTTC--TTKRLNHEMALVGYGERDGD 508

Query: 313 PYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYV 349
            YW +KNSW   WG +GY+L+S ++NNCG+ T PTYV
Sbjct: 509 TYWIIKNSWGPDWGVEGYLLISSRNNNCGIATEPTYV 545



 Score = 37.7 bits (86), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 31/139 (22%), Positives = 62/139 (44%), Gaps = 26/139 (18%)

Query: 14  GMVKTYQLSKNGTN--GLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFK 71
           G VK+  + +N     G+S ++ P T + E     C    G  D   + + +LP+  +F+
Sbjct: 65  GAVKSIIVDENAKYRYGVSYEIHPETDDGEDAIFKCNVMLGSEDNIFDLKIILPETDNFE 124

Query: 72  VNIYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNK 131
                  ++ P   +++ +     ++ E+              S+ E  +  ++W  Y++
Sbjct: 125 -------YVGP---DSQTVENTIKFVAED--------------SDLETKTIKTIWATYDE 160

Query: 132 ASKDAIPVRYEMKGYNSLL 150
            + +  PVRYE+K YN LL
Sbjct: 161 KNNNWHPVRYEVKNYNELL 179


>gi|301122899|ref|XP_002909176.1| cathepsin, cysteine protease family C01A, putative [Phytophthora
           infestans T30-4]
 gi|262099938|gb|EEY57990.1| cathepsin, cysteine protease family C01A, putative [Phytophthora
           infestans T30-4]
          Length = 598

 Score =  208 bits (530), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 107/202 (52%), Positives = 133/202 (65%), Gaps = 3/202 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+FGTTGA+EGA + + KKL  +SQQ L+DCSW YGNN C+GG D+++Y+WI
Sbjct: 397 DQGACGSCWTFGTTGALEGALFAQQKKLYNMSQQNLLDCSWDYGNNACNGGLDYQAYEWI 456

Query: 211 MKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           + +G L T   YG Y     YCH     A   M GFVNVT  S  AL  ALA  GP+SV+
Sbjct: 457 IANGGLETTATYGAYRNAPDYCHFNVDNAIGRMKGFVNVT--SVQALNDALATIGPLSVS 514

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDA+  SF FY  G Y D +C +  D LDH+VLAVG    + + Y  VKNSWST+WG  G
Sbjct: 515 IDANLPSFYFYGGGYYDDVECKSDLDSLDHSVLAVGVTTHNDQKYTLVKNSWSTHWGEDG 574

Query: 330 YVLMSIKDNNCGVMTAPTYVTM 351
           Y+ +S K+N CGV TA TY  +
Sbjct: 575 YIKISQKNNLCGVATAATYPVL 596


>gi|301090471|ref|XP_002895448.1| cathepsin, cysteine protease family C01A, putative [Phytophthora
           infestans T30-4]
 gi|262098628|gb|EEY56680.1| cathepsin, cysteine protease family C01A, putative [Phytophthora
           infestans T30-4]
          Length = 371

 Score =  207 bits (526), Expect = 9e-51,   Method: Compositional matrix adjust.
 Identities = 107/202 (52%), Positives = 133/202 (65%), Gaps = 3/202 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+FGTTGA+EGA + + KKL  +SQQ L+DCSW YGNN C+GG D+++Y+WI
Sbjct: 170 DQGACGSCWTFGTTGALEGALFAQQKKLYNMSQQNLLDCSWDYGNNACNGGLDYQAYEWI 229

Query: 211 MKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           + +G L T   YG Y     YCH     A   M GFVNVT  S  AL  ALA  GP+SV+
Sbjct: 230 IANGGLETTATYGAYRNAPDYCHFNVDNAIGRMKGFVNVT--SVQALNDALATIGPLSVS 287

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDA+  SF FY  G Y D +C +  D LDH+VLAVG    + + Y  VKNSWST+WG  G
Sbjct: 288 IDANLPSFYFYGGGYYDDVECKSDLDSLDHSVLAVGVTTHNDQKYTLVKNSWSTHWGEDG 347

Query: 330 YVLMSIKDNNCGVMTAPTYVTM 351
           Y+ +S K+N CGV TA TY  +
Sbjct: 348 YIKISQKNNLCGVATAATYPVL 369


>gi|161408097|dbj|BAF94152.1| cathepsin L-like cysteine protease 2 [Plautia stali]
          Length = 334

 Score =  206 bits (524), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 103/200 (51%), Positives = 135/200 (67%), Gaps = 5/200 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTGA+EG  + K  KL  LS+Q L+DCS  YGNNGC+GG    ++Q+I
Sbjct: 135 NQGHCGSCWAFSTTGALEGQNFRKTGKLVSLSEQNLVDCSGSYGNNGCEGGLMDNAFQYI 194

Query: 211 MK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
            + HG+ T+  Y PY G+D  C    T+  AT +GFV++T   E+AL  A+A  GP+SVA
Sbjct: 195 KENHGIDTEKSY-PYEGEDETCRFRKTSIGATDSGFVDITQGDEEALMQAVATIGPISVA 253

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS +SF FY  GVYY+ +C  S + LDH VL VGYG  D + YW VKNSW T WG+ G
Sbjct: 254 IDASHQSFQFYSEGVYYEPEC--SSENLDHGVLVVGYGVEDNQKYWLVKNSWGTQWGDGG 311

Query: 330 YVLMSI-KDNNCGVMTAPTY 348
           Y+ M+  +DNNCG+ T  +Y
Sbjct: 312 YIKMARDQDNNCGIATQASY 331


>gi|357624858|gb|EHJ75471.1| putative C1A cysteine protease precursor [Danaus plexippus]
          Length = 529

 Score =  204 bits (520), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 126/357 (35%), Positives = 190/357 (53%), Gaps = 48/357 (13%)

Query: 37  TTETELNKISCFHAYGIP---DATIEPQSVLPDVSDFKVNIYRLFFLRP----RFHENEK 89
           T   E +KI+ F+ Y  P   D  +   +   D +DF  ++  L  + P    R + +  
Sbjct: 173 TGALEGHKITTFYDYRKPTKDDFNVALVTECEDATDFYKDLRVLHPMIPSDVDRLYHSYT 232

Query: 90  IRYNWTYIGEE--LVNGIILEKWRLVTSEGEKVSKYSLWV-RYNKASKDAI--------- 137
             +N  Y  EE  L   I+ + W+ V    +K   + L + +Y+  +K+ +         
Sbjct: 233 KHHNRNYKAEEHSLRKSILEQNWQRVLLHNKKNLGFKLTLNKYSDRTKEELSFLTGTRPS 292

Query: 138 ----------------------PVRYEMK---GYNSLLDQSVCGSCWSFGTTGAVEGAYY 172
                                 P  Y+M+     +++ +Q  CGSCW+F T  AVEGA  
Sbjct: 293 LGTGTVSFPHTDEEVEQMVLDLPENYDMRLEGTISAVKNQGRCGSCWTFSTAAAVEGALA 352

Query: 173 MKHKKLAV-LSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYC 231
            K+    + LS+Q+++DC+WGY N GCDGG    ++++I+ +G+PTQ +YG YLG+D YC
Sbjct: 353 RKNGGRDLDLSEQSIVDCAWGYHNAGCDGGMIDTAFKYILDYGIPTQIEYGDYLGEDGYC 412

Query: 232 HIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCN 291
           HI N T    + GFV V   S +A+K+AL K+GPVSVAI+A+ K    Y +G+++D +CN
Sbjct: 413 HIENVTDVYNIIGFVQVPSKSVNAMKVALYKYGPVSVAINAN-KLLVAYESGIFFDPECN 471

Query: 292 NSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTY 348
              D ++HAV  VGYG  DG  YW VKNSW   WG  GY+L+S  DNNC ++    Y
Sbjct: 472 E--DHINHAVTVVGYGVRDGATYWIVKNSWGEDWGQDGYLLISATDNNCHILEYAYY 526


>gi|1483570|emb|CAA68066.1| cathepsin l [Litopenaeus vannamei]
          Length = 328

 Score =  202 bits (515), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 105/201 (52%), Positives = 140/201 (69%), Gaps = 6/201 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F TTG++EG +++K  KL  LS+Q L+DCS  +GN GC GG   +++++I
Sbjct: 128 DQKQCGSCWAFSTTGSLEGQHFLKDGKLVSLSEQNLVDCSDKFGNMGCMGGLMDQAFRYI 187

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T+D Y PY  QD  C    +   AT TG+V+V   SE ALK A+A  GP+SVA
Sbjct: 188 KANKGIDTEDSY-PYEAQDGKCRFDASNVGATDTGYVDVEHGSESALKKAVATIGPISVA 246

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELD-GKPYWQVKNSWSTYWGNQ 328
           IDASQ SF FY +GVYY+E C+++   LDH VLAVGYGE + G+ YW VKNSW+T WGN+
Sbjct: 247 IDASQPSFQFYHDGVYYEEGCSSTM--LDHGVLAVGYGETEKGEAYWLVKNSWNTSWGNK 304

Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
           GY+ MS  K NNCG+ +  +Y
Sbjct: 305 GYIQMSRDKKNNCGIASQASY 325


>gi|121543825|gb|ABM55577.1| putative cathepsin L-like protease [Maconellicoccus hirsutus]
          Length = 341

 Score =  201 bits (510), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 109/240 (45%), Positives = 146/240 (60%), Gaps = 6/240 (2%)

Query: 111 RLVTSEGEKVSKYSLWVRYNKASKDAIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGA 170
           R VT  G+++   +    YN    D++  R E      + +Q  CGSCW+F TTG++EG 
Sbjct: 103 RRVT--GDEIDSVTFIPAYNVTVPDSVDWRTE-GAVTEVKNQGQCGSCWAFSTTGSLEGQ 159

Query: 171 YYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDA 229
           ++   K+L  LS+Q LIDCS  YGNNGC GG    ++ +I  + G+ T+  Y PY G D 
Sbjct: 160 HFRNTKQLTSLSEQNLIDCSGKYGNNGCSGGLMDNAFAYIKSNKGIDTEQSY-PYEGIDD 218

Query: 230 YCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEK 289
            C      + AT  GFV++    E+ LKLA+A  GP+SVAIDAS +SF FY  GVYYD+ 
Sbjct: 219 KCRYKPQESGATDKGFVDIPQGDEEKLKLAVATVGPISVAIDASHQSFQFYKKGVYYDKG 278

Query: 290 CNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           C N  + LDH VLAVGYG  +GK YW VKNSW   WG  GY+ M+  K N+CG+ T+ +Y
Sbjct: 279 CGNGEEDLDHGVLAVGYGTENGKDYWLVKNSWGKRWGLDGYIKMARNKHNHCGIATSASY 338


>gi|50539796|ref|NP_001002368.1| cathepsin L.1 precursor [Danio rerio]
 gi|49900360|gb|AAH75887.1| Cathepsin L.1 [Danio rerio]
          Length = 334

 Score =  199 bits (505), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 109/228 (47%), Positives = 140/228 (61%), Gaps = 6/228 (2%)

Query: 124 SLWVRYNKASKDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLS 182
           S + R  KA+     V +  KGY   + DQ  CGSCW+F  TG++EG  + K  KL  LS
Sbjct: 107 STFFRLRKAAVVPDTVDWRDKGYVTDIKDQKQCGSCWAFSATGSLEGQTFRKTGKLVSLS 166

Query: 183 QQALIDCSWGYGNNGCDGGEDFRSYQWI-MKHGLPTQDDYGPYLGQDAYCHIANTTATAT 241
           +Q L+DCS  YGN GCDGG   +++Q+I    GL T+D Y PY  QD  C    +T  A+
Sbjct: 167 EQQLVDCSGSYGNYGCDGGLMDQAFQYIEANKGLDTEDSY-PYEAQDGECRFNPSTVGAS 225

Query: 242 MTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAV 301
            TG+V++    E AL+ A+A  GP+SVAIDA   SF  Y +GVY +  C++S   LDH V
Sbjct: 226 CTGYVDIASGDESALQEAVATIGPISVAIDAGHSSFQLYSSGVYNEPDCSSSE--LDHGV 283

Query: 302 LAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           LAVGYG  +G  YW VKNSW   WG QGY+LMS  K N CG+ TA +Y
Sbjct: 284 LAVGYGSSNGDDYWIVKNSWGLDWGVQGYILMSRNKSNQCGIATAASY 331


>gi|157132324|ref|XP_001655999.1| cathepsin l [Aedes aegypti]
 gi|108881694|gb|EAT45919.1| AAEL002833-PA [Aedes aegypti]
          Length = 339

 Score =  198 bits (503), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 104/201 (51%), Positives = 134/201 (66%), Gaps = 6/201 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCWSF  TGA+EG ++ K  KL  LS+Q L+DCS  YGNNGC+GG    ++Q+I
Sbjct: 139 DQGHCGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSGKYGNNGCNGGMMDYAFQYI 198

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T+  Y PY   D  CH       AT  G+V++    E+ALK ALA  GPVS+A
Sbjct: 199 KDNGGIDTEKSY-PYEAIDDTCHFNPKAVGATDKGYVDIPQGDEEALKKALATVGPVSIA 257

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
           IDAS +SF FY  GVYY+ +C++  + LDH VLAVGYG   +G+ YW VKNSW T WG+Q
Sbjct: 258 IDASHESFQFYSEGVYYEPQCDS--ENLDHGVLAVGYGTSEEGEDYWLVKNSWGTTWGDQ 315

Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
           GYV M+  +DN+CGV T  +Y
Sbjct: 316 GYVKMARNRDNHCGVATCASY 336


>gi|195995653|ref|XP_002107695.1| hypothetical protein TRIADDRAFT_51454 [Trichoplax adhaerens]
 gi|190588471|gb|EDV28493.1| hypothetical protein TRIADDRAFT_51454 [Trichoplax adhaerens]
          Length = 549

 Score =  197 bits (502), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 92/204 (45%), Positives = 132/204 (64%), Gaps = 1/204 (0%)

Query: 149 LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
           + DQ +C SCW+  TT A+E A Y++  +   LS Q+L+DCSW +GN GC GG    +  
Sbjct: 342 IRDQGMCSSCWALSTTQAIESALYIQTGEKVELSSQSLVDCSWPFGNQGCSGGFISHTLA 401

Query: 209 WIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVS 267
           WI+K  G+PT+  YG YLG++ YCH  + T  A + G+  +   +  ALK+ALAK GPV+
Sbjct: 402 WIIKQRGIPTKSSYGQYLGRNGYCHHHDLTTGAKLKGYSQIPQGNMKALKVALAKVGPVT 461

Query: 268 VAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGN 327
           V+I+ S KSF FY +GV+Y+++C      LDH VLA+GYG    + Y  +KNSWS YWG+
Sbjct: 462 VSINTSPKSFLFYSSGVFYNDQCLGDYAHLDHNVLAIGYGRQHDQDYILIKNSWSAYWGD 521

Query: 328 QGYVLMSIKDNNCGVMTAPTYVTM 351
            GY+ +S ++NNCG+ T   Y  +
Sbjct: 522 HGYIKISTRNNNCGLATNAYYPIL 545


>gi|238816977|gb|ACR56863.1| cathepsin L-like cysteine proteinase [Delia coarctata]
          Length = 338

 Score =  197 bits (502), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 103/207 (49%), Positives = 136/207 (65%), Gaps = 6/207 (2%)

Query: 148 SLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSY 207
           S+ DQ  CGSCWSF +TG++EG ++ K   L  LS+Q L+DCS  YGNNGC+GG    ++
Sbjct: 135 SVKDQGHCGSCWSFSSTGSLEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAF 194

Query: 208 QWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
           ++I  + G+ T+  Y PY G D  CH    T  AT TGFV++    E+A+  A+A  GPV
Sbjct: 195 RYIKDNGGVDTEKSY-PYEGIDDSCHFNKATVGATDTGFVDIPQGDEEAMMKAVATMGPV 253

Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYW 325
           +VAIDAS +SF  Y  GVY D  C  S D LDH VL VGYG + DG+ YW VKNSW T W
Sbjct: 254 AVAIDASNESFQLYSEGVYNDPNC--SSDNLDHGVLVVGYGTDKDGQDYWLVKNSWGTTW 311

Query: 326 GNQGYVLMSI-KDNNCGVMTAPTYVTM 351
           G+QGY+ M+  +DN CG+ TA ++ T+
Sbjct: 312 GDQGYIKMARNQDNQCGIATASSFPTV 338


>gi|158300877|ref|XP_001689282.1| AGAP011828-PA [Anopheles gambiae str. PEST]
 gi|157013372|gb|EDO63348.1| AGAP011828-PA [Anopheles gambiae str. PEST]
          Length = 344

 Score =  197 bits (501), Expect = 6e-48,   Method: Compositional matrix adjust.
 Identities = 105/201 (52%), Positives = 133/201 (66%), Gaps = 6/201 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCWSF  TGA+EG ++ K  KL  LS+Q L+DCS  YGNNGC+GG    ++Q+I
Sbjct: 144 DQGHCGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSQKYGNNGCNGGMMDFAFQYI 203

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T+  Y PY   D  CH       AT  GFV++   +E AL  ALA  GPVSVA
Sbjct: 204 KDNKGIDTEKSY-PYEAIDDECHYNPKAVGATDKGFVDIPQGNEKALMKALATVGPVSVA 262

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGEL-DGKPYWQVKNSWSTYWGNQ 328
           IDAS +SF FY  GVYY+ +C++  + LDH VLAVGYG   DG+ YW VKNSW T WG+Q
Sbjct: 263 IDASHESFQFYSEGVYYEPQCDS--EQLDHGVLAVGYGTTEDGEDYWLVKNSWGTTWGDQ 320

Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
           GYV M+  +DN+CG+ T  +Y
Sbjct: 321 GYVKMARNRDNHCGIATTASY 341


>gi|427797099|gb|JAA64001.1| Putative cathepsin l cathepsin l, partial [Rhipicephalus
           pulchellus]
          Length = 331

 Score =  197 bits (501), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 101/200 (50%), Positives = 133/200 (66%), Gaps = 5/200 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCWSF TTG++EG ++ K  KL  LS+Q LIDCS  +GNNGC+GG    ++++I
Sbjct: 132 NQGQCGSCWSFSTTGSLEGQHFRKLHKLVSLSEQNLIDCSRSFGNNGCEGGLMDYAFKYI 191

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T+  Y PY   D  CH   +   AT TGFV++    E+ LK A+A  GPVSVA
Sbjct: 192 KANKGIDTEQSY-PYNATDGVCHFNKSAVGATDTGFVDIPEGDENKLKKAVATVGPVSVA 250

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS +SF FY  GVY + +C++  + LDH VL VGYG  DG+ YW VKNSW T WG+ G
Sbjct: 251 IDASHESFQFYSEGVYDEPECDS--EQLDHGVLVVGYGTKDGQDYWLVKNSWGTTWGDGG 308

Query: 330 YVLMSI-KDNNCGVMTAPTY 348
           Y+ MS  KDN CG+ +A +Y
Sbjct: 309 YIYMSRNKDNQCGIASAASY 328


>gi|229366214|gb|ACQ58087.1| Cathepsin L precursor [Anoplopoma fimbria]
          Length = 334

 Score =  196 bits (499), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 108/228 (47%), Positives = 143/228 (62%), Gaps = 6/228 (2%)

Query: 124 SLWVRYNKASKDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLS 182
           S ++R  + +     V +  KGY   + DQ  CGSCW+F TTG++EG  + K  KL  LS
Sbjct: 107 SAYLRLPEGADLPNSVDWREKGYVTDVKDQKQCGSCWAFSTTGSLEGQTFRKTGKLVSLS 166

Query: 183 QQALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATAT 241
           +Q L+DCS  YGN GC GG    ++++I  + G+ T+D Y PY  +D  C   +    AT
Sbjct: 167 EQQLVDCSGDYGNEGCMGGLMDSAFRYIQANGGIDTEDSY-PYEAEDGQCRYNSANIGAT 225

Query: 242 MTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAV 301
            TG+V+V    EDALK ALA  GPVSVAIDAS  SF  Y +GVY + +C++S   LDH V
Sbjct: 226 CTGYVDVKQGDEDALKEALATIGPVSVAIDASHSSFQLYESGVYDEPECSSSE--LDHGV 283

Query: 302 LAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           LAVGYG  +G  YW VKNSW   WGN+GY++M+  K N CG+ TA +Y
Sbjct: 284 LAVGYGSDNGHDYWLVKNSWGLGWGNKGYIMMTRNKHNQCGIATASSY 331


>gi|312381833|gb|EFR27483.1| hypothetical protein AND_05794 [Anopheles darlingi]
          Length = 344

 Score =  196 bits (499), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 104/201 (51%), Positives = 133/201 (66%), Gaps = 6/201 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCWSF  TGA+EG ++ K  KL  LS+Q L+DCS  YGNNGC+GG    ++Q++
Sbjct: 144 DQGHCGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFQYV 203

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T+  Y PY   D  CH       AT  GFV++    E ALK ALA  GPVSVA
Sbjct: 204 KDNKGIDTEKAY-PYEAIDDECHYNPKAIGATDKGFVDIPQGDEKALKKALATVGPVSVA 262

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGEL-DGKPYWQVKNSWSTYWGNQ 328
           IDAS +SF FY  GVYY+ +C++  + LDH VLAVGYG   DG+ YW VKNSW T WG+Q
Sbjct: 263 IDASHESFQFYSEGVYYEPQCDS--EQLDHGVLAVGYGTTEDGEDYWLVKNSWGTTWGDQ 320

Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
           GYV M+  ++N+CG+ T  +Y
Sbjct: 321 GYVKMARNRENHCGIATTASY 341


>gi|156397875|ref|XP_001637915.1| predicted protein [Nematostella vectensis]
 gi|156225031|gb|EDO45852.1| predicted protein [Nematostella vectensis]
          Length = 331

 Score =  196 bits (499), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 102/213 (47%), Positives = 140/213 (65%), Gaps = 6/213 (2%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           + +  KGY   + +Q  CGSCW+F TTGA+EG ++ K  KL  LS+Q L+DCS  YGNNG
Sbjct: 119 IDWRSKGYVTPVKNQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSGKYGNNG 178

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++Q+I ++ G+ T+  Y PYL +D  CH   +   A  TGFV++    E+AL
Sbjct: 179 CEGGLMDNAFQYIKENGGIDTEKSY-PYLAKDGVCHYNKSAIGAKDTGFVDIPTGDENAL 237

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           + ALA  GP+S+AIDASQ +F FY  GVY D  C+++   LDH VLAVGYG  DGK YW 
Sbjct: 238 QQALASVGPISIAIDASQSTFHFYHQGVYDDPDCSSTR--LDHGVLAVGYGTDDGKDYWL 295

Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTY 348
           VKNSW   WG +GY+ ++  D++ CGV +  +Y
Sbjct: 296 VKNSWGPSWGEEGYIKIARNDHDKCGVASKASY 328


>gi|91992508|gb|ABE72970.1| cathepsin L [Aedes aegypti]
          Length = 339

 Score =  196 bits (499), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 104/201 (51%), Positives = 133/201 (66%), Gaps = 6/201 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCWSF  TGA+EG ++ K  KL  LS+Q L+DCS  YGNNGC+GG    ++Q+I
Sbjct: 139 DQGHCGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSGKYGNNGCNGGMMDYAFQYI 198

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T+  Y PY   D  CH       AT  G+V++    E+ALK ALA  GPVS+A
Sbjct: 199 KDNGGIDTEKSY-PYEAIDDTCHFNPKAVGATDKGYVDIPQGDEEALKKALATVGPVSIA 257

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
           IDAS +SF FY  GVYY+ +C++  + LDH VLAVGYG   +G+ YW VKNSW T WG+Q
Sbjct: 258 IDASHESFQFYSEGVYYEPQCDS--ENLDHGVLAVGYGTSEEGEDYWLVKNSWGTTWGDQ 315

Query: 329 GYVLMSIK-DNNCGVMTAPTY 348
           GYV M+   DN+CGV T  +Y
Sbjct: 316 GYVKMARNHDNHCGVATCASY 336


>gi|449673497|ref|XP_002169904.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
          Length = 325

 Score =  196 bits (499), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 112/250 (44%), Positives = 154/250 (61%), Gaps = 17/250 (6%)

Query: 102 VNGIILEKWRLVTSEGEKVSKYSLWVRYNKASKDAIPVRYEMKGY-NSLLDQSVCGSCWS 160
           +NG++L K +         +  +  V  + A+ DA+  R E  GY   + +Q  CGSCW+
Sbjct: 87  MNGLLLHKHQ---------NGSTFLVPSHTAAPDAVDWRSE--GYVTPVKNQGQCGSCWA 135

Query: 161 FGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQD 219
           F +TGA+EG ++ K  +L  LS+Q L+DCS  YGNNGC+GG    ++ +I  + G+ T+ 
Sbjct: 136 FSSTGALEGQHFKKTGRLVSLSEQNLVDCSTDYGNNGCNGGLMDNAFSYIKANGGIDTET 195

Query: 220 DYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSF 279
            Y PY GQD  C  + ++  A  TGFV++    EDALK A+A  GPVSVAIDAS  SF F
Sbjct: 196 GY-PYEGQDGTCRYSKSSIGADDTGFVDIPEGDEDALKQAVATVGPVSVAIDASHMSFQF 254

Query: 280 YVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKD-N 338
           Y +GVY + +C  SP  LDH VL VGYG  +GK YW VKNSW T WG +GY+ MS  + N
Sbjct: 255 YHSGVYDEPQC--SPSALDHGVLVVGYGTDNGKDYWLVKNSWGTGWGTEGYIYMSRNNQN 312

Query: 339 NCGVMTAPTY 348
            CG+ +  +Y
Sbjct: 313 QCGIASKASY 322


>gi|170041165|ref|XP_001848344.1| cathepsin l [Culex quinquefasciatus]
 gi|167864709|gb|EDS28092.1| cathepsin l [Culex quinquefasciatus]
          Length = 340

 Score =  196 bits (498), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 102/200 (51%), Positives = 130/200 (65%), Gaps = 4/200 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCWSF  TGA+EG ++ K  KL  LS+Q L+DCS  YGNNGC+GG    ++Q+I
Sbjct: 140 DQGHCGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSTKYGNNGCNGGMMDFAFQYI 199

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
             +G    +   PY   D  CH       AT  GFV++    E AL  A+A  GPVSVAI
Sbjct: 200 KDNGGIDTEKAYPYEAIDDTCHYNPKAVGATDKGFVDIPQGDEKALMKAIATAGPVSVAI 259

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQG 329
           DAS +SF FY  GVYY+ +C++  + LDH VLAVGYG   +G+ YW VKNSW T WG+QG
Sbjct: 260 DASHESFQFYSEGVYYEPQCDS--ENLDHGVLAVGYGTSEEGEDYWLVKNSWGTTWGDQG 317

Query: 330 YVLMSI-KDNNCGVMTAPTY 348
           YV M+  +DN+CG+ TA +Y
Sbjct: 318 YVKMARNRDNHCGIATAASY 337


>gi|32394728|gb|AAM96000.1| cathepsin L precursor [Metapenaeus ensis]
          Length = 322

 Score =  196 bits (497), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 103/224 (45%), Positives = 150/224 (66%), Gaps = 9/224 (4%)

Query: 131 KASKDAIP--VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALI 187
           +A  + +P  V +  KG    + DQ  CGSCW+F TTG++EG +++K  KL  LS+Q L+
Sbjct: 99  EADDETLPKHVDWRTKGAVTPVKDQKQCGSCWAFSTTGSLEGQHFLKDGKLVSLSEQNLV 158

Query: 188 DCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFV 246
           DCS  +GN GC GG   +++++I ++ G+ T++ Y PY  QD  C   ++   AT TGFV
Sbjct: 159 DCSGKFGNMGCCGGLMDQAFKYIKENKGIDTEESY-PYEAQDGKCRFDSSNVGATDTGFV 217

Query: 247 NVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGY 306
           ++    E++L  A+A  GP+SVAIDAS  SF FY  GVYY+++C+++   LDH VLA+GY
Sbjct: 218 DIAHGEENSLMKAVANIGPISVAIDASHPSFQFYHQGVYYEKECSSTM--LDHGVLAIGY 275

Query: 307 GEL-DGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           GE  DGK YW VKNSW+T WG++G++ MS  K NNCG+ +  +Y
Sbjct: 276 GETDDGKEYWLVKNSWNTSWGDKGFIQMSRNKKNNCGIASQASY 319


>gi|32394730|gb|AAM96001.1| cathepsin L precursor [Metapenaeus ensis]
          Length = 306

 Score =  196 bits (497), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 103/224 (45%), Positives = 150/224 (66%), Gaps = 9/224 (4%)

Query: 131 KASKDAIP--VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALI 187
           +A  + +P  V +  KG    + DQ  CGSCW+F TTG++EG +++K  KL  LS+Q L+
Sbjct: 83  EADDETLPKHVDWRTKGAVTPVKDQKQCGSCWAFSTTGSLEGQHFLKDGKLVSLSEQNLV 142

Query: 188 DCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFV 246
           DCS  +GN GC GG   +++++I ++ G+ T++ Y PY  QD  C   ++   AT TGFV
Sbjct: 143 DCSGKFGNMGCCGGLMDQAFKYIKENKGIDTEESY-PYEAQDGKCRFDSSNVGATDTGFV 201

Query: 247 NVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGY 306
           ++    E++L  A+A  GP+SVAIDAS  SF FY  GVYY+++C+++   LDH VLA+GY
Sbjct: 202 DIAHGEENSLMKAVANIGPISVAIDASHPSFQFYHQGVYYEKECSSTM--LDHGVLAIGY 259

Query: 307 GEL-DGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           GE  DGK YW VKNSW+T WG++G++ MS  K NNCG+ +  +Y
Sbjct: 260 GETDDGKEYWLVKNSWNTSWGDKGFIQMSRNKKNNCGIASQASY 303


>gi|229367042|gb|ACQ58501.1| Cathepsin L precursor [Anoplopoma fimbria]
          Length = 334

 Score =  196 bits (497), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 107/228 (46%), Positives = 143/228 (62%), Gaps = 6/228 (2%)

Query: 124 SLWVRYNKASKDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLS 182
           S ++R  + +     V +  KGY   + DQ  CGSCW+F TTG++EG  + K  KL  LS
Sbjct: 107 SAYLRLPEGADLPNSVDWREKGYVTEVKDQKQCGSCWAFSTTGSLEGQTFRKTGKLVSLS 166

Query: 183 QQALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATAT 241
           +Q L+DCS  YGN GC GG    ++++I  + G+ T+D Y PY  +D  C   +    AT
Sbjct: 167 EQQLVDCSGDYGNEGCMGGLMDSAFRYIQANGGIDTEDSY-PYEAEDGQCRYNSANIGAT 225

Query: 242 MTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAV 301
            TG+V+V    EDALK A+A  GPVSVAIDAS  SF  Y +GVY + +C++S   LDH V
Sbjct: 226 CTGYVDVKQGDEDALKEAVATIGPVSVAIDASHSSFQLYESGVYDEPECSSSE--LDHGV 283

Query: 302 LAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           LAVGYG  +G  YW VKNSW   WGN+GY++M+  K N CG+ TA +Y
Sbjct: 284 LAVGYGSDNGHDYWLVKNSWGLGWGNKGYIMMTRNKHNQCGIATASSY 331


>gi|91092014|ref|XP_970644.1| PREDICTED: similar to cathepsin-L-like cysteine peptidase 02
           [Tribolium castaneum]
 gi|270001249|gb|EEZ97696.1| cathepsin L precursor [Tribolium castaneum]
          Length = 337

 Score =  196 bits (497), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 101/201 (50%), Positives = 131/201 (65%), Gaps = 6/201 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCWSF TTG++EG ++ K KKL  LS+Q LIDCS  YGNNGC+GG    ++++I
Sbjct: 137 DQGQCGSCWSFSTTGSLEGQHFRKSKKLVSLSEQNLIDCSEKYGNNGCNGGLMDNAFRYI 196

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T+  Y PY  +D  CH       AT  GFV++    E+ LK A+A  GP+SVA
Sbjct: 197 KDNGGIDTEQSY-PYKAEDEKCHYKPRNKGATDRGFVDIESGDEEKLKAAVATVGPISVA 255

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
           IDAS  +F  Y  GVYY+ +C  S + LDH VL VGYG + DG  YW VKNSW   WG+Q
Sbjct: 256 IDASHPTFQQYSEGVYYEPEC--SSEQLDHGVLVVGYGTDEDGNDYWLVKNSWGDSWGDQ 313

Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
           GY+ M+  +DNNCG+ T  +Y
Sbjct: 314 GYIKMARNRDNNCGIATQASY 334


>gi|346469447|gb|AEO34568.1| hypothetical protein [Amblyomma maculatum]
          Length = 333

 Score =  195 bits (496), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 100/200 (50%), Positives = 131/200 (65%), Gaps = 5/200 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F +TG++EG +++K  KL  LS+Q L+DCS  YGN GC+GG    S+ +I
Sbjct: 134 DQGQCGSCWAFSSTGSLEGQHFLKTGKLVSLSEQNLVDCSSAYGNQGCNGGLMDNSFNYI 193

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T+D Y PY  +D  C        AT TGFV++   SE  L+ A+A  GPVSVA
Sbjct: 194 KANGGIDTEDSY-PYEAEDGDCRYKKEDVGATDTGFVDIKEGSEKDLQKAVATVGPVSVA 252

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDASQ+SF  Y  GVY +  C  S + LDH VLAVGYG  +GK YW VKNSW+  WG  G
Sbjct: 253 IDASQQSFQLYSEGVYDEPNC--SSESLDHGVLAVGYGVKNGKKYWLVKNSWAETWGQDG 310

Query: 330 YVLMSI-KDNNCGVMTAPTY 348
           Y+LMS  K+N CG+ ++ +Y
Sbjct: 311 YILMSRDKNNQCGIASSASY 330


>gi|151573014|gb|ABS17682.1| cathepsin L-1 [Artemia salina]
          Length = 334

 Score =  195 bits (495), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 100/200 (50%), Positives = 132/200 (66%), Gaps = 5/200 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F +TGA+EG  + K  KL  LS+Q LIDCS  YGN GC+GG   +++Q+I
Sbjct: 135 DQGQCGSCWAFSSTGALEGQTFRKTGKLVSLSEQNLIDCSGKYGNEGCNGGLMDQAFQYI 194

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T++ Y PY  +D  C        A   GFV++    ED LK A+A  GPVSVA
Sbjct: 195 KDNKGIDTENTY-PYEAEDDVCRYNPRNRGAVDRGFVDIPSGEEDKLKAAVATVGPVSVA 253

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS +SF FY  GVYY+  C++  D LDH VL VGYG  +GK YW VKNSWS +WG++G
Sbjct: 254 IDASHESFQFYSKGVYYEPSCDS--DDLDHGVLVVGYGSDNGKDYWLVKNSWSEHWGDEG 311

Query: 330 YVLMSI-KDNNCGVMTAPTY 348
           Y+ M+  + N+CGV +A +Y
Sbjct: 312 YIKMARNRKNHCGVASAASY 331


>gi|33348836|gb|AAQ16118.1| cathepsin L-like cysteine proteinase B [Rhipicephalus
           haemaphysaloides haemaphysaloides]
          Length = 335

 Score =  195 bits (495), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 98/200 (49%), Positives = 134/200 (67%), Gaps = 5/200 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTG++EG ++ K +KL  LS+Q L+DCS  +GNNGC+GG    ++++I
Sbjct: 136 NQGQCGSCWAFSTTGSLEGPHFRKTRKLVSLSEQNLVDCSRSFGNNGCEGGLMDNAFKYI 195

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T+  Y PY   D  CH   +   AT TGFV++    E+ LK A+A  GPVSVA
Sbjct: 196 KSNKGIDTEWSY-PYNATDGVCHFNRSDVGATDTGFVDIPEGDENKLKKAVAAVGPVSVA 254

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS +SF FY  GVY + +C  S + LDH VL VGYG  DG+ YW VKNSW T WG++G
Sbjct: 255 IDASHESFQFYSEGVYDEPEC--SSEQLDHGVLVVGYGTKDGQDYWLVKNSWGTTWGDEG 312

Query: 330 YVLMSI-KDNNCGVMTAPTY 348
           Y+ M+  KDN CG+ ++ +Y
Sbjct: 313 YIYMTRNKDNQCGIASSASY 332


>gi|55740402|gb|AAV63977.1| cathepsin L precursor [Artemia franciscana]
          Length = 338

 Score =  194 bits (494), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 100/200 (50%), Positives = 132/200 (66%), Gaps = 5/200 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F +TGA+EG  + K  KL  LS+Q LIDCS  YGN GC+GG   +++Q+I
Sbjct: 139 DQGQCGSCWAFSSTGALEGQTFRKTGKLISLSEQNLIDCSGKYGNEGCNGGLMDQAFQYI 198

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T++ Y PY  +D  C        A   GFV++    ED LK A+A  GPVSVA
Sbjct: 199 KDNKGIDTENTY-PYEAEDDVCRYNPRNRGAVDRGFVDIPSGEEDKLKAAVATVGPVSVA 257

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS +SF FY  GVYY+  C++  D LDH VL VGYG  +GK YW VKNSWS +WG++G
Sbjct: 258 IDASHESFQFYSKGVYYEPSCDS--DDLDHGVLVVGYGSDNGKDYWLVKNSWSEHWGDEG 315

Query: 330 YVLMSI-KDNNCGVMTAPTY 348
           Y+ ++  + N+CGV TA +Y
Sbjct: 316 YIKIARNRKNHCGVATAASY 335


>gi|530736|emb|CAA56915.1| cathepsin l [Nephrops norvegicus]
 gi|1582621|prf||2119193B cathepsin L-related Cys protease
          Length = 313

 Score =  194 bits (494), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 104/219 (47%), Positives = 142/219 (64%), Gaps = 7/219 (3%)

Query: 136 AIPVRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYG 194
           A  V +  KG    + DQ  CGSCW+F  TG++EG +++K+ +L  LS+Q L+DCS  YG
Sbjct: 99  AADVDWRTKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNNELVSLSEQELVDCSTEYG 158

Query: 195 NNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSE 253
           N+GC GG    ++ +I  + G+ T+  Y PY  QD  C     +  AT TGFV V  ++E
Sbjct: 159 NDGCGGGWMTSAFDYIKDNGGIDTESSY-PYEAQDRSCRFDANSIGATCTGFVEVQ-HTE 216

Query: 254 DALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKP 313
           +AL  A++  GP+SVAIDAS  SF FY +GVYY++KC  SP  LDH VLAVGYG    + 
Sbjct: 217 EALHEAVSDIGPISVAIDASHFSFQFYSSGVYYEKKC--SPTNLDHGVLAVGYGTESTED 274

Query: 314 YWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
           YW VKNSW + WG+ GY+ MS  +DNNCG+ + P+Y T+
Sbjct: 275 YWLVKNSWGSGWGDAGYIKMSRNRDNNCGIASEPSYPTV 313


>gi|442754503|gb|JAA69411.1| Putative cathepsin l-like cysteine proteinase b [Ixodes ricinus]
          Length = 335

 Score =  194 bits (494), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 103/236 (43%), Positives = 148/236 (62%), Gaps = 16/236 (6%)

Query: 116 EGEKVSKYSLWVRYNKASKDAI-PVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMK 174
           E E +  +SL    +  +K A+ PV+          +Q  CGSCW+F  TG++EG ++ K
Sbjct: 110 EPENIEDFSLPKTVDWRTKGAVTPVK----------NQGQCGSCWAFSATGSLEGQHFRK 159

Query: 175 HKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHI 233
              +  LS+Q L+DCS  +GNNGC+GG    ++++I  + G+ T+  Y PY G D  CH 
Sbjct: 160 SGSMVSLSEQNLVDCSTDFGNNGCEGGLMDNAFKYIRANKGIDTEKSY-PYNGTDGTCHF 218

Query: 234 ANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNS 293
             +T  AT +GFV++   SE  LK A+A  GP+SVAIDAS +SF FY +GVY + +C++ 
Sbjct: 219 KKSTVGATDSGFVDIKEGSETQLKKAVATVGPISVAIDASHESFQFYSDGVYDEPECDS- 277

Query: 294 PDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
            + LDH VL VGYG L+G  YW VKNSW T WG++GY+ MS  K N CG+ ++ +Y
Sbjct: 278 -ESLDHGVLVVGYGTLNGTDYWLVKNSWGTTWGDEGYIRMSRNKKNQCGIASSASY 332


>gi|2584884|gb|AAB82743.1| cysteine proteinase [Oncorhynchus mykiss]
          Length = 151

 Score =  194 bits (493), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 88/151 (58%), Positives = 114/151 (75%), Gaps = 1/151 (0%)

Query: 160 SFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHG-LPTQ 218
           SF TTGAVEGA ++K   L VLSQQ L+DCSWG+GNNGCDGGE++R+Y+WIMKHG + T 
Sbjct: 1   SFATTGAVEGALFLKSGSLQVLSQQMLVDCSWGFGNNGCDGGEEWRAYEWIMKHGGIATT 60

Query: 219 DDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFS 278
           + YG Y+G +  CH+  +  TA +  + NVT    +ALK+AL KHGPV+V+IDA  +SF 
Sbjct: 61  ETYGSYMGMNGLCHLNTSQLTARVQSYTNVTSGDAEALKVALFKHGPVAVSIDAGHRSFV 120

Query: 279 FYVNGVYYDEKCNNSPDGLDHAVLAVGYGEL 309
           FY +G+YY+ KC N+ D LDHAVLAVGYG +
Sbjct: 121 FYSHGIYYEPKCGNTTDSLDHAVLAVGYGVM 151


>gi|281206749|gb|EFA80934.1| counting factor associated protein [Polysphondylium pallidum PN500]
          Length = 530

 Score =  194 bits (493), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 99/200 (49%), Positives = 133/200 (66%), Gaps = 2/200 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ VCGSCW+FG+TG++EG   +   KL  LS+Q L+DC++   + GC+GG    ++Q+I
Sbjct: 328 DQGVCGSCWTFGSTGSLEGVSCLATGKLVSLSEQQLVDCAYLGQSQGCNGGFASDAFQYI 387

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTT-ATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           M  G    +   PYL Q+ YC  +++  +   +  +VNVT  SE AL+ A+A  GPV++A
Sbjct: 388 MNFGGIAYESTYPYLMQNGYCKDSSSQLSNIKVKSYVNVTSFSEPALQNAVATVGPVAIA 447

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS   F FY +GVYY   C N  D LDH VLAVGYG L+G  YW VKNSWST++G +G
Sbjct: 448 IDASAPDFRFYSSGVYYSSVCKNGLDDLDHEVLAVGYGTLNGADYWIVKNSWSTHYGAEG 507

Query: 330 YVLMSI-KDNNCGVMTAPTY 348
           Y+LMS  + NNCGV + PTY
Sbjct: 508 YILMSRNRGNNCGVASQPTY 527


>gi|33348834|gb|AAQ16117.1| cathepsin L-like cysteine proteinase A [Rhipicephalus
           haemaphysaloides haemaphysaloides]
          Length = 332

 Score =  194 bits (493), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 104/238 (43%), Positives = 146/238 (61%), Gaps = 8/238 (3%)

Query: 116 EGEKVSKYSLWVRYNKASKDAIP--VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYY 172
            G++ S+ S ++     +  ++P  V +  KG    + DQ  CGSCW+F  TG++EG ++
Sbjct: 95  RGQRTSRGSTFMPPANVNDSSLPSTVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHF 154

Query: 173 MKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYC 231
           +K  +L  LS+Q L+DCS  +GNNGC+GG    ++++I  + G+  ++ Y PY   D  C
Sbjct: 155 LKDGELVSLSEQNLVDCSQSFGNNGCEGGLMDNAFKYIKANDGIDAEESY-PYEAMDDKC 213

Query: 232 HIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCN 291
                   AT TGFV++   SED LK A+A  GP+SVAIDA   SF  Y  GVY + +C 
Sbjct: 214 RFKKEDVGATDTGFVDIEGGSEDDLKKAVATVGPISVAIDAGHSSFQLYSEGVYDEPEC- 272

Query: 292 NSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
            S + LDH VLAVGYG  DGK YW VKNSW   WG+ GY+LMS  K+N CG+ +A +Y
Sbjct: 273 -SSEELDHGVLAVGYGVKDGKKYWLVKNSWGGSWGDNGYILMSRDKNNQCGIASAASY 329


>gi|5081735|gb|AAD39513.1|AF147207_1 cathepsin L-like protease precursor [Artemia franciscana]
          Length = 338

 Score =  194 bits (492), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 99/200 (49%), Positives = 132/200 (66%), Gaps = 5/200 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F +TGA+EG  + K  KL  LS+Q LIDCS  YGN GC+GG   +++Q+I
Sbjct: 139 DQGQCGSCWAFSSTGALEGQTFRKTGKLISLSEQNLIDCSGKYGNEGCNGGLMDQAFQYI 198

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T++ Y PY  +D  C        A   GFV++    ED LK A+A  GPVSVA
Sbjct: 199 KDNKGIDTENTY-PYEAEDNVCRYNPRNRGAIDRGFVHIPSGEEDKLKAAVATVGPVSVA 257

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS +SF FY  GVYY+  C++  D LDH VL VGYG  +GK YW VKNSWS +WG++G
Sbjct: 258 IDASHESFQFYSKGVYYEPSCDS--DDLDHGVLVVGYGSDNGKDYWLVKNSWSEHWGDEG 315

Query: 330 YVLMSI-KDNNCGVMTAPTY 348
           Y+ ++  + N+CG+ TA +Y
Sbjct: 316 YIKIARNRKNHCGIATAASY 335


>gi|33112581|gb|AAP94046.1| cathepsin-L-like cysteine peptidase 02 [Tenebrio molitor]
          Length = 337

 Score =  194 bits (492), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 100/201 (49%), Positives = 131/201 (65%), Gaps = 6/201 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCWSF  TG++EG ++ K  KL  LS+Q L+DCS  +GNNGC+GG    ++++I
Sbjct: 137 DQGQCGSCWSFSATGSLEGQHFRKSGKLVSLSEQNLVDCSEKFGNNGCNGGLMDNAFRYI 196

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T+  Y PY  +D  CH       AT  G+V++   +ED L+ A+A  GPVSVA
Sbjct: 197 KANGGIDTEQAY-PYKAEDEKCHYKPKNKGATDRGYVDIESGNEDKLQSAVATVGPVSVA 255

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
           IDAS +SF  Y  GVYY+ +C  SP  LDH VL VGYG E DG  YW VKNSW   WG+Q
Sbjct: 256 IDASHQSFQLYSGGVYYEPEC--SPSQLDHGVLVVGYGTEDDGTDYWLVKNSWGKSWGDQ 313

Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
           GY+ M+  +DNNCG+ T  +Y
Sbjct: 314 GYIKMARNRDNNCGIATEASY 334


>gi|391332597|ref|XP_003740719.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
          Length = 330

 Score =  194 bits (492), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 110/249 (44%), Positives = 148/249 (59%), Gaps = 7/249 (2%)

Query: 107 LEKWRLVTSEGEKVSKYSLWVRYNKASKDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTG 165
            EK+R    E  +     L  R N++      V +  +GY   + +Q VCGSCW+F TTG
Sbjct: 85  FEKYRGTRFEANEARVSKLQHRDNRSMHVPDTVDWRTEGYVTPVKNQGVCGSCWAFSTTG 144

Query: 166 AVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPY 224
           A+EG ++ +   L  LS+Q L+DCS  YGN GC+GG    ++++I    GL T+  Y PY
Sbjct: 145 ALEGQHFRRSGDLVSLSEQMLVDCSAVYGNAGCNGGLMDNAFRFIKDAGGLETEKSY-PY 203

Query: 225 LGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGV 284
            G+D  CH       A +TGFV+V    E+ALK A    GPVSVAIDAS ++F FY +GV
Sbjct: 204 TGKDGTCHFDARGIGAKLTGFVDVPSRDEEALKEAAGVVGPVSVAIDASGQNFQFYKDGV 263

Query: 285 YYDEKCNNSPDGLDHAVLAVGYGEL-DGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGV 342
           Y +  C+++   LDH VL VGYG   DGK YW VKNSW + WG  GY+ MS  K+N CG+
Sbjct: 264 YDEITCSSTS--LDHGVLVVGYGTTRDGKDYWLVKNSWGSSWGQSGYIQMSRNKENQCGI 321

Query: 343 MTAPTYVTM 351
            T  +Y T+
Sbjct: 322 ATMASYPTV 330


>gi|154183745|gb|ABS70713.1| cathepsin L-like cysteine proteinase [Dermacentor variabilis]
          Length = 333

 Score =  194 bits (492), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 99/200 (49%), Positives = 130/200 (65%), Gaps = 5/200 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTG++EG +++K   L  LS+Q L+DCS  +GN+GC+GG    ++Q+I
Sbjct: 134 NQGQCGSCWAFSTTGSLEGQHFLKTGVLVSLSEQNLVDCSETFGNHGCEGGLMDNAFQYI 193

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T+  Y PY  +D  C        AT TGFV++   SED LK A+A  GPVSVA
Sbjct: 194 KANGGIDTEKSY-PYEAEDGECRFKKQNVGATDTGFVDIEQGSEDDLKKAVATVGPVSVA 252

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS  SF  Y  GVY + +C  S + LDH VL VGYG  DGK YW VKNSW+  WG+ G
Sbjct: 253 IDASHSSFQLYSEGVYDETEC--SSEQLDHGVLVVGYGVEDGKKYWLVKNSWAESWGDNG 310

Query: 330 YVLMSI-KDNNCGVMTAPTY 348
           Y+ MS  KDN CG+ +A +Y
Sbjct: 311 YIKMSRDKDNQCGIASAASY 330


>gi|728637|emb|CAA59441.1| cathepsin l [Litopenaeus vannamei]
          Length = 326

 Score =  193 bits (491), Expect = 8e-47,   Method: Compositional matrix adjust.
 Identities = 104/224 (46%), Positives = 148/224 (66%), Gaps = 9/224 (4%)

Query: 131 KASKDAIP--VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALI 187
           KA  + +P  V +  KG    + DQ  CGSCW+F TTG++EG +++K  KL  LS+Q L+
Sbjct: 103 KADDETLPEKVDWRTKGAVTPVKDQKQCGSCWAFSTTGSLEGQHFLKDGKLVSLSEQNLV 162

Query: 188 DCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFV 246
           DCS  +GN GC GG   +++++I  + G+ T+D Y PY  QD  C    +   AT TG+V
Sbjct: 163 DCSDKFGNMGCMGGLMDQAFRYIKANKGIDTEDSY-PYEAQDGKCRFDASNVGATDTGYV 221

Query: 247 NVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGY 306
           +V   SE ALK A+A  GP+SV IDASQ +F FY  GVY+D+ C+++   LDH VLAVGY
Sbjct: 222 DVEHGSESALKKAVATIGPISVGIDASQSTFHFYHTGVYHDDHCSSTM--LDHGVLAVGY 279

Query: 307 G-ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           G + +G  +W VKNSW+T WG++GY+ MS  ++NNCG+ +  +Y
Sbjct: 280 GSDENGGDFWLVKNSWNTSWGDKGYIKMSRNRNNNCGIASQASY 323


>gi|146217394|gb|ABQ10739.1| cathepsin L [Penaeus monodon]
          Length = 341

 Score =  193 bits (491), Expect = 8e-47,   Method: Compositional matrix adjust.
 Identities = 100/203 (49%), Positives = 136/203 (66%), Gaps = 6/203 (2%)

Query: 149 LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
           + DQ  CGSCW+F  TGA+EG  + K  +L  LS+Q L+DCS  +GNNGC+GG    +++
Sbjct: 139 IKDQGQCGSCWAFSATGALEGQTFRKTGQLVSLSEQNLVDCSRKFGNNGCNGGLMDNAFE 198

Query: 209 WIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVS 267
           ++ ++ G+ T++ Y PY  +D  CH     A A   GFV+V   SE ALK A+A  GPVS
Sbjct: 199 YVKENGGIDTEESY-PYDAEDEKCHYNPRAAGAEDKGFVDVREGSEHALKKAVATVGPVS 257

Query: 268 VAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWG 326
           VAIDAS +SF FY +GVY + +C  SP+ LDH VL VGYG + DG  YW VKNSW T WG
Sbjct: 258 VAIDASHESFQFYSHGVYIEPEC--SPEMLDHGVLVVGYGIDDDGTDYWLVKNSWGTTWG 315

Query: 327 NQGYVLMSI-KDNNCGVMTAPTY 348
           +QGYV M+  +DN CG+ ++ ++
Sbjct: 316 DQGYVKMARNRDNQCGIASSASF 338


>gi|161408095|dbj|BAF94151.1| cathepsin L-like cysteine protease 1 [Plautia stali]
          Length = 344

 Score =  193 bits (491), Expect = 8e-47,   Method: Compositional matrix adjust.
 Identities = 101/224 (45%), Positives = 139/224 (62%), Gaps = 6/224 (2%)

Query: 128 RYNKASKDAIPVRYEMKGYNS-LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQAL 186
           R+NK +     V +  KG  S + DQ  CGSCW+F +TGA+E   ++K  +   LS+Q L
Sbjct: 107 RHNKDAVIQKEVDWRQKGAISPVKDQGHCGSCWAFSSTGALEAHTFLKKGRRVSLSEQNL 166

Query: 187 IDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGF 245
           IDCS  YGNNGC+GG   +++Q++  + G+ T++ Y PY G+D+ C        AT  GF
Sbjct: 167 IDCSLNYGNNGCEGGLMEQAFQYVRDNDGIDTEEAY-PYEGEDSECRFKKNNVGATDAGF 225

Query: 246 VNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVG 305
           V +    E AL  A+A  GP+S+AIDAS  SF FY  GVYY+ +C+++   LDH VL VG
Sbjct: 226 VTIPSGDEQALMEAVATQGPLSIAIDASNPSFQFYSEGVYYEPECSSAQ--LDHGVLLVG 283

Query: 306 YGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           YG    + YW VKNSWS  WG  GY+ M+  KDNNCG+ T  ++
Sbjct: 284 YGVEKDQKYWLVKNSWSEQWGENGYIKMARNKDNNCGIATQASF 327


>gi|312306194|gb|ADQ73946.1| cathepsin L [Paralithodes camtschaticus]
          Length = 324

 Score =  193 bits (490), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 102/219 (46%), Positives = 143/219 (65%), Gaps = 8/219 (3%)

Query: 135 DAIP--VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW 191
           D +P  V +  KG    + DQ  CGSCW+F  TG++EG +++K  KL  LS+Q L+DCS 
Sbjct: 106 DTLPAEVDWRTKGAVTPVKDQKACGSCWAFSATGSLEGQHFLKDGKLVSLSEQNLVDCST 165

Query: 192 GYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTP 250
             G++GC GG    ++ +I  + G+ T+  Y PY   D  C      + AT+TG+V+V  
Sbjct: 166 KQGDHGCGGGLMDFAFTYIKDNGGIDTEASY-PYEATDGKCQYNPANSGATVTGYVDVEH 224

Query: 251 NSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELD 310
           +SEDAL+ A+A  GP+SVAIDAS+ +F FY  GVYYD++C+++   LDH VLAVGYG  D
Sbjct: 225 DSEDALQKAVATIGPISVAIDASRSTFHFYHKGVYYDKECSSTS--LDHGVLAVGYGTQD 282

Query: 311 GKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           G  YW VKNSW+  WGN G++ MS  ++NNCG+ T  +Y
Sbjct: 283 GTDYWLVKNSWNITWGNHGFIEMSRNRNNNCGIATQASY 321


>gi|228245|prf||1801240C Cys protease 3
          Length = 321

 Score =  193 bits (490), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 97/203 (47%), Positives = 134/203 (66%), Gaps = 5/203 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F  TGA+EG +++K+ +L  LS+Q L+DCS  YGN+GC GG    ++ +I
Sbjct: 122 DQEQCGSCWAFSATGALEGQHFLKNDELVSLSEQQLVDCSTDYGNDGCGGGWMTSAFDYI 181

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T+  Y PY  +D  C     +  A  TG V +  ++E+AL+ A++  GP+SVA
Sbjct: 182 KDNGGIDTESSY-PYEAEDRSCRFDANSIGAICTGSVEIVQHTEEALQEAVSGVGPISVA 240

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS  SF FY +GVYY++ C  SP  LDH VLAVGYG    K YW VKNSW + WG+ G
Sbjct: 241 IDASHFSFQFYSSGVYYEQNC--SPTFLDHGVLAVGYGTESTKDYWLVKNSWGSSWGDAG 298

Query: 330 YVLMSI-KDNNCGVMTAPTYVTM 351
           Y+ MS  +DNNCG+ + P+Y T+
Sbjct: 299 YIKMSRNRDNNCGIASEPSYPTV 321


>gi|151573016|gb|ABS17683.1| cathepsin L-1 [Artemia persimilis]
          Length = 334

 Score =  192 bits (489), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 99/200 (49%), Positives = 130/200 (65%), Gaps = 5/200 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CG CW+F +TGA+EG  + K  KL  L +Q LIDCS  YGN GC+GG   +++Q+I
Sbjct: 135 DQGQCGPCWAFSSTGALEGQTFRKTGKLVSLREQNLIDCSGKYGNEGCNGGLMDQAFQYI 194

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T++ Y PY  +D  C        A   GFV++    ED LK A+A  GPVSVA
Sbjct: 195 KDNKGIDTENTY-PYEAEDDVCRYNPRNRGAVDRGFVDIPSGEEDKLKAAVATVGPVSVA 253

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS +SF FY  GVYY+  C++  D LDH VL VGYG  +GK YW VKNSWS +WG+QG
Sbjct: 254 IDASHESFQFYSKGVYYEPSCDS--DDLDHGVLVVGYGSDNGKDYWLVKNSWSEHWGDQG 311

Query: 330 YVLMSI-KDNNCGVMTAPTY 348
           Y+ ++  + N+CGV TA +Y
Sbjct: 312 YIKIARNRKNHCGVATAASY 331


>gi|254746348|emb|CAX16639.1| putative C1A cysteine protease precursor [Spodoptera frugiperda]
          Length = 539

 Score =  192 bits (489), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 94/219 (42%), Positives = 140/219 (63%), Gaps = 9/219 (4%)

Query: 135 DAIPVRYEMK---GYNSLLDQSVCGSCWSFGTTGAVEGAYYMKH-KKLAVLSQQALIDCS 190
           + +P  ++++       + +Q  CGSCW+F TT AVEGA    +  +L  LS+Q+L+DC+
Sbjct: 322 EELPENFDLRMEGAVTPVKNQGHCGSCWAFCTTAAVEGAVARANGDRLVDLSEQSLVDCA 381

Query: 191 WGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTP 250
           WGY N GC+GG    + ++++ HG+PT+++YG Y   D  C I N + T  + GF  VTP
Sbjct: 382 WGYENQGCNGGTLDGAMKYVLTHGIPTEEEYGLYEANDGLCRIQNMSTTYNIRGFAQVTP 441

Query: 251 NSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNS-PDGLDHAVLAVGYGEL 309
            + +ALK+AL K+GPV+VAI AS +    Y +G++YD  C+ + P   +H V  +GYG  
Sbjct: 442 RNPNALKVALNKYGPVTVAIHAS-RLMQHYASGLFYDFSCDETYP---NHGVTVIGYGTR 497

Query: 310 DGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTY 348
           DG+ YW VKNSW   WG  G++LMS K+NNC V+ +P Y
Sbjct: 498 DGEDYWIVKNSWGESWGEDGFILMSAKNNNCFVLDSPYY 536


>gi|149617838|ref|XP_001521715.1| PREDICTED: cathepsin L1-like [Ornithorhynchus anatinus]
          Length = 338

 Score =  192 bits (489), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 103/205 (50%), Positives = 137/205 (66%), Gaps = 10/205 (4%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTGA+EG  + K  +L  LS+Q L+DCSW  GN GC GG    ++Q+I
Sbjct: 134 NQGHCGSCWAFSTTGALEGQLFRKSGRLISLSEQNLVDCSWQQGNQGCHGGIVDLAFQYI 193

Query: 211 MK-HGLPTQDDYGPYLGQD-AYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
           ++  G+ ++D Y PY  +D A C      ATA +TGFV++ P+SE+AL  A+A  GPVSV
Sbjct: 194 LQNQGIDSEDCY-PYTAKDTAQCTFKPECATAPVTGFVDIPPHSEEALMKAVATVGPVSV 252

Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTY 324
            IDAS  SF FY +G++YD KC  S + LDHAVL VGYG    +  GK YW VKNSW  +
Sbjct: 253 GIDASSTSFRFYQSGIFYDPKC--SSESLDHAVLVVGYGYEREDEAGKKYWIVKNSWGKH 310

Query: 325 WGNQGYVLMSI-KDNNCGVMTAPTY 348
           WG++GYV MS  + N+CG+ T  +Y
Sbjct: 311 WGDRGYVYMSKDRGNHCGIATVASY 335


>gi|449670651|ref|XP_002168787.2| PREDICTED: uncharacterized protein LOC100201231 [Hydra
           magnipapillata]
          Length = 1244

 Score =  192 bits (489), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 91/224 (40%), Positives = 136/224 (60%), Gaps = 5/224 (2%)

Query: 131 KASKDAIPVRYEMKGY---NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALI 187
           + ++D IP   + + +    S+  Q +C  C++F   GA EG ++ K  KL  +S Q L+
Sbjct: 611 QVNEDEIPEEIDWREFGAVTSVKGQGICSGCYAFTAAGAAEGGWFRKTGKLIEVSSQQLV 670

Query: 188 DCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVN 247
           DCSWGYGN+GC GG    +  WI  HG+ T   YG YLGQ+ YCH  +T   A + GF  
Sbjct: 671 DCSWGYGNHGCKGGHYASALSWIFTHGVSTDKSYGKYLGQEGYCHFNDTLFGAQIDGFSY 730

Query: 248 VTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG 307
           + P +  ALK  +AK+GPV+V+I+    SF FY  G+Y D++C    +   H+ L VGYG
Sbjct: 731 IQPYNISALKRVVAKYGPVAVSINTKPLSFKFYSKGIYDDKEC--LSNQTHHSALIVGYG 788

Query: 308 ELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
           +L+G+ YW +KNSWS+ WG  GY+ ++++++ CGV   P  V +
Sbjct: 789 KLNGREYWIIKNSWSSSWGEGGYMKLAMENHLCGVTENPVAVVI 832


>gi|402770499|gb|AFQ98384.1| cathepsin L, partial [Hyalomma anatolicum anatolicum]
          Length = 312

 Score =  192 bits (489), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 105/237 (44%), Positives = 145/237 (61%), Gaps = 8/237 (3%)

Query: 117 GEKVSKYSLWVRYNKASKDAIP--VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYM 173
           GE+  + S ++     +  ++P  V +  KG    + DQ  CGSCW+F  TG++EG +++
Sbjct: 76  GERKGRGSTFLPPANVNDSSLPKTVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFL 135

Query: 174 KHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCH 232
           K  KL  LS+Q LIDCS  +GN GC GG    ++++I  + G+ T++ Y PY   D  C 
Sbjct: 136 KSGKLVSLSEQNLIDCSGSFGNEGCGGGLMDNAFKYIKANDGIDTEESY-PYEAMDGDCR 194

Query: 233 IANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNN 292
                  AT TGFV++   SED L+ A+A  GP+SVAIDAS  SF  Y  GVY +  C  
Sbjct: 195 FKKEDVGATDTGFVDIQQGSEDDLQKAVATVGPISVAIDASHSSFQLYSEGVYDEPNC-- 252

Query: 293 SPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           S + LDH VLAVGYG  +GK YW VKNSW+  WG+ GY+LMS  KDN CG+ ++ +Y
Sbjct: 253 SSEELDHGVLAVGYGVKNGKKYWLVKNSWAETWGDNGYILMSRDKDNQCGIASSASY 309


>gi|346466067|gb|AEO32878.1| hypothetical protein [Amblyomma maculatum]
          Length = 358

 Score =  192 bits (489), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 96/200 (48%), Positives = 133/200 (66%), Gaps = 5/200 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTG++EG ++ K  ++  LS+Q L+DCS  +GNNGC+GG    ++++I
Sbjct: 159 NQGQCGSCWAFSTTGSLEGQHFRKTGRMVSLSEQNLVDCSGKFGNNGCEGGLMDNAFKYI 218

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T+  Y PY G D  CH   +   AT TGFV++   +E  LK A+A  GPVSVA
Sbjct: 219 KANGGIDTELSY-PYNGTDGICHFEKSDVGATDTGFVDIPEGNEQLLKKAVATVGPVSVA 277

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS +SF FY  GVY + +C  S + LDH VL VGYG  DG+ YW VKNSW T WG+ G
Sbjct: 278 IDASHESFQFYSQGVYDEPEC--SSESLDHGVLVVGYGTKDGQDYWLVKNSWGTTWGDDG 335

Query: 330 YVLMSI-KDNNCGVMTAPTY 348
           Y+ M+  K+N CG+ ++ +Y
Sbjct: 336 YIYMTRNKENQCGIASSASY 355


>gi|55740406|gb|AAV63979.1| cathepsin L1 precursor [Artemia parthenogenetica]
          Length = 338

 Score =  192 bits (488), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 98/200 (49%), Positives = 131/200 (65%), Gaps = 5/200 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F +TGA+EG  + K  KL  LS+Q LIDCS  YGN GC+GG   +++Q+I
Sbjct: 139 DQGQCGSCWAFSSTGALEGQTFRKTGKLVSLSEQNLIDCSGKYGNEGCNGGLMDQAFQYI 198

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T++ Y PY  +D  C        A   GFV++    ED LK A+A  GPVSVA
Sbjct: 199 KDNKGIDTENTY-PYEAEDGVCRYNPRNRGAVDRGFVDIPSGEEDKLKAAVATVGPVSVA 257

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS +SF FY  G YY+  C++  D LDH VL VGYG  +G+ YW VKNSWS +WG++G
Sbjct: 258 IDASHESFQFYSKGXYYEPSCDS--DDLDHGVLVVGYGSDNGEDYWLVKNSWSEHWGDEG 315

Query: 330 YVLMSI-KDNNCGVMTAPTY 348
           Y+ ++  + N+CGV TA +Y
Sbjct: 316 YIKIARNRKNHCGVATAASY 335


>gi|151176971|gb|ABR88030.1| digestive cysteine protease [Dermestes frischii]
          Length = 339

 Score =  192 bits (488), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 99/201 (49%), Positives = 130/201 (64%), Gaps = 6/201 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCWSF  TGA+EG ++ K  KL  LS+Q L+DCS  +GN+GC+GG    +++++
Sbjct: 139 DQGHCGSCWSFSATGALEGQHFRKTNKLVSLSEQNLVDCSTKFGNDGCNGGLMDNAFKYV 198

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
              HG+ T+  Y PY   D  CH    T+ AT  GFV++    E+ L  A+A  GPVSVA
Sbjct: 199 KYNHGIDTEASY-PYHADDEKCHYNPKTSGATDRGFVDIPTGDEEKLMAAVATVGPVSVA 257

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
           IDAS +SF  Y  GVYYD +C  S + LDH VL VGYG + +G+ YW VKNSW   WG Q
Sbjct: 258 IDASHESFQLYSEGVYYDPEC--SSEELDHGVLVVGYGTDENGQDYWIVKNSWGESWGEQ 315

Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
           GY+ M+  +DNNCG+ T  +Y
Sbjct: 316 GYIKMARNRDNNCGIATQASY 336


>gi|72008176|ref|XP_780713.1| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
          Length = 335

 Score =  192 bits (487), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 108/234 (46%), Positives = 145/234 (61%), Gaps = 7/234 (2%)

Query: 119 KVSKYSLWVRYNKASKDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKK 177
           K +K S ++  N   +    V +  KGY   + DQ  CGSCW+F TTG++EG ++    K
Sbjct: 102 KAAKGSTFLPSNNIGELPKTVDWRTKGYVTPVKDQGQCGSCWAFSTTGSLEGQHFKATGK 161

Query: 178 LAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANT 236
           L  LS+Q L+DCS   GN GCDGG   +++Q+I+K  G+ T++ Y PY   D  CH    
Sbjct: 162 LVSLSEQNLVDCSGKEGNEGCDGGLMDQAFQYIIKAGGIDTEESY-PYKAVDGECHFKKA 220

Query: 237 TATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDG 296
              AT+TG+ +VT +SE AL+ A+A  GP+SVAIDAS  SF  Y +GVY +  C+++   
Sbjct: 221 NIGATVTGYTDVTSDSETALQKAVAHIGPISVAIDASHMSFQLYKSGVYNEPDCSSTL-- 278

Query: 297 LDHAVLAVGYGEL-DGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           LDH VLAVGYG   DG  YW VKNSW+  WG  GY+ MS  KDN CG+ T  +Y
Sbjct: 279 LDHGVLAVGYGTTSDGTDYWIVKNSWAETWGMNGYLWMSRNKDNQCGIATQASY 332


>gi|47086859|ref|NP_997749.1| cathepsin L, 1 a precursor [Danio rerio]
 gi|42542930|gb|AAH66490.1| Cathepsin L1, a [Danio rerio]
          Length = 337

 Score =  191 bits (486), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 104/216 (48%), Positives = 143/216 (66%), Gaps = 11/216 (5%)

Query: 141 YEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCD 199
           +  KGY   + DQ  CGSCW+F TTGA+EG  + K  KL  LS+Q L+DCS   GN GC+
Sbjct: 122 WREKGYVTPVKDQGECGSCWAFSTTGALEGQMFRKTGKLVSLSEQNLVDCSRPEGNEGCN 181

Query: 200 GGEDFRSYQWIM-KHGLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDALK 257
           GG   +++Q++  ++GL +++ Y PYLG  D  CH     + A  TGFV++    E AL 
Sbjct: 182 GGLMDQAFQYVKDQNGLDSEESY-PYLGTDDQPCHFDPKNSAANDTGFVDIPSGKERALM 240

Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKP 313
            A+A  GPVSVAIDA  +SF FY +G+YY+++C  S + LDH VLAVGYG    ++DGK 
Sbjct: 241 KAIAAVGPVSVAIDAGHESFQFYQSGIYYEKEC--SSEELDHGVLAVGYGFEGEDVDGKK 298

Query: 314 YWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           YW VKNSWS  WG++GY+ M+  + N+CG+ TA +Y
Sbjct: 299 YWIVKNSWSENWGDKGYIYMAKDRHNHCGIATAASY 334


>gi|23452059|gb|AAN32912.1| cathepsin [Danio rerio]
          Length = 310

 Score =  191 bits (486), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 104/216 (48%), Positives = 143/216 (66%), Gaps = 11/216 (5%)

Query: 141 YEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCD 199
           +  KGY   + DQ  CGSCW+F TTGA+EG  + K  KL  LS+Q L+DCS   GN GC+
Sbjct: 95  WREKGYVTPVKDQGECGSCWAFSTTGALEGQMFRKTGKLVSLSEQNLVDCSRPEGNEGCN 154

Query: 200 GGEDFRSYQWIM-KHGLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDALK 257
           GG   +++Q++  ++GL +++ Y PYLG  D  CH     + A  TGFV++    E AL 
Sbjct: 155 GGLMDQAFQYVKDQNGLDSEESY-PYLGTDDQPCHFDPKNSAANDTGFVDIPSGKERALM 213

Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKP 313
            A+A  GPVSVAIDA  +SF FY +G+YY+++C  S + LDH VLAVGYG    ++DGK 
Sbjct: 214 KAIAAVGPVSVAIDAGHESFQFYQSGIYYEKEC--SSEELDHGVLAVGYGFEGEDVDGKK 271

Query: 314 YWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           YW VKNSWS  WG++GY+ M+  + N+CG+ TA +Y
Sbjct: 272 YWIVKNSWSENWGDKGYIYMAKDRHNHCGIATAASY 307


>gi|410923307|ref|XP_003975123.1| PREDICTED: cathepsin L1-like [Takifugu rubripes]
          Length = 336

 Score =  191 bits (486), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 106/217 (48%), Positives = 141/217 (64%), Gaps = 9/217 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + DQ  CGSCW+F TTGA+EG ++ K   L  LS+Q L+DCS   GN G
Sbjct: 119 VDWRDKGYVTPVKDQGQCGSCWAFSTTGAMEGQHFRKTGTLVSLSEQNLVDCSRPEGNEG 178

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQD-AYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG   +++Q+I  +G    ++  PYLG D   CH   +  +A  TGFV+V   SE AL
Sbjct: 179 CNGGLMDQAFQYIKDNGGLDSEESYPYLGTDEGPCHYDPSYNSANDTGFVDVPSGSERAL 238

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GPVSVAIDA  +SF FY +G+YYD++C  S + LDH VL VGYG    ++DGK
Sbjct: 239 MKAVASVGPVSVAIDAGHESFQFYHSGIYYDKEC--SSEELDHGVLVVGYGFEGKDVDGK 296

Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
            YW VKNSWS  WG++GY+ M+  K N+CG+ TA +Y
Sbjct: 297 KYWIVKNSWSENWGDKGYIYMAKDKKNHCGIATAASY 333


>gi|82796372|gb|ABB91778.1| cathepsin L [Hymeniacidon perlevis]
          Length = 323

 Score =  191 bits (486), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 97/203 (47%), Positives = 133/203 (65%), Gaps = 5/203 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTG++EG +++K  KL  LS+Q L+DCS   GN GC+GG   +++++I
Sbjct: 124 NQGQCGSCWAFSTTGSLEGQHFLKTGKLVSLSEQNLVDCSGKEGNEGCNGGLMDQAFEYI 183

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
            K+ G+ T+  Y PY   D  C    +   AT TG+V++    E+AL  A+ K GPVSVA
Sbjct: 184 KKNGGIDTEASY-PYQAHDERCRFKASDVGATCTGYVDIKREDENALMQAVEKIGPVSVA 242

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS  SF  Y +GVYY+ +C  S   LDH VLA+GYG   G  YW VKNSW T WG +G
Sbjct: 243 IDASHSSFQLYRSGVYYEREC--SQTALDHGVLAIGYGTEGGSDYWLVKNSWGTDWGMEG 300

Query: 330 YVLMSI-KDNNCGVMTAPTYVTM 351
           Y++MS  ++NNCG+ T  +Y T+
Sbjct: 301 YIMMSRNRNNNCGIATEASYPTV 323


>gi|410898132|ref|XP_003962552.1| PREDICTED: cathepsin L-like [Takifugu rubripes]
          Length = 335

 Score =  191 bits (486), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 105/228 (46%), Positives = 139/228 (60%), Gaps = 6/228 (2%)

Query: 124 SLWVRYNKASKDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLS 182
           S + R  + +     V +  KGY   + DQ  CGSCW+F  TG++EG  + K  KL  LS
Sbjct: 108 SAFFRLAEGTHLPTTVDWRDKGYVTGVKDQKQCGSCWAFSATGSLEGQNFRKTGKLVSLS 167

Query: 183 QQALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATAT 241
           +Q L+DCS  YGN GC+GG    ++++I ++ G+ T+  Y PY  +D  C        A 
Sbjct: 168 EQQLVDCSGDYGNMGCNGGLMDYAFKYIQENGGIDTEKSY-PYEAEDGQCRFKPENVGAK 226

Query: 242 MTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAV 301
            TG+V+VT   EDALK A+A  GPVSV IDAS  SF  Y +GVY ++ C  S   LDH V
Sbjct: 227 CTGYVDVTVGDEDALKEAVATIGPVSVGIDASHSSFQLYDSGVYDEQDC--SSQDLDHGV 284

Query: 302 LAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           LAVGYG  +G+ YW VKNSW   WG +GY++MS  KDN CG+ TA +Y
Sbjct: 285 LAVGYGTDNGQDYWLVKNSWGLGWGQEGYIMMSRNKDNQCGIATAASY 332


>gi|402770503|gb|AFQ98386.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  191 bits (486), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 96/200 (48%), Positives = 130/200 (65%), Gaps = 5/200 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F  TG++EG +++K+ +L  LS+Q L+DCS  +GNNGC+GG    ++++I
Sbjct: 133 DQGQCGSCWAFSATGSLEGRHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYI 192

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
            ++ G+ T+  Y PY   D  C        AT TG+V +   SED LK A+A  GP+SVA
Sbjct: 193 KENDGIDTEKSY-PYEAVDGECRFKKEDVGATDTGYVEIKAGSEDDLKKAVATVGPISVA 251

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS  SF  Y  GVY + +C  S + LDH VL VGYG   GK YW VKNSW+  WG+QG
Sbjct: 252 IDASHSSFQLYSEGVYDEPEC--SSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQG 309

Query: 330 YVLMSIKDNN-CGVMTAPTY 348
           Y+LMS  +NN CG+ +  +Y
Sbjct: 310 YILMSRDNNNQCGIASQASY 329


>gi|238481789|gb|ACR43934.1| cathepsin L-like cysteine proteinase [Haliotis diversicolor
           supertexta]
          Length = 347

 Score =  191 bits (486), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 100/213 (46%), Positives = 140/213 (65%), Gaps = 6/213 (2%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY  ++ +Q  CGSCWSF TTG++EG ++ K  KL  LS+Q L+DCS  +GN G
Sbjct: 135 VDWRKKGYVTAVKNQGQCGSCWSFSTTGSLEGQHFHKSGKLVSLSEQQLVDCSGKFGNEG 194

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG   +++++I+ + G+ T+++Y PY  +   CH   +   AT +G V+V    E  L
Sbjct: 195 CNGGLMDQAFEYIITNGGIETEEEY-PYDARQERCHFKKSEVAATASGCVDVKSGDETDL 253

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K ++A+ GPVS+AIDAS +SF  Y  GVY + KC+++   LDH VL VGYG  DG+ YW 
Sbjct: 254 KNSVAEVGPVSIAIDASHQSFQLYSGGVYDEPKCSSTE--LDHGVLVVGYGTDDGQDYWL 311

Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           VKNSW T WG +GYV MS  +DN CGV T  +Y
Sbjct: 312 VKNSWGTTWGLEGYVKMSRNQDNQCGVATQASY 344


>gi|330434686|gb|AEC22811.1| cathepsin L [Macrobrachium nipponense]
          Length = 342

 Score =  191 bits (485), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 102/201 (50%), Positives = 131/201 (65%), Gaps = 6/201 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F  TGA+EG +Y +   L  LS+Q L+DCS  +GNNGC+GG    ++Q+I
Sbjct: 142 DQGSCGSCWAFSATGALEGQHYRQTGDLVSLSEQNLVDCSSKFGNNGCNGGLMDNAFQYI 201

Query: 211 -MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
            +  G+ T+  Y PY  +D  C      A A   GFV+V   +E+ALK A+A  GPVSVA
Sbjct: 202 KVNGGIDTEKSY-PYEAEDEPCRYNPANAGADDRGFVDVREGNENALKKAIATIGPVSVA 260

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGEL-DGKPYWQVKNSWSTYWGNQ 328
           IDASQ SF FY +GVY D  C  S + LDH VLAVGYG   DG+ YW VKNSWS  WG+Q
Sbjct: 261 IDASQDSFQFYQHGVYSDPDC--SAENLDHGVLAVGYGTTEDGQDYWLVKNSWSKSWGDQ 318

Query: 329 GYVLMSIKDNN-CGVMTAPTY 348
           GY+ ++   NN CG+ +A +Y
Sbjct: 319 GYIKIARNQNNMCGIASAASY 339


>gi|402770509|gb|AFQ98389.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  191 bits (484), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 96/200 (48%), Positives = 129/200 (64%), Gaps = 5/200 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F  TG++EG +++K+ +L  LS+Q L+DCS  +GNNGC+GG    ++++I
Sbjct: 133 DQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYI 192

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T+  Y PY   D  C        AT TG+V +   SED LK A+A  GP+SVA
Sbjct: 193 KANDGIDTEKSY-PYEAVDGECRFKKEDVGATDTGYVEIKAGSEDDLKKAVATVGPISVA 251

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS  SF  Y  GVY + +C  S + LDH VL VGYG   GK YW VKNSW+  WG+QG
Sbjct: 252 IDASHSSFQLYSEGVYDEPEC--SSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQG 309

Query: 330 YVLMSIKDNN-CGVMTAPTY 348
           Y+LMS  +NN CG+ +  +Y
Sbjct: 310 YILMSRDNNNQCGIASQASY 329


>gi|33112583|gb|AAP94047.1| cathepsin-L-like cysteine peptidase 03 [Tenebrio molitor]
          Length = 337

 Score =  191 bits (484), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 99/201 (49%), Positives = 130/201 (64%), Gaps = 6/201 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCWSF  TG++EG ++ K  KL  LS+Q L+DCS  +GNNGC+GG    ++++I
Sbjct: 137 DQGQCGSCWSFSATGSLEGQHFRKSGKLVSLSEQNLVDCSEKFGNNGCNGGLMDNAFRYI 196

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T+  Y PY  +D  CH       AT  G+V++   +ED L+ A+A  GPVSVA
Sbjct: 197 KANGGIDTEQAY-PYKAEDEKCHYKPKNKGATDRGYVDIESGNEDKLQSAVATVGPVSVA 255

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
           IDAS +SF  Y  GVYY+  C+ S   LDH VL VGYG E DG  YW VKNSW   WG+Q
Sbjct: 256 IDASHQSFQLYSGGVYYEPDCSASQ--LDHGVLVVGYGTEDDGTDYWLVKNSWGKSWGDQ 313

Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
           GY+ M+  +DNNCG+ T  +Y
Sbjct: 314 GYIKMARNRDNNCGIATEASY 334


>gi|402770501|gb|AFQ98385.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  191 bits (484), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 96/200 (48%), Positives = 129/200 (64%), Gaps = 5/200 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F  TG++EG +++K+ +L  LS+Q L+DCS  +GNNGC+GG    ++++I
Sbjct: 133 DQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYI 192

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T+  Y PY   D  C        AT TG+V +   SED LK A+A  GP+SVA
Sbjct: 193 KANDGIDTEKSY-PYEAVDGECRFKKEDVGATDTGYVEIKAGSEDDLKKAVATVGPISVA 251

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS  SF  Y  GVY + +C  S + LDH VL VGYG   GK YW VKNSW+  WG+QG
Sbjct: 252 IDASHSSFQLYSEGVYDEPEC--SSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQG 309

Query: 330 YVLMSIKDNN-CGVMTAPTY 348
           Y+LMS  +NN CG+ +  +Y
Sbjct: 310 YILMSRDNNNQCGIASQASY 329


>gi|2765358|emb|CAA74241.1| cathepsin L [Litopenaeus vannamei]
          Length = 325

 Score =  191 bits (484), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 103/224 (45%), Positives = 147/224 (65%), Gaps = 9/224 (4%)

Query: 131 KASKDAIP--VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALI 187
           KA  + +P  V +  KG    + DQ  CGSCW+F TTG++EG +++K  KL  LS+Q L+
Sbjct: 102 KADDETLPEKVDWRTKGAVTPVKDQKQCGSCWAFSTTGSLEGQHFLKDGKLVSLSEQNLV 161

Query: 188 DCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFV 246
           DCS  + N GC GG   +++++I  + G+ T+D Y PY  QD  C    +   AT TG+V
Sbjct: 162 DCSDKFRNMGCMGGLMDQAFRYIKANKGIDTEDSY-PYEAQDGKCRFDASNVGATDTGYV 220

Query: 247 NVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGY 306
           +V   SE ALK A+A  GP+SV IDASQ +F FY  GVY+D+ C+++   LDH VLAVGY
Sbjct: 221 DVEHGSESALKKAVATIGPISVGIDASQSTFHFYHTGVYHDDHCSSTM--LDHGVLAVGY 278

Query: 307 G-ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           G + +G  +W VKNSW+T WG++GY+ MS  ++NNCG+ +  +Y
Sbjct: 279 GSDENGGDFWLVKNSWNTSWGDKGYIKMSRNRNNNCGIASQASY 322


>gi|29165304|gb|AAO65603.1| cathepsin L precursor [Hydra vulgaris]
          Length = 324

 Score =  191 bits (484), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 99/213 (46%), Positives = 138/213 (64%), Gaps = 6/213 (2%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  +GY   + DQ  CGSCW+F TTG++EG ++ K  KL  LS+Q L+DCS  YGNNG
Sbjct: 112 VDWRNEGYVTPVKDQGQCGSCWAFSTTGSLEGQHFKKTGKLVSLSEQNLVDCSTAYGNNG 171

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           CDGG    ++ +I ++ G+ ++  Y PY  +D  C    ++  AT TGFV++   +E+ L
Sbjct: 172 CDGGLMDNAFTYIKENKGIDSEASY-PYTAEDGKCVFKKSSVAATDTGFVDIPEGNENKL 230

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K A+A  GP+SVAIDAS +SF FY +GVY +  C+++   LDH VL VGYG   GK YW 
Sbjct: 231 KEAVASVGPISVAIDASHESFQFYSSGVYNEPSCSSTE--LDHGVLVVGYGTESGKDYWL 288

Query: 317 VKNSWSTYWGNQGYVLMSIK-DNNCGVMTAPTY 348
           VKNSW+T WG++GY+ M     N CG+ T  +Y
Sbjct: 289 VKNSWNTSWGDKGYIKMRRNAKNQCGIATKASY 321


>gi|348531585|ref|XP_003453289.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
          Length = 366

 Score =  190 bits (483), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 100/213 (46%), Positives = 135/213 (63%), Gaps = 6/213 (2%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + DQ +CGSCW+F TTG +EG ++ K  KL  LS+Q L+DCS  +GNNG
Sbjct: 154 VDWRDKGYVTEVKDQKICGSCWAFSTTGVLEGQHFRKTGKLVSLSEQQLMDCSHSFGNNG 213

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG   R++Q+I  + G+ T+  Y PY  +   C        A  TG+V V P++EDAL
Sbjct: 214 CNGGSVKRAFQYIQANGGIDTEASY-PYEAKGQQCRYKPDGIGAKCTGYVEVKPSNEDAL 272

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K A+A  GP+SV IDAS  SF FY +GVY +  C+ +   L+H VLAVGYG  +G  YW 
Sbjct: 273 KEAVATIGPISVGIDASHNSFRFYQSGVYDEPDCSKTV--LNHDVLAVGYGTENGHDYWL 330

Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           +KNSW   WG++GY+ MS  K N CG+ +  TY
Sbjct: 331 IKNSWGIRWGDKGYIKMSRNKSNQCGIASDATY 363


>gi|340370270|ref|XP_003383669.1| PREDICTED: cathepsin L1-like [Amphimedon queenslandica]
          Length = 326

 Score =  190 bits (483), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 91/201 (45%), Positives = 132/201 (65%), Gaps = 3/201 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCWSF +TG++EG +++K   L+ LS+Q L+DCS  +GN+GC GG    S++++
Sbjct: 125 NQGKCGSCWSFSSTGSLEGQHFLKTGTLSSLSEQQLMDCSTSFGNHGCKGGLMDNSFRYL 184

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
                   ++  PY  +D +C   ++ A A  TG+ ++    EDALK A+A  GP+SVAI
Sbjct: 185 ETVAGDMSEEMYPYTAEDGFCRYRSSEAIAKDTGYKDIPRGDEDALKEAVATVGPISVAI 244

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
           DA  +SF  Y  G+YY+  C+++   LDH VLAVGYG  +G+ YW VKNSW   WGN+GY
Sbjct: 245 DAGHRSFQLYHEGIYYEPACSSTK--LDHGVLAVGYGTGEGEEYWLVKNSWGPSWGNEGY 302

Query: 331 VLMSI-KDNNCGVMTAPTYVT 350
           V+MS  ++NNCG+ T  +Y T
Sbjct: 303 VMMSRNRENNCGIATQASYPT 323


>gi|225706370|gb|ACO09031.1| Cathepsin L precursor [Osmerus mordax]
          Length = 337

 Score =  190 bits (483), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 104/218 (47%), Positives = 145/218 (66%), Gaps = 11/218 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  +GY   + DQ  CGSCW+F TTGA+EG  + K +KL  LS+Q L+DCS   GN G
Sbjct: 120 VDWREEGYVTPVKDQGQCGSCWAFSTTGAMEGQMFRKTQKLVSLSEQNLVDCSRPEGNEG 179

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDA 255
           C+GG   +++Q+I  + GL +++ Y PYLG D   C+     + A  TGF+++    E A
Sbjct: 180 CNGGLMDQAFQYIQDNSGLDSEEAY-PYLGTDDQPCNYKAEFSAANDTGFMDIPSGKEHA 238

Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
           L  A+A  GPVSVAIDA  +SF FY +G+YY+++C  S + LDH VLAVGYG    ++DG
Sbjct: 239 LMKAIASVGPVSVAIDAGHESFQFYQSGIYYEKEC--SSEELDHGVLAVGYGFEGEDVDG 296

Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           K YW VKNSWS  WG++GY+LM+  + N+CG+ TA +Y
Sbjct: 297 KKYWIVKNSWSEKWGDKGYILMAKDRKNHCGIATAASY 334


>gi|148224022|ref|NP_001087489.1| cathepsin L2 precursor [Xenopus laevis]
 gi|51258284|gb|AAH80004.1| MGC81823 protein [Xenopus laevis]
          Length = 335

 Score =  190 bits (483), Expect = 8e-46,   Method: Compositional matrix adjust.
 Identities = 104/217 (47%), Positives = 136/217 (62%), Gaps = 9/217 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + DQ  CGSCW+F TTGA+EG +Y K  KL  LS+Q L+DCS   GN G
Sbjct: 118 VDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHYRKAGKLISLSEQNLVDCSRAQGNQG 177

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQ-DAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG   +++Q++  +G    +D  PY  + D  CH      +A  TGFV+V   SE  L
Sbjct: 178 CNGGLMDQAFQYVKDNGGIDSEDSYPYTAKDDQECHYDPNYNSANDTGFVDVPSGSEKDL 237

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GPVSVA+DA  KSF FY +G+YYD +C  S + LDH VL VGYG    ++DGK
Sbjct: 238 MKAVASVGPVSVAVDAGHKSFQFYQSGIYYDPEC--SSEDLDHGVLVVGYGFEGEDVDGK 295

Query: 313 PYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
            YW VKNSWS  WGN GY+ +   + N+CG+ TA +Y
Sbjct: 296 RYWIVKNSWSEKWGNNGYIKIAKDRHNHCGIATAASY 332


>gi|325303202|tpg|DAA34687.1| TPA_inf: cathepsin L-like cysteine proteinase B [Amblyomma
           variegatum]
          Length = 337

 Score =  190 bits (483), Expect = 8e-46,   Method: Compositional matrix adjust.
 Identities = 95/200 (47%), Positives = 131/200 (65%), Gaps = 5/200 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTG++EG ++ K   +  LS+Q L+DCS  +GNNGC+GG    ++++I
Sbjct: 138 NQGQCGSCWAFSTTGSLEGQHFRKSGDMVSLSEQNLVDCSTAFGNNGCEGGLMDNAFKYI 197

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T+  Y PY G D  CH   +   AT TGFV++   +E  LK A+A  GP+SVA
Sbjct: 198 KANGGIDTEKSY-PYNGTDGTCHFKKSDVGATDTGFVDIPEGNEHLLKKAVATVGPISVA 256

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS +SF FY  GVY + +C  S + LDH VL VGYG  D + YW VKNSW T WG+ G
Sbjct: 257 IDASHQSFQFYSQGVYDEPEC--SSENLDHGVLVVGYGTKDDQDYWLVKNSWGTTWGDGG 314

Query: 330 YVLMSI-KDNNCGVMTAPTY 348
           Y+ M+  KDN CG+ ++ +Y
Sbjct: 315 YIYMTRNKDNQCGIASSASY 334


>gi|261289779|ref|XP_002611751.1| hypothetical protein BRAFLDRAFT_284345 [Branchiostoma floridae]
 gi|229297123|gb|EEN67761.1| hypothetical protein BRAFLDRAFT_284345 [Branchiostoma floridae]
          Length = 330

 Score =  190 bits (483), Expect = 8e-46,   Method: Compositional matrix adjust.
 Identities = 97/202 (48%), Positives = 132/202 (65%), Gaps = 5/202 (2%)

Query: 149 LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
           + DQ  CGSCW+F TTG++EG +      L  LS+Q L+DCS   GN GC+GG+  + +Q
Sbjct: 129 IKDQGQCGSCWAFSTTGSLEGQHAKATGTLVSLSEQNLVDCSRQEGNKGCEGGDMDQGFQ 188

Query: 209 WIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVS 267
           +I+++ G+ T+  Y PY  ++  C   N+   ATM+ F +VT   EDALK A A  GP+S
Sbjct: 189 YIIQNKGIDTEQCY-PYKAKNHRCKFDNSCIGATMSSFTDVTSGDEDALKQACANIGPIS 247

Query: 268 VAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGN 327
           V IDAS +SF FY +GVY + +C+++   LDH VL VGYG    K YW VKNSW T WGN
Sbjct: 248 VGIDASHQSFQFYSSGVYNEFECSSTK--LDHGVLVVGYGTYGSKDYWLVKNSWGTVWGN 305

Query: 328 QGYVLMSI-KDNNCGVMTAPTY 348
           +GY++MS  KDN CGV T  ++
Sbjct: 306 EGYIMMSRNKDNQCGVATDASF 327


>gi|254674508|dbj|BAH86062.1| cysteine protease [Haemaphysalis longicornis]
          Length = 333

 Score =  190 bits (482), Expect = 9e-46,   Method: Compositional matrix adjust.
 Identities = 97/199 (48%), Positives = 125/199 (62%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTG++EG ++ K  KL  LS+Q L+DCS  +GN GC+GG     +Q+I
Sbjct: 134 NQGQCGSCWAFSTTGSLEGQHFRKTGKLVSLSEQNLVDCSDDFGNQGCNGGLMDNGFQYI 193

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
             +G    ++  PY  QD  C        AT  GFV++   SED LK A+A  GPVSVAI
Sbjct: 194 KANGGIDTEESHPYTAQDGDCKFKKADVGATDAGFVDIQQGSEDDLKKAVATVGPVSVAI 253

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
           DAS  SF  Y  GVY +  C++S   LDH VL VGYG  +GK YW VKNSW   WG+ GY
Sbjct: 254 DASHGSFQLYSQGVYDEPDCSSSQ--LDHGVLTVGYGVKNGKKYWLVKNSWGGDWGDNGY 311

Query: 331 VLMSI-KDNNCGVMTAPTY 348
           +LMS  KDN CG+ ++ +Y
Sbjct: 312 ILMSRDKDNQCGIASSASY 330


>gi|402770507|gb|AFQ98388.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  190 bits (482), Expect = 9e-46,   Method: Compositional matrix adjust.
 Identities = 96/200 (48%), Positives = 129/200 (64%), Gaps = 5/200 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F TTG++EG +++K+ +L  LS+Q L+DCS  +GNNGC+GG    ++++I
Sbjct: 133 DQGQCGSCWAFSTTGSLEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYI 192

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T+  Y PY   D  C        AT TG+V +    ED LK A+A  GP+SVA
Sbjct: 193 KANDGIDTEKSY-PYEAVDGECRFKKEDVGATDTGYVEIKAGCEDDLKKAVATVGPISVA 251

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS  SF  Y  GVY + +C  S + LDH VL VGYG   GK YW VKNSW+  WG+QG
Sbjct: 252 IDASHSSFQLYSEGVYDEPEC--SSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQG 309

Query: 330 YVLMSIKDNN-CGVMTAPTY 348
           Y+LMS  +NN CG+ +  +Y
Sbjct: 310 YILMSRDNNNQCGIASQASY 329


>gi|443724292|gb|ELU12369.1| hypothetical protein CAPTEDRAFT_165495 [Capitella teleta]
          Length = 351

 Score =  190 bits (482), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 106/222 (47%), Positives = 138/222 (62%), Gaps = 12/222 (5%)

Query: 136 AIPVR------YEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALID 188
           AIPV       +  +GY   + DQ  CGSCWSF TTGA+EG ++ K  KL  LS+Q LID
Sbjct: 130 AIPVSLPAEVDWRKEGYVTPIKDQGHCGSCWSFSTTGALEGQHFRKTGKLVSLSEQNLID 189

Query: 189 CSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVN 247
           CS  YGNNGC+GG    ++Q+I  + G  T+D Y PY   D  C        AT TG+ +
Sbjct: 190 CSTSYGNNGCNGGVMDYAFQYIKDNDGDDTEDSY-PYEAADGPCRFKKEYVGATDTGYTD 248

Query: 248 VTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG 307
           +    E+ +K A+A  GPVSVAIDAS  SF  Y +GVY + +C+  P+GLDH VL VGYG
Sbjct: 249 LPKGDEEKMKEAVAMVGPVSVAIDASHTSFQMYQSGVYDEVECD--PEGLDHGVLVVGYG 306

Query: 308 ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
              G+ YW VKNSW T WG++GY+ MS  K+N CG+ +  +Y
Sbjct: 307 TELGQDYWLVKNSWGTKWGDEGYIKMSRNKNNQCGISSMASY 348


>gi|41688064|dbj|BAD08618.1| cathepsin L preproprotein [Cyprinus carpio]
          Length = 337

 Score =  190 bits (482), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 105/213 (49%), Positives = 142/213 (66%), Gaps = 11/213 (5%)

Query: 144 KGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGE 202
           KGY   + DQ  CGSCW+F TTGA+EG  + K  KL  LS+Q L+DCS   GN GC+GG 
Sbjct: 125 KGYVTPVKDQGECGSCWAFSTTGAMEGQMFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGL 184

Query: 203 DFRSYQWIM-KHGLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDALKLAL 260
             +++Q+I  ++GL +++ Y PY+G  D  CH     + A  TGFV++    E AL  A+
Sbjct: 185 MDQAFQYIKDQNGLDSEESY-PYVGTDDQPCHYDPKYSAANDTGFVDIPSGKEHALMKAI 243

Query: 261 AKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQ 316
           A  GPVSVAIDA  +SF FY +G+YY+++C  S + LDH VLAVGYG    ++DGK YW 
Sbjct: 244 AAVGPVSVAIDAGHESFQFYQSGIYYEKEC--SSEELDHGVLAVGYGFEGEDVDGKKYWI 301

Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           VKNSWS  WG++GYV M+  + N+CG+ TA +Y
Sbjct: 302 VKNSWSENWGDKGYVYMAKDRHNHCGIATAASY 334


>gi|323450461|gb|EGB06342.1| hypothetical protein AURANDRAFT_59223 [Aureococcus anophagefferens]
          Length = 539

 Score =  190 bits (482), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 100/204 (49%), Positives = 129/204 (63%), Gaps = 6/204 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCWS+GTTGA+EG +++KH  L  LSQQ L+DCSW  GNN CDGG D  +Y W+
Sbjct: 338 DQGSCGSCWSYGTTGAIEGQHFLKHGALVELSQQNLMDCSWPQGNNACDGGLDSNAYDWM 397

Query: 211 MK---HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVS 267
           ++     +   + YG Y   D  CH A+    A +TG+VNVT     A   ALA  GP+S
Sbjct: 398 LEVNGGKVAAAEPYGGYKNADGACHAADGF-HAKITGYVNVT--GLPAFNDALANVGPLS 454

Query: 268 VAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGN 327
           ++IDA+ + F +Y +G YY + C    D LDH VLAVG      + Y  VKNSWST+WG+
Sbjct: 455 ISIDATPEDFYYYKSGFYYSDDCKGGVDDLDHTVLAVGVKVHGDQRYTIVKNSWSTHWGD 514

Query: 328 QGYVLMSIKDNNCGVMTAPTYVTM 351
            GYV +S K+N CGV TA TY T+
Sbjct: 515 AGYVYISQKNNCCGVATAATYPTI 538


>gi|334332720|ref|XP_001367595.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
          Length = 333

 Score =  190 bits (482), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 98/212 (46%), Positives = 135/212 (63%), Gaps = 4/212 (1%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TG++EG ++ K KKL  LS+Q L+DCS   GNNG
Sbjct: 121 VDWREKGYVTPVKNQGQCGSCWAFSATGSLEGQWFHKTKKLVSLSEQNLVDCSTSEGNNG 180

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
           C GG    +++++  +G    +   PYLGQD  C      + A +TGFV++   +E AL 
Sbjct: 181 CSGGLMDNAFEYVKNNGGIDTEQAYPYLGQDNECKYRAECSGANVTGFVDIPSMNERALM 240

Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQV 317
            A+A  GP+SVAIDA   SF FY +GVYY+ +C++S   LDH VL VGYG +    YW V
Sbjct: 241 KAVANVGPISVAIDAGNPSFQFYESGVYYEPQCSSSQ--LDHGVLVVGYGSIGKDEYWIV 298

Query: 318 KNSWSTYWGNQGYVLMS-IKDNNCGVMTAPTY 348
           KNSW   WG +GYVLM+  ++N+CG+ TA +Y
Sbjct: 299 KNSWGEEWGKKGYVLMAKFRNNHCGIATAASY 330


>gi|21483190|gb|AAL14223.1| cathepsin L [Dictyocaulus viviparus]
          Length = 347

 Score =  189 bits (481), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 98/225 (43%), Positives = 143/225 (63%), Gaps = 7/225 (3%)

Query: 127 VRYNKASKDAIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQAL 186
           V +N  + D++  R E      + +Q +CGSCW+F  TGA+EG ++    KL  LS+Q L
Sbjct: 124 VPFNVKAPDSVDWR-EHNLVTPVKNQGMCGSCWAFSATGALEGQHFRATGKLVSLSEQNL 182

Query: 187 IDCSWGYGNNGCDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGF 245
           +DCS  YGN+GC+GG    ++++I   HG+ T++ Y PY+G++  CH       A   GF
Sbjct: 183 VDCSTKYGNHGCNGGLMDLAFEYIKDNHGIDTEEGY-PYVGKEMRCHFKKRDIGAEDRGF 241

Query: 246 VNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVG 305
           V++    EDALK+A+A  GP+S+AIDA  +SF  Y  GVY+DE+C  S + LDH VL VG
Sbjct: 242 VDLPEGDEDALKVAVATQGPISIAIDAGHRSFQLYKKGVYFDEEC--SSEELDHGVLLVG 299

Query: 306 YG-ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           YG + +   YW +KNSW T WG +GYV ++  ++N+CGV T  +Y
Sbjct: 300 YGTDPEAGDYWIIKNSWGTKWGEKGYVRIARNRNNHCGVATKASY 344


>gi|185135439|ref|NP_001117777.1| procathepsin L precursor [Oncorhynchus mykiss]
 gi|14582899|gb|AAK69706.1|AF358668_1 procathepsin L [Oncorhynchus mykiss]
          Length = 338

 Score =  189 bits (481), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 104/218 (47%), Positives = 142/218 (65%), Gaps = 11/218 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + DQ  CGSCW+F TTGA+EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 121 VDWREKGYVTPVKDQGSCGSCWAFSTTGAMEGQQFRKTGKLVSLSEQNLVDCSRPEGNEG 180

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDA 255
           C+GG   +++Q+I  + GL T++ Y PY+G D   CH     + A  TGFV++    E A
Sbjct: 181 CNGGLMDQAFQYIQDNAGLDTEESY-PYVGTDEDPCHYKPEFSAANETGFVDIPSGKEHA 239

Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
           +  A+A  GPVSVAIDA  +SF FY +G+YY+++C  S + LDH VL VGYG    ++DG
Sbjct: 240 MMKAVAAVGPVSVAIDAGHESFQFYESGIYYEKEC--SSEELDHGVLVVGYGFEGEDVDG 297

Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           K YW VKNSWS  WG++GY+ M+  + N+CG+ TA +Y
Sbjct: 298 KKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATASSY 335


>gi|348542774|ref|XP_003458859.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
          Length = 330

 Score =  189 bits (481), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 104/213 (48%), Positives = 139/213 (65%), Gaps = 7/213 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  +GY   + DQ  CGSCW+F  TGA+EG Y+ K  KL  LS+Q L+DCS  + NNG
Sbjct: 119 VDWRKQGYVTHVKDQKECGSCWAFSATGALEGQYFKKTGKLVSLSEQQLVDCSRKFRNNG 178

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GGE   ++Q+I  + GL T++ Y  Y  +D  CH    +  A  +G+VNV+P  EDAL
Sbjct: 179 CEGGEPHWAFQYIRYNGGLDTEESY-HYEAKDGQCHYNPDSVGAKCSGYVNVSP-FEDAL 236

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K A+A  GP+SVAID S+ SF  Y +GVY +  C+N    L+HAVLAVGYG  +G  YW 
Sbjct: 237 KEAVATIGPISVAIDISRVSFQLYHSGVYDEPWCSNI--NLNHAVLAVGYGTENGHDYWL 294

Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           VKNSW + WGN+GY+ M+  KDN CG+ T  +Y
Sbjct: 295 VKNSWGSEWGNKGYIKMTRNKDNQCGIATEASY 327


>gi|226821421|gb|ACO82386.1| cathepsin L-like protein [Lutjanus argentimaculatus]
          Length = 301

 Score =  189 bits (481), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 102/205 (49%), Positives = 135/205 (65%), Gaps = 10/205 (4%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F TTGA+EG ++ K  KL  LS+Q L+DCS   GN GC+GG   +++Q+I
Sbjct: 97  DQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYI 156

Query: 211 M-KHGLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
               GL ++D Y PYLG  D  CH      +A  TGFV++    E AL  A+A  GPVSV
Sbjct: 157 KDNQGLDSEDSY-PYLGTDDQPCHYDPKYNSANDTGFVDIPSGKERALMKAVAAVGPVSV 215

Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTY 324
           AIDA  +SF FY +G+YY++ C  S + LDH VL VGYG    ++DGK YW VKNSWS  
Sbjct: 216 AIDAGHESFQFYQSGIYYEKDC--SSEELDHGVLVVGYGFEGEDVDGKKYWIVKNSWSEK 273

Query: 325 WGNQGYVLMSI-KDNNCGVMTAPTY 348
           WG++GY+ M+  + N+CG+ TA +Y
Sbjct: 274 WGDKGYIYMAKDRKNHCGIATAASY 298


>gi|242020003|ref|XP_002430447.1| Cathepsin L precursor, putative [Pediculus humanus corporis]
 gi|212515585|gb|EEB17709.1| Cathepsin L precursor, putative [Pediculus humanus corporis]
          Length = 345

 Score =  189 bits (481), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 104/234 (44%), Positives = 139/234 (59%), Gaps = 19/234 (8%)

Query: 119 KVSKYSLWVRYNKASKDAIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKL 178
           K+ K+  WV+    +    PV+          DQ  CGSCW+F  TGA+EG ++ K K L
Sbjct: 124 KLPKHVDWVKLGAVT----PVK----------DQGHCGSCWAFSATGALEGLHFRKTKVL 169

Query: 179 AVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI-MKHGLPTQDDYGPYLGQDAYCHIANTT 237
             LS+Q LIDCS   GNNGC+GG   +++Q++ +  G+ T+  Y PY G +  C      
Sbjct: 170 VSLSEQNLIDCSTEEGNNGCNGGLMDQAFQYVRINGGIDTERSY-PYEGNNDVCRYEPEN 228

Query: 238 ATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGL 297
           + A  TG+ +V    EDALK A+A  GPVSVAIDASQ+SF  Y +GVY++  C N P+ L
Sbjct: 229 SGAIDTGYTDVPLGDEDALKSAVATVGPVSVAIDASQESFQLYSSGVYFEPNCKNEPESL 288

Query: 298 DHAVLAVGYG--ELDGKPYWQVKNSWSTYWGNQGYVLMSIK-DNNCGVMTAPTY 348
           DH VL VGYG  E   + YW VKNSW   WG  GY+ M+   DN CG+ T P++
Sbjct: 289 DHGVLVVGYGTDEETQQDYWLVKNSWGDSWGENGYIKMARNADNQCGIATQPSF 342


>gi|66378018|gb|AAY45870.1| cathepsin L-like cysteine proteinase [Rotylenchulus reniformis]
          Length = 369

 Score =  189 bits (480), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 97/204 (47%), Positives = 132/204 (64%), Gaps = 6/204 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F  TGA+EG +  K  +L  LS+Q L+DC+  YGN GC+GG    ++Q+I
Sbjct: 169 NQGQCGSCWAFSATGALEGQHARKTGQLVSLSEQNLVDCTKKYGNMGCNGGLMDNAFQYI 228

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+  +  Y PY  +   CH       AT TGF +V    ED LKLA+A  GPVSVA
Sbjct: 229 KDNEGIDKEMTY-PYKAKAGRCHFKRNDVGATDTGFFDVAEGDEDKLKLAVATQGPVSVA 287

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
           IDA  +SF  Y +GVY++E+CN  P+ LDH VL VGYG + +   YW VKNSWST+WG Q
Sbjct: 288 IDAGHRSFQLYKHGVYFEEECN--PEELDHGVLVVGYGTDPEHGDYWIVKNSWSTHWGEQ 345

Query: 329 GYVLMSI-KDNNCGVMTAPTYVTM 351
           GY+ M+  ++NNCG+ +  +Y T+
Sbjct: 346 GYIRMAPNRNNNCGIPSHASYPTV 369


>gi|357627452|gb|EHJ77132.1| cathepsin L-like protease [Danaus plexippus]
          Length = 341

 Score =  189 bits (480), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 101/201 (50%), Positives = 130/201 (64%), Gaps = 6/201 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCWSF TTGA+EG ++ K   L  LS+Q LIDCS  YGNNGC+GG    ++++I
Sbjct: 141 DQGKCGSCWSFSTTGALEGQHFRKSGFLVSLSEQNLIDCSSAYGNNGCNGGLMDNAFKYI 200

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T+  Y PY   D  C      + A   GFV++    E  L LALA  GPVSVA
Sbjct: 201 KDNDGIDTEKTY-PYEAVDDKCRYNPKNSGAEDVGFVDIPAGDEHKLMLALATVGPVSVA 259

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
           IDASQ+SF  Y +GVYYDE C  S + LDH VL VGYG + DG  YW VKNSW   WG++
Sbjct: 260 IDASQESFQLYSDGVYYDENC--SSENLDHGVLVVGYGTDEDGGDYWLVKNSWGPSWGDE 317

Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
           GY+ M+  +DN+CG+ ++ +Y
Sbjct: 318 GYIKMARNRDNHCGIASSASY 338


>gi|290462225|gb|ADD24160.1| Cathepsin L [Lepeophtheirus salmonis]
          Length = 334

 Score =  189 bits (480), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 100/202 (49%), Positives = 132/202 (65%), Gaps = 7/202 (3%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCWSF  TG++EG  + K  KL  LS+Q L+DCS  YGNNGC+GG    ++++I
Sbjct: 133 NQGQCGSCWSFSATGSLEGQDFRKTGKLISLSEQNLVDCSRKYGNNGCEGGLMDYAFKYI 192

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
              +G+ T+  Y PY G D +CH        +  GFV++   SE  L+ ALA  GP+SVA
Sbjct: 193 QDNNGIDTEASY-PYEGIDGHCHYDPKNKGGSDIGFVDIKKGSEKDLQKALATVGPISVA 251

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG--ELDGKPYWQVKNSWSTYWGN 327
           IDAS  SF FY +GVY ++KC  SP+ LDH VLAVGYG  E+ G+ YW VKNSWS  WG 
Sbjct: 252 IDASHMSFQFYSHGVYSEKKC--SPENLDHGVLAVGYGTDEVTGEDYWLVKNSWSEKWGE 309

Query: 328 QGYVLMSI-KDNNCGVMTAPTY 348
            GY+ M+  KDN CG+ ++ +Y
Sbjct: 310 DGYIKMARNKDNMCGIASSASY 331


>gi|440800456|gb|ELR21495.1| cathepsin Llike proteinase [Acanthamoeba castellanii str. Neff]
          Length = 557

 Score =  189 bits (480), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 95/200 (47%), Positives = 132/200 (66%), Gaps = 8/200 (4%)

Query: 159 WSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGY------GNNGCDGGEDFRSYQWIMK 212
           W+FGTTG +EGA+++K  KL  LS+Q ++DC+WGY      GN GCDGG    + QW++ 
Sbjct: 359 WTFGTTGTLEGAWFLKTGKLVSLSEQQIVDCAWGYEWSGPSGNLGCDGGYASGAIQWVID 418

Query: 213 HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDA 272
           +G    ++  PYL QDA+C   + ++   +  +VNV+  SE AL++A+A  GPV+VAIDA
Sbjct: 419 NGGIALEESYPYLMQDAWCLKTDRSSGVKVQAYVNVSA-SEAALQMAVATAGPVAVAIDA 477

Query: 273 SQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVL 332
           S   F FY +GV+Y  +C N  + LDH VLAVGYG  +G+ YW VKNSWST+WG  GYV 
Sbjct: 478 SHPEFLFYSSGVFYLAECKNDANDLDHEVLAVGYGTENGQDYWLVKNSWSTHWGENGYVK 537

Query: 333 MSI-KDNNCGVMTAPTYVTM 351
           M+  + NNCG+ T   Y  +
Sbjct: 538 MARNRGNNCGIATQANYAII 557


>gi|21483188|gb|AAK77918.1| cathepsin L 1 [Dictyocaulus viviparus]
          Length = 347

 Score =  189 bits (480), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 93/201 (46%), Positives = 133/201 (66%), Gaps = 6/201 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q +CGSCW+F  TGA+EG ++    KL  LS+Q L+DCS  YGN+GC+GG    ++++I
Sbjct: 147 NQGMCGSCWAFSATGALEGQHFRATGKLVSLSEQNLVDCSTKYGNHGCNGGLMDLAFEYI 206

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
              HG+ T++ Y PY+G++  CH       A   GFV++    EDALK+A+A  GP+S+A
Sbjct: 207 KDNHGIDTEEGY-PYVGKEMRCHFKKRDIGAEDRGFVDLPEGDEDALKVAVATQGPISIA 265

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
           IDA  +SF  Y  GVY+DE+C  S + LDH VL VGYG + +   YW +KNSW T WG +
Sbjct: 266 IDAGHRSFQLYKKGVYFDEEC--SSEELDHGVLLVGYGTDPEAGDYWIIKNSWGTKWGEK 323

Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
           GYV ++  ++N+CGV T  +Y
Sbjct: 324 GYVRIARNRNNHCGVATKASY 344


>gi|299507656|gb|ADJ21807.1| cathepsin L [Oplegnathus fasciatus]
          Length = 336

 Score =  189 bits (480), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 101/205 (49%), Positives = 136/205 (66%), Gaps = 10/205 (4%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F TTGA+EG ++ K  KL  LS+Q L+DCS   GN GC+GG   +++Q+I
Sbjct: 132 DQGQCGSCWAFSTTGAMEGQHFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYI 191

Query: 211 M-KHGLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
               GL ++D Y PYLG  D  CH      +A  TGF+++    E AL  A+A  GPVSV
Sbjct: 192 KDNQGLDSEDSY-PYLGTDDQPCHYDPKYNSANDTGFIDIPSGKERALMKAVAAVGPVSV 250

Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTY 324
           AIDA  +SF FY +G+YY+++C  S + LDH VL VGYG    ++DGK YW VKNSWS  
Sbjct: 251 AIDAGHESFQFYQSGIYYEKEC--SSEELDHGVLVVGYGFEGEDVDGKKYWIVKNSWSEK 308

Query: 325 WGNQGYVLMSI-KDNNCGVMTAPTY 348
           WG++GY+ M+  + N+CG+ TA +Y
Sbjct: 309 WGDKGYIYMAKDRKNHCGIATAASY 333


>gi|342305188|dbj|BAK55648.1| cathepsin L [Oplegnathus fasciatus]
          Length = 336

 Score =  189 bits (480), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 101/205 (49%), Positives = 136/205 (66%), Gaps = 10/205 (4%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F TTGA+EG ++ K  KL  LS+Q L+DCS   GN GC+GG   +++Q+I
Sbjct: 132 DQGQCGSCWAFSTTGAMEGQHFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYI 191

Query: 211 M-KHGLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
               GL ++D Y PYLG  D  CH      +A  TGF+++    E AL  A+A  GPVSV
Sbjct: 192 KDNQGLDSEDSY-PYLGTDDQPCHYDPKYNSANDTGFIDIPSGKERALMKAVAAVGPVSV 250

Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTY 324
           AIDA  +SF FY +G+YY+++C  S + LDH VL VGYG    ++DGK YW VKNSWS  
Sbjct: 251 AIDAGHESFQFYQSGIYYEKEC--SSEELDHGVLVVGYGFEGEDVDGKKYWIVKNSWSEK 308

Query: 325 WGNQGYVLMSI-KDNNCGVMTAPTY 348
           WG++GY+ M+  + N+CG+ TA +Y
Sbjct: 309 WGDKGYIYMAKDRKNHCGIATAASY 333


>gi|390994425|gb|AFM37362.1| cathepsin L2 [Dictyocaulus viviparus]
          Length = 352

 Score =  189 bits (479), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 93/201 (46%), Positives = 133/201 (66%), Gaps = 6/201 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q +CGSCW+F  TGA+EG ++    KL  LS+Q L+DCS  YGN+GC+GG    ++++I
Sbjct: 152 NQGMCGSCWAFSATGALEGQHFRATGKLVSLSEQNLVDCSTKYGNHGCNGGLMDLAFEYI 211

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
              HG+ T++ Y PY+G++  CH       A   GFV++    EDALK+A+A  GP+S+A
Sbjct: 212 KDNHGIDTEEGY-PYVGKEMRCHFKKRDIGAEDRGFVDLPEGDEDALKVAVATQGPISIA 270

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
           IDA  +SF  Y  GVY+DE+C  S + LDH VL VGYG + +   YW +KNSW T WG +
Sbjct: 271 IDAGHRSFQLYKKGVYFDEEC--SSEELDHGVLLVGYGTDPEAGDYWIIKNSWGTKWGEK 328

Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
           GYV ++  ++N+CGV T  +Y
Sbjct: 329 GYVRIARNRNNHCGVATKASY 349


>gi|226443040|ref|NP_001140018.1| Cathepsin L1 precursor [Salmo salar]
 gi|221221188|gb|ACM09255.1| Cathepsin L1 precursor [Salmo salar]
          Length = 338

 Score =  189 bits (479), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 104/218 (47%), Positives = 142/218 (65%), Gaps = 11/218 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + DQ  CGSCW+F TTGA+EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 121 VDWREKGYVTPVKDQGSCGSCWAFSTTGAMEGQQFRKTGKLVSLSEQNLVDCSRPEGNEG 180

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDA 255
           C+GG   +++Q+I  + GL T++ Y PY+G D   CH     + A  TGFV++    E A
Sbjct: 181 CNGGLMDQAFQYIQDNAGLDTEESY-PYVGTDEDPCHYKPEFSGANETGFVDIPSGKEHA 239

Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
           +  A+A  GPVSVAIDA  +SF FY +G+YY+++C  S + LDH VL VGYG    ++DG
Sbjct: 240 MMKAVAAVGPVSVAIDAGHESFQFYESGIYYEKEC--SSEELDHGVLVVGYGFEGEDVDG 297

Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           K YW VKNSWS  WG++GY+ M+  + N+CG+ TA +Y
Sbjct: 298 KKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATASSY 335


>gi|22653679|sp|Q26636.1|CATL_SARPE RecName: Full=Cathepsin L; Contains: RecName: Full=Cathepsin L
           heavy chain; Contains: RecName: Full=Cathepsin L light
           chain; Flags: Precursor
 gi|505140|dbj|BAA03970.1| cathepsin L precursor [Sarcophaga peregrina]
          Length = 339

 Score =  189 bits (479), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 100/204 (49%), Positives = 130/204 (63%), Gaps = 6/204 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F +TGA+EG ++ K   L  LS+Q L+DCS  YGNNGC+GG    ++++I
Sbjct: 139 DQGHCGSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI 198

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T+  Y PY G D  CH    T  AT TGFV++    E+ +K A+A  GPVSVA
Sbjct: 199 KDNGGIDTEKSY-PYEGIDDSCHFNKATIGATDTGFVDIPEGDEEKMKKAVATMGPVSVA 257

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
           IDAS +SF  Y  GVY + +C+     LDH VL VGYG +  G  YW VKNSW T WG Q
Sbjct: 258 IDASHESFQLYSEGVYNEPECDEQ--NLDHGVLVVGYGTDESGMDYWLVKNSWGTTWGEQ 315

Query: 329 GYVLMSIKDNN-CGVMTAPTYVTM 351
           GY+ M+   NN CG+ TA +Y T+
Sbjct: 316 GYIKMARNQNNQCGIATASSYPTV 339


>gi|344953542|gb|AEN28617.1| cathepsin L-like cysteine protease [Epinephelus coioides]
          Length = 336

 Score =  189 bits (479), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 101/205 (49%), Positives = 136/205 (66%), Gaps = 10/205 (4%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F TTGA+EG ++ K  KL  LS+Q L+DCS   GN GC+GG   +++Q++
Sbjct: 132 DQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYV 191

Query: 211 M-KHGLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
               GL ++D Y PYLG  D  CH   T  +   TGFV++    E AL  A+A  GPVSV
Sbjct: 192 KDNQGLDSEDSY-PYLGTDDQPCHYDPTYNSVNDTGFVDIPSGKERALMKAVAAVGPVSV 250

Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTY 324
           AIDA  +SF FY +G+YY+++C  S + LDH VL VGYG    ++DGK YW VKNSWS  
Sbjct: 251 AIDAGHESFQFYQSGIYYEKEC--SSEELDHGVLVVGYGFQGEDVDGKKYWIVKNSWSEK 308

Query: 325 WGNQGYVLMSI-KDNNCGVMTAPTY 348
           WG++GY+ M+  + N+CG+ TA +Y
Sbjct: 309 WGDKGYIYMAKDRKNHCGIATAASY 333


>gi|253796148|gb|ACT35690.1| cathepsin L-like cysteine proteinase [Ditylenchus destructor]
          Length = 376

 Score =  189 bits (479), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 95/201 (47%), Positives = 130/201 (64%), Gaps = 6/201 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q +CGSCW+F  TG++EG +      L  LS+Q L+DCS  YGNNGC+GG    ++Q+I
Sbjct: 176 NQGMCGSCWAFSATGSLEGQHKRSKGTLVSLSEQNLVDCSAAYGNNGCNGGLMDFAFQYI 235

Query: 211 MK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
            + HG+ T+  Y PY  +   CH   ++  A  TGF+++    ED LK+A+A  GP+SVA
Sbjct: 236 KENHGIDTETSY-PYKARQKKCHFQRSSVGADDTGFMDLPEGDEDQLKIAVATQGPISVA 294

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
           IDA  +SF  Y  GVYY+++C  S + LDH VL VGYG + D   YW VKNSW T WG Q
Sbjct: 295 IDAGHRSFQLYKTGVYYEKEC--SSEQLDHGVLVVGYGTDPDHGDYWIVKNSWGTTWGEQ 352

Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
           GYV M+  K+N+CG+ T  +Y
Sbjct: 353 GYVRMARNKNNHCGIATKASY 373


>gi|57964592|ref|XP_560736.1| Anopheles gambiae str. PEST AGAP012900-PA [Anopheles gambiae str.
           PEST]
 gi|55246680|gb|EAL42129.1| AGAP012900-PA [Anopheles gambiae str. PEST]
          Length = 135

 Score =  189 bits (479), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 87/134 (64%), Positives = 103/134 (76%)

Query: 218 QDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSF 277
           +D+YG Y GQD YCH+ N T  A +TG+VNVT    DA KLAL KHGP+S+AIDA  KSF
Sbjct: 2   EDEYGGYFGQDGYCHVQNMTLYAPITGWVNVTSGDPDAFKLALFKHGPLSIAIDAGHKSF 61

Query: 278 SFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKD 337
           SFY NGVYY+ +C N  D LDHAVLAVGYG+L+G+ YW +KNSWS YWGN GY LM++KD
Sbjct: 62  SFYSNGVYYEPECKNKLDELDHAVLAVGYGQLNGEDYWLIKNSWSNYWGNDGYALMAVKD 121

Query: 338 NNCGVMTAPTYVTM 351
           NNCG+ T  TYV M
Sbjct: 122 NNCGLTTDATYVLM 135


>gi|47230018|emb|CAG10432.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 294

 Score =  189 bits (479), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 105/230 (45%), Positives = 140/230 (60%), Gaps = 6/230 (2%)

Query: 122 KYSLWVRYNKASKDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAV 180
           K S + R  + +     V +  KGY   + DQ  CGSCW+F  TG++EG  Y K  KL  
Sbjct: 65  KGSAFFRLAEGTPLPTTVDWRDKGYVTGVKDQKQCGSCWAFSATGSLEGQNYRKTGKLVS 124

Query: 181 LSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTAT 239
           LS+Q L+DCS  YGN GC GG    ++++I ++ G+ T++ Y PY  +D  C        
Sbjct: 125 LSEQQLVDCSGDYGNMGCGGGLMDSAFKYIQENGGIDTEESY-PYEAEDGKCRFKPQNIG 183

Query: 240 ATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDH 299
           A  TG+V+VT   EDALK A+A  GPVSVAIDAS  SF  Y +GVY + +C  S + LDH
Sbjct: 184 AKCTGYVDVTAGDEDALKEAVATIGPVSVAIDASHSSFQLYESGVYDELEC--SSEDLDH 241

Query: 300 AVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
            VLAVGYG  +G+ YW VKNSW   WG +GY++MS  K N CG+ +  +Y
Sbjct: 242 GVLAVGYGTDNGQDYWLVKNSWGLGWGQKGYIMMSRNKHNQCGIASMASY 291


>gi|15593246|gb|AAL02220.1|AF410880_1 cysteine protease CP7 precursor [Frankliniella occidentalis]
          Length = 333

 Score =  189 bits (479), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 118/290 (40%), Positives = 162/290 (55%), Gaps = 22/290 (7%)

Query: 70  FKVNIYRLFFLRPRFHENE---KIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLW 126
           FK N  R+     RF   E   K+ YN  Y   ++    + EK     S  ++ S +   
Sbjct: 52  FKENAIRIAKHNDRFASGEVTFKVGYN-QYA--DMHTHEVTEKLNGYRSGLKQASAF--- 105

Query: 127 VRYNKASKDAIP----VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVL 181
              + AS D+ P    V +  KG    + DQ  CGSCWSF  TG++EG  ++K+K L  L
Sbjct: 106 --VHTASNDSWPWSKKVDWRSKGAVTPIKDQGQCGSCWSFSATGSLEGQLFLKNKNLVSL 163

Query: 182 SQQALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATA 240
           S+Q L+DCSW +GN GC+GG    +++++  + G+ T++ Y PY  +D  C         
Sbjct: 164 SEQNLVDCSWDFGNEGCNGGLMDSAFEYVKSYGGIDTEESY-PYTAEDGTCLYKAANNAG 222

Query: 241 TMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHA 300
             TG+ +V   SE AL+ A+ K GPVSVAIDAS  SF  Y +G+YY+  C  S D LDH 
Sbjct: 223 VNTGYKDVQAKSESALRDAVEKVGPVSVAIDASNWSFQMYTSGIYYEPAC--SSDSLDHG 280

Query: 301 VLAVGYG-ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           VLAVGYG E   K +W VKNSW T WG +GY+ M+  K NNCG+ T  +Y
Sbjct: 281 VLAVGYGSEWPNKEFWIVKNSWGTSWGEEGYIKMARNKKNNCGIATEASY 330


>gi|118125|sp|P25784.1|CYSP3_HOMAM RecName: Full=Digestive cysteine proteinase 3; Flags: Precursor
          Length = 321

 Score =  189 bits (479), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 98/203 (48%), Positives = 134/203 (66%), Gaps = 6/203 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F  TGA+EG +++K+ +L  LS+Q L+DCS  YGN+GC GG    ++ +I
Sbjct: 123 DQEQCGSCWAFSATGALEGQHFLKNDELVSLSEQQLVDCSTDYGNDGCGGGWMTSAFDYI 182

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T+  Y PY  +D  C     +  A  TG V V  ++E+AL+ A++  GP+SVA
Sbjct: 183 KDNGGIDTESSY-PYEAEDRSCRFDANSIGAICTGSVEVQ-HTEEALQEAVSGVGPISVA 240

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS  SF FY +GVYY++ C  SP  LDH VLAVGYG    K YW VKNSW + WG+ G
Sbjct: 241 IDASHFSFQFYSSGVYYEQNC--SPTFLDHGVLAVGYGTESTKDYWLVKNSWGSSWGDAG 298

Query: 330 YVLMSI-KDNNCGVMTAPTYVTM 351
           Y+ MS  +DNNCG+ + P+Y T+
Sbjct: 299 YIKMSRNRDNNCGIASEPSYPTV 321


>gi|225719768|gb|ACO15730.1| Cathepsin L1 precursor [Caligus clemensi]
          Length = 338

 Score =  189 bits (479), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 104/218 (47%), Positives = 142/218 (65%), Gaps = 11/218 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + DQ  CGSCW+F TTGA+EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 121 VDWREKGYVTPVKDQGSCGSCWAFSTTGAMEGQPFRKTGKLVSLSEQNLVDCSRPEGNEG 180

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDA 255
           C+GG   +++Q+I  + GL T++ Y PY+G D   CH     + A  TGFV++    E A
Sbjct: 181 CNGGLMDQAFQYIQDNAGLDTEESY-PYVGTDEDPCHYKPEFSAANETGFVDIPSGKEHA 239

Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
           +  A+A  GPVSVAIDA  +SF FY +G+YY+++C  S + LDH VL VGYG    ++DG
Sbjct: 240 MMKAVAAVGPVSVAIDAGHESFQFYESGIYYEKEC--SSEELDHGVLVVGYGFEGEDVDG 297

Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           K YW VKNSWS  WG++GY+ M+  + N+CG+ TA +Y
Sbjct: 298 KKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATASSY 335


>gi|15593252|gb|AAL02222.1|AF410882_1 cysteine protease CP14 precursor [Frankliniella occidentalis]
          Length = 333

 Score =  189 bits (479), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 118/290 (40%), Positives = 162/290 (55%), Gaps = 22/290 (7%)

Query: 70  FKVNIYRLFFLRPRFHENE---KIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLW 126
           FK N  R+     RF   E   K+ YN  Y   ++    + EK     S  ++ S +   
Sbjct: 52  FKENAIRIAKHNDRFASGEVTFKVGYN-QYA--DMHTHEVTEKLNGYRSGLKQASAF--- 105

Query: 127 VRYNKASKDAIP----VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVL 181
              + AS D+ P    V +  KG    + DQ  CGSCWSF  TG++EG  ++K+K L  L
Sbjct: 106 --VHTASNDSWPWSKKVDWRSKGAVTPIKDQGQCGSCWSFSATGSLEGQLFLKNKNLVSL 163

Query: 182 SQQALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATA 240
           S+Q L+DCSW +GN GC+GG    +++++  + G+ T++ Y PY  +D  C         
Sbjct: 164 SEQNLVDCSWDFGNEGCNGGLMDSAFEYVKSNGGIDTEESY-PYTAEDGTCLYKAANNAG 222

Query: 241 TMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHA 300
             TG+ +V   SE AL+ A+ K GPVSVAIDAS  SF  Y +G+YY+  C  S D LDH 
Sbjct: 223 VNTGYKDVQAKSESALRDAVEKVGPVSVAIDASNWSFQMYTSGIYYEPAC--SSDSLDHG 280

Query: 301 VLAVGYG-ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           VLAVGYG E   K +W VKNSW T WG +GY+ M+  K NNCG+ T  +Y
Sbjct: 281 VLAVGYGSEWPNKEFWIVKNSWGTSWGEEGYIKMARNKKNNCGIATEASY 330


>gi|11055|emb|CAA45129.1| cysteine proteinase preproenzyme [Homarus americanus]
          Length = 320

 Score =  189 bits (479), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 98/203 (48%), Positives = 134/203 (66%), Gaps = 6/203 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F  TGA+EG +++K+ +L  LS+Q L+DCS  YGN+GC GG    ++ +I
Sbjct: 122 DQEQCGSCWAFSATGALEGQHFLKNDELVSLSEQQLVDCSTDYGNDGCGGGWMTSAFDYI 181

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T+  Y PY  +D  C     +  A  TG V V  ++E+AL+ A++  GP+SVA
Sbjct: 182 KDNGGIDTESSY-PYEAEDRSCRFDANSIGAICTGSVEVQ-HTEEALQEAVSGVGPISVA 239

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS  SF FY +GVYY++ C  SP  LDH VLAVGYG    K YW VKNSW + WG+ G
Sbjct: 240 IDASHFSFQFYSSGVYYEQNC--SPTFLDHGVLAVGYGTESTKDYWLVKNSWGSSWGDAG 297

Query: 330 YVLMSI-KDNNCGVMTAPTYVTM 351
           Y+ MS  +DNNCG+ + P+Y T+
Sbjct: 298 YIKMSRNRDNNCGIASEPSYPTV 320


>gi|309380130|gb|ADO65978.1| cathepsin L [Eriocheir sinensis]
 gi|309380134|gb|ADO65980.1| cathepsin L [Eriocheir sinensis]
          Length = 325

 Score =  189 bits (479), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 100/236 (42%), Positives = 147/236 (62%), Gaps = 6/236 (2%)

Query: 115 SEGEKVSKYSLWVRYNKASKDAIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMK 174
           S G+KV + +++        D +  R +      + DQ  CGSCW+F  TG++EG +++ 
Sbjct: 91  SAGKKVPRGTMYQPLVDELPDTVDWR-DKGAVTPVKDQKACGSCWAFSATGSLEGQHFLS 149

Query: 175 HKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHI 233
             KL  LS+Q L+DCS  YGN GC GG    ++++I   +G+ T++ Y PY  ++  C  
Sbjct: 150 TGKLVSLSEQNLVDCSDKYGNFGCGGGLMDNAFRYIKDNNGIDTEESY-PYEAKNGPCRF 208

Query: 234 ANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNS 293
            +    AT++ +V++   SED L+ A+A+ GPVSVAIDAS  +F FY  G+YYDEKC++S
Sbjct: 209 NSDNVGATLSSYVDIQHGSEDDLQKAVAEKGPVSVAIDASTSTFHFYSRGIYYDEKCSSS 268

Query: 294 PDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
              LDH VLAVGYG  D   YW VKNSW+  WG+ GY+ MS  ++NNCG+ +  +Y
Sbjct: 269 F--LDHGVLAVGYGTDDSSDYWLVKNSWNETWGDSGYIKMSRNRNNNCGIASQASY 322


>gi|342675481|gb|AEL31666.1| cathepsin L [Cynoglossus semilaevis]
          Length = 336

 Score =  189 bits (479), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 104/218 (47%), Positives = 140/218 (64%), Gaps = 11/218 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + DQ  CGSCW+F TTGA+ZG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 119 VDWREKGYVTPVKDQGQCGSCWAFSTTGALZGQNFRKMGKLVSLSEQNLVDCSRPEGNEG 178

Query: 198 CDGGEDFRSYQWIM-KHGLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDA 255
           C GG   +++Q++    GL ++D Y PYLG  D  CH      +   TGFV++    E A
Sbjct: 179 CGGGLMDQAFQYVKDNQGLDSEDSY-PYLGTDDQPCHYDPKYNSVNDTGFVDIPSGKEHA 237

Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
           L  A+A  GPVSVAIDA  +SF FY +G+YY+++C  S + LDH VLAVGYG    ++DG
Sbjct: 238 LMKAVASVGPVSVAIDAGHESFQFYQSGIYYEKEC--SSEELDHGVLAVGYGFEGEDVDG 295

Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           K YW VKNSWS  WG++GY+ M+  + N+CG+ TA +Y
Sbjct: 296 KKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATAASY 333


>gi|52630917|gb|AAU84922.1| putative cathepsin L [Toxoptera citricida]
          Length = 341

 Score =  188 bits (478), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 105/228 (46%), Positives = 143/228 (62%), Gaps = 9/228 (3%)

Query: 127 VRYNKASKDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQ 183
           V + K+    IP  + +  KGY   + +Q  CGSCWSF  TG++EG ++ K   L  LS+
Sbjct: 114 VTFLKSENVVIPKSIDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVLVSLSE 173

Query: 184 QALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATM 242
           Q LIDCS  YGNNGC+GG    ++++I  + GL T+  Y PY  +D  C      + AT 
Sbjct: 174 QNLIDCSRKYGNNGCEGGLMDLAFKYIKSNKGLDTEKSY-PYEAEDDKCRYNPENSGATD 232

Query: 243 TGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVL 302
            GFV++    EDAL  ALA  GPVS+AIDAS + F FY  GV+Y+ +C+++   LDH VL
Sbjct: 233 KGFVDIPEGDEDALVHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTE--LDHGVL 290

Query: 303 AVGYG-ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           AVGYG +  G  YW VKNSW   WG+QGY++M+  K NNCGV ++ +Y
Sbjct: 291 AVGYGTDHKGGDYWIVKNSWGKTWGDQGYIMMARNKKNNCGVASSASY 338


>gi|310975575|gb|ADP55136.1| truncated cathepsin L-like protein [Miichthys miiuy]
          Length = 246

 Score =  188 bits (478), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 102/205 (49%), Positives = 136/205 (66%), Gaps = 10/205 (4%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F TTGA+EG ++ K  KL  LS+Q L+DCS   GN GC+GG   +++Q++
Sbjct: 42  DQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYV 101

Query: 211 M-KHGLPTQDDYGPYLGQ-DAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
               GL ++D Y PYLG  D  CH      +A  TGF++V    E AL  A+A  GPVSV
Sbjct: 102 KDNQGLDSEDAY-PYLGTGDQPCHYDPNYNSANDTGFIDVPSGKEHALMKAVAAVGPVSV 160

Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTY 324
           AIDAS +SF FY +G+YY++ C  S + LDH VL VGYG    ++DGK YW VKNSWS  
Sbjct: 161 AIDASHESFQFYQSGIYYEKDC--SSEELDHGVLVVGYGFEGEDVDGKKYWIVKNSWSEK 218

Query: 325 WGNQGYVLMSI-KDNNCGVMTAPTY 348
           WG++GY+ M+  + N+CG+ TA +Y
Sbjct: 219 WGDKGYIYMAKDRKNHCGIATAASY 243


>gi|23306947|dbj|BAC16538.1| cathepsin L [Engraulis japonicus]
          Length = 336

 Score =  188 bits (478), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 104/221 (47%), Positives = 144/221 (65%), Gaps = 11/221 (4%)

Query: 139 VRYEMKGYNS-LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           + Y   GY + + DQ  CGSCW+F TTGA+EG  + +  KL  LS+Q L+DCS   GN G
Sbjct: 119 IDYRDLGYATPVKDQGQCGSCWAFSTTGAMEGQLFREGGKLVSLSEQNLVDCSRPEGNEG 178

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDA 255
           C+GG   +++Q+I  + GL T+D Y PYLG  D  CH     + A  TGFV++    E A
Sbjct: 179 CNGGLMDQAFQYIKDNGGLDTEDAY-PYLGTDDQDCHYDPKYSAANDTGFVDIPEGKERA 237

Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
           L  A+A  GPVSVAIDA  +SF FY +G+Y++++C+++   LDH VL VGYG    ++DG
Sbjct: 238 LMKAVAAVGPVSVAIDAGHESFQFYHSGIYFEKECSSTE--LDHGVLVVGYGFEGEDVDG 295

Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
           K YW VKNSWS  WG++GY+ M+  + N+CG+ TA +Y  M
Sbjct: 296 KKYWIVKNSWSEKWGDEGYIYMAKDRKNHCGIATAASYPLM 336


>gi|66812702|ref|XP_640530.1| counting factor associated protein [Dictyostelium discoideum AX4]
 gi|74897159|sp|Q54TR1.1|CFAD_DICDI RecName: Full=Counting factor associated protein D; Flags:
           Precursor
 gi|60468561|gb|EAL66564.1| counting factor associated protein [Dictyostelium discoideum AX4]
          Length = 531

 Score =  188 bits (478), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 101/226 (44%), Positives = 144/226 (63%), Gaps = 7/226 (3%)

Query: 129 YNKASKDAIPVRYEMKGYNSLL---DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQA 185
           ++  S  +IP   + +  N +    DQ +CGSCW+FG+TG++EG   + + +L  LS+Q 
Sbjct: 301 HDDESLRSIPSTVDWRNQNCVTPVKDQGICGSCWTFGSTGSLEGTNCVTNGELVSLSEQQ 360

Query: 186 LIDCSWGYGNNGCDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANTTATA-TMT 243
           L+DC+   G+ GC GG    ++Q++M+ G L T+ +Y PYL Q+  C     T +  ++T
Sbjct: 361 LVDCAILTGSQGCGGGFASSAFQYVMEIGSLATESNY-PYLMQNGLCRDRTVTPSGVSIT 419

Query: 244 GFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLA 303
           G+VNVT  SE AL+ A+A  GPV++AIDAS   F +Y++GVY +  C N  D LDH VLA
Sbjct: 420 GYVNVTSGSESALQNAIATTGPVAIAIDASVDDFRYYMSGVYNNPACKNGLDDLDHEVLA 479

Query: 304 VGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTY 348
           +GYG   G+ Y+ VKNSWST WG  GYV M+  DNN CGV +  TY
Sbjct: 480 IGYGTYQGQDYFLVKNSWSTNWGMDGYVYMARNDNNLCGVSSQATY 525


>gi|126681066|gb|ABO26562.1| cathepsin L-like cysteine protease [Ixodes ricinus]
          Length = 335

 Score =  188 bits (478), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 101/235 (42%), Positives = 146/235 (62%), Gaps = 16/235 (6%)

Query: 116 EGEKVSKYSLWVRYNKASKDAI-PVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMK 174
           E E +  +SL    +  +K A+ PV+          +Q  CGSCW+F  TG++EG ++ K
Sbjct: 110 EPENIEDFSLPKTVDWRTKGAVTPVK----------NQGQCGSCWAFSATGSLEGQHFRK 159

Query: 175 HKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHI 233
              +  LS+Q L+ CS  +GNNGC+GG    ++++I  + G+ T+  Y PY G D  CH 
Sbjct: 160 SGSMVSLSEQNLVGCSTDFGNNGCEGGLMDDAFKYIRANKGIDTEKSY-PYNGTDGTCHF 218

Query: 234 ANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNS 293
             +T  AT +GFV++   SE  LK A+A  GP+SVAIDAS +SF FY +GVY + +C++ 
Sbjct: 219 KKSTVGATDSGFVDIKEGSETQLKKAVATVGPISVAIDASHESFQFYSDGVYDEPECDS- 277

Query: 294 PDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPT 347
            + LDH VL VGYG L+G  YW VKNSW T WG++GY+ MS  K N CG+ ++ +
Sbjct: 278 -ESLDHGVLVVGYGTLNGTDYWFVKNSWGTTWGDEGYIRMSRNKKNQCGIASSAS 331


>gi|21483184|gb|AAF86584.1| cathepsin L cysteine protease [Haemonchus contortus]
          Length = 355

 Score =  188 bits (478), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 93/201 (46%), Positives = 133/201 (66%), Gaps = 6/201 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q +CGSCW+F +TGA+EG +     KL  LS+Q L+DCS  YGN+GC+GG    ++++I
Sbjct: 155 NQGMCGSCWAFSSTGALEGQHARATGKLVSLSEQNLVDCSTKYGNHGCNGGLMDLAFEYI 214

Query: 211 MK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
            + HG+ T+D Y PY+G++  CH    T  A   GFV++    E+ALK A+A  GP+S+A
Sbjct: 215 KENHGVDTEDSY-PYVGRETKCHFKRNTVGADDKGFVDLPEGDEEALKKAVATQGPISIA 273

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
           IDA  +SF  Y  GVY+DE+C  S + LDH VL VGYG + +   YW VKNSW   WG +
Sbjct: 274 IDAGHRSFQLYKKGVYFDEEC--SSEELDHGVLLVGYGTDPEAGDYWLVKNSWGPTWGEK 331

Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
           GY+ ++  ++N+CGV T  +Y
Sbjct: 332 GYIRIARNRNNHCGVATKASY 352


>gi|157644745|gb|ABV59078.1| cathepsin L [Lates calcarifer]
          Length = 337

 Score =  188 bits (477), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 105/213 (49%), Positives = 138/213 (64%), Gaps = 11/213 (5%)

Query: 144 KGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGE 202
           KGY   + DQ  CGSCW+F TTGA+EG  + K  KL  LS+Q L+DCS   GN GC+GG 
Sbjct: 125 KGYVTPVKDQGQCGSCWAFSTTGALEGQQFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGL 184

Query: 203 DFRSYQWIM-KHGLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDALKLAL 260
             +++Q++    GL ++D Y PYLG  D  CH      +A  TGFV+V    E AL  A+
Sbjct: 185 MDQAFQYVKDNQGLDSEDSY-PYLGTDDQPCHYDPNYNSANDTGFVDVPSGKERALMKAV 243

Query: 261 AKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQ 316
           A  GPVSVAIDA  +SF FY +G+YY++ C  S + LDH VL VGYG    ++DGK YW 
Sbjct: 244 AAVGPVSVAIDAGHESFQFYQSGIYYEKDC--SSEELDHGVLVVGYGYEGEDVDGKKYWI 301

Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           VKNSWS  WG++GY+ M+  + N+CG+ TA +Y
Sbjct: 302 VKNSWSEKWGDKGYIYMAKDRKNHCGIATAASY 334


>gi|262410743|gb|ACY66807.1| cathepsin L [Aphis gossypii]
          Length = 341

 Score =  188 bits (477), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 104/228 (45%), Positives = 143/228 (62%), Gaps = 9/228 (3%)

Query: 127 VRYNKASKDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQ 183
           V + K+    +P  + +  KGY   + +Q  CGSCWSF  TG++EG ++ K   L  LS+
Sbjct: 114 VTFLKSENVVVPKAIDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVLVSLSE 173

Query: 184 QALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATM 242
           Q LIDCS  YGNNGC+GG    ++++I  + GL T+  Y PY  +D  C      + AT 
Sbjct: 174 QNLIDCSRKYGNNGCEGGLMDLAFKYIKSNKGLDTEKSY-PYEAEDDKCRYNPENSGATD 232

Query: 243 TGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVL 302
            GFV++    EDAL  ALA  GPVS+AIDAS + F FY  GV+Y+ +C+++   LDH VL
Sbjct: 233 KGFVDIPEGDEDALMHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTE--LDHGVL 290

Query: 303 AVGYG-ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           AVGYG +  G  YW VKNSW   WG+QGY++M+  K NNCGV ++ +Y
Sbjct: 291 AVGYGTDHKGGDYWIVKNSWGKTWGDQGYIMMARNKKNNCGVASSASY 338


>gi|449683741|ref|XP_002155462.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
          Length = 324

 Score =  188 bits (477), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 100/213 (46%), Positives = 134/213 (62%), Gaps = 6/213 (2%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  +GY   + DQ  CGSCW+F TTG++EG  + K  KL  LS+Q L+DCS  YGNNG
Sbjct: 112 VDWRNEGYVTPVKDQGQCGSCWAFSTTGSLEGQNFKKTGKLVSLSEQNLVDCSTAYGNNG 171

Query: 198 CDGGEDFRSYQWIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++ +I + +G+ ++  Y PY  +D  C        AT TGFV++    E+ L
Sbjct: 172 CNGGLMDNAFTYIKENNGIDSEASY-PYTAKDGKCAFTKPNVAATDTGFVDIPSGDENKL 230

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K A+A  GP+SVAIDAS  SF FY  GVY + KC+++   LDH VL VGYG   GK YW 
Sbjct: 231 KEAVASVGPISVAIDASHFSFQFYRKGVYNERKCSSTE--LDHGVLVVGYGTESGKDYWL 288

Query: 317 VKNSWSTYWGNQGYVLMSIK-DNNCGVMTAPTY 348
           VKNSW+T WG++GY+ MS    N CG+ T  +Y
Sbjct: 289 VKNSWNTSWGDKGYIKMSRNAKNQCGIATNASY 321


>gi|307192137|gb|EFN75465.1| Cathepsin L [Harpegnathos saltator]
          Length = 339

 Score =  187 bits (476), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 100/203 (49%), Positives = 132/203 (65%), Gaps = 6/203 (2%)

Query: 149 LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
           + DQ  CGSCWSF  TGA+EG +Y    KL  LS+Q LIDCS  YGNNGC+GG   +++Q
Sbjct: 137 IKDQGHCGSCWSFSATGALEGQHYRITGKLVSLSEQNLIDCSGRYGNNGCNGGLMDQAFQ 196

Query: 209 WIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVS 267
           +I   HGL T+  Y PY  ++  C        AT +G+V++   +E  LK A+A  GPVS
Sbjct: 197 YIKDNHGLDTEISY-PYEAENDKCRYNPRNNGATDSGYVDIPEGNEKKLKAAVATIGPVS 255

Query: 268 VAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDG-KPYWQVKNSWSTYWG 326
           VAIDAS +SF FY  GVYY+ +C  S + LDH VL VGYG  D  + YW VKNSW   WG
Sbjct: 256 VAIDASAESFQFYREGVYYEPRC--SSENLDHGVLVVGYGTDDNDQDYWLVKNSWGVTWG 313

Query: 327 NQGYVLMSI-KDNNCGVMTAPTY 348
           ++GY+ M+  KDN+CG+ ++ +Y
Sbjct: 314 DEGYIKMARNKDNHCGIASSASY 336


>gi|156398078|ref|XP_001638016.1| predicted protein [Nematostella vectensis]
 gi|156225133|gb|EDO45953.1| predicted protein [Nematostella vectensis]
          Length = 326

 Score =  187 bits (476), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 103/228 (45%), Positives = 145/228 (63%), Gaps = 10/228 (4%)

Query: 126 WVRYNKASKDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLS 182
           W  Y   S   IP  V +  KGY   + +Q  CGSCW+F TTG+VEG ++ K   L  LS
Sbjct: 101 WATYMPPSNVKIPSSVDWSQKGYVTGVKNQGQCGSCWAFSTTGSVEGQHFRKTGSLVSLS 160

Query: 183 QQALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATAT 241
           +Q LIDCS  YGNNGC GG    ++++I  + G+ T+  Y PYLGQ   CH +++   A 
Sbjct: 161 EQNLIDCSGSYGNNGCQGGLMDNAFRYIESNGGIDTESSY-PYLGQQGSCHFSSSHVGAR 219

Query: 242 MTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAV 301
           +TG+ ++   SE AL+ A+A  GPVSVA+DASQ  + FY +GVY +  C+++   LDH V
Sbjct: 220 VTGYQDIPQGSEQALQSAVATVGPVSVAVDASQ--WQFYSSGVYDNPYCSSTQ--LDHGV 275

Query: 302 LAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           L +GYG  +G+ YW VKNSW   WG +GY++MS  K+N CG+ ++ +Y
Sbjct: 276 LVIGYGNYNGQDYWLVKNSWGYSWGVEGYIMMSRNKNNQCGIASSASY 323


>gi|223646726|gb|ACN10121.1| Cathepsin L1 precursor [Salmo salar]
 gi|223672581|gb|ACN12472.1| Cathepsin L1 precursor [Salmo salar]
          Length = 338

 Score =  187 bits (476), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 104/218 (47%), Positives = 141/218 (64%), Gaps = 11/218 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + DQ  CGSCW+F TTGA+EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 121 VDWREKGYVTPVKDQGSCGSCWAFSTTGAMEGQQFRKTGKLVSLSEQNLVDCSRPEGNEG 180

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDA 255
           C+GG   +++Q+I  + GL T++ Y PY+G D   CH     + A  TGFV++    E A
Sbjct: 181 CNGGLMDQAFQYIQDNAGLDTEESY-PYVGTDEDPCHYKPEFSGANETGFVDIPSGKEHA 239

Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
           +  A+A  GPVSVAIDA  +SF FY  G+YY+++C  S + LDH VL VGYG    ++DG
Sbjct: 240 MMKAVAAVGPVSVAIDAGHESFQFYEFGIYYEKEC--SSEELDHGVLVVGYGFEGEDVDG 297

Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           K YW VKNSWS  WG++GY+ M+  + N+CG+ TA +Y
Sbjct: 298 KKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATASSY 335


>gi|118119|sp|P13277.2|CYSP1_HOMAM RecName: Full=Digestive cysteine proteinase 1; Flags: Precursor
 gi|11051|emb|CAA45127.1| cysteine proteinase preproenzyme [Homarus americanus]
 gi|228243|prf||1801240A Cys protease 1
          Length = 322

 Score =  187 bits (476), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 101/224 (45%), Positives = 140/224 (62%), Gaps = 7/224 (3%)

Query: 132 ASKDAIPVRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCS 190
           A+ ++  V +  KG    + DQ  CGSCW+F TTG +EG +++K  +L  LS+Q L+DC+
Sbjct: 102 AAPESTEVDWRTKGAVTPVKDQGQCGSCWAFSTTGGIEGQHFLKTGRLVSLSEQQLVDCA 161

Query: 191 WG-YGNNGCDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
            G Y N GC+GG   R+  ++  +G + T+  Y PY  +D  C   + T  AT TG+V +
Sbjct: 162 GGSYYNQGCNGGWVERAIMYVRDNGGVDTESSY-PYEARDNTCRFNSNTIGATCTGYVGI 220

Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
              SE ALK A    GP+SVAIDAS +SF  Y  GVYY+  C++S   LDHAVLAVGYG 
Sbjct: 221 AQGSESALKTATRDIGPISVAIDASHRSFQSYYTGVYYEPSCSSSQ--LDHAVLAVGYGS 278

Query: 309 LDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
             G+ +W VKNSW+T WG  GY+ M+  ++NNCG+ T   Y T+
Sbjct: 279 EGGQDFWLVKNSWATSWGESGYIKMARNRNNNCGIATDACYPTV 322


>gi|195056367|ref|XP_001995082.1| GH22826 [Drosophila grimshawi]
 gi|193899288|gb|EDV98154.1| GH22826 [Drosophila grimshawi]
          Length = 340

 Score =  187 bits (476), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 98/201 (48%), Positives = 130/201 (64%), Gaps = 6/201 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F +TGA+EG ++ K   L  LS+Q L+DCS  YGNNGC+GG    ++++I
Sbjct: 140 DQGHCGSCWAFSSTGALEGQHFRKTGTLISLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI 199

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T+  Y PY G D  CH    T  AT  GF ++    E  L  A+A  GPVSVA
Sbjct: 200 KDNGGIDTEKSY-PYEGIDDSCHFNKGTIGATDRGFTDIPQGDEKKLAQAVATIGPVSVA 258

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
           IDAS +SF FY  GVY + +C+  P  LDH VL VGYG + +GK YW VKNSW T WG++
Sbjct: 259 IDASHESFQFYSTGVYDEPQCD--PQNLDHGVLVVGYGTDENGKDYWLVKNSWGTTWGDK 316

Query: 329 GYVLMSIKDNN-CGVMTAPTY 348
           G++ M+  D+N CG+ TA +Y
Sbjct: 317 GFIKMARNDDNQCGIATASSY 337


>gi|209693435|ref|NP_001129410.1| cathepsin L precursor [Acyrthosiphon pisum]
 gi|251823771|ref|NP_001156569.1| cathepsin L precursor [Acyrthosiphon pisum]
          Length = 341

 Score =  187 bits (475), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 104/228 (45%), Positives = 143/228 (62%), Gaps = 9/228 (3%)

Query: 127 VRYNKASKDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQ 183
           V + K+    IP  V +  KGY   + +Q  CGSCWSF  TG++EG ++ K   L  LS+
Sbjct: 114 VTFLKSENVVIPKSVDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVLVSLSE 173

Query: 184 QALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATM 242
           Q LIDCS  YGNNGC+GG    ++++I  + GL T+  Y PY  +D  C      + AT 
Sbjct: 174 QNLIDCSRKYGNNGCEGGLMDLAFKYIKSNKGLDTEKSY-PYEAEDDKCRYNPENSGATD 232

Query: 243 TGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVL 302
            GFV++    EDAL  ALA  GPVS+AIDAS + F FY  GV+Y+ +C+++   LDH VL
Sbjct: 233 KGFVDIPEGDEDALMHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTE--LDHGVL 290

Query: 303 AVGYG-ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           AVG+G +  G  YW VKNSW   WG++GY++M+  K NNCGV ++ +Y
Sbjct: 291 AVGFGSDKKGGDYWIVKNSWGKTWGDEGYIMMARNKKNNCGVASSASY 338


>gi|30023547|gb|AAO48766.2| cathepsin L-like cysteine proteinase [Tenebrio molitor]
          Length = 337

 Score =  187 bits (475), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 97/201 (48%), Positives = 130/201 (64%), Gaps = 6/201 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCWSF  TG++EG ++ +  KL  LS+Q L+DCS  +GNNGC+GG    ++++I
Sbjct: 137 DQGQCGSCWSFSATGSLEGQHFRQSGKLVSLSEQNLVDCSEKFGNNGCNGGLMDNAFRYI 196

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T+  Y PY  +D  CH       AT  G+V++   +ED L+ A+A  GPVSVA
Sbjct: 197 KANGGIDTEQAY-PYKAEDEKCHYKPKNKGATDRGYVDIESGNEDKLQSAVATVGPVSVA 255

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
           IDAS +SF  Y  GVYY+  C+ S   LDH VL VGYG E DG  YW VKNSW   WG+Q
Sbjct: 256 IDASHQSFQLYSGGVYYEPDCSASQ--LDHGVLVVGYGTEDDGTDYWLVKNSWGKSWGDQ 313

Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
           GY+ M+  ++NNCG+ T  +Y
Sbjct: 314 GYIKMARNRNNNCGIATEASY 334


>gi|390347681|ref|XP_801784.2| PREDICTED: cathepsin L1-like isoform 2 [Strongylocentrotus
           purpuratus]
          Length = 336

 Score =  187 bits (475), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 101/213 (47%), Positives = 135/213 (63%), Gaps = 6/213 (2%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F TTG++EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 124 VDWRTKGYVTPVKNQGQCGSCWAFSTTGSLEGQTFKKTSKLVSLSEQNLVDCSRTEGNMG 183

Query: 198 CDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG   + +Q+++  HG+ ++D Y PY  +D  CH   +  +A +TGF +VT   E AL
Sbjct: 184 CEGGLMDQGFQYVIDNHGIDSEDCY-PYDAEDETCHYKASCDSAEVTGFTDVTSGDEQAL 242

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
             A+A  GPVSVAIDAS +SF  Y +GVY + +C++S   LDH VL VGYG   GK YW 
Sbjct: 243 MEAVASVGPVSVAIDASHQSFQLYESGVYDEPECSSSE--LDHGVLVVGYGTDGGKDYWL 300

Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           VKNSW   WG  GY+ MS  K N CG+ T+ +Y
Sbjct: 301 VKNSWGETWGLSGYIKMSRNKSNQCGIATSASY 333


>gi|16304178|gb|AAL16954.1|AF426414_1 cathepsin L-like cysteine protease precursor [Delia radicum]
          Length = 337

 Score =  187 bits (475), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 99/207 (47%), Positives = 133/207 (64%), Gaps = 6/207 (2%)

Query: 148 SLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSY 207
           ++ DQ  CGSCW+F +T A+EG ++ K   L  LS+Q L+DCS  YGNNGC+GG    ++
Sbjct: 134 AVKDQGHCGSCWAFSSTAALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAF 193

Query: 208 QWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
           ++I  + G+ T+  Y PY G D  CH   +   AT TGFV++    E+AL  A+A  GPV
Sbjct: 194 RYIKDNGGIDTEKSY-PYEGIDDSCHFTKSGVGATDTGFVDIPQGDEEALMKAVATMGPV 252

Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYW 325
           SVAIDAS +SF  Y  GVY + +C+     LDH VL VGYG +  G  YW VKNSW T W
Sbjct: 253 SVAIDASHESFQLYSEGVYNEPECD--AQNLDHGVLVVGYGTDKTGLDYWLVKNSWGTTW 310

Query: 326 GNQGYVLMSI-KDNNCGVMTAPTYVTM 351
           G+QGY+ M+  +DN CG+ TA +Y T+
Sbjct: 311 GDQGYIKMARNQDNQCGIATASSYPTV 337


>gi|402770505|gb|AFQ98387.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  187 bits (475), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 95/200 (47%), Positives = 128/200 (64%), Gaps = 5/200 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F  TG++EG +++K+ +L  LS+Q L+DCS  +GNNGC+GG    ++++I
Sbjct: 133 DQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYI 192

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T+  Y PY   D  C        AT TG+V +   SE  LK A+A  GP+SVA
Sbjct: 193 KANDGIDTEKSY-PYEAVDGECRFKKEDVGATDTGYVEIKAGSEVDLKKAVATVGPISVA 251

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS  SF  Y  GVY + +C  S + LDH VL VGYG   GK YW VKNSW+  WG+QG
Sbjct: 252 IDASHSSFQLYSEGVYDEPEC--SSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQG 309

Query: 330 YVLMSIKDNN-CGVMTAPTY 348
           Y+LMS  +NN CG+ +  +Y
Sbjct: 310 YILMSRDNNNQCGIASQASY 329


>gi|389608655|dbj|BAM17937.1| cathepsin L [Papilio xuthus]
          Length = 341

 Score =  187 bits (475), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 100/201 (49%), Positives = 129/201 (64%), Gaps = 6/201 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCWSF +TGA+EG +Y +   L  LS+Q LIDCS  YGNNGC+GG    ++++I
Sbjct: 141 DQGKCGSCWSFSSTGALEGQHYRRTNILVSLSEQNLIDCSAAYGNNGCNGGLMDNAFKYI 200

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T+  Y PY G D  C        A   GFV++    E  L  A+A  GPVSVA
Sbjct: 201 KDNRGIDTEKSY-PYEGIDDKCRYNPKNTGADDNGFVDIPSGDEGKLMAAVATVGPVSVA 259

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
           IDASQ SF FY +GVY+DE C++S   LDH VL VGYG + +G  YW VKNSW   WG+ 
Sbjct: 260 IDASQSSFQFYSDGVYFDENCSSSS--LDHGVLVVGYGTDENGGDYWLVKNSWGRSWGDL 317

Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
           GY+ M+  +DN+CG+ TA +Y
Sbjct: 318 GYIKMARNRDNHCGIATAASY 338


>gi|21425246|emb|CAD33266.1| cathepsin L [Aphis gossypii]
          Length = 341

 Score =  187 bits (475), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 104/228 (45%), Positives = 143/228 (62%), Gaps = 9/228 (3%)

Query: 127 VRYNKASKDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQ 183
           V + K+    IP  V +  KGY   + +Q  CGSCWSF  TG++EG ++ K   L  LS+
Sbjct: 114 VTFLKSENVVIPKSVDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVLVSLSE 173

Query: 184 QALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATM 242
           Q LIDCS  YGNNGC+GG    ++++I  + GL T+  Y PY  +D  C      + AT 
Sbjct: 174 QNLIDCSRKYGNNGCEGGLMDLAFKYIKSNKGLDTEKSY-PYEAEDDKCRYNPENSGATD 232

Query: 243 TGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVL 302
            GFV++    EDAL  ALA  GPVS+AIDAS + F FY  GV+Y+ +C+++   LDH VL
Sbjct: 233 KGFVDIPEGDEDALMHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTE--LDHGVL 290

Query: 303 AVGYG-ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           AVG+G +  G  YW VKNSW   WG++GY++M+  K NNCGV ++ +Y
Sbjct: 291 AVGFGSDKKGGDYWIVKNSWGKTWGDEGYIMMARNKKNNCGVASSASY 338


>gi|195429415|ref|XP_002062758.1| GK19626 [Drosophila willistoni]
 gi|194158843|gb|EDW73744.1| GK19626 [Drosophila willistoni]
          Length = 341

 Score =  187 bits (475), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 97/201 (48%), Positives = 129/201 (64%), Gaps = 6/201 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F +TGA+EG ++ K   L  LS+Q L+DCS  YGNNGC+GG    +++++
Sbjct: 141 DQGHCGSCWAFSSTGAIEGQHFRKSGTLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYV 200

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T+  Y  Y G D  CH    +  AT  GF ++   +E  L  A+A  GPVSVA
Sbjct: 201 KDNGGIDTEKSY-AYEGIDDSCHFDKNSIGATDRGFADIPQGNEKKLAQAVATIGPVSVA 259

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
           IDASQ+SF FY  GVY +  C  S + LDH VL VGYG E DG  YW VKNSW T WG++
Sbjct: 260 IDASQQSFQFYSEGVYDEPNC--SAENLDHGVLVVGYGTEKDGSDYWLVKNSWGTTWGDK 317

Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
           G++ MS  K+N CG+ +A +Y
Sbjct: 318 GFIKMSRNKENQCGIASASSY 338


>gi|402770517|gb|AFQ98393.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  187 bits (475), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 95/200 (47%), Positives = 128/200 (64%), Gaps = 5/200 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F  TG++EG +++K+ +L  LS+Q L+DCS  +GNNGC+GG    ++++I
Sbjct: 133 DQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYI 192

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T+  Y PY   D  C        AT TG+V +   SE  LK A+A  GP+SVA
Sbjct: 193 KANDGIDTEKSY-PYKAVDGECRFKKEDVGATDTGYVEIKAGSEVDLKKAVATVGPISVA 251

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS  SF  Y  GVY + +C  S + LDH VL VGYG   GK YW VKNSW+  WG+QG
Sbjct: 252 IDASHSSFQLYSEGVYDEPEC--SSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQG 309

Query: 330 YVLMSIKDNN-CGVMTAPTY 348
           Y+LMS  +NN CG+ +  +Y
Sbjct: 310 YILMSRDNNNQCGIASQASY 329


>gi|318037269|ref|NP_001187182.1| cathepsin L precursor [Ictalurus punctatus]
 gi|196475596|gb|ACG76367.1| cathepsin L [Ictalurus punctatus]
          Length = 336

 Score =  187 bits (475), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 102/215 (47%), Positives = 135/215 (62%), Gaps = 9/215 (4%)

Query: 141 YEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCD 199
           +  KGY   + DQ  CGSCW+F TTGA+EG  + K  KL  LS+Q L+DCS   GN GC+
Sbjct: 121 WREKGYVTPVKDQGQCGSCWAFSTTGAMEGQQFRKTGKLVSLSEQNLVDCSRPEGNEGCN 180

Query: 200 GGEDFRSYQWIMKHGLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDALKL 258
           GG   +++Q+I  +G    + + PYLG  D  CH   + + A  TGFV++    E AL  
Sbjct: 181 GGLMDQAFQYIKDNGGLDTEKFYPYLGTDDQPCHYDPSYSAANDTGFVDIPSGKEHALMK 240

Query: 259 ALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPY 314
           A+   GPVSVAIDA  +SF FY +G+YY+  C  S + LDH VL VGYG     +DGK Y
Sbjct: 241 AVTAVGPVSVAIDAGHESFQFYQSGIYYEADC--SSEDLDHGVLVVGYGYEGENVDGKKY 298

Query: 315 WQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           W VKNSWS  WGN+GY+ M+  + N+CG+ TA +Y
Sbjct: 299 WIVKNSWSEQWGNKGYIYMAKDRHNHCGIATAASY 333


>gi|261289783|ref|XP_002611753.1| hypothetical protein BRAFLDRAFT_236364 [Branchiostoma floridae]
 gi|229297125|gb|EEN67763.1| hypothetical protein BRAFLDRAFT_236364 [Branchiostoma floridae]
          Length = 307

 Score =  187 bits (475), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 97/213 (45%), Positives = 136/213 (63%), Gaps = 6/213 (2%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F TTG++EG  + K  KL  LS+Q L+DCS  +GN G
Sbjct: 95  VDWRTKGYVTPVKNQEQCGSCWAFSTTGSLEGQTFKKTGKLVSLSEQNLVDCSGEFGNQG 154

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++++I  + G+ T+D Y PY  +D  C        AT+TG+ +++   E AL
Sbjct: 155 CNGGLMDDAFKYIKANGGIDTEDSY-PYEARDGKCRFKPADVGATVTGYTDISEGDEGAL 213

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
             A+A  GP+SVAIDAS  +F  Y +GVYY+ +C+++   LDH VLAVGYG   GK YW 
Sbjct: 214 TQAVATVGPISVAIDASHHTFQMYSHGVYYEPQCSSTE--LDHGVLAVGYGTEGGKDYWL 271

Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           VKNSW   WG  GY++MS  K+N CG+ T+ +Y
Sbjct: 272 VKNSWGEVWGQNGYIMMSRNKNNQCGIATSASY 304


>gi|156367164|ref|XP_001627289.1| predicted protein [Nematostella vectensis]
 gi|156214194|gb|EDO35189.1| predicted protein [Nematostella vectensis]
          Length = 514

 Score =  187 bits (475), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 91/195 (46%), Positives = 119/195 (61%), Gaps = 1/195 (0%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM 211
           Q +CGSC++    GAVEGAY+MK  KL  LS Q +IDCSWG GN GC GG   ++  WI 
Sbjct: 321 QGICGSCYALAAVGAVEGAYFMKTGKLKELSAQQVIDCSWGSGNRGCKGGYYNKAMSWIY 380

Query: 212 KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAID 271
            HG+ + + YGPYLGQ+  C I      A +  F  V   +  ALK+++A+ GP  V+I+
Sbjct: 381 LHGIASAESYGPYLGQEGTCRIEGLRRAAAIDAFAFVPKYNNTALKISVARFGPAVVSIN 440

Query: 272 ASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYV 331
            +  S  FY  G+Y D +C      + H+VL VGYG  DG+PYW VKNSWST WG  GY+
Sbjct: 441 ENPLSLKFYSWGLYDDPECGRDTAAV-HSVLVVGYGVEDGEPYWLVKNSWSTTWGMDGYI 499

Query: 332 LMSIKDNNCGVMTAP 346
            ++ K N CGV   P
Sbjct: 500 KIAWKRNTCGVTRNP 514



 Score = 51.2 bits (121), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 35/137 (25%), Positives = 59/137 (43%), Gaps = 26/137 (18%)

Query: 14  GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
           G  +T+Q +  G +G + K+ PM T+ +   I C+H  G     I  Q +L   +     
Sbjct: 45  GTDRTFQRADVGPHGEAFKIVPMYTDEKGGYIGCWHLEGTERTPIVVQPILIIST----- 99

Query: 74  IYRLFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKAS 133
                             ++W + G E+ +G    KW         ++ +++WV     +
Sbjct: 100 ------------------HDWKFAGYEVYHGASTAKWEYRYLAFGLMNAHTIWV---TTA 138

Query: 134 KDAIPVRYEMKGYNSLL 150
               PVRYEMKGY++LL
Sbjct: 139 DPPRPVRYEMKGYDNLL 155


>gi|348531519|ref|XP_003453256.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
          Length = 334

 Score =  187 bits (474), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 100/218 (45%), Positives = 136/218 (62%), Gaps = 8/218 (3%)

Query: 136 AIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG 192
           A+P  V +  KGY   + DQ  CGSCW+F TTG +EG ++ K  KL  LS+Q L+DCS  
Sbjct: 117 ALPDTVDWRDKGYVTEVKDQKQCGSCWAFSTTGVLEGQHFRKTGKLVSLSEQQLMDCSHS 176

Query: 193 YGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPN 251
           +GNNGC+GG   R+ Q+I  + G+ T+  Y PY  +   C        A  TG+V+V P+
Sbjct: 177 FGNNGCNGGSVKRALQYIQANGGIDTETSY-PYKAKGQRCRYKPDGIGAKCTGYVHVKPS 235

Query: 252 SEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDG 311
           +E+ LK A+A  GP+SV IDAS+ SF FY +GVY D  C+ +   LDH  LAVGYG  +G
Sbjct: 236 NEETLKKAVATLGPISVGIDASRHSFQFYQSGVYDDPDCSKTV--LDHGALAVGYGTENG 293

Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
             YW +KNSW   WG++GY+ MS  K N CG+ +  +Y
Sbjct: 294 HDYWLIKNSWGLRWGDKGYIKMSRNKSNQCGIASEASY 331


>gi|402770515|gb|AFQ98392.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  187 bits (474), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 95/200 (47%), Positives = 128/200 (64%), Gaps = 5/200 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F  TG++EG +++K+ +L  LS+Q L+DCS  +GNNGC+GG    ++++I
Sbjct: 133 DQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYI 192

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T+  Y PY   D  C        AT TG+V +   SE  LK A+A  GP+SVA
Sbjct: 193 KANDGIDTEKSY-PYEAVDGECRFKKEDVGATDTGYVEIKAGSEVDLKKAVATVGPISVA 251

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS  SF  Y  GVY + +C  S + LDH VL VGYG   GK YW VKNSW+  WG+QG
Sbjct: 252 IDASHSSFQLYSEGVYDEPEC--SSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQG 309

Query: 330 YVLMSIKDNN-CGVMTAPTY 348
           Y+LMS  +NN CG+ +  +Y
Sbjct: 310 YILMSRDNNNQCGIASQASY 329


>gi|402770511|gb|AFQ98390.1| cathepsin L [Rhipicephalus microplus]
 gi|402770513|gb|AFQ98391.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  187 bits (474), Expect = 8e-45,   Method: Compositional matrix adjust.
 Identities = 95/200 (47%), Positives = 128/200 (64%), Gaps = 5/200 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F  TG++EG +++K+ +L  LS+Q L+DCS  +GNNGC+GG    ++++I
Sbjct: 133 DQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYI 192

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T+  Y PY   D  C        AT TG+V +   SE  LK A+A  GP+SVA
Sbjct: 193 KANDGIDTEKSY-PYEAVDGECRFKKEDVGATDTGYVEIKAGSEVDLKKAVATVGPISVA 251

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS  SF  Y  GVY + +C  S + LDH VL VGYG   GK YW VKNSW+  WG+QG
Sbjct: 252 IDASHSSFQLYSEGVYDEPEC--SSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQG 309

Query: 330 YVLMSIKDNN-CGVMTAPTY 348
           Y+LMS  +NN CG+ +  +Y
Sbjct: 310 YILMSRDNNNQCGIASQASY 329


>gi|7381610|gb|AAF61565.1|AF227957_1 cathepsin L-like proteinase precursor [Rhipicephalus microplus]
          Length = 332

 Score =  187 bits (474), Expect = 8e-45,   Method: Compositional matrix adjust.
 Identities = 95/200 (47%), Positives = 128/200 (64%), Gaps = 5/200 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F  TG++EG +++K+ +L  LS+Q L+DCS  +GNNGC+GG    ++++I
Sbjct: 133 DQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYI 192

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T+  Y PY   D  C        AT TG+V +   SE  LK A+A  GP+SVA
Sbjct: 193 KANDGIDTEKSY-PYEAVDGECRFKKEDVGATDTGYVEIKAGSEVDLKKAVATVGPISVA 251

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS  SF  Y  GVY + +C  S + LDH VL VGYG   GK YW VKNSW+  WG+QG
Sbjct: 252 IDASHSSFQLYSEGVYDEPEC--SSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQG 309

Query: 330 YVLMSIKDNN-CGVMTAPTY 348
           Y+LMS  +NN CG+ +  +Y
Sbjct: 310 YILMSRDNNNQCGIASQASY 329


>gi|156399477|ref|XP_001638528.1| predicted protein [Nematostella vectensis]
 gi|156225649|gb|EDO46465.1| predicted protein [Nematostella vectensis]
          Length = 325

 Score =  187 bits (474), Expect = 9e-45,   Method: Compositional matrix adjust.
 Identities = 99/213 (46%), Positives = 135/213 (63%), Gaps = 6/213 (2%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KG+  ++ +Q  CGSCW+F +TG++EG ++ K  KL  LS+Q L+DCS  YGNNG
Sbjct: 113 VDWRDKGFVTAVKNQGQCGSCWAFSSTGSLEGQHFRKTGKLVSLSEQNLVDCSKKYGNNG 172

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++++I  + G+ T+  Y PY  +D  CH    +  AT+TG+ +V   SE  L
Sbjct: 173 CEGGLMDYAFKYIKNNDGIDTEQSY-PYTARDGQCHFKPGSVGATVTGYTDVQRGSEGDL 231

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           + A+A  GP+SVAIDA   SF  Y  GVY +  C+++   LDH VLAVGYG  DGK YW 
Sbjct: 232 QSAVATVGPISVAIDAGHSSFQLYKTGVYSEPDCSSTQ--LDHGVLAVGYGAEDGKDYWL 289

Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           VKNSW   WG  GY+ MS  KDN CG+ T  +Y
Sbjct: 290 VKNSWGEGWGMNGYIKMSRNKDNQCGIATQASY 322


>gi|50657029|emb|CAH04632.1| cathepsin L [Suberites domuncula]
          Length = 324

 Score =  187 bits (474), Expect = 9e-45,   Method: Compositional matrix adjust.
 Identities = 97/216 (44%), Positives = 136/216 (62%), Gaps = 6/216 (2%)

Query: 136 AIPVRYEMKGYNS-LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYG 194
           A  V +  KG  S + +Q  CGSCWSF  TG++EG + +K  +L  LS+Q L+DCS  +G
Sbjct: 109 AASVDWRQKGVVSEVKNQGQCGSCWSFSATGSLEGQHALKMGRLVSLSEQNLMDCSSRFG 168

Query: 195 NNGCDGGEDFRSYQWIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSE 253
           N+GC GG    ++++++  HG+ T+  Y PY  +D YC        AT T + ++   SE
Sbjct: 169 NHGCKGGIMDDAFRYVISNHGVDTESSY-PYTAKDGYCRFNQNNVGATETSYRDIARGSE 227

Query: 254 DALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKP 313
            +L  A A+ GP+SVAIDAS +SF FY NGVYY+  C++S   LDH VL VGYG   G+ 
Sbjct: 228 SSLTQASAQIGPISVAIDASHRSFQFYKNGVYYEPSCSSSR--LDHGVLVVGYGTEGGQD 285

Query: 314 YWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           Y+ VKNSW T WG  GY++MS  + NNCG+ +  +Y
Sbjct: 286 YFIVKNSWGTRWGMDGYIMMSRNRRNNCGIASQASY 321


>gi|37786769|gb|AAO64471.1| cathepsin L precursor [Fundulus heteroclitus]
          Length = 337

 Score =  187 bits (474), Expect = 9e-45,   Method: Compositional matrix adjust.
 Identities = 103/218 (47%), Positives = 142/218 (65%), Gaps = 11/218 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + DQ  CGSCW+F TTGA+EG  + +  KL  LS Q L++CS   GN G
Sbjct: 120 VDWREKGYVTPVKDQGECGSCWAFSTTGALEGQEFTRTGKLVSLSGQNLVECSRPEGNEG 179

Query: 198 CDGGEDFRSYQWIM-KHGLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDA 255
           C+GG   +++Q++    GL ++D Y PYLG  D  CH     + A  TGFV++   +E A
Sbjct: 180 CNGGLMDQAFQYVKDNQGLDSEDSY-PYLGTDDQPCHYDPKFSAANDTGFVDIPSGNERA 238

Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
           L  A+A  GPVSVAIDA  +SF FY +G+YY+++C  S + LDH VLAVGYG    ++DG
Sbjct: 239 LMKAVASVGPVSVAIDAGHESFQFYQSGIYYEKEC--SSEELDHGVLAVGYGFQGEDVDG 296

Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           K +W VKNSWS  WG++GY+ M+  + N+CG+ TA +Y
Sbjct: 297 KKFWIVKNSWSENWGDKGYIYMAKDRKNHCGIATAASY 334


>gi|320164780|gb|EFW41679.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
          Length = 334

 Score =  187 bits (474), Expect = 9e-45,   Method: Compositional matrix adjust.
 Identities = 97/200 (48%), Positives = 127/200 (63%), Gaps = 5/200 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCWSF TTG+VEG +  K  +L  LS+Q L+DCS   GN GC+GG    ++Q+I
Sbjct: 135 DQGQCGSCWSFSTTGSVEGQHARKTGQLVSLSEQNLVDCSKAQGNQGCNGGLMDDAFQYI 194

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           + + G+ T+  Y PY  +D  C        AT++ F ++T  SE  L+ A+A  GPVSVA
Sbjct: 195 ITNKGIDTEASY-PYTAKDGTCKFNAANVGATLSSFQDITRGSESDLQNAVATVGPVSVA 253

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS+ SF  Y +GVY ++KC  S   LDH VLA GYG  +G PYW VKNSW + WG  G
Sbjct: 254 IDASKNSFQLYTSGVYNEKKC--SSTSLDHGVLAAGYGTSNGTPYWLVKNSWGSSWGQAG 311

Query: 330 YVLMSIKDNN-CGVMTAPTY 348
           Y+ MS   NN CG+ T+ +Y
Sbjct: 312 YIWMSRNANNQCGIATSASY 331


>gi|221090861|ref|XP_002167224.1| PREDICTED: cathepsin L-like [Hydra magnipapillata]
          Length = 324

 Score =  187 bits (474), Expect = 9e-45,   Method: Compositional matrix adjust.
 Identities = 98/213 (46%), Positives = 137/213 (64%), Gaps = 6/213 (2%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  +GY   + DQ  CGSCW+F TTG++EG ++ K  KL  LS+Q L+DCS  YGNNG
Sbjct: 112 VDWRNEGYVTPVKDQGQCGSCWAFSTTGSLEGQHFKKTGKLVSLSEQNLVDCSTAYGNNG 171

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++ +I ++ G+ ++  Y PY  +D  C     +  AT TGFV++   +E+ L
Sbjct: 172 CNGGLMDNAFTYIKENKGIDSEASY-PYTAEDGKCVFKKPSVAATDTGFVDLPEGNENKL 230

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K A+A  GP+SVAIDAS +SF FY +GVY +  C+++   LDH VL VGYG   GK YW 
Sbjct: 231 KEAVASVGPISVAIDASHESFQFYSSGVYNEPSCSSTE--LDHGVLVVGYGTESGKDYWL 288

Query: 317 VKNSWSTYWGNQGYVLMSIK-DNNCGVMTAPTY 348
           VKNSW+T WG++GY+ M     N CG+ T  +Y
Sbjct: 289 VKNSWNTSWGDKGYIKMRRNAKNQCGIATKASY 321


>gi|228244|prf||1801240B Cys protease 2
          Length = 323

 Score =  186 bits (473), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 107/285 (37%), Positives = 159/285 (55%), Gaps = 18/285 (6%)

Query: 79  FLRPRFHENEKI--RYNWTYIGEELVNGIILEKWRLVTSEG----------EKVSKYSLW 126
           + R  F +N+K    +N  Y   E+   + + K+  +T E            + +  S++
Sbjct: 39  YRRVIFEQNQKYIEEFNKKYENGEVTFNLAMNKFGDMTLEEFNAVMKGNIPRRSAPVSVF 98

Query: 127 VRYNKASKDAIPVRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQA 185
               +    A  V +  KG    + DQ  CGSCW+F TTG++EG +++K   L  L++Q 
Sbjct: 99  YPKKETGPQATEVDWRTKGAVTPVKDQGQCGSCWAFSTTGSLEGQHFLKTGSLISLAEQQ 158

Query: 186 LIDCSWGYGNNGCDGGEDFRSYQWI-MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTG 244
           L+DCS  YG  GC+GG    ++ +I   +G+ T+  Y PY  +D  C   + +  AT +G
Sbjct: 159 LVDCSRPYGPQGCNGGWMNDAFDYIKANNGIDTEASY-PYEARDGSCRFDSNSVAATCSG 217

Query: 245 FVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAV 304
             N+   SE  L+ A+   GP+SV IDA+  SF FY +GVYY+  C  SP  LDHAVLAV
Sbjct: 218 HTNIASGSETGLQQAVRDIGPISVTIDAAHSSFQFYSSGVYYEPSC--SPSYLDHAVLAV 275

Query: 305 GYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           GYG   G+ +W VKNSW+T WG+ GY+ MS  ++NNCG+ T  +Y
Sbjct: 276 GYGSEGGQDFWLVKNSWATSWGDAGYIKMSRNRNNNCGIATVASY 320


>gi|221117518|ref|XP_002157675.1| PREDICTED: cathepsin L-like [Hydra magnipapillata]
          Length = 340

 Score =  186 bits (473), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 97/213 (45%), Positives = 133/213 (62%), Gaps = 3/213 (1%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY N + +Q  CGSCW+F TTGA+EG  + K   L  LS+Q L+DC+  YGN  
Sbjct: 125 VDWRTKGYVNPVKNQGQCGSCWAFSTTGALEGQTFRKTGVLPDLSEQNLVDCTQSYGNEA 184

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++++I  + G+ ++  Y  Y     YC+       A+ TGFV++    EDAL
Sbjct: 185 CNGGWMDNAFKYISDNKGIDSEAGYPYYAKALGYCYYNQQFNVASDTGFVDIASGDEDAL 244

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K+A+A  GP+SVAIDA++ SF  Y +GVYY+  C N  + LDHAVL VGYG  DG+ +W 
Sbjct: 245 KVAVATVGPISVAIDATKDSFMRYQSGVYYEPTCGNGLENLDHAVLVVGYGTEDGRDFWL 304

Query: 317 VKNSWSTYWGNQGYVLMSIK-DNNCGVMTAPTY 348
           VKNSW   WG+QGY+ MS    N CG+ T  +Y
Sbjct: 305 VKNSWDITWGDQGYIKMSRNMSNQCGIATKASY 337


>gi|390337645|ref|XP_001199228.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
          Length = 333

 Score =  186 bits (473), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 107/234 (45%), Positives = 144/234 (61%), Gaps = 9/234 (3%)

Query: 119 KVSKYSLWVRYNKASKDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKK 177
           K +K S ++  N   K    V +  KGY   + DQ  CGSCW+F  TG++EG ++ K  K
Sbjct: 102 KAAKGSTFLPPNNVGKLPKTVDWRTKGYVTPVKDQGQCGSCWAFSATGSLEGQHFKKTGK 161

Query: 178 LAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANT 236
           L  LS+Q L+DCS    N GC+GG   R++Q+I+  G + T++ Y PY+  D  CH    
Sbjct: 162 LVSLSEQNLVDCS--DKNYGCNGGLMDRAFQYIIDAGGIDTEESY-PYIAMDGNCHFKTA 218

Query: 237 TATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDG 296
              AT+TG+ +VT  SE AL+ A+A  GP+SVAIDAS  SF  Y +GVY +  C+++   
Sbjct: 219 NVGATVTGYTDVTSGSEKALQKAVAHIGPISVAIDASHFSFQLYQSGVYNEPGCSSTL-- 276

Query: 297 LDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           LDH VLAVGYG  +DG  YW VKNSW+  WG  GY+ MS  KDN CG+ T  +Y
Sbjct: 277 LDHGVLAVGYGTTIDGTDYWIVKNSWAETWGMNGYIWMSRNKDNQCGIATQASY 330


>gi|34850847|dbj|BAC87861.1| cathepsin L [Engraulis japonicus]
          Length = 336

 Score =  186 bits (473), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 103/221 (46%), Positives = 143/221 (64%), Gaps = 11/221 (4%)

Query: 139 VRYEMKGYNS-LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           + Y   GY + + DQ  CGSCW+F TTGA+EG  + +  KL  LS+Q L+DCS   GN G
Sbjct: 119 IDYRDLGYATPVKDQGQCGSCWAFSTTGAMEGQLFREGGKLVSLSEQNLVDCSRPEGNEG 178

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDA 255
           C+GG   +++Q+I  + GL T+D Y PYLG  D  CH     + A  TGFV++    E A
Sbjct: 179 CNGGLMDQAFQYIKDNGGLDTEDAY-PYLGTDDQDCHYDPKYSAANDTGFVDIPEGKERA 237

Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
           L  A+A  GPVSVAIDA  + F FY +G+Y++++C+++   LDH VL VGYG    ++DG
Sbjct: 238 LMKAVAAVGPVSVAIDAGHECFQFYHSGIYFEKECSSTE--LDHGVLVVGYGFEGEDVDG 295

Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
           K YW VKNSWS  WG++GY+ M+  + N+CG+ TA +Y  M
Sbjct: 296 KKYWIVKNSWSEKWGDEGYIYMAKDRKNHCGIATAASYPLM 336


>gi|432936690|ref|XP_004082231.1| PREDICTED: cathepsin L-like [Oryzias latipes]
          Length = 334

 Score =  186 bits (473), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 100/213 (46%), Positives = 134/213 (62%), Gaps = 6/213 (2%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + DQ  CGSCW+F  TG++EG  + K  KL  LS+Q L+DCS  YGN G
Sbjct: 122 VDWRDKGYVTDVKDQKQCGSCWAFSATGSLEGQTFRKTGKLVSLSEQQLVDCSGDYGNMG 181

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    ++++I    G+ T++ Y PY  +D  C        AT TG+V+V+   EDAL
Sbjct: 182 CGGGLMDDAFRYIQATGGIDTEESY-PYEAEDGECRYKPDAVGATCTGYVDVSSGDEDAL 240

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           + A+A  GP+SV IDAS  SF  Y +G+Y + +C++S   LDH VLAVGYG  +G+ YW 
Sbjct: 241 QEAVATIGPISVGIDASHISFQLYESGLYDEPQCSSSE--LDHGVLAVGYGSENGQDYWL 298

Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           VKNSW   WG+QGY+ MS  K N CG+ TA +Y
Sbjct: 299 VKNSWGLTWGDQGYIKMSKNKSNQCGIATAASY 331


>gi|229893789|gb|ACQ90252.1| cathepsin L [Pinctada fucata]
          Length = 362

 Score =  186 bits (472), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 103/214 (48%), Positives = 132/214 (61%), Gaps = 7/214 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCWSF TTG++EG ++ +  KL  LS+Q L+DCS  +GN G
Sbjct: 147 VDWRDKGYVTPVKNQGQCGSCWSFSTTGSLEGQHFRQTGKLISLSEQQLVDCSGTFGNEG 206

Query: 198 CDGGEDFRSYQWIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++++I    GL  +DDY PY  +   CH+  +   A  TG  +V    EDAL
Sbjct: 207 CNGGLMDNAFEYIKSIGGLEGEDDY-PYTAKQGKCHLKKSLFKANDTGCTDVESGDEDAL 265

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYW 315
           K ALA  GP+SVAIDAS  SF  Y  GVY +E+C  S   LDH VL VGYG E +G  YW
Sbjct: 266 KDALASVGPISVAIDASHASFQSYDGGVYDEEEC--SSQNLDHGVLTVGYGTEENGGDYW 323

Query: 316 QVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
            VKNSW   WG +GY+ MS  KDN CG+ T  +Y
Sbjct: 324 LVKNSWGEMWGEEGYIKMSRNKDNQCGIATQASY 357


>gi|308474437|ref|XP_003099440.1| CRE-CPL-1 protein [Caenorhabditis remanei]
 gi|308266846|gb|EFP10799.1| CRE-CPL-1 protein [Caenorhabditis remanei]
          Length = 337

 Score =  186 bits (472), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 93/201 (46%), Positives = 132/201 (65%), Gaps = 6/201 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q +CGSCW+F  TGA+EG +  K  KL  LS+Q L+DCS  YGN+GC+GG   +++++I
Sbjct: 137 NQGMCGSCWAFSATGALEGQHARKLGKLVSLSEQNLVDCSTKYGNHGCNGGLMDQAFEYI 196

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
              HG+ T+D Y PY G+D  CH +     A   G+ ++    E+ LK+A+A  GP+S+A
Sbjct: 197 RDNHGVDTEDSY-PYKGRDMKCHFSKKDVGADDKGYTDLPEGDEEQLKIAVATQGPISIA 255

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
           IDA  +SF  Y  GVYYDE+C  S + LDH VL VGYG + +   YW VKNSW T WG +
Sbjct: 256 IDAGHRSFQLYKKGVYYDEEC--SSEELDHGVLLVGYGTDPEHGDYWLVKNSWGTGWGEK 313

Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
           GY+ ++  ++N+CGV T  +Y
Sbjct: 314 GYIRIARNRNNHCGVATKASY 334


>gi|37994576|gb|AAH60335.1| Unknown (protein for MGC:68554) [Xenopus laevis]
          Length = 335

 Score =  186 bits (472), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 101/217 (46%), Positives = 135/217 (62%), Gaps = 9/217 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + DQ  CGSCW+F TTGA+EG +Y K  KL  LS+Q L+DCS   GN G
Sbjct: 118 VDWRKKGYVTPVKDQGQCGSCWAFSTTGALEGQHYRKTSKLISLSEQNLVDCSRAQGNEG 177

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQ-DAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG   +++Q++  +G    +D  PY  + D  CH      +A  TGFV+V    E  L
Sbjct: 178 CNGGLMDQAFQYVKDNGGIDSEDSYPYTAKDDQECHYDPNNNSANDTGFVDVQSGCEKDL 237

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GPVSVAIDA  +SF FY +G+YY+ +C  S + LDH VL VGYG    ++DGK
Sbjct: 238 MKAVASVGPVSVAIDAGHQSFQFYQSGIYYEPEC--SSEDLDHGVLVVGYGFESEDVDGK 295

Query: 313 PYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
            YW VKNSWS  WG+ GY+ +   + N+CG+ TA +Y
Sbjct: 296 KYWIVKNSWSEKWGDNGYINIAKDRHNHCGIATAASY 332


>gi|391328503|ref|XP_003738728.1| PREDICTED: digestive cysteine proteinase 3-like [Metaseiulus
           occidentalis]
          Length = 506

 Score =  186 bits (472), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 98/214 (45%), Positives = 136/214 (63%), Gaps = 6/214 (2%)

Query: 141 YEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCD 199
           Y ++GY   + DQ  CGSCW+F TTG++EG ++    KL  LS+Q L+DCS   GNNGC+
Sbjct: 296 YWLEGYVTPVKDQGQCGSCWAFSTTGSLEGQHFKATGKLVSLSEQNLVDCSGDEGNNGCE 355

Query: 200 GGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKL 258
           GG   + + +I  + G+ T++ Y PY  +D  C   +    A +TGFV++   SE AL+ 
Sbjct: 356 GGLMDQGFTYIKNNGGIDTEESY-PYNAEDGDCAFKSNAVGARVTGFVDIDSGSEKALQK 414

Query: 259 ALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVK 318
           A+A  GPVSVAIDAS  SF  Y  G+Y +  C+++   LDH VLAVGYG  +G  YW VK
Sbjct: 415 AVATVGPVSVAIDASNDSFQLYKEGIYDEPACSSTQ--LDHGVLAVGYGSENGVDYWLVK 472

Query: 319 NSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
           NSW+T WG  GY+ M+  KDN CG+ +  +Y T+
Sbjct: 473 NSWNTVWGQDGYIKMARNKDNQCGIASQASYPTV 506



 Score =  145 bits (365), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 79/179 (44%), Positives = 109/179 (60%), Gaps = 7/179 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V Y   G+   + +Q +CGSCW+F  TG++EG   +++  L  LS+Q L+DCS    N G
Sbjct: 124 VDYRKSGHVTPVKNQGLCGSCWAFSATGSLEGQLSIQNGTLVSLSEQNLLDCS--RENQG 181

Query: 198 CDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           CDGG   +++++I K+G + T++ Y PY G+   C        A +TG V+V    E AL
Sbjct: 182 CDGGYMDKAFEYIKKNGGIDTEESY-PYTGRKGKCMFKKKNIGARVTGHVDVPAEDEQAL 240

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYW 315
           KLA+AK GP+SV IDAS+ SF FY  G+Y +  C+ S   LDH VL VGYG   GK YW
Sbjct: 241 KLAVAKIGPISVGIDASKDSFRFYKEGIYDESSCSTSQ--LDHGVLVVGYGSEKGKDYW 297


>gi|288548566|gb|ADC52431.1| cathepsin L2 cysteine protease [Pinctada fucata]
          Length = 330

 Score =  186 bits (472), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 101/216 (46%), Positives = 134/216 (62%), Gaps = 6/216 (2%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCWSF  TG++EG  + K  KL  LS+Q L+DCS   GN+G
Sbjct: 118 VDWRPKGYVTPIKNQGQCGSCWSFSATGSLEGQTFKKTGKLVSLSEQNLVDCSKKQGNHG 177

Query: 198 CDGGEDFRSYQWI-MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++ +I   +G+ T+  Y PY  +D  C   +    AT TGFV++    E+AL
Sbjct: 178 CEGGLMDDAFTYIKANNGIDTEASY-PYKARDGKCEFKSADVGATDTGFVDIKTKDEEAL 236

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K A+A  GP+SVAIDAS  SF  Y  GVY+D  C+ +   LDH VLAVGYG  D K YW 
Sbjct: 237 KQAVATVGPISVAIDASHMSFQLYRTGVYHDWFCSQTK--LDHGVLAVGYGTEDSKDYWL 294

Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
           VKNSW   WG +GY+ MS  + NNCG+ T+ +Y T+
Sbjct: 295 VKNSWGESWGQKGYIQMSRNRRNNCGIATSASYPTV 330


>gi|268560858|ref|XP_002638172.1| C. briggsae CBR-CPL-1 protein [Caenorhabditis briggsae]
          Length = 336

 Score =  186 bits (472), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 93/201 (46%), Positives = 132/201 (65%), Gaps = 6/201 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q +CGSCW+F  TGA+EG +  K  +L  LS+Q L+DCS  YGN+GC+GG   +++++I
Sbjct: 136 NQGMCGSCWAFSATGALEGQHARKLGQLVSLSEQNLVDCSTKYGNHGCNGGLMDQAFEYI 195

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
              HG+ T++ Y PY G+D  CH    T  A   G+V+     E+ LK+A+A  GP+S+A
Sbjct: 196 RDNHGVDTEESY-PYKGRDMKCHFNKKTVGADDKGYVDTPEGDEEQLKIAVATQGPISIA 254

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
           IDA  +SF  Y  GVYYDE+C  S + LDH VL VGYG + +   YW VKNSW T WG +
Sbjct: 255 IDAGHRSFQLYKKGVYYDEEC--SSEELDHGVLLVGYGTDPEHGDYWLVKNSWGTGWGEK 312

Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
           GY+ ++  ++N+CGV T  +Y
Sbjct: 313 GYIRIARNRNNHCGVATKASY 333


>gi|118123|sp|P25782.1|CYSP2_HOMAM RecName: Full=Digestive cysteine proteinase 2; Flags: Precursor
 gi|11053|emb|CAA45128.1| cysteine proteinase preproenzyme [Homarus americanus]
          Length = 323

 Score =  186 bits (472), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 107/285 (37%), Positives = 159/285 (55%), Gaps = 18/285 (6%)

Query: 79  FLRPRFHENEKI--RYNWTYIGEELVNGIILEKWRLVTSEG----------EKVSKYSLW 126
           + R  F +N+K    +N  Y   E+   + + K+  +T E            + +  S++
Sbjct: 39  YRRVIFEQNQKYIEEFNKKYENGEVTFNLAMNKFGDMTLEEFNAVMKGNIPRRSAPVSVF 98

Query: 127 VRYNKASKDAIPVRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQA 185
               +    A  V +  KG    + DQ  CGSCW+F TTG++EG +++K   L  L++Q 
Sbjct: 99  YPKKETGPQATEVDWRTKGAVTPVKDQGQCGSCWAFSTTGSLEGQHFLKTGSLISLAEQQ 158

Query: 186 LIDCSWGYGNNGCDGGEDFRSYQWI-MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTG 244
           L+DCS  YG  GC+GG    ++ +I   +G+ T+  Y PY  +D  C   + +  AT +G
Sbjct: 159 LVDCSRPYGPQGCNGGWMNDAFDYIKANNGIDTEAAY-PYEARDGSCRFDSNSVAATCSG 217

Query: 245 FVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAV 304
             N+   SE  L+ A+   GP+SV IDA+  SF FY +GVYY+  C  SP  LDHAVLAV
Sbjct: 218 HTNIASGSETGLQQAVRDIGPISVTIDAAHSSFQFYSSGVYYEPSC--SPSYLDHAVLAV 275

Query: 305 GYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           GYG   G+ +W VKNSW+T WG+ GY+ MS  ++NNCG+ T  +Y
Sbjct: 276 GYGSEGGQDFWLVKNSWATSWGDAGYIKMSRNRNNNCGIATVASY 320


>gi|146152090|gb|ABQ08058.1| cathepsin L [Misgurnus mizolepis]
          Length = 337

 Score =  186 bits (472), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 103/218 (47%), Positives = 141/218 (64%), Gaps = 11/218 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           + +  KGY   + DQ  CGSCW+F TTGA+EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 120 LDWREKGYVTPVKDQGECGSCWAFSTTGAMEGQMFRKQGKLVSLSEQNLVDCSRPEGNEG 179

Query: 198 CDGGEDFRSYQWIM-KHGLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDA 255
           C+GG   +++Q+I   +GL +++ Y PYLG  D  CH       A  TGFV++    E A
Sbjct: 180 CNGGLMDQAFQYIKDNNGLDSEEAY-PYLGTDDQPCHYDPKYNAANDTGFVDIPSGKEHA 238

Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
           L  A+A  GPVSVAIDA  +SF FY +G+Y++++C  S + LDH VL VGYG    ++DG
Sbjct: 239 LMKAVASVGPVSVAIDAGHESFQFYQSGIYFEKEC--SSEELDHGVLVVGYGFEGEDVDG 296

Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           K YW VKNSWS  WG++GY+ M+  + N+CG+ TA +Y
Sbjct: 297 KKYWIVKNSWSESWGDKGYIYMAKDRKNHCGIATAASY 334


>gi|21489677|gb|AAM55195.1|AF412313_1 cathepsin L cysteine protease [Haemonchus contortus]
 gi|21483192|gb|AAL14224.1| cathepsin L [Haemonchus contortus]
          Length = 354

 Score =  186 bits (472), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 92/201 (45%), Positives = 132/201 (65%), Gaps = 6/201 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q +CGSCW+F +TGA+EG +     KL  LS+Q L+DCS  YGN+GC+GG    ++++I
Sbjct: 154 NQGMCGSCWAFSSTGALEGQHARATGKLVSLSEQNLVDCSTKYGNHGCNGGLMDLAFEYI 213

Query: 211 MK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
            + HG+ T+D Y PY+G++  CH       A   GFV++    E+ALK A+A  GP+S+A
Sbjct: 214 KENHGVDTEDSY-PYVGRETKCHFKRNAVGADDKGFVDLPEGDEEALKKAVATQGPISIA 272

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
           IDA  +SF  Y  GVY+DE+C  S + LDH VL VGYG + +   YW VKNSW   WG +
Sbjct: 273 IDAGHRSFQLYKKGVYFDEEC--SSEELDHGVLLVGYGTDPEAGDYWLVKNSWGPTWGEK 330

Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
           GY+ ++  ++N+CGV T  +Y
Sbjct: 331 GYIRIARNRNNHCGVATKASY 351


>gi|89272015|emb|CAJ83143.1| cathepsin L2 [Xenopus (Silurana) tropicalis]
          Length = 335

 Score =  186 bits (472), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 101/217 (46%), Positives = 136/217 (62%), Gaps = 9/217 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + DQ  CGSCW+F TTGA+EG +Y    K+  LS+Q L+DCS   GN G
Sbjct: 118 VDWRKKGYVTPVKDQGQCGSCWAFSTTGALEGQHYRNTGKMISLSEQNLVDCSRAQGNQG 177

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQ-DAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG   +++Q++  +G    +D  PY  + D  CH      +A  TGFV+VT  SE  L
Sbjct: 178 CNGGLMDQAFQYVKDNGGIDSEDSYPYTAKDDQECHYDPNYNSANDTGFVDVTSESEKDL 237

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GPVSVA+DA  +SF FY +G+YY+ +C  S + LDH VL VGYG    + DGK
Sbjct: 238 MNAVASVGPVSVAVDAGHQSFQFYKSGIYYEPEC--SSEDLDHGVLVVGYGFEGEDEDGK 295

Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
            YW VKNSWS  WGN GY+ ++  + N+CG+ TA +Y
Sbjct: 296 KYWIVKNSWSEKWGNDGYIYIAKDRHNHCGIATAASY 332


>gi|326501772|dbj|BAK02675.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 333

 Score =  186 bits (472), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 103/221 (46%), Positives = 138/221 (62%), Gaps = 8/221 (3%)

Query: 133 SKDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDC 189
           SK A+P  V +  KGY   + DQ  CGSCW+F TTGA+EG ++ +  KL  LS+Q L+DC
Sbjct: 113 SKIALPDTVDWRDKGYVTDVKDQGQCGSCWAFSTTGALEGQHFKQTGKLVSLSEQNLVDC 172

Query: 190 SWGYGNNGCDGGEDFRSYQWIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
           S   GN GC+GG   +++++I + +G+ T+D Y PY   D  C        AT TGF ++
Sbjct: 173 SGKQGNMGCNGGLMDQAFEYIKENNGIDTEDSY-PYEAVDNQCRFKAANVGATDTGFTDI 231

Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
           T   E AL+ A+A  GP+SVAIDA   SF  Y +GVY +  C+ +   LDH VLAVGYG 
Sbjct: 232 TSKDESALQQAVATVGPISVAIDAGHTSFQLYKHGVYNEPFCSQTR--LDHGVLAVGYGT 289

Query: 309 LDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
             GK YW VKNSW   WG++GY+ M+  K N CG+ TA +Y
Sbjct: 290 DSGKDYWLVKNSWGEGWGDKGYIKMTRNKRNQCGIATAASY 330


>gi|341878328|gb|EGT34263.1| CBN-CPL-1 protein [Caenorhabditis brenneri]
          Length = 336

 Score =  186 bits (471), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 93/201 (46%), Positives = 132/201 (65%), Gaps = 6/201 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q +CGSCW+F  TGA+EG +  K  +L  LS+Q L+DCS  YGN+GC+GG   +++++I
Sbjct: 136 NQGMCGSCWAFSATGALEGQHARKLGQLVSLSEQNLVDCSTKYGNHGCNGGLMDQAFEYI 195

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
              HG+ T++ Y PY G+D  CH    T  A   G+V+     E+ LK+A+A  GP+S+A
Sbjct: 196 RDNHGVDTEESY-PYKGRDMKCHFNKKTIGADDKGYVDTPEGDEEQLKIAVATQGPISIA 254

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
           IDA  +SF  Y  GVYYDE+C  S + LDH VL VGYG + +   YW VKNSW T WG +
Sbjct: 255 IDAGHRSFQLYKKGVYYDEEC--SSEELDHGVLLVGYGTDPEHGDYWLVKNSWGTGWGEK 312

Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
           GY+ ++  ++N+CGV T  +Y
Sbjct: 313 GYIRIARNRNNHCGVATKASY 333


>gi|330800456|ref|XP_003288252.1| hypothetical protein DICPUDRAFT_55299 [Dictyostelium purpureum]
 gi|325081708|gb|EGC35214.1| hypothetical protein DICPUDRAFT_55299 [Dictyostelium purpureum]
          Length = 531

 Score =  186 bits (471), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 94/200 (47%), Positives = 132/200 (66%), Gaps = 3/200 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ VCGSCW+FG+TG++EG   + +  L  LS+Q L+DC++  G+ GC+GG    ++Q+I
Sbjct: 326 DQGVCGSCWTFGSTGSLEGTNCVTNGYLVSLSEQQLVDCAYLMGSQGCNGGFAASAFQYI 385

Query: 211 MKHG-LPTQDDYGPYLGQDAYCHIANTTATAT-MTGFVNVTPNSEDALKLALAKHGPVSV 268
           M  G + T+ DY  YL Q+A C   +TT +   ++ +VNVT  S +AL  A+A  GPV++
Sbjct: 386 MDAGGIATESDY-QYLMQNALCKDKSTTFSGVGVSSYVNVTAGSINALLNAVATQGPVAI 444

Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQ 328
           AIDAS   F +Y +G+Y +  C N PD LDH VLA+GYG L+G  YW VKNSWST WG +
Sbjct: 445 AIDASVDDFRYYQSGIYSNPSCKNGPDDLDHEVLAIGYGTLNGVDYWLVKNSWSTNWGME 504

Query: 329 GYVLMSIKDNNCGVMTAPTY 348
           GY ++   +N CG  +  TY
Sbjct: 505 GYFMLERANNLCGPASQATY 524


>gi|2804262|dbj|BAA24442.1| cysteine proteinase [Sitophilus zeamais]
          Length = 338

 Score =  186 bits (471), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 98/201 (48%), Positives = 129/201 (64%), Gaps = 6/201 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCWSF  TG++EG ++ K  KL  LS+Q L+DCS  YGNNGC+GG    ++++I
Sbjct: 138 DQGHCGSCWSFSATGSLEGQHFRKTGKLVSLSEQNLVDCSGRYGNNGCNGGLMDNAFRYI 197

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T+  Y PYL +D  CH     + AT  GFV++   +ED LK A+A  GPVS+A
Sbjct: 198 KDNGGIDTEKSY-PYLAEDEKCHYKAQNSGATDKGFVDIEEANEDDLKAAVATVGPVSIA 256

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
           IDAS ++F  Y +GVY D +C  S   LDH VL VGYG   DG+ YW VKNSW   WG  
Sbjct: 257 IDASHETFQLYSDGVYSDPEC--SSQELDHGVLVVGYGTSDDGQDYWLVKNSWGPSWGLN 314

Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
           GY+ M+  +DN CGV +  +Y
Sbjct: 315 GYIKMARNQDNMCGVASQASY 335


>gi|159792912|gb|ABW98676.1| cathepsin L [Apostichopus japonicus]
          Length = 332

 Score =  186 bits (471), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 110/258 (42%), Positives = 155/258 (60%), Gaps = 13/258 (5%)

Query: 98  GEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKASKDAIPVRYEMKGY-NSLLDQSVCG 156
           GEE V   ++   +   S   +  K+  + ++   + D++  R E  GY   + DQ  CG
Sbjct: 84  GEEFVQ--MMNGLKFDASRERQGIKFLSYAKFQ--APDSVDWRDE--GYVTPVKDQGQCG 137

Query: 157 SCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GL 215
           SCW+F TTG++EG ++     L  LS+Q L+DCS  YGNNGC+GG    ++Q+I  + G+
Sbjct: 138 SCWAFSTTGSLEGQHFRSTGVLTSLSEQNLVDCSISYGNNGCEGGLMDYAFQYIKDNLGI 197

Query: 216 PTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQK 275
            T+D Y PY  +D  C  +     AT +G+V+V    EDALK A A +GP+SVAIDAS +
Sbjct: 198 DTEDKY-PYEAEDDTCRFSPDNVGATDSGYVDVDSGDEDALKEACAANGPISVAIDASHE 256

Query: 276 SFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQGYVLMS 334
           SF  Y +GVY +E C++    LDH VL VGYG +  G  YW VKNSW   WG +GY+ MS
Sbjct: 257 SFQLYESGVYDEESCSSIE--LDHGVLVVGYGTDSVGGDYWIVKNSWGLSWGQEGYIWMS 314

Query: 335 I-KDNNCGVMTAPTYVTM 351
             KDN CG+ T+ +Y T+
Sbjct: 315 RNKDNQCGIATSASYPTV 332


>gi|330803820|ref|XP_003289900.1| hypothetical protein DICPUDRAFT_80649 [Dictyostelium purpureum]
 gi|325080011|gb|EGC33585.1| hypothetical protein DICPUDRAFT_80649 [Dictyostelium purpureum]
          Length = 328

 Score =  186 bits (471), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 112/277 (40%), Positives = 152/277 (54%), Gaps = 14/277 (5%)

Query: 84  FHENEKIRYNWTYIGEELVNGII-------LEKWRLVTSEGEKVSKYSLWVRYNKASKDA 136
           F +N      W   G + + G+         E  R+       V K +L +     SK  
Sbjct: 55  FQDNMDFVTKWNQKGSDTILGLNSMADLTNQEYQRIYLGTKTTVKKPNLIIGVTDVSKAP 114

Query: 137 IPVRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGN 195
             V +   G   ++ +Q  CG C+SF TTG+VEG + +  K+L  LS+Q ++DCS   GN
Sbjct: 115 ASVDWRANGAVTAVKNQGQCGGCYSFSTTGSVEGIHEITSKQLVSLSEQQILDCSGSEGN 174

Query: 196 NGCDGGEDFRSYQWIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSED 254
           NGCDGG    S+++I+   GL T+  Y PY G    C        AT+TG+ NV   SE 
Sbjct: 175 NGCDGGLMTNSFEYIIAVGGLDTEASY-PYEGVVGKCKFNKANIGATITGYKNVKSGSES 233

Query: 255 ALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPY 314
            L+ A+A   PVSVAIDASQ SF  Y +GVYY+  C+++   LDH VLAVGYG   G+ Y
Sbjct: 234 DLQTAVAAQ-PVSVAIDASQNSFQLYSSGVYYEPACSSTQ--LDHGVLAVGYGSQSGQDY 290

Query: 315 WQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVT 350
           W VKNSW   WG +G++LM+  K NNCG+ T  +Y T
Sbjct: 291 WIVKNSWGADWGEKGFILMARNKHNNCGIATMASYPT 327


>gi|52345644|ref|NP_001004869.1| cathepsin L2 precursor [Xenopus (Silurana) tropicalis]
 gi|49522051|gb|AAH74718.1| MGC69486 protein [Xenopus (Silurana) tropicalis]
          Length = 335

 Score =  186 bits (471), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 101/217 (46%), Positives = 136/217 (62%), Gaps = 9/217 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + DQ  CGSCW+F TTGA+EG +Y    K+  LS+Q L+DCS   GN G
Sbjct: 118 VDWRKKGYVTPVKDQGQCGSCWAFSTTGALEGQHYRNTGKMISLSEQNLVDCSRAQGNQG 177

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQ-DAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG   +++Q++  +G    +D  PY  + D  CH      +A  TGFV+VT  SE  L
Sbjct: 178 CNGGLMDQAFQYVKDNGGIDSEDSYPYTAKDDQECHYDPNYNSANDTGFVDVTSGSEKDL 237

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GPVSVA+DA  +SF FY +G+YY+ +C  S + LDH VL VGYG    + DGK
Sbjct: 238 MNAVASVGPVSVAVDAGHQSFQFYKSGIYYEPEC--SSEDLDHGVLVVGYGFEGEDEDGK 295

Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
            YW VKNSWS  WGN GY+ ++  + N+CG+ TA +Y
Sbjct: 296 KYWIVKNSWSEKWGNDGYIYIAKDRHNHCGIATAASY 332


>gi|291224872|ref|XP_002732426.1| PREDICTED: cathepsin L2-like [Saccoglossus kowalevskii]
          Length = 691

 Score =  185 bits (470), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 100/216 (46%), Positives = 131/216 (60%), Gaps = 6/216 (2%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + DQ  CGSCW+F TTG++EG  +    KL   S+Q L+DCS  YGN G
Sbjct: 479 VDWRTKGYVTEVKDQGACGSCWAFSTTGSMEGQSFKNTGKLVSFSEQQLVDCSGSYGNMG 538

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
           C GG   +++ +I  +G+  + DY PY  +D  C    + A AT TG+ ++    E AL+
Sbjct: 539 CGGGLMDQAFAYIEDYGIEPEADY-PYTAKDDPCSYDTSKAVATNTGYTDIATMDEKALQ 597

Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGEL-DGKPYWQ 316
            A+A  GP+SVAIDAS  SF  Y +GVY +  C+ +   LDH VLAVGYG   DG  YW 
Sbjct: 598 QAVATVGPISVAIDASHSSFRLYKSGVYDEPACSQTM--LDHGVLAVGYGTTDDGNDYWI 655

Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
           VKNSW + WGNQGY+ MS   DN CG+ T  +Y  M
Sbjct: 656 VKNSWGSTWGNQGYIHMSRNNDNQCGIATNASYPLM 691


>gi|305434756|gb|ADM53740.1| cathepsin L1 precursor [Lepeophtheirus salmonis]
          Length = 325

 Score =  185 bits (470), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 98/205 (47%), Positives = 131/205 (63%), Gaps = 7/205 (3%)

Query: 148 SLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSY 207
           ++ DQ  CGSCW+F TTG+VEG Y++K+KKL   S+Q L+DCS  + N GC+GG    ++
Sbjct: 121 AVKDQKDCGSCWAFSTTGSVEGQYFIKNKKLLSFSEQQLVDCSSDFRNEGCNGGWMDNAF 180

Query: 208 QWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
           ++++ + G+ T+D Y PY   D  C    T A   ++ F +V   SED LKLA+A+ GP+
Sbjct: 181 KYLIANKGIATEDTY-PYTATDGVCVYNKTMAAGRISSFKDVKHGSEDQLKLAVAQIGPI 239

Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGK--PYWQVKNSWSTY 324
           SVAIDAS   F FY  GVY DE+C  S   LDH VLAVGYG   G    YW VKNSWS  
Sbjct: 240 SVAIDASSGDFQFYKKGVYVDEEC--SSKYLDHGVLAVGYGTDKGTGLDYWLVKNSWSAS 297

Query: 325 WGNQGYVLMSIKDNN-CGVMTAPTY 348
           WG+QGY+ M+    N CG+ +  +Y
Sbjct: 298 WGDQGYIKMARNHKNMCGIASLASY 322


>gi|395514296|ref|XP_003761355.1| PREDICTED: cathepsin L1-like [Sarcophilus harrisii]
          Length = 262

 Score =  185 bits (470), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 96/205 (46%), Positives = 136/205 (66%), Gaps = 7/205 (3%)

Query: 148 SLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSY 207
           S+ D+  CGSCW+F  TG++EG ++ K  KL  LS+Q L+DCS   GN+GC GG    ++
Sbjct: 58  SIKDKGQCGSCWAFSATGSLEGQWFHKTGKLVSLSEQNLVDCSTAQGNSGCQGGLMDNAF 117

Query: 208 QWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
           +++ K+ G+ T++ Y PY+G+D  CH  +  + A +TG+V++    E AL  A+A  GP+
Sbjct: 118 EYVKKNGGIDTEESY-PYVGKDGTCHYNSQCSGANVTGYVDIPAGVERALAKAVATVGPI 176

Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG--ELDGKPYWQVKNSWSTY 324
           SVAIDA   SF FY +GVYY+ +C  S + LDH VL VG+G    +GK YW VKNSW   
Sbjct: 177 SVAIDAGHSSFQFYRSGVYYEPEC--SSEELDHGVLVVGFGVEGKNGKKYWIVKNSWGEE 234

Query: 325 WGNQGYVLMSIKDNN-CGVMTAPTY 348
           WG++GYVLM+   NN CG+ TA +Y
Sbjct: 235 WGDRGYVLMTRDHNNHCGIATAASY 259


>gi|195995655|ref|XP_002107696.1| hypothetical protein TRIADDRAFT_19901 [Trichoplax adhaerens]
 gi|190588472|gb|EDV28494.1| hypothetical protein TRIADDRAFT_19901 [Trichoplax adhaerens]
          Length = 569

 Score =  185 bits (470), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 93/252 (36%), Positives = 145/252 (57%), Gaps = 15/252 (5%)

Query: 115 SEGEKVSKYSLWVRYNKASKDAIPVRYE------------MKGYNSLLD-QSVCGSCWSF 161
           +E E  +      R+  A+K+ IP +Y+             KG  + +  Q+ CGSC++F
Sbjct: 313 TEKELTNMRGRLRRFYHATKNYIPWKYDPHEKPLESIDWRKKGAVTPIKYQAHCGSCFAF 372

Query: 162 GTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDY 221
                +E A + +  +L  LS+Q++IDCSWGYGN  C+GG ++ +YQ++ KHG+P+   Y
Sbjct: 373 SAVTPIESALFRQTGRLIPLSEQSIIDCSWGYGNYACNGGMEYAAYQFVTKHGIPSAKCY 432

Query: 222 GPYLGQDAYCHIANTTATATMTGFVNVTPNSE-DALKLALAKHGPVSVAIDASQKSFSFY 280
           GPYL    YCH+ N T    + G+  V P+    A++LA++K GP+S+A +    SF FY
Sbjct: 433 GPYLASPGYCHLLNCTHRVRLDGYSIVLPSGNVSAVRLAVSKIGPISIAFNVHSLSFHFY 492

Query: 281 VNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLM-SIKDNN 339
            +G++ D  C N    ++HA+  VGYG+L  + YW ++NSWS +WG  GY+ M S   N 
Sbjct: 493 SSGIFEDPTCYNDIHHINHALNLVGYGKLGNEHYWIIRNSWSEHWGENGYMRMKSTNANE 552

Query: 340 CGVMTAPTYVTM 351
           C V T  T+  +
Sbjct: 553 CAVATFATFPVL 564



 Score = 41.6 bits (96), Expect = 0.64,   Method: Compositional matrix adjust.
 Identities = 22/63 (34%), Positives = 33/63 (52%), Gaps = 4/63 (6%)

Query: 14  GMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAYGIPDATIEPQSVLPDVSDFKVN 73
           G+  T     +G  G + K+ P+TTET+ N  SC   +G P     PQ+ +P +    VN
Sbjct: 82  GVSTTIIRRDDGPFGSAYKIMPITTETQYNVKSCVAIHGRPRHPYRPQTAIPSL----VN 137

Query: 74  IYR 76
           I+R
Sbjct: 138 IFR 140


>gi|405966500|gb|EKC31778.1| Cathepsin L [Crassostrea gigas]
          Length = 271

 Score =  185 bits (470), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 98/213 (46%), Positives = 133/213 (62%), Gaps = 6/213 (2%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  +GY   + +Q  CGSCWSF  TG++EG ++   KKL  LS+Q L+DCS   GN+G
Sbjct: 59  VDWRKEGYVTDIKNQGHCGSCWSFSATGSLEGQHFKASKKLVSLSEQNLVDCSQREGNHG 118

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    ++++I  + G+ T++ Y PY  ++ +CH       AT TG+V++    ED L
Sbjct: 119 CQGGLMDNAFRYIESNKGIDTEESY-PYTAKNGFCHFKKENVGATDTGYVDIPHMQEDKL 177

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           + A+A  GP+SVAIDA  KSF  Y  GVY +  C++S   LDH VLAVGYG   G  YW 
Sbjct: 178 QEAVATVGPISVAIDAGHKSFQLYREGVYSEPACSSSK--LDHGVLAVGYGTESGDDYWL 235

Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           VKNSW T WG QGYV+M+  K N CG+ T  +Y
Sbjct: 236 VKNSWGTSWGMQGYVMMARNKHNMCGIATQASY 268


>gi|124487918|gb|ABN12042.1| putative cathepsin L precursor [Maconellicoccus hirsutus]
          Length = 211

 Score =  185 bits (469), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 90/204 (44%), Positives = 129/204 (63%), Gaps = 1/204 (0%)

Query: 149 LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
           + DQ  CGSC++F TTG++EG  + K   L  LS+Q +IDCS  YGN GC+GG    ++ 
Sbjct: 8   VKDQGDCGSCYAFSTTGSIEGQQFRKSGTLKSLSEQQIIDCSVKYGNGGCEGGVMENAFN 67

Query: 209 WIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
           +++ +G    +   PY+ ++  C      + A +  F  +    E+ LKLA+AK GP+S+
Sbjct: 68  YVIDNGGIDSEGSYPYIDRETQCAYKPENSAANIKDFATLPVGDEEMLKLAVAKVGPISI 127

Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQ 328
           AI+ S +SF  Y +GVYYD+ C + PD L HAVL VGYG  DGK YW VKNSW+T WG  
Sbjct: 128 AINTSPRSFKLYKSGVYYDKDCKSDPDDLTHAVLVVGYGTEDGKDYWLVKNSWNTDWGEN 187

Query: 329 GYVLMSI-KDNNCGVMTAPTYVTM 351
           GY+ M+  K+N+CG+ +  TY T+
Sbjct: 188 GYIKMARNKNNHCGIASYATYPTV 211


>gi|196002275|ref|XP_002111005.1| expressed hypothetical protein [Trichoplax adhaerens]
 gi|190586956|gb|EDV27009.1| expressed hypothetical protein [Trichoplax adhaerens]
          Length = 325

 Score =  185 bits (469), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 99/213 (46%), Positives = 136/213 (63%), Gaps = 6/213 (2%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  +GY   + +Q  CGSCW+F TTG++EG ++ K  KL  LS+Q LIDCS   GN+G
Sbjct: 113 VDWRKEGYVTPIKNQGRCGSCWAFSTTGSLEGQHFRKTGKLVSLSEQNLIDCSAAEGNDG 172

Query: 198 CDGGEDFRSYQWI-MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    ++++I + +G+ T+  Y PY G+D  C    T   A  TG++++   SED L
Sbjct: 173 CGGGFMDDAFEYIKLNNGIDTEASY-PYEGRDDICRYKKTNKGAIDTGYMDIKQYSEDDL 231

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K A+A  GP+SVAIDAS KSF  Y  GVY++ +C+ +   LDH VL VGYG  +G+ YW 
Sbjct: 232 KAAVATVGPISVAIDASHKSFHMYHTGVYHEPECSQTV--LDHGVLVVGYGTENGEDYWL 289

Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           VKNSW T WG  GY+ MS  + NNCG+ T  +Y
Sbjct: 290 VKNSWGTDWGMNGYIKMSRNRSNNCGIATNASY 322


>gi|432114312|gb|ELK36240.1| Aryl hydrocarbon receptor nuclear translocator [Myotis davidii]
          Length = 897

 Score =  184 bits (468), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 99/216 (45%), Positives = 133/216 (61%), Gaps = 8/216 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           + Y  KGY   + +Q  CGSCW+F + GA+EG    K  KL  LS Q L+DC     N+G
Sbjct: 687 IDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLMKKTGKLLNLSPQNLVDCV--SENDG 744

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    ++Q++ ++ G+ ++D Y PY+GQD  C    T   A   G+  +   +E AL
Sbjct: 745 CGGGYMTNAFQYVQRNRGIDSEDAY-PYVGQDESCMYNPTGKAAKCRGYKEIPEGNEKAL 803

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K A+A+ GP+SVAIDAS  SF FY  GVYYDE CN+  D L+HAVLAVGYG   GK +W 
Sbjct: 804 KKAVARVGPISVAIDASLSSFQFYSKGVYYDENCNS--DNLNHAVLAVGYGIQKGKKHWI 861

Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           +KNSW   WGN+GY+LM+   NN CG+    ++  M
Sbjct: 862 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 897


>gi|195124431|ref|XP_002006696.1| GI21205 [Drosophila mojavensis]
 gi|193911764|gb|EDW10631.1| GI21205 [Drosophila mojavensis]
          Length = 339

 Score =  184 bits (468), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 95/210 (45%), Positives = 133/210 (63%), Gaps = 6/210 (2%)

Query: 142 EMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGG 201
           E     ++ DQ  CGSCW+F +TGA+EG ++ K   L  LS+Q L+DCS  YGNNGC+GG
Sbjct: 130 EKGAVTAVKDQGHCGSCWAFSSTGALEGQHFRKTGTLVSLSEQNLVDCSAKYGNNGCNGG 189

Query: 202 EDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLAL 260
               ++++I  + G+ T+  Y PY G D  CH    +  AT  GF ++   +E  +  A+
Sbjct: 190 LMDNAFRYIKDNGGIDTEKSY-PYEGIDDSCHFNKDSVGATDRGFADIPQGNEKKMAEAV 248

Query: 261 AKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKN 319
           A  GPVSVAIDAS +SF FY  G+Y + +CN+    LDH VL VGYG +  GK YW VKN
Sbjct: 249 ATIGPVSVAIDASHESFQFYSEGIYNEPECNSQ--NLDHGVLVVGYGTDESGKDYWLVKN 306

Query: 320 SWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           SW T WG++G++ M+  +DN CG+ +A +Y
Sbjct: 307 SWGTTWGDKGFIKMARNEDNQCGIASASSY 336


>gi|340370276|ref|XP_003383672.1| PREDICTED: digestive cysteine proteinase 2-like [Amphimedon
           queenslandica]
          Length = 327

 Score =  184 bits (468), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 94/204 (46%), Positives = 133/204 (65%), Gaps = 5/204 (2%)

Query: 149 LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
           + +Q  CGSCWSF +TG++EG +++    L  LS+Q L+DCS  YGN+GC+GG    S++
Sbjct: 124 IKNQGQCGSCWSFSSTGSLEGQHFINTGTLVSLSEQQLMDCSTKYGNHGCNGGLMDNSFR 183

Query: 209 WIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVS 267
           ++    G  T+D+Y PY  ++  C   ++ A  T   +V++    ED+LK A+A  GP+S
Sbjct: 184 YLKSVAGDETEDNY-PYTAENGVCRYDSSLAVVTDKSYVDIPQGDEDSLKDAVANVGPIS 242

Query: 268 VAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGN 327
           VAIDAS  SF  Y +GVYY   C+++   LDH VLA+GYG  DGK YW VKNSW T WG 
Sbjct: 243 VAIDASHSSFQLYNSGVYYASTCSSTQ--LDHGVLAIGYGTEDGKDYWLVKNSWGTSWGM 300

Query: 328 QGYVLMSI-KDNNCGVMTAPTYVT 350
           +GY+ MS  ++NNCG+ T  +Y T
Sbjct: 301 EGYIKMSRNRNNNCGIATQASYPT 324


>gi|348546019|ref|XP_003460476.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
 gi|348546143|ref|XP_003460538.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
          Length = 334

 Score =  184 bits (468), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 101/213 (47%), Positives = 132/213 (61%), Gaps = 6/213 (2%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + DQ  CGSCW+F  TG++EG ++ K   L  LS+Q L+DCS  YGN G
Sbjct: 122 VDWRDKGYVTDVKDQKQCGSCWAFSATGSLEGQHFRKTGTLVSLSEQQLVDCSGDYGNMG 181

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    ++Q+I  + G+ T++ Y PY  ++  C        AT TG+  V+   EDAL
Sbjct: 182 CMGGLMDYAFQYIQANGGIDTEESY-PYEAENGKCRYNPDNIGATSTGYTEVSQGDEDAL 240

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K A+A  GP+SV IDASQ SF FY +GVY +  C++    LDH VLAVGYG  DG  YW 
Sbjct: 241 KEAVATIGPISVGIDASQMSFQFYESGVYNEPDCSSLE--LDHGVLAVGYGTEDGNDYWL 298

Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           VKNSW   WG++GY+ MS  K N CG+ TA +Y
Sbjct: 299 VKNSWGLEWGDKGYIKMSRNKSNQCGIATAASY 331


>gi|345320664|ref|XP_001521690.2| PREDICTED: cathepsin L1-like [Ornithorhynchus anatinus]
          Length = 388

 Score =  184 bits (468), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 99/205 (48%), Positives = 136/205 (66%), Gaps = 10/205 (4%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTGA+EG  + K  +L  LS+Q L+DCSW  GN GC+GG    ++Q+I
Sbjct: 184 NQGHCGSCWAFSTTGALEGQLFRKSGRLVSLSEQNLVDCSWQQGNQGCNGGIVDFAFQYI 243

Query: 211 MKH-GLPTQDDYGPYLGQD-AYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
           +++ G+ ++D Y PY  +D A C      ATA +TGFV++ P+SE+AL  A+A  GPVSV
Sbjct: 244 LENRGIDSEDCY-PYTAKDTAQCAFKPECATARVTGFVDIPPHSEEALMKAVATVGPVSV 302

Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTY 324
           AIDA   SF FY +G++Y+ KC  S + L+HAVL VGYG    +  GK YW VKNSW   
Sbjct: 303 AIDAHPTSFRFYQSGIFYEPKC--SSERLNHAVLVVGYGYEGEDEAGKKYWIVKNSWGKQ 360

Query: 325 WGNQGYVLMSI-KDNNCGVMTAPTY 348
           WG+ GY  +S  + N+CG+ T  +Y
Sbjct: 361 WGDHGYFYLSKDRGNHCGIATTASY 385


>gi|291224868|ref|XP_002732424.1| PREDICTED: cathepsin L-like [Saccoglossus kowalevskii]
          Length = 823

 Score =  184 bits (468), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 95/202 (47%), Positives = 128/202 (63%), Gaps = 5/202 (2%)

Query: 149 LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
           L+ +  CGSCW+F TTG++EG  + K  KL  LS+Q L+DCS  +GN+GC+GG    +++
Sbjct: 622 LVAKGQCGSCWAFSTTGSLEGQTFKKTGKLPDLSEQQLVDCSTQFGNHGCNGGLMDLAFE 681

Query: 209 WIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVS 267
           +I    G+  + DY PYL +D  C    +   AT TG+V++    E+ALK A+A  GP+S
Sbjct: 682 YIKAAPGIEGEMDY-PYLAKDGRCMFDQSKVVATDTGYVDIPSMDENALKEAVATIGPIS 740

Query: 268 VAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGN 327
           VAIDA   SF  Y +GVY +  C  S + LDH VLAVGYG  DG+ YW VKNSW   WG 
Sbjct: 741 VAIDAGHPSFQMYKSGVYNEPGC--SSERLDHGVLAVGYGTEDGQDYWLVKNSWGDSWGQ 798

Query: 328 QGYVLMSIKDNN-CGVMTAPTY 348
            GY++MS   NN CG+ T  +Y
Sbjct: 799 AGYIMMSRNMNNQCGIATQASY 820


>gi|225719058|gb|ACO15375.1| Cathepsin L1 precursor [Caligus clemensi]
          Length = 326

 Score =  184 bits (467), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 98/202 (48%), Positives = 127/202 (62%), Gaps = 7/202 (3%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQW- 209
           DQ  CGSCW+F  TG+VEG  ++K KKL  LS+Q L+DCS   GN GC GG    ++++ 
Sbjct: 125 DQGKCGSCWAFSATGSVEGQIFLKKKKLMSLSEQQLVDCSGDEGNLGCGGGLMDNAFKYF 184

Query: 210 IMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           I   G+  +  Y PY  +D  C    + + AT++ F +V    ED LK+A+A  GPVSVA
Sbjct: 185 IANKGIANEKSY-PYTAKDNDCKYKKSMSVATISSFKDVKHKDEDQLKMAVANVGPVSVA 243

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG--ELDGKPYWQVKNSWSTYWGN 327
           IDAS   F FY +GVYYDE C  S + LDH VLAVGYG  +  G  +W VKNSW+  WG 
Sbjct: 244 IDASSSKFQFYESGVYYDENC--SSEVLDHGVLAVGYGTDKKSGMDFWLVKNSWAASWGL 301

Query: 328 QGYVLMSI-KDNNCGVMTAPTY 348
            GY+ M+  KDNNCG+ T  +Y
Sbjct: 302 NGYIKMARNKDNNCGIATMASY 323


>gi|392922426|ref|NP_001256718.1| Protein CPL-1, isoform a [Caenorhabditis elegans]
 gi|3879367|emb|CAB07275.1| Protein CPL-1, isoform a [Caenorhabditis elegans]
          Length = 337

 Score =  184 bits (467), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 92/201 (45%), Positives = 131/201 (65%), Gaps = 6/201 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q +CGSCW+F  TGA+EG +  K  +L  LS+Q L+DCS  YGN+GC+GG   +++++I
Sbjct: 137 NQGMCGSCWAFSATGALEGQHARKLGQLVSLSEQNLVDCSTKYGNHGCNGGLMDQAFEYI 196

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
              HG+ T++ Y PY G+D  CH    T  A   G+V+     E+ LK+A+A  GP+S+A
Sbjct: 197 RDNHGVDTEESY-PYKGRDMKCHFNKKTVGADDKGYVDTPEGDEEQLKIAVATQGPISIA 255

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
           IDA  +SF  Y  GVYYDE+C  S + LDH VL VGYG + +   YW VKNSW   WG +
Sbjct: 256 IDAGHRSFQLYKKGVYYDEEC--SSEELDHGVLLVGYGTDPEHGDYWIVKNSWGAGWGEK 313

Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
           GY+ ++  ++N+CGV T  +Y
Sbjct: 314 GYIRIARNRNNHCGVATKASY 334


>gi|226477902|emb|CAX72658.1| Cathepsin L precursor [Schistosoma japonicum]
 gi|226488903|emb|CAX74801.1| Cathepsin L precursor [Schistosoma japonicum]
          Length = 372

 Score =  184 bits (467), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 94/204 (46%), Positives = 131/204 (64%), Gaps = 7/204 (3%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F +TGA+EG +Y K  +L  LS+Q LIDCS  YGNNGC+GG    ++Q++
Sbjct: 167 NQGQCGSCWAFSSTGAIEGQHYRKTNRLVNLSEQQLIDCSKSYGNNGCEGGLMDLAFQYV 226

Query: 211 MKH-GLPTQDDYGPYLGQDA----YCHIANTTATATMTGFVNVTPNSEDALKLALAKHGP 265
             + G+ ++  Y PY+  D      C   +T   A +TG++N+    E AL  A+A  GP
Sbjct: 227 RDNEGIDSEISY-PYISGDGDENVRCLFNSTNIMAQVTGYINIHEGDERALMNAVATIGP 285

Query: 266 VSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYW 325
           VSVAI+A   SFS Y +G+Y D +C ++ + LDH VL VGYG  DGKPYW +KNSW   W
Sbjct: 286 VSVAINAGLSSFSMYKSGIYSDPECASASEDLDHGVLLVGYGIEDGKPYWLIKNSWGEDW 345

Query: 326 GNQGYV-LMSIKDNNCGVMTAPTY 348
           G++GYV ++    N CGV +A +Y
Sbjct: 346 GDKGYVKILKDSKNMCGVASAASY 369


>gi|66378053|gb|AAY45871.1| cathepsin L-like cysteine proteinase [Longidorus elongatus]
          Length = 358

 Score =  184 bits (466), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 110/255 (43%), Positives = 149/255 (58%), Gaps = 10/255 (3%)

Query: 99  EELVNGIIL-EKWRLVTSEGEKVSKYSLWVRYNKASKDAIPVRYEMKGY-NSLLDQSVCG 156
            + +NG  L  K +L  S+  K       +  N    D++  R E  GY   + DQ  CG
Sbjct: 104 RQRMNGFKLPAKRKLAKSQPLKEDGMIFEMPDNVTIPDSVDWRKE--GYVTKVKDQGSCG 161

Query: 157 SCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI-MKHGL 215
           SCW+F  TG++EG +Y +  KL  LS+Q L+DC     + GC+GG    ++Q++    G+
Sbjct: 162 SCWAFSATGSLEGQHYKQTGKLVSLSEQNLVDCDVNGDDEGCNGGYMDGAFQYVETNKGI 221

Query: 216 PTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQK 275
            T+  Y PY G+D  C   +    AT TGFV++   +E  L+ A+A  GPVSVAIDA+  
Sbjct: 222 DTEASY-PYKGRDGRCRFKSEDVGATDTGFVDIPEGNETLLEAAIATVGPVSVAIDAASF 280

Query: 276 SFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGEL-DGKPYWQVKNSWSTYWGNQGYVLMS 334
            F FY +GVYYD  C  SP+ LDH VLAVGY    DGK Y+ VKNSWS  WG+ GY+LMS
Sbjct: 281 KFQFYSHGVYYDRSC--SPEYLDHGVLAVGYNSTKDGKQYYIVKNSWSEDWGDDGYILMS 338

Query: 335 -IKDNNCGVMTAPTY 348
             K+NNCG+ T  +Y
Sbjct: 339 RRKNNNCGIATMASY 353


>gi|392881548|gb|AFM89606.1| cathepsin L [Callorhinchus milii]
          Length = 338

 Score =  184 bits (466), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 99/217 (45%), Positives = 137/217 (63%), Gaps = 9/217 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  +GY   + DQ  CGSCW+F TTGA+EG ++ +  +L  LS+Q L++CS   GN G
Sbjct: 121 VDWRDEGYVTPVKDQGQCGSCWAFSTTGALEGQHFRRTGQLVSLSEQNLVECSKPEGNEG 180

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG   +++Q++  +G    +D  PY+G  D  CH       A  TGFV++    E AL
Sbjct: 181 CNGGLMDQAFQYVKDNGGIDSEDSYPYVGTDDTPCHYNPQYNAANDTGFVDIPSGKERAL 240

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GPVSVAIDA   SF FY +G+Y++ +C+++   LDH VL VGYG    + DGK
Sbjct: 241 MKAIAAVGPVSVAIDAGHTSFQFYQSGIYFEAECSSTD--LDHGVLVVGYGVEKRDTDGK 298

Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
            YW VKNSWS  WG  GY+LM+  KDN+CG+ TA +Y
Sbjct: 299 KYWIVKNSWSEKWGQNGYILMAKDKDNHCGIATAASY 335


>gi|355681656|gb|AER96815.1| Cathepsin L precursor [Mustela putorius furo]
          Length = 331

 Score =  184 bits (466), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 98/217 (45%), Positives = 137/217 (63%), Gaps = 11/217 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TGA+EG  + K K+L  LS+Q L+DCS   GN G
Sbjct: 118 VDWRKKGYVTPVKNQGPCGSCWAFSATGALEGQMFRKTKRLVSLSEQNLVDCSQAEGNEG 177

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    ++Q++  + GL +++ Y PY  QD  C      + A  TGF+++ P  E++L
Sbjct: 178 CSGGLMDYAFQYVKDNGGLDSEESY-PYRAQDESCKYKPEQSAANDTGFMDIHPE-EESL 235

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
           KLA+A  GP+S AIDAS  +F FY  G+YYD  C  S + LDH +L VGYG    + + +
Sbjct: 236 KLAVATVGPISAAIDASLSTFQFYHKGIYYDPDC--SSENLDHGILVVGYGSQGEDSEKQ 293

Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
            YW VKNSW T WG QGY+LM+  +DN+CG+ TA ++
Sbjct: 294 KYWIVKNSWGTDWGTQGYILMAKDRDNHCGIATAASF 330


>gi|392884266|gb|AFM90965.1| cathepsin L [Callorhinchus milii]
          Length = 338

 Score =  184 bits (466), Expect = 7e-44,   Method: Compositional matrix adjust.
 Identities = 99/217 (45%), Positives = 137/217 (63%), Gaps = 9/217 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  +GY   + DQ  CGSCW+F TTGA+EG ++ +  +L  LS+Q L++CS   GN G
Sbjct: 121 VDWRDEGYVTPVKDQGQCGSCWAFSTTGALEGQHFRRTGQLVSLSEQNLVECSKPEGNEG 180

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG   +++Q++  +G    +D  PY+G  D  CH       A  TGFV++    E AL
Sbjct: 181 CNGGLMDQAFQYVKDNGGIDSEDSYPYVGTDDTPCHYNPQYNAANDTGFVDIPSGKERAL 240

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GPVSVAIDA   SF FY +G+Y++ +C+++   LDH VL VGYG    + DGK
Sbjct: 241 MKAIAAVGPVSVAIDAGHTSFQFYQSGIYFEAECSSTD--LDHGVLVVGYGVEKRDTDGK 298

Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
            YW VKNSWS  WG  GY+LM+  KDN+CG+ TA +Y
Sbjct: 299 KYWIVKNSWSEKWGQNGYILMAKDKDNHCGIATAASY 335


>gi|387914010|gb|AFK10614.1| cathepsin L [Callorhinchus milii]
 gi|392873762|gb|AFM85713.1| cathepsin L [Callorhinchus milii]
 gi|392877488|gb|AFM87576.1| cathepsin L [Callorhinchus milii]
          Length = 338

 Score =  184 bits (466), Expect = 7e-44,   Method: Compositional matrix adjust.
 Identities = 99/217 (45%), Positives = 137/217 (63%), Gaps = 9/217 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  +GY   + DQ  CGSCW+F TTGA+EG ++ +  +L  LS+Q L++CS   GN G
Sbjct: 121 VDWRDEGYVTPVKDQGQCGSCWAFSTTGALEGQHFRRTGQLVSLSEQNLVECSKPEGNEG 180

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG   +++Q++  +G    +D  PY+G  D  CH       A  TGFV++    E AL
Sbjct: 181 CNGGLMDQAFQYVKDNGGIDSEDSYPYVGTDDTPCHYNPQYNAANDTGFVDIPSGKERAL 240

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GPVSVAIDA   SF FY +G+Y++ +C+++   LDH VL VGYG    + DGK
Sbjct: 241 MKAIAAVGPVSVAIDAGHTSFQFYQSGIYFEAECSSTD--LDHGVLVVGYGVEKRDTDGK 298

Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
            YW VKNSWS  WG  GY+LM+  KDN+CG+ TA +Y
Sbjct: 299 KYWIVKNSWSEKWGQNGYILMAKDKDNHCGIATAASY 335


>gi|320169658|gb|EFW46557.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
          Length = 324

 Score =  184 bits (466), Expect = 8e-44,   Method: Compositional matrix adjust.
 Identities = 96/202 (47%), Positives = 127/202 (62%), Gaps = 5/202 (2%)

Query: 149 LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
           + DQ  CGSCWSF TTG+VEG +  K  +L  LS+Q L+DCS   GN GC+GG   +++Q
Sbjct: 123 IKDQGQCGSCWSFSTTGSVEGQHARKTGQLVSLSEQNLVDCSSAQGNAGCNGGLMDQAFQ 182

Query: 209 WIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVS 267
           +I+  +G+ T+  Y PY  QD  C   +    AT+  + ++   SE  L+ A+A  GP+S
Sbjct: 183 YIISNNGIDTESSY-PYTAQDGTCQFNSANVGATVASYQDIASGSESDLQNAVATVGPIS 241

Query: 268 VAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGN 327
           VAIDASQ SF FY +GVY +  C++S   LDH VLAVGYG      YW VKNSW T WG 
Sbjct: 242 VAIDASQPSFQFYSSGVYNEPACSSSQ--LDHGVLAVGYGTSGSSDYWLVKNSWGTSWGQ 299

Query: 328 QGYVLMSIKDNN-CGVMTAPTY 348
            GY+ M+   NN CG+ TA +Y
Sbjct: 300 SGYIWMTRNSNNQCGIATAASY 321


>gi|348531515|ref|XP_003453254.1| PREDICTED: cathepsin L2-like [Oreochromis niloticus]
          Length = 333

 Score =  184 bits (466), Expect = 8e-44,   Method: Compositional matrix adjust.
 Identities = 99/228 (43%), Positives = 142/228 (62%), Gaps = 7/228 (3%)

Query: 124 SLWVRYNKASKDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLS 182
           S + R  K +K    V +  +GY   +  Q  CGSCW+F  TGA+EG ++ K +KL  LS
Sbjct: 107 STFNRLPKGTKLPKTVDWRKQGYVTKVKHQKECGSCWAFSATGALEGQHFRKTRKLVSLS 166

Query: 183 QQALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATAT 241
           +Q L+DCS  +GN+GC+GG    ++Q+I  + GL T+D Y PY  +D  CH    +  A 
Sbjct: 167 EQQLVDCSRSFGNHGCNGGWMNPAFQYIRYNGGLDTEDSY-PYKAKDGICHYNPNSVGAI 225

Query: 242 MTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAV 301
            +G V+V+P+ E ALK A+A  GP+S+A+DAS +SF  Y +GVY + +CN     + HA+
Sbjct: 226 CSGHVDVSPD-EAALKQAVATIGPISIAVDASHESFQLYQSGVYDEHRCNKKH--VTHAM 282

Query: 302 LAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           L VGYG   G  YW +KNSW   WG++GY+ M+  K N CG+ TA +Y
Sbjct: 283 LVVGYGTEGGHDYWLIKNSWGLQWGDKGYIKMTRNKGNQCGIATAASY 330


>gi|291224870|ref|XP_002732425.1| PREDICTED: cathepsin L2-like [Saccoglossus kowalevskii]
          Length = 326

 Score =  184 bits (466), Expect = 8e-44,   Method: Compositional matrix adjust.
 Identities = 99/213 (46%), Positives = 135/213 (63%), Gaps = 6/213 (2%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           + +  +GY   + DQ  CGSCW+F +TG++EG  + K  KL  LS+Q L+DCS  YGN G
Sbjct: 114 IDWRTQGYVTDVKDQGACGSCWAFSSTGSLEGQTFKKTGKLVPLSEQQLVDCSGDYGNMG 173

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
           C GG   +++ +I   G  ++D Y PY G D  C    +   AT TG+ ++    E+AL+
Sbjct: 174 CGGGWMDQAFSYIKDKGEESEDGY-PYTGTDDTCVYDASKVVATDTGYTDIPEMDENALQ 232

Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQ 316
            A+A  GP+SVAIDA+  SF FY +GVY + +C+ +   LDHAVLAVGYG   +G  YW 
Sbjct: 233 QAVATVGPISVAIDATHSSFQFYESGVYDEPECSQT--NLDHAVLAVGYGTSEEGLDYWI 290

Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           VKNSWST WG QGY+ MS  KDN CG+ +  +Y
Sbjct: 291 VKNSWSTGWGMQGYIEMSRNKDNQCGIASKASY 323


>gi|56758090|gb|AAW27185.1| SJCHGC06231 protein [Schistosoma japonicum]
          Length = 372

 Score =  183 bits (465), Expect = 8e-44,   Method: Compositional matrix adjust.
 Identities = 94/204 (46%), Positives = 131/204 (64%), Gaps = 7/204 (3%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F +TGA+EG +Y K  +L  LS+Q LIDCS  YGNNGC+GG    ++Q++
Sbjct: 167 NQGQCGSCWAFSSTGAIEGQHYRKTNRLVNLSEQQLIDCSKSYGNNGCEGGLMDLAFQYV 226

Query: 211 MKH-GLPTQDDYGPYLGQDA----YCHIANTTATATMTGFVNVTPNSEDALKLALAKHGP 265
             + G+ ++  Y PY+  D      C   +T   A +TG++N+    E AL  A+A  GP
Sbjct: 227 RDNKGIDSEISY-PYISGDGDENVRCLFNSTNIMAQVTGYINIHEGDERALMNAVATIGP 285

Query: 266 VSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYW 325
           VSVAI+A   SFS Y +G+Y D +C ++ + LDH VL VGYG  DGKPYW +KNSW   W
Sbjct: 286 VSVAINAGLPSFSMYKSGIYSDPECASASEDLDHGVLLVGYGIEDGKPYWLIKNSWGEDW 345

Query: 326 GNQGYV-LMSIKDNNCGVMTAPTY 348
           G++GYV ++    N CGV +A +Y
Sbjct: 346 GDKGYVKILKDSKNMCGVASAASY 369


>gi|47523662|ref|NP_999467.1| cathepsin K precursor [Sus scrofa]
 gi|15213940|sp|Q9GLE3.1|CATK_PIG RecName: Full=Cathepsin K; Flags: Precursor
 gi|10048286|gb|AAG12340.1|AF292030_1 cathepsin K precursor [Sus scrofa]
          Length = 330

 Score =  183 bits (465), Expect = 9e-44,   Method: Compositional matrix adjust.
 Identities = 101/216 (46%), Positives = 133/216 (61%), Gaps = 8/216 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           + Y  KGY   + +Q  CGSCW+F + GA+EG    K  KL  LS Q L+DC     N+G
Sbjct: 120 IDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 177

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    ++Q++ K+ G+ ++D Y PY+GQD  C    T   A   G+  +   +E AL
Sbjct: 178 CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQDENCMYNPTGKAAKCRGYREIPEGNEKAL 236

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K A+A+ GPVSVAIDAS  SF FY  GVYYDE CN+  D L+HAVLAVGYG   GK +W 
Sbjct: 237 KRAVARVGPVSVAIDASLTSFQFYSKGVYYDENCNS--DNLNHAVLAVGYGIQKGKKHWI 294

Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           +KNSW   WGN+GY+LM+   NN CG+    ++  M
Sbjct: 295 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 330


>gi|256082975|ref|XP_002577726.1| subfamily C1A unassigned peptidase (C01 family) [Schistosoma
           mansoni]
          Length = 1471

 Score =  183 bits (464), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 92/204 (45%), Positives = 127/204 (62%), Gaps = 7/204 (3%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTGA+EG +Y K  +L  LS+Q L+DCS  YGNNGC GG    +++++
Sbjct: 165 NQGQCGSCWAFSTTGAIEGQHYRKTNRLVNLSEQQLVDCSKSYGNNGCSGGLMNSAFEYV 224

Query: 211 MKH-GLPTQDDYGPYLGQDA----YCHIANTTATATMTGFVNVTPNSEDALKLALAKHGP 265
             + G+ ++  Y PY+  D      C    +   A +TG+VN+    E AL  A+A  GP
Sbjct: 225 RDNEGIDSEISY-PYVSGDGTENNRCLFNASNILAQVTGYVNIHEGDERALMDAVATKGP 283

Query: 266 VSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYW 325
           VSVAI+A   SFS Y +G+Y D  C  + D LDH VL VGYGE +G+ YW +KNSW   W
Sbjct: 284 VSVAINAGLPSFSMYKSGIYSDTDCEGTLDALDHGVLVVGYGEENGRSYWLIKNSWGEEW 343

Query: 326 GNQGYVLMSIKDNN-CGVMTAPTY 348
           G +GY+ +S   +N CGV +A +Y
Sbjct: 344 GEKGYIKISKGSHNMCGVASAASY 367


>gi|194757786|ref|XP_001961143.1| GF13722 [Drosophila ananassae]
 gi|190622441|gb|EDV37965.1| GF13722 [Drosophila ananassae]
          Length = 417

 Score =  183 bits (464), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 95/201 (47%), Positives = 128/201 (63%), Gaps = 6/201 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F +TGA+EG +Y K   L  LS+Q L+DCS  YGNNGC+GG    ++++I
Sbjct: 217 DQGHCGSCWAFSSTGALEGQHYRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI 276

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T+  Y PY   D  CH    T  AT  GFV++   +E  L  A+A  GPVSVA
Sbjct: 277 KDNGGIDTEKSY-PYEALDDSCHFNKGTIGATDRGFVDIPQGNEKKLAEAVATIGPVSVA 335

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
           IDAS +SF FY  GVY +  C+     LDH VL VG+G +  G+ YW VKNSW T WG++
Sbjct: 336 IDASHESFQFYSEGVYVEPACD--AQNLDHGVLVVGFGTDESGQDYWLVKNSWGTTWGDK 393

Query: 329 GYV-LMSIKDNNCGVMTAPTY 348
           G++ ++  KDN CG+ +A +Y
Sbjct: 394 GFIKMLRNKDNQCGIASASSY 414


>gi|449663703|ref|XP_002169139.2| PREDICTED: uncharacterized protein LOC100198320 [Hydra
            magnipapillata]
          Length = 1092

 Score =  183 bits (464), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 85/196 (43%), Positives = 123/196 (62%), Gaps = 7/196 (3%)

Query: 152  QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM 211
            QS C SCW+F   GA+EGAY + +  L  ++ Q L+DCSWGY NNGC GG  +R+  ++ 
Sbjct: 838  QSFCASCWAFSAVGAIEGAYTIYNGTLKEMAVQQLVDCSWGYNNNGCRGGWSWRALNFVG 897

Query: 212  KHGLPTQDDYGPYLGQDAYCHIA---NTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
            +HGL T++ YG Y+ Q+ YCH     N      M+ +  +   + + LK AL  +GP SV
Sbjct: 898  EHGLATREAYGNYIAQEGYCHCGKNDNCDGVKKMS-YRTIEKKNAEKLKAALTLYGPGSV 956

Query: 269  AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQ 328
            A+  ++K+F FY +GVY D KC    +  DHAV+ +GYG  + KPYW +KNSW   WG+ 
Sbjct: 957  AVQCARKTFKFYSSGVYDDPKCT---EVTDHAVVIIGYGVENNKPYWLIKNSWGKLWGDN 1013

Query: 329  GYVLMSIKDNNCGVMT 344
            GY+ + + +N CGV+T
Sbjct: 1014 GYMKIDMNNNLCGVLT 1029


>gi|330803818|ref|XP_003289899.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
 gi|325080010|gb|EGC33584.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
          Length = 326

 Score =  183 bits (464), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 96/205 (46%), Positives = 132/205 (64%), Gaps = 6/205 (2%)

Query: 148 SLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSY 207
           ++ +Q  CG C++F TTG+VEG + +  ++L  LS+Q ++DCS   GNNGCDGG    S+
Sbjct: 125 AVKNQGQCGGCYAFSTTGSVEGIHEITSQQLVPLSEQQILDCSGSEGNNGCDGGLMTNSF 184

Query: 208 QWIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
           ++I+   GL T+  Y PY G+   C        AT+TG+ NV   SE  L+ A+A   PV
Sbjct: 185 EYIIAVGGLDTEASY-PYTGEVGKCKFNKKNIGATITGYKNVESGSESDLQTAVAAQ-PV 242

Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWG 326
           SVAIDASQ SF  Y +GVYY+ +C+++   LDH VLAVGYG   G+ YW VKNSW   WG
Sbjct: 243 SVAIDASQSSFQLYASGVYYEPECSSTQ--LDHGVLAVGYGSQSGQDYWIVKNSWGADWG 300

Query: 327 NQGYVLMSI-KDNNCGVMTAPTYVT 350
             G++LM+  KDNNCG+ T  ++ T
Sbjct: 301 ENGFILMARNKDNNCGIATMASFPT 325


>gi|34559455|gb|AAQ75437.1| cathepsin L-like protease [Helicoverpa armigera]
 gi|338855117|gb|AEJ31938.1| cathepsin L-like protease [Helicoverpa assulta]
          Length = 341

 Score =  183 bits (464), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 97/201 (48%), Positives = 131/201 (65%), Gaps = 6/201 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F TTGA+EG ++ K   L  LS+Q LIDCS  YGNNGC+GG    ++++I
Sbjct: 141 DQGKCGSCWAFSTTGALEGQHFRKTGYLVSLSEQNLIDCSAAYGNNGCNGGLMDNAFKYI 200

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T+  Y PY G D  C      + A   GFV++    E+ L  A+A  GPVSVA
Sbjct: 201 KDNGGIDTEKAY-PYEGVDDKCRYNAKNSGADDVGFVDIPQGDEEKLMQAVATVGPVSVA 259

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
           IDASQ+SF FY +GVYYDE C+++   LDH V+ VGYG +  G  YW VKNSW   WG+ 
Sbjct: 260 IDASQESFQFYSDGVYYDENCSSTD--LDHGVMVVGYGTDEQGGDYWLVKNSWGRTWGDL 317

Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
           GY+ M+  K+N+CG+ ++ +Y
Sbjct: 318 GYIKMARNKNNHCGIASSASY 338


>gi|291383486|ref|XP_002708337.1| PREDICTED: cathepsin L1 [Oryctolagus cuniculus]
          Length = 333

 Score =  183 bits (464), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 104/232 (44%), Positives = 141/232 (60%), Gaps = 13/232 (5%)

Query: 129 YNKASKDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQA 185
           + KA    +P  V +  KGY   + +Q  CGSCW+F  TGA+EG  + K  +L  LS+Q 
Sbjct: 106 FGKAVFQEVPSSVDWRDKGYVTPVKNQGRCGSCWAFSATGALEGQMFRKTGRLVSLSEQN 165

Query: 186 LIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTG 244
           LIDCSW  GN GC GG    ++Q++  + GL ++D Y PY  +D  C  +   + A  TG
Sbjct: 166 LIDCSWPAGNYGCRGGLPDHAFQYVKDNGGLDSEDSY-PYEARDGLCRYSPQESVANDTG 224

Query: 245 FVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAV 304
           FV + P  E+AL  A+A  GP++VAIDAS  SF FY  G+YY+  C  S + LDHAVL V
Sbjct: 225 FVQI-PEQEEALMEAVATVGPIAVAIDASHSSFLFYKEGIYYEPNC--SRENLDHAVLVV 281

Query: 305 GYG----ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
           GYG    E D + YW VKNSW   WG  GY+ M+  ++N+CG+ TA +Y T+
Sbjct: 282 GYGFEGAESDNQKYWLVKNSWGKGWGMDGYMKMAKDRNNHCGIATAASYPTV 333


>gi|431896622|gb|ELK06034.1| Cathepsin K [Pteropus alecto]
          Length = 330

 Score =  182 bits (463), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 101/216 (46%), Positives = 133/216 (61%), Gaps = 8/216 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V Y  KGY   + +Q  CGSCW+F + GA+EG    K  KL  LS Q L+DC     N+G
Sbjct: 120 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 177

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    ++Q++ K+ G+ ++D Y PY+GQD  C    T   A   G+  +   +E AL
Sbjct: 178 CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQDESCMYNPTGKAAKCRGYKEIPEGNEKAL 236

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K A+A+ GP+SVAIDAS  SF FY  GVYYDE CN+  D L+HAVLAVGYG   G+ +W 
Sbjct: 237 KRAVARVGPISVAIDASLTSFQFYRKGVYYDENCNS--DNLNHAVLAVGYGIQKGRKHWI 294

Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           +KNSW   WGN+GYVLM+   NN CG+    ++  M
Sbjct: 295 IKNSWGENWGNKGYVLMARNKNNACGIANLASFPRM 330


>gi|198432221|ref|XP_002130541.1| PREDICTED: similar to cathepsin L [Ciona intestinalis]
          Length = 330

 Score =  182 bits (463), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 97/223 (43%), Positives = 143/223 (64%), Gaps = 5/223 (2%)

Query: 130 NKASKDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALID 188
           +K +++   V +  +GY   + +Q  CGSCW+F  TG++EG ++ K KKL  LS+Q LID
Sbjct: 106 DKFAQNPTTVDWRTQGYVTPVKNQLQCGSCWAFSATGSLEGQHFAKTKKLVSLSEQQLID 165

Query: 189 CSWGYGNNGCDGGEDFRSYQWIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVN 247
           CS   G+ GC GG    ++ +I +  G+ ++ +Y PY  ++  C    +   AT+TG V+
Sbjct: 166 CSTKQGDLGCGGGYPDWAFAYINQVGGIESETNY-PYEAKNDVCRFNVSEVAATLTGCVD 224

Query: 248 VTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG 307
           +TP+SE  L+ A+   GPVSV IDAS  SF  Y +G+YY+++C++SP  LDH VLAVGYG
Sbjct: 225 ITPDSETQLEKAVGSIGPVSVLIDASHISFQLYGSGIYYEQQCSSSPASLDHGVLAVGYG 284

Query: 308 ELDGKPYWQVKNSWSTYWGN-QGYVLMSI-KDNNCGVMTAPTY 348
             +G+ YW VKNSW   WG   GY+ M+  K+NNCG+ T  +Y
Sbjct: 285 ADNGQEYWMVKNSWGEGWGKLGGYIKMAKNKNNNCGIATQASY 327


>gi|348531523|ref|XP_003453258.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
          Length = 341

 Score =  182 bits (463), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 102/228 (44%), Positives = 142/228 (62%), Gaps = 6/228 (2%)

Query: 124 SLWVRYNKASKDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLS 182
           S + R  K +     V +  KGY  ++ +Q  CGSCW+F  TG++EG ++ K  KL  LS
Sbjct: 114 STFFRLPKGTVLPDTVDWRDKGYVTNVQNQMDCGSCWAFSATGSLEGQHFRKTGKLVSLS 173

Query: 183 QQALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATAT 241
           +Q L+DCS  +GN GC+GG    ++Q+I  + G+ T++ Y PY  +D  C     +  AT
Sbjct: 174 KQQLVDCSGEFGNEGCNGGLMDSAFQYIQANGGIDTEESY-PYEAEDGKCRYNPKSTGAT 232

Query: 242 MTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAV 301
            TG+V+V P +E+ LK A+A  GP+SVAIDA   SF FY +GVY +  C+++   LDHAV
Sbjct: 233 CTGYVDVQPANEETLKEAVATIGPISVAIDAFHPSFQFYESGVYDEPDCSSTM--LDHAV 290

Query: 302 LAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           LAVGYG  +G  YW VKNS    WG +GY+ MS  K N CG+ TA +Y
Sbjct: 291 LAVGYGTENGLDYWLVKNSAGVGWGEKGYIKMSRNKSNQCGIATAASY 338


>gi|332375975|gb|AEE63128.1| unknown [Dendroctonus ponderosae]
          Length = 338

 Score =  182 bits (463), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 97/202 (48%), Positives = 128/202 (63%), Gaps = 7/202 (3%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCWSF  TGA+EG ++ + KKL  LS+Q L+DCS  +GNNGC+GG    ++++I
Sbjct: 137 DQGHCGSCWSFSATGALEGQHFRQTKKLVSLSEQNLVDCSSRFGNNGCNGGLMDNAFRYI 196

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T+  Y PY+G+D     +     AT  GFV++    ED LK A+A  GP+S+A
Sbjct: 197 KNNGGIDTEAAY-PYMGEDEKFRYSAKNRGATDKGFVDIPSGDEDKLKAAVATVGPISIA 255

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG--ELDGKPYWQVKNSWSTYWGN 327
           IDAS +SF  Y NGVY D  C+++   LDH VL VGYG  E  G  YW VKNSW   WG 
Sbjct: 256 IDASHESFQLYSNGVYSDPTCSSTE--LDHGVLVVGYGTDEKTGMDYWLVKNSWGDTWGL 313

Query: 328 QGYVLMSI-KDNNCGVMTAPTY 348
            GY+ M+  +DN CGV T  +Y
Sbjct: 314 DGYIKMARNQDNQCGVATQASY 335


>gi|334332718|ref|XP_001367502.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
          Length = 333

 Score =  182 bits (463), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 95/212 (44%), Positives = 130/212 (61%), Gaps = 4/212 (1%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CG+CWSF   GA+EG ++ K  KL  LS Q LIDC+   GNNG
Sbjct: 121 VDWREKGYVTPVKEQGDCGACWSFSAVGAIEGQWFRKTGKLVSLSIQNLIDCTIPEGNNG 180

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
           CDGG    ++Q++  +G    ++  PY+ QD  C      + A +TGFV++    E AL 
Sbjct: 181 CDGGFMDNAFQYVQDNGGIDTEECYPYVAQDTECKYKPECSGANITGFVDIPSMDERALM 240

Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQV 317
            A+A  GP+SV ID++  SF FY +GVYY+  C++S   LDH VL VGYG +    YW V
Sbjct: 241 EAVATVGPISVGIDSANPSFKFYQSGVYYEPDCSSSQ--LDHGVLVVGYGSIGKDEYWIV 298

Query: 318 KNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           KNSW   WG+ GY+LM+  KDN+CG+ T  +Y
Sbjct: 299 KNSWGEAWGDNGYILMAKDKDNHCGIATEASY 330


>gi|225709022|gb|ACO10357.1| Cathepsin L precursor [Caligus rogercresseyi]
          Length = 332

 Score =  182 bits (463), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 98/202 (48%), Positives = 131/202 (64%), Gaps = 7/202 (3%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F +TG++EG  + K  KL  LS+Q L+DCS  YGNNGC+GG    ++ +I
Sbjct: 131 NQGQCGSCWAFSSTGSLEGQTFRKTGKLIPLSEQNLVDCSRKYGNNGCEGGLMDFAFTYI 190

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T+  Y PY G    CH   +   ++  GFV+V   SE+ L  A+A  GPVSVA
Sbjct: 191 RDNKGIDTEGSY-PYEGVGGRCHYDPSKKGSSDIGFVDVKKGSEEELLKAVASVGPVSVA 249

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG--ELDGKPYWQVKNSWSTYWGN 327
           IDAS  SF FY +GVY++ KC  SP+ LDH VL VGYG  E  G+ YW VKNSWS  WG+
Sbjct: 250 IDASHMSFQFYSHGVYFESKC--SPENLDHGVLVVGYGTDENSGEDYWLVKNSWSENWGD 307

Query: 328 QGYVLMSI-KDNNCGVMTAPTY 348
           QGY+ M+  K N CG+ ++ +Y
Sbjct: 308 QGYIKMARNKKNMCGIASSASY 329


>gi|21953244|emb|CAD42716.1| putative cathepsin L [Myzus persicae]
          Length = 341

 Score =  182 bits (463), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 101/228 (44%), Positives = 142/228 (62%), Gaps = 9/228 (3%)

Query: 127 VRYNKASKDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQ 183
           V + K+    IP  + +  KGY   + +Q  CGSCWSF  TG++EG ++ K   L  LS+
Sbjct: 114 VTFLKSENVVIPKSIDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVLVSLSE 173

Query: 184 QALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATM 242
           Q LIDCS  YGNNGC+GG    ++++I  + GL T+  Y PY  +D  C      + AT 
Sbjct: 174 QNLIDCSRKYGNNGCEGGLMDLAFKYIKSNKGLDTEKSY-PYEAEDDKCRYNPDNSGATD 232

Query: 243 TGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVL 302
            GFV++    E+AL  ALA  GPVS+AIDAS + F FY  GV+Y+ +C+++   LDH VL
Sbjct: 233 NGFVDIPEGDEEALMHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTE--LDHGVL 290

Query: 303 AVGY-GELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           AVG+  +  G  YW VKNSW   WG++GY++M+  K NNCGV ++ +Y
Sbjct: 291 AVGFRTDKKGGDYWIVKNSWGKTWGDEGYIMMARNKKNNCGVASSASY 338


>gi|405966499|gb|EKC31777.1| Cathepsin L [Crassostrea gigas]
          Length = 331

 Score =  182 bits (462), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 97/213 (45%), Positives = 132/213 (61%), Gaps = 6/213 (2%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  +GY   + +Q  CGSCWSF  TG++EG ++   KKL  LS+Q L+DCS   GN+G
Sbjct: 119 VDWRKEGYVTDIKNQGHCGSCWSFSATGSLEGQHFKASKKLVSLSEQNLVDCSKKEGNHG 178

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    ++++I  + G+ T++ Y PY  ++ +CH       AT TG+V++    ED L
Sbjct: 179 CQGGLMDNAFRYIESNKGIDTEESY-PYTAKNGFCHFKAENVGATDTGYVDIPHMQEDKL 237

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           + A+A  GP+SV IDA  KSF  Y  GVY +  C++S   LDH VLAVGYG   G  YW 
Sbjct: 238 QEAVATVGPISVGIDAGHKSFQLYREGVYSEPACSSSK--LDHGVLAVGYGTESGDDYWL 295

Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           VKNSW T WG QGYV+M+  K N CG+ T  +Y
Sbjct: 296 VKNSWGTSWGMQGYVMMARNKHNMCGIATQASY 328


>gi|198427748|ref|XP_002130282.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
          Length = 340

 Score =  182 bits (462), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 96/199 (48%), Positives = 123/199 (61%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCWSF  TG++EG +  K  KL  LS+Q LIDCS   GN+GC+GG   +++++I
Sbjct: 141 NQGQCGSCWSFSATGSLEGQHKKKTGKLVSLSEQNLIDCSTPEGNDGCNGGLMDQAFKYI 200

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
              G    + Y PY  +D  C    T + AT TGFV++    E+ LK A A  GP+SVAI
Sbjct: 201 KIQGGIDTEAYYPYEAKDDTCRFNITDSGATDTGFVDIKSGDEEMLKEAAATVGPISVAI 260

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
           DAS  SF FY NGVY +  C+++   LDH VL VGYG  +GK YW VKNSW   WG  GY
Sbjct: 261 DASHTSFQFYSNGVYSETACSSTM--LDHGVLVVGYGTENGKDYWLVKNSWGEGWGEAGY 318

Query: 331 VLMSIK-DNNCGVMTAPTY 348
           + MS   DN CG+ T  +Y
Sbjct: 319 IKMSRNADNQCGIATQASY 337


>gi|156371477|ref|XP_001628790.1| predicted protein [Nematostella vectensis]
 gi|156215775|gb|EDO36727.1| predicted protein [Nematostella vectensis]
          Length = 330

 Score =  182 bits (462), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 98/213 (46%), Positives = 133/213 (62%), Gaps = 6/213 (2%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  +GY   + +Q  CGSCW+F TTG++EG  + K  KL  LS+Q L+DCS  YGNNG
Sbjct: 118 VDWRKEGYVTPVKNQGQCGSCWAFSTTGSLEGQNFKKTGKLVSLSEQNLVDCSTAYGNNG 177

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    ++++I ++ G+ T++ Y PY  ++  C    +   A  TGFV+VT   E+AL
Sbjct: 178 CQGGLMDYAFKYIKENGGIDTEESY-PYEARNDRCRFQKSNIGAVDTGFVDVTHGDEEAL 236

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K A    GP+SVAIDA   SF FY +GVY +  C+++   LDH VL VGYG   G  YW 
Sbjct: 237 KTAAGTVGPISVAIDAGHMSFQFYHSGVYNNAGCSSTS--LDHGVLVVGYGTYQGSDYWL 294

Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           VKNSW   WG +GY++MS  K+N CGV T  +Y
Sbjct: 295 VKNSWGERWGMEGYIMMSRNKNNQCGVATQASY 327


>gi|405963298|gb|EKC28885.1| Cathepsin L [Crassostrea gigas]
          Length = 265

 Score =  182 bits (462), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 103/216 (47%), Positives = 133/216 (61%), Gaps = 8/216 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  +GY   + +Q  CGSCW+F TTG +EG +Y K  KL  LS+Q L+DCS    N G
Sbjct: 55  VDWSKEGYVTPVKNQGQCGSCWAFSTTGGLEGQHYRKTGKLVSLSEQNLLDCS--KENMG 112

Query: 198 CDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG   ++Y++I ++G + T++ Y PYLG+   C    +   AT TGFV VT   E AL
Sbjct: 113 CNGGLPQKAYKYIKENGGIDTEESY-PYLGKKETCSFRPSEVGATCTGFVQVTAGDELAL 171

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K A+A  GP++V IDASQ SF  Y  GVY ++ CN  P   DHAVL VGYG   GK YW 
Sbjct: 172 KKAVASVGPITVCIDASQPSFQLYKGGVYDEQSCN--PIVFDHAVLIVGYGVYQGKDYWL 229

Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           VKNSW T WG  GY++MS   NN CG+     Y T+
Sbjct: 230 VKNSWGTSWGMDGYIMMSRNQNNQCGIANHAVYPTV 265


>gi|383849553|ref|XP_003700409.1| PREDICTED: cathepsin L-like [Megachile rotundata]
          Length = 343

 Score =  182 bits (462), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 93/201 (46%), Positives = 131/201 (65%), Gaps = 6/201 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F  TG++EG ++ +   L  LS+Q LIDCS  YGNNGC+GG   +++ +I
Sbjct: 139 DQGHCGSCWAFSATGSLEGQHFRRTGVLVSLSEQNLIDCSGSYGNNGCNGGLMDQAFSYI 198

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + GL T+  Y PY G+D  C     ++ A+  GFV++    E  LK A+A  GPVSVA
Sbjct: 199 KDNKGLDTEKTY-PYEGEDDKCRYDKRSSGASDVGFVDIPVGDEQKLKAAVATVGPVSVA 257

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
           IDAS +SF FY +G+Y++ +C  S   LDH VL VGYG + +G+ YW VKNSW   WG +
Sbjct: 258 IDASHQSFQFYSDGIYFEPEC--SSTNLDHGVLVVGYGTDEEGRDYWIVKNSWGESWGEK 315

Query: 329 GYVLMSIK-DNNCGVMTAPTY 348
           GY+ M+   DN+CG+ ++ +Y
Sbjct: 316 GYIKMARNIDNHCGIASSASY 336


>gi|390476660|ref|XP_003735160.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin K [Callithrix jacchus]
          Length = 329

 Score =  182 bits (462), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 100/216 (46%), Positives = 133/216 (61%), Gaps = 8/216 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V Y  KGY   + +Q  CGSCW+F + GA+EG    K  KL  LS Q L+DC     N+G
Sbjct: 119 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 176

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    ++Q++ K+ G+ ++D Y PY+GQ+  C    T   A   G+  +   +E AL
Sbjct: 177 CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQEESCMYNPTGKAAKCRGYREIPEGNEKAL 235

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K A+A+ GP+SVAIDAS  SF FY  GVYYDE CN+  D L+HAVLAVGYG L G  +W 
Sbjct: 236 KRAVARVGPISVAIDASLTSFQFYSKGVYYDESCNS--DNLNHAVLAVGYGILKGNKHWI 293

Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           +KNSW   WGN+GY+LM+   NN CG+    ++  M
Sbjct: 294 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 329


>gi|195381187|ref|XP_002049336.1| GJ20806 [Drosophila virilis]
 gi|194144133|gb|EDW60529.1| GJ20806 [Drosophila virilis]
          Length = 339

 Score =  182 bits (462), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 94/201 (46%), Positives = 128/201 (63%), Gaps = 6/201 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F +TGA+EG ++ K   L  LS+Q L+DCS  YGNNGC+GG    ++++I
Sbjct: 139 DQGHCGSCWAFSSTGALEGQHFRKAGTLISLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI 198

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T+  Y PY G D  CH    T  AT  G V++    E  +  A+A  GPVSVA
Sbjct: 199 KDNGGIDTEKSY-PYEGIDDSCHFNKATIGATDRGSVDIPQGDEKKMAEAVATIGPVSVA 257

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
           IDAS +SF FY  G+Y + +C+  P  LDH VL VGYG +  G+ YW VKNSW T WG++
Sbjct: 258 IDASHESFQFYSEGIYNEPQCD--PQNLDHGVLVVGYGTDESGQDYWLVKNSWGTTWGDK 315

Query: 329 GYVLMSIK-DNNCGVMTAPTY 348
           G++ M+   DN CG+ +A +Y
Sbjct: 316 GFIKMARNADNQCGIASASSY 336


>gi|307175095|gb|EFN65237.1| Cathepsin L [Camponotus floridanus]
          Length = 372

 Score =  182 bits (461), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 95/205 (46%), Positives = 133/205 (64%), Gaps = 7/205 (3%)

Query: 148 SLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSY 207
           ++ DQ  CGSCW+F +TGA+EG ++ +   L  LS+Q LIDCS  YGNNGC+GG    ++
Sbjct: 168 AIKDQGQCGSCWAFSSTGALEGQHFRQSGVLVSLSEQNLIDCSGKYGNNGCNGGLMDYAF 227

Query: 208 QWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
           ++I ++ GL T+  Y PY  ++  C      + A+  GFV++    ED LK A+A  GP+
Sbjct: 228 RYIKENKGLDTEKSY-PYEAENDQCRYNPKNSGASDVGFVDIPEGDEDKLKAAVATIGPI 286

Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELD--GKPYWQVKNSWSTY 324
           SVAIDAS +SF FY  GVYY+ +C  SP  LDH VL VGYG     G+ YW VKNSW   
Sbjct: 287 SVAIDASHESFHFYSEGVYYEPEC--SPANLDHGVLIVGYGTDSGTGEDYWLVKNSWGET 344

Query: 325 WGNQGYVLMSI-KDNNCGVMTAPTY 348
           WG +GY+ M+  K+N+CG+ ++ +Y
Sbjct: 345 WGEKGYIKMARNKENHCGIASSASY 369


>gi|405971603|gb|EKC36430.1| Cathepsin L [Crassostrea gigas]
          Length = 360

 Score =  182 bits (461), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 102/214 (47%), Positives = 132/214 (61%), Gaps = 7/214 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCWSF TTG++EG ++ K  KL  LS+  L+DCS  +GN G
Sbjct: 147 VDWRKKGYVTDVKNQGQCGSCWSFSTTGSLEGQHFRKSGKLVSLSESQLVDCSQSFGNEG 206

Query: 198 CDGGEDFRSYQWIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++++I    GL +++DY PY  +   C   +T   AT TG V+V   SE AL
Sbjct: 207 CNGGLMDNAFKYIKSVGGLESEEDY-PYKPKQGTCKFDDTKVAATDTGCVDVESGSESAL 265

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELD-GKPYW 315
           K A+++ GPVSVAIDAS  SF  Y  GVY + +C  S + LDH VL VGYG  D G+ YW
Sbjct: 266 KKAVSEVGPVSVAIDASHSSFQSYAGGVYDEPEC--SSEQLDHGVLCVGYGTDDQGQDYW 323

Query: 316 QVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
            VKNSW   WG  GYV MS  K N CG+ T  +Y
Sbjct: 324 IVKNSWGAEWGEDGYVKMSRNKKNQCGIATQASY 357


>gi|156354963|ref|XP_001623448.1| predicted protein [Nematostella vectensis]
 gi|156210148|gb|EDO31348.1| predicted protein [Nematostella vectensis]
          Length = 392

 Score =  182 bits (461), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 90/221 (40%), Positives = 133/221 (60%), Gaps = 13/221 (5%)

Query: 137 IPVRYEMKGY---NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG- 192
           +P + + + Y   N    Q  CGSCW+F T GAVE A++++  +L  L++Q L+DC+W  
Sbjct: 177 VPDQLDWRNYGAVNPAKGQGTCGSCWAFATAGAVEAAHFIQKGELLNLAEQQLLDCTWST 236

Query: 193 ----YGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
               +GNNGC GG  ++++ W+ K G+ T   YG Y GQ+ +C  +N T  A +T +  V
Sbjct: 237 PGVYHGNNGCLGGWTWKAFSWVKKFGIATTKSYGHYRGQEGFCKTSNLTVGARITSYRRV 296

Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
              +  ALK AL+ HGP +++I+A+ KS  FY +G+  D+ C+N     DHAVL +GYG 
Sbjct: 297 KRFNPIALKKALSYHGPATISINANPKSLKFYSDGIMSDKHCSNK---TDHAVLLIGYGS 353

Query: 309 LDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYV 349
            +G PYW +KNSWS  WGN G++   IK   CG+   P  V
Sbjct: 354 DNGVPYWLIKNSWSHKWGNNGFI--KIKQGLCGIEKRPFVV 392


>gi|390337642|ref|XP_780653.3| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
          Length = 333

 Score =  182 bits (461), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 108/234 (46%), Positives = 141/234 (60%), Gaps = 9/234 (3%)

Query: 119 KVSKYSLWVRYNKASKDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKK 177
           K +K S ++  N   +    V +  KGY   + DQ  CGSCW+F TTG+VEG ++    K
Sbjct: 102 KAAKGSTFLPPNNVGELPKTVDWRTKGYVTPVKDQGQCGSCWAFSTTGSVEGQHFKATGK 161

Query: 178 LAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANT 236
           L  LS+Q L+DCS    + GCDGG   R++Q+I+  G + T+  Y PY   D  CH    
Sbjct: 162 LVSLSEQNLVDCS--GRDAGCDGGFMDRAFQYIIDAGGIDTEASY-PYKAVDGKCHFKKA 218

Query: 237 TATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDG 296
              AT+TG+ +VT  SE AL+ A+A  GP+SVAIDAS  SF  Y +GVY +  C+++   
Sbjct: 219 NVGATVTGYTDVTSGSEKALQKAVAHVGPISVAIDASHMSFQHYKSGVYNEPGCDSTV-- 276

Query: 297 LDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           LDH VLAVGYG   DG  YW VKNSW+  WG  GYV MS  KDN CG+ T  +Y
Sbjct: 277 LDHGVLAVGYGTSSDGTDYWIVKNSWAETWGMNGYVWMSRNKDNQCGIATNASY 330


>gi|158268253|gb|ABW25046.1| cathepsin L-like protease [Strongylus vulgaris]
          Length = 354

 Score =  182 bits (461), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 89/201 (44%), Positives = 130/201 (64%), Gaps = 6/201 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q +CGSCW+F  TGA+EG +     K+  LS+Q L+DCS  YGN+GC+GG    ++++I
Sbjct: 154 NQGMCGSCWAFSATGALEGQHARASGKMVSLSEQNLVDCSTKYGNHGCNGGLMDLAFEYI 213

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
              HG+ T++ Y PY+G++  CH       A   GFV++    E+ALK+A+A  GP+S+A
Sbjct: 214 KDNHGIDTEESY-PYVGRETKCHFKKKDIGAEDKGFVDLPEGDEEALKVAVATQGPISIA 272

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
           IDA  ++F  Y  GVYYDE+C  S + LDH VL VGYG + +   YW +KNSW   WG +
Sbjct: 273 IDAGHRTFQLYKKGVYYDEEC--SSEELDHGVLLVGYGTDPEAGDYWLIKNSWGPGWGEK 330

Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
           GY+ ++  + N+CGV T  +Y
Sbjct: 331 GYIRIARNRSNHCGVATKASY 351


>gi|158268255|gb|ABW25047.1| cathepsin L-like protease [Strongylus vulgaris]
          Length = 354

 Score =  182 bits (461), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 89/201 (44%), Positives = 130/201 (64%), Gaps = 6/201 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q +CGSCW+F  TGA+EG +     K+  LS+Q L+DCS  YGN+GC+GG    ++++I
Sbjct: 154 NQGMCGSCWAFSATGALEGQHARASGKMVSLSEQNLVDCSTKYGNHGCNGGLMDLAFEYI 213

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
              HG+ T++ Y PY+G++  CH       A   GFV++    E+ALK+A+A  GP+S+A
Sbjct: 214 KDNHGIDTEESY-PYVGRETKCHFKKKDIGAEDKGFVDLPEGDEEALKVAVATQGPISIA 272

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
           IDA  ++F  Y  GVYYDE+C  S + LDH VL VGYG + +   YW +KNSW   WG +
Sbjct: 273 IDAGHRTFQLYKKGVYYDEEC--SSEELDHGVLLVGYGTDPEAGDYWLIKNSWGPGWGEK 330

Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
           GY+ ++  + N+CGV T  +Y
Sbjct: 331 GYIRIARNRSNHCGVATKASY 351


>gi|17224950|gb|AAL37181.1|AF320084_1 cathepsin L-like protease [Ancylostoma caninum]
          Length = 214

 Score =  182 bits (461), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 90/201 (44%), Positives = 132/201 (65%), Gaps = 6/201 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q +CGSCW+F  TGA+EG +     ++  LS+Q L+DCS  YGN+GC+GG    ++++I
Sbjct: 14  NQGMCGSCWAFSATGALEGQHARASGQMVSLSEQNLVDCSTKYGNHGCNGGLMDLAFEYI 73

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
              HG+ T++ Y PY+G+D  CH       A   G+V++    E+ALK+A+A  GP+S+A
Sbjct: 74  KDNHGIDTEESY-PYVGRDMKCHFKKKDIGAVDNGYVDLPEGDEEALKIAVATQGPISIA 132

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
           IDA  ++F  Y  GVYYDE+C  S + LDH VL VGYG + +   YW VKNSW T WG +
Sbjct: 133 IDAGHRTFQLYKKGVYYDEEC--SSEELDHGVLLVGYGTDPEAGDYWLVKNSWGTGWGEK 190

Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
           GY+ ++  ++N+CGV T  +Y
Sbjct: 191 GYIRIARNRNNHCGVATKASY 211


>gi|2706547|emb|CAA75862.1| putative cathepsin L [Xenopus laevis]
          Length = 231

 Score =  182 bits (461), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 100/217 (46%), Positives = 134/217 (61%), Gaps = 9/217 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + DQ  CGSCW+  TTGA+EG +Y K  KL  LS+Q L+DCS   GN G
Sbjct: 14  VDWRKKGYVTPVKDQGQCGSCWAPSTTGALEGQHYRKTSKLISLSEQNLVDCSRAQGNEG 73

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQ-DAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG   +++Q++  +G    +D  PY  + D  CH      +A  TGFV+V    E  L
Sbjct: 74  CNGGLMDQAFQYVKDNGGIDSEDSYPYTAKDDQECHYDPNNNSANDTGFVDVQSGCEKDL 133

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GPVSVAIDA  +SF FY +G+YY+ +C  S + LDH VL VGYG    ++DGK
Sbjct: 134 MKAVASVGPVSVAIDAGHQSFQFYQSGIYYEPEC--SSEDLDHGVLVVGYGFESEDVDGK 191

Query: 313 PYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
            YW VKNSWS  WG+ GY+ +   + N+CG+ TA +Y
Sbjct: 192 KYWIVKNSWSEKWGDNGYINIAKDRHNHCGIATAASY 228


>gi|261289785|ref|XP_002611754.1| hypothetical protein BRAFLDRAFT_284341 [Branchiostoma floridae]
 gi|229297126|gb|EEN67764.1| hypothetical protein BRAFLDRAFT_284341 [Branchiostoma floridae]
          Length = 327

 Score =  181 bits (460), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 90/200 (45%), Positives = 130/200 (65%), Gaps = 4/200 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F TTG++EG ++MK  KL  LS+Q L+DCS  +GN GC+GG   +++++I
Sbjct: 127 DQGHCGSCWAFSTTGSLEGQHFMKTGKLVSLSEQNLLDCSRRFGNKGCEGGLMDQAFRYI 186

Query: 211 MKHGLPTQDDYGPYLGQD-AYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             +G    ++  PY+ +D   C    + + AT++ + ++    E AL  A+   GPVSVA
Sbjct: 187 KSNGGIDTEECYPYMAKDEKVCDYKTSCSGATLSSYTDIKAMDEMALMQAVGTVGPVSVA 246

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS KS  FY +G+Y + +C+ +   LDH VLAVGYG +DG  YW VKNSW + WG+ G
Sbjct: 247 IDASHKSLRFYKSGIYDEPECSRTK--LDHGVLAVGYGSMDGMDYWLVKNSWGSAWGDMG 304

Query: 330 YVLMSI-KDNNCGVMTAPTY 348
           YV M+  K+N CG+ T  +Y
Sbjct: 305 YVKMTRNKNNQCGIATKASY 324


>gi|77735825|ref|NP_001029607.1| cathepsin K precursor [Bos taurus]
 gi|59858469|gb|AAX09069.1| cathepsin K preproprotein [Bos taurus]
 gi|83638771|gb|AAI09854.1| Cathepsin K [Bos taurus]
 gi|296489554|tpg|DAA31667.1| TPA: cathepsin K [Bos taurus]
          Length = 334

 Score =  181 bits (460), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 100/216 (46%), Positives = 132/216 (61%), Gaps = 8/216 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V Y  KGY   + +Q  CGSCW+F + GA+EG    K  KL  LS Q L+DC     N+G
Sbjct: 124 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 181

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    ++Q++ K+ G+ ++D Y PY+GQD  C    T   A   G+  +   +E AL
Sbjct: 182 CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQDENCMYNPTGKAAKCRGYREIPEGNEKAL 240

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K A+A+ GP+SVAIDAS  SF FY  GVYYDE CN+  D L+HAVLAVGYG   G  +W 
Sbjct: 241 KRAVARVGPISVAIDASLTSFQFYRKGVYYDENCNS--DNLNHAVLAVGYGIQKGNKHWI 298

Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           +KNSW   WGN+GY+LM+   NN CG+    ++  M
Sbjct: 299 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 334


>gi|149751227|ref|XP_001490649.1| PREDICTED: cathepsin K-like [Equus caballus]
          Length = 329

 Score =  181 bits (460), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 100/216 (46%), Positives = 132/216 (61%), Gaps = 8/216 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           + Y  KGY   + +Q  CGSCW+F + GA+EG    K  KL  LS Q L+DC     N+G
Sbjct: 119 IDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 176

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    ++Q++ K+ G+ ++D Y PY+GQD  C    T   A   G+  +   +E AL
Sbjct: 177 CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQDESCMYNPTGKAAKCRGYREIPQGNEKAL 235

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K A+A+ GPVSVAIDAS  SF FY  GVYYDE CN+  D L+HAVLAVGYG   G  +W 
Sbjct: 236 KRAVARVGPVSVAIDASLTSFQFYSRGVYYDENCNS--DNLNHAVLAVGYGIQKGNKHWI 293

Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           +KNSW   WGN+GY+LM+   NN CG+    ++  M
Sbjct: 294 IKNSWGENWGNKGYILMARNKNNACGIANMASFPKM 329


>gi|77404197|ref|NP_001029168.1| cathepsin K precursor [Canis lupus familiaris]
 gi|122056102|sp|Q3ZKN1.1|CATK_CANFA RecName: Full=Cathepsin K; Flags: Precursor
 gi|58047562|gb|AAW65150.1| cathepsin K [Canis lupus familiaris]
          Length = 330

 Score =  181 bits (460), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 101/222 (45%), Positives = 134/222 (60%), Gaps = 8/222 (3%)

Query: 133 SKDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW 191
           S+    V Y  KGY   + +Q  CGSCW+F + GA+EG    K  KL  LS Q L+DC  
Sbjct: 114 SRAPDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV- 172

Query: 192 GYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTP 250
              N+GC GG    ++Q++ K+ G+ ++D Y PY+GQD  C    T   A   G+  +  
Sbjct: 173 -SENDGCGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQDESCMYNPTGKAAKCRGYREIPE 230

Query: 251 NSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELD 310
            +E ALK A+A+ GP+SVAIDAS  SF FY  GVYYDE CN+  D L+HAVLAVGYG   
Sbjct: 231 GNEKALKRAVARVGPISVAIDASLTSFQFYSKGVYYDENCNS--DNLNHAVLAVGYGIQK 288

Query: 311 GKPYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           G  +W +KNSW   WGN+GY+LM+   NN CG+    ++  M
Sbjct: 289 GNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 330


>gi|330805277|ref|XP_003290611.1| hypothetical protein DICPUDRAFT_81345 [Dictyostelium purpureum]
 gi|325079250|gb|EGC32859.1| hypothetical protein DICPUDRAFT_81345 [Dictyostelium purpureum]
          Length = 330

 Score =  181 bits (460), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 97/200 (48%), Positives = 126/200 (63%), Gaps = 6/200 (3%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCWSF TTG+VEGA+ +K   +  LS+Q L+DCS  +GNNGCDGG    ++++I
Sbjct: 131 DQGQCGSCWSFSTTGSVEGAHQIKTGNMVTLSEQNLVDCSGKFGNNGCDGGLMVNAFKFI 190

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           M   G+ T+D Y PY      C    +   A ++G+  +T  SE  L+ AL K  PVS+A
Sbjct: 191 MSQGGVATEDSY-PYNAVQGKCKFTKSMVGANISGYKEITQGSELELQAALTKQ-PVSIA 248

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDASQ+SF  Y +GVY + +C  S   LDH VLAVGYG  +GK Y+ VKNSW+  WG  G
Sbjct: 249 IDASQQSFQLYKSGVYDEPEC--SSYQLDHGVLAVGYGTENGKDYYIVKNSWADSWGQDG 306

Query: 330 YVLMSIK-DNNCGVMTAPTY 348
           Y+ MS    N CGV T  +Y
Sbjct: 307 YIFMSRNAKNQCGVATMASY 326


>gi|391338876|ref|XP_003743781.1| PREDICTED: cathepsin L-like isoform 4 [Metaseiulus occidentalis]
          Length = 336

 Score =  181 bits (460), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 99/201 (49%), Positives = 125/201 (62%), Gaps = 6/201 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCWSF TTGA+EG  + K  +L  LS+Q LIDCS  YGNNGC GG    ++ +I
Sbjct: 136 NQGHCGSCWSFSTTGALEGQLFRKTGELVSLSEQNLIDCSTSYGNNGCGGGLMDNAFTYI 195

Query: 211 MK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
            + HG+ T++ Y PY G+   C      +    TGFV++   +E AL  ALA  GPVSVA
Sbjct: 196 KENHGIDTEESY-PYEGKQGKCRYHKEDSAGRDTGFVDIPSGNERALAKALATIGPVSVA 254

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGEL-DGKPYWQVKNSWSTYWGNQ 328
           IDAS +SF FY  GVY    C++    LDH VLAVGYG   DG+ Y+ +KNSW   WG +
Sbjct: 255 IDASHESFQFYHEGVYNPPDCDSHS--LDHGVLAVGYGTTDDGQDYYIIKNSWGERWGQE 312

Query: 329 GYVLMSIKD-NNCGVMTAPTY 348
           GYVLM+    N CGV T  +Y
Sbjct: 313 GYVLMARNSKNECGVATQASY 333


>gi|116563690|gb|ABJ99858.1| cathepsin L [Hippoglossus hippoglossus]
          Length = 336

 Score =  181 bits (460), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 103/218 (47%), Positives = 137/218 (62%), Gaps = 11/218 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + DQ  CGSCW+F TTGA+EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 119 VDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQQFRKTGKLVSLSEQNLVDCSRPEGNEG 178

Query: 198 CDGGEDFRSYQWIM-KHGLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDA 255
           C GG   +++Q++    GL ++D Y PY G  D  CH      +A  TGFV+V    E A
Sbjct: 179 CGGGLMDQAFQYVTDNQGLDSEDSY-PYTGTDDQPCHYDPLYNSANDTGFVDVPSGKEHA 237

Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
           L  A+A  GPVSVAIDA  +SF FY +G+YY+++C  S + LDH VLAVGYG    +  G
Sbjct: 238 LMKAVASVGPVSVAIDAGHESFQFYQSGIYYEKEC--SSEELDHGVLAVGYGFEGEDKMG 295

Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           K +W VKNSW   WG++GY+ M+  + N+CG+ TA +Y
Sbjct: 296 KKFWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAASY 333


>gi|426216528|ref|XP_004002514.1| PREDICTED: cathepsin K [Ovis aries]
          Length = 330

 Score =  181 bits (460), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 100/216 (46%), Positives = 132/216 (61%), Gaps = 8/216 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V Y  KGY   + +Q  CGSCW+F + GA+EG    K  KL  LS Q L+DC     N+G
Sbjct: 120 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 177

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    ++Q++ K+ G+ ++D Y PY+GQD  C    T   A   G+  +   +E AL
Sbjct: 178 CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQDENCMYNPTGKAAKCRGYREIPEGNEKAL 236

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K A+A+ GP+SVAIDAS  SF FY  GVYYDE CN+  D L+HAVLAVGYG   G  +W 
Sbjct: 237 KRAVARVGPISVAIDASLTSFQFYRKGVYYDENCNS--DNLNHAVLAVGYGIQKGNKHWI 294

Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           +KNSW   WGN+GY+LM+   NN CG+    ++  M
Sbjct: 295 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 330


>gi|109940312|sp|Q5E968.2|CATK_BOVIN RecName: Full=Cathepsin K; Flags: Precursor
          Length = 329

 Score =  181 bits (460), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 100/216 (46%), Positives = 132/216 (61%), Gaps = 8/216 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V Y  KGY   + +Q  CGSCW+F + GA+EG    K  KL  LS Q L+DC     N+G
Sbjct: 119 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 176

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    ++Q++ K+ G+ ++D Y PY+GQD  C    T   A   G+  +   +E AL
Sbjct: 177 CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQDENCMYNPTGKAAKCRGYREIPEGNEKAL 235

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K A+A+ GP+SVAIDAS  SF FY  GVYYDE CN+  D L+HAVLAVGYG   G  +W 
Sbjct: 236 KRAVARVGPISVAIDASLTSFQFYRKGVYYDENCNS--DNLNHAVLAVGYGIQKGNKHWI 293

Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           +KNSW   WGN+GY+LM+   NN CG+    ++  M
Sbjct: 294 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 329


>gi|157278115|ref|NP_001098156.1| cathepsin L precursor [Oryzias latipes]
 gi|50251128|dbj|BAD27581.1| cathepsin L [Oryzias latipes]
          Length = 336

 Score =  181 bits (459), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 98/217 (45%), Positives = 136/217 (62%), Gaps = 9/217 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + DQ  CGSCW+F TTGA+EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 119 VDWRDKGYVTPVKDQGQCGSCWAFSTTGALEGQQFRKTGKLVSLSEQNLVDCSRPEGNEG 178

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG   +++Q++  +     +D+ PY G  D  C      +    TGFV++    E AL
Sbjct: 179 CNGGLMDQAFQYVKDNQGLDSEDFYPYKGTDDQPCQYNAQYSAVNDTGFVDIPSGKERAL 238

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GPVSVAIDA  +SF FY +G+Y++++C  S D LDH VL VGYG    ++DGK
Sbjct: 239 MKAVASVGPVSVAIDAGHESFQFYQSGIYFEKEC--SSDELDHGVLVVGYGFEGEDVDGK 296

Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
            YW VKNSWS  WG++G++ M+  + N+CG+ TA +Y
Sbjct: 297 KYWIVKNSWSEKWGDKGFIYMAKDRHNHCGIATAASY 333


>gi|156355026|ref|XP_001623478.1| predicted protein [Nematostella vectensis]
 gi|156210181|gb|EDO31378.1| predicted protein [Nematostella vectensis]
          Length = 306

 Score =  181 bits (459), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 90/221 (40%), Positives = 133/221 (60%), Gaps = 13/221 (5%)

Query: 137 IPVRYEMKGY---NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG- 192
           +P + + + Y   N    Q  CGSCW+F T GAVE A++++  +L  L++Q L+DC+W  
Sbjct: 91  VPDQLDWRDYGAVNPAKGQGTCGSCWAFATAGAVEAAHFIQKGELLNLAEQQLLDCTWST 150

Query: 193 ----YGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
               +GNNGC GG  ++++ W+ K G+ T   YG Y GQ+ +C  +N T  A +T +  V
Sbjct: 151 PGVYHGNNGCLGGWTWKAFSWVKKFGIATTKSYGHYRGQEGFCKTSNLTVGARITSYRRV 210

Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
              +  ALK AL+ HGP +++I+A+ KS  FY +G+  D+ C+N     DHAVL +GYG 
Sbjct: 211 KRFNPIALKKALSYHGPATISINANPKSLKFYSDGIMSDKHCSNK---TDHAVLLIGYGS 267

Query: 309 LDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYV 349
            +G PYW +KNSWS  WGN G++   IK   CG+   P  V
Sbjct: 268 DNGVPYWLIKNSWSHKWGNNGFI--KIKQGLCGIEKRPFVV 306


>gi|403302736|ref|XP_003942009.1| PREDICTED: cathepsin K isoform 2 [Saimiri boliviensis boliviensis]
          Length = 383

 Score =  181 bits (459), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 99/216 (45%), Positives = 132/216 (61%), Gaps = 8/216 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V Y  KGY   + +Q  CGSCW+F + GA+EG    K  KL  LS Q L+DC     N+G
Sbjct: 173 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 230

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    ++Q++ K+ G+ ++D Y PY+GQ+  C    T   A   G+  +   +E AL
Sbjct: 231 CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQEESCMYNPTGKAAKCRGYREIPEGNEKAL 289

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K A+A+ GP+SVAIDAS  SF FY  GVYYDE CN+  D L+HAVLAVGYG   G  +W 
Sbjct: 290 KRAVARVGPISVAIDASLTSFQFYSKGVYYDESCNS--DNLNHAVLAVGYGIQKGNKHWI 347

Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           +KNSW   WGN+GY+LM+   NN CG+    ++  M
Sbjct: 348 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 383


>gi|15593249|gb|AAL02221.1|AF410881_1 cysteine protease CP10 precursor [Frankliniella occidentalis]
          Length = 334

 Score =  181 bits (459), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 103/226 (45%), Positives = 137/226 (60%), Gaps = 12/226 (5%)

Query: 132 ASKDAIP----VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQAL 186
           AS D+ P    V +  KG    + DQ  CGSCWSF  TG++EG  ++K+K L  LS+Q L
Sbjct: 109 ASNDSWPWSKKVDWRSKGAVTPIKDQGQCGSCWSFSATGSLEGQLFLKNKNLVSLSEQNL 168

Query: 187 IDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDA-YCHIANTTATATMTG 244
           +DCSW +GN GC+GG    +++++  + G+ T++ Y PY   D   C           TG
Sbjct: 169 VDCSWDFGNEGCNGGLMDSAFEYVESNGGIDTEESY-PYTAVDGDSCLYKAANNAGVNTG 227

Query: 245 FVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAV 304
           + +V   SE AL+ A+ K GPVSVAIDAS  SF  Y +G+YY+  C  S D LDH VLAV
Sbjct: 228 YKDVQAKSESALRDAVEKAGPVSVAIDASNWSFQMYSSGIYYESAC--SSDYLDHGVLAV 285

Query: 305 GYG-ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           GYG E   K +W VKNSW T WG +GY+ M+  K NNCG+ T  +Y
Sbjct: 286 GYGSEWPNKEFWIVKNSWGTSWGEEGYIKMARNKKNNCGIATEASY 331


>gi|33520126|gb|AAQ21040.1| cathepsin L precursor [Branchiostoma belcheri tsingtauense]
          Length = 327

 Score =  181 bits (459), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 92/200 (46%), Positives = 126/200 (63%), Gaps = 4/200 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTG++EG +++K   L  LS+Q L+DCS   GN GC GG   +++++I
Sbjct: 127 NQEQCGSCWAFSTTGSLEGQHFLKSGTLVSLSEQNLVDCSRKEGNKGCKGGLMDQAFKYI 186

Query: 211 MKHGLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             +G    ++  PY G+D   C    + + AT++ FV+V    EDALK A A  GP+SV 
Sbjct: 187 KTNGGIDTEECYPYKGRDERKCEYKASCSGATLSSFVDVKTGDEDALKQASATIGPISVG 246

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS  SF  Y +GVY++++C  S   LDH VL VGYG    K YW VKNSW   WG +G
Sbjct: 247 IDASHPSFQLYDHGVYHEKRC--SSKKLDHGVLVVGYGTQSTKDYWLVKNSWGADWGMEG 304

Query: 330 YVLMSI-KDNNCGVMTAPTY 348
           Y++MS  KDN CG+ T  +Y
Sbjct: 305 YIMMSRNKDNQCGIATQASY 324


>gi|334324659|ref|XP_001371004.2| PREDICTED: cathepsin K-like [Monodelphis domestica]
          Length = 332

 Score =  181 bits (459), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 107/256 (41%), Positives = 145/256 (56%), Gaps = 13/256 (5%)

Query: 99  EELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKASKDAIPVRYEMKGY-NSLLDQSVCGS 157
           EE+V  +   K  L  S+      +  W      +K    + Y  KGY   + +Q  CGS
Sbjct: 87  EEVVQKMTGLKVPLSRSQNNDTLYFPDW-----ETKTPDSIDYRKKGYVTPVKNQGQCGS 141

Query: 158 CWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLP 216
           CW+F + GA+EG    K  KL  LS Q L+DC     N+GC GG    ++Q++ K+ G+ 
Sbjct: 142 CWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDGCGGGYMTNAFQYVQKNRGID 199

Query: 217 TQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKS 276
           ++D Y PY+G+D  C    T   A   G+  +   SE ALK A+A+ GPV+VAIDAS  S
Sbjct: 200 SEDAY-PYIGEDESCMYNPTGKAAKCRGYREIPEGSEKALKRAVARVGPVAVAIDASLSS 258

Query: 277 FSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIK 336
           F FY  GVYYDE CN+  D L+HAVLAVGYG   G  +W +KNSW   WGN+GY+LM+  
Sbjct: 259 FQFYSKGVYYDENCNS--DNLNHAVLAVGYGIQRGTKHWIIKNSWGEQWGNKGYILMARN 316

Query: 337 DNN-CGVMTAPTYVTM 351
            NN CG+    ++  M
Sbjct: 317 KNNACGIANLASFPKM 332


>gi|391338870|ref|XP_003743778.1| PREDICTED: cathepsin L-like isoform 1 [Metaseiulus occidentalis]
 gi|391338872|ref|XP_003743779.1| PREDICTED: cathepsin L-like isoform 2 [Metaseiulus occidentalis]
 gi|391338874|ref|XP_003743780.1| PREDICTED: cathepsin L-like isoform 3 [Metaseiulus occidentalis]
          Length = 331

 Score =  181 bits (459), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 99/201 (49%), Positives = 125/201 (62%), Gaps = 6/201 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCWSF TTGA+EG  + K  +L  LS+Q LIDCS  YGNNGC GG    ++ +I
Sbjct: 131 NQGHCGSCWSFSTTGALEGQLFRKTGELVSLSEQNLIDCSTSYGNNGCGGGLMDNAFTYI 190

Query: 211 MK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
            + HG+ T++ Y PY G+   C      +    TGFV++   +E AL  ALA  GPVSVA
Sbjct: 191 KENHGIDTEESY-PYEGKQGKCRYHKEDSAGRDTGFVDIPSGNERALAKALATIGPVSVA 249

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGEL-DGKPYWQVKNSWSTYWGNQ 328
           IDAS +SF FY  GVY    C++    LDH VLAVGYG   DG+ Y+ +KNSW   WG +
Sbjct: 250 IDASHESFQFYHEGVYNPPDCDSHS--LDHGVLAVGYGTTDDGQDYYIIKNSWGERWGQE 307

Query: 329 GYVLMSIKD-NNCGVMTAPTY 348
           GYVLM+    N CGV T  +Y
Sbjct: 308 GYVLMARNSKNECGVATQASY 328


>gi|410968296|ref|XP_003990643.1| PREDICTED: cathepsin K [Felis catus]
          Length = 330

 Score =  181 bits (459), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 100/222 (45%), Positives = 134/222 (60%), Gaps = 8/222 (3%)

Query: 133 SKDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW 191
           S+    + Y  KGY   + +Q  CGSCW+F + GA+EG    K  KL  LS Q L+DC  
Sbjct: 114 SRAPDSIDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV- 172

Query: 192 GYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTP 250
              N+GC GG    ++Q++ K+ G+ ++D Y PY+GQD  C    T   A   G+  +  
Sbjct: 173 -SENDGCGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQDESCMYNPTGKAAKCRGYREIPE 230

Query: 251 NSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELD 310
            +E ALK A+A+ GP+SVAIDAS  SF FY  GVYYDE CN+  D L+HAVLAVGYG   
Sbjct: 231 GNEKALKRAVARVGPISVAIDASLTSFQFYSKGVYYDENCNS--DNLNHAVLAVGYGIQK 288

Query: 311 GKPYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           G  +W +KNSW   WGN+GY+LM+   NN CG+    ++  M
Sbjct: 289 GNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 330


>gi|440906717|gb|ELR56946.1| Cathepsin K [Bos grunniens mutus]
          Length = 338

 Score =  181 bits (459), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 99/216 (45%), Positives = 132/216 (61%), Gaps = 8/216 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           + Y  KGY   + +Q  CGSCW+F + GA+EG    K  KL  LS Q L+DC     N+G
Sbjct: 128 IDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 185

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    ++Q++ K+ G+ ++D Y PY+GQD  C    T   A   G+  +   +E AL
Sbjct: 186 CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQDENCMYNPTGKAAKCRGYREIPEGNEKAL 244

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K A+A+ GP+SVAIDAS  SF FY  GVYYDE CN+  D L+HAVLAVGYG   G  +W 
Sbjct: 245 KRAVARVGPISVAIDASLTSFQFYRKGVYYDENCNS--DNLNHAVLAVGYGIQKGNKHWI 302

Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           +KNSW   WGN+GY+LM+   NN CG+    ++  M
Sbjct: 303 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 338


>gi|301767944|ref|XP_002919404.1| PREDICTED: cathepsin K-like [Ailuropoda melanoleuca]
 gi|281352889|gb|EFB28473.1| hypothetical protein PANDA_008011 [Ailuropoda melanoleuca]
          Length = 330

 Score =  181 bits (459), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 100/222 (45%), Positives = 134/222 (60%), Gaps = 8/222 (3%)

Query: 133 SKDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW 191
           S+    + Y  KGY   + +Q  CGSCW+F + GA+EG    K  KL  LS Q L+DC  
Sbjct: 114 SRAPDSIDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV- 172

Query: 192 GYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTP 250
              N+GC GG    ++Q++ K+ G+ ++D Y PY+GQD  C    T   A   G+  +  
Sbjct: 173 -SENDGCGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQDESCMYNPTGKAAKCRGYREIPE 230

Query: 251 NSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELD 310
            +E ALK A+A+ GP+SVAIDAS  SF FY  GVYYDE CN+  D L+HAVLAVGYG   
Sbjct: 231 GNEKALKRAVARVGPISVAIDASLTSFQFYSKGVYYDENCNS--DNLNHAVLAVGYGIQK 288

Query: 311 GKPYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           G  +W +KNSW   WGN+GY+LM+   NN CG+    ++  M
Sbjct: 289 GNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 330


>gi|15593255|gb|AAL02223.1|AF410883_1 cysteine protease CP19 precursor [Frankliniella occidentalis]
          Length = 334

 Score =  181 bits (459), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 103/226 (45%), Positives = 138/226 (61%), Gaps = 12/226 (5%)

Query: 132 ASKDAIP----VRYEMKGYNS-LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQAL 186
           AS D+ P    V +  KG  + + DQ  CGSCWSF  TG++EG  ++K+K L  LS+Q L
Sbjct: 109 ASNDSWPWSKKVDWRSKGAATPIKDQGQCGSCWSFSATGSLEGQLFLKNKNLVSLSEQNL 168

Query: 187 IDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDA-YCHIANTTATATMTG 244
           +DCSW +GN GC+GG    +++++  + G+ T++ Y PY   D   C           TG
Sbjct: 169 VDCSWDFGNEGCNGGLMDSAFEYVKSNGGIDTEESY-PYTAVDGDSCLYRAANNAGVNTG 227

Query: 245 FVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAV 304
           + +V   SE AL+ A+ K GPVSVAIDAS  SF  Y +G+YY+  C  S D LDH VLAV
Sbjct: 228 YKDVQAKSESALRDAVEKVGPVSVAIDASNWSFQMYSSGIYYESAC--SSDYLDHGVLAV 285

Query: 305 GYG-ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           GYG E   K +W VKNSW T WG +GY+ M+  K NNCG+ T  +Y
Sbjct: 286 GYGSEWPNKEFWIVKNSWGTSWGEEGYIKMARNKKNNCGIATEASY 331


>gi|389610697|dbj|BAM18960.1| cathepsin L [Papilio polytes]
          Length = 341

 Score =  181 bits (458), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 95/201 (47%), Positives = 129/201 (64%), Gaps = 6/201 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCWSF  TGA+EG +Y +   L  LS+Q LIDCS  YGNNGC+GG    ++++I
Sbjct: 141 DQGKCGSCWSFSATGALEGQHYRQTNILVSLSEQNLIDCSTAYGNNGCNGGLMDNAFKYI 200

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T+  Y PY   D  C      + A   GF+++    E  L  A+A  GPVSVA
Sbjct: 201 KDNKGIDTEKSY-PYEAVDDKCRYNPRNSGADDVGFIDIPSGDEGKLMAAVATVGPVSVA 259

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
           IDASQ++F FY +GVY+DE C+++   LDH VL VGYG + +G  YW VKNSW   WG+ 
Sbjct: 260 IDASQETFQFYSDGVYFDENCSSTS--LDHGVLVVGYGTDENGGDYWLVKNSWGRSWGDL 317

Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
           GY+ M+  +DN+CG+ TA ++
Sbjct: 318 GYIKMARNRDNHCGIATAASF 338


>gi|33242872|gb|AAQ01140.1| cathepsin [Branchiostoma lanceolatum]
          Length = 334

 Score =  181 bits (458), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 97/204 (47%), Positives = 134/204 (65%), Gaps = 9/204 (4%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F TTG++EG +  K  KL  LS+Q L+DCS  +GN GC GG   +++Q+I
Sbjct: 131 DQGECGSCWAFSTTGSLEGQHSSKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYI 190

Query: 211 MKH-GLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
             + GL T++ Y PY    D  C   N++  AT+ G+ +V   +E ALK A+A  GPVSV
Sbjct: 191 KANGGLDTEESY-PYTATDDKPCKFDNSSVGATLVGYKDVKSGNEHALKRAVATVGPVSV 249

Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDG---KPYWQVKNSWSTYW 325
           AIDA  +SF FY +GVY + +C  S + LDH VLAVGYG ++    + +W VKNSW   W
Sbjct: 250 AIDAGHESFQFYSSGVYDEPQC--STEQLDHGVLAVGYGAMNDNSHQAFWIVKNSWGPSW 307

Query: 326 GNQGYVLMSI-KDNNCGVMTAPTY 348
           G+QGY++MS  K+N CG+ T+ +Y
Sbjct: 308 GDQGYIMMSRNKNNQCGIATSASY 331


>gi|198432215|ref|XP_002130162.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
          Length = 331

 Score =  181 bits (458), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 97/214 (45%), Positives = 136/214 (63%), Gaps = 7/214 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  +GY   + DQ  CGSCW+F TTG++EG ++ K  KL  LS+Q L+DCS   GN+G
Sbjct: 118 VDWRKEGYVTPVKDQKQCGSCWAFSTTGSLEGQHFKKTGKLVSLSEQNLVDCSMKEGNHG 177

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQ-DAYCHIANTTATATMTGFVNVTPNSEDA 255
           C GG     +++I  + G+ T+  Y PY+ + +  C    + + AT+TG V++   SE A
Sbjct: 178 CQGGLMDLGFEYIFDNGGIDTESSY-PYMAKNEPQCMYKRSNSGATLTGCVDIKRGSESA 236

Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYW 315
           L  A+A  GP+SVAIDA  KSF  Y +GVYY+  C  S   LDH VLAVG+G  +G+ +W
Sbjct: 237 LMKAVADVGPISVAIDAGHKSFQMYKSGVYYEPSC--SSVKLDHGVLAVGFGADNGEDFW 294

Query: 316 QVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
            VKNSW   WG +GY++MS  +DNNCG+ T  +Y
Sbjct: 295 LVKNSWGPIWGMEGYIMMSRNRDNNCGIATQASY 328


>gi|330805273|ref|XP_003290609.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
 gi|325079248|gb|EGC32857.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
          Length = 333

 Score =  181 bits (458), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 99/216 (45%), Positives = 129/216 (59%), Gaps = 7/216 (3%)

Query: 135 DAIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYG 194
           D+I  R E    + + DQ  CGSCWSF TTGAVEGA+ +K   +  LS+Q L+DCS  YG
Sbjct: 119 DSIDWR-EKGAVSQVKDQGQCGSCWSFSTTGAVEGAHQIKSGNMVSLSEQNLVDCSGQYG 177

Query: 195 NNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSE 253
           N GC+GG    ++++I+ + G+ T+  Y PY      C    +   A + G+  +    E
Sbjct: 178 NQGCEGGLMVNAFEYIIDNGGIATESSY-PYTAAQGRCKFTKSMNGANIIGYKEIPQGEE 236

Query: 254 DALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKP 313
           D+L  ALAK  PVSVAIDAS  SF  Y +GVY +  C  S + LDH VLAVGYG L+GK 
Sbjct: 237 DSLTAALAKQ-PVSVAIDASHMSFQLYSSGVYDEPAC--SSEALDHGVLAVGYGTLEGKD 293

Query: 314 YWQVKNSWSTYWGNQGYVLMSIK-DNNCGVMTAPTY 348
           Y+ +KNSW   WG  GY+ MS    N CGV T  +Y
Sbjct: 294 YYIIKNSWGPTWGQDGYIFMSRNAQNQCGVATMASY 329


>gi|157779038|gb|ABV71063.1| cathepsin L3 precursor [Schistosoma mansoni]
 gi|360044915|emb|CCD82463.1| subfamily C1A unassigned peptidase (C01 family) [Schistosoma
           mansoni]
          Length = 370

 Score =  181 bits (458), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 92/204 (45%), Positives = 127/204 (62%), Gaps = 7/204 (3%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTGA+EG +Y K  +L  LS+Q L+DCS  YGNNGC GG    +++++
Sbjct: 165 NQGQCGSCWAFSTTGAIEGQHYRKTNRLVNLSEQQLVDCSKSYGNNGCSGGLMNSAFEYV 224

Query: 211 MKH-GLPTQDDYGPYLGQDA----YCHIANTTATATMTGFVNVTPNSEDALKLALAKHGP 265
             + G+ ++  Y PY+  D      C    +   A +TG+VN+    E AL  A+A  GP
Sbjct: 225 RDNEGIDSEISY-PYVSGDGTENNRCLFNASNILAQVTGYVNIHEGDERALMDAVATKGP 283

Query: 266 VSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYW 325
           VSVAI+A   SFS Y +G+Y D  C  + D LDH VL VGYGE +G+ YW +KNSW   W
Sbjct: 284 VSVAINAGLPSFSMYKSGIYSDTDCEGTLDALDHGVLVVGYGEENGRSYWLIKNSWGEEW 343

Query: 326 GNQGYVLMSIKDNN-CGVMTAPTY 348
           G +GY+ +S   +N CGV +A +Y
Sbjct: 344 GEKGYIKISKGSHNMCGVASAASY 367


>gi|33242870|gb|AAQ01139.1| cathepsin [Branchiostoma lanceolatum]
          Length = 334

 Score =  181 bits (458), Expect = 7e-43,   Method: Compositional matrix adjust.
 Identities = 97/204 (47%), Positives = 134/204 (65%), Gaps = 9/204 (4%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F TTG++EG +  K  KL  LS+Q L+DCS  +GN GC GG   +++Q+I
Sbjct: 131 DQGECGSCWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYI 190

Query: 211 MKH-GLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
             + GL T++ Y PY    D  C   N++  AT+ G+ +V   +E ALK A+A  GPVSV
Sbjct: 191 TANGGLDTEESY-PYTATDDEPCKFDNSSVGATLVGYKDVKSGNEHALKRAVATVGPVSV 249

Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDG---KPYWQVKNSWSTYW 325
           AIDA  +SF FY +GVY + +C  S + LDH VLAVGYG ++    + +W VKNSW   W
Sbjct: 250 AIDAGHESFQFYSSGVYDEPQC--STEQLDHGVLAVGYGAMNDNSHQAFWIVKNSWGPSW 307

Query: 326 GNQGYVLMSI-KDNNCGVMTAPTY 348
           G+QGY++MS  K+N CG+ T+ +Y
Sbjct: 308 GDQGYIMMSRNKNNQCGIATSASY 331


>gi|324512246|gb|ADY45078.1| Cathepsin L [Ascaris suum]
          Length = 388

 Score =  181 bits (458), Expect = 7e-43,   Method: Compositional matrix adjust.
 Identities = 94/202 (46%), Positives = 129/202 (63%), Gaps = 8/202 (3%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q +CGSCW+F  TGA+EG +  K   L  LS+Q L+DCS  YGNNGC+GG    ++++I
Sbjct: 188 NQGMCGSCWAFSATGALEGQHKRKIGSLVSLSEQNLVDCSRKYGNNGCNGGLMDYAFEYI 247

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
              HG+ T+  Y PY G++  CH    T  A   G+V++    E+ LK+A+A  GP+SVA
Sbjct: 248 KDNHGVDTEASY-PYKGKEMKCHFNKKTVGAEDEGYVDLPEGDEEKLKIAVATQGPISVA 306

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG--ELDGKPYWQVKNSWSTYWGN 327
           IDA   SF  Y  GVYY+ +C  S + LDH VL VGYG  E+DG  YW VKNSW   WG 
Sbjct: 307 IDAGHPSFQMYRKGVYYEPQC--SSESLDHGVLVVGYGTDEIDG-DYWIVKNSWGPGWGE 363

Query: 328 QGYVLMSI-KDNNCGVMTAPTY 348
           +GYV ++  +DN+CG+ +  +Y
Sbjct: 364 KGYVRIARNRDNHCGIASKASY 385


>gi|38146075|gb|AAR11477.1| cathepsin L [Litopenaeus vannamei]
          Length = 297

 Score =  181 bits (458), Expect = 7e-43,   Method: Compositional matrix adjust.
 Identities = 97/206 (47%), Positives = 136/206 (66%), Gaps = 8/206 (3%)

Query: 131 KASKDAIP--VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALI 187
           KA  + +P  V +  KG    + DQ  CGSCW+F TTG++EG +++K  KL  LS+Q L+
Sbjct: 95  KADDETLPEKVDWRTKGAVTPVKDQKQCGSCWAFSTTGSLEGQHFLKDGKLVSLSEQNLV 154

Query: 188 DCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFV 246
           DCS  +GN GC GG   +++++I  + G+ T+D Y PY  QD  C    +   AT TG+V
Sbjct: 155 DCSDKFGNMGCMGGLMDQAFRYIKANKGIDTEDSY-PYEAQDGKCRFDASNVGATDTGYV 213

Query: 247 NVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGY 306
           +V   SE ALK A+A  GP+SV IDASQ +F FY  GVY+D+ C+++   LDH VLAVGY
Sbjct: 214 DVEHGSESALKKAVATIGPISVGIDASQSTFHFYHTGVYHDDHCSSTM--LDHGVLAVGY 271

Query: 307 G-ELDGKPYWQVKNSWSTYWGNQGYV 331
           G + +G  +W VKNSW+T WG++GY+
Sbjct: 272 GSDENGGDFWLVKNSWNTSWGDKGYI 297


>gi|33242876|gb|AAQ01142.1| cathepsin [Branchiostoma lanceolatum]
          Length = 334

 Score =  180 bits (457), Expect = 7e-43,   Method: Compositional matrix adjust.
 Identities = 97/204 (47%), Positives = 134/204 (65%), Gaps = 9/204 (4%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F TTG++EG +  K  KL  LS+Q L+DCS  +GN GC GG   +++Q+I
Sbjct: 131 DQGECGSCWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYI 190

Query: 211 MKH-GLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
             + GL T++ Y PY    D  C   N++  AT+ G+ +V   +E ALK A+A  GPVSV
Sbjct: 191 KANGGLDTEESY-PYTATDDKPCKFDNSSVGATLVGYKDVKSGNEHALKRAVATVGPVSV 249

Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDG---KPYWQVKNSWSTYW 325
           AIDA  +SF FY +GVY + +C  S + LDH VLAVGYG ++    + +W VKNSW   W
Sbjct: 250 AIDAGHESFQFYSSGVYDEPQC--STEQLDHGVLAVGYGAMNDNSHQAFWIVKNSWGPSW 307

Query: 326 GNQGYVLMSI-KDNNCGVMTAPTY 348
           G+QGY++MS  K+N CG+ T+ +Y
Sbjct: 308 GDQGYIMMSRNKNNQCGIATSASY 331


>gi|254746340|emb|CAX16635.1| putative C1A cysteine protease precursor [Manduca sexta]
          Length = 342

 Score =  180 bits (457), Expect = 7e-43,   Method: Compositional matrix adjust.
 Identities = 99/201 (49%), Positives = 128/201 (63%), Gaps = 6/201 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F TTGA+EG ++ K   L  LS+Q LIDCS  YGNNGC+GG    ++++I
Sbjct: 142 DQGKCGSCWAFSTTGALEGQHFRKSGYLVSLSEQNLIDCSSTYGNNGCNGGLMDNAFKYI 201

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T+  Y PY G D  C      + A   GFV++    E+ L  A+A  GPVSVA
Sbjct: 202 KDNGGIDTEKTY-PYEGVDDKCRYNPKNSGAEDVGFVDIPSGDEEKLMQAVATVGPVSVA 260

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
           IDASQ SF FY  GVYYD +C+++   LDH VL VGYG +  G  YW VKNSWS  WG  
Sbjct: 261 IDASQNSFQFYSGGVYYDTECSSTD--LDHGVLVVGYGTDEAGGDYWLVKNSWSRTWGEL 318

Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
           GY+ M+  +DN+CG+ T  +Y
Sbjct: 319 GYIKMARNRDNHCGIATDASY 339


>gi|33242878|gb|AAQ01143.1| cathepsin [Branchiostoma lanceolatum]
          Length = 334

 Score =  180 bits (457), Expect = 7e-43,   Method: Compositional matrix adjust.
 Identities = 97/204 (47%), Positives = 134/204 (65%), Gaps = 9/204 (4%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F TTG++EG +  K  KL  LS+Q L+DCS  +GN GC GG   +++Q+I
Sbjct: 131 DQGECGSCWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYI 190

Query: 211 MKH-GLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
             + GL T++ Y PY    D  C   N++  AT+ G+ +V   +E ALK A+A  GPVSV
Sbjct: 191 KANGGLDTEESY-PYTATDDKPCKFDNSSVGATLVGYKDVKSGNEHALKRAVATVGPVSV 249

Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDG---KPYWQVKNSWSTYW 325
           AIDA  +SF FY +GVY + +C  S + LDH VLAVGYG ++    + +W VKNSW   W
Sbjct: 250 AIDAGHESFQFYSSGVYDEPQC--STEQLDHGVLAVGYGAMNDNSHQAFWIVKNSWGPSW 307

Query: 326 GNQGYVLMSI-KDNNCGVMTAPTY 348
           G+QGY++MS  K+N CG+ T+ +Y
Sbjct: 308 GDQGYIMMSRNKNNQCGIATSASY 331


>gi|33242874|gb|AAQ01141.1| cathepsin [Branchiostoma lanceolatum]
          Length = 334

 Score =  180 bits (457), Expect = 7e-43,   Method: Compositional matrix adjust.
 Identities = 97/204 (47%), Positives = 134/204 (65%), Gaps = 9/204 (4%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F TTG++EG +  K  KL  LS+Q L+DCS  +GN GC GG   +++Q+I
Sbjct: 131 DQGECGSCWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYI 190

Query: 211 MKH-GLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
             + GL T++ Y PY    D  C   N++  AT+ G+ +V   +E ALK A+A  GPVSV
Sbjct: 191 KANGGLDTEESY-PYTATDDKPCKFDNSSVGATLVGYKDVKSGNEHALKRAVATVGPVSV 249

Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDG---KPYWQVKNSWSTYW 325
           AIDA  +SF FY +GVY + +C  S + LDH VLAVGYG ++    + +W VKNSW   W
Sbjct: 250 AIDAGHESFQFYSSGVYDEPQC--STEQLDHGVLAVGYGAMNDNSHQAFWIVKNSWGPSW 307

Query: 326 GNQGYVLMSI-KDNNCGVMTAPTY 348
           G+QGY++MS  K+N CG+ T+ +Y
Sbjct: 308 GDQGYIMMSRNKNNQCGIATSASY 331


>gi|349604730|gb|AEQ00199.1| Cathepsin K-like protein, partial [Equus caballus]
          Length = 219

 Score =  180 bits (457), Expect = 7e-43,   Method: Compositional matrix adjust.
 Identities = 100/216 (46%), Positives = 132/216 (61%), Gaps = 8/216 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           + Y  KGY   + +Q  CGSCW+F + GA+EG    K  KL  LS Q L+DC     N+G
Sbjct: 9   IDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 66

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    ++Q++ K+ G+ ++D Y PY+GQD  C    T   A   G+  +   +E AL
Sbjct: 67  CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQDESCMYNPTGKAAKCRGYREIPQGNEKAL 125

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K A+A+ GPVSVAIDAS  SF FY  GVYYDE CN+  D L+HAVLAVGYG   G  +W 
Sbjct: 126 KRAVARVGPVSVAIDASLTSFQFYSRGVYYDENCNS--DNLNHAVLAVGYGIQKGNKHWI 183

Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           +KNSW   WGN+GY+LM+   NN CG+    ++  M
Sbjct: 184 IKNSWGENWGNKGYILMARNKNNACGIANMASFPKM 219


>gi|60654335|gb|AAX29858.1| cathepsin K [synthetic construct]
 gi|60654337|gb|AAX29859.1| cathepsin K [synthetic construct]
          Length = 330

 Score =  180 bits (457), Expect = 7e-43,   Method: Compositional matrix adjust.
 Identities = 100/216 (46%), Positives = 132/216 (61%), Gaps = 8/216 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V Y  KGY   + +Q  CGSCW+F + GA+EG    K  KL  LS Q L+DC     N+G
Sbjct: 119 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 176

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    ++Q++ K+ G+ ++D Y PY+GQ+  C    T   A   G+  +   +E AL
Sbjct: 177 CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQEESCMYNPTGKAAKCRGYREIPEGNEKAL 235

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K A+A+ GPVSVAIDAS  SF FY  GVYYDE CN+  D L+HAVLAVGYG   G  +W 
Sbjct: 236 KRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNS--DNLNHAVLAVGYGIQKGNKHWI 293

Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           +KNSW   WGN+GY+LM+   NN CG+    ++  M
Sbjct: 294 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 329


>gi|344275468|ref|XP_003409534.1| PREDICTED: cathepsin K-like [Loxodonta africana]
          Length = 329

 Score =  180 bits (457), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 100/216 (46%), Positives = 132/216 (61%), Gaps = 8/216 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           + Y  KGY   + +Q  CGSCW+F + GA+EG    K  KL  LS Q L+DC     N+G
Sbjct: 119 IDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 176

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    ++Q++ K+ G+ ++D Y PY+GQD  C    T   A   G+  +   +E AL
Sbjct: 177 CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQDESCMYNPTGKAAKCRGYREIPVGNEKAL 235

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K A+A+ GPVSVAIDAS  SF FY  GVYYDE CN+  D L+HAVLAVGYG   G  +W 
Sbjct: 236 KRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNS--DNLNHAVLAVGYGIQKGNKHWI 293

Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           +KNSW   WGN+GY+LM+   NN CG+    ++  M
Sbjct: 294 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 329


>gi|281346354|gb|EFB21938.1| hypothetical protein PANDA_009085 [Ailuropoda melanoleuca]
          Length = 333

 Score =  180 bits (457), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 98/217 (45%), Positives = 136/217 (62%), Gaps = 11/217 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + DQ  CGSCW+F  TGA+EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 118 VDWREKGYVTPVKDQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAEGNAG 177

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    +++++  + GL +++ Y PYL QD  C      + A  TGF ++  + E++L
Sbjct: 178 CNGGLMDNAFRYVKDNGGLDSEESY-PYLAQDGRCKYKPEQSAANDTGFADIHQD-EESL 235

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
            L++A  GP+SVAIDAS  +F FY  G+YYD  C  S + LDH VL VGYG    E + K
Sbjct: 236 MLSVATVGPISVAIDASLDTFRFYYKGIYYDPNC--SSEDLDHGVLVVGYGSDEREAENK 293

Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
            YW VKNSW T WG QGY+LM+  + N+CG+ T+ ++
Sbjct: 294 NYWIVKNSWGTQWGMQGYILMAKDRGNHCGIATSASF 330


>gi|395729888|ref|XP_002810309.2| PREDICTED: cathepsin K [Pongo abelii]
          Length = 343

 Score =  180 bits (457), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 100/216 (46%), Positives = 132/216 (61%), Gaps = 8/216 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V Y  KGY   + +Q  CGSCW+F + GA+EG    K  KL  LS Q L+DC     N+G
Sbjct: 133 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 190

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    ++Q++ K+ G+ ++D Y PY+GQ+  C    T   A   G+  +   +E AL
Sbjct: 191 CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQEESCMYNPTGKAAKCRGYREIPEGNEKAL 249

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K A+A+ GPVSVAIDAS  SF FY  GVYYDE CN+  D L+HAVLAVGYG   G  +W 
Sbjct: 250 KRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNS--DNLNHAVLAVGYGIQKGNKHWI 307

Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           +KNSW   WGN+GY+LM+   NN CG+    ++  M
Sbjct: 308 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 343


>gi|348514005|ref|XP_003444531.1| PREDICTED: cathepsin L1-like [Oreochromis niloticus]
          Length = 338

 Score =  180 bits (457), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 99/213 (46%), Positives = 135/213 (63%), Gaps = 11/213 (5%)

Query: 144 KGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGE 202
           KGY   + DQ  CGSCW+F  TGA+EG  + K  K+  LS+Q L++CS   GN GC+GG 
Sbjct: 126 KGYVTPVKDQGQCGSCWAFSATGALEGQQFRKTGKMVQLSEQNLVECSRPEGNEGCNGGL 185

Query: 203 DFRSYQWIM-KHGLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDALKLAL 260
             +++Q++    GL +++ Y PYLG  D  CH          TGFV++   SE AL  A+
Sbjct: 186 MDQAFQYVKDNQGLDSEESY-PYLGTDDQKCHYDPRYNAVNDTGFVDIKSGSEHALMKAV 244

Query: 261 AKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQ 316
              GP+SVAIDA  +SF FY +G+YY+ +C  S + LDH VL VGYG    ++DGK YW 
Sbjct: 245 TAVGPISVAIDAGHESFQFYQSGIYYEPEC--SSEELDHGVLLVGYGFEGEDVDGKKYWI 302

Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           VKNSWS  WG++GYV M+  + N+CG+ TA +Y
Sbjct: 303 VKNSWSEKWGDKGYVYMAKDRQNHCGIATAASY 335


>gi|269784818|ref|NP_001161481.1| cathepsin L1 precursor [Gallus gallus]
          Length = 353

 Score =  180 bits (457), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 98/217 (45%), Positives = 133/217 (61%), Gaps = 9/217 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + DQ  CGSCW+F TTGA+EG ++ K  KL  LS+Q L+DCS   GN G
Sbjct: 136 VDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGNQG 195

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG   +++Q++  +G    ++  PY  +D   C        A  TGFV++    E AL
Sbjct: 196 CNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDCRYKAEYNAANDTGFVDIPQGHERAL 255

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GPVSVAIDA   SF FY +G+YY+  C  S + LDH VL VGYG    ++DGK
Sbjct: 256 MKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPDC--SSEDLDHGVLVVGYGFEGEDVDGK 313

Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
            YW VKNSW   WG++GY+ M+  + N+CG+ TA +Y
Sbjct: 314 KYWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAASY 350


>gi|391336140|ref|XP_003742440.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
          Length = 330

 Score =  180 bits (457), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 100/216 (46%), Positives = 132/216 (61%), Gaps = 6/216 (2%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F TTG++EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 118 VDWTQKGYVTEVKNQGQCGSCWAFSTTGSLEGQVFKKTGKLVSLSEQNLVDCSTSEGNQG 177

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG   +++ +I K+ G+ T+  Y PY G D  C        AT++GFV+V    E+AL
Sbjct: 178 CNGGLMDQAFTYIKKNGGIDTEAAY-PYTGSDGTCRFLENKVGATVSGFVDVKSGDENAL 236

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K A+A  GP+SVAIDAS   F FY  GVY    C+++   LDH VL VGYG   GK YW 
Sbjct: 237 KEAVATVGPISVAIDASSIFFQFYRGGVYNPWFCSSTE--LDHGVLVVGYGTEGGKDYWL 294

Query: 317 VKNSWSTYWGNQGYVLM-SIKDNNCGVMTAPTYVTM 351
           VKNSW + WG +GY+ M   K N CG+ T  +Y T+
Sbjct: 295 VKNSWGSSWGLKGYIKMVRNKKNRCGIATQASYPTV 330


>gi|397492864|ref|XP_003817340.1| PREDICTED: cathepsin K [Pan paniscus]
          Length = 343

 Score =  180 bits (457), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 100/216 (46%), Positives = 132/216 (61%), Gaps = 8/216 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V Y  KGY   + +Q  CGSCW+F + GA+EG    K  KL  LS Q L+DC     N+G
Sbjct: 133 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 190

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    ++Q++ K+ G+ ++D Y PY+GQ+  C    T   A   G+  +   +E AL
Sbjct: 191 CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQEESCMYNPTGKAAKCRGYREIPEGNEKAL 249

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K A+A+ GPVSVAIDAS  SF FY  GVYYDE CN+  D L+HAVLAVGYG   G  +W 
Sbjct: 250 KRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNS--DNLNHAVLAVGYGIQKGNKHWI 307

Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           +KNSW   WGN+GY+LM+   NN CG+    ++  M
Sbjct: 308 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 343


>gi|4503151|ref|NP_000387.1| cathepsin K preproprotein [Homo sapiens]
 gi|1168793|sp|P43235.1|CATK_HUMAN RecName: Full=Cathepsin K; AltName: Full=Cathepsin O; AltName:
           Full=Cathepsin O2; AltName: Full=Cathepsin X; Flags:
           Precursor
 gi|562757|emb|CAA57649.1| Cathepsin O [Homo sapiens]
 gi|606923|gb|AAA65233.1| cathepsin O [Homo sapiens]
 gi|1195556|gb|AAB35521.1| cathepsin O2 [Homo sapiens]
 gi|16359188|gb|AAH16058.1| Cathepsin K [Homo sapiens]
 gi|49456311|emb|CAG46476.1| CTSK [Homo sapiens]
 gi|60823594|gb|AAX36649.1| cathepsin K [synthetic construct]
 gi|119573901|gb|EAW53516.1| cathepsin K (pycnodysostosis), isoform CRA_b [Homo sapiens]
 gi|307685681|dbj|BAJ20771.1| cathepsin K [synthetic construct]
 gi|312150424|gb|ADQ31724.1| cathepsin K [synthetic construct]
          Length = 329

 Score =  180 bits (457), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 100/216 (46%), Positives = 132/216 (61%), Gaps = 8/216 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V Y  KGY   + +Q  CGSCW+F + GA+EG    K  KL  LS Q L+DC     N+G
Sbjct: 119 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 176

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    ++Q++ K+ G+ ++D Y PY+GQ+  C    T   A   G+  +   +E AL
Sbjct: 177 CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQEESCMYNPTGKAAKCRGYREIPEGNEKAL 235

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K A+A+ GPVSVAIDAS  SF FY  GVYYDE CN+  D L+HAVLAVGYG   G  +W 
Sbjct: 236 KRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNS--DNLNHAVLAVGYGIQKGNKHWI 293

Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           +KNSW   WGN+GY+LM+   NN CG+    ++  M
Sbjct: 294 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 329


>gi|332220191|ref|XP_003259241.1| PREDICTED: cathepsin K [Nomascus leucogenys]
          Length = 329

 Score =  180 bits (457), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 100/216 (46%), Positives = 132/216 (61%), Gaps = 8/216 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V Y  KGY   + +Q  CGSCW+F + GA+EG    K  KL  LS Q L+DC     N+G
Sbjct: 119 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 176

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    ++Q++ K+ G+ ++D Y PY+GQ+  C    T   A   G+  +   +E AL
Sbjct: 177 CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQEESCMYNPTGKAAKCRGYREIPEGNEKAL 235

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K A+A+ GPVSVAIDAS  SF FY  GVYYDE CN+  D L+HAVLAVGYG   G  +W 
Sbjct: 236 KRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNS--DNLNHAVLAVGYGIQKGNKHWI 293

Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           +KNSW   WGN+GY+LM+   NN CG+    ++  M
Sbjct: 294 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 329


>gi|449513868|ref|XP_002191976.2| PREDICTED: cathepsin L1-like [Taeniopygia guttata]
          Length = 443

 Score =  180 bits (457), Expect = 9e-43,   Method: Compositional matrix adjust.
 Identities = 98/217 (45%), Positives = 133/217 (61%), Gaps = 9/217 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + DQ  CGSCW+F TTGA+EG ++ K  KL  LS+Q L+DCS   GN G
Sbjct: 226 VDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGNQG 285

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG   +++Q++  +G    ++  PY  +D   C        A  TGFV++    E AL
Sbjct: 286 CNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDCRYKAEYNAANDTGFVDIPQGHERAL 345

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GPVSVAIDA   SF FY +G+YY+  C  S + LDH VL VGYG    ++DGK
Sbjct: 346 MKAVAAVGPVSVAIDAGHSSFQFYQSGIYYEPDC--SSEDLDHGVLVVGYGFEGEDVDGK 403

Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
            YW VKNSW   WG++GY+ M+  + N+CG+ TA +Y
Sbjct: 404 KYWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAASY 440


>gi|49456399|emb|CAG46520.1| CTSK [Homo sapiens]
          Length = 329

 Score =  180 bits (457), Expect = 9e-43,   Method: Compositional matrix adjust.
 Identities = 100/216 (46%), Positives = 132/216 (61%), Gaps = 8/216 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V Y  KGY   + +Q  CGSCW+F + GA+EG    K  KL  LS Q L+DC     N+G
Sbjct: 119 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 176

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    ++Q++ K+ G+ ++D Y PY+GQ+  C    T   A   G+  +   +E AL
Sbjct: 177 CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQEESCMYNPTGKAAKCRGYREIPEGNEKAL 235

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K A+A+ GPVSVAIDAS  SF FY  GVYYDE CN+  D L+HAVLAVGYG   G  +W 
Sbjct: 236 KRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNS--DNLNHAVLAVGYGIQKGNKHWI 293

Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           +KNSW   WGN+GY+LM+   NN CG+    ++  M
Sbjct: 294 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 329


>gi|836934|gb|AAA95998.1| cathepsin X [Homo sapiens]
          Length = 329

 Score =  180 bits (457), Expect = 9e-43,   Method: Compositional matrix adjust.
 Identities = 100/216 (46%), Positives = 132/216 (61%), Gaps = 8/216 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V Y  KGY   + +Q  CGSCW+F + GA+EG    K  KL  LS Q L+DC     N+G
Sbjct: 119 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 176

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    ++Q++ K+ G+ ++D Y PY+GQ+  C    T   A   G+  +   +E AL
Sbjct: 177 CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQEESCMYNPTGKAAKCRGYREIPEGNEKAL 235

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K A+A+ GPVSVAIDAS  SF FY  GVYYDE CN+  D L+HAVLAVGYG   G  +W 
Sbjct: 236 KRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNS--DNLNHAVLAVGYGIQKGNKHWI 293

Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           +KNSW   WGN+GY+LM+   NN CG+    ++  M
Sbjct: 294 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 329


>gi|74136185|ref|NP_001027984.1| cathepsin K precursor [Macaca mulatta]
 gi|47117667|sp|P61276.1|CATK_MACFA RecName: Full=Cathepsin K; Flags: Precursor
 gi|47117668|sp|P61277.1|CATK_MACMU RecName: Full=Cathepsin K; Flags: Precursor
 gi|3236470|gb|AAC23694.1| cathepsin K [Macaca fascicularis]
 gi|4927694|gb|AAD33249.1| cathepsin K [Macaca mulatta]
 gi|355558400|gb|EHH15180.1| hypothetical protein EGK_01237 [Macaca mulatta]
 gi|355763132|gb|EHH62118.1| hypothetical protein EGM_20317 [Macaca fascicularis]
 gi|380809978|gb|AFE76864.1| cathepsin K preproprotein [Macaca mulatta]
 gi|383416065|gb|AFH31246.1| cathepsin K preproprotein [Macaca mulatta]
 gi|384945478|gb|AFI36344.1| cathepsin K preproprotein [Macaca mulatta]
          Length = 329

 Score =  180 bits (457), Expect = 9e-43,   Method: Compositional matrix adjust.
 Identities = 100/216 (46%), Positives = 132/216 (61%), Gaps = 8/216 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V Y  KGY   + +Q  CGSCW+F + GA+EG    K  KL  LS Q L+DC     N+G
Sbjct: 119 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 176

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    ++Q++ K+ G+ ++D Y PY+GQ+  C    T   A   G+  +   +E AL
Sbjct: 177 CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQEESCMYNPTGKAAKCRGYREIPEGNEKAL 235

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K A+A+ GPVSVAIDAS  SF FY  GVYYDE CN+  D L+HAVLAVGYG   G  +W 
Sbjct: 236 KRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNS--DNLNHAVLAVGYGIQKGNKHWI 293

Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           +KNSW   WGN+GY+LM+   NN CG+    ++  M
Sbjct: 294 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 329


>gi|261289811|ref|XP_002611767.1| hypothetical protein BRAFLDRAFT_284308 [Branchiostoma floridae]
 gi|229297139|gb|EEN67777.1| hypothetical protein BRAFLDRAFT_284308 [Branchiostoma floridae]
          Length = 336

 Score =  180 bits (457), Expect = 9e-43,   Method: Compositional matrix adjust.
 Identities = 96/204 (47%), Positives = 133/204 (65%), Gaps = 9/204 (4%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F TTG++EG +  K  KL  LS+Q L+DCS  +GN GC GG   +++Q+I
Sbjct: 133 DQGECGSCWAFSTTGSLEGQHANKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYI 192

Query: 211 MKH-GLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
             + GL T++ Y PY    D  C   N++  AT+ G+ +V   +E ALK A+A  GP+SV
Sbjct: 193 KANGGLDTEESY-PYTATDDKPCKFDNSSVGATLIGYKDVKSGNEHALKRAVATVGPISV 251

Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDG---KPYWQVKNSWSTYW 325
           AIDA  +SF FY +GVY + +C  S + LDH VL VGYG ++    + +W VKNSW   W
Sbjct: 252 AIDAGHESFQFYSSGVYDEPQC--SSEQLDHGVLVVGYGAMNDNSHQAFWIVKNSWGPNW 309

Query: 326 GNQGYVLMSI-KDNNCGVMTAPTY 348
           G+QGY++MS  KDN CG+ T+ +Y
Sbjct: 310 GDQGYIMMSRNKDNQCGIATSASY 333


>gi|443708542|gb|ELU03619.1| hypothetical protein CAPTEDRAFT_17807 [Capitella teleta]
          Length = 350

 Score =  180 bits (457), Expect = 9e-43,   Method: Compositional matrix adjust.
 Identities = 96/221 (43%), Positives = 132/221 (59%), Gaps = 10/221 (4%)

Query: 136 AIPVR------YEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALID 188
           AIPV       +  +GY   + +Q  CGSCW+F TTG++EG ++ K  KL  LS+Q L+D
Sbjct: 129 AIPVSVPAEVDWRKEGYVTPVKNQGQCGSCWAFSTTGSLEGQHFRKTGKLVSLSEQNLVD 188

Query: 189 CSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
           CS  YGN GC+GG    ++Q+I  +     +   PY   D  C   +    AT TG+ ++
Sbjct: 189 CSTSYGNEGCNGGIVDYAFQYIKDNDGDDTEACYPYEAVDGTCRFKSVCVGATCTGYTDL 248

Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
               E  +K A+A  GPVSVAIDAS  SF  Y +G+Y +++C  SP  LDHAVL VGYG 
Sbjct: 249 PKGDEAKMKEAVALVGPVSVAIDASHSSFQMYQSGIYVEQEC--SPKQLDHAVLVVGYGT 306

Query: 309 LDGKPYWQVKNSWSTYWGNQGYVLMSIK-DNNCGVMTAPTY 348
             G+ YW VKNSW T WG++GY+ M+   DN CG+ +  +Y
Sbjct: 307 EQGQDYWLVKNSWGTTWGDEGYIKMARNMDNQCGIASQASY 347


>gi|313221001|emb|CBY31833.1| unnamed protein product [Oikopleura dioica]
 gi|313229611|emb|CBY18426.1| unnamed protein product [Oikopleura dioica]
          Length = 362

 Score =  180 bits (457), Expect = 9e-43,   Method: Compositional matrix adjust.
 Identities = 103/228 (45%), Positives = 132/228 (57%), Gaps = 4/228 (1%)

Query: 123 YSLWVRYNKASKDAIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLS 182
           Y  WV  +++  D      E      + DQ  CGSCWSF  TGA+EG       KL  LS
Sbjct: 124 YLDWVDADESKLDKSVDWREKGAVTEVKDQGQCGSCWSFSATGALEGQMAQVFGKLPDLS 183

Query: 183 QQALIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQD-AYCHIANTTATAT 241
           +Q L+DCS   GN GC+GG    ++Q++        +D+ PY G D   C    +   A 
Sbjct: 184 EQNLVDCSRPEGNQGCNGGLMDAAFQYVKDQDGLDGEDWYPYEGVDNKECRYDKSHREAD 243

Query: 242 MTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAV 301
            TGF  +   +E ALK ALAK GPVSVAIDAS  SF FY +GVYY+  C  SP+ LDH V
Sbjct: 244 DTGFKMIPEGNEKALKHALAKVGPVSVAIDASNPSFQFYQSGVYYEPNC--SPENLDHGV 301

Query: 302 LAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           LAVGYG  DG+ Y+ VKNSWS  WG+ GY+ M+  K+N+CG+ +   Y
Sbjct: 302 LAVGYGTEDGEHYYLVKNSWSEAWGDNGYIKMARNKENHCGIASYAVY 349


>gi|6435586|pdb|7PCK|A Chain A, Crystal Structure Of Wild Type Human Procathepsin K
 gi|6435587|pdb|7PCK|B Chain B, Crystal Structure Of Wild Type Human Procathepsin K
 gi|6435588|pdb|7PCK|C Chain C, Crystal Structure Of Wild Type Human Procathepsin K
 gi|6435589|pdb|7PCK|D Chain D, Crystal Structure Of Wild Type Human Procathepsin K
 gi|6435592|pdb|1BY8|A Chain A, The Crystal Structure Of Human Procathepsin K
          Length = 314

 Score =  180 bits (457), Expect = 9e-43,   Method: Compositional matrix adjust.
 Identities = 100/216 (46%), Positives = 132/216 (61%), Gaps = 8/216 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V Y  KGY   + +Q  CGSCW+F + GA+EG    K  KL  LS Q L+DC     N+G
Sbjct: 104 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 161

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    ++Q++ K+ G+ ++D Y PY+GQ+  C    T   A   G+  +   +E AL
Sbjct: 162 CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQEESCMYNPTGKAAKCRGYREIPEGNEKAL 220

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K A+A+ GPVSVAIDAS  SF FY  GVYYDE CN+  D L+HAVLAVGYG   G  +W 
Sbjct: 221 KRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNS--DNLNHAVLAVGYGIQKGNKHWI 278

Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           +KNSW   WGN+GY+LM+   NN CG+    ++  M
Sbjct: 279 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 314


>gi|402856109|ref|XP_003892642.1| PREDICTED: cathepsin K [Papio anubis]
          Length = 348

 Score =  180 bits (457), Expect = 9e-43,   Method: Compositional matrix adjust.
 Identities = 100/216 (46%), Positives = 132/216 (61%), Gaps = 8/216 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V Y  KGY   + +Q  CGSCW+F + GA+EG    K  KL  LS Q L+DC     N+G
Sbjct: 138 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 195

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    ++Q++ K+ G+ ++D Y PY+GQ+  C    T   A   G+  +   +E AL
Sbjct: 196 CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQEESCMYNPTGKAAKCRGYREIPEGNEKAL 254

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K A+A+ GPVSVAIDAS  SF FY  GVYYDE CN+  D L+HAVLAVGYG   G  +W 
Sbjct: 255 KRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNS--DNLNHAVLAVGYGIQKGNKHWI 312

Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           +KNSW   WGN+GY+LM+   NN CG+    ++  M
Sbjct: 313 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 348


>gi|388509526|gb|AFK42829.1| unknown [Lotus japonicus]
          Length = 333

 Score =  180 bits (457), Expect = 9e-43,   Method: Compositional matrix adjust.
 Identities = 97/216 (44%), Positives = 132/216 (61%), Gaps = 6/216 (2%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + DQ  CGSCW+F +TG++EG ++ K  +L  LS+Q L DCS   GN G
Sbjct: 121 VDWRTKGYVTPIKDQGQCGSCWAFSSTGSLEGQHFAKTGQLVSLSEQNLTDCSQKQGNMG 180

Query: 198 CDGGEDFRSYQWIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG   +++ +I + +G+ T+  Y PY   D  CH       AT TG+ ++    E+AL
Sbjct: 181 CNGGLMDQAFTYIKENNGIDTESSY-PYKAVDEKCHFKAADVGATDTGYTDIAQQDENAL 239

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           + A+A  GP+SVAIDAS  SF  Y +G Y +  C  S   LDH VLAVGY   DGK Y+ 
Sbjct: 240 QSAIATVGPISVAIDASHSSFQLYRSGAYNERAC--SATQLDHGVLAVGYDSEDGKDYYI 297

Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
           VKNSW T WG +GY+ M+  K+N CG+ T  TY T+
Sbjct: 298 VKNSWGTSWGQKGYIWMTRNKNNQCGIATMSTYPTV 333


>gi|426331364|ref|XP_004026652.1| PREDICTED: cathepsin K [Gorilla gorilla gorilla]
          Length = 329

 Score =  180 bits (456), Expect = 9e-43,   Method: Compositional matrix adjust.
 Identities = 100/216 (46%), Positives = 132/216 (61%), Gaps = 8/216 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V Y  KGY   + +Q  CGSCW+F + GA+EG    K  KL  LS Q L+DC     N+G
Sbjct: 119 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 176

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    ++Q++ K+ G+ ++D Y PY+GQ+  C    T   A   G+  +   +E AL
Sbjct: 177 CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQEESCMYNPTGKAAKCRGYREIPEGNEKAL 235

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K A+A+ GPVSVAIDAS  SF FY  GVYYDE CN+  D L+HAVLAVGYG   G  +W 
Sbjct: 236 KRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNS--DNLNHAVLAVGYGIQKGNKHWI 293

Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           +KNSW   WGN+GY+LM+   NN CG+    ++  M
Sbjct: 294 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 329


>gi|318816588|ref|NP_001187996.1| cathepsin L precursor [Ictalurus punctatus]
 gi|308324547|gb|ADO29408.1| cathepsin L [Ictalurus punctatus]
          Length = 334

 Score =  180 bits (456), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 97/213 (45%), Positives = 132/213 (61%), Gaps = 6/213 (2%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + DQ  CGSCW+F  TG++EG  + K  KL  LS+Q L+DCS  YGN G
Sbjct: 122 VDWRDKGYVAEVKDQKNCGSCWAFSATGSLEGQTFRKTGKLVSLSEQQLVDCSGKYGNMG 181

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    ++++I  + G+ T++ Y PY   D  C     T  AT TG+V++    E+AL
Sbjct: 182 CGGGLMDLAFEYIEDNKGIDTEESY-PYEATDGDCRFKPATVGATCTGYVDINSEDENAL 240

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           + A+A  GP+SVAIDA   SF  Y +G+Y +  C  S + LDH VLAVGYG  + + YW 
Sbjct: 241 QKAVANIGPISVAIDAGHISFQLYGSGIYNEPNC--SSEDLDHGVLAVGYGTDNQQDYWL 298

Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           VKNSW   WG+QGY+ M+  K+N CG+ TA +Y
Sbjct: 299 VKNSWGLDWGDQGYIKMTRNKNNQCGIATAASY 331


>gi|405966498|gb|EKC31776.1| Cathepsin L [Crassostrea gigas]
          Length = 330

 Score =  180 bits (456), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 98/216 (45%), Positives = 133/216 (61%), Gaps = 6/216 (2%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCWSF  TG++EG  + K  KL  LS+Q L+DCS   GN+G
Sbjct: 118 VDWRPKGYVTPIKNQGQCGSCWSFSATGSLEGQTFKKTGKLPSLSEQNLVDCSQKQGNHG 177

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    ++Q+I  + G+ T+  Y PY  ++  C        AT +GF ++   SE  L
Sbjct: 178 CQGGLMDDAFQYIKDNSGIDTESSY-PYEAKNGKCRFNAANVGATDSGFTDIKSKSESDL 236

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           + A+A  GP+SVAIDAS  SF  Y +GVY++  C+ +   LDH VLAVGYG   GK YW 
Sbjct: 237 QSAVATVGPISVAIDASHMSFQLYRSGVYHEFFCSETR--LDHGVLAVGYGTESGKDYWL 294

Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
           VKNSW   WG +GY++MS  K NNCG+ T+ +Y T+
Sbjct: 295 VKNSWGESWGQKGYIMMSRNKRNNCGIATSASYPTV 330


>gi|395535911|ref|XP_003769964.1| PREDICTED: cathepsin K [Sarcophilus harrisii]
          Length = 332

 Score =  180 bits (456), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 100/216 (46%), Positives = 131/216 (60%), Gaps = 8/216 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V Y  KGY   + +Q  CGSCW+F + GA+EG    K  KL  LS Q L+DC     N+G
Sbjct: 122 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SKNDG 179

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    ++Q++ ++ G+ ++D Y PY+GQD  C    T   A   G+  +   SE AL
Sbjct: 180 CGGGYMTNAFQYVQENRGIDSEDAY-PYIGQDESCMYNPTGKAAKCRGYREIPEGSEKAL 238

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K A+A+ GPV+VAIDAS  SF FY  GVYYDE CN   D L+HAVLAVGYG   G  +W 
Sbjct: 239 KRAVARVGPVAVAIDASLSSFQFYSKGVYYDENCNG--DNLNHAVLAVGYGIQRGTKHWI 296

Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
           +KNSW   WGN+GY+LM+  K N CG+    ++  M
Sbjct: 297 IKNSWGEEWGNKGYILMARNKKNACGIANLASFPKM 332


>gi|387015022|gb|AFJ49630.1| Cathepsin L1-like [Crotalus adamanteus]
          Length = 338

 Score =  180 bits (456), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 102/219 (46%), Positives = 139/219 (63%), Gaps = 12/219 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + DQ  CGSCW+F  TGA+EG ++ K  KL  LS+Q LIDCS   GN G
Sbjct: 120 VDWREKGYVTPVKDQGQCGSCWAFSATGALEGQHFRKTGKLVSLSEQNLIDCSGPEGNQG 179

Query: 198 CDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDA 255
           C+GG   +++Q+I   +G+ +++ Y PY+G+D   C       +A  TGFV++    E A
Sbjct: 180 CNGGLMDQAFQYIKDNNGIDSEESY-PYIGKDDEDCLYKPEYNSANDTGFVDIPEGRERA 238

Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-----ELD 310
           L  A+A  GP+SVAIDAS  SF FY +GVYY+ +CN+    LDH VL VGYG     + +
Sbjct: 239 LMKAVAAVGPISVAIDASHTSFQFYESGVYYEPQCNSEE--LDHGVLVVGYGYEGTDDDN 296

Query: 311 GKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
            K YW VKNSWS  WG+QGY+ M+  + NNCG+ +A +Y
Sbjct: 297 KKRYWIVKNSWSEKWGDQGYIHMAKDRSNNCGIASAASY 335


>gi|225718114|gb|ACO14903.1| Cathepsin L precursor [Caligus clemensi]
          Length = 336

 Score =  180 bits (456), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 96/202 (47%), Positives = 129/202 (63%), Gaps = 7/202 (3%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCWSF  TGA+EG  + K  KL  LS+Q L+DCS  +GNNGC+GG    ++ +I
Sbjct: 135 NQGQCGSCWSFSATGALEGQDFRKTGKLISLSEQNLVDCSRKFGNNGCEGGLMDFAFTYI 194

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T+  Y PY G D +CH        +  GFV++   SE  LK A+A  GP+SVA
Sbjct: 195 RDNKGIDTEASY-PYEGIDGHCHYNPKNKGGSDIGFVDIKKGSEKDLKKAVAGVGPISVA 253

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG--ELDGKPYWQVKNSWSTYWGN 327
           IDAS  SF FY +GVY + KC  S + LDH VL VG+G   + G+ YW VKNSWS  WG+
Sbjct: 254 IDASHMSFQFYSHGVYVESKC--SSEELDHGVLVVGFGTDSVSGEDYWLVKNSWSEKWGD 311

Query: 328 QGYVLMSI-KDNNCGVMTAPTY 348
           QGY+ M+  K+N CG+ ++ +Y
Sbjct: 312 QGYIKMARNKENMCGIASSASY 333


>gi|119573900|gb|EAW53515.1| cathepsin K (pycnodysostosis), isoform CRA_a [Homo sapiens]
          Length = 288

 Score =  180 bits (456), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 100/216 (46%), Positives = 132/216 (61%), Gaps = 8/216 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V Y  KGY   + +Q  CGSCW+F + GA+EG    K  KL  LS Q L+DC     N+G
Sbjct: 78  VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 135

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    ++Q++ K+ G+ ++D Y PY+GQ+  C    T   A   G+  +   +E AL
Sbjct: 136 CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQEESCMYNPTGKAAKCRGYREIPEGNEKAL 194

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K A+A+ GPVSVAIDAS  SF FY  GVYYDE CN+  D L+HAVLAVGYG   G  +W 
Sbjct: 195 KRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNS--DNLNHAVLAVGYGIQKGNKHWI 252

Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           +KNSW   WGN+GY+LM+   NN CG+    ++  M
Sbjct: 253 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 288


>gi|403302734|ref|XP_003942008.1| PREDICTED: cathepsin K isoform 1 [Saimiri boliviensis boliviensis]
          Length = 329

 Score =  180 bits (456), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 99/216 (45%), Positives = 132/216 (61%), Gaps = 8/216 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V Y  KGY   + +Q  CGSCW+F + GA+EG    K  KL  LS Q L+DC     N+G
Sbjct: 119 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 176

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    ++Q++ K+ G+ ++D Y PY+GQ+  C    T   A   G+  +   +E AL
Sbjct: 177 CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQEESCMYNPTGKAAKCRGYREIPEGNEKAL 235

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K A+A+ GP+SVAIDAS  SF FY  GVYYDE CN+  D L+HAVLAVGYG   G  +W 
Sbjct: 236 KRAVARVGPISVAIDASLTSFQFYSKGVYYDESCNS--DNLNHAVLAVGYGIQKGNKHWI 293

Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           +KNSW   WGN+GY+LM+   NN CG+    ++  M
Sbjct: 294 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 329


>gi|391328505|ref|XP_003738729.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
          Length = 323

 Score =  180 bits (456), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 99/222 (44%), Positives = 136/222 (61%), Gaps = 8/222 (3%)

Query: 135 DAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW 191
           +A+P  V +  KGY   + +Q  CGSCW+F TTG++EG ++    KL  LS+Q L+DCS 
Sbjct: 105 EALPTQVDWREKGYVTPVKNQGQCGSCWAFSTTGSLEGQHFKATGKLVSLSEQNLVDCSR 164

Query: 192 GYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTP 250
             GNNGC+GG     + +I ++ G+ T++ Y PY G+D  C     +  A + GFV+V  
Sbjct: 165 VEGNNGCNGGLMDNGFTYIQQNGGIDTEESY-PYTGKDGDCAFNENSVGARVKGFVDVPQ 223

Query: 251 NSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELD 310
             E AL+ A+A  GPVSVAIDAS  SF +Y  GVY +  C+ S   LDH VL VGYG  +
Sbjct: 224 RDEAALQAAVASVGPVSVAIDASNDSFQYYKEGVYDEPSCSFSQ--LDHGVLVVGYGTEN 281

Query: 311 GKPYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTYVTM 351
           G  YW VKNSW   WG  GY+ +M  K+N CG+ +  +Y T+
Sbjct: 282 GVDYWLVKNSWGPTWGQDGYIKMMRNKENQCGIASMASYPTV 323


>gi|355681653|gb|AER96814.1| cathepsin K [Mustela putorius furo]
          Length = 329

 Score =  179 bits (455), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 99/219 (45%), Positives = 133/219 (60%), Gaps = 8/219 (3%)

Query: 133 SKDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW 191
           S+    + Y  KGY   + +Q  CGSCW+F + GA+EG    K  KL  LS Q L+DC  
Sbjct: 114 SRAPDSIDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV- 172

Query: 192 GYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTP 250
              N+GC GG    ++Q++ K+ G+ ++D Y PY+GQD  C    T   A   G+  +  
Sbjct: 173 -SENDGCGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQDESCMYNPTGKAAKCKGYREIPE 230

Query: 251 NSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELD 310
            +E ALK A+A+ GP+SVAIDAS  SF FY  GVYYDE CN+  D L+HAVLAVGYG   
Sbjct: 231 GNEKALKRAVARVGPISVAIDASLTSFQFYSKGVYYDENCNS--DNLNHAVLAVGYGVQK 288

Query: 311 GKPYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTY 348
           G  +W +KNSW   WGN+GY+LM+   NN CG+    ++
Sbjct: 289 GNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLASF 327


>gi|449275508|gb|EMC84350.1| Cathepsin L1, partial [Columba livia]
          Length = 319

 Score =  179 bits (455), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 98/217 (45%), Positives = 133/217 (61%), Gaps = 9/217 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + DQ  CGSCW+F TTGA+EG ++ K  KL  LS+Q L+DCS   GN G
Sbjct: 102 VDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGNQG 161

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG   +++Q++  +G    ++  PY  +D   C        A  TGFV++    E AL
Sbjct: 162 CNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDCRYKAEYNAANDTGFVDIPQGHERAL 221

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GPVSVAIDA   SF FY +G+YY+  C  S + LDH VL VGYG    ++DGK
Sbjct: 222 MKAVAAVGPVSVAIDAGHSSFQFYQSGIYYEPDC--SSEDLDHGVLVVGYGFEGEDVDGK 279

Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
            YW VKNSW   WG++GY+ M+  + N+CG+ TA +Y
Sbjct: 280 KYWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAASY 316


>gi|392873948|gb|AFM85806.1| cathepsin L [Callorhinchus milii]
          Length = 338

 Score =  179 bits (455), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 98/217 (45%), Positives = 136/217 (62%), Gaps = 9/217 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  +GY   + DQ  CGSCW+F TTGA+EG ++ +  +L  LS+Q L++CS   GN G
Sbjct: 121 VDWRDEGYVTPVKDQGQCGSCWAFSTTGALEGQHFRRTGQLVSLSEQNLVECSKPEGNEG 180

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG   +++Q++  +G    +D  PY+G  D  CH       A  TGFV++    E AL
Sbjct: 181 CNGGLMDQAFQYVKDNGGIDSEDSYPYVGTDDTPCHYNPQYNAANDTGFVDIPSGKERAL 240

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GPVSVAIDA   SF FY +G+Y++ +C+++   LDH VL VGYG    + DGK
Sbjct: 241 MKAIAAVGPVSVAIDAGHTSFQFYQSGIYFEAECSSTD--LDHGVLVVGYGVEKRDTDGK 298

Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
            YW VKNSWS   G  GY+LM+  KDN+CG+ TA +Y
Sbjct: 299 KYWIVKNSWSEKLGQNGYILMAKDKDNHCGIATAASY 335


>gi|350412176|ref|XP_003489564.1| PREDICTED: cathepsin L-like [Bombus impatiens]
          Length = 343

 Score =  179 bits (455), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 96/201 (47%), Positives = 130/201 (64%), Gaps = 6/201 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCWSF  TGA+EG ++ +   L  LS+Q LIDCS  YGNNGC+GG   +++Q+I
Sbjct: 139 DQGHCGSCWSFSATGALEGQHFRRTGILIPLSEQNLIDCSGKYGNNGCNGGLMDQAFQYI 198

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + GL T+  Y PY  ++  C      + A   G+V++   +E  LK A+A  GPVSVA
Sbjct: 199 KDNKGLDTEVTY-PYEAENDKCRYNAANSGARDVGYVDIPQGNEKKLKAAVATIGPVSVA 257

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
           IDAS +SF FY  GVYY+ +C  S + LDH VLAVGYG + +G+ YW VKNSW   WG+ 
Sbjct: 258 IDASHQSFQFYSEGVYYEPEC--SSENLDHGVLAVGYGTDENGQDYWLVKNSWGETWGDN 315

Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
           GY+ M+  K N+CG+ +  +Y
Sbjct: 316 GYIKMARNKLNHCGIASTASY 336


>gi|156124996|gb|ABU50816.1| Ale o 1 allergen [Aleuroglyphus ovatus]
          Length = 337

 Score =  179 bits (455), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 97/219 (44%), Positives = 135/219 (61%), Gaps = 8/219 (3%)

Query: 135 DAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW 191
           +A+P  V +  KG    + +Q  CGSCW+F    ++EG + +K  KL  LS+Q L+DCS 
Sbjct: 119 EALPATVDWTTKGVVTPIKNQQQCGSCWAFSAVASMEGQHALKTGKLVSLSEQNLVDCSA 178

Query: 192 GYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTP 250
             G+ GC GG    ++++++++ G+ T+  Y PY   D  C     +  AT+  FV+V  
Sbjct: 179 AEGDMGCSGGWMDYAFKYVIQNRGIDTEASY-PYKAIDESCEFKRNSIGATIHSFVDVKT 237

Query: 251 NSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELD 310
             E AL+ A+A  GP+SVAIDASQ SF FY +GVY +  C  S + LDH V AVGYG L+
Sbjct: 238 GDESALQNAVASIGPISVAIDASQPSFQFYSSGVYNEPDC--STEILDHGVTAVGYGTLN 295

Query: 311 GKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           G PYW+VKNSW T WG +GY+ MS  K N CG+ T  +Y
Sbjct: 296 GVPYWKVKNSWGTSWGQKGYIFMSRNKQNQCGIATKASY 334


>gi|306992173|gb|ADN19567.1| cathepsin L-like proteinase [Spodoptera frugiperda]
          Length = 344

 Score =  179 bits (455), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 94/201 (46%), Positives = 129/201 (64%), Gaps = 6/201 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F TTGA+EG ++ K   L  LS+Q L+DCS  YGNNGC+GG    ++++I
Sbjct: 144 DQGKCGSCWAFSTTGALEGQHFRKTGYLVSLSEQNLVDCSAAYGNNGCNGGLMDNAFKYI 203

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T+  Y PY   D  C      + A   GFV++    E+ L  A+A  GP+SVA
Sbjct: 204 KDNGGIDTEKSY-PYEAVDDKCRYNPKNSGADDVGFVDIPQGDEEKLMQAVATVGPISVA 262

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
           IDASQ++F FY  GVYYDE C+++   LDH V+ VGYG E +G  YW VKNSW   WG  
Sbjct: 263 IDASQETFQFYSKGVYYDENCSSTD--LDHGVMVVGYGTEEEGGDYWLVKNSWGRSWGEL 320

Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
           GY+ M+  K+N+CG+ ++ +Y
Sbjct: 321 GYIKMAHNKNNHCGIASSASY 341


>gi|326672297|ref|XP_003199631.1| PREDICTED: cathepsin L1-like [Danio rerio]
          Length = 336

 Score =  179 bits (455), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 100/221 (45%), Positives = 142/221 (64%), Gaps = 11/221 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  +GY   + DQ  CGSCWSF +TGA+EG  + K  KL  +S+Q L+DCS  +GN G
Sbjct: 119 VDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPHGNQG 178

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDA 255
           C+GG   +++Q++ ++ GL ++  Y PYL +D   C        A +TGFV++   +E A
Sbjct: 179 CNGGLMDQAFQYVKENKGLDSEQSY-PYLARDDLPCRYDPRFNVAKITGFVDIPKGNELA 237

Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
           L  A+A  GPVSVAIDAS +S  FY +G+YY+  C++S   LDHAVL VGYG    ++ G
Sbjct: 238 LMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSR--LDHAVLVVGYGYQGADVAG 295

Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
             YW VKNSWS  WG++GY+ M+  K+N+CG+ T  +Y  M
Sbjct: 296 NRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPLM 336


>gi|156124998|gb|ABU50817.1| Ale o 1 allergen [Aleuroglyphus ovatus]
          Length = 337

 Score =  179 bits (455), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 96/219 (43%), Positives = 135/219 (61%), Gaps = 8/219 (3%)

Query: 135 DAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW 191
           +A+P  V +  KG    + +Q  CGSCW+F    ++EG + +K  KL  LS+Q L+DCS 
Sbjct: 119 EALPATVDWTTKGVVTPIKNQQQCGSCWAFSAVASMEGQHALKTGKLVSLSEQNLVDCSA 178

Query: 192 GYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTP 250
             G+ GC GG    ++++++++ G+ T+  Y PY   D  C     +  AT+  FV+V  
Sbjct: 179 AEGDMGCSGGWMDYAFKYVIQNRGIDTEASY-PYKAIDESCEFKRNSVGATIHSFVDVKT 237

Query: 251 NSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELD 310
             E AL+ A+A  GP+SVAIDA+Q SF FY +GVY +  C  S + LDH V AVGYG L+
Sbjct: 238 GDESALQNAVASIGPISVAIDAAQPSFQFYSSGVYNEPDC--STEILDHGVTAVGYGTLN 295

Query: 311 GKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           G PYW+VKNSW T WG +GY+ MS  K N CG+ T  +Y
Sbjct: 296 GAPYWKVKNSWGTSWGRKGYIFMSRNKQNQCGIATKASY 334


>gi|301769893|ref|XP_002920368.1| PREDICTED: cathepsin L1-like [Ailuropoda melanoleuca]
          Length = 503

 Score =  179 bits (454), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 96/216 (44%), Positives = 133/216 (61%), Gaps = 9/216 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + DQ  CGSCW+F  TGA+EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 118 VDWREKGYVTPVKDQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAEGNAG 177

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
           C+GG    +++++  +G    ++  PYL QD  C      + A  TGF ++  + E++L 
Sbjct: 178 CNGGLMDNAFRYVKDNGGLDSEESYPYLAQDGRCKYKPEQSAANDTGFADIHQD-EESLM 236

Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKP 313
           L++A  GP+SVAIDAS  +F FY  G+YYD  C  S + LDH VL VGYG    E + K 
Sbjct: 237 LSVATVGPISVAIDASLDTFRFYYKGIYYDPNC--SSEDLDHGVLVVGYGSDEREAENKN 294

Query: 314 YWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           YW VKNSW T WG QGY+LM+  + N+CG+ T+ ++
Sbjct: 295 YWIVKNSWGTQWGMQGYILMAKDRGNHCGIATSASF 330



 Score = 98.2 bits (243), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 58/113 (51%), Positives = 71/113 (62%), Gaps = 8/113 (7%)

Query: 238 ATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGL 297
           + A +TG VNV P  E+A+ LA+A  GPVS AI AS  SF F   G+YYD  C  S + L
Sbjct: 390 SAADVTGPVNV-PQQEEAVMLAVAAGGPVSAAIRASLGSFQFCKEGIYYDPNC--SSEDL 446

Query: 298 DHAVLAVGYG----ELDGKPYWQVKNSWSTYWGNQGYVLMSIK-DNNCGVMTA 345
           DH VL VGYG    E + K YW VKNSW T WG QGY+L+    DN+C + T+
Sbjct: 447 DHGVLVVGYGSDEREAENKNYWIVKNSWGTDWGLQGYMLLVRDWDNHCEITTS 499


>gi|340727787|ref|XP_003402217.1| PREDICTED: cathepsin L-like [Bombus terrestris]
          Length = 343

 Score =  179 bits (454), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 96/201 (47%), Positives = 130/201 (64%), Gaps = 6/201 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCWSF  TGA+EG ++ +   L  LS+Q LIDCS  YGNNGC+GG   +++Q+I
Sbjct: 139 DQGHCGSCWSFSATGALEGQHFRRTGILIPLSEQNLIDCSGKYGNNGCNGGLMDQAFQYI 198

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + GL T+  Y PY  ++  C      + A   G+V++   +E  LK A+A  GPVSVA
Sbjct: 199 KDNKGLDTEVTY-PYEAENDKCRYNAANSGARDVGYVDIPQGNEKKLKAAVATIGPVSVA 257

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
           IDAS +SF FY  GVYY+ +C  S + LDH VLAVGYG + +G+ YW VKNSW   WG+ 
Sbjct: 258 IDASHQSFQFYSEGVYYEPEC--SSENLDHGVLAVGYGTDENGQDYWLVKNSWGETWGDN 315

Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
           GY+ M+  K N+CG+ +  +Y
Sbjct: 316 GYIKMARNKLNHCGIASTASY 336


>gi|75765285|pdb|1U9V|A Chain A, Crystal Structure Of The Cysteine Protease Human Cathepsin
           K In Complex With The Covalent Inhibitor Nvp-Abe854
 gi|75765286|pdb|1U9W|A Chain A, Crystal Structure Of The Cysteine Protease Human Cathepsin
           K In Complex With The Covalent Inhibitor Nvp-Abi491
 gi|75765287|pdb|1U9X|A Chain A, Crystal Structure Of The Cysteine Protease Human Cathepsin
           K In Complex With The Covalent Inhibitor Nvp-Abj688
 gi|160286063|pdb|2R6N|A Chain A, Crystal Structure Of A Pyrrolopyrimidine Inhibitor In
           Complex With Human Cathepsin K
          Length = 217

 Score =  179 bits (454), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 100/216 (46%), Positives = 132/216 (61%), Gaps = 8/216 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V Y  KGY   + +Q  CGSCW+F + GA+EG    K  KL  LS Q L+DC     N+G
Sbjct: 7   VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 64

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    ++Q++ K+ G+ ++D Y PY+GQ+  C    T   A   G+  +   +E AL
Sbjct: 65  CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQEESCMYNPTGKAAKCRGYREIPEGNEKAL 123

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K A+A+ GPVSVAIDAS  SF FY  GVYYDE CN+  D L+HAVLAVGYG   G  +W 
Sbjct: 124 KRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNS--DNLNHAVLAVGYGIQKGNKHWI 181

Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           +KNSW   WGN+GY+LM+   NN CG+    ++  M
Sbjct: 182 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 217


>gi|281398206|gb|ADA67926.1| putative peptidase [Bombyx mori]
          Length = 696

 Score =  179 bits (454), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 92/241 (38%), Positives = 131/241 (54%), Gaps = 16/241 (6%)

Query: 111 RLVTSEGEKVSKYSLWVRYNKASKDAIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGA 170
           R    + E++ +   W R+   +    PVR+          Q  C SCW+F   GAVEGA
Sbjct: 423 RRARRDAERLPRNFDWRRHGAVT----PVRF----------QGSCSSCWAFAVVGAVEGA 468

Query: 171 YYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAY 230
            +++  +L  LS+Q LIDC+  YG +GC G     +Y +I   GL   D+Y PY  +   
Sbjct: 469 LFLRTSRLVPLSEQDLIDCAHPYGGHGCSGTWPRAAYDYIQDQGLRALDEYLPYEAKVLQ 528

Query: 231 CHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKC 290
           C    T     ++G VNVT NSE ALK A+ ++GP  V +D   K F ++  GV YD +C
Sbjct: 529 CAARQTRPVTHISGHVNVTRNSELALKAAIRRNGPNVVLVDGQSKGFIYHKKGVLYDNRC 588

Query: 291 NNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVT 350
                  +HAVLAVG+GE  G+ ++ VKNSWS+ W N+GY  +    N CGV+T P+Y  
Sbjct: 589 KKR--SRNHAVLAVGWGERRGESFFVVKNSWSSNWANEGYGWLHAPSNTCGVLTEPSYPV 646

Query: 351 M 351
           +
Sbjct: 647 L 647


>gi|189525868|ref|XP_001341714.2| PREDICTED: cathepsin L1-like isoform 1 [Danio rerio]
          Length = 336

 Score =  179 bits (454), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 101/221 (45%), Positives = 141/221 (63%), Gaps = 11/221 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  +GY   + DQ  CGSCWSF +TGA+EG  + K  KL  +S+Q L+DCS   GN G
Sbjct: 119 VDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQG 178

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDA 255
           C+GG   +++Q++ ++ GL ++  Y PYL +D   C        A +TGFV++   +E A
Sbjct: 179 CNGGLMDQAFQYVKENKGLDSEQSY-PYLARDDLPCRYDPRFNVAKITGFVDIPSGNELA 237

Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
           L  A+A  GPVSVAIDAS +S  FY +G+YY+  C++S   LDHAVL VGYG    ++ G
Sbjct: 238 LMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSR--LDHAVLVVGYGYQGADVAG 295

Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
             YW VKNSWS  WG++GY+ M+  K+N+CGV T  +Y  M
Sbjct: 296 NRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATKASYPLM 336


>gi|2914594|pdb|1MEM|A Chain A, Crystal Structure Of Cathepsin K Complexed With A Potent
           Vinyl Sulfone Inhibitor
 gi|28374044|pdb|1NL6|A Chain A, Crystal Structure Of The Cysteine Protease Human Cathepsin
           K In Complex With A Covalent Azepanone Inhibitor
 gi|28374045|pdb|1NL6|B Chain B, Crystal Structure Of The Cysteine Protease Human Cathepsin
           K In Complex With A Covalent Azepanone Inhibitor
 gi|28374047|pdb|1NLJ|A Chain A, Crystal Structure Of The Cysteine Protease Human Cathepsin
           K In Complex With A Covalent Azepanone Inhibitor
 gi|28374048|pdb|1NLJ|B Chain B, Crystal Structure Of The Cysteine Protease Human Cathepsin
           K In Complex With A Covalent Azepanone Inhibitor
 gi|47168617|pdb|1Q6K|A Chain A, Cathepsin K Complexed With T-butyl(1s)-1-cyclohexyl-2-
           Oxoethylcarbamate
 gi|55670045|pdb|1TU6|A Chain A, Cathepsin K Complexed With A Ketoamide Inhibitor
 gi|55670046|pdb|1TU6|B Chain B, Cathepsin K Complexed With A Ketoamide Inhibitor
 gi|62738654|pdb|1YK7|A Chain A, Cathepsin K Complexed With A Cyanopyrrolidine Inhibitor
 gi|73535690|pdb|1YK8|A Chain A, Cathepsin K Complexed With A Cyanamide-Based Inhibitor
 gi|73535721|pdb|1YT7|A Chain A, Cathepsin K Complexed With A Constrained Ketoamide
           Inhibitor
 gi|93278849|pdb|2BDL|A Chain A, Cathepsin K Complexed With A Pyrrolidine Ketoamide-Based
           Inhibitor
 gi|114793438|pdb|2ATO|A Chain A, Crystal Structure Of Human Cathepsin K In Complex With
           Myocrisin
 gi|114793448|pdb|2AUX|A Chain A, Cathepsin K Complexed With A Semicarbazone Inhibitor
 gi|114793451|pdb|2AUZ|A Chain A, Cathepsin K Complexed With A Semicarbazone Inhibitor
 gi|126030469|pdb|2FTD|A Chain A, Crystal Structure Of Cathepsin K Complexed With 7-Methyl-
           Substituted Azepan-3-One Compound
 gi|126030470|pdb|2FTD|B Chain B, Crystal Structure Of Cathepsin K Complexed With 7-Methyl-
           Substituted Azepan-3-One Compound
 gi|157830076|pdb|1ATK|A Chain A, Crystal Structure Of The Cysteine Protease Human Cathepsin
           K In Complex With The Covalent Inhibitor E-64
 gi|157830085|pdb|1AU0|A Chain A, Crystal Structure Of The Cysteine Protease Human Cathepsin
           K In Complex With A Covalent Symmetric Diacylaminomethyl
           Ketone Inhibitor
 gi|157830086|pdb|1AU2|A Chain A, Crystal Structure Of The Cysteine Protease Human Cathepsin
           K In Complex With A Covalent Propanone Inhibitor
 gi|157830087|pdb|1AU3|A Chain A, Crystal Structure Of The Cysteine Protease Human Cathepsin
           K In Complex With A Covalent Pyrrolidinone Inhibitor
 gi|157830088|pdb|1AU4|A Chain A, Crystal Structure Of The Cysteine Protease Human Cathepsin
           K In Complex With A Covalent Pyrrolidinone Inhibitor
 gi|157830146|pdb|1AYU|A Chain A, Crystal Structure Of Cysteine Protease Human Cathepsin K
           In Complex With A Covalent Symmetric Biscarbohydrazide
           Inhibitor
 gi|157830147|pdb|1AYV|A Chain A, Crystal Structure Of Cysteine Protease Human Cathepsin K
           In Complex With A Covalent Thiazolhydrazide Inhibitor
 gi|157830148|pdb|1AYW|A Chain A, Crystal Structure Of Cysteine Protease Human Cathepsin K
           In Complex With A Covalent
           Benzyloxybenzoylcarbohydrazide Inhibitor
 gi|157830300|pdb|1BGO|A Chain A, Crystal Structure Of Cysteine Protease Human Cathepsin K
           In Complex With A Covalent Peptidomimetic Inhibitor
 gi|197305045|pdb|3C9E|A Chain A, Crystal Structure Of The Cathepsin K : Chondroitin Sulfate
           Complex.
 gi|290560385|pdb|3KW9|A Chain A, X-Ray Structure Of Cathepsin K Covalently Bound To A
           Triazine Ligand
 gi|290560386|pdb|3KWZ|A Chain A, Cathepsin K In Complex With A Non-Selective 2-Cyano-
           Pyrimidine Inhibitor
 gi|290560387|pdb|3KX1|A Chain A, Cathepsin K In Complex With A Selective 2-Cyano-Pyrimidine
           Inhibitor
 gi|293651910|pdb|3KWB|X Chain X, Structure Of Catk Covalently Bound To A Dioxo-Triazine
           Inhibitor
 gi|293651911|pdb|3KWB|Y Chain Y, Structure Of Catk Covalently Bound To A Dioxo-Triazine
           Inhibitor
 gi|308198615|pdb|3O1G|A Chain A, Cathepsin K Covalently Bound To A 2-Cyano Pyrimidine
           Inhibitor With A Benzyl P3 Group.
 gi|327200584|pdb|3O0U|A Chain A, Cathepsin K Covalently Bound To A Cyano-Pyrimidine
           Inhibitor With Improved Selectivity Over Herg
 gi|394986262|pdb|4DMX|A Chain A, Cathepsin K Inhibitor
 gi|394986263|pdb|4DMY|A Chain A, Cathepsin K Inhibitor
 gi|394986264|pdb|4DMY|B Chain B, Cathepsin K Inhibitor
          Length = 215

 Score =  179 bits (454), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 100/216 (46%), Positives = 132/216 (61%), Gaps = 8/216 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V Y  KGY   + +Q  CGSCW+F + GA+EG    K  KL  LS Q L+DC     N+G
Sbjct: 5   VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 62

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    ++Q++ K+ G+ ++D Y PY+GQ+  C    T   A   G+  +   +E AL
Sbjct: 63  CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQEESCMYNPTGKAAKCRGYREIPEGNEKAL 121

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K A+A+ GPVSVAIDAS  SF FY  GVYYDE CN+  D L+HAVLAVGYG   G  +W 
Sbjct: 122 KRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNS--DNLNHAVLAVGYGIQKGNKHWI 179

Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           +KNSW   WGN+GY+LM+   NN CG+    ++  M
Sbjct: 180 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 215


>gi|50513589|pdb|1SNK|A Chain A, Cathepsin K Complexed With Carbamate Derivatized
           Norleucine Aldehyde
          Length = 214

 Score =  179 bits (454), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 100/216 (46%), Positives = 132/216 (61%), Gaps = 8/216 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V Y  KGY   + +Q  CGSCW+F + GA+EG    K  KL  LS Q L+DC     N+G
Sbjct: 4   VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 61

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    ++Q++ K+ G+ ++D Y PY+GQ+  C    T   A   G+  +   +E AL
Sbjct: 62  CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQEESCMYNPTGKAAKCRGYREIPEGNEKAL 120

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K A+A+ GPVSVAIDAS  SF FY  GVYYDE CN+  D L+HAVLAVGYG   G  +W 
Sbjct: 121 KRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNS--DNLNHAVLAVGYGIQKGNKHWI 178

Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           +KNSW   WGN+GY+LM+   NN CG+    ++  M
Sbjct: 179 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 214


>gi|334332714|ref|XP_001367224.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
          Length = 335

 Score =  179 bits (454), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 97/214 (45%), Positives = 132/214 (61%), Gaps = 6/214 (2%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  C SCW+F   GA+EG ++ K  KL  LS Q L+DCS   GNNG
Sbjct: 121 VDWREKGYVTPVKEQRGCYSCWAFSAAGAIEGQWFRKTGKLVSLSVQNLVDCSIPEGNNG 180

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
           CDGG    ++Q++  +G    ++  PY+ QD  C      + A +TGFV +    E AL 
Sbjct: 181 CDGGLMGNAFQYVQDNGGIDTEECYPYVAQDNECKYQPECSGANVTGFVKIPSTDERALM 240

Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE--LDGKPYW 315
            A+A  GP+SVAIDA   SF FY +GVYYD +C++S   L+H VL VGYG    +G+ YW
Sbjct: 241 KAVANVGPISVAIDAGNPSFKFYQSGVYYDPQCSSSQ--LNHGVLVVGYGSEGKNGRKYW 298

Query: 316 QVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
            VKNSW   WG+ GYVLM+  +DN+CG++T  +Y
Sbjct: 299 IVKNSWGENWGDNGYVLMAKDEDNHCGIITDASY 332


>gi|328776427|ref|XP_625135.3| PREDICTED: cathepsin L-like [Apis mellifera]
          Length = 351

 Score =  179 bits (454), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 94/201 (46%), Positives = 128/201 (63%), Gaps = 6/201 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCWSF  TGA+EG ++ +   L  LS+Q LIDCS  YGNNGC+GG   +++Q+I
Sbjct: 145 DQGHCGSCWSFSATGALEGQHFRRTGVLVSLSEQNLIDCSGKYGNNGCNGGLMDQAFQYI 204

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + GL T+  Y PY  ++  C      + A   G++++    E  LK A+A  GPVSVA
Sbjct: 205 KDNKGLDTEASY-PYEAENDKCRYNPANSGAIDVGYIDIPTGDEKLLKAAVATIGPVSVA 263

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
           IDAS +SF FY  GVYY+ +C  S + LDH VL +GYG   +G+ YW VKNSW   WGN 
Sbjct: 264 IDASHQSFQFYSEGVYYEPEC--SSEELDHGVLVIGYGTNENGQDYWLVKNSWGETWGNN 321

Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
           GY+ M+  K N+CG+ ++ +Y
Sbjct: 322 GYIKMARNKLNHCGIASSASY 342


>gi|315364648|pdb|3OVZ|A Chain A, Cathepsin K In Complex With A Covalent Inhibitor With A
           Ketoamide Warhead
          Length = 213

 Score =  179 bits (454), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 100/216 (46%), Positives = 132/216 (61%), Gaps = 8/216 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V Y  KGY   + +Q  CGSCW+F + GA+EG    K  KL  LS Q L+DC     N+G
Sbjct: 3   VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 60

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    ++Q++ K+ G+ ++D Y PY+GQ+  C    T   A   G+  +   +E AL
Sbjct: 61  CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQEESCMYNPTGKAAKCRGYREIPEGNEKAL 119

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K A+A+ GPVSVAIDAS  SF FY  GVYYDE CN+  D L+HAVLAVGYG   G  +W 
Sbjct: 120 KRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNS--DNLNHAVLAVGYGIQKGNKHWI 177

Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           +KNSW   WGN+GY+LM+   NN CG+    ++  M
Sbjct: 178 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 213


>gi|18858809|ref|NP_571273.1| cathepsin L, 1 b precursor [Danio rerio]
 gi|1752664|emb|CAA69623.1| cathepsin L [Danio rerio]
          Length = 336

 Score =  179 bits (453), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 101/221 (45%), Positives = 141/221 (63%), Gaps = 11/221 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  +GY   + DQ  CGSCWSF +TGA+EG  + K  KL  +S+Q L+DCS   GN G
Sbjct: 119 VDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQG 178

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDA 255
           C+GG   +++Q++ ++ GL ++  Y PYL +D   C        A +TGFV++   +E A
Sbjct: 179 CNGGLMDQAFQYVKENKGLDSEQSY-PYLARDDLPCRYDPRFNVAKITGFVDIPSGNELA 237

Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
           L  A+A  GPVSVAIDAS +S  FY +G+YY+  C++S   LDHAVL VGYG    ++ G
Sbjct: 238 LMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSR--LDHAVLVVGYGYQGADVAG 295

Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
             YW VKNSWS  WG++GY+ M+  K+N+CGV T  +Y  M
Sbjct: 296 NRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATKASYPLM 336


>gi|380014284|ref|XP_003691169.1| PREDICTED: cathepsin L-like [Apis florea]
          Length = 345

 Score =  179 bits (453), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 94/201 (46%), Positives = 129/201 (64%), Gaps = 6/201 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCWSF  TGA+EG ++ +   L  LS+Q LIDCS  YGNNGC+GG   +++Q+I
Sbjct: 139 DQGHCGSCWSFSATGALEGQHFRRTGVLVSLSEQNLIDCSGKYGNNGCNGGLMDQAFQYI 198

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + GL T+  Y PY  ++  C      + A   G++++   +E  LK A+A  GPVSVA
Sbjct: 199 KDNKGLDTEASY-PYEAENDKCRYNPANSGAIDVGYIDIPTGNEKLLKAAVATIGPVSVA 257

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
           IDAS +SF FY  GVYY+ +C  S + LDH VL +GYG   +G+ YW VKNSW   WGN 
Sbjct: 258 IDASHQSFQFYSEGVYYEPEC--SSEELDHGVLVIGYGTNENGEDYWLVKNSWGETWGNN 315

Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
           GY+ M+  K N+CG+ ++ +Y
Sbjct: 316 GYIKMARNKLNHCGIASSASY 336


>gi|330805275|ref|XP_003290610.1| hypothetical protein DICPUDRAFT_98747 [Dictyostelium purpureum]
 gi|325079249|gb|EGC32858.1| hypothetical protein DICPUDRAFT_98747 [Dictyostelium purpureum]
          Length = 334

 Score =  179 bits (453), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 98/204 (48%), Positives = 134/204 (65%), Gaps = 8/204 (3%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F TTGAVEGA+ +K   +   S+Q L+DCS  YGNNGCDGG    ++++I
Sbjct: 136 DQGHCGSCWAFATTGAVEGAHQIKTGNMVTFSEQHLVDCSGRYGNNGCDGGLMTSAFKYI 195

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATAT-MTGFVNVTPNSEDALKLALAKHGPVSV 268
           + + G+ T++ Y PY      C + NTT   T ++G+ +V   SE AL  A++K  PV+V
Sbjct: 196 IDNDGIATEEAY-PYTATQNRC-VYNTTMLGTAISGYKDVPRGSESALTAAISKQ-PVAV 252

Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQ 328
           AIDAS  +F  Y +GVY +  C  S   L+H VLAVGYG L+GK Y+ VKNSW+  WGNQ
Sbjct: 253 AIDASPITFQLYKSGVYQEATC--SSYRLNHGVLAVGYGTLEGKDYYIVKNSWAETWGNQ 310

Query: 329 GYVLMSIKDNN-CGVMTAPTYVTM 351
           GY+LM+   NN CG+ T  +Y ++
Sbjct: 311 GYILMARNANNHCGIATMASYASV 334


>gi|392922428|ref|NP_001256719.1| Protein CPL-1, isoform b [Caenorhabditis elegans]
 gi|379657173|emb|CCG28194.1| Protein CPL-1, isoform b [Caenorhabditis elegans]
          Length = 198

 Score =  179 bits (453), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 91/198 (45%), Positives = 129/198 (65%), Gaps = 6/198 (3%)

Query: 154 VCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM-K 212
           +CGSCW+F  TGA+EG +  K  +L  LS+Q L+DCS  YGN+GC+GG   +++++I   
Sbjct: 1   MCGSCWAFSATGALEGQHARKLGQLVSLSEQNLVDCSTKYGNHGCNGGLMDQAFEYIRDN 60

Query: 213 HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDA 272
           HG+ T++ Y PY G+D  CH    T  A   G+V+     E+ LK+A+A  GP+S+AIDA
Sbjct: 61  HGVDTEESY-PYKGRDMKCHFNKKTVGADDKGYVDTPEGDEEQLKIAVATQGPISIAIDA 119

Query: 273 SQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQGYV 331
             +SF  Y  GVYYDE+C  S + LDH VL VGYG + +   YW VKNSW   WG +GY+
Sbjct: 120 GHRSFQLYKKGVYYDEEC--SSEELDHGVLLVGYGTDPEHGDYWIVKNSWGAGWGEKGYI 177

Query: 332 LMSI-KDNNCGVMTAPTY 348
            ++  ++N+CGV T  +Y
Sbjct: 178 RIARNRNNHCGVATKASY 195


>gi|348542776|ref|XP_003458860.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
          Length = 334

 Score =  179 bits (453), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 99/213 (46%), Positives = 132/213 (61%), Gaps = 6/213 (2%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  +GY   + DQ  CGSCW+F  TGA+EG ++ K   L  LS+Q L+DCS  YGN G
Sbjct: 122 VDWREQGYVTGVKDQKQCGSCWAFSATGALEGQHFRKTGILVSLSEQQLVDCSGAYGNEG 181

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++++I  + G+ T+  Y PY  +D  C     +  AT +G+V+V    E+AL
Sbjct: 182 CNGGWMDSAFRYIEANGGIDTEASY-PYEAEDWLCRYNPASVGATCSGYVDVNKYDEEAL 240

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K A+A  GPVSVAIDAS  SF FY +GVY +  C++    LDH VLAVGYG  +G  YW 
Sbjct: 241 KEAVATIGPVSVAIDASHASFQFYTSGVYDEPGCSSIE--LDHGVLAVGYGTENGHDYWL 298

Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           VKNSW   WG  GY+ MS  K N CG+ +A +Y
Sbjct: 299 VKNSWGRGWGEMGYIKMSRNKHNQCGIASAASY 331


>gi|33242880|gb|AAQ01144.1| cathepsin [Branchiostoma lanceolatum]
          Length = 334

 Score =  179 bits (453), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 96/204 (47%), Positives = 134/204 (65%), Gaps = 9/204 (4%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F TTG++EG +  K  KL  LS+Q L+DCS  +GN GC GG   +++Q+I
Sbjct: 131 DQGECGSCWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYI 190

Query: 211 MKH-GLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
             + GL T++ Y PY    D  C   N++  AT+ G+ +V  ++E ALK A+A  GPVSV
Sbjct: 191 KANGGLDTEESY-PYTATDDKPCKFDNSSVGATLIGYKDVKSSNEHALKRAVATVGPVSV 249

Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDG---KPYWQVKNSWSTYW 325
           AIDA  +SF FY +GVY + +C  S + LDH VL VGYG ++    + +W VKNSW   W
Sbjct: 250 AIDAGHESFQFYSSGVYDEPQC--STEQLDHGVLVVGYGAMNDNSHQAFWIVKNSWGPNW 307

Query: 326 GNQGYVLMSI-KDNNCGVMTAPTY 348
           G+QGY++MS  K+N CG+ T+ +Y
Sbjct: 308 GDQGYIMMSRNKNNQCGIATSASY 331


>gi|405958751|gb|EKC24845.1| Cathepsin L [Crassostrea gigas]
          Length = 330

 Score =  179 bits (453), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 97/216 (44%), Positives = 133/216 (61%), Gaps = 6/216 (2%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCWSF  TG++EG  + K  KL  LS+Q L+DCS   GN+G
Sbjct: 118 VDWRPKGYVTPIKNQGQCGSCWSFSATGSLEGQTFKKTGKLPSLSEQNLVDCSQKQGNHG 177

Query: 198 CDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    ++Q+I   +G+ T+  Y PY  ++  C        AT +GF ++   SE  L
Sbjct: 178 CQGGLMDDAFQYIKDNNGIDTESSY-PYEAKNGKCRFNAANVGATDSGFTDIKSKSESDL 236

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           + A+A  GP++VAIDAS  SF  Y +GVY++  C+ +   LDH VLAVGYG   GK YW 
Sbjct: 237 QSAVATVGPIAVAIDASHMSFQLYKSGVYHEFFCSETR--LDHGVLAVGYGTESGKDYWL 294

Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
           VKNSW   WG +GY++MS  K NNCG+ T+ +Y T+
Sbjct: 295 VKNSWGESWGQKGYIMMSRNKRNNCGIATSASYPTV 330


>gi|94421564|gb|ABF18889.1| cathepsin-L [Lygus lineolaris]
          Length = 314

 Score =  179 bits (453), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 93/189 (49%), Positives = 120/189 (63%), Gaps = 4/189 (2%)

Query: 149 LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
           + +Q  CGSCW+F TTG++EG + +K  KL  LS+Q L+DCS   GN+GCDGG    ++ 
Sbjct: 128 IKNQGQCGSCWAFSTTGSLEGQHALKKGKLVSLSEQELVDCSAAEGNDGCDGGLMDDAFT 187

Query: 209 WIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVS 267
           +I K +G+ T+  Y PY G+D  C    +   AT+TGFV+VT  SE  L+ A A  GP+S
Sbjct: 188 YIKKNNGIDTEQSY-PYTGEDGTCSFKKSDVAATVTGFVDVTSGSESGLQDASATIGPIS 246

Query: 268 VAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGN 327
           VAIDAS   F  Y +GVY    C+ +   LDH VL VGYG  DG  YW VKNSW T WG+
Sbjct: 247 VAIDASSWDFQLYESGVYDVSDCSTTE--LDHGVLVVGYGTDDGTAYWLVKNSWGTDWGH 304

Query: 328 QGYVLMSIK 336
            GY+ MS K
Sbjct: 305 HGYIQMSRK 313


>gi|226469954|emb|CAX70258.1| Cathepsin L precursor [Schistosoma japonicum]
          Length = 372

 Score =  179 bits (453), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 92/204 (45%), Positives = 128/204 (62%), Gaps = 7/204 (3%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F +TGA+EG +Y K  +L  LS+Q LIDCS  YGNNGC+GG    ++Q++
Sbjct: 167 NQGQCGSCWAFSSTGAIEGQHYRKTNRLVNLSEQQLIDCSKSYGNNGCEGGLMDLAFQYV 226

Query: 211 MKH-GLPTQDDYGPYLGQDA----YCHIANTTATATMTGFVNVTPNSEDALKLALAKHGP 265
             + G+ ++  Y PY+  D      C    T   A +TG++N+    E AL  A+   GP
Sbjct: 227 RDNEGIDSEISY-PYISGDGDENVRCLFNFTNIMAQVTGYINIHEGDERALMNAVTTIGP 285

Query: 266 VSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYW 325
           VSVAI+A   SFS Y +G+Y D +C ++ + LDH VL VGYG  DGKPYW +KNSW   W
Sbjct: 286 VSVAINAGLSSFSMYKSGIYSDPECASASEDLDHGVLLVGYGIEDGKPYWLIKNSWGEDW 345

Query: 326 GNQGYV-LMSIKDNNCGVMTAPTY 348
           G++GYV ++    N C V +A +Y
Sbjct: 346 GDKGYVKILKDSKNMCSVASAASY 369


>gi|395514298|ref|XP_003761356.1| PREDICTED: cathepsin L1-like [Sarcophilus harrisii]
          Length = 365

 Score =  179 bits (453), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 96/215 (44%), Positives = 139/215 (64%), Gaps = 9/215 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TG++EG ++ K  KL  LS+Q L+DCS   GN+G
Sbjct: 152 VDWRDKGYVTPVKNQGQCGSCWAFSATGSLEGQWFRKTGKLVSLSEQNLVDCSTAQGNSG 211

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    +++++ ++ G+ T++ Y PY+  D  C      + A +TG+V++    E AL
Sbjct: 212 CQGGLMDNAFEYVKENGGIDTEESY-PYIAADDTCQYKPQYSGANITGYVDIPSRMEKAL 270

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGK--PY 314
           + A+A  GP+SVAIDA   SF FY +GVYY+ +C  S + LDH VLAVGYG + GK   Y
Sbjct: 271 EKAVATVGPISVAIDAGHSSFQFYRSGVYYEPEC--SSEDLDHGVLAVGYG-VQGKNGKY 327

Query: 315 WQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           W VKNSW   WG+ GY+LM+  ++N+CG+ TA +Y
Sbjct: 328 WIVKNSWGEEWGDSGYILMARDRNNHCGIATAASY 362


>gi|93279455|pdb|2F7D|A Chain A, A Mutant Rabbit Cathepsin K With A Nitrile Inhibitor
          Length = 215

 Score =  179 bits (453), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 99/216 (45%), Positives = 131/216 (60%), Gaps = 8/216 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           + Y  KGY   + +Q  CGSCW+F + GA+EG    K  KL  LS Q L+DC     N+G
Sbjct: 5   IDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 62

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    ++Q++ ++ G+ ++D Y PY+GQD  C    T   A   G+  +   +E AL
Sbjct: 63  CGGGYMTNAFQYVQRNRGIDSEDAY-PYVGQDESCMYNPTGKAAKCRGYREIPEGNEKAL 121

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K A+A+ GPVSVAIDAS  SF FY  GVYYDE C  S D L+HAVLAVGYG   G  +W 
Sbjct: 122 KRAVARVGPVSVAIDASLTSFQFYSKGVYYDENC--SSDNLNHAVLAVGYGIQKGNKHWI 179

Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           +KNSW   WGN+GY+LM+   NN CG+    ++  M
Sbjct: 180 IKNSWGESWGNKGYILMARNKNNACGIANLASFPKM 215


>gi|24653514|ref|NP_523735.2| cysteine proteinase-1, isoform C [Drosophila melanogaster]
 gi|118572624|sp|Q95029.2|CATL_DROME RecName: Full=Cathepsin L; AltName: Full=Cysteine proteinase 1;
           Contains: RecName: Full=Cathepsin L heavy chain;
           Contains: RecName: Full=Cathepsin L light chain; Flags:
           Precursor
 gi|21627209|gb|AAM68565.1| cysteine proteinase-1, isoform C [Drosophila melanogaster]
          Length = 371

 Score =  179 bits (453), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 91/204 (44%), Positives = 129/204 (63%), Gaps = 6/204 (2%)

Query: 148 SLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSY 207
           ++ DQ  CGSCW+F +TGA+EG ++ K   L  LS+Q L+DCS  YGNNGC+GG    ++
Sbjct: 168 AVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAF 227

Query: 208 QWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
           ++I  + G+ T+  Y PY   D  CH    T  AT  GF ++    E  +  A+A  GPV
Sbjct: 228 RYIKDNGGIDTEKSY-PYEAIDDSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPV 286

Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYW 325
           SVAIDAS +SF FY  GVY + +C+     LDH VL VG+G +  G+ YW VKNSW T W
Sbjct: 287 SVAIDASHESFQFYSEGVYNEPQCD--AQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTW 344

Query: 326 GNQGYV-LMSIKDNNCGVMTAPTY 348
           G++G++ ++  K+N CG+ +A +Y
Sbjct: 345 GDKGFIKMLRNKENQCGIASASSY 368


>gi|110625773|ref|NP_081620.2| cathepsin L-like 3 precursor [Mus musculus]
 gi|74208432|dbj|BAE26401.1| unnamed protein product [Mus musculus]
 gi|187955662|gb|AAI47425.1| RIKEN cDNA 2310051M13 gene [Mus musculus]
 gi|187957686|gb|AAI47424.1| RIKEN cDNA 2310051M13 gene [Mus musculus]
          Length = 331

 Score =  178 bits (452), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 97/201 (48%), Positives = 126/201 (62%), Gaps = 7/201 (3%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F   G++EG  + K  KL  LS Q L+DCSW  GN GCDGG    ++Q++
Sbjct: 132 DQGSCGSCWAFSAVGSLEGQMFRKTGKLVPLSVQNLVDCSWSQGNQGCDGGLPDLAFQYV 191

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + GL T   Y PY   +  C      + AT+TGFVNV  +SEDAL  A+A  GP+SV 
Sbjct: 192 KDNGGLDTSVSY-PYEALNGTCRYNPKNSAATVTGFVNVQ-SSEDALMKAVATVGPISVG 249

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
           ID   KSF FY  G+YY+  C+++   LDHAVL VGYG E DG+ YW VKNSW   WG  
Sbjct: 250 IDTKHKSFQFYKEGMYYEPDCSSTV--LDHAVLVVGYGEESDGRKYWLVKNSWGRDWGMN 307

Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
           GY+ M+  ++NNCG+ +  +Y
Sbjct: 308 GYIKMAKDRNNNCGIASDASY 328


>gi|118424553|gb|ABK90824.1| cathepsin L-like cysteine proteinase [Spodoptera exigua]
          Length = 344

 Score =  178 bits (452), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 95/201 (47%), Positives = 128/201 (63%), Gaps = 6/201 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F TTGA+EG ++ K   L  LS+Q LIDCS  YGNNGC+GG    ++++I
Sbjct: 144 DQGKCGSCWAFSTTGALEGQHFRKTGYLVSLSEQNLIDCSAAYGNNGCNGGLMDNAFKYI 203

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T+  Y PY   D  C      + A   GFV++    E+ L  A+A  GP+SVA
Sbjct: 204 KDNGGIDTEKSY-PYEAVDDKCRYNPKESGADDVGFVDIPQGDEEKLMQAVATVGPISVA 262

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
           IDASQ++F FY  GVYYDE C+++   LDH V+ VGYG E DG   W VKNSW   WG  
Sbjct: 263 IDASQETFQFYSKGVYYDENCSSTD--LDHGVMVVGYGTEEDGSDDWLVKNSWGRSWGEL 320

Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
           GY+ M+  K+N+CG+ ++ +Y
Sbjct: 321 GYIKMARNKNNHCGIASSASY 341


>gi|327263389|ref|XP_003216502.1| PREDICTED: cathepsin L1-like [Anolis carolinensis]
          Length = 339

 Score =  178 bits (452), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 99/218 (45%), Positives = 137/218 (62%), Gaps = 10/218 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + DQ  CGSCW+F TTG++EG ++ K  KL  LS+Q L+DCS   GN G
Sbjct: 121 VDWREKGYVTPVKDQGQCGSCWAFSTTGSLEGQHFRKTGKLVSLSEQNLVDCSRPEGNQG 180

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG   +++++I  +G    ++  PY+ +D   C   +    A  TGFV+V    E AL
Sbjct: 181 CNGGLMDQAFEYIADNGGIDSEESYPYIAKDDEDCLYKSEFNAANDTGFVDVPEGHERAL 240

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-----ELDG 311
             A+A  GPVSVAIDAS  +F FY +G+YYD  C  S + LDH VL VGYG     + + 
Sbjct: 241 MKAVAAVGPVSVAIDASHSTFQFYESGIYYDPDC--SSEELDHGVLVVGYGFEGTDDDNK 298

Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           K YW VKNSWS  WG++GY+LM+  ++N+CG+ TA +Y
Sbjct: 299 KKYWIVKNSWSDKWGDKGYILMAKDRNNHCGIATAASY 336


>gi|390368662|ref|XP_780781.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
          Length = 333

 Score =  178 bits (452), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 106/234 (45%), Positives = 138/234 (58%), Gaps = 9/234 (3%)

Query: 119 KVSKYSLWVRYNKASKDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKK 177
           K +K S ++  N   K    V +  KGY   + DQ  CGSCW+F  TG++EG  + K  K
Sbjct: 102 KAAKGSTFLPSNNVDKLPKTVDWRTKGYVTPVKDQGQCGSCWAFSATGSLEGQQFKKTGK 161

Query: 178 LAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANT 236
           L  LS+Q L+DCS  Y N GC GG   R++Q+I+  G + T+  Y  Y   D  CH    
Sbjct: 162 LVSLSEQNLVDCS--YRNYGCHGGFMDRAFQYIIDAGGIDTEATY-SYRAVDGNCHFKKA 218

Query: 237 TATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDG 296
              AT+TG+ +VT  SE AL+ A+A  GP+SVAIDAS K F FY +GVY +  C+ +   
Sbjct: 219 NVGATVTGYTDVTSGSEKALQKAVAHIGPISVAIDASHKFFKFYKSGVYNEPGCSTTR-- 276

Query: 297 LDHAVLAVGYGEL-DGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           L HAVL VGYG   DG  YW VKNSW+  WG  GY+ MS  KDN CG+ +  +Y
Sbjct: 277 LGHAVLVVGYGTTSDGTDYWIVKNSWAKTWGMNGYLWMSRNKDNQCGIASEASY 330


>gi|255522980|gb|ACU12382.1| RE21773p [Drosophila melanogaster]
          Length = 375

 Score =  178 bits (452), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 91/204 (44%), Positives = 129/204 (63%), Gaps = 6/204 (2%)

Query: 148 SLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSY 207
           ++ DQ  CGSCW+F +TGA+EG ++ K   L  LS+Q L+DCS  YGNNGC+GG    ++
Sbjct: 172 AVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAF 231

Query: 208 QWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
           ++I  + G+ T+  Y PY   D  CH    T  AT  GF ++    E  +  A+A  GPV
Sbjct: 232 RYIKDNGGIDTEKSY-PYEAIDDSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPV 290

Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYW 325
           SVAIDAS +SF FY  GVY + +C+     LDH VL VG+G +  G+ YW VKNSW T W
Sbjct: 291 SVAIDASHESFQFYSEGVYNEPQCD--AQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTW 348

Query: 326 GNQGYV-LMSIKDNNCGVMTAPTY 348
           G++G++ ++  K+N CG+ +A +Y
Sbjct: 349 GDKGFIKMLRNKENQCGIASASSY 372


>gi|322799749|gb|EFZ20954.1| hypothetical protein SINV_06041 [Solenopsis invicta]
          Length = 337

 Score =  178 bits (452), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 95/205 (46%), Positives = 129/205 (62%), Gaps = 7/205 (3%)

Query: 148 SLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSY 207
           ++ DQ  CGSCW+F +TGA+EG ++     L  LS+Q LIDCS  YGNNGC+GG    ++
Sbjct: 133 AVKDQGHCGSCWAFSSTGALEGQHFRSTGYLVSLSEQNLIDCSGKYGNNGCNGGLMDYAF 192

Query: 208 QWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
           Q+I  + GL T+  Y PY  ++  C      + AT  G+V++    E+ LK A+A  GP+
Sbjct: 193 QYIKDNKGLDTEKTY-PYEAENDRCRYNPRNSGATDKGYVDIPQGDEEKLKAAVATIGPI 251

Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG--ELDGKPYWQVKNSWSTY 324
           SVAIDAS +SF  Y  GVYYD  C  S + LDH VL VGYG  E  G  YW VKNSW   
Sbjct: 252 SVAIDASHESFQLYSEGVYYDPDC--SAENLDHGVLIVGYGTDETSGHDYWLVKNSWGKT 309

Query: 325 WGNQGYVLMSI-KDNNCGVMTAPTY 348
           WG +GY+ M+  K+N+CG+ ++ +Y
Sbjct: 310 WGQKGYIKMARNKNNHCGIASSASY 334


>gi|157311713|ref|NP_001098585.1| uncharacterized protein LOC564979 precursor [Danio rerio]
 gi|156230121|gb|AAI52284.1| Wu:fa26c03 protein [Danio rerio]
          Length = 336

 Score =  178 bits (452), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 100/221 (45%), Positives = 142/221 (64%), Gaps = 11/221 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  +G+   + DQ  CGSCWSF +TGA+EG  + K  KL  +S+Q L+DCS   GN G
Sbjct: 119 VDWRQRGFVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQG 178

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDA 255
           C+GG   +++Q++ ++ GL ++  Y PYL +D   C        A +TGFV++   +E A
Sbjct: 179 CNGGLMDQAFQYVKENKGLDSEQSY-PYLARDDLPCRYDPRFNVAKITGFVDIPRGNELA 237

Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
           L  A+A  GPVSVAIDAS +S  FY +G+YY+  C++S   LDHAVL VGYG    ++ G
Sbjct: 238 LMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSR--LDHAVLVVGYGYQGADVAG 295

Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
             YW VKNSWS  WG++GY+ M+  K+N+CGV T+ +Y  M
Sbjct: 296 NRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATSASYPLM 336


>gi|38147395|gb|AAR12010.1| cathepsin L-like proteinase [Triatoma infestans]
          Length = 328

 Score =  178 bits (451), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 89/200 (44%), Positives = 127/200 (63%), Gaps = 5/200 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           D   CGSCW+F +TG++ G  ++K+KKL  LS+Q L+DCS  YGN+GCDGG   +++Q+I
Sbjct: 129 DPGQCGSCWAFSSTGSLGGQLFLKNKKLVSLSEQQLVDCSGNYGNDGCDGGIMVQAFQYI 188

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T+  Y PY  +D  C     +   T  G+V++    E+ALK A+A+ GP+SVA
Sbjct: 189 KGNGGIDTEGSY-PYEAEDDKCRYKTKSVAGTDKGYVDIAQGDENALKEAVAEIGPISVA 247

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDA   SF FY  G+Y +  C+N+   LDH VL VGYG  +G+ YW VKNSW   WG  G
Sbjct: 248 IDAGNLSFQFYSEGIYDEPFCSNTE--LDHGVLVVGYGTENGQDYWLVKNSWGPSWGENG 305

Query: 330 YVLMSIKDNN-CGVMTAPTY 348
           Y+ ++   NN CG+ +  +Y
Sbjct: 306 YIKIARNHNNHCGIASMASY 325


>gi|189525870|ref|XP_001923796.1| PREDICTED: cathepsin L1 [Danio rerio]
          Length = 335

 Score =  178 bits (451), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 99/221 (44%), Positives = 140/221 (63%), Gaps = 12/221 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  +GY   + DQ  CGSCWSF +TGA+EG  + K  KL  +S+Q L+DCS  +GN G
Sbjct: 119 VDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPHGNQG 178

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDA 255
           C+GG   +++Q++ ++ GL ++  Y PYL +D   C        A +TGFV++   +E A
Sbjct: 179 CNGGLMDQAFQYVKENKGLDSEQSY-PYLARDDLPCRYDPRFNVAKITGFVDIPKGNELA 237

Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
           L  A+A  GPVSVAIDAS +S  FY +G+YY+  C +    LDHAVL VGYG    ++ G
Sbjct: 238 LMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSQ---LDHAVLVVGYGYQGADVAG 294

Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
             YW VKNSWS  WG++GY+ M+  K+N+CG+ T  +Y  M
Sbjct: 295 NRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPLM 335


>gi|157787177|ref|NP_001099150.1| cathepsin L1-like precursor [Danio rerio]
 gi|157422879|gb|AAI53505.1| MGC174152 protein [Danio rerio]
          Length = 336

 Score =  178 bits (451), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 100/221 (45%), Positives = 141/221 (63%), Gaps = 11/221 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  +GY   + DQ  CGSCWSF +TGA+EG  + K  KL  +S+Q L+DCS   GN G
Sbjct: 119 VDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQG 178

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDA 255
           C+GG   +++Q++ ++ GL ++  Y PYL +D   C        A +TGFV++   +E A
Sbjct: 179 CNGGLMDQAFQYVKENKGLDSEQSY-PYLARDDLPCRYDPRFNVAKITGFVDIPRGNELA 237

Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
           L  A+A  GPVSVAIDAS +S  FY +G+YY+  C++S   LDHAVL VGYG    ++ G
Sbjct: 238 LMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSR--LDHAVLVVGYGYQGADVAG 295

Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
             YW VKNSWS  WG++GY+ M+  K+N+CG+ T  +Y  M
Sbjct: 296 NRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPLM 336


>gi|395856027|ref|XP_003800444.1| PREDICTED: cathepsin K [Otolemur garnettii]
          Length = 329

 Score =  178 bits (451), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 97/216 (44%), Positives = 131/216 (60%), Gaps = 8/216 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           + Y  KGY   + +Q  CGSCW+F + GA+EG    K  KL  LS Q L+DC     N+G
Sbjct: 119 IDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SDNDG 176

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    ++Q++ K+ G+ ++D Y PY+GQD  C    T   A   G+  +   +E AL
Sbjct: 177 CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQDESCMYNPTGKAAKCRGYREIPEGNEKAL 235

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K A+A+ GP+SV IDAS  SF FY  GVYYDE CN+  D ++HAVLAVGYG   G  +W 
Sbjct: 236 KRAVARVGPISVGIDASLTSFQFYSKGVYYDESCNS--DNVNHAVLAVGYGIQKGNKHWI 293

Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           +KNSW   WGN+GY+LM+   NN CG+    ++  M
Sbjct: 294 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 329


>gi|343978787|gb|AEM76722.1| cathepsin L-like proteinase [Triatoma brasiliensis]
          Length = 330

 Score =  178 bits (451), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 98/221 (44%), Positives = 133/221 (60%), Gaps = 8/221 (3%)

Query: 133 SKDAIP--VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDC 189
           S D +P  V +  KG    + DQ  CGSCWSF  TG++EG  ++K  KL  LS+Q L+DC
Sbjct: 110 SNDKLPKSVDWRQKGAVTPVKDQGQCGSCWSFSATGSLEGQIFLKKGKLVSLSEQNLMDC 169

Query: 190 SWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
           S  YGNNGC+GG   +++Q++  + G+ T+  Y PY  +D  C         T  G+V++
Sbjct: 170 SKEYGNNGCEGGLMDKAFQYVSDNKGIDTESSY-PYEARDYACRFKKDKVGGTDKGYVDI 228

Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
               E AL+ ALA  GP+SVAIDAS +SF FY  GVY +  C  S   LDH VLAVGYG 
Sbjct: 229 PEGDEKALQNALATVGPISVAIDASHESFHFYSEGVYNEPYC--SSYDLDHGVLAVGYGT 286

Query: 309 LDGKPYWQVKNSWSTYWGNQGYVLMSIK-DNNCGVMTAPTY 348
            +G+ YW VKNSW   WG  GY+ ++    N+CG+ +  +Y
Sbjct: 287 ENGQDYWLVKNSWGPSWGESGYIKIARNHSNHCGIASMASY 327


>gi|195484843|ref|XP_002090843.1| GE12574 [Drosophila yakuba]
 gi|194176944|gb|EDW90555.1| GE12574 [Drosophila yakuba]
          Length = 341

 Score =  178 bits (451), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 92/204 (45%), Positives = 128/204 (62%), Gaps = 6/204 (2%)

Query: 148 SLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSY 207
           ++ DQ  CGSCW+F +TGA+EG ++ K   L  LS+Q L+DCS  YGNNGC+GG    ++
Sbjct: 138 AVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAF 197

Query: 208 QWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
           ++I  + G+ T+  Y PY   D  CH    T  AT  GF ++    E  +  A+A  GPV
Sbjct: 198 RYIKDNGGIDTEKSY-PYEAIDDSCHFNKGTIGATDRGFTDIPQGDEKKMAEAVATVGPV 256

Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYW 325
           SVAIDAS +SF FY  GVY + +C+     LDH VL VG+G +  G  YW VKNSW T W
Sbjct: 257 SVAIDASHESFQFYSEGVYNEPQCD--AQNLDHGVLVVGFGTDESGDDYWLVKNSWGTTW 314

Query: 326 GNQGYV-LMSIKDNNCGVMTAPTY 348
           G++G++ ++  KDN CG+ +A +Y
Sbjct: 315 GDKGFIKMLRNKDNQCGIASASSY 338


>gi|33242882|gb|AAQ01145.1| cathepsin [Branchiostoma lanceolatum]
          Length = 334

 Score =  178 bits (451), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 96/204 (47%), Positives = 133/204 (65%), Gaps = 9/204 (4%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CG CW+F TTG++EG +  K  KL  LS+Q L+DCS  +GN GC GG   +++Q+I
Sbjct: 131 DQGECGPCWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYI 190

Query: 211 MKH-GLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
             + GL T++ Y PY    D  C   N++  AT+ G+ +V   +E ALK A+A  GPVSV
Sbjct: 191 PANGGLDTEESY-PYTATDDKPCKFDNSSVGATLVGYKDVKSGNEHALKRAVATVGPVSV 249

Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDG---KPYWQVKNSWSTYW 325
           AIDA  +SF FY +GVY + +C  S + LDH VLAVGYG ++    + +W VKNSW   W
Sbjct: 250 AIDAGHESFQFYSSGVYDEPQC--STEQLDHGVLAVGYGAMNDNSHQAFWIVKNSWGPSW 307

Query: 326 GNQGYVLMSI-KDNNCGVMTAPTY 348
           G+QGY++MS  K+N CG+ T+ +Y
Sbjct: 308 GDQGYIMMSRNKNNQCGIATSASY 331


>gi|114559412|ref|XP_001171151.1| PREDICTED: cathepsin K isoform 4 [Pan troglodytes]
 gi|410221358|gb|JAA07898.1| cathepsin K [Pan troglodytes]
 gi|410248298|gb|JAA12116.1| cathepsin K [Pan troglodytes]
 gi|410301088|gb|JAA29144.1| cathepsin K [Pan troglodytes]
 gi|410351445|gb|JAA42326.1| cathepsin K [Pan troglodytes]
          Length = 329

 Score =  178 bits (451), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 98/216 (45%), Positives = 132/216 (61%), Gaps = 8/216 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V Y  KGY   + +Q  CGSCW+F + GA+EG    K  KL  LS Q L+DC     N+G
Sbjct: 119 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 176

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    +++++ K+ G+ ++D Y PY+GQ+  C    T   A   G+  +   +E AL
Sbjct: 177 CGGGYMTNAFEYVQKNRGIDSEDAY-PYVGQEESCMYNPTGKAAKCRGYREIPEGNEKAL 235

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K A+A+ GPVSVAIDAS  SF FY  GVY+DE CN+  D L+HAVLAVGYG   G  +W 
Sbjct: 236 KRAVARVGPVSVAIDASLTSFQFYSRGVYFDESCNS--DNLNHAVLAVGYGIQKGNKHWI 293

Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           +KNSW   WGN+GY+LM+   NN CG+    ++  M
Sbjct: 294 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 329


>gi|24653516|ref|NP_725347.1| cysteine proteinase-1, isoform A [Drosophila melanogaster]
 gi|24653518|ref|NP_725348.1| cysteine proteinase-1, isoform B [Drosophila melanogaster]
 gi|1658527|gb|AAB18345.1| cysteine proteinase 1 [Drosophila melanogaster]
 gi|2305221|gb|AAB65749.1| cysteine proteinase-1 [Drosophila melanogaster]
 gi|7303249|gb|AAF58311.1| cysteine proteinase-1, isoform A [Drosophila melanogaster]
 gi|21627210|gb|AAM68566.1| cysteine proteinase-1, isoform B [Drosophila melanogaster]
 gi|54650754|gb|AAV36956.1| LP06554p [Drosophila melanogaster]
 gi|220951982|gb|ACL88534.1| Cp1-PA [synthetic construct]
          Length = 341

 Score =  177 bits (450), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 91/204 (44%), Positives = 129/204 (63%), Gaps = 6/204 (2%)

Query: 148 SLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSY 207
           ++ DQ  CGSCW+F +TGA+EG ++ K   L  LS+Q L+DCS  YGNNGC+GG    ++
Sbjct: 138 AVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAF 197

Query: 208 QWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
           ++I  + G+ T+  Y PY   D  CH    T  AT  GF ++    E  +  A+A  GPV
Sbjct: 198 RYIKDNGGIDTEKSY-PYEAIDDSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPV 256

Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYW 325
           SVAIDAS +SF FY  GVY + +C+     LDH VL VG+G +  G+ YW VKNSW T W
Sbjct: 257 SVAIDASHESFQFYSEGVYNEPQCD--AQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTW 314

Query: 326 GNQGYV-LMSIKDNNCGVMTAPTY 348
           G++G++ ++  K+N CG+ +A +Y
Sbjct: 315 GDKGFIKMLRNKENQCGIASASSY 338


>gi|188501707|gb|ACD54818.1| cathepsin L precursor-like protein [Adineta vaga]
          Length = 331

 Score =  177 bits (450), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 97/213 (45%), Positives = 133/213 (62%), Gaps = 8/213 (3%)

Query: 139 VRYEMKGYNSLL-DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY + + DQ  CGSCW+F TTGA+EG ++ K  +L  LS+Q L+DCS    N G
Sbjct: 121 VDWRTKGYVTYVKDQGQCGSCWAFSTTGALEGQHFAKTSQLIPLSEQNLVDCS--LLNFG 178

Query: 198 CDGGEDFRSYQWI-MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    +Y +I M HG+ +++ Y PY  +   C        AT+TG   +  ++E  L
Sbjct: 179 CNGGNQDLAYDYIKMHHGISSEESY-PYWAERDICQFNKIHIAATITGHARIKKHNETDL 237

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           + A+A  GP++V+IDA Q SF FY +GVY +  C  S   LDHAVLAVGYG L+ K Y+ 
Sbjct: 238 QAAIATIGPIAVSIDAGQGSFQFYSSGVYDEPNC--STKRLDHAVLAVGYGTLNSKDYYI 295

Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           VKNSW + WG +GY+LMS  K N CG+ T+  Y
Sbjct: 296 VKNSWGSSWGIRGYILMSRNKQNQCGIATSALY 328


>gi|125811033|ref|XP_001361727.1| GA25021 [Drosophila pseudoobscura pseudoobscura]
 gi|54636904|gb|EAL26307.1| GA25021 [Drosophila pseudoobscura pseudoobscura]
          Length = 341

 Score =  177 bits (450), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 92/201 (45%), Positives = 128/201 (63%), Gaps = 6/201 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F +TGA+EG +Y K   L  LS+Q L+DCS  YGNNGC+GG    ++++I
Sbjct: 141 DQGHCGSCWAFSSTGALEGQHYRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI 200

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T+  Y PY   D  CH    T  AT  GFV++   +E  +  A+A  GPV+VA
Sbjct: 201 KDNGGIDTEKSY-PYEAIDDSCHFNKGTIGATDRGFVDIPQGNEKKMAEAVATIGPVAVA 259

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
           IDAS +SF FY  GVY +  C+     LDH VL VG+G +  G+ YW VKNSW T WG++
Sbjct: 260 IDASHESFQFYSEGVYNEPACD--AQNLDHGVLVVGFGTDESGQDYWLVKNSWGTTWGDK 317

Query: 329 GYV-LMSIKDNNCGVMTAPTY 348
           G++ ++  K+N CG+ +A +Y
Sbjct: 318 GFIKMLRNKENQCGIASASSY 338


>gi|197258082|gb|ACH56225.1| cathepsin L-like cysteine proteinase [Bursaphelenchus xylophilus]
          Length = 282

 Score =  177 bits (450), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 95/218 (43%), Positives = 136/218 (62%), Gaps = 9/218 (4%)

Query: 135 DAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGY 193
           DA+  R E  GY   + +Q +CGSCW+F  TG++EG +     KL  LS+Q L+DCS  +
Sbjct: 67  DAVDWRDE--GYVTPVKNQGMCGSCWAFSATGSLEGQHKRATGKLVSLSEQNLVDCSADF 124

Query: 194 GNNGCDGGEDFRSYQWIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNS 252
           GNNGC+GG    +++++ + HG+ T++ Y PY  +   CH       A  TGFV++    
Sbjct: 125 GNNGCNGGLMDFAFEYVKQNHGIDTEESY-PYKAKQKKCHFQKANVGADDTGFVDLPEAD 183

Query: 253 EDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDG 311
           E+ LK A+A  GPVSVAIDA  +SF  Y  GVYY++ C  SP+ LDH VL VGYG + + 
Sbjct: 184 EEQLKAAVASQGPVSVAIDAGHRSFRLYKTGVYYEKHC--SPEQLDHGVLVVGYGTDPEH 241

Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
             YW VKNSW   WG +GYV ++  ++N+CG+ +  +Y
Sbjct: 242 GDYWIVKNSWGEEWGEKGYVRIARNRNNHCGIASKASY 279


>gi|81294188|gb|AAI08032.1| Cathepsin L, 1 b [Danio rerio]
          Length = 336

 Score =  177 bits (450), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 101/221 (45%), Positives = 140/221 (63%), Gaps = 11/221 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  +GY   + DQ  CGSCWSF +TGA+EG  + K  KL  +S+Q L+DCS   GN G
Sbjct: 119 VDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQG 178

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDA 255
           C+GG    ++Q++ ++ GL ++  Y PYL +D   C        A +TGFV++   +E A
Sbjct: 179 CNGGLMDLAFQYVKENKGLDSEQSY-PYLARDDLPCRYDPRFNVAKITGFVDIPSGNELA 237

Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
           L  A+A  GPVSVAIDAS +S  FY +G+YY+  C++S   LDHAVL VGYG    ++ G
Sbjct: 238 LMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSR--LDHAVLVVGYGYQGADVAG 295

Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
             YW VKNSWS  WG++GY+ M+  K+N+CGV T  +Y  M
Sbjct: 296 NRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATKASYPLM 336


>gi|156368930|ref|XP_001627944.1| predicted protein [Nematostella vectensis]
 gi|156214907|gb|EDO35881.1| predicted protein [Nematostella vectensis]
          Length = 315

 Score =  177 bits (450), Expect = 6e-42,   Method: Compositional matrix adjust.
 Identities = 85/203 (41%), Positives = 126/203 (62%), Gaps = 11/203 (5%)

Query: 137 IPVRYEMKGY---NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG- 192
           +P + + + Y   N    Q  CGSCW+F T GAVE A++++  +L  L++Q L+DC+W  
Sbjct: 112 VPDQLDWRDYGAVNPAKGQGTCGSCWAFATAGAVEAAHFIQKGELLNLAEQQLLDCTWST 171

Query: 193 ----YGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
               +GNNGC GG  ++++ W+ K GL T   YG Y GQ+ +C  +N T  A +T +  V
Sbjct: 172 PGVYHGNNGCLGGWTWKAFSWVKKFGLATTKSYGHYRGQEGFCKTSNLTVGARITSYRRV 231

Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
              +  ALK AL+ HGP +++I+A+ KS  FY +G+  D+ C+N     DHAVL +GYG 
Sbjct: 232 KRFNPIALKKALSYHGPATISINANPKSLKFYSDGIMSDKHCSNK---TDHAVLLIGYGS 288

Query: 309 LDGKPYWQVKNSWSTYWGNQGYV 331
            +G PYW +KNSWS  WGN G++
Sbjct: 289 DNGVPYWLIKNSWSHKWGNNGFI 311


>gi|149510440|ref|XP_001518002.1| PREDICTED: cathepsin K-like [Ornithorhynchus anatinus]
          Length = 618

 Score =  177 bits (449), Expect = 6e-42,   Method: Compositional matrix adjust.
 Identities = 96/216 (44%), Positives = 130/216 (60%), Gaps = 8/216 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           + Y  KGY   + +Q  CGSCW+F + GA+EG    K  +L  LS Q L+DC     N+G
Sbjct: 408 IDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGRLLDLSPQNLVDCV--ASNDG 465

Query: 198 CDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    ++Q++    G+ ++D Y PY+GQD  C  + T   A   G+  V    E AL
Sbjct: 466 CGGGYMTNAFQYVHDNRGIDSEDAY-PYVGQDEPCRYSPTGKAAKCRGYREVPVGDEKAL 524

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K A+A+ GPV+VAIDAS  SF FY  GVY+DE CN +   L+HA+LAVGYG   G  +W 
Sbjct: 525 KRAVARVGPVAVAIDASLSSFQFYSKGVYFDENCNGA--NLNHALLAVGYGAQKGAKHWI 582

Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           +KNSW   WGN+GYVLM+   NN CG+ +  ++  M
Sbjct: 583 IKNSWGEEWGNKGYVLMARNKNNACGIASLASFPKM 618


>gi|340368360|ref|XP_003382720.1| PREDICTED: digestive cysteine proteinase 2-like [Amphimedon
           queenslandica]
          Length = 326

 Score =  177 bits (449), Expect = 6e-42,   Method: Compositional matrix adjust.
 Identities = 91/220 (41%), Positives = 133/220 (60%), Gaps = 5/220 (2%)

Query: 134 KDAIPVRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG 192
           K A  V +  KG   ++ +Q  CGSCWSF TTG++EG +++K   L  LS+Q  +DCS  
Sbjct: 106 KVAATVDWREKGAVTAIKNQGKCGSCWSFSTTGSLEGQHFLKTGTLLSLSEQQFVDCSTK 165

Query: 193 YGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNS 252
           +GN+GC GG    +++++        +   PY  +D +C   +T       G+ ++  + 
Sbjct: 166 FGNHGCKGGTMDNAFRYLETVSGDETEMMYPYTAEDGFCKFRSTEGKVKCEGYKDIPRDD 225

Query: 253 EDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDG- 311
           EDAL+ A+A  GP+SVAIDA   SF  Y  GVYY+  C+++   LDH VLAVGYG  +G 
Sbjct: 226 EDALREAVATVGPISVAIDAGHSSFQLYKEGVYYNPTCSSTK--LDHGVLAVGYGTYEGS 283

Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVT 350
           + YW VKNSW   WG +GY++MS  ++NNCG+ T  +Y T
Sbjct: 284 EEYWLVKNSWGPSWGMEGYIMMSRNRENNCGIATMASYPT 323


>gi|13928758|ref|NP_113748.1| cathepsin K precursor [Rattus norvegicus]
 gi|12585195|sp|O35186.1|CATK_RAT RecName: Full=Cathepsin K; Flags: Precursor
 gi|2305208|gb|AAB65743.1| cathepsin K [Rattus norvegicus]
 gi|50927597|gb|AAH78793.1| Cathepsin K [Rattus norvegicus]
 gi|149030667|gb|EDL85704.1| cathepsin K, isoform CRA_a [Rattus norvegicus]
          Length = 329

 Score =  177 bits (449), Expect = 6e-42,   Method: Compositional matrix adjust.
 Identities = 95/215 (44%), Positives = 126/215 (58%), Gaps = 6/215 (2%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           + Y  KGY   + +Q  CGSCW+F + GA+EG    K  KL  LS Q L+DC     N G
Sbjct: 119 IDYRKKGYVTPVKNQGQCGSCWAFSSAGALEGQLKKKTGKLLALSPQNLVDCV--SENYG 176

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
           C GG    ++Q++ ++G    +D  PY+GQD  C    T   A   G+  +   +E ALK
Sbjct: 177 CGGGYMTTAFQYVQQNGGIDSEDAYPYVGQDESCMYNATAKAAKCRGYREIPVGNEKALK 236

Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQV 317
            A+A+ GPVSV+IDAS  SF FY  GVYYDE C+   D ++HAVL VGYG   G  YW +
Sbjct: 237 RAVARVGPVSVSIDASLTSFQFYSRGVYYDENCDR--DNVNHAVLVVGYGTQKGNKYWII 294

Query: 318 KNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           KNSW   WGN+GYVL++   NN CG+    ++  M
Sbjct: 295 KNSWGESWGNKGYVLLARNKNNACGITNLASFPKM 329


>gi|162138968|ref|NP_001104662.1| uncharacterized protein LOC567623 precursor [Danio rerio]
 gi|158254065|gb|AAI54241.1| Zgc:174153 protein [Danio rerio]
          Length = 336

 Score =  177 bits (449), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 101/221 (45%), Positives = 139/221 (62%), Gaps = 11/221 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  +GY   + DQ  CGSCWSF +TGA+EG  + K  KL  +S+Q L+DCS   GN G
Sbjct: 119 VDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQG 178

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDA 255
           C+GG    ++Q++ ++ GL ++  Y PYL +D   C        A  TGFV++   +E A
Sbjct: 179 CNGGLMDLAFQYVKENKGLDSEQSY-PYLARDDLPCRYDPRFNVAKSTGFVDIPSGNEPA 237

Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
           L  A+A  GPVSVAIDAS +S  FY +G+YY+  C++S   LDHAVL VGYG    ++ G
Sbjct: 238 LMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSR--LDHAVLVVGYGYQGADVAG 295

Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
             YW VKNSWS  WG++GY+ M+  K+N+CGV T  +Y  M
Sbjct: 296 NRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATKASYPLM 336


>gi|530734|emb|CAA56914.1| cathepsin l [Nephrops norvegicus]
 gi|1582620|prf||2119193A cathepsin L-related Cys protease
          Length = 324

 Score =  177 bits (449), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 96/227 (42%), Positives = 142/227 (62%), Gaps = 11/227 (4%)

Query: 133 SKDAIP----VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALI 187
           S DA P    V +  KG    + DQ  CGSCW+F  TG++EG +++K+ +L  L++Q L+
Sbjct: 101 STDAAPETTEVDWRTKGCVTHVKDQGQCGSCWAFSATGSLEGQHFLKYGELVSLAEQQLV 160

Query: 188 DCSWG-YGNNGCDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGF 245
           DC+ G Y N GC+GG   +++++I  +G + T+  Y PY  +D  C   + +  AT +GF
Sbjct: 161 DCAGGIYYNQGCNGGWVNQAFKYIKANGGIDTESSY-PYEARDNTCRFNSNSVAATCSGF 219

Query: 246 VNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVG 305
           V++   SE          GP+SVAIDA+ +SF  Y +GVYY+  C++S   LDHAVLAVG
Sbjct: 220 VSIAQGSESPEVRRTTNTGPISVAIDAAHRSFQSYSSGVYYEPSCSSSQ--LDHAVLAVG 277

Query: 306 YGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
           YG   G+ +W VKNSW T WG+ GY+ M+  ++NNCG+ T  +Y T+
Sbjct: 278 YGSEGGQDFWLVKNSWGTSWGSAGYINMARNRNNNCGIATDASYPTV 324


>gi|156368926|ref|XP_001627942.1| predicted protein [Nematostella vectensis]
 gi|156214905|gb|EDO35879.1| predicted protein [Nematostella vectensis]
          Length = 310

 Score =  177 bits (449), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 84/203 (41%), Positives = 126/203 (62%), Gaps = 11/203 (5%)

Query: 137 IPVRYEMKGY---NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG- 192
           +P + + + Y   N    Q  CGSCW+F T GAVE A++++  +L  L++Q L+DC+W  
Sbjct: 107 VPDQLDWRNYGAVNPAKGQGTCGSCWAFATAGAVEAAHFIQKGELLNLAEQQLLDCTWST 166

Query: 193 ----YGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
               +GNNGC GG  ++++ W+ K GL T   YG Y GQ+ +C  +N T  A +T +  V
Sbjct: 167 PGVYHGNNGCLGGWTWKAFSWVKKFGLATTKSYGHYRGQEGFCKTSNLTVGARITSYRRV 226

Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
              +  ALK AL+ HGP +++I+A+ KS  +Y +G+  D+ C+N     DHAVL +GYG 
Sbjct: 227 KRFNPIALKKALSYHGPATISINANPKSLKYYSDGIMSDKHCSNK---TDHAVLLIGYGS 283

Query: 309 LDGKPYWQVKNSWSTYWGNQGYV 331
            +G PYW +KNSWS  WGN G++
Sbjct: 284 ENGVPYWLIKNSWSNKWGNNGFI 306


>gi|6630972|gb|AAF19630.1|AF194426_1 cysteine proteinase precursor [Myxine glutinosa]
          Length = 324

 Score =  177 bits (448), Expect = 8e-42,   Method: Compositional matrix adjust.
 Identities = 96/212 (45%), Positives = 120/212 (56%), Gaps = 4/212 (1%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  +GY   + DQ  CGSCWSF  TG++EG ++ K   L  LS+Q L+DCSW YGN G
Sbjct: 112 VDWRNQGYVTPVKDQGQCGSCWSFSATGSLEGQHFAKTGTLVSLSEQQLVDCSWSYGNYG 171

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
           C GG    +Y +I   G    +   PY  Q+  CH   + A AT TG V +    E +L 
Sbjct: 172 CSGGLMESAYDYIRDAGGVQLESAYPYTAQNGRCHFDQSKAVATCTGHVAIPSGDEQSLM 231

Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQV 317
            A+   GPV+VAIDAS   F  Y +GVY   +C++S   LDH VLA GYG   G  YW V
Sbjct: 232 QAVGTVGPVAVAIDASGYDFQLYESGVYDRSRCSSSS--LDHGVLAAGYGTEGGNDYWLV 289

Query: 318 KNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           KNSW   WG QGY+ MS  K N CG+ T   Y
Sbjct: 290 KNSWGPGWGAQGYIKMSRNKSNQCGIATMACY 321


>gi|1272388|gb|AAB17051.1| cysteine protease, partial [Spirometra mansonoides]
          Length = 216

 Score =  177 bits (448), Expect = 8e-42,   Method: Compositional matrix adjust.
 Identities = 88/212 (41%), Positives = 126/212 (59%), Gaps = 4/212 (1%)

Query: 141 YEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDG 200
           +E     S+ +Q  CGSCWSF   GA+EGA  +K   L  LS+Q L+DCSW YGN GC+G
Sbjct: 8   HEKGAVTSVKNQGQCGSCWSFSANGAIEGAIQIKMGILPTLSEQQLVDCSWEYGNQGCNG 67

Query: 201 GEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLAL 260
           G    ++Q+  ++G+  + DY  Y  +D +C        A +TG+  +    E +L+ A+
Sbjct: 68  GFMSLAFQYAQRYGVEAEVDY-RYTAKDGFCRYQQDMVVANVTGYAELPQGDEASLQRAV 126

Query: 261 AKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNS 320
           A  GP+SV IDA+   F  Y +GV+  + C  SPD ++H VL +GYG  + +PYW VKNS
Sbjct: 127 AVIGPISVGIDANDPGFMSYSHGVFVSKTC--SPDDINHGVLVIGYGTENDEPYWLVKNS 184

Query: 321 WSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           W   WG QGYV M+   NN CG+ +  +Y T+
Sbjct: 185 WGRSWGEQGYVKMARNKNNMCGIASVASYPTV 216


>gi|130502110|ref|NP_001076110.1| cathepsin K precursor [Oryctolagus cuniculus]
 gi|1168794|sp|P43236.1|CATK_RABIT RecName: Full=Cathepsin K; AltName: Full=Protein OC-2; Flags:
           Precursor
 gi|454187|dbj|BAA03125.1| OC-2 protein [Oryctolagus cuniculus]
          Length = 329

 Score =  177 bits (448), Expect = 9e-42,   Method: Compositional matrix adjust.
 Identities = 98/216 (45%), Positives = 130/216 (60%), Gaps = 8/216 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           + Y  KGY   + +Q  CGSCW+F + GA+EG    K  KL  LS Q L+DC     N G
Sbjct: 119 IDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENYG 176

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    ++Q++ ++ G+ ++D Y PY+GQD  C    T   A   G+  +   +E AL
Sbjct: 177 CGGGYMTNAFQYVQRNRGIDSEDAY-PYVGQDESCMYNPTGKAAKCRGYREIPEGNEKAL 235

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K A+A+ GPVSVAIDAS  SF FY  GVYYDE C  S D ++HAVLAVGYG   G  +W 
Sbjct: 236 KRAVARVGPVSVAIDASLTSFQFYSKGVYYDENC--SSDNVNHAVLAVGYGIQKGNKHWI 293

Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           +KNSW   WGN+GY+LM+   NN CG+    ++  M
Sbjct: 294 IKNSWGESWGNKGYILMARNKNNACGIANLASFPKM 329


>gi|326672302|ref|XP_003199633.1| PREDICTED: cathepsin L1-like [Danio rerio]
 gi|157423549|gb|AAI53506.1| Im:6910535 [Danio rerio]
          Length = 335

 Score =  177 bits (448), Expect = 9e-42,   Method: Compositional matrix adjust.
 Identities = 99/221 (44%), Positives = 139/221 (62%), Gaps = 12/221 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  +GY   + DQ  CGSCWSF +TGA+EG  + K  KL  +S+Q L+DCS   GN G
Sbjct: 119 VDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQG 178

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDA 255
           C+GG   +++Q++ ++ GL ++  Y PYL +D   C        A +TGFV++   +E A
Sbjct: 179 CNGGIMDQAFQYVKENKGLDSEQSY-PYLARDDLPCRYDPRFNVAKITGFVDIPKGNELA 237

Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
           L  A+A  GPVSVAIDAS +S  FY +G+YY+  C +    LDHAVL VGYG    ++ G
Sbjct: 238 LMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSR---LDHAVLVVGYGYQGADVAG 294

Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
             YW VKNSWS  WG++GY+ M+  K+N+CG+ T  +Y  M
Sbjct: 295 NRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPLM 335


>gi|156739289|ref|NP_001096592.1| uncharacterized protein LOC569326 precursor [Danio rerio]
 gi|156230119|gb|AAI52283.1| Im:6910535 protein [Danio rerio]
          Length = 335

 Score =  176 bits (447), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 99/221 (44%), Positives = 139/221 (62%), Gaps = 12/221 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  +GY   + DQ  CGSCWSF +TGA+EG  + K  KL  +S+Q L+DCS   GN G
Sbjct: 119 VDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQG 178

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDA 255
           C+GG   +++Q++ ++ GL ++  Y PYL +D   C        A +TGFV++   +E A
Sbjct: 179 CNGGIMDQAFQYVKENKGLDSEQSY-PYLARDDLPCRYDPRFNVAKITGFVDIPRGNELA 237

Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
           L  A+A  GPVSVAIDAS +S  FY +G+YY+  C +    LDHAVL VGYG    ++ G
Sbjct: 238 LMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSR---LDHAVLVVGYGYQGADVAG 294

Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
             YW VKNSWS  WG++GY+ M+  K+N+CG+ T  +Y  M
Sbjct: 295 NRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPLM 335


>gi|195583187|ref|XP_002081405.1| GD10995 [Drosophila simulans]
 gi|194193414|gb|EDX06990.1| GD10995 [Drosophila simulans]
          Length = 341

 Score =  176 bits (447), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 94/214 (43%), Positives = 132/214 (61%), Gaps = 7/214 (3%)

Query: 139 VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KG   ++ DQ  CGSCW+F +TGA+EG ++ K   L  LS+Q L+DCS  YGNNG
Sbjct: 128 VDWRTKGAVTAVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNG 187

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++++I  + G+ T+  Y PY   D  CH    T  AT  GF ++    E  +
Sbjct: 188 CNGGLMDNAFRYIKDNGGIDTEKSY-PYEAIDDSCHFNKGTIGATDRGFTDIPQGDEKKM 246

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYW 315
             A+A  GPVSVAIDAS +SF FY  GVY + +C+     LDH VL VG+G +  G  YW
Sbjct: 247 AEAVATVGPVSVAIDASHESFQFYSEGVYNEPQCD--AQNLDHGVLVVGFGTDESGDDYW 304

Query: 316 QVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
            VKNSW T WG++G++ ++  K+N CG+ +A +Y
Sbjct: 305 LVKNSWGTTWGDKGFIKMLRNKENQCGIASASSY 338


>gi|417409774|gb|JAA51378.1| Putative cathepsin k, partial [Desmodus rotundus]
          Length = 331

 Score =  176 bits (447), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 97/216 (44%), Positives = 130/216 (60%), Gaps = 8/216 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           + Y  KGY   + +Q  CGSCW+F + GA+EG    K  KL  LS Q L+DC     N+G
Sbjct: 121 IDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 178

Query: 198 CDGGEDFRSYQWIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    ++ ++ K  G+ ++D Y PY+GQD  C    T   A   G+  +   +E AL
Sbjct: 179 CGGGYMTNAFHYVQKNQGIDSEDAY-PYVGQDESCMYNPTGKAAKCRGYKEIPEGNEKAL 237

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K A+A+ GP+SVAIDAS  SF FY  GVYYD+ CN+  D L+HAVLAVGYG    K +W 
Sbjct: 238 KRAVARVGPISVAIDASLTSFQFYSKGVYYDKNCNS--DNLNHAVLAVGYGIQKRKKHWI 295

Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           +KNSW   WGN+GY+LM+   NN CG+    ++  M
Sbjct: 296 IKNSWGESWGNKGYILMARNKNNACGIANLASFPKM 331


>gi|156355008|ref|XP_001623469.1| predicted protein [Nematostella vectensis]
 gi|156210172|gb|EDO31369.1| predicted protein [Nematostella vectensis]
          Length = 301

 Score =  176 bits (447), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 82/185 (44%), Positives = 119/185 (64%), Gaps = 8/185 (4%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG-----YGNNGCDGGEDFRS 206
           Q  CGSCW+F T GAVE A++++  +L  L++Q L+DC+W      +GNNGC GG  +++
Sbjct: 116 QGTCGSCWAFATAGAVEAAHFIQKGELLNLAEQQLLDCTWSTPGVYHGNNGCLGGWTWKA 175

Query: 207 YQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
           + W+ K GL T   YG Y GQ+ +C  +N T  A +T +  V   +  ALK AL+ HGP 
Sbjct: 176 FSWVKKFGLATTKSYGHYRGQEGFCKTSNLTVGARITSYRRVKRFNPIALKKALSYHGPA 235

Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWG 326
           +++I+A+ KS  FY +G+  D+ C+N     DHAVL +GYG  +G PYW +KNSWS  WG
Sbjct: 236 TISINANPKSLKFYSDGIMSDKHCSNK---TDHAVLLIGYGSNNGVPYWLIKNSWSHKWG 292

Query: 327 NQGYV 331
           N G++
Sbjct: 293 NNGFI 297


>gi|156739275|ref|NP_001096585.1| cathepsin L1-like precursor [Danio rerio]
 gi|156230123|gb|AAI52285.1| MGC174857 protein [Danio rerio]
          Length = 335

 Score =  176 bits (447), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 99/221 (44%), Positives = 139/221 (62%), Gaps = 12/221 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  +GY   + DQ  CGSCWSF +TGA+EG  + K  KL  +S+Q L+DCS   GN G
Sbjct: 119 VDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQG 178

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDA 255
           C+GG   +++Q++ ++ GL ++  Y PYL +D   C        A +TGFV++   +E A
Sbjct: 179 CNGGIMDQAFQYVKENKGLDSEQSY-PYLARDDLPCRYDPRFNVAKITGFVDIPRGNELA 237

Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
           L  A+A  GPVSVAIDAS +S  FY +G+YY+  C +    LDHAVL VGYG    ++ G
Sbjct: 238 LMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSR---LDHAVLVVGYGYQGADVAG 294

Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
             YW VKNSWS  WG++GY+ M+  K+N+CG+ T  +Y  M
Sbjct: 295 NRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPLM 335


>gi|156739281|ref|NP_001096588.1| cathepsin L1-like precursor [Danio rerio]
 gi|166158351|ref|NP_001107526.1| uncharacterized protein LOC100135391 precursor [Xenopus (Silurana)
           tropicalis]
 gi|326672305|ref|XP_003199634.1| PREDICTED: cathepsin L1-like [Danio rerio]
 gi|156230096|gb|AAI52237.1| MGC174155 protein [Danio rerio]
 gi|163916362|gb|AAI57707.1| LOC100135391 protein [Xenopus (Silurana) tropicalis]
          Length = 335

 Score =  176 bits (447), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 99/221 (44%), Positives = 139/221 (62%), Gaps = 12/221 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  +GY   + DQ  CGSCWSF +TGA+EG  + K  KL  +S+Q L+DCS   GN G
Sbjct: 119 VDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQG 178

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDA 255
           C+GG   +++Q++ ++ GL ++  Y PYL +D   C        A +TGFV++   +E A
Sbjct: 179 CNGGIMDQAFQYVKENKGLDSEQSY-PYLARDDLPCRYDPRFNVAKITGFVDIPRGNELA 237

Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
           L  A+A  GPVSVAIDAS +S  FY +G+YY+  C +    LDHAVL VGYG    ++ G
Sbjct: 238 LMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSR---LDHAVLVVGYGYQGADVAG 294

Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
             YW VKNSWS  WG++GY+ M+  K+N+CG+ T  +Y  M
Sbjct: 295 NRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPLM 335


>gi|28932702|gb|AAO60045.1| midgut cysteine proteinase 2 [Rhipicephalus appendiculatus]
          Length = 564

 Score =  176 bits (447), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 119/292 (40%), Positives = 168/292 (57%), Gaps = 15/292 (5%)

Query: 69  DFKVNIYR--LFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLW 126
           D + +I+R  L F+  +   N        ++ +     I + + RL + +G   S+   +
Sbjct: 279 DRRRDIFRQNLRFIDSKNRANLGYNLAVNHLADRTREEISVLRGRLQSKDGS--SRAEPF 336

Query: 127 VRYNKASKDAIPVRYEMKGYNSLL---DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQ 183
            R+   +K  +P + + + Y ++    DQ+VCGSCWSFGT G +EGAY+ K  +L  LS+
Sbjct: 337 PRHRFTAK--LPDQIDWRPYGAVTPVKDQAVCGSCWSFGTVGELEGAYFRKTGRLVRLSE 394

Query: 184 QALIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTAT-ATM 242
           Q L+DCSW  GNNGCDGGEDFR+Y++I  HGL + +DYG Y+GQD  CH +   +T +++
Sbjct: 395 QQLVDCSWNNGNNGCDGGEDFRAYEYIADHGLASDEDYGAYIGQDGVCHDSKVNSTISSI 454

Query: 243 TGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVL 302
             +VN+T  + D L  ALA  GPVSV+IDA+ +SFSFY              D LDH+VL
Sbjct: 455 KSYVNIT--NRDDLPTALANVGPVSVSIDAALRSFSFYPTVSSMIPTAAMDTDSLDHSVL 512

Query: 303 AVGYGE-LDGKPYWQVKNSWSTYWGN-QGYVLMSIKDNNC-GVMTAPTYVTM 351
                  L G+PYW VKNSW    G   GYVL+S K     GV T  TYV +
Sbjct: 513 RQSATRTLQGEPYWGVKNSWVYLLGEMMGYVLISPKGTTTGGVATQGTYVEL 564



 Score = 43.5 bits (101), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 19/43 (44%), Positives = 30/43 (69%), Gaps = 1/43 (2%)

Query: 108 EKWRLVTSEGEKVSKYSLWVRYNKASKDAIPVRYEMKGYNSLL 150
           E+++L     ++VSKY+LWV  +   + A+P RY M+GYN+LL
Sbjct: 154 ERFQLTVPTRDRVSKYTLWVSRDSQGR-AVPRRYLMRGYNTLL 195


>gi|195153545|ref|XP_002017686.1| GL17172 [Drosophila persimilis]
 gi|194113482|gb|EDW35525.1| GL17172 [Drosophila persimilis]
          Length = 341

 Score =  176 bits (447), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 91/201 (45%), Positives = 128/201 (63%), Gaps = 6/201 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F +TGA+EG +Y K   L  LS+Q L+DCS  YGNNGC+GG    ++++I
Sbjct: 141 DQGHCGSCWAFSSTGALEGQHYRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI 200

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T+  Y PY   D  CH    +  AT  GFV++   +E  +  A+A  GPV+VA
Sbjct: 201 KDNGGIDTEKSY-PYEAIDDSCHFNKGSIGATDRGFVDIPQGNEKKMAEAVATIGPVAVA 259

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
           IDAS +SF FY  GVY +  C+     LDH VL VG+G +  G+ YW VKNSW T WG++
Sbjct: 260 IDASHESFQFYSEGVYNEPACD--AQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDK 317

Query: 329 GYV-LMSIKDNNCGVMTAPTY 348
           G++ ++  K+N CG+ +A +Y
Sbjct: 318 GFIKMLRNKENQCGIASASSY 338


>gi|344258279|gb|EGW14383.1| Cathepsin L1 [Cricetulus griseus]
          Length = 295

 Score =  176 bits (447), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 97/217 (44%), Positives = 134/217 (61%), Gaps = 8/217 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +   GY   + DQ  CG+CW+F   G++ G  + K  KL  LS+Q L+DCSW +GN G
Sbjct: 83  VDWRKHGYVTPVKDQGSCGACWAFSAVGSLVGQMFWKTGKLVPLSEQNLVDCSWSHGNIG 142

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    ++Q++M + GL T + Y PY  ++  C      + A +TGFV + P +E +L
Sbjct: 143 CHGGLMQNAFQYVMDNGGLDTSESY-PYESRNTTCRYNPENSAANVTGFVKI-PANEYSL 200

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYW 315
             A+A  GP+S AID    SF FY  G+YY+ +C++S   LDHAVL VGYG E DG+ YW
Sbjct: 201 MKAVAIVGPISAAIDTKHHSFQFYRGGMYYEPECSSS--NLDHAVLVVGYGEESDGRKYW 258

Query: 316 QVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
            VKNSW TYWG  GY+ M+  ++NNCG+ T   Y T+
Sbjct: 259 LVKNSWGTYWGMNGYIKMARDRNNNCGIATYAMYPTV 295


>gi|21617827|sp|P09648.1|CATL1_CHICK RecName: Full=Cathepsin L1; Contains: RecName: Full=Cathepsin L1
           heavy chain; Contains: RecName: Full=Cathepsin L1 light
           chain
          Length = 218

 Score =  176 bits (447), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 96/213 (45%), Positives = 129/213 (60%), Gaps = 5/213 (2%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + DQ  CGSCW+F TTGA+EG ++    KL  LS+Q L+DCS   GN G
Sbjct: 5   VDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRTKGKLVSLSEQNLVDCSRPEGNQG 64

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG   +++Q++  +G    ++  PY  +D   C        A  TGFV++    E AL
Sbjct: 65  CNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDCRYKAEYNAANDTGFVDIPQGHERAL 124

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
             A+A  GPVSVAIDA   SF FY +G+YY+  C  S + LDH VL VGYG   GK YW 
Sbjct: 125 MKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPDC--SSEDLDHGVLVVGYGFEGGKKYWI 182

Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           VKNSW   WG++GY+ M+  + N+CG+ TA +Y
Sbjct: 183 VKNSWGEKWGDKGYIYMAKDRKNHCGIATAASY 215


>gi|156325009|ref|XP_001618537.1| hypothetical protein NEMVEDRAFT_v1g44827 [Nematostella vectensis]
 gi|156199245|gb|EDO26437.1| predicted protein [Nematostella vectensis]
          Length = 283

 Score =  176 bits (447), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 83/192 (43%), Positives = 121/192 (63%), Gaps = 8/192 (4%)

Query: 147 NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG-----YGNNGCDGG 201
           N    Q  CGSCW+F T GAVE A++++  +L  L++Q L+DC+W      +GNNGC GG
Sbjct: 93  NPAKGQGTCGSCWAFATAGAVEAAHFIQKGELLNLAEQQLLDCTWSTPEVYHGNNGCLGG 152

Query: 202 EDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALA 261
             ++++ W+ K GL T   YG Y GQ+ +C  +N T  A +T +  V   +  ALK AL+
Sbjct: 153 WTWKAFSWVKKFGLATTKSYGHYRGQEGFCKTSNLTVGARITSYRRVKRFNPIALKKALS 212

Query: 262 KHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSW 321
            HGP +++I+A+ KS  FY +G+  D+ C+N     DHAVL +GYG  +G PYW +KNSW
Sbjct: 213 YHGPATISINANPKSLKFYSDGIMSDKHCSNK---TDHAVLLIGYGSDNGVPYWLIKNSW 269

Query: 322 STYWGNQGYVLM 333
           S  WGN G++ +
Sbjct: 270 SHKWGNNGFIKI 281


>gi|148709355|gb|EDL41301.1| cDNA sequence BC051665 [Mus musculus]
          Length = 349

 Score =  176 bits (446), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 98/204 (48%), Positives = 126/204 (61%), Gaps = 7/204 (3%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F   G++EG  + K  KL  LS+Q L+DCSW YGN GC+GG    ++Q++
Sbjct: 150 DQGHCGSCWAFSAVGSLEGQIFRKTGKLVPLSEQNLMDCSWSYGNVGCNGGLMELAFQYV 209

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
            ++ GL T++ Y  Y   D  C      +   +TGFV V P SEDAL  A+A  GPVSV 
Sbjct: 210 KENRGLDTRESYA-YEAWDGPCRYDPKYSAVNITGFVKV-PLSEDALMNAVASVGPVSVG 267

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
           ID    SF FY  G YY+  C+++   LDHAVL VGYG E DG+ YW VKNSW   WG  
Sbjct: 268 IDTHHHSFRFYRGGTYYEPDCSST--NLDHAVLVVGYGEESDGRKYWLVKNSWGEDWGMD 325

Query: 329 GYVLMSI-KDNNCGVMTAPTYVTM 351
           GY+ M+  +DNNCG+ T   Y T+
Sbjct: 326 GYIKMAKDRDNNCGIATYAIYPTV 349


>gi|74211558|dbj|BAE26509.1| unnamed protein product [Mus musculus]
          Length = 338

 Score =  176 bits (446), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 98/204 (48%), Positives = 126/204 (61%), Gaps = 7/204 (3%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F   G++EG  + K  KL  LS+Q L+DCSW YGN GC+GG    ++Q++
Sbjct: 139 DQGHCGSCWAFSAVGSLEGQIFRKTGKLVPLSEQNLMDCSWSYGNVGCNGGLMELAFQYV 198

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
            ++ GL T++ Y  Y   D  C      +   +TGFV V P SEDAL  A+A  GPVSV 
Sbjct: 199 KENRGLDTRESYA-YEAWDGPCRYDPKYSAVNITGFVKV-PLSEDALMNAVASVGPVSVG 256

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
           ID    SF FY  G YY+  C+++   LDHAVL VGYG E DG+ YW VKNSW   WG  
Sbjct: 257 IDTHHHSFRFYRGGTYYEPDCSST--NLDHAVLVVGYGEESDGRKYWLVKNSWGEDWGMD 314

Query: 329 GYVLMSI-KDNNCGVMTAPTYVTM 351
           GY+ M+  +DNNCG+ T   Y T+
Sbjct: 315 GYIKMAKDRDNNCGIATYAIYPTV 338


>gi|269954686|ref|NP_954599.2| uncharacterized protein LOC218275 precursor [Mus musculus]
          Length = 330

 Score =  176 bits (446), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 98/204 (48%), Positives = 126/204 (61%), Gaps = 7/204 (3%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F   G++EG  + K  KL  LS+Q L+DCSW YGN GC+GG    ++Q++
Sbjct: 131 DQGHCGSCWAFSAVGSLEGQIFRKTGKLVPLSEQNLMDCSWSYGNVGCNGGLMELAFQYV 190

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
            ++ GL T++ Y  Y   D  C      +   +TGFV V P SEDAL  A+A  GPVSV 
Sbjct: 191 KENRGLDTRESYA-YEAWDGPCRYDPKYSAVNITGFVKV-PLSEDALMNAVASVGPVSVG 248

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
           ID    SF FY  G YY+  C+++   LDHAVL VGYG E DG+ YW VKNSW   WG  
Sbjct: 249 IDTHHHSFRFYRGGTYYEPDCSST--NLDHAVLVVGYGEESDGRKYWLVKNSWGEDWGMD 306

Query: 329 GYVLMSI-KDNNCGVMTAPTYVTM 351
           GY+ M+  +DNNCG+ T   Y T+
Sbjct: 307 GYIKMAKDRDNNCGIATYAIYPTV 330


>gi|195334204|ref|XP_002033774.1| GM21500 [Drosophila sechellia]
 gi|194125744|gb|EDW47787.1| GM21500 [Drosophila sechellia]
          Length = 341

 Score =  176 bits (446), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 90/204 (44%), Positives = 129/204 (63%), Gaps = 6/204 (2%)

Query: 148 SLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSY 207
           ++ DQ  CGSCW+F +TGA+EG ++ K   L  LS+Q L+DCS  YGNNGC+GG    ++
Sbjct: 138 AVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAF 197

Query: 208 QWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
           ++I  + G+ T+  Y PY   D  CH    T  AT  GF ++    E  +  A+A  GPV
Sbjct: 198 RYIKDNGGIDTEKSY-PYEAIDDSCHFNKGTIGATDRGFTDIPQGDEKKMAEAVATVGPV 256

Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYW 325
           +VAIDAS +SF FY  GVY + +C+     LDH VL VG+G +  G+ YW VKNSW T W
Sbjct: 257 AVAIDASHESFQFYSEGVYNEPQCD--AQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTW 314

Query: 326 GNQGYV-LMSIKDNNCGVMTAPTY 348
           G++G++ ++  K+N CG+ +A +Y
Sbjct: 315 GDKGFIKMLRNKENQCGIASASSY 338


>gi|54020916|ref|NP_001005702.1| cathepsin K (pycnodysostosis) precursor [Xenopus (Silurana)
           tropicalis]
 gi|49671274|gb|AAH75275.1| cathepsin K (pycnodysostosis) [Xenopus (Silurana) tropicalis]
          Length = 329

 Score =  176 bits (446), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 95/215 (44%), Positives = 126/215 (58%), Gaps = 6/215 (2%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           + Y  KGY   + +Q +CGSCW+F + GA+EG    K  KL  LS Q L+DC     N G
Sbjct: 119 IDYRKKGYVTPVHNQGICGSCWAFSSVGALEGQLMKKTGKLVSLSPQNLVDCD--TDNYG 176

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
           C+GG    ++ ++  +G    D   PY+GQD  CH       AT  G+  +   SE ALK
Sbjct: 177 CEGGYMTNAFGYVRDNGGIDSDAEYPYVGQDEGCHYNPADKAATCKGYKEIPVGSEKALK 236

Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQV 317
            A+A  GPVSV+IDAS  SF FY  GVYYD  CN  PD ++HAVL VGYG   G  +W +
Sbjct: 237 RAVANVGPVSVSIDASLPSFQFYKKGVYYDSSCN--PDAVNHAVLVVGYGNEKGIKHWII 294

Query: 318 KNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
           KNSW  +WG +GYVL++  K N CG+ +  ++  M
Sbjct: 295 KNSWGDWWGKKGYVLLARDKKNACGIASLASFPVM 329


>gi|194883222|ref|XP_001975702.1| GG20414 [Drosophila erecta]
 gi|190658889|gb|EDV56102.1| GG20414 [Drosophila erecta]
          Length = 341

 Score =  176 bits (446), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 91/204 (44%), Positives = 127/204 (62%), Gaps = 6/204 (2%)

Query: 148 SLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSY 207
           ++ DQ  CGSCW+F +TGA+EG ++ K   L  LS+Q L+DCS  YGNNGC+GG    ++
Sbjct: 138 AVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAF 197

Query: 208 QWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
           ++I  + G+ T+  Y PY   D  CH       AT  GF ++    E  +  A+A  GPV
Sbjct: 198 RYIKDNGGIDTEKSY-PYEAIDDSCHFNKGAIGATDRGFTDIPQGDEKKMAEAVATVGPV 256

Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYW 325
           +VAIDAS +SF FY  GVY + +C+     LDH VL VGYG +  G  YW VKNSW T W
Sbjct: 257 AVAIDASHESFQFYSEGVYNEPQCD--AQNLDHGVLVVGYGTDESGDDYWLVKNSWGTTW 314

Query: 326 GNQGYV-LMSIKDNNCGVMTAPTY 348
           G++G++ ++  KDN CG+ +A +Y
Sbjct: 315 GDKGFIKMLRNKDNQCGIASASSY 338


>gi|30388235|gb|AAH51665.1| CDNA sequence BC051665 [Mus musculus]
          Length = 330

 Score =  176 bits (446), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 98/204 (48%), Positives = 126/204 (61%), Gaps = 7/204 (3%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F   G++EG  + K  KL  LS+Q L+DCSW YGN GC+GG    ++Q++
Sbjct: 131 DQGHCGSCWAFSAVGSLEGQIFRKTGKLVPLSEQNLMDCSWSYGNVGCNGGLMELAFQYV 190

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
            ++ GL T++ Y  Y   D  C      +   +TGFV V P SEDAL  A+A  GPVSV 
Sbjct: 191 KENRGLDTRESYA-YEAWDGPCRYDPKYSAVNITGFVKV-PLSEDALMNAVASVGPVSVG 248

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
           ID    SF FY  G YY+  C+++   LDHAVL VGYG E DG+ YW VKNSW   WG  
Sbjct: 249 IDTHHHSFRFYRGGTYYEPDCSST--NLDHAVLVVGYGEESDGRKYWLVKNSWGEDWGMD 306

Query: 329 GYVLMSI-KDNNCGVMTAPTYVTM 351
           GY+ M+  +DNNCG+ T   Y T+
Sbjct: 307 GYIKMAKDRDNNCGIATYAIYPTV 330


>gi|330842502|ref|XP_003293216.1| hypothetical protein DICPUDRAFT_95775 [Dictyostelium purpureum]
 gi|325076482|gb|EGC30264.1| hypothetical protein DICPUDRAFT_95775 [Dictyostelium purpureum]
          Length = 376

 Score =  176 bits (446), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 105/242 (43%), Positives = 136/242 (56%), Gaps = 43/242 (17%)

Query: 149 LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
           + DQ  CGSCWSF TTG+VEGA+ +K  KL  LS+Q L+DCS   GN GCDGG    ++ 
Sbjct: 137 IKDQGQCGSCWSFSTTGSVEGAHQIKTGKLVSLSEQNLVDCSGAEGNLGCDGGLMDNAFI 196

Query: 209 WIMKH-GLPTQDDYGPYLGQDA-YCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
           +I+++ G+ T+  Y PY  Q    C    T+  AT++G+VN+T  SE  L+ A+AK+GPV
Sbjct: 197 YIIQNKGIDTESSY-PYKAQSGTKCLFKPTSIGATLSGYVNITAGSESQLETAVAKNGPV 255

Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-----ELDGKP-------- 313
           SVAIDAS  SF  Y +GVYY+ KC  SP  LDH VL VGYG     E +  P        
Sbjct: 256 SVAIDASHNSFQLYSSGVYYEPKC--SPTELDHGVLVVGYGVAKKDENNASPNKHQIRIR 313

Query: 314 ------------------------YWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
                                   YW VKNSW   WG QG++ MS  + NNCG+ +  +Y
Sbjct: 314 HNDDFGIDEIVTDSSSDDGRKTSQYWLVKNSWGVSWGMQGFIQMSKNRKNNCGIASCASY 373

Query: 349 VT 350
            T
Sbjct: 374 PT 375


>gi|297287735|ref|XP_002803218.1| PREDICTED: putative cathepsin L-like protein 6-like [Macaca
           mulatta]
          Length = 270

 Score =  176 bits (446), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 102/259 (39%), Positives = 151/259 (58%), Gaps = 12/259 (4%)

Query: 101 LVNGIILEKWRLVTSEGEKVSKYSLWVRYNKASKDAIP--VRYEMKGY-NSLLDQSVCGS 157
           ++  + L ++R V + G +  K+     + +     IP  V +  KGY   + +Q +CGS
Sbjct: 16  IIGSLALTEFRQVMN-GFQYQKHRKGKEFQEQHLLEIPTSVDWREKGYVTPVKNQGMCGS 74

Query: 158 CWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPT 217
           CW+F  TGA+EG  + K  KL  LS+Q L+DCSW  GN G +GG    S++++ ++G   
Sbjct: 75  CWAFSATGALEGQMFWKTGKLISLSEQNLVDCSWPQGNEGYNGGFMDNSFRYVQENGGLD 134

Query: 218 QDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSF 277
            +   PY G+   C      + A  TGFV++ P+ E  L  A+A  GP+SVA+DAS  SF
Sbjct: 135 SEASYPYEGKVKTCRYNPKYSVANDTGFVDI-PSREKDLAKAVATVGPISVAVDASHFSF 193

Query: 278 SFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTYWGNQGYVLM 333
            FY  G+Y++ +C+  P+GLDHA+L VGYG    + D   YW VKNSW   WG  GY+ M
Sbjct: 194 QFYKKGIYFEPRCD--PEGLDHAMLTVGYGYEGADSDNNKYWLVKNSWGKNWGMDGYIKM 251

Query: 334 SI-KDNNCGVMTAPTYVTM 351
           +  + NNCG+ TA +Y T+
Sbjct: 252 AKDRRNNCGIATAASYPTV 270


>gi|23344734|gb|AAN28680.1| cathepsin L [Theromyzon tessulatum]
          Length = 351

 Score =  176 bits (445), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 95/220 (43%), Positives = 135/220 (61%), Gaps = 11/220 (5%)

Query: 137 IPVRYEMKGYNS-LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGN 195
           + V +  KG  S + +Q  CGSCW+F TTG++EG +  K   +  LS+Q L+DCS  YGN
Sbjct: 131 VEVDWRTKGLVSEVKNQGSCGSCWAFSTTGSLEGQHMRKTGTMVDLSEQNLVDCSTSYGN 190

Query: 196 NGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSED 254
           +GC+GG    ++++I  + G+ T++ Y PY G+D  C        AT+TGFV +   +E 
Sbjct: 191 DGCNGGLMTNAFKYIKDNKGIDTEEAY-PYAGRDGDCKFKKNKVGATVTGFVEIPAGNEK 249

Query: 255 ALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPY 314
            L+ ALA  GPVSVAIDA+ +SF  Y +GVY + +C+++   LDH VLAVGYG + GK Y
Sbjct: 250 KLQEALATVGPVSVAIDANHQSFMLYKSGVYDEPECDSAQ--LDHGVLAVGYGSIHGKDY 307

Query: 315 WQVKNSWSTYWGNQGYVLMSIK------DNNCGVMTAPTY 348
           + VKNSW T WG QGY+  S           CG++   +Y
Sbjct: 308 YIVKNSWGTTWGEQGYIRFSTTAVPDAIGGICGILLDASY 347


>gi|320543907|ref|NP_001188921.1| cysteine proteinase-1, isoform D [Drosophila melanogaster]
 gi|318068589|gb|ADV37168.1| cysteine proteinase-1, isoform D [Drosophila melanogaster]
          Length = 249

 Score =  176 bits (445), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 94/214 (43%), Positives = 133/214 (62%), Gaps = 7/214 (3%)

Query: 139 VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KG   ++ DQ  CGSCW+F +TGA+EG ++ K   L  LS+Q L+DCS  YGNNG
Sbjct: 36  VDWRTKGAVTAVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNG 95

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++++I  + G+ T+  Y PY   D  CH    T  AT  GF ++    E  +
Sbjct: 96  CNGGLMDNAFRYIKDNGGIDTEKSY-PYEAIDDSCHFNKGTVGATDRGFTDIPQGDEKKM 154

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYW 315
             A+A  GPVSVAIDAS +SF FY  GVY + +C+     LDH VL VG+G +  G+ YW
Sbjct: 155 AEAVATVGPVSVAIDASHESFQFYSEGVYNEPQCD--AQNLDHGVLVVGFGTDESGEDYW 212

Query: 316 QVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
            VKNSW T WG++G++ ++  K+N CG+ +A +Y
Sbjct: 213 LVKNSWGTTWGDKGFIKMLRNKENQCGIASASSY 246


>gi|156354961|ref|XP_001623447.1| predicted protein [Nematostella vectensis]
 gi|156210147|gb|EDO31347.1| predicted protein [Nematostella vectensis]
          Length = 294

 Score =  176 bits (445), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 84/203 (41%), Positives = 126/203 (62%), Gaps = 11/203 (5%)

Query: 137 IPVRYEMKGY---NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG- 192
           +P + + + Y   N    Q  CGSCW+F T GAVE A++++  +L  L++Q L+DC+W  
Sbjct: 91  VPDQLDWRDYGAVNPAKGQGTCGSCWAFATAGAVEAAHFIQKGELLNLAEQQLLDCTWST 150

Query: 193 ----YGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
               +GNNGC GG  ++++ W+ K G+ T   YG Y GQ+ +C  +N T  A +T +  V
Sbjct: 151 PGVYHGNNGCLGGWTWKAFSWVKKFGIATTKSYGHYRGQEGFCKTSNLTVGARITSYRRV 210

Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
              +  ALK AL+ HGP +++I+A+ KS  FY +G+  D+ C+N     DHAVL +GYG 
Sbjct: 211 KRFNPIALKKALSYHGPATISINANPKSLKFYSDGIMSDKHCSNK---TDHAVLLIGYGS 267

Query: 309 LDGKPYWQVKNSWSTYWGNQGYV 331
            +G PYW +KNSWS  WGN G++
Sbjct: 268 DNGVPYWLIKNSWSHKWGNNGFI 290


>gi|118425914|gb|ABK90856.1| cathepsin-L-like cysteine peptidase [Radix peregra]
          Length = 324

 Score =  176 bits (445), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 97/220 (44%), Positives = 133/220 (60%), Gaps = 8/220 (3%)

Query: 134 KDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCS 190
           KD++P  V +  +GY   + DQ  CGSCW+F TTG++EG ++   K+L  LS+  L+DCS
Sbjct: 105 KDSLPTAVDWRKEGYVTEVKDQGQCGSCWAFSTTGSLEGQHFKATKQLVSLSESNLVDCS 164

Query: 191 WGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVT 249
             +GN GC+GG    ++++I  + G+ T+  Y PY  +D  C+       AT   + ++T
Sbjct: 165 KKWGNQGCNGGLMDNAFKYIADNKGIDTEKSY-PYKPEDRKCNFKKANVGATDKLYKDIT 223

Query: 250 PNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGEL 309
             SEDAL+ A+A  GP+SVAIDAS  SF  Y  GVY ++ C  S   LDH VLAVGY   
Sbjct: 224 SGSEDALQEAVATIGPISVAIDASHDSFQLYSGGVYNEKAC--STKTLDHGVLAVGYDSK 281

Query: 310 DGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           +G  YW VKNSW   WG  GY+ MS  K N CG+ T  +Y
Sbjct: 282 NGDDYWIVKNSWGKSWGIDGYIWMSRNKKNQCGIATMASY 321


>gi|327289219|ref|XP_003229322.1| PREDICTED: cathepsin K-like, partial [Anolis carolinensis]
          Length = 289

 Score =  176 bits (445), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 97/216 (44%), Positives = 127/216 (58%), Gaps = 8/216 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V Y  KGY   + +Q  CGSCW+F + GA+E    MK  KL  LS Q L+DC     N+G
Sbjct: 79  VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEAQLKMKTGKLLNLSPQNLVDCV--SNNDG 136

Query: 198 CDGGEDFRSYQWI-MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    +++++ +  G+ + D Y PY+GQD  C    T   A   G+  +    E AL
Sbjct: 137 CGGGYMTNAFEYVHVNRGIDSDDTY-PYIGQDENCMYNPTGKAAKCRGYKEIPEGDEKAL 195

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K A+A+ GPVSV IDAS  SF FY  GVYYDE CN   D ++HAVLAVGYG   G  +W 
Sbjct: 196 KRAVARKGPVSVGIDASLASFQFYSRGVYYDENCN--ADNINHAVLAVGYGSQKGTKHWI 253

Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           VKNSW   WG++GY+LM+   NN CG+    ++  M
Sbjct: 254 VKNSWGEDWGDKGYILMARNMNNACGIANLASFPKM 289


>gi|114625736|ref|XP_001153919.1| PREDICTED: cathepsin L2 isoform 2 [Pan troglodytes]
 gi|114625742|ref|XP_520130.2| PREDICTED: cathepsin L2 isoform 5 [Pan troglodytes]
          Length = 334

 Score =  176 bits (445), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 96/217 (44%), Positives = 130/217 (59%), Gaps = 10/217 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TGA+EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 118 VDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQG 177

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG   R++Q++ ++ GL +++ Y PY+  D  C      + A  TGF  VTP  E AL
Sbjct: 178 CNGGFMARAFQYVKENGGLDSEESY-PYVAMDEICKYRPENSVANDTGFTVVTPGKEKAL 236

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GP+SVA+DA   SF FY +G+Y++  C  S   LDH VL VGYG      +  
Sbjct: 237 MKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDC--SSKNLDHGVLVVGYGFEGANSNNS 294

Query: 313 PYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
            YW VKNSW   WG+ GYV +   K+N+CG+ TA +Y
Sbjct: 295 KYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASY 331


>gi|45384464|ref|NP_990302.1| cathepsin K precursor [Gallus gallus]
 gi|25089842|sp|Q90686.1|CATK_CHICK RecName: Full=Cathepsin K; AltName: Full=JTAP-1; Flags: Precursor
 gi|1017831|gb|AAC59739.1| JTAP-1 [Gallus gallus]
          Length = 334

 Score =  176 bits (445), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 95/223 (42%), Positives = 132/223 (59%), Gaps = 8/223 (3%)

Query: 132 ASKDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCS 190
           +S+    V +  KGY   + DQ  CGSCW+F + GA+EG    +  KL  LS Q L+ C 
Sbjct: 117 SSRAPAAVDWRRKGYVTPVKDQGQCGSCWAFSSVGALEGQLKRRTGKLLSLSPQNLVYCV 176

Query: 191 WGYGNNGCDGGEDFRSYQWI-MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVT 249
               NNGC GG    +++++ +  G+ ++D Y PY+GQD  C  + T   A   G+  + 
Sbjct: 177 --SNNNGCGGGYMTNAFEYVRLNRGIDSEDAY-PYIGQDESCMYSPTGKAAKCRGYREIP 233

Query: 250 PNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGEL 309
            ++E ALK A+A+ GPVSV IDAS  SF FY  GVYYD  CN  P+ ++HAVLAVGYG  
Sbjct: 234 EDNEKALKRAVARIGPVSVGIDASLPSFQFYSRGVYYDTGCN--PENINHAVLAVGYGAQ 291

Query: 310 DGKPYWQVKNSWSTYWGNQGYVLMSIK-DNNCGVMTAPTYVTM 351
            G  +W +KNSW T WGN+GYVL++      CG+    ++  M
Sbjct: 292 KGTKHWIIKNSWGTEWGNKGYVLLARNMKQTCGIANLASFPKM 334


>gi|225579644|gb|ACN93991.1| cathepsin L [Dicentrarchus labrax]
          Length = 316

 Score =  175 bits (444), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 93/190 (48%), Positives = 125/190 (65%), Gaps = 9/190 (4%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F TTGA+EG ++ K  KL  LS+Q L+DCS   GN GC+GG   +++Q++
Sbjct: 128 DQGQCGSCWAFSTTGALEGQHFRKSGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYV 187

Query: 211 M-KHGLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
               GL ++D Y PYLG  D  CH      +A  TGFV++    E AL  A+A  GPVSV
Sbjct: 188 KDNQGLDSEDSY-PYLGTDDQPCHYDPNYNSANDTGFVDIPSGKEHALMKAVAAVGPVSV 246

Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTY 324
           AIDA  +SF FY +G+YY+++C  S + LDH VL VGYG    ++DGK YW V NSWS  
Sbjct: 247 AIDAGHESFQFYQSGIYYEKEC--SSEELDHGVLVVGYGFEGQDVDGKKYWIVNNSWSEK 304

Query: 325 WGNQGYVLMS 334
           WG++GY+ ++
Sbjct: 305 WGDEGYIYLA 314


>gi|156355022|ref|XP_001623476.1| predicted protein [Nematostella vectensis]
 gi|156210179|gb|EDO31376.1| predicted protein [Nematostella vectensis]
          Length = 195

 Score =  175 bits (444), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 83/190 (43%), Positives = 120/190 (63%), Gaps = 8/190 (4%)

Query: 147 NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG-----YGNNGCDGG 201
           N    Q  CGSCW+F T GAVE A++++  +L  L++Q L+DC+W      +GNNGC GG
Sbjct: 7   NPAKGQGTCGSCWAFATAGAVEAAHFIQKGELLNLAEQQLLDCTWSTPGVYHGNNGCLGG 66

Query: 202 EDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALA 261
             ++++ W+ K GL T   YG Y GQ+ +C  +N T  A +T +  V   +  ALK AL+
Sbjct: 67  WTWKAFSWVKKFGLATTKSYGHYRGQEGFCKTSNLTVGARITSYRRVKRFNPIALKKALS 126

Query: 262 KHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSW 321
            HGP +++I+A+ KS  FY +G+  D+ C+N     DHAVL +GYG  +G PYW +KNSW
Sbjct: 127 YHGPATISINANPKSLKFYSDGIMSDKHCSNK---TDHAVLLIGYGSDNGVPYWLIKNSW 183

Query: 322 STYWGNQGYV 331
           S  WGN G++
Sbjct: 184 SHKWGNNGFI 193


>gi|344238391|gb|EGV94494.1| Ras-specific guanine nucleotide-releasing factor 1 [Cricetulus
            griseus]
          Length = 1632

 Score =  175 bits (444), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 83/198 (41%), Positives = 121/198 (61%), Gaps = 3/198 (1%)

Query: 152  QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM 211
            Q  CGSCW+F TTGA+E A  +   K+  L++Q L+DC+  + N+GC+GG   +++++I+
Sbjct: 1432 QGSCGSCWTFSTTGALESAVAIASGKMLSLAEQQLVDCAQNFNNHGCEGGLPSQAFEYIL 1491

Query: 212  -KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
               G+  +D Y PY G+D +C      A A +    N+T N E A+  A+A + PVS A 
Sbjct: 1492 YNKGIMGEDTY-PYRGKDGHCKFDPQKAIAFVKDVANITLNDEKAMVEAVALYNPVSFAF 1550

Query: 271  DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
            + +   F  Y  G+Y    C+ +PD ++HAVLAVGYGE DG PYW VKNSW T WG++GY
Sbjct: 1551 EVTD-DFMLYQKGIYSSTSCHKTPDKVNHAVLAVGYGEKDGIPYWIVKNSWGTNWGDKGY 1609

Query: 331  VLMSIKDNNCGVMTAPTY 348
             L+    N CG+    +Y
Sbjct: 1610 FLIERGKNMCGLAACASY 1627


>gi|311265493|ref|XP_003130681.1| PREDICTED: cathepsin L1-like [Sus scrofa]
          Length = 332

 Score =  175 bits (444), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 96/216 (44%), Positives = 131/216 (60%), Gaps = 10/216 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY  ++ +Q  CGSCW+F  TGA+EG  + K  KL  LS+Q L+DCSW  GN G
Sbjct: 118 VDWREKGYVTAVKNQGHCGSCWAFSATGALEGQMFRKTSKLISLSEQNLVDCSWPEGNEG 177

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++Q+I  + GL +++ Y PY G+D  C     ++ A  TG+V++ P  E AL
Sbjct: 178 CNGGLMDNAFQYIKDNGGLDSEESY-PYFGKDGSCKYKPQSSAANDTGYVDI-PKQEKAL 235

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGY---GELDGKP 313
             A+A  GP+SV IDAS +SF FY  G+Y++ +C  S + LDH VL VGY   G      
Sbjct: 236 MKAVATVGPISVGIDASHESFQFYSTGIYFEPQC--SSEDLDHGVLVVGYGVEGAHSNNK 293

Query: 314 YWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTY 348
           YW VKNSW   WG  GY+ M+   NN CG+ T  +Y
Sbjct: 294 YWLVKNSWGNTWGMDGYIKMTKDQNNHCGIATMASY 329


>gi|351694420|gb|EHA97338.1| Cathepsin K [Heterocephalus glaber]
          Length = 329

 Score =  175 bits (444), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 99/216 (45%), Positives = 132/216 (61%), Gaps = 8/216 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V Y  KGY   + +Q  CGSCW+F + GA+EG    K  KL  LS Q L+DC     N+G
Sbjct: 119 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 176

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    ++Q++ ++ G+ ++D Y PY+GQD  C    T   A   G+  V   +E AL
Sbjct: 177 CGGGYMTNAFQYVQQNRGIDSEDAY-PYVGQDESCMYNPTGKAAKCRGYREVPVGNEKAL 235

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K A+A+ GP+SVAIDAS  SF FY  GVYYDE C+   D L+HAVLAVGYG   G  +W 
Sbjct: 236 KRAVARVGPISVAIDASLTSFQFYSKGVYYDESCDG--DNLNHAVLAVGYGIQRGHKHWI 293

Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
           +KNSW   WGN+GYVL++  K+N CG+    ++  M
Sbjct: 294 LKNSWGENWGNKGYVLLARNKNNTCGIANLASFPKM 329


>gi|33242886|gb|AAQ01147.1| cathepsin [Haplochromis chilotes]
          Length = 334

 Score =  175 bits (444), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 92/202 (45%), Positives = 122/202 (60%), Gaps = 6/202 (2%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM 211
           Q  CGSCW+F   GA+EG       KL  LS Q L+DCS  YGN+GC+GG   R++Q+++
Sbjct: 137 QGACGSCWAFSAAGALEGQLAKSTGKLVDLSPQNLVDCSGKYGNHGCNGGFMTRAFQYVI 196

Query: 212 -KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
             HG+ +   Y PY+G+D  CH    T  A  + +  +    E+ALK  LA  GP+SVAI
Sbjct: 197 DNHGIDSDASY-PYIGRDDQCHYNPATRAANCSSYQFLPEGDENALKQGLATVGPISVAI 255

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
           DA +  FSFY +GVY D  C      ++H VLAVGYG L+G+ YW VKNSW T +G+QGY
Sbjct: 256 DARRPRFSFYRSGVYNDPSCTQK---VNHGVLAVGYGTLNGQDYWLVKNSWGTTFGDQGY 312

Query: 331 VLMSIK-DNNCGVMTAPTYVTM 351
           + M+    N CG+   P Y  M
Sbjct: 313 IRMARNTGNQCGIALYPCYPVM 334


>gi|295321664|pdb|3H7D|A Chain A, The Crystal Structure Of The Cathepsin K Variant M5 In
           Compl Chondroitin-4-Sulfate
 gi|295321665|pdb|3H7D|E Chain E, The Crystal Structure Of The Cathepsin K Variant M5 In
           Compl Chondroitin-4-Sulfate
          Length = 215

 Score =  175 bits (444), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 99/216 (45%), Positives = 130/216 (60%), Gaps = 8/216 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V Y  KGY   + +Q  CGSCW+F + GA+EG    K  KL  LS Q L+DC     N+G
Sbjct: 5   VDYREKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 62

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    ++Q++ K+ G+ ++D Y PY+GQ+  C    T   A   G+  +   +E AL
Sbjct: 63  CGGGYMTNAFQYVQKNRGIDSEDAY-PYVGQEESCMYNPTGKAAKCRGYREIPEGNEKAL 121

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K A+A+ GPVSVAIDAS  SF FY  GVYYDE CN+  D L+HAVLAVGYGE  G  +W 
Sbjct: 122 KRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNS--DNLNHAVLAVGYGESKGNKHWI 179

Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           +KNSW   WG  GY+ M+   NN CG+    ++  M
Sbjct: 180 IKNSWGENWGMGGYIKMARNKNNACGIANLASFPKM 215


>gi|397499865|ref|XP_003820654.1| PREDICTED: cathepsin L2 isoform 1 [Pan paniscus]
 gi|397499867|ref|XP_003820655.1| PREDICTED: cathepsin L2 isoform 2 [Pan paniscus]
          Length = 334

 Score =  175 bits (443), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 96/217 (44%), Positives = 129/217 (59%), Gaps = 10/217 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TGA+EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 118 VDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQG 177

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG   R++Q++ ++ GL +++ Y PY+  D  C      + A  TGF  VTP  E AL
Sbjct: 178 CNGGFMARAFQYVKENGGLDSEESY-PYVAMDEICKYRPENSVANDTGFTVVTPGKEKAL 236

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GP+SVA+DA   SF FY +G+Y++  C  S   LDH VL VGYG      +  
Sbjct: 237 MKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDC--SSKNLDHGVLVVGYGFEGANSNNS 294

Query: 313 PYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
            YW VKNSW   WG+ GYV +   K N+CG+ TA +Y
Sbjct: 295 KYWLVKNSWGPEWGSNGYVKIAKDKKNHCGIATAASY 331


>gi|410519429|gb|AFV73398.1| cathepsin L [Haliotis discus hannai]
          Length = 326

 Score =  175 bits (443), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 95/216 (43%), Positives = 130/216 (60%), Gaps = 6/216 (2%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F +TG++EG  + K+ KL  LS+Q L+DCS   GN G
Sbjct: 114 VDWRTKGYVTEVKNQGQCGSCWAFSSTGSLEGQTFKKYNKLISLSEQNLVDCSTEQGNMG 173

Query: 198 CDGGEDFRSYQWI-MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG   +++ +I +  G+ T+  Y PY      C        A  TG+ ++   SE  L
Sbjct: 174 CGGGLMDQAFTYIKVNDGIDTETSY-PYEAASGKCRFNKANVGANDTGYTDIKSKSESDL 232

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           + A+A  GP++VAIDAS  SF  Y +GVY+   C+ +   LDH VLAVGYG   GK YW 
Sbjct: 233 QSAVATVGPIAVAIDASHMSFQLYKSGVYHYIFCSQTR--LDHGVLAVGYGTDSGKDYWL 290

Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
           VKNSW   WG QGY++MS  +DNNCG+ T  +Y T+
Sbjct: 291 VKNSWGATWGQQGYIMMSRNRDNNCGIATQASYPTV 326


>gi|47076309|emb|CAD89795.1| putative cathepsin L protease [Meloidogyne incognita]
          Length = 383

 Score =  175 bits (443), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 102/222 (45%), Positives = 136/222 (61%), Gaps = 12/222 (5%)

Query: 136 AIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG 192
           AIP  V +  KG   S+ +Q +CGSCW+F  TGA+EG +  K   L  LS+Q LIDC+ G
Sbjct: 162 AIPESVDWRDKGLVTSVKNQGMCGSCWAFSATGALEGQHSRKLGTLVSLSEQNLIDCTKG 221

Query: 193 --YGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNV 248
             YGN GC+GG    ++Q+I  + G+ T++ Y PY  ++   C    +   AT TG+V++
Sbjct: 222 EPYGNMGCNGGLMDNAFQYIEDNKGVDTENSY-PYKAKNGKKCLFKRSNVGATDTGYVDL 280

Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
               ED LK+A+A  GP+SVAIDA  +SF  Y +GVY +E C  SPD L H VL VGYG 
Sbjct: 281 PSGDEDKLKIAVATQGPISVAIDAGHRSFQLYAHGVYDEEAC--SPDNLGHGVLVVGYGT 338

Query: 309 LD-GKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
            D    YW VKNSW  +WG  GY+ MS  KDN CG+ +  +Y
Sbjct: 339 DDIHGDYWLVKNSWGEHWGENGYIRMSRNKDNQCGIASKASY 380


>gi|37963625|gb|AAP94048.2| cathepsin-L-like midgut cysteine proteinase [Tenebrio molitor]
          Length = 330

 Score =  175 bits (443), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 90/218 (41%), Positives = 133/218 (61%), Gaps = 7/218 (3%)

Query: 136 AIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGN 195
           A  V +     + + DQ  CGSCWSF TTGAVEG   ++  +L  LS+Q LIDCS  YGN
Sbjct: 118 AASVDWRSNAVSEVKDQGQCGSCWSFSTTGAVEGQLALQRGRLTSLSEQNLIDCSSSYGN 177

Query: 196 NGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDA 255
            GCDGG    ++ +I  +G+ ++  Y PY  Q  YC   ++ +  T++G+ ++    E++
Sbjct: 178 AGCDGGWMDSAFSYIHDYGIMSESAY-PYEAQGDYCRFDSSQSVTTLSGYYDLPSGDENS 236

Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYW 315
           L  A+ + GPV+VAIDA+ +   FY  G++YD+ CN S   L+H VL VGYG  +G+ YW
Sbjct: 237 LADAVGQAGPVAVAIDATDE-LQFYSGGLFYDQTCNQSD--LNHGVLVVGYGSDNGQDYW 293

Query: 316 QVKNSWSTYWGNQGYVLMSIKD--NNCGVMTAPTYVTM 351
            +KNSW + WG  GY    +++  NNCG+ TA +Y  +
Sbjct: 294 ILKNSWGSGWGESGY-WRQVRNYGNNCGIATAASYPAL 330


>gi|15128493|dbj|BAB62718.1| plerocercoid growth factor/cysteine protease [Spirometra
           erinaceieuropaei]
 gi|15130639|dbj|BAB62799.1| plerocercoid growth factor-2/cysteine protease [Spirometra
           erinaceieuropaei]
          Length = 336

 Score =  175 bits (443), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 97/251 (38%), Positives = 139/251 (55%), Gaps = 14/251 (5%)

Query: 102 VNGIILEKWRLVTSEGEKVSKYSLWVRYNKASKDAIPVRYEMKGYNSLLDQSVCGSCWSF 161
           + GI+L K R          K ++ V   +   D++  R E     S+ +Q  CGSCWSF
Sbjct: 99  LRGIVLTKLR---------RKEAVSVPLKENLPDSVNWR-ERGAVTSVKNQGQCGSCWSF 148

Query: 162 GTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDY 221
              GA+EGA  +K   L  LS+Q L+DCSW YGN GC+GG   +++Q+  ++G+  + DY
Sbjct: 149 SANGAIEGAIQIKTGALRSLSEQQLMDCSWDYGNQGCNGGLMPQAFQYAQRYGVEAEVDY 208

Query: 222 GPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYV 281
             Y  +D  C        A +TG+  +    E  L+ A+A  GP+SV IDA+   F  Y 
Sbjct: 209 -RYTERDGVCRYRQDLVVANVTGYAELPEGDEGGLQRAVATIGPISVGIDAADPGFMSYS 267

Query: 282 NGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNN-C 340
           +GV+  + C  SP  +DH VL VGYG  +G+ YW VKNSW + WG  GYV M+   NN C
Sbjct: 268 HGVFVSKTC--SPYAIDHGVLVVGYGAENGEAYWLVKNSWGSSWGEGGYVKMARNRNNMC 325

Query: 341 GVMTAPTYVTM 351
           G+ +  +Y T+
Sbjct: 326 GIASMASYPTV 336


>gi|391343119|ref|XP_003745860.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
          Length = 385

 Score =  175 bits (443), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 92/204 (45%), Positives = 126/204 (61%), Gaps = 6/204 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F   G++EG ++    KL  LS+Q L+DCS   GN+GC+GG   ++++++
Sbjct: 185 NQGQCGSCWAFSAVGSLEGQHFKSTGKLVSLSEQNLVDCSTPEGNSGCNGGWMDQAFEYV 244

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
              HG+ T+D Y PY+G D  CH  N +  AT+ GF++V    E+AL+ A+   GPVSVA
Sbjct: 245 KDNHGIDTEDSY-PYVGTDGSCHFKNKSIGATLKGFMDVKEGDEEALRQAVGVAGPVSVA 303

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
           IDAS   F FY  GVY    C+ S   LDH VL VGYG +  GK +W VKNSW   WG  
Sbjct: 304 IDASSMLFQFYRGGVYNVPWCSTSE--LDHGVLVVGYGKQFQGKDFWMVKNSWGVGWGIY 361

Query: 329 GYVLMSI-KDNNCGVMTAPTYVTM 351
           GY+ MS  K N CG+ +  +  T+
Sbjct: 362 GYIEMSRNKGNQCGIASKASIPTV 385


>gi|308322281|gb|ADO28278.1| cathepsin L [Ictalurus furcatus]
          Length = 359

 Score =  175 bits (443), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 96/218 (44%), Positives = 134/218 (61%), Gaps = 8/218 (3%)

Query: 136 AIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG 192
           A+P  V +   GY   + +Q  C SCW+F  TGA+EG  + K  KL  LS+Q L+DCS  
Sbjct: 115 ALPNTVNWTQMGYVTEVEEQKQCNSCWAFSATGALEGQTFKKTGKLVSLSKQQLVDCSKK 174

Query: 193 YGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPN 251
           +GNNGC GG    +++++ ++ GL T++ Y PY  +D  C     T   T TG V +   
Sbjct: 175 FGNNGCKGGLMNWAFEYVKENGGLHTEESY-PYEAKDGSCRDNLGTVGVTCTGHVQINSE 233

Query: 252 SEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDG 311
            E+AL+ A+A  GP+SVAIDA+  SF  Y +G+Y +  C  S   ++H VLAVGYG  DG
Sbjct: 234 DENALQEAVATIGPISVAIDANHTSFQLYESGLYDEPDC--SCTDMNHGVLAVGYGTDDG 291

Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           K YW +KNSW   WG++GY+ MS  K+N CG+ TA +Y
Sbjct: 292 KDYWLIKNSWGINWGDKGYIKMSRNKNNQCGIATAASY 329


>gi|198432217|ref|XP_002130230.1| PREDICTED: similar to cathepsin L [Ciona intestinalis]
          Length = 327

 Score =  174 bits (442), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 96/222 (43%), Positives = 140/222 (63%), Gaps = 7/222 (3%)

Query: 134 KDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG 192
           ++   + +  +GY   + +Q  CGSCW+F TTG++EG ++ K K L  LS+Q L+DCS+ 
Sbjct: 109 ENPTSIDWRTRGYVTPVKNQLQCGSCWAFSTTGSLEGQHFAKTKNLVSLSEQQLMDCSFK 168

Query: 193 YGNNGCDGGEDFRSYQWI-MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPN 251
            G+ GC GG    ++ +I +  G+ ++ DY PY  ++ +C   N++  AT+TG V+VT  
Sbjct: 169 EGDEGCGGGIMDYAFDYIFLAGGVESEADY-PYEARNDHCRFDNSSIAATLTGCVDVTSG 227

Query: 252 SEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDG 311
           SE  L+ A+   GPVSVAIDAS  SF  Y +GV Y+  C+ +   LDH VLAVGYG  +G
Sbjct: 228 SETQLEKAVGSIGPVSVAIDASHISFQLYGSGVNYEPMCSTTT--LDHGVLAVGYGADNG 285

Query: 312 KPYWQVKNSWSTYWGN-QGYVLMSI-KDNNCGVMTAPTYVTM 351
             YW VKNSW   WG+  GY+ MS  ++NNCG+ T  +Y T+
Sbjct: 286 NEYWIVKNSWGEGWGHLNGYIKMSKNRNNNCGIATQASYPTV 327


>gi|2804264|dbj|BAA24443.1| cysteine proteinase [Sitophilus zeamais]
          Length = 331

 Score =  174 bits (442), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 93/197 (47%), Positives = 126/197 (63%), Gaps = 6/197 (3%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCWSF  +G++EG ++ K  KL  LS+Q L+DCS  YGNNGC+GG    ++++I
Sbjct: 138 DQGHCGSCWSFSGSGSLEGQHFRKTGKLVSLSEQNLVDCSGRYGNNGCNGGLMDNAFRYI 197

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T+  Y PYL +D  CH     + AT  GFV++   +ED LK A+A  GP+S+A
Sbjct: 198 KDNGGIDTEQSY-PYLAEDEKCHYKTQNSGATDKGFVDIEEGNEDDLKAAVATVGPISIA 256

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
           IDAS ++F  Y +GVY D +C +    LDH VL VGYG   DG+ YW VKNSW    G  
Sbjct: 257 IDASYETFQLYSDGVYSDPECISQE--LDHGVLVVGYGTSDDGQDYWLVKNSWRPSCGLN 314

Query: 329 GYVLMSI-KDNNCGVMT 344
           GY+ M+  +DN CGV +
Sbjct: 315 GYIKMARNQDNMCGVAS 331


>gi|72005575|ref|XP_783218.1| PREDICTED: cathepsin L2-like isoform 2 [Strongylocentrotus
           purpuratus]
 gi|390337647|ref|XP_003724610.1| PREDICTED: cathepsin L2-like isoform 1 [Strongylocentrotus
           purpuratus]
          Length = 334

 Score =  174 bits (442), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 98/214 (45%), Positives = 132/214 (61%), Gaps = 7/214 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  +GY   + DQ  CGSCW+F T GA+EG +++K   L  LS+Q L+DCS   GN+G
Sbjct: 121 VDWRTEGYVTPVKDQGQCGSCWAFSTVGALEGQHFVKTGTLVSLSEQNLVDCSQAEGNDG 180

Query: 198 CDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    + ++I  +G + T+  Y PY G D  CH   +   AT+TGF  V  +SE AL
Sbjct: 181 CNGGWPAWADEYIKSNGGIDTEVGY-PYEGVDDSCHYRTSDVGATITGFAEVEADSEKAL 239

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGEL-DGKPYW 315
           + ALA+ GP+SV IDA+Q SF  Y +GVY +  C+++   LDH V AVGY    DG  Y+
Sbjct: 240 EKALAQVGPISVCIDATQPSFQLYESGVYDEPDCSST--ALDHCVTAVGYDSTADGDKYY 297

Query: 316 QVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
            VKNSW T WG +GY+ MS  K   CG+ T  TY
Sbjct: 298 IVKNSWGTTWGQEGYIWMSRDKQKQCGIATNATY 331


>gi|86279347|gb|ABC88769.1| putative cathepsin L-like proteinase [Tenebrio molitor]
          Length = 328

 Score =  174 bits (442), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 90/218 (41%), Positives = 133/218 (61%), Gaps = 7/218 (3%)

Query: 136 AIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGN 195
           A  V +     + + DQ  CGSCWSF TTGAVEG   ++  +L  LS+Q LIDCS  YGN
Sbjct: 116 AASVDWRSNAVSEVKDQGQCGSCWSFSTTGAVEGQLALQRGRLTSLSEQNLIDCSSSYGN 175

Query: 196 NGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDA 255
            GCDGG    ++ +I  +G+ ++  Y PY  Q  YC   ++ +  T++G+ ++    E++
Sbjct: 176 AGCDGGWMDSAFSYIHDYGIMSESAY-PYEAQGDYCRFDSSQSVTTLSGYYDLPSGDENS 234

Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYW 315
           L  A+ + GPV+VAIDA+ +   FY  G++YD+ CN S   L+H VL VGYG  +G+ YW
Sbjct: 235 LADAVGQAGPVAVAIDATDE-LQFYSGGLFYDQTCNQSD--LNHGVLVVGYGSDNGQDYW 291

Query: 316 QVKNSWSTYWGNQGYVLMSIKD--NNCGVMTAPTYVTM 351
            +KNSW + WG  GY    +++  NNCG+ TA +Y  +
Sbjct: 292 ILKNSWGSGWGESGY-WRQVRNYGNNCGIATAASYPAL 328


>gi|186688053|gb|ACC86112.1| cathepsin K [Paralichthys olivaceus]
          Length = 330

 Score =  174 bits (442), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 105/250 (42%), Positives = 140/250 (56%), Gaps = 18/250 (7%)

Query: 105 IILEKWRLVT-SEGEKVSKYSLWVRYNKASKDAIPVRYEMKGYNSLLDQSVCGSCWSFGT 163
           + LE+ R  T +  E+VSK   +V Y K      PV+          +Q  CGSCW+F +
Sbjct: 96  VPLERQRSFTMALDERVSKLPKFVDYRKEGM-VTPVK----------NQGSCGSCWAFSS 144

Query: 164 TGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGP 223
            GA+EG    K  +L  LS Q  +DC     NNGC GG    ++Q++ ++G    ++  P
Sbjct: 145 AGALEGQLAKKTGQLMDLSPQNPVDCV--TENNGCGGGYMTNAFQYVQENGGIDSEEAYP 202

Query: 224 YLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNG 283
           Y+G+D  C   ++   A   G+  V    E AL +AL K GPVSV IDASQ SF FY  G
Sbjct: 203 YVGEDQSCRYNSSGMAAQCKGYKEVPVGDEHALAVALFKVGPVSVGIDASQSSFQFYQRG 262

Query: 284 VYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCG 341
           VYYD  CN   D ++HAVLAVGYG    GK YW +KNSWS  WG +GY+LM+  +DN CG
Sbjct: 263 VYYDRNCNK--DDINHAVLAVGYGISSKGKKYWIIKNSWSENWGKKGYILMARNRDNLCG 320

Query: 342 VMTAPTYVTM 351
           +    +Y  M
Sbjct: 321 IANLASYPIM 330


>gi|86279345|gb|ABC88768.1| putative cathepsin L-like proteinase [Tenebrio molitor]
          Length = 328

 Score =  174 bits (442), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 90/217 (41%), Positives = 130/217 (59%), Gaps = 5/217 (2%)

Query: 136 AIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGN 195
           A  V +     + + DQ  CGSCWSF TTGAVEG   ++   L  LS+Q LIDCS  YGN
Sbjct: 116 AASVDWRSNAVSEVKDQGQCGSCWSFSTTGAVEGQLALQRGGLTSLSEQNLIDCSSSYGN 175

Query: 196 NGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDA 255
            GCDGG    ++ +I  +G+ ++  Y PY  QD YC   ++ +  T++G+ ++    E++
Sbjct: 176 AGCDGGWMDSAFSYIHDYGIMSESAY-PYEAQDDYCRFDSSQSVTTLSGYYDLPSGDENS 234

Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYW 315
           L  A+ + GPV+VAIDA+ +   FY  G++YD+ CN S   L+H V  VGYG  +G+ YW
Sbjct: 235 LADAVGQAGPVAVAIDATDE-LQFYSGGLFYDQTCNQSD--LNHGVFVVGYGSDNGQDYW 291

Query: 316 QVKNSWSTYWGNQGY-VLMSIKDNNCGVMTAPTYVTM 351
            +KNSW + WG  GY   +    NNCG+ TA +Y  +
Sbjct: 292 ILKNSWGSGWGENGYWTQVRNYGNNCGIATAASYPAL 328


>gi|348525618|ref|XP_003450319.1| PREDICTED: cathepsin S-like [Oreochromis niloticus]
          Length = 330

 Score =  174 bits (442), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 97/222 (43%), Positives = 129/222 (58%), Gaps = 7/222 (3%)

Query: 133 SKDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW 191
           SK    V Y  KG   S+ +Q  CGSCW+F   GA+EG       KL  LS Q L+DCS 
Sbjct: 113 SKIPKSVDYRKKGMVTSVKNQGACGSCWAFSAAGALEGQLAKSTGKLVDLSPQNLVDCSG 172

Query: 192 GYGNNGCDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTP 250
            YGN+GC+GG   R++Q+++  HG+ +   Y PY G+D  C     T  A  + +  +  
Sbjct: 173 KYGNHGCNGGFMTRAFQYVIDNHGIDSDASY-PYTGRDEQCRYNPATRAANCSSYQFLPE 231

Query: 251 NSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELD 310
             E+ALK ALA  GP+SVAIDA +  FSFY +GVY D  C      ++H VLAVGYG L+
Sbjct: 232 GDENALKQALATIGPISVAIDARRPRFSFYRSGVYNDPSCTQE---VNHGVLAVGYGSLN 288

Query: 311 GKPYWQVKNSWSTYWGNQGYVLMSIK-DNNCGVMTAPTYVTM 351
           G+ YW VKNSW + +G+QGY+ M+    N CG+     Y  M
Sbjct: 289 GQDYWLVKNSWGSTFGDQGYIRMARNTGNQCGIALYACYPVM 330


>gi|391340505|ref|XP_003744580.1| PREDICTED: digestive cysteine proteinase 1-like [Metaseiulus
           occidentalis]
          Length = 469

 Score =  174 bits (442), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 94/203 (46%), Positives = 126/203 (62%), Gaps = 7/203 (3%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F T+GA+EG +++K+ +L  LS+Q ++DCSW   + GC+GG+   + +++
Sbjct: 272 DQGQCGSCWAFSTSGAIEGQHFLKNGELLSLSEQQMVDCSWL--DFGCNGGQPMLAMEYV 329

Query: 211 -MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
               GL  +  Y PY G    CH    +A A +TGF      SE AL+ A+AK GP+SV 
Sbjct: 330 RFNGGLELETAY-PYKGVGGSCHSDKKSAAAKITGFWMAGFYSESALQKAVAKVGPISVG 388

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           +DAS + F  Y +G+Y  E C  S  GLDHAVLAVGYG  D   YW VKNSW+T WG +G
Sbjct: 389 MDASGEDFQHYKSGIYNPESC--SSIGLDHAVLAVGYGTSDDGDYWLVKNSWNTSWGEKG 446

Query: 330 YV-LMSIKDNNCGVMTAPTYVTM 351
           Y  L   K N CG+ T P Y T+
Sbjct: 447 YFKLPRNKGNKCGIATTPIYPTV 469


>gi|348531513|ref|XP_003453253.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
          Length = 333

 Score =  174 bits (442), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 93/213 (43%), Positives = 131/213 (61%), Gaps = 7/213 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TGA+EG ++ K  +L  LS+Q L+DCS  +GN G
Sbjct: 122 VDWRKKGYVTKVKNQQQCGSCWAFSATGALEGQHFKKTGRLVYLSEQQLVDCSRNFGNRG 181

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           CDGG    ++++I  + G+ T+  Y PY   D  CH    +  A   G+V+V+P+ E+AL
Sbjct: 182 CDGGWMNNAFKYIKDNGGIQTEASY-PYQAMDGLCHYNPNSVGAICNGYVDVSPD-EEAL 239

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K A+A  GP+S+A+DAS +SF  Y +GVY + +CN+    L H +L VGYG   G  YW 
Sbjct: 240 KEAVATIGPISIAMDASHESFQLYQSGVYDEHRCNDYY--LSHGMLVVGYGTEGGLDYWL 297

Query: 317 VKNSWSTYWGNQGYVLM-SIKDNNCGVMTAPTY 348
           +KNSW   WG  GY+ M   K N CG+ TA +Y
Sbjct: 298 IKNSWGLGWGKMGYIKMVRNKRNQCGIATAASY 330


>gi|312282841|dbj|BAJ34286.1| unnamed protein product [Thellungiella halophila]
          Length = 358

 Score =  174 bits (442), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 88/199 (44%), Positives = 122/199 (61%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F TTGA+E AY+    K   LS+Q L+DC+  + N GC+GG   +++++I
Sbjct: 158 DQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYI 217

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + GL T++ Y PY G+D  C  +       +   VN+T  +ED LK A+    PVS+A
Sbjct: 218 KSNGGLDTEEAY-PYTGKDGTCKYSAENVGVQVLDSVNITLGAEDELKHAVGLVRPVSIA 276

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            +   KSF  Y +GVY D  C N+P  ++HAVLAVGYG  DG PYW +KNSW   WG++G
Sbjct: 277 FEVV-KSFRLYKSGVYTDSHCGNTPMDVNHAVLAVGYGIEDGVPYWLIKNSWGADWGDKG 335

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y  M +  N CG+ T  +Y
Sbjct: 336 YFKMEMGKNMCGIATCASY 354


>gi|449681105|ref|XP_002158608.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
          Length = 339

 Score =  174 bits (442), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 91/213 (42%), Positives = 131/213 (61%), Gaps = 3/213 (1%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  +GY   + +Q  CGSCW+F +TGA+EG  + K   L  +S+Q L+DC+  YGN  
Sbjct: 124 VDWRTEGYVTRVKNQGQCGSCWAFSSTGALEGQTFRKTGVLQEISEQNLVDCTQSYGNEA 183

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++ +I  + G+ ++  Y  Y     YC+       A+ TGFV++    E+AL
Sbjct: 184 CNGGWMDNAFTYIKDNKGIDSEVGYPYYARALGYCYYNQQYNVASDTGFVDIPSGDENAL 243

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K+A+A  GP+SVAIDA++ SF  Y +GVY +  C N  + LDHAVL VGYG  +G+ +W 
Sbjct: 244 KVAVATVGPISVAIDATKASFMSYQSGVYNEPTCGNGIENLDHAVLVVGYGTEEGRDFWI 303

Query: 317 VKNSWSTYWGNQGYVLMSIK-DNNCGVMTAPTY 348
           VKNSW T WG+QGY+ MS    N CG+ T  +Y
Sbjct: 304 VKNSWDTTWGDQGYIKMSRNMSNQCGIATKASY 336


>gi|301789679|ref|XP_002930256.1| PREDICTED: cathepsin L1-like [Ailuropoda melanoleuca]
 gi|281343339|gb|EFB18923.1| hypothetical protein PANDA_020645 [Ailuropoda melanoleuca]
          Length = 334

 Score =  174 bits (442), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 98/220 (44%), Positives = 131/220 (59%), Gaps = 10/220 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V + +KGY   + +Q  CGSCW+F  TGA+EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 118 VDWTLKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRSQGNEG 177

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDA-YCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++Q++ ++G    ++  PYLG D   C      + A  TGFV++ P  E AL
Sbjct: 178 CNGGLMDNAFQYVKENGGLDSEESYPYLGTDTDSCKYKPECSAANDTGFVDI-PQREKAL 236

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GP+SVAIDA  +SF FY +G+YYD  C  S   LDH VL VGYG    + +  
Sbjct: 237 MKAVATVGPISVAIDAGHQSFQFYKSGIYYDPDC--SSKDLDHGVLVVGYGFEGTDSNNN 294

Query: 313 PYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
            +W VKNSW   WG  GYV M+   NN CG+ TA +Y T+
Sbjct: 295 KFWIVKNSWGPEWGTNGYVKMAKDQNNHCGIATAASYPTV 334


>gi|426362423|ref|XP_004048364.1| PREDICTED: cathepsin L2 isoform 1 [Gorilla gorilla gorilla]
 gi|426362425|ref|XP_004048365.1| PREDICTED: cathepsin L2 isoform 2 [Gorilla gorilla gorilla]
          Length = 334

 Score =  174 bits (442), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 95/217 (43%), Positives = 129/217 (59%), Gaps = 10/217 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TGA+EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 118 VDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQG 177

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG   R++Q++ ++ GL +++ Y PY+  D  C      + A  TGF  V P  E AL
Sbjct: 178 CNGGFMARAFQYVKENGGLDSEESY-PYVAMDEICKYRPENSVANDTGFTVVAPGKEKAL 236

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GP+SVA+DA   SF FY +G+Y++  C  S   LDH VL VGYG      +  
Sbjct: 237 MKAVATVGPISVAVDAGHSSFQFYKSGIYFEPDC--SSKNLDHGVLVVGYGFEGANSNNS 294

Query: 313 PYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
            YW VKNSW   WG+ GYV +   K+N+CG+ TA +Y
Sbjct: 295 KYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASY 331


>gi|388890776|gb|AFK80364.1| cysteine proteinase 3, partial [Acanthamoeba castellanii]
          Length = 329

 Score =  174 bits (442), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 93/202 (46%), Positives = 128/202 (63%), Gaps = 6/202 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCWSF TTG+ EGA ++K  +L  LS+Q LIDCS  YGNNGC+GG    ++++I
Sbjct: 131 NQGQCGSCWSFSTTGSTEGANFLKTGRLTSLSEQNLIDCSGSYGNNGCNGGLMDYAFEYI 190

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           + + G+ T+  Y PY      C      +  ++T + +V+   E+AL  A+A   P SVA
Sbjct: 191 INNKGIDTEASY-PYQTAQYTCQYNPANSGGSLTSYTDVSSGDENALLNAVATE-PTSVA 248

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS  SF FY  GVYY+  C+++   LDH VLAVG+G  DG+ YW VKNSW   WG  G
Sbjct: 249 IDASHNSFQFYSGGVYYESACSSTQ--LDHGVLAVGWGTEDGQDYWLVKNSWGADWGLAG 306

Query: 330 YVLMSI-KDNNCGVMTAPTYVT 350
           Y+ M+  + NNCG+ T+ +Y T
Sbjct: 307 YIKMARNRSNNCGIATSASYPT 328


>gi|291383517|ref|XP_002708299.1| PREDICTED: cathepsin L1 [Oryctolagus cuniculus]
          Length = 333

 Score =  174 bits (442), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 99/220 (45%), Positives = 133/220 (60%), Gaps = 11/220 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TGA+EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 118 VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFQKTGKLISLSEQNLVDCSHPQGNQG 177

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++Q++  + GL +++ Y PY G D  C      + A  TGFV++ P  E AL
Sbjct: 178 CNGGLMDYAFQYVKDNSGLDSEESY-PYEGMDGTCKYKPECSVANDTGFVDI-PGHEKAL 235

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GP+S AIDA   SF FY +G+YYD  C  S   LDH +L VGYG      +  
Sbjct: 236 LRAVATVGPISAAIDAGHMSFQFYKSGIYYDPDC--SSKDLDHGILVVGYGFEGTNSNAT 293

Query: 313 PYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTYVTM 351
            YW VKNSW T WG++GYV ++  KDN+CG+ TA +Y T+
Sbjct: 294 KYWLVKNSWGTTWGDEGYVKIIRDKDNHCGIATAASYPTV 333


>gi|2804266|dbj|BAA24444.1| cysteine proteinase [Sitophilus zeamais]
          Length = 331

 Score =  174 bits (441), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 94/197 (47%), Positives = 125/197 (63%), Gaps = 6/197 (3%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCWSF  +G++EG ++ K  KL  LS+Q L+DCS  YGN GC+GG    ++++I
Sbjct: 138 DQGHCGSCWSFSGSGSLEGQHFRKTGKLVSLSEQNLVDCSGRYGNTGCNGGLMDNAFRYI 197

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T+  Y PYL +D  CH     + AT  GFV++   +ED LK A+A  GPVS+A
Sbjct: 198 KDNGGIDTEQSY-PYLAEDEKCHYKTQNSGATDKGFVDIEEGNEDDLKAAVATVGPVSIA 256

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
           IDAS ++F  Y +GVY D +C  S   LDH VL VGYG   DG+ YW VKNSW    G  
Sbjct: 257 IDASYETFQLYSDGVYSDPEC--SSQELDHGVLVVGYGTSDDGQDYWLVKNSWRPSCGLN 314

Query: 329 GYVLMSI-KDNNCGVMT 344
           GY+ M+  +DN CGV +
Sbjct: 315 GYIKMARNQDNMCGVAS 331


>gi|1149525|emb|CAA64218.1| preprocathepsin K [Mus musculus]
          Length = 329

 Score =  174 bits (441), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 92/215 (42%), Positives = 125/215 (58%), Gaps = 6/215 (2%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           + Y  KGY   + +Q  CGSCW+F + GA+EG    K  KL  LS Q L+DC     N G
Sbjct: 119 IDYRKKGYVTPVKNQGQCGSCWAFSSAGALEGQLKKKTGKLLALSPQNLVDCV--TENYG 176

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
           C GG    ++Q++ ++G    +D  PY+GQD  C    T   A   G+  +   +E ALK
Sbjct: 177 CGGGYMTTAFQYVQQNGGIDSEDAFPYVGQDESCMYNATAKAAKCRGYREIPVGNEKALK 236

Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQV 317
            A+A+ GP+SV+IDAS  SF FY  GVYYDE C+   D ++HAVL VGYG   G  +W +
Sbjct: 237 RAVARVGPISVSIDASLASFQFYSRGVYYDENCDR--DNVNHAVLVVGYGTQKGSKHWII 294

Query: 318 KNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           KNSW   WGN+GY L++   NN CG+    ++  M
Sbjct: 295 KNSWGESWGNKGYALLARNKNNACGITNMASFPKM 329


>gi|293342579|ref|XP_001065885.2| PREDICTED: cathepsin L1 [Rattus norvegicus]
 gi|293354415|ref|XP_225137.5| PREDICTED: cathepsin L1 [Rattus norvegicus]
 gi|149039747|gb|EDL93863.1| rCG24278 [Rattus norvegicus]
          Length = 330

 Score =  174 bits (441), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 96/204 (47%), Positives = 128/204 (62%), Gaps = 7/204 (3%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F   G++EG  + K  KL  LS+Q L+DCSW YGN GC+GG    ++Q++
Sbjct: 131 NQGQCGSCWAFSAVGSLEGQIFKKTGKLVSLSEQNLVDCSWSYGNLGCNGGLMEFAFQYV 190

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
            ++ GL T + Y  Y  QD  C      + A +TGFV V P SED L  A+A  GPVSV 
Sbjct: 191 KENRGLDTGESYA-YEAQDGLCRYNPKYSAANVTGFVKV-PLSEDDLMSAVASVGPVSVG 248

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
           ID+  +SF FY  G+YY+  C+++   +DHAVL VGYG E DG  YW VKNSW   WG  
Sbjct: 249 IDSHHQSFRFYSGGMYYEPDCSSTE--MDHAVLVVGYGEESDGGKYWLVKNSWGEDWGMD 306

Query: 329 GYVLMSI-KDNNCGVMTAPTYVTM 351
           GY+ M+  ++NNCG+ T   Y T+
Sbjct: 307 GYIKMAKDQNNNCGIATYAIYPTV 330


>gi|348531521|ref|XP_003453257.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
          Length = 333

 Score =  174 bits (441), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 99/228 (43%), Positives = 140/228 (61%), Gaps = 7/228 (3%)

Query: 124 SLWVRYNKASKDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLS 182
           S ++R  + +     V +  KGY   + +Q  CGSCW+F   GA+EG  + K  KL  LS
Sbjct: 107 STFLRLPEGTDLPDTVDWRDKGYVTDVQNQMQCGSCWAFSAIGALEGQNFRKTGKLVSLS 166

Query: 183 QQALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATAT 241
           +Q L+DCS  +GN+GC+GG    ++++I    G+ T+  Y PY  ++  CH    T  AT
Sbjct: 167 KQQLVDCSQSFGNHGCNGGWMDWAFKYIQATGGIDTEASY-PYEAEEGNCHYNPETVGAT 225

Query: 242 MTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAV 301
            TG+V+V+PN EDALK A+A  GP+S+A+DAS +SF FY +GVY +  C  S     HA+
Sbjct: 226 CTGYVDVSPN-EDALKEAVATIGPISIAMDASHESFQFYQSGVYDEPSCITSR--FSHAM 282

Query: 302 LAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           LAVGYG  +G  YW VKNS+   WG +GY+ MS  K N CG+ +  +Y
Sbjct: 283 LAVGYGTENGHDYWLVKNSFGLGWGEKGYIKMSRNKSNQCGIASKASY 330


>gi|340370384|ref|XP_003383726.1| PREDICTED: silicatein-like [Amphimedon queenslandica]
          Length = 337

 Score =  174 bits (441), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 90/228 (39%), Positives = 134/228 (58%), Gaps = 8/228 (3%)

Query: 129 YNKASKDAIPVRYEMKGYNSLL---DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQA 185
           Y++     +P   + +  N++    DQ  CGSC++F   GA+EGA  + H KL  LS+Q 
Sbjct: 113 YSRPDDPNLPEEVDWRTKNAVTGVKDQGQCGSCYAFSAVGALEGAQALAHDKLVHLSEQN 172

Query: 186 LIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTG 244
           ++DCS  YGN GC+GG  + S+++I+ + G+  +D Y  Y G+   C            G
Sbjct: 173 IVDCSIPYGNKGCNGGNMYESFRYIIDNDGIDREDGY-KYTGRQGQCKFDRKAIGGRQVG 231

Query: 245 FVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAV 304
            +++   SE  L+ ALA  GPVSVAID S  +F FY  GV+ +  C+ +   L HA L +
Sbjct: 232 IIHIPTGSEAELQSALATAGPVSVAIDGSSNAFRFYEKGVFDEPNCSTTK--LTHAGLII 289

Query: 305 GYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
           GYG+  GKPYW VKNSW  +WG +GY++M+  K N CG+ TA ++ T+
Sbjct: 290 GYGKKKGKPYWLVKNSWGPHWGMKGYIMMARNKANQCGIATAASFPTL 337


>gi|18308182|gb|AAL67857.1|AF462309_1 cysteine proteinase [Acanthamoeba healyi]
          Length = 330

 Score =  174 bits (440), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 96/225 (42%), Positives = 135/225 (60%), Gaps = 8/225 (3%)

Query: 131 KASKDAIPVRYEMKGYNSLL---DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALI 187
           +A    IP  ++ +   ++    +Q  CGSCWSF TTG+ EGA ++K  +L  LS+Q LI
Sbjct: 108 EAPATGIPSEFDWRQKGAVTHVKNQGQCGSCWSFSTTGSTEGANFLKTGRLVSLSEQNLI 167

Query: 188 DCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFV 246
           DCS  YGNNGC+GG    ++++I+ + G+ T+  Y         C         ++TG+ 
Sbjct: 168 DCSVSYGNNGCNGGLMDYAFEYIINNRGIDTEASYPYQTAGPLTCQYNAANKGGSLTGYT 227

Query: 247 NVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGY 306
           +VT   E+AL  A  K  PVSVAIDAS  SF FY  GVYY+  C+++   LDH VL VG+
Sbjct: 228 DVTSGDENALLNAAVKE-PVSVAIDASHNSFQFYSGGVYYESACSSTQ--LDHGVLVVGW 284

Query: 307 GELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVT 350
           G  +G+ +W VKNSW   WG  GY+ MS  ++NNCG+ TA +Y T
Sbjct: 285 GSENGQDFWWVKNSWGASWGLNGYIKMSRNQNNNCGIATAASYPT 329


>gi|156354996|ref|XP_001623463.1| predicted protein [Nematostella vectensis]
 gi|156210166|gb|EDO31363.1| predicted protein [Nematostella vectensis]
          Length = 312

 Score =  174 bits (440), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 84/203 (41%), Positives = 124/203 (61%), Gaps = 11/203 (5%)

Query: 137 IPVRYEMKGY---NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG- 192
           +P + + + Y   N    Q  CGSCW F T GAVE A++++  +L  L++Q L+DC+W  
Sbjct: 109 VPDQLDWRDYGAVNPAKGQGTCGSCWPFATAGAVEAAHFIQKGELLNLAEQQLLDCTWST 168

Query: 193 ----YGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
               +GNNGC GG  ++++ W+ K GL T   YG Y GQ+ +C  +N T  A +T +  V
Sbjct: 169 PGVYHGNNGCLGGWTWKAFSWVKKFGLATTKSYGHYRGQEGFCKTSNLTVGARITSYRRV 228

Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
              +  ALK AL  HGP +++I+A+ KS  FY +G+  D+ C+N     DHAVL +GYG 
Sbjct: 229 KRFNPIALKKALYYHGPATISINANPKSLKFYSDGIISDKHCSNK---TDHAVLLIGYGS 285

Query: 309 LDGKPYWQVKNSWSTYWGNQGYV 331
            +G PYW ++NSWS  WGN G++
Sbjct: 286 DNGVPYWLIRNSWSHKWGNNGFI 308


>gi|163658591|gb|ABY28387.1| cathepsin L [Gnathostoma spinigerum]
          Length = 398

 Score =  174 bits (440), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 91/202 (45%), Positives = 124/202 (61%), Gaps = 7/202 (3%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F  TGA+EG +  K  +L  LS+Q L+DCS  YGNNGC+GG    ++++I
Sbjct: 197 DQGQCGSCWAFSATGALEGQHMRKTHQLVSLSEQNLVDCSRKYGNNGCNGGLMDNAFEYI 256

Query: 211 M-KHGLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
              HG+ T++ Y PY G +   CH       A   G+ ++    E+ALK+A+A  GP+SV
Sbjct: 257 KDNHGIDTEESY-PYKGVEGKKCHFRRKFVGAEDYGYTDLPEGDEEALKVAVATIGPISV 315

Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGN 327
           AIDA   SF  Y  G+Y + +C  SP+ LDH VL VGYG + +   YW VKNSW T WG 
Sbjct: 316 AIDAGHISFQNYRKGIYTENEC--SPEDLDHGVLVVGYGTDENAGDYWIVKNSWGTRWGE 373

Query: 328 QGYVLMSI-KDNNCGVMTAPTY 348
            GY+ M+  K N CG+ +  +Y
Sbjct: 374 HGYIRMARNKRNQCGIASKASY 395


>gi|73946536|ref|XP_541257.2| PREDICTED: cathepsin L1 [Canis lupus familiaris]
          Length = 333

 Score =  174 bits (440), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 94/216 (43%), Positives = 129/216 (59%), Gaps = 9/216 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  +GY   + DQ  C  CW+F  TGA+EG  + K  KL  LS+Q L+DCSW  GN G
Sbjct: 118 VDWREQGYVTPVKDQGQCLGCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSWSQGNRG 177

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
           C+GG    ++Q++  +G    ++  PYL ++  C      + A +T F  +  N ED L 
Sbjct: 178 CNGGLMEYAFQYVKDNGGLDSEESYPYLARNEPCKYRPEKSAANVTAFWPIL-NEEDGLM 236

Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKP 313
             +A  GPVS A+D+S +SF FY  G+YYD KC+N    L+H VL VGYG    E D K 
Sbjct: 237 TTVATVGPVSAAVDSSPQSFQFYKKGIYYDPKCSNKL--LNHGVLVVGYGFEGAESDNKK 294

Query: 314 YWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           YW VKNSW T WG QGY+L++  +DN+CG+ T  +Y
Sbjct: 295 YWIVKNSWGTNWGMQGYMLLAKDRDNHCGIATRASY 330


>gi|340374922|ref|XP_003385986.1| PREDICTED: cathepsin L-like [Amphimedon queenslandica]
          Length = 325

 Score =  174 bits (440), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 96/235 (40%), Positives = 141/235 (60%), Gaps = 4/235 (1%)

Query: 116 EGEKVSKYSLWVRYNKASKDAIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKH 175
           EG K +  +  +  N A  D      +     S+ +Q  CGSCW+F  TGA+E  +++  
Sbjct: 91  EGFKRNMMNESLHANNAKADTAVDWRDRGAVTSVKNQGQCGSCWAFSATGALEAMHFLNK 150

Query: 176 KKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIAN 235
             L  LS+Q L+DCS+ YGN GC GG   R++ ++   G    ++  PY+G    C  ++
Sbjct: 151 GTLVSLSEQQLVDCSYRYGNAGCQGGLMDRAFAYVRNKGYICSEESYPYVGYMWACKSSS 210

Query: 236 TTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPD 295
            +  A+  G+ N+    E++L  A++  G VSVAIDAS+ SF FY +GV+YD  C++S  
Sbjct: 211 CSPAASCRGYKNIASGDEESLTNAVSSVGTVSVAIDASRYSFQFYSSGVFYDSDCSSSR- 269

Query: 296 GLDHAVLAVGYGE-LDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
            L+H VL VGYG   DG  Y+ VKNSWS+ WG++GYVLM+  KDNNCG+ +A +Y
Sbjct: 270 -LNHGVLVVGYGTYYDGTEYYIVKNSWSSDWGDKGYVLMARNKDNNCGIASAASY 323


>gi|313235898|emb|CBY11285.1| unnamed protein product [Oikopleura dioica]
          Length = 326

 Score =  174 bits (440), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 106/244 (43%), Positives = 140/244 (57%), Gaps = 25/244 (10%)

Query: 126 WVRYN------KASKDAIP------------VRYEMKGY-NSLLDQSVCGSCWSFGTTGA 166
           WV YN      KA+ D  P            V +  +GY   + DQ  CGSCW+F TTG+
Sbjct: 84  WVEYNRLRNGAKANSDCKPMPSSSAQANPDAVDWRPQGYVTPIKDQGQCGSCWAFSTTGS 143

Query: 167 VEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGG-EDFRSYQWIMKHGLPTQDDYGPYL 225
            EGA++ K  KL +LS+Q L+DCS   G++GC+GG  DF     I   G+ T+  Y PY 
Sbjct: 144 TEGAHFKKTGKLVMLSEQQLVDCSTKEGDHGCNGGLMDFGFTYIIENDGITTESAY-PYK 202

Query: 226 GQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVY 285
            QD  C  +  TA AT++   +V   SE  L+ A+A  GP+SVAIDA   SF  Y  G+Y
Sbjct: 203 AQDGSCK-SGMTAAATLSECYDVAQGSEADLETAVATVGPISVAIDAHLLSFRLYKQGIY 261

Query: 286 YDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMT 344
           +D  C+++   LDH VLAVGY       YW VKNSW+T WGN+GY+ M+  K N CG+ T
Sbjct: 262 HDRLCSSTR--LDHGVLAVGYKNDPSGNYWIVKNSWNTTWGNEGYIWMAKDKKNTCGIAT 319

Query: 345 APTY 348
           A +Y
Sbjct: 320 AASY 323


>gi|15826035|pdb|1FH0|A Chain A, Crystal Structure Of Human Cathepsin V Complexed With An
           Irreversible Vinyl Sulfone Inhibitor
 gi|15826036|pdb|1FH0|B Chain B, Crystal Structure Of Human Cathepsin V Complexed With An
           Irreversible Vinyl Sulfone Inhibitor
          Length = 221

 Score =  174 bits (440), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 96/217 (44%), Positives = 129/217 (59%), Gaps = 10/217 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TGA+EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 5   VDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQG 64

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG   R++Q++ ++ GL +++ Y PY+  D  C      + A  TGF  V P  E AL
Sbjct: 65  CNGGFMARAFQYVKENGGLDSEESY-PYVAVDEICKYRPENSVAQDTGFTVVAPGKEKAL 123

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GP+SVA+DA   SF FY +G+Y++  C  S   LDH VL VGYG      D  
Sbjct: 124 MKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDC--SSKNLDHGVLVVGYGFEGANSDNS 181

Query: 313 PYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
            YW VKNSW   WG+ GYV +   K+N+CG+ TA +Y
Sbjct: 182 KYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASY 218


>gi|254746344|emb|CAX16637.1| putative C1A cysteine protease precursor [Manduca sexta]
          Length = 541

 Score =  174 bits (440), Expect = 8e-41,   Method: Compositional matrix adjust.
 Identities = 85/200 (42%), Positives = 122/200 (61%), Gaps = 4/200 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAV-LSQQALIDCSWGYGNNGCDGGEDFRSYQW 209
           +Q  CGSCW+F +  AVE    +K+    + LS+Q+L+DC+WG+   GC+G      +++
Sbjct: 343 NQGNCGSCWAFSSVAAVEATLALKNGGRNLELSEQSLVDCAWGFEAMGCNGASPDSGFKY 402

Query: 210 IMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           I++HG+PT  +YGPYL ++ +C   N +    +TGF  VTP + +  K+ L ++GPV VA
Sbjct: 403 ILEHGVPTDMEYGPYLEKNGFCEARNMSKLYHITGFGRVTPRNPEITKVVLNRYGPVLVA 462

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           I A   S   Y +GV+YD  C NS    +H V  VGYG  DG+ YW VKNSW   WG  G
Sbjct: 463 IHAGN-SMKLYSSGVFYDVTCENSTP--NHGVALVGYGVRDGELYWMVKNSWGEDWGEDG 519

Query: 330 YVLMSIKDNNCGVMTAPTYV 349
           Y+LMS  +NNC +M +  Y 
Sbjct: 520 YILMSATNNNCLLMDSGFYA 539


>gi|313241067|emb|CBY33367.1| unnamed protein product [Oikopleura dioica]
          Length = 326

 Score =  174 bits (440), Expect = 8e-41,   Method: Compositional matrix adjust.
 Identities = 106/244 (43%), Positives = 139/244 (56%), Gaps = 25/244 (10%)

Query: 126 WVRYN------KASKDAIP------------VRYEMKGY-NSLLDQSVCGSCWSFGTTGA 166
           WV YN      KA+ D  P            V +  +GY   + DQ  CGSCW+F TTG+
Sbjct: 84  WVEYNRLRNGAKANSDCKPMPSSSAEANPDSVDWRNEGYVTPIKDQGQCGSCWAFSTTGS 143

Query: 167 VEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGG-EDFRSYQWIMKHGLPTQDDYGPYL 225
            EGA++ K  KL  LS+Q L+DCS   G++GC+GG  DF     I   G+ T+  Y PY 
Sbjct: 144 TEGAHFKKTGKLVTLSEQQLVDCSTKEGDHGCNGGLMDFGFTYIIENDGITTESAY-PYK 202

Query: 226 GQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVY 285
            QD  C  +  TA AT++   +V   SE  L+ A+A  GP+SVAIDA   SF  Y  G+Y
Sbjct: 203 AQDGSCK-SGMTAAATLSECYDVAQGSEADLETAVATVGPISVAIDAHLLSFRLYKQGIY 261

Query: 286 YDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMT 344
           +D  C+++   LDH VLAVGY       YW VKNSW+T WGN+GY+ M+  K N CG+ T
Sbjct: 262 HDRLCSSTR--LDHGVLAVGYKNDPSGNYWIVKNSWNTTWGNEGYIWMAKDKKNTCGIAT 319

Query: 345 APTY 348
           A +Y
Sbjct: 320 AASY 323


>gi|1834307|dbj|BAA09820.1| cysteine proteinase [Spirometra erinaceieuropaei]
 gi|1834309|dbj|BAA09821.1| cysteine proteinase [Spirometra erinaceieuropaei]
          Length = 336

 Score =  174 bits (440), Expect = 8e-41,   Method: Compositional matrix adjust.
 Identities = 96/251 (38%), Positives = 138/251 (54%), Gaps = 14/251 (5%)

Query: 102 VNGIILEKWRLVTSEGEKVSKYSLWVRYNKASKDAIPVRYEMKGYNSLLDQSVCGSCWSF 161
           + GI+L K R          K ++ V   +   D++  R E     S+ +Q  CGSCWSF
Sbjct: 99  LRGIVLTKLR---------RKEAVSVPLKENLPDSVNWR-ERGAVTSVKNQGQCGSCWSF 148

Query: 162 GTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDY 221
              GA+EGA  +K   L  LS+Q L+DCSW YGN GC+GG   +++Q+  ++G+  + DY
Sbjct: 149 SANGAIEGAIQIKTGALRSLSEQQLMDCSWDYGNQGCNGGLMPQAFQYAQRYGVEAEVDY 208

Query: 222 GPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYV 281
             Y  +D  C        A +TG+  +    E  L+ A+A  GP+SV IDA+   F  Y 
Sbjct: 209 -RYTERDGVCRYRQDLVVANVTGYAELPEGDEGGLQRAVATIGPISVGIDAADPGFMSYS 267

Query: 282 NGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNN-C 340
           +GV+  + C  SP  +DH VL VGYG  +G  YW VKNSW + WG  GY+ M+   NN C
Sbjct: 268 HGVFVSKTC--SPYAIDHGVLVVGYGAENGDAYWLVKNSWGSSWGEDGYLKMARNRNNMC 325

Query: 341 GVMTAPTYVTM 351
           G+ +  +Y T+
Sbjct: 326 GIASMASYPTV 336


>gi|31982433|ref|NP_031828.2| cathepsin K precursor [Mus musculus]
 gi|12644320|sp|P55097.2|CATK_MOUSE RecName: Full=Cathepsin K; Flags: Precursor
 gi|3550487|emb|CAA06825.1| cathepsin K [Mus musculus]
 gi|12834090|dbj|BAB22783.1| unnamed protein product [Mus musculus]
 gi|28277388|gb|AAH46320.1| Cathepsin K [Mus musculus]
 gi|74209960|dbj|BAE21279.1| unnamed protein product [Mus musculus]
 gi|148706870|gb|EDL38817.1| cathepsin K, isoform CRA_a [Mus musculus]
          Length = 329

 Score =  174 bits (440), Expect = 8e-41,   Method: Compositional matrix adjust.
 Identities = 92/215 (42%), Positives = 125/215 (58%), Gaps = 6/215 (2%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           + Y  KGY   + +Q  CGSCW+F + GA+EG    K  KL  LS Q L+DC     N G
Sbjct: 119 IDYRKKGYVTPVKNQGQCGSCWAFSSAGALEGQLKKKTGKLLALSPQNLVDCV--TENYG 176

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
           C GG    ++Q++ ++G    +D  PY+GQD  C    T   A   G+  +   +E ALK
Sbjct: 177 CGGGYMTTAFQYVQQNGGIDSEDAYPYVGQDESCMYNATAKAAKCRGYREIPVGNEKALK 236

Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQV 317
            A+A+ GP+SV+IDAS  SF FY  GVYYDE C+   D ++HAVL VGYG   G  +W +
Sbjct: 237 RAVARVGPISVSIDASLASFQFYSRGVYYDENCDR--DNVNHAVLVVGYGTQKGSKHWII 294

Query: 318 KNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           KNSW   WGN+GY L++   NN CG+    ++  M
Sbjct: 295 KNSWGESWGNKGYALLARNKNNACGITNMASFPKM 329


>gi|356565778|ref|XP_003551114.1| PREDICTED: thiol protease aleurain-like [Glycine max]
          Length = 353

 Score =  174 bits (440), Expect = 8e-41,   Method: Compositional matrix adjust.
 Identities = 89/199 (44%), Positives = 121/199 (60%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F TTGA+E AY     K   LS+Q L+DC+  + N GC+GG   +++++I
Sbjct: 153 DQGNCGSCWTFSTTGALEAAYAQAFGKNISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYI 212

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + GL T++ Y PY G+D  C          +   +N+T  +ED LK A+A   PVSVA
Sbjct: 213 KYNGGLDTEEAY-PYTGKDGVCKFTAKNVAVRVIDSINITLGAEDELKQAVAFVRPVSVA 271

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            + + K F FY NGVY    C ++P  ++HAVLAVGYG  DG PYW +KNSW + WG+ G
Sbjct: 272 FEVA-KDFRFYNNGVYTSTICGSTPMDVNHAVLAVGYGVEDGVPYWIIKNSWGSNWGDNG 330

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y  M +  N CGV T  +Y
Sbjct: 331 YFKMELGKNMCGVATCASY 349


>gi|348545637|ref|XP_003460286.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
          Length = 334

 Score =  173 bits (439), Expect = 9e-41,   Method: Compositional matrix adjust.
 Identities = 97/218 (44%), Positives = 133/218 (61%), Gaps = 8/218 (3%)

Query: 136 AIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG 192
           A+P  V +  KGY   + DQ  CGSCW+F  TGA+EG ++ K   L  LS+Q L+DCS  
Sbjct: 117 ALPNTVDWRDKGYVTEVKDQKQCGSCWAFSATGALEGQHFRKTGTLVSLSEQQLVDCSSN 176

Query: 193 YGNNGCDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPN 251
           +GN+GC GG    ++++I    G+ T++ Y PY  ++  C     +  AT +G++ V   
Sbjct: 177 FGNSGCMGGWMDFAFKYIKYNRGIDTEEFY-PYEAKNGLCRYKRDSIGATCSGYIIVKRF 235

Query: 252 SEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDG 311
            E ALK A+A  GP+SV IDAS+ SF  Y +GVYYD+ C +    L+HAVLAVGYG  +G
Sbjct: 236 EEQALKEAVATVGPISVTIDASRPSFQLYESGVYYDDGCGSIF--LNHAVLAVGYGTENG 293

Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
             YW VKNSW   WG +GY+ MS  K N CG+ +   Y
Sbjct: 294 HDYWLVKNSWGLGWGEKGYIRMSRNKKNQCGIASVARY 331


>gi|119433808|gb|ABL74967.1| cysteine protease [Acanthamoeba castellanii]
          Length = 330

 Score =  173 bits (439), Expect = 9e-41,   Method: Compositional matrix adjust.
 Identities = 95/202 (47%), Positives = 126/202 (62%), Gaps = 6/202 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCWSF TTG+ EGA ++K   L  LS+Q LIDCS  YGNNGC+GG    ++++I
Sbjct: 132 NQGQCGSCWSFSTTGSTEGANFLKRGTLVSLSEQNLIDCSGSYGNNGCNGGLMDYAFEYI 191

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           + + G+ T+  Y PY      C      +  ++T + +V+   E+AL  A+A   P SVA
Sbjct: 192 INNKGIDTEASY-PYETAQYNCRYNPANSGGSLTSYTDVSSGDENALLNAVAIE-PTSVA 249

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS  SF FY  GVYY+  C  S   LDH VLAVG+G  +G+ YW VKNSW   WG QG
Sbjct: 250 IDASHNSFQFYSGGVYYESSC--SSTQLDHGVLAVGWGTENGQDYWLVKNSWGADWGLQG 307

Query: 330 YVLMSI-KDNNCGVMTAPTYVT 350
           Y+ M+  + NNCG+ TA +Y T
Sbjct: 308 YIKMARNRHNNCGIATAASYPT 329


>gi|261289789|ref|XP_002611756.1| hypothetical protein BRAFLDRAFT_236363 [Branchiostoma floridae]
 gi|229297128|gb|EEN67766.1| hypothetical protein BRAFLDRAFT_236363 [Branchiostoma floridae]
          Length = 308

 Score =  173 bits (439), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 89/201 (44%), Positives = 129/201 (64%), Gaps = 6/201 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F  TG++EG +++K   L  LS+Q L+DCS   GN GC GG   +++++I
Sbjct: 108 NQEQCGSCWAFSATGSLEGQHFLKTNNLVSLSEQNLVDCSRREGNKGCKGGSMDQAFKYI 167

Query: 211 -MKHGLPTQDDYGPYLGQD-AYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
            M  G+ T++ Y  Y G+D + C   ++ + AT++ + ++    E AL  A++  GP+SV
Sbjct: 168 KMNGGIDTEECY-SYRGRDESMCRYKSSCSGATLSSYTDIKTGDEMALMQAVSTVGPISV 226

Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQ 328
           AIDA  KSF  Y +GVY + KC+++   LDH VLAVGYG  +G  YW VKNSW T WG +
Sbjct: 227 AIDAGHKSFQLYHHGVYDEPKCSSTH--LDHGVLAVGYGSSNGSDYWLVKNSWGTEWGME 284

Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
           GY++MS  K N CG+ T   Y
Sbjct: 285 GYIMMSRNKHNQCGIATRAIY 305


>gi|405971604|gb|EKC36431.1| Cathepsin L [Crassostrea gigas]
          Length = 384

 Score =  173 bits (439), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 93/214 (43%), Positives = 130/214 (60%), Gaps = 7/214 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TG++EG Y+ K+ KL  LS+  L+DCS  +GN G
Sbjct: 170 VDWRSKGYVTKVKNQGACGSCWAFSATGSLEGQYFRKNGKLVPLSESQLVDCSGSFGNEG 229

Query: 198 CDGGEDFRSYQWIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    +++++    G+ ++ DY PY  +   C    T   AT++G V+V   SE +L
Sbjct: 230 CNGGFMENAFKYVKSVGGIESESDY-PYKARQRTCAFDKTKVIATVSGCVDVESGSESSL 288

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYW 315
           K  +++ GPVSVAIDA   SF  Y  GVY +  C+ S   L+H VL VGYG  L GK YW
Sbjct: 289 KEVVSEVGPVSVAIDAGHSSFQLYAGGVYDEPLCSTSR--LNHGVLCVGYGTSLQGKDYW 346

Query: 316 QVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
            VKNSW   WG +GY+ MS  K+N CG+ +  +Y
Sbjct: 347 IVKNSWGVRWGVEGYIKMSRNKNNQCGIASEASY 380


>gi|224460525|gb|ACN43674.1| cathepsin L [Paralichthys olivaceus]
          Length = 334

 Score =  173 bits (439), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 96/200 (48%), Positives = 127/200 (63%), Gaps = 5/200 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCWSF +TGA+EG  + K  +L  LS+Q L+DCS  YGN GC+GG    ++++I
Sbjct: 135 NQGSCGSCWSFSSTGALEGQNFRKTGRLVSLSEQELVDCSGNYGNYGCNGGWMDNAFRYI 194

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           + K G+ T+D Y PY GQ   C        AT TG+ ++   +E ALK A+A  GPVSVA
Sbjct: 195 VNKGGIHTEDSY-PYEGQVGQCRANYGEIGATCTGYYDIPSGNEHALKEAVATFGPVSVA 253

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           I AS +SF  Y +GVY +  C  S   LDHAVL VGYG   G+ YW VKNSW   WG+QG
Sbjct: 254 IHASDQSFQLYHSGVYNNPYC--SGTALDHAVLIVGYGTEYGQDYWLVKNSWGPAWGDQG 311

Query: 330 YVLMSI-KDNNCGVMTAPTY 348
           Y+ MS  + N CG+ +A ++
Sbjct: 312 YIKMSRNRYNQCGIASAASF 331


>gi|91092022|ref|XP_970951.1| PREDICTED: similar to cathepsin l [Tribolium castaneum]
 gi|270001246|gb|EEZ97693.1| cathepsin L precursor [Tribolium castaneum]
          Length = 343

 Score =  173 bits (439), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 89/210 (42%), Positives = 126/210 (60%), Gaps = 4/210 (1%)

Query: 142 EMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGG 201
           E+     + +Q  C  CW+F   GA+EG  + K  +L  LS Q LIDCS  YGN+GC GG
Sbjct: 132 EVGAVTPVKNQGSCAGCWAFSAAGALEGHNFRKTGRLVELSPQNLIDCSTNYGNDGCSGG 191

Query: 202 EDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLAL 260
               +Y+++  + G+ T+D Y PY  ++  C     T  A  TG+V++    E  L+ A+
Sbjct: 192 LMNPAYEYVRTNPGIDTEDSY-PYEARNGPCRFRPETVGAYCTGYVDIAEGDEQGLEAAI 250

Query: 261 AKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKN 319
           A  GPVS A+DA ++SF FY +G+YYD +C N PD ++HAVL VGYG E +G+ YW VKN
Sbjct: 251 ATLGPVSAAMDAGRQSFQFYSDGIYYDPQCGNRPDDVNHAVLVVGYGTEPNGQKYWLVKN 310

Query: 320 SWSTYWGNQGYVLMSIKDNN-CGVMTAPTY 348
           S+   WG  GYV ++   NN CG+    +Y
Sbjct: 311 SYGPQWGIGGYVKLAKDANNHCGIAIQASY 340


>gi|3087790|emb|CAA75029.1| cathepsin L2 [Homo sapiens]
          Length = 334

 Score =  173 bits (439), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 95/217 (43%), Positives = 129/217 (59%), Gaps = 10/217 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TGA+EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 118 VDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQG 177

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG   R++Q++ ++ GL +++ Y PY+  D  C      + A  TGF  V P  E AL
Sbjct: 178 CNGGFMARAFQYVKENGGLDSEESY-PYVAVDEICKYRPENSVANDTGFTVVAPGKEKAL 236

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GP+SVA+DA   SF FY +G+Y++  C  S   LDH VL VGYG      +  
Sbjct: 237 MKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDC--SSKNLDHGVLVVGYGFEGANSNNS 294

Query: 313 PYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
            YW VKNSW   WG+ GYV +   K+N+CG+ TA +Y
Sbjct: 295 KYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASY 331


>gi|426219875|ref|XP_004004143.1| PREDICTED: cathepsin L1 [Ovis aries]
          Length = 333

 Score =  173 bits (439), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 99/225 (44%), Positives = 137/225 (60%), Gaps = 13/225 (5%)

Query: 136 AIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG 192
           +IP  V +  KGY   + +Q  CGSCW+F TTGA+EG  + K  KL  LS+Q L+DCS  
Sbjct: 113 SIPPSVDWREKGYVTPVKNQGKCGSCWAFSTTGALEGQMFRKTGKLVSLSEQNLVDCSQP 172

Query: 193 YGNNGCDGGEDFRSYQWIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPN 251
            GN GC GG    ++Q+++   GL +++ Y PY G    C+     + A  TGFV++ P 
Sbjct: 173 EGNRGCHGGLMDNAFQYVLDVGGLDSEESY-PYTGLVGTCNYNPKNSAANETGFVDL-PK 230

Query: 252 SEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG---- 307
            E+AL  A+A  GP+SVA+DAS  SF FY +G+YY+ KC +  + +DH VL VGYG    
Sbjct: 231 QENALMKAVATLGPISVAVDASNPSFQFYKSGIYYEPKCKS--ESVDHGVLVVGYGFEGA 288

Query: 308 ELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           + D   YW VKNSW  +WG  GY+ M+   NN CG+ T  +Y T+
Sbjct: 289 DSDDNKYWLVKNSWGKHWGINGYIKMAKDQNNHCGIATMASYPTV 333


>gi|23110960|ref|NP_001324.2| cathepsin L2 preproprotein [Homo sapiens]
 gi|320118898|ref|NP_001188504.1| cathepsin L2 preproprotein [Homo sapiens]
 gi|12644075|sp|O60911.2|CATL2_HUMAN RecName: Full=Cathepsin L2; AltName: Full=Cathepsin U; AltName:
           Full=Cathepsin V; Flags: Precursor
 gi|3107915|dbj|BAA25909.1| cathepsin V [Homo sapiens]
 gi|3228672|gb|AAC23598.1| cathepsin U [Homo sapiens]
 gi|3869129|dbj|BAA34365.1| cathepsin L2 [Homo sapiens]
 gi|23958123|gb|AAH23504.1| CTSL2 protein [Homo sapiens]
 gi|37182404|gb|AAQ89004.1| cathepsin L2 [Homo sapiens]
 gi|83405150|gb|AAI10513.1| Cathepsin L2 [Homo sapiens]
 gi|119579235|gb|EAW58831.1| cathepsin L2, isoform CRA_a [Homo sapiens]
 gi|119579236|gb|EAW58832.1| cathepsin L2, isoform CRA_a [Homo sapiens]
          Length = 334

 Score =  173 bits (438), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 95/217 (43%), Positives = 129/217 (59%), Gaps = 10/217 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TGA+EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 118 VDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQG 177

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG   R++Q++ ++ GL +++ Y PY+  D  C      + A  TGF  V P  E AL
Sbjct: 178 CNGGFMARAFQYVKENGGLDSEESY-PYVAVDEICKYRPENSVANDTGFTVVAPGKEKAL 236

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GP+SVA+DA   SF FY +G+Y++  C  S   LDH VL VGYG      +  
Sbjct: 237 MKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDC--SSKNLDHGVLVVGYGFEGANSNNS 294

Query: 313 PYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
            YW VKNSW   WG+ GYV +   K+N+CG+ TA +Y
Sbjct: 295 KYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASY 331


>gi|348586441|ref|XP_003478977.1| PREDICTED: cathepsin K-like [Cavia porcellus]
          Length = 329

 Score =  173 bits (438), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 97/216 (44%), Positives = 131/216 (60%), Gaps = 8/216 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V Y  KGY   + +Q  CGSCW+F + GA+EG    K  KL  LS Q L+DC     N+G
Sbjct: 119 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDG 176

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    ++Q++ ++ G+ ++D Y PY+GQ+  C    T   A   G+  +   +E AL
Sbjct: 177 CGGGYMTNAFQYVQENRGIDSEDAY-PYVGQEESCMYNPTGKAAKCRGYREIPVGNEKAL 235

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K A+A+ GPVSVAIDAS  SF FY  GVYYDE CN   + L+HA+LAVGYG   G  +W 
Sbjct: 236 KRAVARVGPVSVAIDASLSSFQFYSKGVYYDESCNG--EDLNHALLAVGYGMQRGNKHWI 293

Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           +KNSW   WGN+GYVL++   NN CG+    ++  M
Sbjct: 294 LKNSWGENWGNKGYVLLARNKNNACGIANLASFPKM 329


>gi|440793751|gb|ELR14926.1| Cysteine proteinase 5, putative [Acanthamoeba castellanii str.
           Neff]
          Length = 326

 Score =  172 bits (437), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 103/286 (36%), Positives = 159/286 (55%), Gaps = 17/286 (5%)

Query: 70  FKVNIYR--LFFLRPRFHENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWV 127
           ++ N++R    ++    H+N+         G+ L N    + ++ ++   ++  + S   
Sbjct: 48  YRWNVWRENYLYIEAHNHQNKSFHLAMNKFGD-LTNAEFNKLFKGLSITADQAKQES--- 103

Query: 128 RYNKASKDAIPVRYEMKGYNSLL---DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQ 184
             + A    +P  ++ +   ++    +Q  CGSCWSF TTG+ EGA ++KH +L  LS+Q
Sbjct: 104 --DIAPAPGLPADFDWRQKGAVTHVKNQGQCGSCWSFSTTGSTEGANFLKHGRLTSLSEQ 161

Query: 185 ALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMT 243
            L+DCS  YGN+GC+GG    ++++I+++ G+ T++ Y PY      C      +   + 
Sbjct: 162 NLVDCSTSYGNHGCNGGLMDYAFEYIIRNKGIDTEESY-PYHASQGTCRYNKQHSGGELV 220

Query: 244 GFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLA 303
            + NV   +E AL  A+A   P SVAIDAS  SF FY  GVY +  C++S   LDH VLA
Sbjct: 221 SYTNVPSGNEGALLNAVATQ-PTSVAIDASHSSFQFYKGGVYDEPACSSSR--LDHGVLA 277

Query: 304 VGYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           VG+G  DGK YW VKNSW   WG  GY+ MS  K N CG+ TA ++
Sbjct: 278 VGWGVRDGKDYWLVKNSWGADWGLSGYIEMSRNKHNQCGIATAASH 323


>gi|395819351|ref|XP_003783057.1| PREDICTED: cathepsin L1-like [Otolemur garnettii]
          Length = 333

 Score =  172 bits (437), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 99/220 (45%), Positives = 133/220 (60%), Gaps = 11/220 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TGA+EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 118 VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLISLSEQNLVDCSRPQGNEG 177

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           CDGG    ++Q+I ++ GL +++ Y PY   D  C      + A  TGFV++ P  E AL
Sbjct: 178 CDGGLMDYAFQYIKENGGLDSEESY-PYDAMDESCKYRPEYSVANDTGFVDI-PKEEKAL 235

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GP+SVAIDA  +SF FY  GVY++ +C  S D +DH VL VGYG    E D  
Sbjct: 236 MKAVATVGPISVAIDAGHESFQFYKEGVYFEPEC--SSDNVDHGVLVVGYGYEETESDNN 293

Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
            +W VKNSW   WG  GY+ M+  + N+CG+ TA +Y T+
Sbjct: 294 KFWLVKNSWGEEWGLGGYIKMTKDQKNHCGIATAASYPTV 333


>gi|215259621|gb|ACJ64302.1| cathepsin l [Culex tarsalis]
          Length = 125

 Score =  172 bits (437), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 80/123 (65%), Positives = 97/123 (78%)

Query: 227 QDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYY 286
           QDAYC + N T  A +TG+VNVT    DA+K+AL KHGP+S+AIDA  KSFS+Y NGVYY
Sbjct: 1   QDAYCRLENKTLYAPITGWVNVTSGDTDAMKVALFKHGPLSIAIDAGHKSFSYYANGVYY 60

Query: 287 DEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAP 346
           + +C N+ DGLDHAVLAVGYG+L+G+ YW +KNSWS  WGN GY LMS+KDNNCG+ T  
Sbjct: 61  EPECKNNLDGLDHAVLAVGYGKLNGEDYWLIKNSWSNLWGNDGYALMSMKDNNCGLATDA 120

Query: 347 TYV 349
           TYV
Sbjct: 121 TYV 123


>gi|261289787|ref|XP_002611755.1| hypothetical protein BRAFLDRAFT_284339 [Branchiostoma floridae]
 gi|229297127|gb|EEN67765.1| hypothetical protein BRAFLDRAFT_284339 [Branchiostoma floridae]
          Length = 327

 Score =  172 bits (437), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 88/200 (44%), Positives = 125/200 (62%), Gaps = 4/200 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTG++EG +++K   L  LS+Q L+DCS   GN GC GG   +++++I
Sbjct: 127 NQEQCGSCWAFSTTGSLEGQHFLKSGTLVSLSEQNLVDCSRKEGNKGCQGGLMDQAFKYI 186

Query: 211 MKHGLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             +G    ++  PY G++   C   ++ + AT++ +V++    EDAL  A A  GP+SV 
Sbjct: 187 KTNGGIDTEECYPYKGKNERKCEYKSSCSGATLSSYVDIKTGDEDALMQASATIGPISVG 246

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS  SF  Y +GVY++++C  S   LDH VL VGYG    K YW VKNSW   WG +G
Sbjct: 247 IDASHPSFQLYDHGVYHEKRC--SSKKLDHGVLVVGYGTDGEKDYWLVKNSWGEEWGMEG 304

Query: 330 YVLMSI-KDNNCGVMTAPTY 348
           Y+ MS  KDN CG+ T  +Y
Sbjct: 305 YIKMSRNKDNQCGIATQASY 324


>gi|297684914|ref|XP_002820054.1| PREDICTED: cathepsin L2 isoform 2 [Pongo abelii]
          Length = 334

 Score =  172 bits (437), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 92/216 (42%), Positives = 126/216 (58%), Gaps = 8/216 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TGA+EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 118 VDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSHPQGNQG 177

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
           C+GG   +++Q++ ++G    ++  PY+  D  C      + A  TGF  + P  E AL 
Sbjct: 178 CNGGFMDKAFQYVKENGGLDSEESYPYVAMDEICKYRPENSVANDTGFTVILPGKEKALM 237

Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKP 313
            A+A  GP+SVA+DA   SF FY +G+Y++  C  S   LDH VL VGYG      D   
Sbjct: 238 KAVATVGPISVAMDAGHSSFQFYKSGIYFEPDC--SSKNLDHGVLVVGYGFEGANSDNSK 295

Query: 314 YWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
           YW VKNSW   WG+ GYV +   K+N+CG+ TA +Y
Sbjct: 296 YWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASY 331


>gi|156354979|ref|XP_001623456.1| predicted protein [Nematostella vectensis]
 gi|156210156|gb|EDO31356.1| predicted protein [Nematostella vectensis]
          Length = 310

 Score =  172 bits (437), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 83/203 (40%), Positives = 124/203 (61%), Gaps = 11/203 (5%)

Query: 137 IPVRYEMKGY---NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG- 192
           +P + + + Y   N    Q  CGSCW+F T GAVE A++++  +L  L++Q L+DC+W  
Sbjct: 107 VPDQLDWRNYGAVNPAKGQGTCGSCWAFATAGAVEAAHFIQKGELLNLAEQQLLDCTWST 166

Query: 193 ----YGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
               +GNN C GG  ++++ W+ K GL T   YG Y GQ+ +C  +N    A +T +  V
Sbjct: 167 PGVYHGNNCCLGGWTWKAFSWVKKFGLATTKSYGHYRGQEGFCKTSNLIVGARITSYRRV 226

Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
              +  ALK AL+ HGP +++I+A+ KS  FY +G+  D+ C+N     DHAVL +GYG 
Sbjct: 227 KRFNPIALKKALSYHGPATISINANPKSLKFYSDGIMSDKHCSNK---TDHAVLLIGYGS 283

Query: 309 LDGKPYWQVKNSWSTYWGNQGYV 331
            +G PYW +KNSWS  WGN G++
Sbjct: 284 DNGVPYWLIKNSWSHKWGNNGFI 306


>gi|156394055|ref|XP_001636642.1| predicted protein [Nematostella vectensis]
 gi|156223747|gb|EDO44579.1| predicted protein [Nematostella vectensis]
          Length = 200

 Score =  172 bits (437), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 90/195 (46%), Positives = 121/195 (62%), Gaps = 5/195 (2%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM 211
           Q  CGSCW+F   G +EGAY +K  KL   S+Q L+DCSWG+GNNGC GG  +R   W  
Sbjct: 9   QGRCGSCWTFSAMGPIEGAYALKTGKLKDFSEQQLLDCSWGFGNNGCRGGFSWRGLVWAR 68

Query: 212 KHGLPTQDDYGPYLGQDAYCHIANTTATATM--TGFVNVTPNSEDALKLALAKHGPVSVA 269
           K G+ +   YG YL Q+ YCH A T    T+     + V   ++ AL+  LA +GP S+A
Sbjct: 69  KFGVTSSRTYGRYLAQEGYCHCAKTDKCDTVHFERVMTVPKYNQTALRRGLALYGPASIA 128

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           I+ ++KS  FY  GVY D +C   P   +H V+ VGYG  +G+PYW VKNSW  +WG QG
Sbjct: 129 INVNRKSLKFYSGGVYDDPQC---PPNTNHGVVVVGYGVEEGQPYWIVKNSWGKFWGEQG 185

Query: 330 YVLMSIKDNNCGVMT 344
           ++ +S K N CGV++
Sbjct: 186 FIRISRKKNQCGVVS 200


>gi|194320502|gb|ACF48469.1| cathepsin L [Triatoma brasiliensis]
          Length = 330

 Score =  172 bits (436), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 90/200 (45%), Positives = 122/200 (61%), Gaps = 5/200 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCWSF  TG++EG  ++K  KL  LS+Q L+DCS  YGNNGC+GG   +++Q++
Sbjct: 131 DQGQCGSCWSFSATGSLEGQVFLKTGKLVSLSEQNLVDCSTSYGNNGCEGGLMDQAFQYV 190

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T+  Y PY  ++  C         T  G V++    E AL+ ALA  GP+SVA
Sbjct: 191 SDNKGIDTEASY-PYEARENTCRFKKNKVGGTDKGHVDIPAGDEKALQNALATVGPISVA 249

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDA+  SF FY  GVY +  C  S   LDH VLAVGYG  +G+ YW VKNSW   WG  G
Sbjct: 250 IDANHGSFQFYSKGVYNEPNC--SSYDLDHGVLAVGYGTENGQDYWLVKNSWGPSWGENG 307

Query: 330 YVLMSIK-DNNCGVMTAPTY 348
           Y+ ++    N+CG+ +  +Y
Sbjct: 308 YIKIARNHSNHCGIASMASY 327


>gi|297684916|ref|XP_002820055.1| PREDICTED: cathepsin L2 isoform 3 [Pongo abelii]
          Length = 345

 Score =  172 bits (436), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 92/216 (42%), Positives = 126/216 (58%), Gaps = 8/216 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TGA+EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 129 VDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSHPQGNQG 188

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
           C+GG   +++Q++ ++G    ++  PY+  D  C      + A  TGF  + P  E AL 
Sbjct: 189 CNGGFMDKAFQYVKENGGLDSEESYPYVAMDEICKYRPENSVANDTGFTVILPGKEKALM 248

Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKP 313
            A+A  GP+SVA+DA   SF FY +G+Y++  C  S   LDH VL VGYG      D   
Sbjct: 249 KAVATVGPISVAMDAGHSSFQFYKSGIYFEPDC--SSKNLDHGVLVVGYGFEGANSDNSK 306

Query: 314 YWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
           YW VKNSW   WG+ GYV +   K+N+CG+ TA +Y
Sbjct: 307 YWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASY 342


>gi|17062058|gb|AAL34984.1|AF320565_1 cathepsine L-like cysteine protease [Rhodnius prolixus]
          Length = 316

 Score =  172 bits (436), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 87/200 (43%), Positives = 124/200 (62%), Gaps = 5/200 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCWSF  TG++EG  ++K  +L  LS+Q L+DCS  YGN+GC+GG   +++Q++
Sbjct: 117 DQGHCGSCWSFSATGSLEGQLFLKTGRLVSLSEQNLVDCSKTYGNSGCEGGLMNQAFQYV 176

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T+  Y PY  ++  C         T  G+V++   SE  L+ A+A  GP+SV 
Sbjct: 177 RDNKGIDTEASY-PYEARENNCRFKEDKVGGTDKGYVDILEASEKDLQSAVATVGPISVR 235

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS +SF FY  GVY ++ C  SP  LDH VL VGYG  +G+ YW VKNSW   WG  G
Sbjct: 236 IDASHESFQFYSEGVYKEQYC--SPSQLDHGVLTVGYGTENGQDYWLVKNSWGPSWGESG 293

Query: 330 YVLMSIK-DNNCGVMTAPTY 348
           Y+ ++    N+CG+ +  +Y
Sbjct: 294 YIKIARNHKNHCGIASMASY 313


>gi|66735056|gb|AAY53767.1| cysteine protease [Saprolegnia parasitica]
          Length = 523

 Score =  172 bits (436), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 92/214 (42%), Positives = 129/214 (60%), Gaps = 12/214 (5%)

Query: 142 EMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGG 201
           E  G   + +Q +CGSCW+F TTGA+EGA ++  K+L  +S+Q L+DC    G+ GC+GG
Sbjct: 124 EQGGVTPVKNQGMCGSCWAFSTTGAIEGAAFVSSKQLVSVSEQELVDCDHN-GDMGCNGG 182

Query: 202 EDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLAL 260
               +++W+  H GL  ++DY PY  ++  C +        +T F +V  N E ALK A+
Sbjct: 183 LMDNAFKWVKTHKGLCKEEDY-PYHAKEGTCALKKCKPVTKVTAFHDVPANDEQALKAAV 241

Query: 261 AKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNS 320
           AK  PVSVAI+A Q  F FY +GV+ D+ C      LDH VL VGYGE  GK YW+VKNS
Sbjct: 242 AKQ-PVSVAIEADQPEFQFYKSGVF-DKSCGTK---LDHGVLVVGYGEEGGKKYWKVKNS 296

Query: 321 WSTYWGNQGYVLMSI----KDNNCGVMTAPTYVT 350
           W   WG++GY+ ++     +   CGV   P+Y T
Sbjct: 297 WGADWGDKGYIKLAREFGPETGQCGVAMVPSYPT 330


>gi|441593109|ref|XP_003260582.2| PREDICTED: cathepsin L2 isoform 1 [Nomascus leucogenys]
          Length = 334

 Score =  172 bits (436), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 94/217 (43%), Positives = 129/217 (59%), Gaps = 10/217 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TGA+EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 118 VDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQG 177

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG   +++Q++ ++ GL +++ Y PY+  D  C      + A  TGF  V P  E AL
Sbjct: 178 CNGGFMGKAFQYVKENGGLDSEESY-PYVAMDEICKYRPENSVANDTGFTVVPPGKEKAL 236

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GP+SVA+DA   SF FY  G+Y++  C  S + LDH VL VGYG      +  
Sbjct: 237 MKAVATVGPISVAMDAGHSSFQFYNQGIYFEPDC--SSENLDHGVLVVGYGFEGANSNNS 294

Query: 313 PYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
            YW VKNSW   WG+ GYV +   K+N+CG+ TA +Y
Sbjct: 295 KYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASY 331


>gi|431897851|gb|ELK06685.1| Cathepsin L1 [Pteropus alecto]
          Length = 331

 Score =  172 bits (436), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 95/216 (43%), Positives = 127/216 (58%), Gaps = 9/216 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + DQ  CGSCW+F  TGA+EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 116 VDWRQKGYVTPVKDQGQCGSCWAFSATGALEGQMFRKTGKLISLSEQNLVDCSQSQGNEG 175

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
           CDGG    ++Q++  +G    ++  PYL +D  C      + A  +GFV++    E +L 
Sbjct: 176 CDGGLMDNAFQYVKDNGGLDSEESYPYLARDESCKYKPEFSAANDSGFVDIH-KQERSLM 234

Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKP 313
            A+A  GP+SV IDAS  SF FY  G+YY+ +C  S + L+H VL VGYG    E +   
Sbjct: 235 KAVASVGPISVGIDASYSSFQFYEKGIYYEPEC--SSEDLNHGVLVVGYGFERAESNKNK 292

Query: 314 YWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTY 348
           YW VKNSW T WG  GY+ M+   NN CG+ TA +Y
Sbjct: 293 YWIVKNSWGTNWGMNGYINMAKDQNNHCGIATAASY 328


>gi|354472953|ref|XP_003498701.1| PREDICTED: cathepsin K [Cricetulus griseus]
          Length = 329

 Score =  172 bits (435), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 93/215 (43%), Positives = 125/215 (58%), Gaps = 6/215 (2%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           + Y  KGY   + +Q  CGSCW+F + GA+EG    K  KL  LS Q L+DC     N G
Sbjct: 119 IDYRKKGYVTPVKNQGECGSCWAFSSAGALEGQLKKKTGKLLNLSPQNLVDCV--SENYG 176

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
           C GG    +++++  +G    +D  PY+GQD  C    T   A   G+  +   SE ALK
Sbjct: 177 CGGGYMTTAFRYVQTNGGIDSEDAYPYVGQDQSCMYNPTAKAAKCRGYREIPVGSEKALK 236

Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQV 317
            A+A+ GP+SV+IDAS  SF FY  GVYYDE C+   D ++HAVL VGYG   G  +W +
Sbjct: 237 RAVARVGPISVSIDASLTSFQFYSRGVYYDENCDG--DNVNHAVLVVGYGAQKGNKHWII 294

Query: 318 KNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           KNSW   WGN+GYVL++   NN CG+    ++  M
Sbjct: 295 KNSWGESWGNKGYVLLARNRNNACGITNLASFPKM 329


>gi|115715524|ref|XP_780580.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
          Length = 334

 Score =  172 bits (435), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 97/214 (45%), Positives = 132/214 (61%), Gaps = 7/214 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TG++EG ++ K  KL  LS+Q L+DCS   GN G
Sbjct: 121 VDWRTKGYVTPVKNQLQCGSCWAFSATGSLEGQHFKKTGKLVSLSEQNLVDCSGKEGNMG 180

Query: 198 CDGGEDFRSYQWIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG   +++Q+I+   G+ T+  Y PY   D  CH       AT TG+ +VT  SE AL
Sbjct: 181 CEGGLMDQAFQYILDVGGIDTEMSY-PYTAMDGQCHFNKANIGATDTGYTDVTTGSESAL 239

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYW 315
           ++A+A  GP+SVAIDAS +SF  Y +GVY +  C+++   LDH VLAVGYG   DG  Y+
Sbjct: 240 QMAVASVGPISVAIDASHQSFQLYKSGVYNEPACSSTL--LDHGVLAVGYGTSSDGTDYF 297

Query: 316 QVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
              +SW   WG  GY+ MS  KDN CG+ T  +Y
Sbjct: 298 FFFHSWGAAWGMNGYLWMSRNKDNQCGIATKASY 331


>gi|345493482|ref|XP_001602523.2| PREDICTED: cathepsin L-like [Nasonia vitripennis]
          Length = 514

 Score =  172 bits (435), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 89/202 (44%), Positives = 125/202 (61%), Gaps = 7/202 (3%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F  TG++EG ++  +  L  LS+Q L+DCS  +GN+GCDGG    ++ ++
Sbjct: 312 NQGNCGSCWAFSATGSLEGQHFRHNGSLISLSEQNLVDCSGRFGNDGCDGGLMNNAFTYV 371

Query: 211 -MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
            +  GL ++  Y PY  +D  C      + A   G+VN+   SE  L+ A+A  GP+SVA
Sbjct: 372 KVNRGLDSEKSY-PYEAEDDRCRYNPKNSAADDAGYVNIPTGSESKLQAAVATVGPISVA 430

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG--ELDGKPYWQVKNSWSTYWGN 327
           IDA   SF FY +GVYY+  C+ +   LDH VLA+GYG     GK +W VKNSW   WG 
Sbjct: 431 IDADSDSFMFYHSGVYYEPDCSRTD--LDHGVLAIGYGTDSKTGKQFWLVKNSWGEDWGE 488

Query: 328 QGYVLMSI-KDNNCGVMTAPTY 348
           +GY+ MS  + NNCG+ TA +Y
Sbjct: 489 KGYIRMSRNRHNNCGIATAASY 510



 Score =  154 bits (388), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 87/195 (44%), Positives = 115/195 (58%), Gaps = 16/195 (8%)

Query: 149 LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
           + DQ  CGSCWSF  TGA+EG ++ +  KL  LS+Q LIDCS  YGNNGC+GG    +++
Sbjct: 137 IKDQGHCGSCWSFSATGALEGQHFRQTGKLVSLSEQNLIDCSGKYGNNGCNGGLMDNAFK 196

Query: 209 WIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVS 267
           +I  + GL T+  Y PY  +D  C      + A   GFV++    E+ LK A+A  GPVS
Sbjct: 197 YIRDNKGLDTESTY-PYEAEDDECRYNARNSGAEDVGFVDIPEGDEEKLKAAIATIGPVS 255

Query: 268 VAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWG 326
           VAIDAS ++F FY  GVYY+ +C+++   LDH VL VGYG   DG+ YW+          
Sbjct: 256 VAIDASHQTFQFYSTGVYYEPECSSTE--LDHGVLVVGYGTSEDGQDYWK---------- 303

Query: 327 NQGYVLMSIKDNNCG 341
            QG V       NCG
Sbjct: 304 -QGAVTPVKNQGNCG 317


>gi|344271616|ref|XP_003407633.1| PREDICTED: cathepsin L1-like [Loxodonta africana]
          Length = 334

 Score =  172 bits (435), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 101/222 (45%), Positives = 135/222 (60%), Gaps = 14/222 (6%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TGA+EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 118 VDWTQKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRREGNEG 177

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDA 255
           C+GG    ++Q++  + GL +++ Y PYL  D + C+     + A  TGFV++ P  E A
Sbjct: 178 CNGGLMDNAFQYVQDNGGLDSEESY-PYLATDTHTCNYKPECSAANDTGFVDI-PQREKA 235

Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGK--- 312
           L  A+A  GP+SVAIDA  +SF FY +G+YY+  C  S   LDH VL VGYG  +GK   
Sbjct: 236 LMKAVATVGPISVAIDAGHESFQFYKSGIYYEPGC--SSKDLDHGVLLVGYG-FEGKDSE 292

Query: 313 --PYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
              +W VKNSW T WG  GYV M+   NN CG+ TA +Y T+
Sbjct: 293 NNKFWIVKNSWGTSWGTNGYVKMAKDQNNHCGIATAASYPTV 334


>gi|66377984|gb|AAY45869.1| cathepsin L-like cysteine proteinase [Globodera pallida]
          Length = 379

 Score =  172 bits (435), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 89/202 (44%), Positives = 126/202 (62%), Gaps = 7/202 (3%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q +CGSCW+F +TGA+E  +  +  +L  LS+Q LIDCS  YGN GC+GG    ++Q+I
Sbjct: 178 NQGMCGSCWAFSSTGALEAQHARQTGQLISLSEQNLIDCSKKYGNMGCNGGIMDNAFQYI 237

Query: 211 M-KHGLPTQDDYGPYLGQDA-YCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
              +G+  + DY PY  +    C        AT TGF ++    E+ LK+A+A  GP SV
Sbjct: 238 KDNNGVDKELDY-PYKAKTGKKCLFKRNDVGATDTGFFDIAEGDEEKLKIAVATQGPASV 296

Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGN 327
           AIDA  +SF  Y +GVY++++C  SP+ LDH VL VGYG +     YW VKNSW  +WG 
Sbjct: 297 AIDAGHRSFQLYTHGVYFEKEC--SPENLDHGVLVVGYGTDAQQGDYWIVKNSWGAHWGE 354

Query: 328 QGYVLMSI-KDNNCGVMTAPTY 348
           QGY+ M+  + NNCG+ +  +Y
Sbjct: 355 QGYIRMARNRKNNCGIASHASY 376


>gi|66394764|gb|AAY46196.1| cathepsin L-like cysteine proteinase [Globodera pallida]
          Length = 379

 Score =  172 bits (435), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 89/202 (44%), Positives = 126/202 (62%), Gaps = 7/202 (3%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q +CGSCW+F +TGA+E  +  +  +L  LS+Q LIDCS  YGN GC+GG    ++Q+I
Sbjct: 178 NQGMCGSCWAFSSTGALEAQHARQTGQLISLSEQNLIDCSKKYGNMGCNGGIMDNAFQYI 237

Query: 211 M-KHGLPTQDDYGPYLGQDA-YCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
              +G+  + DY PY  +    C        AT TGF ++    E+ LK+A+A  GP SV
Sbjct: 238 KDNNGVDKELDY-PYKAKTGKKCLFKRNDVGATDTGFFDIAEGDEEKLKIAVATQGPASV 296

Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGN 327
           AIDA  +SF  Y +GVY++++C  SP+ LDH VL VGYG +     YW VKNSW  +WG 
Sbjct: 297 AIDAGHRSFQLYTHGVYFEKEC--SPENLDHGVLVVGYGTDAQQGDYWIVKNSWGAHWGE 354

Query: 328 QGYVLMSI-KDNNCGVMTAPTY 348
           QGY+ M+  + NNCG+ +  +Y
Sbjct: 355 QGYIRMARNRKNNCGIASHASY 376


>gi|55740404|gb|AAV63978.1| cathepsin L2 precursor [Artemia franciscana]
          Length = 226

 Score =  172 bits (435), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 89/199 (44%), Positives = 123/199 (61%), Gaps = 3/199 (1%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM 211
           Q  C SCW+F +TGA+E   + K  KL  LS+Q LIDCS  YGN GC GG   +++++I 
Sbjct: 26  QGQCASCWAFSSTGALESQTFRKTGKLISLSEQNLIDCSGEYGNLGCKGGWISQAFEYIK 85

Query: 212 KH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
            + G+ T++ Y  Y  ++ +C        A   GFVN+    ED LK A+A  GPVS  I
Sbjct: 86  DNKGIDTENKY-HYEAKENFCRDNPRNRGAVALGFVNIPSGEEDKLKAAVATVGPVSAVI 144

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
           D S + F FY  GVYY+  C  S + L+HAVL +GYG  +G+ YW VKNSWS +WG++GY
Sbjct: 145 DVSHEGFQFYSKGVYYEPSCKTSFEHLNHAVLVIGYGSDNGEDYWLVKNSWSKHWGDEGY 204

Query: 331 VLMSI-KDNNCGVMTAPTY 348
           + ++  + N+CGV TA  Y
Sbjct: 205 LKIARNRKNHCGVATAALY 223


>gi|94448674|emb|CAI91575.1| cathepsin L2 [Lubomirskia baicalensis]
          Length = 324

 Score =  171 bits (434), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 96/213 (45%), Positives = 131/213 (61%), Gaps = 6/213 (2%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KG+   + +Q  CGSCWSF  TG++EG ++     L  LS+Q L+DCS   GN+G
Sbjct: 112 VDWSKKGWVTPVKNQGQCGSCWSFSATGSMEGQHFNATGTLMSLSEQNLVDCSAAEGNHG 171

Query: 198 CDGGEDFRSYQWIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++++++K +G+ T+  Y PY   D+ C        AT++G+V+VT +SE  L
Sbjct: 172 CNGGLMDDAFEYVIKNNGIDTEASY-PYRAVDSTCKFNTADVGATISGYVDVTKDSESDL 230

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           ++A+A  GPVSVAIDAS  SF FY +GVY    C  S   LDH VLAVGYG    K YW 
Sbjct: 231 QVAVATIGPVSVAIDASHISFQFYSSGVYDPLIC--SSTNLDHGVLAVGYGTDGSKDYWL 288

Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTY 348
           VKNSW   WG  GY+ M    NN CG+ T+ +Y
Sbjct: 289 VKNSWGASWGMSGYIEMVRNHNNKCGIATSASY 321


>gi|37905511|gb|AAO64477.1| cathepsin S precursor [Fundulus heteroclitus]
          Length = 337

 Score =  171 bits (434), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 90/202 (44%), Positives = 124/202 (61%), Gaps = 6/202 (2%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM 211
           Q  CGSCW+F   GA+EG    K  KL  LS Q L+DCS  YGN+GC+GG   +++Q+++
Sbjct: 140 QGACGSCWAFSAAGALEGQLAKKTGKLQNLSPQNLVDCSTKYGNHGCNGGFMHKAFQYVI 199

Query: 212 -KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
              G+ ++D Y PY G+D  C     T  A  + +  +    E ALK A+A  GP+SVAI
Sbjct: 200 DNQGIDSEDSY-PYRGRDQQCQYNPATRAANCSRYDFLPEGDEQALKEAIATIGPISVAI 258

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
           DA +  F+FY +GVY D  C  +   ++HAVLAVGYG L G+ YW VKNSW T +G+QGY
Sbjct: 259 DARRPRFAFYRSGVYDDSSCTQN---VNHAVLAVGYGSLGGQDYWLVKNSWGTSFGDQGY 315

Query: 331 VLMSI-KDNNCGVMTAPTYVTM 351
           + M+  K++ CG+     Y  M
Sbjct: 316 IRMARNKNDQCGIALYACYPIM 337


>gi|340381055|ref|XP_003389037.1| PREDICTED: cathepsin L1-like [Amphimedon queenslandica]
          Length = 329

 Score =  171 bits (434), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 98/243 (40%), Positives = 140/243 (57%), Gaps = 11/243 (4%)

Query: 118 EKVSKYSLWVRYNKASKDAIP--VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMK 174
            + ++ S   RY+  + +A+P  V +  KG    + +Q  CGSCW+F TTG++EG  ++K
Sbjct: 89  RRSARKSNATRYHVPTGNALPDTVDWRTKGAVTPVKNQKQCGSCWAFSTTGSLEGQTFLK 148

Query: 175 HKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIA 234
              L  LS+Q L+DCS  YGN+GC GG    ++++I  +G    +   PY  ++  C   
Sbjct: 149 KGTLPSLSEQQLVDCSDKYGNHGCQGGLMDNAFKYIEANGGIDSEASYPYEAKNGKCRFQ 208

Query: 235 NTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSP 294
            +   AT TG+ ++  +  D L+ A+A  GP+SVA+DAS  SF  Y  GVY    C+++ 
Sbjct: 209 QSAVAATCTGYKDIPHDDIDGLQDAVANVGPISVAMDASHSSFQLYAAGVYDPLLCSSTR 268

Query: 295 DGLDHAVLAVGYG-ELDG-----KPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTY 348
             LDH VLAVGYG E  G     KPYW VKNSW   WG QGY  +  KDN CG+ T  +Y
Sbjct: 269 --LDHGVLAVGYGTEPSGLFHEEKPYWLVKNSWGPDWGQQGYFKIVRKDNKCGIATDASY 326

Query: 349 VTM 351
            T+
Sbjct: 327 PTV 329


>gi|348531517|ref|XP_003453255.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
          Length = 330

 Score =  171 bits (434), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 96/213 (45%), Positives = 133/213 (62%), Gaps = 7/213 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  +GY   +  Q  CGSCW+F  TGA+EG ++ K   L  LS+Q L+DCS  Y NNG
Sbjct: 119 VDWREQGYVTDVKHQQQCGSCWAFSATGALEGQHFKKTGTLVPLSEQQLVDCSRKYRNNG 178

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           CDGGE   ++Q+I  + G+ T+  Y  Y  +D  C   + +  A   G+V+V+P  E+AL
Sbjct: 179 CDGGEPNWAFQYIRDNGGVDTEKSY-RYEAKDGQCRYRSNSIGAKCNGYVDVSP-FEEAL 236

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
             A+A  GP+SV+ID S+ SF  Y +GVY +  C+N    L+HAVLAVGYG  +G  YW 
Sbjct: 237 MEAVATIGPISVSIDDSRVSFQLYQSGVYDEPWCSNI--NLNHAVLAVGYGTENGHDYWL 294

Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           VKNSW + WGN+GY+ M+  K N CG+ T  +Y
Sbjct: 295 VKNSWGSGWGNKGYIKMTRNKGNQCGIATEASY 327


>gi|156354971|ref|XP_001623452.1| predicted protein [Nematostella vectensis]
 gi|156210152|gb|EDO31352.1| predicted protein [Nematostella vectensis]
          Length = 301

 Score =  171 bits (434), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 82/190 (43%), Positives = 118/190 (62%), Gaps = 8/190 (4%)

Query: 147 NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG-----YGNNGCDGG 201
           N    Q  CGSCW+F T GAVE A++++  +L  L++Q L+DC+W      +GNN C GG
Sbjct: 111 NPAKGQGTCGSCWAFATAGAVEAAHFIQKGELLNLAEQQLLDCTWSTPGVYHGNNCCLGG 170

Query: 202 EDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALA 261
             ++++ W+ K GL T   YG Y GQ+ +C  +N T  A +T +  V   +  ALK AL+
Sbjct: 171 WTWKAFSWVKKFGLATTKSYGHYRGQEGFCKTSNRTVGARITSYRRVKRFNPIALKKALS 230

Query: 262 KHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSW 321
            HGP +++I+A+ KS  FY NG+  D+ C+N      HAVL +GYG  +G PYW +KNSW
Sbjct: 231 YHGPATISINANPKSIKFYSNGIMSDKHCSNK---TYHAVLLIGYGSDNGVPYWLIKNSW 287

Query: 322 STYWGNQGYV 331
           S  WGN G++
Sbjct: 288 SHKWGNNGFI 297


>gi|77379397|gb|ABA71355.1| cysteine protease [Brassica napus]
          Length = 359

 Score =  171 bits (434), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 86/199 (43%), Positives = 120/199 (60%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F TTGA+E AY+    K   LS+Q L+DC+  + N GC+GG   +++++I
Sbjct: 159 DQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYI 218

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + GL T++ Y PY G+D  C  +       +   VN+T  +ED LK A+    PVS+A
Sbjct: 219 KSNGGLDTEEAY-PYTGEDGTCKYSAENVGVQVLDSVNITLGAEDELKHAVGLLRPVSIA 277

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            +    SF  Y +GVY D  C  +P  ++HAVLAVGYG  DG PYW +KNSW   WG++G
Sbjct: 278 FEVIH-SFRLYKSGVYSDSHCGQTPMDVNHAVLAVGYGIEDGVPYWLIKNSWGADWGDKG 336

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y  M +  N CG+ T  +Y
Sbjct: 337 YFKMEMGKNMCGIATCASY 355


>gi|281427380|ref|NP_001163996.1| cathepsin L-like proteinase precursor [Tribolium castaneum]
 gi|281427798|ref|NP_001164001.1| cathepsin L-like proteinase precursor [Tribolium castaneum]
 gi|270001241|gb|EEZ97688.1| cathepsin L precursor [Tribolium castaneum]
 gi|270016928|gb|EFA13374.1| hypothetical protein TcasGA2_TC001950 [Tribolium castaneum]
          Length = 328

 Score =  171 bits (434), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 90/217 (41%), Positives = 127/217 (58%), Gaps = 5/217 (2%)

Query: 136 AIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGN 195
           A  V +  K    + DQ  CGSCWSF TTGAVEG   +  K L  LS+Q L+DCS  YGN
Sbjct: 116 AAEVDWRSKAVTEVKDQGQCGSCWSFSTTGAVEGQLAISGKGLTSLSEQNLVDCSSQYGN 175

Query: 196 NGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDA 255
            GC+GG    ++ +I  +G+ ++  Y PY   D  C    + +  ++ G+ ++    E A
Sbjct: 176 AGCNGGWMDSAFDYIHDNGIMSESAY-PYTAMDGNCRFDASQSVTSLQGYYDIPSGDESA 234

Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYW 315
           L+ A+A +GPV+VA+DA+++    Y  GV YD  C  S   L+H VL VGYG   G+ YW
Sbjct: 235 LQDAVANNGPVAVALDATEE-LQLYSGGVLYDTTC--SAQALNHGVLVVGYGSEGGQDYW 291

Query: 316 QVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
            VKNSW + WG QGY   +  ++NNCG+ TA +Y  +
Sbjct: 292 IVKNSWGSGWGEQGYWRQARNRNNNCGIATAASYPAL 328


>gi|1705639|sp|Q10991.1|CATL1_SHEEP RecName: Full=Cathepsin L1; Contains: RecName: Full=Cathepsin L1
           heavy chain; Contains: RecName: Full=Cathepsin L1 light
           chain; Flags: Precursor
          Length = 217

 Score =  171 bits (434), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 100/217 (46%), Positives = 131/217 (60%), Gaps = 8/217 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TGA+EG  + K  KL  LS+Q L+D S   GN G
Sbjct: 5   VDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDSSRPQGNQG 64

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++Q+I ++ GL +++ Y PY   D  C+     + A  TGFV++ P  E AL
Sbjct: 65  CNGGLMDNAFQYIKENGGLDSEESY-PYEATDTSCNYKPEYSAAKDTGFVDI-PQREKAL 122

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYW 315
             A+A  GP+SVAIDA   SF FY +G+YYD  C  S   LDH VL VGYG E     +W
Sbjct: 123 MKAVATVGPISVAIDAGHSSFQFYKSGIYYDPDC--SSKDLDHGVLVVGYGFEGTNNKFW 180

Query: 316 QVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
            VKNSW   WGN+GYV M+   NN CG+ TA +Y T+
Sbjct: 181 IVKNSWGPEWGNKGYVKMAKDQNNHCGIATAASYPTV 217


>gi|45822209|emb|CAE47501.1| cathepsin L-like proteinase [Diabrotica virgifera virgifera]
          Length = 325

 Score =  171 bits (433), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 89/199 (44%), Positives = 123/199 (61%), Gaps = 4/199 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ +CGSCW+F TTG+VE A+++K   L  LS+Q L+DC+      GC GG   ++ ++I
Sbjct: 127 DQGMCGSCWTFSTTGSVEAAHFLKTGNLVSLSEQNLVDCAKDTCY-GCGGGWMDKALEYI 185

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
            K G+ ++ DY PY G D  C    +   A ++ F  +  N E+ LK A+A  GP+SVAI
Sbjct: 186 EKGGIMSEKDY-PYEGVDDNCRFDISKVAAKISNFTYIKKNDEEDLKNAVAAKGPISVAI 244

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
           DAS  +F  YV+G+  D +C+N  D L+H VL VGYG  +GK YW +KNSW   WG  GY
Sbjct: 245 DASA-TFQLYVSGILDDTECSNEFDSLNHGVLVVGYGTENGKDYWIIKNSWGVNWGMDGY 303

Query: 331 VLMSI-KDNNCGVMTAPTY 348
           + MS  K+N CG+ T   Y
Sbjct: 304 IRMSRNKNNQCGITTDGVY 322


>gi|156938919|gb|ABU97481.1| cathepsin L-like cysteine protease [Tyrophagus putrescentiae]
          Length = 333

 Score =  171 bits (433), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 90/203 (44%), Positives = 127/203 (62%), Gaps = 6/203 (2%)

Query: 149 LLDQSVCGSCWSF-GTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSY 207
           + +Q  CGSCW+F     ++EG + +K  KL  LS+Q L+DCS   GN GC+GG   +++
Sbjct: 131 IKNQEQCGSCWAFFSAVASMEGQHGLKTGKLVSLSEQNLVDCSAAEGNMGCEGGLMDQAF 190

Query: 208 QWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
           Q+++ + G+ T+  Y PY   D        +  AT+  +V+V   SE +L+ A+A  GP+
Sbjct: 191 QYVIANKGIDTEMSY-PYKAIDESWEFKKNSVGATIKSYVDVKTGSESSLQSAVATVGPI 249

Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWG 326
           SV IDASQ SF FY +GVY +  C+ +   LDH V AVGYG L+G PYW+VKNSW T WG
Sbjct: 250 SVGIDASQLSFQFYSSGVYEEPACSTTI--LDHGVTAVGYGALNGTPYWKVKNSWGTSWG 307

Query: 327 NQGYVLMSI-KDNNCGVMTAPTY 348
             GY+ MS  K N CG+ TA ++
Sbjct: 308 MSGYIFMSRNKQNQCGIATAASW 330


>gi|384941728|gb|AFI34469.1| cathepsin L2 preproprotein [Macaca mulatta]
          Length = 334

 Score =  171 bits (433), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 95/220 (43%), Positives = 130/220 (59%), Gaps = 10/220 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TGA+EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 118 VDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQG 177

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    +++++ ++ GL +++ Y PY+  D  C   +  + A  TGF  V    E AL
Sbjct: 178 CNGGFMNSAFRYVKENGGLDSEESY-PYVAMDGICKYRSENSVANDTGFEVVPAGKEKAL 236

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GP+SVA+DA   SF FY +G+Y++  C  S   LDH VL VGYG      D  
Sbjct: 237 MKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDC--SSKNLDHGVLVVGYGFEGANSDNN 294

Query: 313 PYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTYVTM 351
            YW VKNSW   WG+ GYV +   KDN+CG+ TA +Y T+
Sbjct: 295 KYWLVKNSWGPEWGSNGYVKIAKDKDNHCGIATAASYPTV 334


>gi|118136313|gb|ABK62794.1| cathepsin L-like cysteine protease [Neobenedenia melleni]
 gi|118136315|gb|ABK62795.1| cathepsin L-like cysteine protease [Neobenedenia melleni]
          Length = 335

 Score =  171 bits (433), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 91/204 (44%), Positives = 129/204 (63%), Gaps = 6/204 (2%)

Query: 148 SLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSY 207
           ++ +Q+ CGSCW+F +TG++EGA      KL   S+Q L+DCS  +GN+GC+GG    S+
Sbjct: 132 AVKNQAQCGSCWAFSSTGSIEGAVKRATGKLISFSEQQLVDCSTAFGNHGCNGGIMDNSF 191

Query: 208 QWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
            +++ + GL ++  Y PY  Q   C      +  T++ F +V+   E  LK A+   GPV
Sbjct: 192 NYLIHNKGLESEASY-PYEAQKKECRYKKALSKGTISSFTDVSQFDEKDLKRAVGLVGPV 250

Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGEL-DGKPYWQVKNSWSTYW 325
           S+AIDASQ SF  Y +GVY +E C+ +   L+H VLAVGYG   +G  YW+VKNSW+  W
Sbjct: 251 SIAIDASQFSFHLYDSGVYDEEDCSQTM--LNHGVLAVGYGTTPEGLDYWKVKNSWTNTW 308

Query: 326 GNQGYVLMSI-KDNNCGVMTAPTY 348
           G +GY+LMS  KDN CGV T  +Y
Sbjct: 309 GMEGYILMSRNKDNQCGVATVASY 332


>gi|377656292|pdb|3QT4|A Chain A, Structure Of Digestive Procathepsin L 3 Of Tenebrio
           Molitor Larval Midgut
          Length = 329

 Score =  171 bits (433), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 89/218 (40%), Positives = 132/218 (60%), Gaps = 7/218 (3%)

Query: 136 AIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGN 195
           A  V +     + + DQ  CGS WSF TTGAVEG   ++  +L  LS+Q LIDCS  YGN
Sbjct: 117 AASVDWRSNAVSEVKDQGQCGSSWSFSTTGAVEGQLALQRGRLTSLSEQNLIDCSSSYGN 176

Query: 196 NGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDA 255
            GCDGG    ++ +I  +G+ ++  Y PY  Q  YC   ++ +  T++G+ ++    E++
Sbjct: 177 AGCDGGWMDSAFSYIHDYGIMSESAY-PYEAQGDYCRFDSSQSVTTLSGYYDLPSGDENS 235

Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYW 315
           L  A+ + GPV+VAIDA+ +   FY  G++YD+ CN S   L+H VL VGYG  +G+ YW
Sbjct: 236 LADAVGQAGPVAVAIDATDE-LQFYSGGLFYDQTCNQSD--LNHGVLVVGYGSDNGQDYW 292

Query: 316 QVKNSWSTYWGNQGYVLMSIKD--NNCGVMTAPTYVTM 351
            +KNSW + WG  GY    +++  NNCG+ TA +Y  +
Sbjct: 293 ILKNSWGSGWGESGY-WRQVRNYGNNCGIATAASYPAL 329


>gi|18141289|gb|AAL60582.1|AF454960_1 senescence-associated cysteine protease [Brassica oleracea]
          Length = 359

 Score =  171 bits (433), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 86/199 (43%), Positives = 120/199 (60%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F TTGA+E AY+    K   LS+Q L+DC+  + N GC+GG   +++++I
Sbjct: 159 DQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYI 218

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + GL T++ Y PY G+D  C  +       +   VN+T  +ED LK A+    PVS+A
Sbjct: 219 KSNGGLDTEEAY-PYTGEDGTCKYSAENVGVEVLDSVNITLGAEDELKHAVGLVRPVSIA 277

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            +    SF  Y +GVY D  C  +P  ++HAVLAVGYG  DG PYW +KNSW   WG++G
Sbjct: 278 FEVIH-SFRLYKSGVYSDSHCGQTPMDVNHAVLAVGYGIEDGVPYWLIKNSWGADWGDKG 336

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y  M +  N CG+ T  +Y
Sbjct: 337 YFKMEMGKNMCGIATCASY 355


>gi|109112413|ref|XP_001106814.1| PREDICTED: cathepsin L2 isoform 3 [Macaca mulatta]
 gi|297271422|ref|XP_002800251.1| PREDICTED: cathepsin L2 [Macaca mulatta]
          Length = 334

 Score =  171 bits (432), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 95/220 (43%), Positives = 130/220 (59%), Gaps = 10/220 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TGA+EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 118 VDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSHPQGNQG 177

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    +++++ ++ GL +++ Y PY+  D  C   +  + A  TGF  V    E AL
Sbjct: 178 CNGGFMNSAFRYVKENGGLDSEESY-PYVAMDGICKYRSENSVANDTGFKVVPAGKEKAL 236

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GP+SVA+DA   SF FY +G+Y++  C  S   LDH VL VGYG      D  
Sbjct: 237 MKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDC--SSKNLDHGVLVVGYGFEGANSDNN 294

Query: 313 PYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTYVTM 351
            YW VKNSW   WG+ GYV +   KDN+CG+ TA +Y T+
Sbjct: 295 KYWLVKNSWGPEWGSNGYVKIAKDKDNHCGIATAASYPTV 334


>gi|23397070|gb|AAN31820.1| putative cysteine proteinase AALP [Arabidopsis thaliana]
          Length = 358

 Score =  171 bits (432), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 86/199 (43%), Positives = 120/199 (60%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F TTGA+E AY+    K   LS+Q L+DC+  + N GC+GG   +++++I
Sbjct: 158 DQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYI 217

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + GL T+  Y PY G+D  C  +       +   VN+T  +ED LK A+    PVS+A
Sbjct: 218 KSNGGLDTEKAY-PYTGKDETCKFSAENVGVQVLNSVNITLGAEDELKHAVGLVRPVSIA 276

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            +    SF  Y +GVY D  C ++P  ++HAVLAVGYG  DG PYW +KNSW   WG++G
Sbjct: 277 FEVIH-SFRLYKSGVYTDSHCGSTPMDVNHAVLAVGYGVEDGVPYWLIKNSWGADWGDKG 335

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y  M +  N CG+ T  +Y
Sbjct: 336 YFKMEMGKNMCGIATCASY 354


>gi|355753449|gb|EHH57495.1| Cathepsin L1 [Macaca fascicularis]
          Length = 333

 Score =  171 bits (432), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 95/220 (43%), Positives = 132/220 (60%), Gaps = 11/220 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TGA+EG  + K  KL  LS+Q L+DCSW  GN G
Sbjct: 118 VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSWPQGNEG 177

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++Q++  + GL +++ Y PY   +  C      + A  TGFV++ P  E AL
Sbjct: 178 CNGGLMDYAFQYVADNGGLDSEESY-PYEATEESCKYNPEYSVANDTGFVDI-PKQEKAL 235

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GP+SVAIDA  +SF FY  G+Y++  C  S + +DH VL VGYG    E D  
Sbjct: 236 MKAVATVGPISVAIDAGHESFMFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDNS 293

Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
            YW VKNSW   WG  GY+ M+  + N+CG+ +A +Y T+
Sbjct: 294 KYWLVKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYPTV 333


>gi|118404242|ref|NP_001072435.1| cathepsin K precursor [Xenopus (Silurana) tropicalis]
 gi|113197688|gb|AAI21683.1| hypothetical protein MGC147539 [Xenopus (Silurana) tropicalis]
          Length = 331

 Score =  171 bits (432), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 106/254 (41%), Positives = 145/254 (57%), Gaps = 20/254 (7%)

Query: 113 VTSEGEKVSKYSLWV-RYNKAS---------KDAIP--VRYEMKGY-NSLLDQSVCGSCW 159
           +TSE    +   LWV R N+++         ++ IP  + Y  KGY   + +Q  CGSCW
Sbjct: 83  MTSEEVVRTMTGLWVHRRNRSTNFTSEDNTAQEKIPDSIDYRKKGYVTPIRNQGSCGSCW 142

Query: 160 SFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQ 218
           +F + GA+EG    K  KL  LS Q L+DC     N+GC GG    +++++  + G+ ++
Sbjct: 143 AFSSVGALEGQLKKKKGKLVDLSPQNLVDCV--KKNDGCGGGYMTNAFEYVRDNKGIDSE 200

Query: 219 DDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFS 278
           + Y PY+G+D  C    T   A+  GF  V   SE ALK A+   GPVSV IDA   SF 
Sbjct: 201 NAY-PYVGEDQECMYNATGKAASCKGFKEVQEGSEKALKKAVGLVGPVSVGIDAGLSSFQ 259

Query: 279 FYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KD 337
           FY  GVYYD+ CN   + ++HAVLAVGYG      YW VKNSW   WGN+GY+LM+  KD
Sbjct: 260 FYSKGVYYDKDCN--AENINHAVLAVGYGTQKKTKYWIVKNSWGEDWGNKGYILMAREKD 317

Query: 338 NNCGVMTAPTYVTM 351
           N CG+ +  +Y  M
Sbjct: 318 NACGISSLASYPVM 331


>gi|1093503|prf||2104214A Cys protease
          Length = 255

 Score =  171 bits (432), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 92/214 (42%), Positives = 130/214 (60%), Gaps = 7/214 (3%)

Query: 139 VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KG   ++ DQ  CGSCW+F +TGA+EG ++ K   L  LS+Q L+DCS  YGNNG
Sbjct: 42  VDWRTKGAVTAVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNG 101

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++ +I  + G+ T+  Y PY   D  CH       AT  GF ++    E  +
Sbjct: 102 CNGGLMDNAFPYIKDNGGIDTEKSY-PYEAIDDSCHFNRAQVGATDRGFTDIPQGDEKKM 160

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYW 315
             A+A  GPVSVAIDAS +SF FY  GVY + +C+     LDH VL VG+G +  G+ YW
Sbjct: 161 PEAVATVGPVSVAIDASHESFQFYSEGVYNEPQCD--AQNLDHGVLVVGFGTDESGEDYW 218

Query: 316 QVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
            VKNSW T WG++G++ ++  K+N CG+ +  +Y
Sbjct: 219 LVKNSWGTTWGDKGFIKMLRNKENQCGIASPSSY 252


>gi|348687948|gb|EGZ27762.1| papain-like cysteine protease C1 [Phytophthora sojae]
          Length = 533

 Score =  171 bits (432), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 92/209 (44%), Positives = 132/209 (63%), Gaps = 14/209 (6%)

Query: 145 GYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDF 204
           G   + +Q +CGSCW+F TTGAVEGA ++   KL  LS+Q L+DC    G+ GC+GG   
Sbjct: 128 GVTPVKNQGMCGSCWAFSTTGAVEGATFVSSGKLLSLSEQELVDCDHN-GDMGCNGGLMD 186

Query: 205 RSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKH 263
            ++QWI  HG + ++DDY  Y  +   C   ++     +TGF +V P  E ALK+A+A+ 
Sbjct: 187 HAFQWIEDHGGICSEDDY-EYKAKAQVCRKCDSVVK--VTGFQDVNPQDEHALKVAVAQQ 243

Query: 264 GPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWST 323
            PVSVAI+A QK+F FY +GV+ +  C      LDH VLAVGYG  +G+ +W+VKNSW  
Sbjct: 244 -PVSVAIEADQKAFQFYKSGVF-NLTCGTR---LDHGVLAVGYGNDNGQKFWKVKNSWGA 298

Query: 324 YWGNQGYVLMSIKDN----NCGVMTAPTY 348
            WG QGY+ ++ ++N     CG+ + P+Y
Sbjct: 299 SWGEQGYIRLAREENGPAGQCGIASVPSY 327


>gi|320169652|gb|EFW46551.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
          Length = 325

 Score =  171 bits (432), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 93/201 (46%), Positives = 123/201 (61%), Gaps = 6/201 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCWSF TTG+VEG +  K   L  LS+Q L+DCS   GN GC+GG    ++++I
Sbjct: 125 NQGQCGSCWSFSTTGSVEGQHARKTGTLVSLSEQNLVDCSSQEGNEGCNGGLMDDAFEYI 184

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           +K+ G+ T+  Y PY      C        AT+  + ++   SE  L+ A+A  GPVSVA
Sbjct: 185 IKNGGIDTEASY-PYTATTGTCKFNAANIGATVASYQDIITGSESDLQNAVATVGPVSVA 243

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
           IDAS  +F FY  GVY ++KC+ +   LDH VLAVGYG   +GK YW VKNSW   WG  
Sbjct: 244 IDASHINFQFYFTGVYNEKKCSTTQ--LDHGVLAVGYGTSTEGKDYWLVKNSWGATWGKA 301

Query: 329 GYVLMSIK-DNNCGVMTAPTY 348
           GY+ MS   DN CG+ T+ +Y
Sbjct: 302 GYIWMSRNADNQCGIATSASY 322


>gi|164420679|ref|NP_001037464.2| fibroinase precursor [Bombyx mori]
 gi|40556818|gb|AAR87763.1| fibroinase precursor [Bombyx mori]
          Length = 341

 Score =  171 bits (432), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 93/203 (45%), Positives = 125/203 (61%), Gaps = 6/203 (2%)

Query: 149 LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
           + DQ  CGSCWSF TTGA+EG ++ +   L  LS+Q LIDCS  YGNNGC+GG    +++
Sbjct: 139 IKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFK 198

Query: 209 WIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVS 267
           +I  + G+ T+  Y PY G D  C        A   GFV++    E  L  A+A  GPVS
Sbjct: 199 YIKDNGGIDTEQTY-PYEGVDDKCRYNPKNTGAEDVGFVDIPEGDEQKLMEAVATVGPVS 257

Query: 268 VAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWG 326
           VAIDAS  SF  Y +GVY +E+C+++   LDH VL VGYG +  G  YW VKNSW   WG
Sbjct: 258 VAIDASHTSFQLYSSGVYNEEECSSTD--LDHGVLVVGYGTDEQGVDYWLVKNSWGRSWG 315

Query: 327 NQGYV-LMSIKDNNCGVMTAPTY 348
             GY+ ++  K+N CG+ ++ +Y
Sbjct: 316 ELGYIKMIRNKNNRCGIASSASY 338


>gi|317135059|gb|ADV03094.1| cathepsin L [Hyriopsis cumingii]
 gi|372126672|gb|AEX88474.1| cathepsin L [Hyriopsis schlegelii]
          Length = 333

 Score =  170 bits (431), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 94/232 (40%), Positives = 135/232 (58%), Gaps = 6/232 (2%)

Query: 120 VSKYSLWVRYNKASKDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKL 178
           + +  L  +Y   S+    V +  KGY   + +Q  CGSC++F  TGAVEG ++ K  KL
Sbjct: 102 IERRGLVFKYTNLSEYPSEVDWRSKGYVTPVKNQGGCGSCYAFSATGAVEGQHFRKTGKL 161

Query: 179 AVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTT 237
             LS+Q ++DCS+  GN GC GG   +S+ +I   +G+ T++ Y PY  +D  C    + 
Sbjct: 162 VSLSEQNIVDCSFKEGNKGCRGGLMDKSFTYIKDNNGIDTEEAY-PYEARDGPCRFRRSE 220

Query: 238 ATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGL 297
             AT+ G+V++  N E AL+ A+   GP+SVAID    +F FY +GV+ +  C  S   +
Sbjct: 221 VGATVRGYVDLPENDEIALQHAVTTIGPISVAIDGHHFNFRFYHHGVFDNPNC--SKTKI 278

Query: 298 DHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           +H VL VGYG  DG  YW VKNSW   WG +GY+LMS   DN C +  A +Y
Sbjct: 279 NHGVLVVGYGTRDGLDYWLVKNSWGERWGAEGYILMSRNNDNQCCITCAASY 330


>gi|449679414|ref|XP_002161570.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
          Length = 353

 Score =  170 bits (431), Expect = 8e-40,   Method: Compositional matrix adjust.
 Identities = 93/214 (43%), Positives = 129/214 (60%), Gaps = 5/214 (2%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  +GY   + +Q +C SCW+F TTGA+EG  + K   L  LS+Q L+DCS  YGN G
Sbjct: 138 VDWRTEGYVTPVKNQGLCRSCWAFSTTGALEGQTFRKTGILPTLSEQNLVDCSKSYGNQG 197

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQD-AYCHIANTTATATMTGFVNVTPNSEDA 255
           CDGG    ++++I  + GL +++ Y PY  ++  YC+       A+ +GFV +    EDA
Sbjct: 198 CDGGWTNNAFEYIKDNDGLDSENGY-PYDAKELGYCYYDEKYKEASDSGFVEIPYGDEDA 256

Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYW 315
           LK A+A  GP++V IDAS+ SF  Y +GVY +  C N    L HAVL VGYG   G  +W
Sbjct: 257 LKEAVATVGPIAVNIDASKPSFQSYKSGVYNEPTCGNGITNLTHAVLVVGYGTEKGHKFW 316

Query: 316 QVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
            VKNSW   WG+ GY+ MS  K N CG+ T  ++
Sbjct: 317 LVKNSWGKTWGDHGYIKMSRNKSNQCGIATRASF 350


>gi|258406688|gb|ACV72067.1| putative cysteine protease [Lathyrus sativus]
          Length = 350

 Score =  170 bits (431), Expect = 8e-40,   Method: Compositional matrix adjust.
 Identities = 89/199 (44%), Positives = 117/199 (58%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F TTGA+E AY     K   LS+Q L+DC+  + N GC GG   +++++I
Sbjct: 150 DQGHCGSCWTFSTTGALESAYAQAFGKNISLSEQQLVDCAGAFNNFGCSGGLPSQAFEYI 209

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + GL T++ Y PY G +  C   +      + G VN+T  SED LK A+A   PVSVA
Sbjct: 210 KYNGGLETEETY-PYTGSNGLCKFTSENVALKVLGSVNITLGSEDELKHAVAFARPVSVA 268

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            +     F  Y +GVY    C N+P  ++HAVLAVGYG  DG PYW +KNSW   WG+ G
Sbjct: 269 FEVVH-DFRLYKSGVYTSTACGNTPMDVNHAVLAVGYGIEDGIPYWHIKNSWGGDWGDHG 327

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y  M +  N CGV T  +Y
Sbjct: 328 YFKMEMGKNMCGVATCSSY 346


>gi|410978262|ref|XP_003995514.1| PREDICTED: cathepsin L1-like [Felis catus]
          Length = 333

 Score =  170 bits (431), Expect = 8e-40,   Method: Compositional matrix adjust.
 Identities = 95/217 (43%), Positives = 128/217 (58%), Gaps = 11/217 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + DQ  CGSCW+F  TGA+EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 118 VDWREKGYVTPVKDQGPCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSQAEGNEG 177

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++Q++  + GL +++ Y PY  QD  C      + A  TGF ++ P  E AL
Sbjct: 178 CNGGLMNNAFQYVKDNGGLDSEESY-PYHAQDESCKYKPQDSAANDTGFFDI-PQQEKAL 235

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDG----K 312
            +A+A  GP+SV IDAS  +F FY  G+YYD  C  S + LDH VL +GYG   G    K
Sbjct: 236 MVAVATKGPISVGIDASHFTFQFYHEGIYYDPDC--SSEDLDHGVLVIGYGTEIGQSINK 293

Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
            YW VKNSW   WG  GY+ M+  + N+CG+ T  ++
Sbjct: 294 TYWIVKNSWGANWGIDGYIKMAKDRKNHCGIATMASF 330


>gi|218478060|dbj|BAH03396.1| cathepsin L-like cysteine peptidase [Taenia saginata]
          Length = 338

 Score =  170 bits (431), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 88/200 (44%), Positives = 124/200 (62%), Gaps = 6/200 (3%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F +TGA+EGA+  K  KL  LS+Q L+DCS   GN+GC+GG    +++++
Sbjct: 140 NQGNCGSCWAFSSTGALEGAFAKKTGKLISLSEQQLVDCSLKNGNDGCNGGYMSYAFKYL 199

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
            +H +  +  Y PY   D  C    +    T+T   ++   +E AL  A+A  GP+S+AI
Sbjct: 200 EEHSIEPESAY-PYRATDGPCRYNESLGVGTVTDIGDIPEGNETALMEAVATVGPISIAI 258

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
           DAS   F FY +G+Y    C  S   L+H VLA+GYG+ DGKPYW VKNSW T WG +GY
Sbjct: 259 DASSLGFMFYRHGIYKSHWC--SSKFLNHGVLAIGYGKQDGKPYWLVKNSWGTRWGMKGY 316

Query: 331 VLMSIKD--NNCGVMTAPTY 348
           ++M+ KD  N CGV +   +
Sbjct: 317 IMMA-KDYHNMCGVASLADF 335


>gi|417399160|gb|JAA46608.1| Putative pro-cathepsin h [Desmodus rotundus]
          Length = 336

 Score =  170 bits (431), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 91/241 (37%), Positives = 138/241 (57%), Gaps = 2/241 (0%)

Query: 109 KWRLVTSEGEKVSK-YSLWVRYNKASKDAIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAV 167
           K R + SE +  S   S ++R +     ++  R + +  + + +Q  CGSCW+F TTGA+
Sbjct: 92  KRRYLWSEPQNCSATKSNYLRGHGPYPTSVDWRKKGRFVSPVKNQGGCGSCWTFSTTGAL 151

Query: 168 EGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQ 227
           E A  +K  K+  LS+Q L+DC+  + N+GC GG   +++++I  +    ++D  PY G+
Sbjct: 152 ESAIAIKTGKMLSLSEQQLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGIMEEDSYPYEGK 211

Query: 228 DAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYD 287
           D+ C      A A +    N+T N E A+  A+A + PVS A + +   F  Y  G+Y  
Sbjct: 212 DSNCRFQPEKAIAFVKDVANITLNDEAAMVEAVALYNPVSFAFEVTS-DFMLYRKGIYSS 270

Query: 288 EKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPT 347
             C+ +PD ++HAVLAVGYGE +GKPYW VKNSW  YWG  GY L+    N CG+    +
Sbjct: 271 TSCHKTPDKVNHAVLAVGYGEQNGKPYWIVKNSWGPYWGMNGYFLIERGTNMCGLAACAS 330

Query: 348 Y 348
           Y
Sbjct: 331 Y 331


>gi|18424347|ref|NP_568921.1| thiol protease aleurain [Arabidopsis thaliana]
 gi|71152227|sp|Q8H166.2|ALEU_ARATH RecName: Full=Thiol protease aleurain; Short=AtALEU; AltName:
           Full=Senescence-associated gene product 2; Flags:
           Precursor
 gi|7230640|gb|AAF43041.1|AF233883_1 AALP protein [Arabidopsis thaliana]
 gi|13430722|gb|AAK25983.1|AF360273_1 putative cysteine proteinase AALP [Arabidopsis thaliana]
 gi|9757740|dbj|BAB08221.1| AALP protein [Arabidopsis thaliana]
 gi|21617934|gb|AAM66984.1| cysteine proteinase AALP [Arabidopsis thaliana]
 gi|23397068|gb|AAN31819.1| putative cysteine proteinase AALP [Arabidopsis thaliana]
 gi|23397074|gb|AAN31822.1| putative cysteine proteinase AALP [Arabidopsis thaliana]
 gi|24417304|gb|AAN60262.1| unknown [Arabidopsis thaliana]
 gi|222423506|dbj|BAH19723.1| AT5G60360 [Arabidopsis thaliana]
 gi|222424411|dbj|BAH20161.1| AT5G60360 [Arabidopsis thaliana]
 gi|332009930|gb|AED97313.1| thiol protease aleurain [Arabidopsis thaliana]
          Length = 358

 Score =  170 bits (431), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 86/199 (43%), Positives = 120/199 (60%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F TTGA+E AY+    K   LS+Q L+DC+  + N GC+GG   +++++I
Sbjct: 158 DQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYI 217

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + GL T+  Y PY G+D  C  +       +   VN+T  +ED LK A+    PVS+A
Sbjct: 218 KSNGGLDTEKAY-PYTGKDETCKFSAENVGVQVLNSVNITLGAEDELKHAVGLVRPVSIA 276

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            +    SF  Y +GVY D  C ++P  ++HAVLAVGYG  DG PYW +KNSW   WG++G
Sbjct: 277 FEVIH-SFRLYKSGVYTDSHCGSTPMDVNHAVLAVGYGVEDGVPYWLIKNSWGADWGDKG 335

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y  M +  N CG+ T  +Y
Sbjct: 336 YFKMEMGKNMCGIATCASY 354


>gi|405977173|gb|EKC41636.1| Cathepsin K [Crassostrea gigas]
          Length = 942

 Score =  170 bits (431), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 92/200 (46%), Positives = 122/200 (61%), Gaps = 7/200 (3%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTG +EG ++ K KKL  LS+Q L+DC     N GC GG    +Y++I
Sbjct: 745 NQGYCGSCWAFATTGGLEGQHFRKTKKLVSLSEQNLVDCC--KENLGCTGGLPVTAYKYI 802

Query: 211 MKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
            ++G + T++ Y PYLG++  C        AT  GFV V    E  L+ A+A  GPV+V+
Sbjct: 803 ARNGGIDTEESY-PYLGKNGNCTFRPPKIGATCQGFVRVPAGDEVGLQKAVASVGPVTVS 861

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS KSF  Y  GVY D+KC  S    +H VL VGYG+  GK YW VKNSW   +G  G
Sbjct: 862 IDASLKSFYLYKEGVYDDKKC--SKKMFNHFVLIVGYGKHLGKEYWLVKNSWGMSFGMDG 919

Query: 330 YVLMSI-KDNNCGVMTAPTY 348
           Y++M+  +DN CG+   P Y
Sbjct: 920 YIMMARNQDNQCGISNQPVY 939


>gi|402898110|ref|XP_003912074.1| PREDICTED: cathepsin L2 [Papio anubis]
          Length = 334

 Score =  170 bits (430), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 95/220 (43%), Positives = 129/220 (58%), Gaps = 10/220 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TGA+EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 118 VDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQG 177

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    +++++ ++ GL +++ Y PY+  D  C      + A  TGF  V    E AL
Sbjct: 178 CNGGFMNSAFRYVKENGGLDSEESY-PYVAMDGICKYRPENSVANDTGFEVVPAGKEKAL 236

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GP+SVA+DA   SF FY +G+Y++  C  S   LDH VL VGYG      D  
Sbjct: 237 MKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDC--SSKNLDHGVLVVGYGFEGANSDNN 294

Query: 313 PYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTYVTM 351
            YW VKNSW   WG+ GYV +   KDN+CG+ TA +Y T+
Sbjct: 295 KYWLVKNSWGPEWGSNGYVKIAKDKDNHCGIATAASYPTV 334


>gi|146386731|pdb|1VSN|A Chain A, Crystal Structure Of A Potent Small Molecule Inhibitor
           Bound To Cathepsin K
          Length = 215

 Score =  170 bits (430), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 96/216 (44%), Positives = 127/216 (58%), Gaps = 8/216 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           + Y  KGY   + +Q  CGSCW+F + GA+EG        L  L+ Q L+DC     N+G
Sbjct: 5   IDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKATGALLNLAPQNLVDCV--SENDG 62

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    ++Q++ ++ G+ ++D Y PY+GQD  C    T   A   G+  +   +E AL
Sbjct: 63  CGGGYMTNAFQYVQRNRGIDSEDAY-PYVGQDESCMYNPTGKAAKCRGYREIPEGNEAAL 121

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K A+A  GPVSVAIDAS  SF FY  GVYYDE C  S D L+HAVLAVGYG   G  +W 
Sbjct: 122 KRAVAAVGPVSVAIDASLTSFQFYSAGVYYDENC--SSDALNHAVLAVGYGIQAGNKHWI 179

Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           +KNSW   WGN GY+LM+   NN CG+    ++  M
Sbjct: 180 IKNSWGESWGNAGYILMARNKNNACGIANLASFPKM 215


>gi|94420703|gb|ABF18679.1| cysteine protease [Medicago sativa]
          Length = 350

 Score =  170 bits (430), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 89/199 (44%), Positives = 118/199 (59%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F TTGA+E AY     K   LS+Q L+DC+  + N GC+GG   +++++I
Sbjct: 150 DQGHCGSCWTFSTTGALESAYAQAFGKNISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYI 209

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + GL T++ Y PY GQ+  C   +      + G VN+T  +ED LK A+A   PVSVA
Sbjct: 210 KYNGGLETEEAY-PYTGQNGPCKFTSEDVAVQVLGSVNITLGAEDELKHAVAFARPVSVA 268

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            +     F  Y  GVY    C N+P  ++HAVLAVGYG  DG PYW +KNSW   WG+ G
Sbjct: 269 FEVVD-DFRLYKKGVYTSTTCGNTPMDVNHAVLAVGYGIEDGVPYWLIKNSWGGEWGDHG 327

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y  M +  N CGV T  +Y
Sbjct: 328 YFKMEMGKNMCGVATCSSY 346


>gi|312381834|gb|EFR27484.1| hypothetical protein AND_05795 [Anopheles darlingi]
          Length = 508

 Score =  170 bits (430), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 86/201 (42%), Positives = 124/201 (61%), Gaps = 6/201 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F +TGAVEG ++ K  KL  LS+Q L+DC+  Y N GC GG  +RS+Q+I
Sbjct: 308 EQGKCGSCWAFSSTGAVEGQHFRKTNKLVSLSEQNLVDCTSNYRNKGCKGGAIYRSFQYI 367

Query: 211 MK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
            + HG+ T+  Y PY  ++  C        A + G+V++    EDAL  A+A  GP+S+ 
Sbjct: 368 EQNHGIDTEKSY-PYQAKEGPCAYNPKAIGAKVKGYVHIPTGDEDALMKAVATVGPISIV 426

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
           +D+   +F  Y +GVYYD +C  S   L HA+L VGYG    G+ +W VKNSW T WG +
Sbjct: 427 VDSRHHTFKHYADGVYYDSQC--SATNLTHAMLVVGYGTSKKGEDFWLVKNSWGTSWGIK 484

Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
           GY+ M+  ++N+CG+     Y
Sbjct: 485 GYIKMARNRNNSCGIANKAYY 505



 Score = 58.2 bits (139), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 28/55 (50%), Positives = 38/55 (69%), Gaps = 2/55 (3%)

Query: 253 EDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG 307
           EDAL  A+A  GP+++++DA  KSF  Y  GVYY+ +CN  P  L H++L VGYG
Sbjct: 253 EDALMKAVATIGPIAISMDAGHKSFQSYGAGVYYEPQCN--PSSLTHSMLVVGYG 305


>gi|47522698|ref|NP_999057.1| cathepsin L1 precursor [Sus scrofa]
 gi|2499874|sp|Q28944.1|CATL1_PIG RecName: Full=Cathepsin L1; Contains: RecName: Full=Cathepsin L1
           heavy chain; Contains: RecName: Full=Cathepsin L1 light
           chain; Flags: Precursor
 gi|1468964|dbj|BAA07140.1| porcine cathepsin L [Sus scrofa]
 gi|15027272|emb|CAC44793.1| cathepsin L [Sus scrofa]
          Length = 334

 Score =  170 bits (430), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 100/221 (45%), Positives = 133/221 (60%), Gaps = 12/221 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY  ++ +Q  CGSCW+F  TGA+EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 118 VDWREKGYVTAVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQG 177

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDA-YCHIANTTATATMTGFVNVTPNSEDA 255
           C+GG    ++Q++  + GL T++ Y PYLG++   C      + A  TGFV++ P  E A
Sbjct: 178 CNGGLMDNAFQYVKDNGGLDTEESY-PYLGRETNSCTYKPECSAANDTGFVDI-PQREKA 235

Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
           L  A+A  GP+SVAIDA   SF FY +G+YYD  C  S   LDH VL VGYG    + + 
Sbjct: 236 LMKAVATVGPISVAIDAGHSSFQFYKSGIYYDPDC--SSKDLDHGVLVVGYGFEGTDSNS 293

Query: 312 KPYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
             +W VKNSW   WG  GYV M+   NN CG+ TA +Y T+
Sbjct: 294 SKFWIVKNSWGPEWGWNGYVKMAKDQNNHCGISTAASYPTV 334


>gi|2239109|emb|CAA70694.1| cathepsin S-like cysteine proteinase [Heterodera glycines]
          Length = 353

 Score =  170 bits (430), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 93/203 (45%), Positives = 123/203 (60%), Gaps = 9/203 (4%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKH-KKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQW 209
           DQ  CGSCW+F  TGA+EGA   K   K+  LS+Q L+DCS  YGN GCDGG    ++++
Sbjct: 152 DQGDCGSCWAFSATGAIEGALAQKKASKIISLSEQNLVDCSSKYGNEGCDGGLMDSAFEY 211

Query: 210 IM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
           +   +GL T++ Y PY      C   N T   T+  F ++    E+ LK+A+A  GP+SV
Sbjct: 212 VRDNNGLDTEESY-PYEAVTGKCQFKNETVGGTVVSFKDLKKGDEEQLKIAVATIGPISV 270

Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG--ELDGKPYWQVKNSWSTYWG 326
           A+DAS  SF FY  GVYY+  C+N    LDH VL VGYG  E  G  YW VKNSW  +WG
Sbjct: 271 ALDASNLSFQFYKTGVYYERWCSNRY--LDHGVLLVGYGTDETHGD-YWLVKNSWGPHWG 327

Query: 327 NQGYVLMSI-KDNNCGVMTAPTY 348
             GY+ ++  K N+CG+ T  +Y
Sbjct: 328 ENGYIRIARNKQNHCGIATMASY 350


>gi|354549232|gb|AER27707.1| putative cysteine protease [Phytophthora sp. SH-2011]
          Length = 533

 Score =  170 bits (430), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 92/209 (44%), Positives = 131/209 (62%), Gaps = 14/209 (6%)

Query: 145 GYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDF 204
           G   + +Q +CGSCW+F TTGAVEGA ++   KL  LS+Q L+DC    G+ GC+GG   
Sbjct: 128 GVTPVKNQGMCGSCWAFSTTGAVEGATFVSSGKLPSLSEQELVDCDHN-GDMGCNGGLMD 186

Query: 205 RSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKH 263
            ++QWI  HG + ++DDY  Y  +   C   ++     +TGF +V P  E ALK+A+A+ 
Sbjct: 187 HAFQWIEDHGGICSEDDY-EYKAKAQVCRECDSVVK--VTGFQDVNPQDEHALKVAVAQQ 243

Query: 264 GPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWST 323
            PVSVAI+A QK+F FY +GV+ +  C      LDH VLAVGYG  +G  +W+VKNSW  
Sbjct: 244 -PVSVAIEADQKAFQFYKSGVF-NLTCGTR---LDHGVLAVGYGNDNGHKFWKVKNSWGA 298

Query: 324 YWGNQGYVLMSIKDN----NCGVMTAPTY 348
            WG QGY+ ++ ++N     CG+ + P+Y
Sbjct: 299 SWGEQGYIRLAREENGPAGQCGIASVPSY 327


>gi|1698586|gb|AAB37252.1| cathepsin L, partial [Oncorhynchus mykiss]
          Length = 196

 Score =  170 bits (430), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 93/196 (47%), Positives = 129/196 (65%), Gaps = 10/196 (5%)

Query: 160 SFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQ 218
           +F TTGA+EG  + K  KL  LS+Q L+DCS   GN GC+GG   +++Q+I  + GL T+
Sbjct: 1   AFSTTGAMEGQQFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYIQDNAGLDTE 60

Query: 219 DDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSF 277
           + Y PY+G D   CH     + A  TGFV++    E A+  A+A  GPVSVAIDA  +SF
Sbjct: 61  ESY-PYVGTDEDPCHYKPEFSAANETGFVDIPSGKEHAMMKAVAAVGPVSVAIDAGHESF 119

Query: 278 SFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTYWGNQGYVLM 333
            FY +G+YY+++C  S + LDH VL VGYG    ++DGK YW VKNSWS  WG++GY+ M
Sbjct: 120 QFYESGIYYEKEC--SSEELDHGVLVVGYGFEGEDVDGKKYWIVKNSWSEKWGDKGYIYM 177

Query: 334 SI-KDNNCGVMTAPTY 348
           +  + N+CG+ TA +Y
Sbjct: 178 AKDRKNHCGIATASSY 193


>gi|291383484|ref|XP_002708316.1| PREDICTED: cathepsin L1 [Oryctolagus cuniculus]
          Length = 333

 Score =  170 bits (430), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 97/232 (41%), Positives = 135/232 (58%), Gaps = 13/232 (5%)

Query: 129 YNKASKDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQA 185
           + KA    +P  V +  KGY   +  Q  CGSCW+F  TGA+EG  + K  +L  LS+Q 
Sbjct: 106 FGKAVFQEVPSSVDWRDKGYVTPVKKQGRCGSCWAFSATGALEGQMFRKTGRLVSLSEQN 165

Query: 186 LIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTG 244
           LIDCSW  GN+GC GG    ++Q++  + GL ++D Y PY  ++  C      + A  TG
Sbjct: 166 LIDCSWPAGNHGCRGGLTDHAFQYVKDNGGLDSEDSY-PYEARNLPCRYDPQKSVANGTG 224

Query: 245 FVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAV 304
           FV + P  E+AL  A+A  GP++VAIDA   SF FY  G+YY+  C  S    +HAVL V
Sbjct: 225 FVRI-PRQENALMEAVATVGPIAVAIDAGHPSFQFYKEGIYYEPNC--SSKHHNHAVLVV 281

Query: 305 GYG----ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
           GYG    E D   YW VKNSW   WG  GY+ ++  ++N+CG+ +  +Y T+
Sbjct: 282 GYGYEGAESDSNKYWLVKNSWGKRWGEAGYIRIAKDRNNHCGIASHASYPTV 333


>gi|189053498|dbj|BAG35664.1| unnamed protein product [Homo sapiens]
          Length = 334

 Score =  170 bits (430), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 94/217 (43%), Positives = 128/217 (58%), Gaps = 10/217 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  C SCW+F  TGA+EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 118 VDWRKKGYVTPVKNQKQCVSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQG 177

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG   R++Q++ ++ GL +++ Y PY+  D  C      + A  TGF  V P  E AL
Sbjct: 178 CNGGFMARAFQYVKENGGLDSEESY-PYVAVDEICKYRPENSVANDTGFTVVAPGKEKAL 236

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GP+SVA+DA   SF FY +G+Y++  C  S   LDH VL VGYG      +  
Sbjct: 237 MKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDC--SSKNLDHGVLVVGYGFEGANSNNS 294

Query: 313 PYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
            YW VKNSW   WG+ GYV +   K+N+CG+ TA +Y
Sbjct: 295 KYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASY 331


>gi|261824891|pdb|3H6S|A Chain A, Strucure Of Clitocypin - Cathepsin V Complex
 gi|261824892|pdb|3H6S|B Chain B, Strucure Of Clitocypin - Cathepsin V Complex
 gi|261824893|pdb|3H6S|C Chain C, Strucure Of Clitocypin - Cathepsin V Complex
 gi|261824894|pdb|3H6S|D Chain D, Strucure Of Clitocypin - Cathepsin V Complex
 gi|310942696|pdb|3KFQ|A Chain A, Unreduced Cathepsin V In Complex With Stefin A
 gi|310942697|pdb|3KFQ|B Chain B, Unreduced Cathepsin V In Complex With Stefin A
          Length = 221

 Score =  170 bits (430), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 95/217 (43%), Positives = 128/217 (58%), Gaps = 10/217 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGS W+F  TGA+EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 5   VDWRKKGYVTPVKNQKQCGSXWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQG 64

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG   R++Q++ ++ GL +++ Y PY+  D  C      + A  TGF  V P  E AL
Sbjct: 65  CNGGFMARAFQYVKENGGLDSEESY-PYVAVDEICKYRPENSVAQDTGFTVVAPGKEKAL 123

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GP+SVA+DA   SF FY +G+Y++  C  S   LDH VL VGYG      D  
Sbjct: 124 MKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDC--SSKNLDHGVLVVGYGFEGANSDNS 181

Query: 313 PYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
            YW VKNSW   WG+ GYV +   K+N+CG+ TA +Y
Sbjct: 182 KYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASY 218


>gi|355567966|gb|EHH24307.1| Cathepsin L2 [Macaca mulatta]
 gi|355753494|gb|EHH57540.1| Cathepsin L2 [Macaca fascicularis]
 gi|380790509|gb|AFE67130.1| cathepsin L2 preproprotein [Macaca mulatta]
          Length = 334

 Score =  169 bits (429), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 95/220 (43%), Positives = 129/220 (58%), Gaps = 10/220 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TGA+EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 118 VDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSHPQGNQG 177

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    +++++ ++ GL +++ Y PY+  D  C      + A  TGF  V    E AL
Sbjct: 178 CNGGFMNSAFRYVKENGGLDSEESY-PYVAMDGICKYRPENSVANDTGFEVVPAGKEKAL 236

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GP+SVA+DA   SF FY +G+Y++  C  S   LDH VL VGYG      D  
Sbjct: 237 MKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDC--SSKNLDHGVLVVGYGFEGANSDNN 294

Query: 313 PYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTYVTM 351
            YW VKNSW   WG+ GYV +   KDN+CG+ TA +Y T+
Sbjct: 295 KYWLVKNSWGPEWGSNGYVKIAKDKDNHCGIATAASYPTV 334


>gi|330801846|ref|XP_003288934.1| hypothetical protein DICPUDRAFT_153222 [Dictyostelium purpureum]
 gi|325081026|gb|EGC34558.1| hypothetical protein DICPUDRAFT_153222 [Dictyostelium purpureum]
          Length = 334

 Score =  169 bits (429), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 90/214 (42%), Positives = 122/214 (57%), Gaps = 7/214 (3%)

Query: 142 EMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGG 201
           E    + + DQ  CGSC+SF   GA E AYY+ + ++  LS+Q ++DCS  YGN GC+GG
Sbjct: 122 EKGAVHDIKDQGHCGSCYSFSAIGAAESAYYIANGEMLTLSEQNILDCSRSYGNEGCNGG 181

Query: 202 EDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHI--ANTTATATMTGFVNVTPNSEDALKLA 259
               S+Q+++  G    +   PY  +DA C      T   AT  G V +    E  L+ A
Sbjct: 182 YMLESFQFLLDQGGAVSEASYPYEAKDASCRFDSVKTPIVATFNGTVEIRRGDEGDLQQA 241

Query: 260 LAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGY--GELDGKPYWQV 317
           +A HGPV+VAIDA   SF  Y  GVYY+  C  S   L HAVLAVGY    + GK YW V
Sbjct: 242 IATHGPVAVAIDAGHISFQLYKTGVYYEPYC--SSYSLSHAVLAVGYDTDSVTGKDYWIV 299

Query: 318 KNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVT 350
            NSW   WG+ G++ M+  + N+CG+ T  +Y+T
Sbjct: 300 ANSWGLKWGDSGFIKMARNRGNHCGISTMSSYIT 333


>gi|354466410|ref|XP_003495667.1| PREDICTED: pro-cathepsin H-like [Cricetulus griseus]
          Length = 333

 Score =  169 bits (429), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 83/203 (40%), Positives = 125/203 (61%), Gaps = 3/203 (1%)

Query: 147 NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRS 206
           +++ +Q  CGSCW+F TTGA+E A  +   K+  L++Q L+DC+  + N+GC+GG   ++
Sbjct: 128 SAVKNQGSCGSCWTFSTTGALESAVAIASGKMLSLAEQQLVDCAQNFNNHGCEGGLPSQA 187

Query: 207 YQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGP 265
           +++I+   G+  +D Y PY G+D +C      A A +    N+T N E A+  A+A + P
Sbjct: 188 FEYILYNKGIMGEDTY-PYRGKDGHCKFDPQKAIAFVKDVANITLNDEKAMVEAVALYNP 246

Query: 266 VSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYW 325
           VS A + +   F  Y  G+Y    C+ +PD ++HAVLAVGYGE DG PYW VKNSW T W
Sbjct: 247 VSFAFEVTD-DFMLYQKGIYSSTSCHKTPDKVNHAVLAVGYGEKDGIPYWIVKNSWGTNW 305

Query: 326 GNQGYVLMSIKDNNCGVMTAPTY 348
           G++GY L+    N CG+    +Y
Sbjct: 306 GDKGYFLIERGKNMCGLAACASY 328


>gi|281203744|gb|EFA77940.1| hypothetical protein PPL_08585 [Polysphondylium pallidum PN500]
          Length = 505

 Score =  169 bits (429), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 97/239 (40%), Positives = 131/239 (54%), Gaps = 27/239 (11%)

Query: 135 DAIPVRYEMKGYNS-LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGY 193
           D+  V +  KG  S + DQ  CGSCWSF TTG+VEGA+ +K   +  LS+Q L+DCS   
Sbjct: 266 DSATVDWRQKGAVSPIKDQGQCGSCWSFSTTGSVEGAHQIKSGNMVELSEQNLVDCSTSE 325

Query: 194 GNNGCDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNS 252
           GN GC+GG    ++++I+  +G+ T+  Y         C      + AT++ + N+T  S
Sbjct: 326 GNMGCNGGLMDYAFEYIITNNGIDTESSYPYTASSGTTCKYNKANSGATISSYKNITAGS 385

Query: 253 EDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE---- 308
           E  L  A+   GPVSVAIDAS  SF  Y +G+YYD  C  S   LDH VL VGYG     
Sbjct: 386 ESDLADAVKNAGPVSVAIDASHNSFQLYSHGIYYDASC--SSVNLDHGVLVVGYGSGTPD 443

Query: 309 ------------------LDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
                              D K YW VKNSW T WG++G++ MS  +DNNCG+ +  +Y
Sbjct: 444 SDSRVHKGSQVRVKVPKTDDTKNYWIVKNSWGTSWGDKGFIYMSKDRDNNCGIASCASY 502


>gi|14041143|emb|CAA71554.1| cathepsin [Geodia cydonium]
          Length = 322

 Score =  169 bits (429), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 91/218 (41%), Positives = 133/218 (61%), Gaps = 8/218 (3%)

Query: 136 AIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG 192
           A+P  V +  KGY   + +Q  CGSCW+F  TG++EG ++    KL  LS+Q L+DCS  
Sbjct: 102 ALPTTVDWRTKGYVTGVKNQGQCGSCWAFSATGSLEGQHFNATGKLVSLSEQNLVDCSSA 161

Query: 193 YGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPN 251
            GN GC+GG    ++++++K+ G+ T+  Y PY+ +D  CH ++    +T + +V++   
Sbjct: 162 EGNEGCNGGLPDDAFKYVIKNGGIDTEASY-PYVARDEKCHYSSANIGSTCSSYVDIESK 220

Query: 252 SEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDG 311
           SE  L++A A  GP+ V IDAS   F  Y  GVY+ + C+ +   LDH VL VGYG    
Sbjct: 221 SEAQLQVASATVGPIPVGIDASHLGFQLYDGGVYHSDLCSQTR--LDHGVLVVGYGVYKE 278

Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           K YW VKNSW T WG  G ++MS  +DNNCG+ T  +Y
Sbjct: 279 KDYWMVKNSWGTNWGISGDMMMSRNRDNNCGIATMASY 316


>gi|75060921|sp|Q5E998.1|CATL2_BOVIN RecName: Full=Cathepsin L2; Flags: Precursor
 gi|59858409|gb|AAX09039.1| cathepsin L2 preproprotein [Bos taurus]
          Length = 334

 Score =  169 bits (429), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 98/220 (44%), Positives = 128/220 (58%), Gaps = 10/220 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TGA+EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 118 VDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNQG 177

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDA-YCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++Q+I  +G    ++  PYL  D   C+     + A  TGFV++ P  E AL
Sbjct: 178 CNGGLMDNAFQYIKDNGCLDSEESYPYLATDTNSCNYKPECSAANDTGFVDI-PQREKAL 236

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GP+SVAIDA   SF FY +G+YYD  C  S   LDH VL VGYG    + +  
Sbjct: 237 MKAVATVGPISVAIDAGHTSFQFYKSGIYYDPDC--SSKDLDHGVLVVGYGFEGTDSNNN 294

Query: 313 PYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
            +W VKNSW   WG  GYV M+   NN CG+ TA +Y T+
Sbjct: 295 KFWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPTV 334


>gi|449452572|ref|XP_004144033.1| PREDICTED: thiol protease aleurain-like [Cucumis sativus]
 gi|449500499|ref|XP_004161114.1| PREDICTED: thiol protease aleurain-like [Cucumis sativus]
          Length = 356

 Score =  169 bits (429), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 89/199 (44%), Positives = 116/199 (58%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F TTGA+E AY   H K   LS+Q L+DC  G+ N GC+GG   +++++I
Sbjct: 156 DQGHCGSCWTFSTTGALEAAYAQAHGKGISLSEQQLVDCGRGFNNFGCNGGLPSQAFEYI 215

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + GL T++ Y PY G D  C          +   VN+T  +ED LK A+A   PVSVA
Sbjct: 216 KYNGGLDTEEAY-PYTGVDGSCKFVPENVGVQVIDSVNITLGAEDELKHAVAFVRPVSVA 274

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            +     F  Y  GVY    C ++P  ++HAVLAVGYG  DG PYW +KNSW   WG+ G
Sbjct: 275 FEVVS-GFRLYSKGVYTSNSCGSTPMDVNHAVLAVGYGVEDGIPYWLIKNSWGGNWGDNG 333

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y  M +  N CGV T  +Y
Sbjct: 334 YFKMEMGKNMCGVATCASY 352


>gi|395502422|ref|XP_003755580.1| PREDICTED: pro-cathepsin H [Sarcophilus harrisii]
          Length = 334

 Score =  169 bits (428), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 84/199 (42%), Positives = 122/199 (61%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTG +E A  +   KL  L++Q L+DC+  + N+GC+GG   +++++I
Sbjct: 133 NQGGCGSCWTFSTTGGLESAVAIATGKLLSLAEQQLVDCAQDFNNHGCNGGLPSQAFEYI 192

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           M   G+  +D Y PY G+D  C      A A +    N+T   E+A+  A+A H PVS A
Sbjct: 193 MYNKGIMGEDTY-PYEGKDGTCKFQPNKAIAFVKDVANITAYDEEAMTEAVAHHNPVSFA 251

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            + +    S++  G+Y + KC+ SPD ++HAVLAVGYG+ +G PYW VKNSW T WGN G
Sbjct: 252 FEVTDDFLSYH-KGIYSNPKCSKSPDKVNHAVLAVGYGKENGIPYWIVKNSWGTSWGNNG 310

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y L+    N CG+    +Y
Sbjct: 311 YFLIERGKNMCGLADCASY 329


>gi|74152091|dbj|BAE32077.1| unnamed protein product [Mus musculus]
          Length = 245

 Score =  169 bits (428), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 91/200 (45%), Positives = 120/200 (60%), Gaps = 6/200 (3%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCS--WGYGNNGCDGGEDFRSYQW 209
           Q  CG+CW+F   GA+EG   +K  KL  LS Q L+DCS    YGN GC GG    ++Q+
Sbjct: 46  QGSCGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQY 105

Query: 210 IMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           I+ +G    D   PY   D  CH  +    AT +G++ +    EDALK A+A  GPVSV 
Sbjct: 106 IIDNGGIEADASYPYKATDEKCHYNSKNRAATCSGYIQLPFGDEDALKEAVATKGPVSVG 165

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS  SF FY +GVY D  C  +   ++H VL VGYG LDGK YW VKNSW   +G+QG
Sbjct: 166 IDASHSSFFFYKSGVYDDPSCTGN---VNHGVLVVGYGTLDGKDYWLVKNSWGLNFGDQG 222

Query: 330 YVLMSIKD-NNCGVMTAPTY 348
           Y+ M+  + N+CG+ +  +Y
Sbjct: 223 YIRMARNNKNHCGIASYCSY 242


>gi|13905172|gb|AAH06878.1| Cathepsin H [Mus musculus]
          Length = 333

 Score =  169 bits (428), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 80/200 (40%), Positives = 124/200 (62%), Gaps = 1/200 (0%)

Query: 149 LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
           +++Q  CGSCW+F TTGA+E A  +   K+  L++Q L+DC+  + N+GC GG   ++++
Sbjct: 130 VINQGACGSCWTFSTTGALESAVAIASGKMLSLAEQQLVDCAQAFNNHGCKGGLPSQAFE 189

Query: 209 WIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
           +I+ +    ++D  PY+G+D+ C      A A +   VN+T N E A+  A+A + PVS 
Sbjct: 190 YILYNKGIMEEDSYPYIGKDSSCRFNPQKAVAFVKNVVNITLNDEAAMVEAVALYNPVSF 249

Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQ 328
           A + ++  F  Y +GVY  + C+ +PD ++HAVLAVGYGE +G  YW VKNSW + WG  
Sbjct: 250 AFEVTE-DFLMYKSGVYSSKSCHKTPDKVNHAVLAVGYGEQNGLLYWIVKNSWGSQWGEN 308

Query: 329 GYVLMSIKDNNCGVMTAPTY 348
           GY L+    N CG+    +Y
Sbjct: 309 GYFLIERGKNMCGLAACASY 328


>gi|431917800|gb|ELK17041.1| Cathepsin L1 [Pteropus alecto]
          Length = 334

 Score =  169 bits (428), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 95/220 (43%), Positives = 129/220 (58%), Gaps = 10/220 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TG++EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 118 VDWRQKGYVTPVKNQGQCGSCWAFSATGSLEGQMFRKTGKLVSLSEQNLVDCSRSQGNEG 177

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDA-YCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++Q+I  +G    ++  PYL +++  C+     + A  TGFV++ P  E +L
Sbjct: 178 CNGGLMDNAFQYIKDNGGLDSEESYPYLAKESDTCNYKPEYSAANDTGFVDI-PQREKSL 236

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKP--- 313
             A+A  GP+SVAIDA   SF FY  G+YY+  C  S   LDH VL +GYG   G P   
Sbjct: 237 MKAVATVGPISVAIDAGHSSFQFYNKGIYYEPDC--SSKDLDHGVLVIGYGSEGGDPKSN 294

Query: 314 -YWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
            +W VKNSW   WG  GYV M+   NN CG+ TA +Y T+
Sbjct: 295 KFWIVKNSWGPEWGMNGYVKMAKDQNNHCGIATAASYPTV 334


>gi|54020908|ref|NP_001005695.1| cathepsin S precursor [Xenopus (Silurana) tropicalis]
 gi|49522293|gb|AAH75261.1| cathepsin S [Xenopus (Silurana) tropicalis]
          Length = 333

 Score =  169 bits (428), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 87/199 (43%), Positives = 120/199 (60%), Gaps = 4/199 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F   GA+EG   +K  KL  LS Q L+DCS  YGN GC GG   +++Q++
Sbjct: 135 NQGGCGSCWAFSAVGALEGQLMLKTGKLVSLSPQNLVDCSSKYGNKGCGGGFMTQAFQYV 194

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
           + +     D Y PY   D  CH   T   +T   +  + P +ED LK AL   GP+SVAI
Sbjct: 195 IDNKGIDSDSYYPYHAMDEKCHYDPTGKASTCAKYTEIVPGTEDNLKQALGSIGPISVAI 254

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
           D ++ SF  Y +GVY D  C++    ++H VLAVGYG L+G+ +W +KNSW T +G+QGY
Sbjct: 255 DGTRPSFFLYRSGVYSDPTCSHE---VNHGVLAVGYGNLNGQDFWLLKNSWGTKYGDQGY 311

Query: 331 VLMSI-KDNNCGVMTAPTY 348
           V ++  K N CGV +   Y
Sbjct: 312 VRIARNKGNLCGVASYTCY 330


>gi|417399134|gb|JAA46597.1| Putative cathepsin l1 [Desmodus rotundus]
          Length = 335

 Score =  169 bits (428), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 97/221 (43%), Positives = 133/221 (60%), Gaps = 12/221 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F   GA+EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 119 VDWRQKGYVTPVKNQGQCGSCWAFSANGALEGQMFRKTGKLVSLSEQNLVDCSHSQGNQG 178

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDA-YCHIANTTATATMTGFVNVTPNSEDA 255
           C+GG    ++Q++  + GL +++ Y PYLG+++  C+     + A  TGFV++ P  E  
Sbjct: 179 CNGGLMDNAFQYVKDNKGLDSEESY-PYLGRESNTCNYRPEYSAANDTGFVDI-PQHERG 236

Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
           L  A+A  GP+SVAIDA   SF FY  G+YY+  C  S   LDH VL VGYG    + D 
Sbjct: 237 LMKAVATVGPISVAIDAGHSSFQFYSEGIYYEPNC--SSKDLDHGVLVVGYGSEGAQSDS 294

Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
             +W VKNSW T WG  GYV M+  + N+CG+ TA +Y T+
Sbjct: 295 NKFWIVKNSWGTGWGMSGYVKMARDQSNHCGIATAASYPTV 335


>gi|41323856|gb|AAS00027.1| cathepsin L-like cysteine proteinase [Taenia solium]
          Length = 339

 Score =  169 bits (428), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 88/200 (44%), Positives = 124/200 (62%), Gaps = 6/200 (3%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F +TGA+EGA+  K  KL  LS+Q L+DCS   GN+GC+GG    +++++
Sbjct: 141 NQGNCGSCWAFSSTGALEGAFAKKTGKLISLSEQQLVDCSLKNGNDGCNGGYMSYAFKYL 200

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
            +H +  +  Y PY   D  C    +    T+T   ++   +E AL  A+A  GP+S+AI
Sbjct: 201 EEHFIEPESAY-PYRATDGPCRYNESLGVGTVTDIGDIPEGNETALMEAVATVGPISIAI 259

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
           DAS   F FY +G+Y    C  S   L+H VLA+GYG+ DGKPYW VKNSW T WG +GY
Sbjct: 260 DASSLGFMFYRHGIYKSHWC--SSKFLNHGVLAIGYGKQDGKPYWLVKNSWGTRWGMKGY 317

Query: 331 VLMSIKD--NNCGVMTAPTY 348
           ++M+ KD  N CGV +   +
Sbjct: 318 IMMA-KDYHNMCGVASLADF 336


>gi|344257452|gb|EGW13556.1| Cathepsin L1 [Cricetulus griseus]
          Length = 290

 Score =  169 bits (428), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 92/204 (45%), Positives = 126/204 (61%), Gaps = 7/204 (3%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  C SCW+F   G++EG  + K  +L  LS+Q L+DCSW YGN GC GG    +++++
Sbjct: 91  DQGQCSSCWAFSAVGSLEGQIFRKTGQLISLSEQNLVDCSWSYGNIGCFGGLMEYAFRYV 150

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
            ++ GL T+  Y PY  ++  C      + A +T FV + P SEDAL  A+A  GP+SV 
Sbjct: 151 KENRGLDTRVSY-PYEARNGPCRYDPKNSAANVTDFVKI-PISEDALMKAVATVGPISVG 208

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
           +D+   SF FY  G+YY+  C++S   LDHAVL VGYG E DG  YW VKNSW   WG  
Sbjct: 209 VDSHHHSFRFYKGGMYYEPHCSSS--NLDHAVLVVGYGEESDGNKYWMVKNSWGQGWGMN 266

Query: 329 GYVLMSI-KDNNCGVMTAPTYVTM 351
           GY+ M+  ++NNCG+ T   Y T+
Sbjct: 267 GYIKMARDRNNNCGIATYAIYPTV 290


>gi|403368476|gb|EJY84073.1| Cathepsin L [Oxytricha trifallax]
          Length = 338

 Score =  169 bits (428), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 94/211 (44%), Positives = 124/211 (58%), Gaps = 7/211 (3%)

Query: 142 EMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGG 201
           E      + DQ  CGSCWSF  TGA+EG   ++   L  LS+Q L+DCS   GN GC GG
Sbjct: 132 EQGAVTPVKDQGQCGSCWSFSATGAMEGHAKIQFGTLYSLSEQQLVDCSQAEGNEGCGGG 191

Query: 202 EDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALA 261
              +++Q++ +  L T+D Y PY   D  C  A++     +  FV+VTPN+ + LK AL 
Sbjct: 192 WMDQAFQYVEQTALETEDQY-PYEAVDDTCR-ASSAGVVKVDSFVDVTPNNVNELKAALD 249

Query: 262 KHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSW 321
           K GPVSVAI+A Q  F FY  GV  D  C  +   LDH VLAVGYG   G+ Y+ VKNSW
Sbjct: 250 K-GPVSVAIEADQMVFQFYSGGVINDASCGTT---LDHGVLAVGYGNESGQDYFLVKNSW 305

Query: 322 STYWGNQGYVLMSIK-DNNCGVMTAPTYVTM 351
              WG +GYV ++   DN CG+++  +Y  M
Sbjct: 306 GASWGEEGYVKIAASPDNICGILSQASYPIM 336


>gi|354502595|ref|XP_003513369.1| PREDICTED: cathepsin L1-like [Cricetulus griseus]
          Length = 330

 Score =  169 bits (427), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 92/204 (45%), Positives = 126/204 (61%), Gaps = 7/204 (3%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  C SCW+F   G++EG  + K  +L  LS+Q L+DCSW YGN GC GG    +++++
Sbjct: 131 DQGQCSSCWAFSAVGSLEGQIFRKTGQLISLSEQNLVDCSWSYGNIGCFGGLMEYAFRYV 190

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
            ++ GL T+  Y PY  ++  C      + A +T FV + P SEDAL  A+A  GP+SV 
Sbjct: 191 KENRGLDTRVSY-PYEARNGPCRYDPKNSAANVTDFVKI-PISEDALMKAVATVGPISVG 248

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
           +D+   SF FY  G+YY+  C++S   LDHAVL VGYG E DG  YW VKNSW   WG  
Sbjct: 249 VDSHHHSFRFYKGGMYYEPHCSSS--NLDHAVLVVGYGEESDGNKYWMVKNSWGQGWGMN 306

Query: 329 GYVLMSI-KDNNCGVMTAPTYVTM 351
           GY+ M+  ++NNCG+ T   Y T+
Sbjct: 307 GYIKMARDRNNNCGIATYAIYPTV 330


>gi|2239107|emb|CAA70693.1| cathepsin L-like cysteine proteinase [Heterodera glycines]
          Length = 374

 Score =  169 bits (427), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 88/202 (43%), Positives = 125/202 (61%), Gaps = 7/202 (3%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q +CGSCW+F  TGA+EG +      L  LS+Q LIDCS  YGN GC+GG    ++Q+I
Sbjct: 173 NQGMCGSCWAFSATGALEGQHVRDKGHLVSLSEQNLIDCSKKYGNMGCNGGIMDNAFQYI 232

Query: 211 MKH-GLPTQDDYGPYLGQDA-YCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
             + G+  +  Y PY  +    C        AT +G+ ++    E+ LK+A+A  GPVSV
Sbjct: 233 KDNKGIDKETAY-PYKAKTGKKCLFKRNDVGATDSGYNDIAEGDEEDLKMAVATQGPVSV 291

Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGN 327
           AIDA  +SF  Y NGVY++++C+  P+ LDH VL VGYG +     YW VKNSW T WG 
Sbjct: 292 AIDAGHRSFQLYTNGVYFEKECD--PENLDHGVLVVGYGTDPTQGDYWIVKNSWGTRWGE 349

Query: 328 QGYVLMSI-KDNNCGVMTAPTY 348
           QGY+ M+  ++NNCG+ +  ++
Sbjct: 350 QGYIRMARNRNNNCGIASHASF 371


>gi|426219849|ref|XP_004004130.1| PREDICTED: cathepsin L1 isoform 1 [Ovis aries]
 gi|426219851|ref|XP_004004131.1| PREDICTED: cathepsin L1 isoform 2 [Ovis aries]
          Length = 334

 Score =  169 bits (427), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 98/220 (44%), Positives = 130/220 (59%), Gaps = 10/220 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TGA+EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 118 VDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQG 177

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQD-AYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++Q+I ++G    ++  PYL  D + C+     + A  TGFV++ P  E AL
Sbjct: 178 CNGGLMDNAFQYIKENGGLDSEESYPYLATDTSSCNYKPECSAANDTGFVDI-PQREKAL 236

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GP+SVAIDA   SF FY +G+YYD  C  S   LDH VL VGYG    + +  
Sbjct: 237 MKAVATVGPISVAIDAGHASFQFYKSGIYYDPDC--SSKDLDHGVLVVGYGFEGTDSNNN 294

Query: 313 PYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
            +W VKNSW   WG  GYV M+   NN CG+ TA +Y T+
Sbjct: 295 KFWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPTV 334


>gi|218478062|dbj|BAH03397.1| cathepsin L-like cysteine peptidase [Taenia asiatica]
          Length = 338

 Score =  169 bits (427), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 87/200 (43%), Positives = 124/200 (62%), Gaps = 6/200 (3%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F +TGA+EGA+  K  KL  LS+Q L+DCS   GN+GC+GG    +++++
Sbjct: 140 NQGNCGSCWAFSSTGALEGAFAKKTGKLISLSEQQLVDCSLKNGNDGCNGGYMSYAFKYL 199

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
            +H +  +  Y PY   D  C    +    T+T   ++   +E AL  A+A  GP+S+AI
Sbjct: 200 EEHSIEPESAY-PYRATDGPCRYNESLGVGTVTDIGDIPEGNETALMEAVATVGPISIAI 258

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
           DAS   F FY +G+Y    C  S   L+H VLA+GYG+ +GKPYW VKNSW T WG +GY
Sbjct: 259 DASSLGFMFYRHGIYKSHWC--SSKFLNHGVLAIGYGKQEGKPYWLVKNSWGTRWGMKGY 316

Query: 331 VLMSIKD--NNCGVMTAPTY 348
           ++M+ KD  N CGV +   +
Sbjct: 317 IMMA-KDYHNMCGVASLADF 335


>gi|68399197|ref|XP_695425.1| PREDICTED: cathepsin L [Danio rerio]
          Length = 349

 Score =  169 bits (427), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 96/226 (42%), Positives = 126/226 (55%), Gaps = 8/226 (3%)

Query: 127 VRYNKASKDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQA 185
           +R N        + Y  KGY   + DQ  CGSCWSF TTGA+EG  Y    +L  LS+Q 
Sbjct: 125 LRLNAKRLGVTNIDYRAKGYVTEVKDQGYCGSCWSFSTTGAIEGQMYKHTGRLVSLSEQQ 184

Query: 186 LIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAY-CHIANTTATATMTG 244
           L+DCS  YG  GC G     +Y +++ + L + D Y PY   D   C      A A ++ 
Sbjct: 185 LVDCSRSYGTYGCSGAWMANAYDYVINNALESSDTY-PYTSVDTQPCFYEKNLAMAGISD 243

Query: 245 FVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAV 304
           +  V   +E AL  A+A  GPVSVAIDA   SF FY +G+Y +  CN  P+ L+HAVL V
Sbjct: 244 YRFVPAGNEQALADAVATVGPVSVAIDADNPSFLFYSSGIYKESNCN--PNNLNHAVLVV 301

Query: 305 GYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKD--NNCGVMTAPTY 348
           GYG  +G  YW +KNSW T WG  GY+ M I++  N CG+ +   Y
Sbjct: 302 GYGSEEGTDYWIIKNSWGTGWGEGGYMRM-IRNGKNTCGIASYALY 346


>gi|139947602|ref|NP_001077155.1| cathepsin L1 precursor [Bos taurus]
 gi|134025180|gb|AAI34742.1| CTSL1 protein [Bos taurus]
 gi|296484500|tpg|DAA26615.1| TPA: cathepsin L1 [Bos taurus]
          Length = 333

 Score =  169 bits (427), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 97/225 (43%), Positives = 135/225 (60%), Gaps = 13/225 (5%)

Query: 136 AIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG 192
           +IP  V +  KGY   + +Q  CGSCW+F  TGA+EG  + K  KL  LS+Q L+DCS  
Sbjct: 113 SIPPSVDWREKGYVTPVKNQGKCGSCWAFSATGALEGQMFQKTGKLVSLSEQNLVDCSQP 172

Query: 193 YGNNGCDGGEDFRSYQWIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPN 251
            GN GC GG    ++Q+++   GL +++ Y PY G    C      + A  TGFV++ P 
Sbjct: 173 EGNRGCHGGFIDNAFQYVLDVGGLDSEESY-PYTGLVGTCLYNPNNSAANETGFVDL-PK 230

Query: 252 SEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG---- 307
            E AL  A+A  GP+SVA+DA   SF FY +G+YY+  C  S + +DHAVL VGYG    
Sbjct: 231 QEKALMKAVANLGPISVAVDAHNPSFQFYKSGIYYEPNC--SSESVDHAVLVVGYGFEGA 288

Query: 308 ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
           + D   YW VKNSW  +WG  GY+ M+  ++N+CG+ T  +Y T+
Sbjct: 289 DSDDNKYWLVKNSWGEHWGMNGYIKMAKDRNNHCGIATMASYPTV 333


>gi|355681660|gb|AER96816.1| cathepsin L2 [Mustela putorius furo]
          Length = 334

 Score =  169 bits (427), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 97/220 (44%), Positives = 130/220 (59%), Gaps = 10/220 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TGA+EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 118 VDWTQKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRSQGNQG 177

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDA-YCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++Q+I  +G    ++  PYL +D   C+     + A  TGFV++ P  E AL
Sbjct: 178 CNGGLMDFAFQYIKDNGGLDSEESYPYLARDTDSCNYKPEYSVANDTGFVDI-PQRERAL 236

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GP+SVAIDA  +SF FY +G+Y+D  C  S   LDH VL VGYG    + +  
Sbjct: 237 MKAVATVGPISVAIDAGHQSFQFYKSGIYFDPDC--SSKDLDHGVLVVGYGFEGTDSNNN 294

Query: 313 PYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
            +W VKNSW   WG  GYV M+   NN CG+ TA +Y T+
Sbjct: 295 KFWIVKNSWGPEWGCNGYVKMAKDQNNHCGIATAASYPTV 334


>gi|403333364|gb|EJY65772.1| Cathepsin L [Oxytricha trifallax]
          Length = 338

 Score =  169 bits (427), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 93/202 (46%), Positives = 122/202 (60%), Gaps = 7/202 (3%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCWSF  TGA+EG   ++   L  LS+Q L+DCS   GN GC GG   +++Q++
Sbjct: 141 DQGQCGSCWSFSATGAMEGHAKIQFGTLYSLSEQQLVDCSQAEGNEGCGGGWMDQAFQYV 200

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
            +  L T+D Y PY   D  C  A++     +  FV+VTPN+ + LK AL K GPVSVAI
Sbjct: 201 EQTALETEDQY-PYEAVDDTCR-ASSAGVVKVDSFVDVTPNNVNELKAALDK-GPVSVAI 257

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
           +A Q  F FY  GV  D  C  +   LDH VLAVGYG   G+ Y+ VKNSW   WG +GY
Sbjct: 258 EADQMVFQFYSGGVINDASCGTT---LDHGVLAVGYGNESGQDYFLVKNSWGASWGEEGY 314

Query: 331 VLMSIK-DNNCGVMTAPTYVTM 351
           V ++   DN CG+++  +Y  M
Sbjct: 315 VKIAASPDNICGILSQASYPIM 336


>gi|149755237|ref|XP_001495795.1| PREDICTED: cathepsin L1-like [Equus caballus]
          Length = 339

 Score =  169 bits (427), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 95/220 (43%), Positives = 131/220 (59%), Gaps = 12/220 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q +CGSCW+F  TGA+EG  + K  KL  LS+Q L+DCSW  GN G
Sbjct: 118 VDWRKKGYVTPVKNQGLCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSWAQGNEG 177

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    ++Q++  + GL ++  Y PYL +D +C      + A  TGF+++    E  L
Sbjct: 178 CSGGLMDYAFQYVKDNGGLDSEKSY-PYLAEDGFCKYKPEYSAANDTGFLDIQ-QQEKFL 235

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGK---- 312
             A+A  GP+S  IDAS +SF FY  G+YYD  C  S   LDH VL VGYG  +GK    
Sbjct: 236 MEAVATVGPISAGIDASLESFQFYKEGIYYDPDC--SSKYLDHGVLVVGYG-FEGKDSRN 292

Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
            YW VKNSW   WG  GY+ M+  ++N+CG+ T  +Y ++
Sbjct: 293 KYWLVKNSWGEDWGMNGYIKMAKDRENHCGIATMASYPSL 332


>gi|111073719|dbj|BAF02548.1| triticain gamma [Triticum aestivum]
          Length = 365

 Score =  169 bits (427), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 88/199 (44%), Positives = 119/199 (59%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQS CGSCW+F TTGA+E AY     K   LS+Q L+DC+ G+ N GC GG   +++++I
Sbjct: 164 DQSHCGSCWTFSTTGALEAAYTQATGKNISLSEQQLVDCAGGFNNFGCSGGLPSQAFEYI 223

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T++ Y PY G +  CH     A   +   VN+T N+ED LK A+    PVSVA
Sbjct: 224 KYNGGIDTEESY-PYKGVNGVCHYKAENAVVQVLDSVNITLNAEDELKNAVGLVRPVSVA 282

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            +     F  Y +GVY  + C  +PD ++HAVLAVGYG  +G PYW +KNSW   WG+ G
Sbjct: 283 FEVIN-GFRQYKSGVYSSDHCGTTPDDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNG 341

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y  M +  N C V T  +Y
Sbjct: 342 YFKMEMGKNMCAVATCASY 360


>gi|443698586|gb|ELT98517.1| hypothetical protein CAPTEDRAFT_128252 [Capitella teleta]
          Length = 324

 Score =  169 bits (427), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 95/237 (40%), Positives = 137/237 (57%), Gaps = 8/237 (3%)

Query: 117 GEKVSKYSLWVRYNKASKDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYM 173
           G K++K S+   + +     +P  V +  KGY   + +Q  CGSCW+F +TG++EG  + 
Sbjct: 88  GYKMAKSSVGSSFLEPENLQVPKTVDWREKGYVTPVKNQGQCGSCWAFSSTGSLEGQVFR 147

Query: 174 KHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCH 232
           K  +L  +S+Q L+DCS   GN GC GG    ++ +I K+ G+ ++  Y PY   D  C 
Sbjct: 148 KTGRLPSISEQNLVDCSRDEGNMGCSGGLMDNAFTYIKKNMGIDSEKSY-PYEAVDGECR 206

Query: 233 IANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNN 292
              + +  T +GFV++    E AL+ A+A  GPVSVAIDAS  SF FY  GVY +  C++
Sbjct: 207 YKKSDSVTTDSGFVDIPHGDETALRTAVASVGPVSVAIDASHTSFQFYKTGVYTEANCSS 266

Query: 293 SPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
           +   LDH VL VGYG  +G+ YW VKNSW   WG  GY+ L     N CG+ +  +Y
Sbjct: 267 TQ--LDHGVLVVGYGVENGQDYWLVKNSWGASWGEAGYIKLARNHGNQCGIASQASY 321


>gi|388521567|gb|AFK48845.1| unknown [Medicago truncatula]
          Length = 343

 Score =  169 bits (427), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 89/199 (44%), Positives = 117/199 (58%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F TTGA+E AY     K   LS+Q L+DC+  Y N GC+GG   +++++I
Sbjct: 143 DQGHCGSCWTFSTTGALESAYAQAFGKNISLSEQQLVDCAGAYNNFGCNGGLPSQAFEYI 202

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + GL T++ Y PY GQ+  C   +      + G VN+T  +ED LK A+A   PVSVA
Sbjct: 203 KYNGGLETEEVY-PYTGQNGLCKFTSENVAVQVLGSVNITLGAEDELKHAVAFARPVSVA 261

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
                  F  Y  GVY    C ++P  ++HAVLAVGYG  DG PYW +KNSW   WG+ G
Sbjct: 262 FQVVD-DFRLYKKGVYTGTTCGSTPMDVNHAVLAVGYGIEDGVPYWLIKNSWGGEWGDHG 320

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y  M +  N CGV T  +Y
Sbjct: 321 YFKMEMGKNMCGVATCSSY 339


>gi|111036374|dbj|BAF02516.1| cathepsin L-like proteinase [Echinococcus multilocularis]
          Length = 338

 Score =  169 bits (427), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 89/200 (44%), Positives = 119/200 (59%), Gaps = 5/200 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F  TGA+EG    K  KL  LS+Q L+DCS   GN GC+GG+   ++++ 
Sbjct: 139 DQGDCGSCWAFSATGALEGQLKRKTGKLISLSEQQLVDCSTYTGNEGCNGGDMNDAFRYW 198

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
           M++G  ++ DY PY   D  C   ++     ++ FV V    ED LKL++A+ GPVSVAI
Sbjct: 199 MRNGAESESDY-PYTAMDGKCKFNSSKVVTKVSKFVKVPKKREDQLKLSVAQVGPVSVAI 257

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGY-GELDGKPYWQVKNSWSTYWGNQG 329
           DA+   F  Y  G+Y D  C  S   LDHAVL VGY  +   + YW VKNSW   WG +G
Sbjct: 258 DATSSGFMLYKKGIYQDNTC--SQQYLDHAVLVVGYDADKTRQKYWIVKNSWGEDWGQRG 315

Query: 330 YVLMSI-KDNNCGVMTAPTY 348
           Y+ M+  K N CG+ T  +Y
Sbjct: 316 YIWMARDKGNMCGIATMASY 335


>gi|109940313|sp|P25975.3|CATL1_BOVIN RecName: Full=Cathepsin L1; Contains: RecName: Full=Cathepsin L1
           heavy chain; Contains: RecName: Full=Cathepsin L1 light
           chain; Flags: Precursor
 gi|74354943|gb|AAI02313.1| CTSL2 protein [Bos taurus]
 gi|154425700|gb|AAI51426.1| Cathepsin L2 [Bos taurus]
 gi|296484466|tpg|DAA26581.1| TPA: cathepsin L2 precursor [Bos taurus]
 gi|440898893|gb|ELR50299.1| Cathepsin L1 [Bos grunniens mutus]
          Length = 334

 Score =  169 bits (427), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 98/220 (44%), Positives = 128/220 (58%), Gaps = 10/220 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TGA+EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 118 VDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNQG 177

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDA-YCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++Q+I  +G    ++  PYL  D   C+     + A  TGFV++ P  E AL
Sbjct: 178 CNGGLMDNAFQYIKDNGGLDSEESYPYLATDTNSCNYKPECSAANDTGFVDI-PQREKAL 236

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GP+SVAIDA   SF FY +G+YYD  C  S   LDH VL VGYG    + +  
Sbjct: 237 MKAVATVGPISVAIDAGHTSFQFYKSGIYYDPDC--SSKDLDHGVLVVGYGFEGTDSNNN 294

Query: 313 PYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
            +W VKNSW   WG  GYV M+   NN CG+ TA +Y T+
Sbjct: 295 KFWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPTV 334


>gi|288764223|emb|CAQ03432.1| silcatein 1 [Spongilla lacustris]
 gi|296168747|emb|CAQ54051.1| silicatein alpha 3 [Spongilla lacustris]
          Length = 327

 Score =  169 bits (427), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 88/199 (44%), Positives = 125/199 (62%), Gaps = 5/199 (2%)

Query: 148 SLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSY 207
           S+  QS CGS ++F   GA+EGA  +   KL  LS+Q +IDCS  YGN+GC GG+ + ++
Sbjct: 125 SVKTQSQCGSSYAFAAVGALEGASALATDKLVALSEQNIIDCSVPYGNHGCSGGDTYTAF 184

Query: 208 QWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
           ++++ + G+ T+  Y PY G+ + C   +  A AT TG V +   SE  L  A+A  GPV
Sbjct: 185 KYVVDNGGIDTESSY-PYKGKQSSCQYNSKNAGATATGVVKIASGSESDLMSAVASGGPV 243

Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWG 326
           +VA+DAS  SF FY +GV+    C+N+   L+HA+L  GYG ++GK YW VKNSW T WG
Sbjct: 244 AVAVDASVNSFMFYQSGVFDSSTCSNTK--LNHAMLVTGYGSVNGKDYWLVKNSWGTSWG 301

Query: 327 NQGYVLM-SIKDNNCGVMT 344
             GY+ M   K N CG+ +
Sbjct: 302 ESGYIRMVRNKYNQCGIAS 320


>gi|151547430|gb|ABS12459.1| cysteine protease Cp [Citrus sinensis]
          Length = 361

 Score =  169 bits (427), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 84/199 (42%), Positives = 121/199 (60%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F TTG++E AY+    K   LS+Q L+DC+  + N GC+GG   +++++I
Sbjct: 161 DQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYI 220

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + GL T++ Y PY G+D  C  ++      +   VN+T  +ED L+ A+    PVSVA
Sbjct: 221 KYNGGLDTEEAY-PYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVRPVSVA 279

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            +     F FY +GVY   KC N+P  ++HAV+AVGYG  DG PYW +KNSW   WG+ G
Sbjct: 280 FEVVD-GFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGDHG 338

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y  + +  N CG+ T  +Y
Sbjct: 339 YFKIKMGKNMCGIATCASY 357


>gi|283046734|ref|NP_001164314.1| cathepsin L precursor [Tribolium castaneum]
 gi|270001247|gb|EEZ97694.1| cathepsin L precursor [Tribolium castaneum]
          Length = 328

 Score =  168 bits (426), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 87/217 (40%), Positives = 128/217 (58%), Gaps = 5/217 (2%)

Query: 136 AIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGN 195
           A  V +     + + +Q  CGSCWSF TTGAVEG   +  + L  LS+Q L+DCS  YGN
Sbjct: 116 AAEVDWRNSAVSEVKNQGQCGSCWSFSTTGAVEGQLAISGRGLTSLSEQNLVDCSSAYGN 175

Query: 196 NGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDA 255
            GC+GG    ++ +I  +G+ ++  Y PY   +  C    + +  ++ G+ ++    E+A
Sbjct: 176 AGCNGGWMDSAFDYIHDNGIMSESAY-PYTASEGSCRFNPSESVTSLQGYYDLPSGDENA 234

Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYW 315
           LK A+A +GP++VA+DA+ +   FY  GV YD  C  S   L+H VL VGYG   G+ YW
Sbjct: 235 LKSAVANNGPIAVALDATDE-LQFYSGGVLYDTTC--SAQALNHGVLVVGYGSEGGQDYW 291

Query: 316 QVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
            VKNSW + WG QGY   +  ++NNCG+ TA +Y  +
Sbjct: 292 IVKNSWGSGWGEQGYWRQARNRNNNCGIATAASYPAL 328


>gi|94733563|emb|CAK11015.1| novel protein similar to vertebrate cathepsin L (CTSL) [Danio
           rerio]
          Length = 334

 Score =  168 bits (426), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 96/226 (42%), Positives = 126/226 (55%), Gaps = 8/226 (3%)

Query: 127 VRYNKASKDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQA 185
           +R N        + Y  KGY   + DQ  CGSCWSF TTGA+EG  Y    +L  LS+Q 
Sbjct: 110 LRLNAKRLGVTNIDYRAKGYVTEVKDQGYCGSCWSFSTTGAIEGQMYKHTGRLVSLSEQQ 169

Query: 186 LIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAY-CHIANTTATATMTG 244
           L+DCS  YG  GC G     +Y +++ + L + D Y PY   D   C      A A ++ 
Sbjct: 170 LVDCSRSYGTYGCSGAWMANAYDYVINNALESSDTY-PYTSVDTQPCFYEKNLAMAGISD 228

Query: 245 FVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAV 304
           +  V   +E AL  A+A  GPVSVAIDA   SF FY +G+Y +  CN  P+ L+HAVL V
Sbjct: 229 YRFVPAGNEQALADAVATVGPVSVAIDADNPSFLFYSSGIYKESNCN--PNNLNHAVLVV 286

Query: 305 GYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKD--NNCGVMTAPTY 348
           GYG  +G  YW +KNSW T WG  GY+ M I++  N CG+ +   Y
Sbjct: 287 GYGSEEGTDYWIIKNSWGTGWGEGGYMRM-IRNGKNTCGIASYALY 331


>gi|293342577|ref|XP_001065834.2| PREDICTED: cathepsin L1 [Rattus norvegicus]
 gi|293354413|ref|XP_573976.3| PREDICTED: cathepsin L1 [Rattus norvegicus]
 gi|149039745|gb|EDL93861.1| rCG24317, isoform CRA_a [Rattus norvegicus]
          Length = 330

 Score =  168 bits (426), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 93/214 (43%), Positives = 130/214 (60%), Gaps = 8/214 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +   GY   + +Q  CGSCW+F   G++EG  + K  KL  LS+Q L+DCSW +GN G
Sbjct: 118 VDWRKHGYVTPVKNQGPCGSCWAFSAVGSLEGQVFRKTGKLVPLSEQNLVDCSWSHGNKG 177

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           CDGG    ++Q++  + GL T   Y PY   +  C      + A + GF+++ P SE+AL
Sbjct: 178 CDGGLPDFAFQYVKDNGGLDTSVSY-PYEALNGTCRYNPKYSAAKVVGFMSIPP-SENAL 235

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYW 315
             A+A  GP+SV ID   KSF FY  G+YY+  C+++   L+HAVL VGYG E DG+ YW
Sbjct: 236 MKAVATVGPISVGIDIKHKSFQFYKGGMYYEPDCSST--NLNHAVLVVGYGEESDGRKYW 293

Query: 316 QVKNSWSTYWGNQGYVLMSIK-DNNCGVMTAPTY 348
            VKNSW   WG  GY+ M+   +NNCG+ +  +Y
Sbjct: 294 LVKNSWGRDWGMDGYIKMAKDWNNNCGIASDASY 327


>gi|307141900|gb|ADN34745.1| putative cysteine peptidase [Echinococcus granulosus]
          Length = 218

 Score =  168 bits (426), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 89/198 (44%), Positives = 119/198 (60%), Gaps = 5/198 (2%)

Query: 149 LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
           + DQ  CGSCW+F  TGA+EG    K  KL  LS+Q L+DCS   GN GC+GG    +++
Sbjct: 22  IKDQGDCGSCWAFSATGALEGQLKRKKGKLISLSEQQLVDCSTDMGNEGCNGGYMNDAFR 81

Query: 209 WIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
           + M++G  ++ DY PY   D  C   ++     ++ FV V    ED LKL++A+ GPVSV
Sbjct: 82  YWMQNGAESESDY-PYTAMDGKCKFNSSKVVTKVSKFVKVPKKREDQLKLSVAQVGPVSV 140

Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGY-GELDGKPYWQVKNSWSTYWGN 327
           AIDA+   F  Y  G+Y D  C  S   LDHAVL VGY  ++ G+ YW VKNSW   WG 
Sbjct: 141 AIDAASSGFMLYKKGIYQDNTC--SQQYLDHAVLVVGYDADMAGQKYWIVKNSWGEDWGQ 198

Query: 328 QGYVLMSI-KDNNCGVMT 344
           +GY+ M+  K N CG+ T
Sbjct: 199 RGYIWMARDKGNMCGIAT 216


>gi|410960470|ref|XP_003986812.1| PREDICTED: pro-cathepsin H [Felis catus]
          Length = 321

 Score =  168 bits (426), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 88/210 (41%), Positives = 125/210 (59%), Gaps = 3/210 (1%)

Query: 140 RYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCD 199
           R + K  + + +Q  CGSCW+F TTGA+E A  +K  KL  L++Q L+DC+  + N+GC 
Sbjct: 109 RTKGKYVSPVKNQGGCGSCWTFSTTGALESAIAIKTGKLLSLAEQQLVDCAQNFNNHGCQ 168

Query: 200 GGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKL 258
           GG   +++++I    G+  +D Y PY GQD  C    + A A +    N+T N E+A+  
Sbjct: 169 GGLPSQAFEYIRYNKGIMGEDTY-PYKGQDGDCKFQPSKAIAFVKDVANITINDEEAMVE 227

Query: 259 ALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVK 318
           A+A + PVS A + +   F  Y  GVY    C+ +PD ++HAVLAVGYGE DG PYW VK
Sbjct: 228 AVALYNPVSFAFEVTD-DFMMYRKGVYSSTSCHKTPDKVNHAVLAVGYGEKDGIPYWIVK 286

Query: 319 NSWSTYWGNQGYVLMSIKDNNCGVMTAPTY 348
           NSW   WG +GY L+    N CG+    +Y
Sbjct: 287 NSWGPQWGMKGYFLIERGKNMCGLAACASY 316


>gi|66814630|ref|XP_641494.1| cysteine protease [Dictyostelium discoideum AX4]
 gi|118121|sp|P04989.1|CYSP2_DICDI RecName: Full=Cysteine proteinase 2; AltName: Full=Prestalk
           cathepsin; Flags: Precursor
 gi|167860|gb|AAA33240.1| pst-cathepsin [Dictyostelium discoideum]
 gi|1834417|emb|CAA27050.1| cysteine proteinase 2 [Dictyostelium discoideum]
 gi|60469522|gb|EAL67513.1| cysteine protease [Dictyostelium discoideum AX4]
 gi|225484|prf||1304284A cathepsin,prestalk
          Length = 376

 Score =  168 bits (426), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 102/241 (42%), Positives = 134/241 (55%), Gaps = 46/241 (19%)

Query: 149 LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
           + DQ  CGSCWSF TTG+ EGA+ +K KKL  LS+Q L+DCS    N GCDGG    ++ 
Sbjct: 138 IKDQGQCGSCWSFSTTGSTEGAHALKTKKLVSLSEQNLVDCSGPEENFGCDGGLMNNAFD 197

Query: 209 WIMKH-GLPTQDDYGPYLGQD-AYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
           +I+K+ G+ T+  Y PY  +  + C    +   AT+ G+VN+T  SE +L+   A+HGPV
Sbjct: 198 YIIKNKGIDTESSY-PYTAETGSTCLFNKSDIGATIKGYVNITAGSEISLENG-AQHGPV 255

Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKP------------- 313
           SVAIDAS  SF  Y +G+YY+ KC  SP  LDH VL VGYG + GK              
Sbjct: 256 SVAIDASHNSFQLYTSGIYYEPKC--SPTELDHGVLVVGYG-VQGKDDEGPVLNRKQTIV 312

Query: 314 -------------------------YWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPT 347
                                    YW VKNSW T WG +GY+LMS  + NNCG+ +  +
Sbjct: 313 IHKNEDNKVESSDDSSDSVRPKANNYWIVKNSWGTSWGIKGYILMSKDRKNNCGIASVSS 372

Query: 348 Y 348
           Y
Sbjct: 373 Y 373


>gi|403302732|ref|XP_003942007.1| PREDICTED: cathepsin S isoform 2 [Saimiri boliviensis boliviensis]
          Length = 289

 Score =  168 bits (426), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 88/206 (42%), Positives = 122/206 (59%), Gaps = 6/206 (2%)

Query: 145 GYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDF 204
           G N L D   CG+CW+F   GA+E    +K  KL  LS Q L+DCS  YGN GC+GG   
Sbjct: 85  GMNHLGDMGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSEKYGNKGCNGGFMT 144

Query: 205 RSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKH 263
            ++Q+I+ + G+ ++  Y PY   D  C   +    AT + +  +    ED LK A+A  
Sbjct: 145 EAFQYIIDNKGIDSEASY-PYKATDQKCQYDSKYRAATCSKYTELPYGREDVLKEAVANK 203

Query: 264 GPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWST 323
           GPV V +DAS  SF  Y +GVYYD  C      ++H VL +GYG+L+GK YW VKNSW +
Sbjct: 204 GPVCVGVDASHPSFFLYRSGVYYDPACTQK---VNHGVLVIGYGDLNGKEYWLVKNSWGS 260

Query: 324 YWGNQGYVLMSI-KDNNCGVMTAPTY 348
            +G QGY+ M+  K N+CG+ + P+Y
Sbjct: 261 NFGEQGYIRMARNKGNHCGIASYPSY 286


>gi|297793593|ref|XP_002864681.1| hypothetical protein ARALYDRAFT_496172 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297310516|gb|EFH40940.1| hypothetical protein ARALYDRAFT_496172 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 361

 Score =  168 bits (426), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 85/192 (44%), Positives = 118/192 (61%), Gaps = 3/192 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F TTGA+E AY+    K   LS+Q L+DC+  Y N GC+GG   +++++I
Sbjct: 158 DQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAYNNYGCNGGLPSQAFEYI 217

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + GL T++ Y PY+G+D  C  +       +   VN+T  +ED LK A+    PVS+A
Sbjct: 218 KSNGGLDTEEAY-PYIGKDGTCKFSAENVGVQVLDSVNITLGAEDELKHAVGLVRPVSIA 276

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            +    SF  Y +GVY D  C ++P  ++HAVLAVGYG  DG PYW +KNSW   WG++G
Sbjct: 277 FEVIH-SFRLYKSGVYTDSHCGSTPMDVNHAVLAVGYGVEDGVPYWLIKNSWGADWGDKG 335

Query: 330 YVLMSIKDNNCG 341
           Y  M +  N CG
Sbjct: 336 YFKMEMGKNMCG 347


>gi|344271892|ref|XP_003407771.1| PREDICTED: cathepsin L1-like [Loxodonta africana]
          Length = 334

 Score =  168 bits (426), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 98/221 (44%), Positives = 132/221 (59%), Gaps = 12/221 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + DQ  CGSCW+F  TGA+EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 118 VDWTQKGYVTPVKDQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRREGNEG 177

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDA 255
           C+GG    ++Q+I  + GL +++ Y PY   D   C      + A  TGFV++ P  E A
Sbjct: 178 CNGGLMDNAFQYIKDNGGLDSEESY-PYTAMDKQDCRYNPKYSAANDTGFVDIPP-QEKA 235

Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
           L  A+A  GP+SVA+DA  +SF FY +G+YYD  C  S   L+H VL VGYG    +   
Sbjct: 236 LMKAVATVGPISVAVDAGHESFQFYKSGIYYDSNC--SSKDLNHGVLVVGYGFEGIDSAN 293

Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
             YW VKNSW T WG  GY+ M+  ++N+CG+ TA +Y T+
Sbjct: 294 NRYWLVKNSWGTGWGTDGYIKMAKDRNNHCGIATAASYPTV 334


>gi|440906716|gb|ELR56945.1| Cathepsin S, partial [Bos grunniens mutus]
          Length = 342

 Score =  168 bits (425), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 90/200 (45%), Positives = 121/200 (60%), Gaps = 7/200 (3%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG-YGNNGCDGGEDFRSYQWI 210
           Q  CGSCW+F   GA+E    +K  KL  LS Q L+DCS   YGN GC+GG    ++Q+I
Sbjct: 144 QGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTAKYGNKGCNGGFMTEAFQYI 203

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           +  +G+ ++  Y PY   D  C        AT + ++ +   SE+ALK A+A  GPVSV 
Sbjct: 204 IDNNGIDSEASY-PYKAMDGKCQYDVKNRAATCSRYIELPFGSEEALKEAVANKGPVSVG 262

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS  SF  Y  GVYYD  C  +   ++H VL VGYG LDGK YW VKNSW  ++G+QG
Sbjct: 263 IDASHSSFFLYKTGVYYDPSCTQN---VNHGVLVVGYGNLDGKDYWLVKNSWGLHFGDQG 319

Query: 330 YVLMSIKD-NNCGVMTAPTY 348
           Y+ M+    N+CG+ + P+Y
Sbjct: 320 YIRMARNSGNHCGIASYPSY 339


>gi|339252572|ref|XP_003371509.1| cathepsin L1 [Trichinella spiralis]
 gi|316968239|gb|EFV52542.1| cathepsin L1 [Trichinella spiralis]
          Length = 448

 Score =  168 bits (425), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 100/238 (42%), Positives = 129/238 (54%), Gaps = 43/238 (18%)

Query: 151 DQSVCGSCWSFGT---------------TGAVEGAYYMKHKKLAVLSQQALIDCSWGYGN 195
           DQ  CGSCW+F                 TGA+EG    K  KL  LS+Q LIDCS  YGN
Sbjct: 211 DQGHCGSCWAFSAVNSNALHVHSRAFQQTGALEGQNKRKTGKLVSLSEQNLIDCSRKYGN 270

Query: 196 NGCDGGEDFRSYQWIMK-HGLPTQDDYGPYLGQ----DAYCHIANTTATATMTGFVNVTP 250
            GC GG    +++++ + HG+ T++ Y PY       D  C   N+T  AT  GFV++ P
Sbjct: 271 KGCSGGLMDNAFEYVKENHGIDTEESY-PYEAAVRMLDKKCRFKNSTIGATDKGFVDIEP 329

Query: 251 NSEDALKLALAKHGPVSVAIDASQKSFSFYVNG-------------------VYYDEKCN 291
            +E  L  A+A  GP+SVAIDAS +SF FY +G                   VY++  C 
Sbjct: 330 GNETYLMHAVATIGPLSVAIDASHESFQFYSSGMLLMVDIFNTVEVMWTNLGVYFEPMC- 388

Query: 292 NSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
            S   LDH VL VGYG L GK YW VKNSW T WGN GY+ M+  K+N+CG+ +  +Y
Sbjct: 389 -SSQFLDHGVLVVGYGSLKGKDYWIVKNSWGTSWGNDGYIFMARNKNNSCGIASFASY 445


>gi|426216526|ref|XP_004002513.1| PREDICTED: cathepsin S isoform 2 [Ovis aries]
          Length = 281

 Score =  168 bits (425), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 94/211 (44%), Positives = 125/211 (59%), Gaps = 8/211 (3%)

Query: 141 YEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW-GYGNNGCD 199
           YE+ G N L D   CGSCW+F   GA+E    +K  KL  LS Q L+DCS   YGN GC+
Sbjct: 73  YEL-GMNHLGDMGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTVKYGNKGCN 131

Query: 200 GGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKL 258
           GG    ++Q+I+  +G+ ++  Y PY   D  C        AT + ++ +   SE+ALK 
Sbjct: 132 GGFMTEAFQYIIDNNGIDSEASY-PYKAMDGRCQYDVKNRAATCSRYIELPFGSEEALKE 190

Query: 259 ALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVK 318
           A+A  GPVSV IDA Q SF  Y  GVYYD  C  +   ++H VL VGYG L+GK YW VK
Sbjct: 191 AVANKGPVSVGIDAKQTSFFLYKTGVYYDPSCTQN---VNHGVLVVGYGSLNGKDYWLVK 247

Query: 319 NSWSTYWGNQGYVLMSIKD-NNCGVMTAPTY 348
           NSW   +G+QGY+ M+    N+CG+   P+Y
Sbjct: 248 NSWGLNFGDQGYIRMARNSGNHCGIANFPSY 278


>gi|344271925|ref|XP_003407787.1| PREDICTED: cathepsin L1-like [Loxodonta africana]
          Length = 333

 Score =  168 bits (425), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 94/220 (42%), Positives = 129/220 (58%), Gaps = 11/220 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + DQ  CGSCW+F  TGA+EG  + K  KL  LS+Q L+DCSW  GN G
Sbjct: 118 VDWTQKGYVTPVKDQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSWREGNEG 177

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++Q++  +G    ++  PY   D   C      + A  TGFV++ P  E AL
Sbjct: 178 CNGGLMDNAFQYVKDNGGLDSEESYPYTATDTQDCRYNPKYSAANDTGFVDIPP-QEKAL 236

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GP+SVAIDA Q SF FY +G+Y+D  C  +   ++H VLAVGYG    + D  
Sbjct: 237 MKAVATVGPISVAIDAGQVSFQFYSSGIYFDPACRLT---VNHGVLAVGYGFEGTDPDKN 293

Query: 313 PYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTYVTM 351
            YW VKNSW   WG  GY+ +   ++N+CG+  A +Y T+
Sbjct: 294 KYWLVKNSWGKSWGADGYIKIAKDRNNHCGIARAASYPTV 333


>gi|957281|gb|AAB33990.1| cysteine proteinase [Bombyx mori]
          Length = 344

 Score =  168 bits (425), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 90/202 (44%), Positives = 121/202 (59%), Gaps = 4/202 (1%)

Query: 149 LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
           + DQ  CGSCWSF TTGA+EG ++ +   L  LS+Q LIDCS  YGNNGC+GG    +++
Sbjct: 142 IKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFK 201

Query: 209 WIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
           +I  +G    +   PY G D  C        A   GFV++    E  L  A+A  GPVSV
Sbjct: 202 YIKDNGGIDTEQAYPYEGVDDKCRYNPKNTGAEDVGFVDIPEGDEQKLMEAVATVGPVSV 261

Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGN 327
           AIDAS   F  Y +GVY +E+C+++   LDH VL VGYG +  G  YW VKNSW   WG 
Sbjct: 262 AIDASHTHFQLYSSGVYNEEECSSTD--LDHGVLVVGYGTDEQGVDYWLVKNSWGRSWGE 319

Query: 328 QGYV-LMSIKDNNCGVMTAPTY 348
            GY+ ++  K+N CG+ ++ +Y
Sbjct: 320 LGYIKMIRNKNNRCGIASSASY 341


>gi|334314327|ref|XP_001368532.2| PREDICTED: cathepsin H-like [Monodelphis domestica]
          Length = 344

 Score =  168 bits (425), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 89/232 (38%), Positives = 134/232 (57%), Gaps = 13/232 (5%)

Query: 118 EKVSKYSLWVRYNKASKDAIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKK 177
            +V  Y  ++ + K      PV+          +Q  CGSCW+F TTG +E A  +   K
Sbjct: 120 RRVGPYPDFMDWRKKGNYVSPVK----------NQGGCGSCWTFSTTGGLESAVAIATGK 169

Query: 178 LAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANT 236
           L  L++Q L+DC+  + N+GC+GG   +++++IM  +G+  +D Y PY G+D  C     
Sbjct: 170 LLSLAEQQLVDCAQAFNNHGCNGGLPSQAFEYIMYNNGIMGEDTY-PYEGKDGTCRFKPD 228

Query: 237 TATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDG 296
            A A +   VN+T   E+A+  A+A H PVS A + ++  F  Y +G+Y + +C+ SPD 
Sbjct: 229 KAIAFVKDVVNITIYDEEAMTEAVAHHNPVSFAFEVTE-DFMSYRDGIYSNPRCDKSPDK 287

Query: 297 LDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTY 348
           ++HAVLAVGYG+ +G  YW VKNSW T WGN GY L+    N CG+    +Y
Sbjct: 288 VNHAVLAVGYGKNNGILYWIVKNSWGTSWGNNGYFLIERGKNMCGLADCASY 339


>gi|148688953|gb|EDL20900.1| cathepsin H, isoform CRA_a [Mus musculus]
          Length = 291

 Score =  168 bits (425), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 80/197 (40%), Positives = 121/197 (61%), Gaps = 1/197 (0%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM 211
           Q  CGSCW+F TTGA+E A  +   K+  L++Q L+DC+  + N+GC GG   +++++I+
Sbjct: 95  QGACGSCWTFSTTGALESAVAIASGKMLSLAEQQLVDCAQAFNNHGCKGGLPSQAFEYIL 154

Query: 212 KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAID 271
            +    ++D  PY+G+D+ C      A A +   VN+T N E A+  A+A + PVS A +
Sbjct: 155 YNKGIMEEDSYPYIGKDSSCRFNPQKAVAFVKNVVNITLNDEAAMVEAVALYNPVSFAFE 214

Query: 272 ASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYV 331
            ++  F  Y +GVY  + C+ +PD ++HAVLAVGYGE +G  YW VKNSW + WG  GY 
Sbjct: 215 VTE-DFLMYKSGVYSSKSCHKTPDKVNHAVLAVGYGEQNGLLYWIVKNSWGSQWGENGYF 273

Query: 332 LMSIKDNNCGVMTAPTY 348
           L+    N CG+    +Y
Sbjct: 274 LIERGKNMCGLAACASY 290


>gi|356582227|ref|NP_001239115.1| cathepsin L1 precursor [Canis lupus familiaris]
 gi|62899810|sp|Q9GL24.1|CATL1_CANFA RecName: Full=Cathepsin L1; Contains: RecName: Full=Cathepsin L1
           heavy chain; Contains: RecName: Full=Cathepsin L1 light
           chain; Flags: Precursor
 gi|10185020|emb|CAC08809.1| cathepsin L [Canis lupus familiaris]
          Length = 333

 Score =  168 bits (425), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 99/221 (44%), Positives = 134/221 (60%), Gaps = 13/221 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TGA+EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 118 VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNEG 177

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDA 255
           C+GG    +++++  + GL +++ Y PYLG+D   C+     + A  TGFV++ P  E A
Sbjct: 178 CNGGLMDNAFRYVKDNGGLDSEESY-PYLGRDTETCNYKPECSAANDTGFVDL-PQREKA 235

Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDG---- 311
           L  A+A  GP+SVAIDA  +SF FY +G+Y+D  C  S   LDH VL VGYG  +G    
Sbjct: 236 LMKAVATLGPISVAIDAGHQSFQFYKSGIYFDPDC--SSKDLDHGVLVVGYG-FEGTDSN 292

Query: 312 KPYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
             +W VKNSW   WG  GYV M+   NN CG+ TA +Y T+
Sbjct: 293 NKFWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPTV 333


>gi|166235890|ref|NP_031827.2| pro-cathepsin H preproprotein [Mus musculus]
 gi|341940309|sp|P49935.2|CATH_MOUSE RecName: Full=Pro-cathepsin H; AltName: Full=Cathepsin B3; AltName:
           Full=Cathepsin BA; Contains: RecName: Full=Cathepsin H
           mini chain; Contains: RecName: Full=Cathepsin H;
           Contains: RecName: Full=Cathepsin H heavy chain;
           Contains: RecName: Full=Cathepsin H light chain; Flags:
           Precursor
 gi|74151776|dbj|BAE29677.1| unnamed protein product [Mus musculus]
 gi|74181999|dbj|BAE34071.1| unnamed protein product [Mus musculus]
 gi|74211659|dbj|BAE29188.1| unnamed protein product [Mus musculus]
 gi|74213518|dbj|BAE35569.1| unnamed protein product [Mus musculus]
 gi|148688954|gb|EDL20901.1| cathepsin H, isoform CRA_b [Mus musculus]
          Length = 333

 Score =  168 bits (425), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 80/198 (40%), Positives = 122/198 (61%), Gaps = 1/198 (0%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTGA+E A  +   K+  L++Q L+DC+  + N+GC GG   +++++I
Sbjct: 132 NQGACGSCWTFSTTGALESAVAIASGKMLSLAEQQLVDCAQAFNNHGCKGGLPSQAFEYI 191

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
           + +    ++D  PY+G+D+ C      A A +   VN+T N E A+  A+A + PVS A 
Sbjct: 192 LYNKGIMEEDSYPYIGKDSSCRFNPQKAVAFVKNVVNITLNDEAAMVEAVALYNPVSFAF 251

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
           + ++  F  Y +GVY  + C+ +PD ++HAVLAVGYGE +G  YW VKNSW + WG  GY
Sbjct: 252 EVTE-DFLMYKSGVYSSKSCHKTPDKVNHAVLAVGYGEQNGLLYWIVKNSWGSQWGENGY 310

Query: 331 VLMSIKDNNCGVMTAPTY 348
            L+    N CG+    +Y
Sbjct: 311 FLIERGKNMCGLAACASY 328


>gi|356984263|gb|AET43955.1| cathepsin L2, partial [Reishia clavigera]
          Length = 278

 Score =  168 bits (425), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 88/193 (45%), Positives = 122/193 (63%), Gaps = 5/193 (2%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TG++EG ++ K   L  LS+Q L+DCS   GN G
Sbjct: 89  VDWRTKGYVTPVKNQKQCGSCWAFSATGSLEGQHFKKTGTLVSLSEQNLVDCSKKEGNEG 148

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG   +++++I ++ G+ T+  Y PY   D  C  +     AT TG+ ++   SE  L
Sbjct: 149 CEGGLMDQAFEYIKRNKGIDTEQSY-PYRAVDEKCRFSRADVGATDTGYTDIHKGSEKDL 207

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           + A+A  GP+SVAIDAS+ SF  Y +GVYY+ KC+++   LDH VLAVGYG  D K YW 
Sbjct: 208 QSAVATVGPISVAIDASRDSFQLYKSGVYYEPKCSSTM--LDHGVLAVGYGTTDSKDYWI 265

Query: 317 VKNSWSTYWGNQG 329
           VKNSW T WG +G
Sbjct: 266 VKNSWGTQWGMKG 278


>gi|358334194|dbj|GAA34712.2| cathepsin L [Clonorchis sinensis]
          Length = 401

 Score =  168 bits (425), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 88/185 (47%), Positives = 117/185 (63%), Gaps = 9/185 (4%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F  TGA+EG ++M  K+L  LS+Q L+DCS  +GN GC GG    +++++
Sbjct: 182 DQGQCGSCWAFSATGAIEGQHFMATKQLVSLSEQQLVDCSSHFGNFGCSGGWMDNAFKYV 241

Query: 211 MK-HGLPTQDDYGPYLGQDA-----YCHIANTTATATMTGFVNVTPNSEDALKLALAKHG 264
              HG+ T+  Y PY+  +       C        AT+TG V++  ++E ALK A+  HG
Sbjct: 242 KHTHGITTETKY-PYISGETGTPNPRCEFHGQAIAATVTGIVDLPRSNEFALKQAVGLHG 300

Query: 265 PVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTY 324
           P+SVAI AS +SF  Y +GVY DE+C  S D LDHAVL VGYGE +G PYW +KNSW   
Sbjct: 301 PISVAIHASLESFMGYKSGVYSDEEC--SSDQLDHAVLVVGYGEENGIPYWLIKNSWGFD 358

Query: 325 WGNQG 329
           WG  G
Sbjct: 359 WGEMG 363


>gi|27806673|ref|NP_776457.1| cathepsin L2 precursor [Bos taurus]
 gi|1542853|emb|CAA62870.1| cathepsin L [Bos taurus]
          Length = 334

 Score =  167 bits (424), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 98/220 (44%), Positives = 128/220 (58%), Gaps = 10/220 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TGA+EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 118 VDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNQG 177

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDA-YCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++Q+I  +G    ++  PYL  D   C+     + A  TGFV++ P  E AL
Sbjct: 178 CNGGLMDNAFQYIKDNGGLDSEESYPYLATDTNSCNYKPECSAANDTGFVDI-PQREKAL 236

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GP+SVAIDA   SF FY +G+YYD  C  S   LDH VL VGYG    + +  
Sbjct: 237 MKAVATVGPISVAIDAGHTSFQFYKSGIYYDPDC--SCKDLDHGVLVVGYGFEGTDSNNN 294

Query: 313 PYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
            +W VKNSW   WG  GYV M+   NN CG+ TA +Y T+
Sbjct: 295 KFWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPTV 334


>gi|28971813|dbj|BAC65418.1| cathepsin L [Pandalus borealis]
          Length = 318

 Score =  167 bits (424), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 92/200 (46%), Positives = 119/200 (59%), Gaps = 7/200 (3%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F TTGA+EGA+++KH  L  LS+Q L+DCS    N+GC+GG    +Y +I
Sbjct: 121 DQGQCGSCWAFSTTGALEGAHFLKHGDLVSLSEQNLVDCS--TENSGCNGGVVQWAYDYI 178

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
              +G+ T+  Y PY  QD  C        AT+TG+ ++    E     A+   GPVSV 
Sbjct: 179 KSNNGIDTESSY-PYEAQDLTCRFDAAHVGATVTGYADIPYADEVTQASAVHDDGPVSVC 237

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDA   SF  Y +GVYY+  CN  P  ++HAVL VGYG  +G  YW +KNSW T WG  G
Sbjct: 238 IDAGHNSFQLYSSGVYYEPNCN--PSSINHAVLPVGYGTEEGSDYWLIKNSWGTGWGLSG 295

Query: 330 YV-LMSIKDNNCGVMTAPTY 348
           Y+ L   K N+CGV T   Y
Sbjct: 296 YMKLTRNKSNHCGVATQSCY 315


>gi|5231178|gb|AAD41105.1|AF157961_1 cysteine proteinase [Hypera postica]
          Length = 324

 Score =  167 bits (424), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 88/201 (43%), Positives = 120/201 (59%), Gaps = 6/201 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F  TG+ EGAY  K  KL  LS+Q LIDC     + GCDGG    +++++
Sbjct: 129 DQGDCGSCWAFSITGSTEGAYARKSGKLVSLSEQQLIDCCTDT-SAGCDGGSLDDNFKYV 187

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
           MK GL +++ Y  Y G+D  C     +    ++ + ++    EDAL  A+A  GPVSV +
Sbjct: 188 MKDGLQSEESY-TYKGEDGACKYNVASVVTKVSKYTSIPAEDEDALLEAVATVGPVSVGM 246

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
           DAS    S Y +G+Y D+ C  SP GL+HA+LAVGYG  +GK YW +KNSW   WG QGY
Sbjct: 247 DASY--LSSYDSGIYEDQDC--SPAGLNHAILAVGYGTENGKDYWIIKNSWGASWGEQGY 302

Query: 331 VLMSIKDNNCGVMTAPTYVTM 351
             ++   N CG+     Y T+
Sbjct: 303 FRLARGKNQCGISEDTVYPTI 323


>gi|390457768|ref|XP_002742793.2| PREDICTED: cathepsin L2 [Callithrix jacchus]
          Length = 588

 Score =  167 bits (424), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 92/217 (42%), Positives = 130/217 (59%), Gaps = 11/217 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TGA+EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 118 VDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSHPQGNQG 177

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++Q++ ++ GL ++  Y PY+ +D  C      + A  TGFV V P  E  L
Sbjct: 178 CNGGFMNNAFQYVKENGGLDSEASY-PYVAKDGSCKYKPENSVANDTGFV-VIPAHEKEL 235

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GP+SVA+DAS  SF FY +G+Y+++ C  S   LDH VL VGYG      +  
Sbjct: 236 MKAVATVGPISVAVDASHSSFQFYKSGIYFEQDC--SSKNLDHGVLVVGYGFEGTNSNNN 293

Query: 313 PYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
            YW +KNSW   WG+ GY+ +   ++N+CG+ TA +Y
Sbjct: 294 NYWLIKNSWGPEWGSNGYIKIAKDRNNHCGIATAASY 330


>gi|194741252|ref|XP_001953103.1| GF17600 [Drosophila ananassae]
 gi|190626162|gb|EDV41686.1| GF17600 [Drosophila ananassae]
          Length = 333

 Score =  167 bits (424), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 95/201 (47%), Positives = 121/201 (60%), Gaps = 8/201 (3%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F TTG++EG ++ K  +L  LS+Q LIDCS   GNNGC  G    ++++I
Sbjct: 135 DQGSCGSCWAFSTTGSLEGQHFRKTGQLISLSEQNLIDCS--PGNNGCKNGAVEYAFRYI 192

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T+  Y PY      C     T  AT TGFV + P  E  L  A+A  GP+SV 
Sbjct: 193 QSNKGIDTEISY-PYEAAQNQCRFRRDTIGATSTGFVKLNPGDEMELAQAVATVGPISVL 251

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELD-GKPYWQVKNSWSTYWGNQ 328
           I++S  SF FY +GVY D  CN  P+ L HAVL VGYG  D G  +W VKNSWST+WG Q
Sbjct: 252 INSSLDSFKFYHDGVYNDPSCN--PNKLTHAVLVVGYGTDDRGGDFWLVKNSWSTHWGEQ 309

Query: 329 GYVLMSIKDNN-CGVMTAPTY 348
           GYV +    NN CG+ +   Y
Sbjct: 310 GYVKIKRNANNLCGIASNALY 330


>gi|357158628|ref|XP_003578189.1| PREDICTED: thiol protease aleurain-like [Brachypodium distachyon]
          Length = 363

 Score =  167 bits (424), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 88/199 (44%), Positives = 119/199 (59%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQS CGSCW+F TTGA+E AY     K   LS+Q L+DC+  Y N GC+GG   +++++I
Sbjct: 162 DQSHCGSCWTFSTTGALEAAYTQATGKNISLSEQQLVDCAGAYNNFGCNGGLPSQAFEYI 221

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + GL T++ Y PY G +  CH     A   +   VN+T N+ED L+ A+    PVSVA
Sbjct: 222 KYNGGLDTEESY-PYKGVNGVCHYKPENAAVQVLDSVNITLNAEDELQNAVGLVRPVSVA 280

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            +     F  Y +GVY  + C  +PD ++HAVLAVGYG  +G PYW +KNSW   WG++G
Sbjct: 281 FEVI-NGFRQYKSGVYTSDHCGTTPDDVNHAVLAVGYGVENGTPYWLIKNSWGESWGDKG 339

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y  M    N C V T  +Y
Sbjct: 340 YFKMERGKNMCAVATCASY 358


>gi|440893559|gb|ELR46281.1| Cathepsin L1 [Bos grunniens mutus]
          Length = 330

 Score =  167 bits (424), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 98/235 (41%), Positives = 137/235 (58%), Gaps = 17/235 (7%)

Query: 130 NKASKDAI------PVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLS 182
           NK  K+ I       + +  KGY   + +Q  CGSCW+F  TGA+EG  + K  KL  LS
Sbjct: 100 NKKGKETIFASIPPSMDWREKGYVTPVKNQGKCGSCWAFSATGALEGQMFQKTGKLVSLS 159

Query: 183 QQALIDCSWGYGNNGCDGGEDFRSYQWIMK-HGLPTQDDYGPYLGQDAYCHIANTTATAT 241
           +Q L+DCS   GN GC GG    ++Q+++   GL +++ Y PY G    C      + A 
Sbjct: 160 EQNLVDCSQPEGNRGCHGGFIDNAFQYVLDVGGLDSEESY-PYTGLVGTCLYNPNNSAAN 218

Query: 242 MTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAV 301
            TGFV++ P  E AL  A+A  GP+SVA+DA   SF FY +G+YY+  C  S + +DHAV
Sbjct: 219 ETGFVDL-PKQEKALMKAVATLGPISVAVDAHNPSFQFYKSGIYYEPNC--SSESVDHAV 275

Query: 302 LAVGYG----ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
           L VGYG    + D   YW VKNSW  +WG  GY+ M+  ++N+CG+ T  +Y T+
Sbjct: 276 LVVGYGFEGADSDDNKYWLVKNSWGEHWGMDGYIKMAKDRNNHCGIATMASYPTV 330


>gi|255550445|ref|XP_002516273.1| cysteine protease, putative [Ricinus communis]
 gi|223544759|gb|EEF46275.1| cysteine protease, putative [Ricinus communis]
          Length = 358

 Score =  167 bits (424), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 87/199 (43%), Positives = 120/199 (60%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTGA+E AY+    K   LS+Q L+DC+  + N GC GG   +++++I
Sbjct: 158 NQGHCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFNNFGCHGGLPSQAFEYI 217

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + GL T++ Y PY G+D  C  ++      +   VN+T  +ED LK A+    PVSVA
Sbjct: 218 KYNGGLETEEAY-PYTGEDGACKFSSENVGIQVLDSVNITLGAEDELKEAVGLVRPVSVA 276

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            +     F FY +GVY  + C ++P  ++HAVLAVGYG  DG PYW VKNSW   WG+ G
Sbjct: 277 FEVVS-GFRFYKSGVYTSDTCGSTPMDVNHAVLAVGYGVEDGVPYWLVKNSWGENWGDHG 335

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y  M +  N CGV T  +Y
Sbjct: 336 YFKMEMGKNMCGVATCASY 354


>gi|339765072|gb|AEK01110.1| cathepsin L [Cristaria plicata]
 gi|397880684|gb|AFO67888.1| cathepsin L [Cristaria plicata]
          Length = 333

 Score =  167 bits (424), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 90/223 (40%), Positives = 129/223 (57%), Gaps = 4/223 (1%)

Query: 128 RYNKASKDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQAL 186
           +Y   S+    V +  KGY   + DQ  CGSC++F  TGA+EG ++ K  KL  LS+Q +
Sbjct: 110 KYTNLSEYPRQVDWRRKGYVTRVKDQGGCGSCYAFSATGALEGQHFRKTGKLVSLSEQNI 169

Query: 187 IDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFV 246
           +DCS+  GN GC GG   +S+ +I  +    +++  PY  +D  C    +   AT  G+V
Sbjct: 170 VDCSFKEGNKGCKGGLMDKSFTYIKNNNGIDKEEAYPYEARDGPCRFRRSEVGATDRGYV 229

Query: 247 NVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGY 306
           ++  N E AL+ A+A  GP+SVAID    +F FY +GV+ +  C  S   ++H VL VGY
Sbjct: 230 DLPENDETALRHAVATIGPISVAIDGHHFNFRFYDHGVFDNPNC--SKTKINHGVLVVGY 287

Query: 307 GELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           G  +G  YW VKNSW   WG +GY+LMS   DN C +  A +Y
Sbjct: 288 GTRNGLDYWMVKNSWGRGWGAKGYILMSRNNDNQCCIACAASY 330


>gi|148224682|ref|NP_001086670.1| cathepsin S [Xenopus laevis]
 gi|50418223|gb|AAH77285.1| Ctss-prov protein [Xenopus laevis]
          Length = 320

 Score =  167 bits (424), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 84/199 (42%), Positives = 120/199 (60%), Gaps = 4/199 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F   GA+EG   +K  K+  LS Q L+DCS  YGN GC GG   R++Q++
Sbjct: 122 NQGGCGSCWAFSAVGALEGQLMLKTGKIVSLSPQNLVDCSSKYGNKGCSGGFMTRAFQYV 181

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
           + +     D Y PY   D  CH       ++   +  + P +ED LK AL   GP+SVAI
Sbjct: 182 IDNNGIDSDTYYPYHAMDEKCHYELAGKASSCVKYREIVPGTEDNLKQALGNIGPISVAI 241

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
           D ++ +F  Y +GVY D  C+     ++H VLAVGYG L+G+ +W +KNSW T +G+QGY
Sbjct: 242 DGTRPTFFLYKSGVYSDPSCSQE---VNHGVLAVGYGTLNGQDFWLLKNSWGTKYGDQGY 298

Query: 331 VLMSI-KDNNCGVMTAPTY 348
           V ++  K+N CGV +  +Y
Sbjct: 299 VRIARNKENLCGVASYTSY 317


>gi|28932706|gb|AAO60047.1| midgut cysteine proteinase 4 [Rhipicephalus appendiculatus]
          Length = 345

 Score =  167 bits (424), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 85/205 (41%), Positives = 126/205 (61%), Gaps = 6/205 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCS-WGYGNNGCDGGEDFRSYQW 209
           +Q  CGSCW+F +TGA+EG  + + ++L  LS+Q L+DC+   YGNNGC+GG+   ++Q+
Sbjct: 143 NQGQCGSCWAFSSTGALEGQVFKRTRRLISLSEQNLMDCAGQRYGNNGCNGGQMPGAFQY 202

Query: 210 IMKHG-LPTQDDYGPYLGQDAYCHIANT--TATATMTGFVNVTPNSEDALKLALAKHGPV 266
           +   G L T+  Y    G +  C  +N+      ++ G   V P +E  L+ A+A  GP+
Sbjct: 203 VQDAGGLDTEARYPYRQGTNFQCQFSNSFEARRVSVNGHTRVPPRNERVLQDAVANVGPI 262

Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWG 326
           S+AI+AS ++F FY NG+Y +  C+  P GL+HAVL VGYGE  G PYW VKNSW   WG
Sbjct: 263 SIAINASPQTFMFYKNGIYGEPNCD--PRGLNHAVLLVGYGEERGVPYWIVKNSWGPGWG 320

Query: 327 NQGYVLMSIKDNNCGVMTAPTYVTM 351
             GY+ +    N CG+   P++  +
Sbjct: 321 EGGYIKILRNRNVCGMSQDPSFPNL 345


>gi|323454730|gb|EGB10599.1| hypothetical protein AURANDRAFT_4398, partial [Aureococcus
           anophagefferens]
          Length = 221

 Score =  167 bits (424), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 92/218 (42%), Positives = 128/218 (58%), Gaps = 10/218 (4%)

Query: 142 EMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGG 201
           ++ G   + DQ  CGSCW+FGT  A+EGA ++K      LS+QAL+DCSW  GNNGCDGG
Sbjct: 6   KVGGVAPVKDQGTCGSCWAFGTVAALEGALFVKTGDQLTLSEQALVDCSWPEGNNGCDGG 65

Query: 202 EDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATAT----MTGFVNVTPNSEDAL 256
           ED  +Y+W++ H G+P+  +YGPYL  D  CH+      A     +  +  V  ++  AL
Sbjct: 66  EDSMAYEWMLHHQGVPSTAEYGPYLNADGLCHVKAANPKAMPGLEIKAWTRVN-DTRAAL 124

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVY--YDEKCNNSPDGLDHAVLAVGYG-ELDGKP 313
           K AL+  G + V + A+  SF FY +G+Y     +C+      DH V  VGYG E  G  
Sbjct: 125 KDALSTRGTLVVRVQATPLSFYFYGSGLYDVPHAECSRDAADSDHIVALVGYGYEPSGLG 184

Query: 314 YWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
           YW ++NSWS +WG  GY+ + + DN CGV    TY T+
Sbjct: 185 YWILRNSWSVHWGEGGYMKIGM-DNVCGVENTATYPTV 221


>gi|351694995|gb|EHA97913.1| Cathepsin L1 [Heterocephalus glaber]
          Length = 278

 Score =  167 bits (424), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 92/217 (42%), Positives = 136/217 (62%), Gaps = 11/217 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TG++EG  + K  +L  LS+Q L+DCS   GN G
Sbjct: 63  VDWRKKGYVTPVKNQGQCGSCWAFSATGSLEGQMFRKTGQLVSLSEQNLVDCSQPQGNQG 122

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    +++++ ++ GL ++  Y PY G+D  C      + A  TGFV++ P  E AL
Sbjct: 123 CNGGLMDFAFEYVKENKGLESEKSY-PYEGKDGSCRYKPELSAANDTGFVDI-PQREKAL 180

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG--ELDGK-- 312
             A+A+ GP+SVA+DA   SF FY +G+Y+D +C  S   L+H VL VGYG  E+D +  
Sbjct: 181 MKAVAEKGPISVAVDAGLMSFQFYKDGIYFDPEC--SSKDLNHGVLVVGYGYEEVDTEKN 238

Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
            YW VKNSW   WG +GY+ ++  ++N+CG+ TA +Y
Sbjct: 239 EYWLVKNSWGPEWGAEGYIKIARNRNNHCGIATAASY 275


>gi|75812934|ref|NP_001028787.1| cathepsin S precursor [Bos taurus]
 gi|115503669|sp|P25326.2|CATS_BOVIN RecName: Full=Cathepsin S; Flags: Precursor
 gi|74353837|gb|AAI02246.1| Cathepsin S [Bos taurus]
 gi|296489535|tpg|DAA31648.1| TPA: cathepsin S precursor [Bos taurus]
          Length = 331

 Score =  167 bits (424), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 90/200 (45%), Positives = 120/200 (60%), Gaps = 7/200 (3%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG-YGNNGCDGGEDFRSYQWI 210
           Q  CGSCW+F   GA+E    +K  KL  LS Q L+DCS   YGN GC+GG    ++Q+I
Sbjct: 133 QGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTAKYGNKGCNGGFMTEAFQYI 192

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           +  +G+ ++  Y PY   D  C        AT + ++ +   SE+ALK A+A  GPVSV 
Sbjct: 193 IDNNGIDSEASY-PYKAMDGKCQYDVKNRAATCSRYIELPFGSEEALKEAVANKGPVSVG 251

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS  SF  Y  GVYYD  C  +   ++H VL VGYG LDGK YW VKNSW  ++G+QG
Sbjct: 252 IDASHSSFFLYKTGVYYDPSCTQN---VNHGVLVVGYGNLDGKDYWLVKNSWGLHFGDQG 308

Query: 330 YVLMSIKD-NNCGVMTAPTY 348
           Y+ M+    N+CG+   P+Y
Sbjct: 309 YIRMARNSGNHCGIANYPSY 328


>gi|2677828|gb|AAB97142.1| cysteine protease [Prunus armeniaca]
          Length = 358

 Score =  167 bits (423), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 88/199 (44%), Positives = 116/199 (58%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F TTGA+E AY    +K   LS+Q L+DC+  + N GC GG   +++++I
Sbjct: 158 DQGHCGSCWTFSTTGALEAAYVQAFRKQISLSEQQLVDCAGAFNNFGCHGGLPSQAFEYI 217

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + GL T+  Y PY+G D  C  +       +   VN+T   E  LK A+A   PVSVA
Sbjct: 218 KYNGGLDTEAAY-PYVGTDGACKFSAENVGVQVLDSVNITLGDEQELKHAVAFVRPVSVA 276

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
                KSF  Y +GVY  + C +SP  ++HAVLAVGYGE  G P+W +KNSW   WG+ G
Sbjct: 277 FQVV-KSFRIYKSGVYTSDTCGSSPMDVNHAVLAVGYGEEGGVPFWLIKNSWGESWGDNG 335

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y  M    N CGV T  +Y
Sbjct: 336 YFKMEFGKNMCGVATCASY 354


>gi|440293514|gb|ELP86618.1| cysteine proteinase 2 precursor, putative [Entamoeba invadens IP1]
          Length = 319

 Score =  167 bits (423), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 86/204 (42%), Positives = 124/204 (60%), Gaps = 7/204 (3%)

Query: 149 LLDQSVCGSCWSFGTTGAVEGA--YYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRS 206
           + DQ+ CG C+SFG   ++EG   +     K   LS++  I CS    NNGC+GG  ++ 
Sbjct: 116 IRDQNPCGGCYSFGALASLEGRLLFIKGDPKTLDLSEEQAIRCSE---NNGCNGGFGYKV 172

Query: 207 YQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
           Y +I ++G+  + DY P+   D  C +  +   AT++G+  V   + DALK AL   G V
Sbjct: 173 YDYIKENGIVQEKDY-PFTATDGECKVDLSKKFATVSGYTKVAKRNNDALKSALVD-GVV 230

Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWG 326
            VAIDAS   F  Y  GVY+D KC+++ + L+H + AVGYG L+G+ YW V+NSW   WG
Sbjct: 231 DVAIDASSVKFQLYKTGVYHDTKCSSAINRLNHEISAVGYGNLNGEEYWIVRNSWGRSWG 290

Query: 327 NQGYVLMSIKDNNCGVMTAPTYVT 350
           ++GY+LM+I+ N CGV T P Y T
Sbjct: 291 DKGYILMAIEGNTCGVTTDPVYPT 314


>gi|351700981|gb|EHB03900.1| Cathepsin H [Heterocephalus glaber]
          Length = 334

 Score =  167 bits (423), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 83/203 (40%), Positives = 124/203 (61%), Gaps = 3/203 (1%)

Query: 147 NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRS 206
           +++ +Q  CGSCW+F TTGA+E A  +   K+  L++Q L+DC+  + N+GC GG   ++
Sbjct: 129 SAVKNQGACGSCWTFSTTGALESAVAIASGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQA 188

Query: 207 YQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGP 265
           +++I+   G+  +D Y PY G+D +C      A A +   VN+T N E+A+  A+A + P
Sbjct: 189 FEYILYNKGIMGEDTY-PYEGKDGHCRFQPQKAIAFVKDIVNITLNDEEAMVEAVALYNP 247

Query: 266 VSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYW 325
           VS A + ++  F  Y  G+Y    C+ +PD ++HAVLAVGYG   G PYW VKNSW T W
Sbjct: 248 VSFAYEVTE-DFMSYKRGIYSSTSCHKTPDKVNHAVLAVGYGVDHGVPYWIVKNSWGTQW 306

Query: 326 GNQGYVLMSIKDNNCGVMTAPTY 348
           GN GY L+    N CG+    +Y
Sbjct: 307 GNNGYFLIERGKNMCGLAACASY 329


>gi|391333248|ref|XP_003741031.1| PREDICTED: uncharacterized protein LOC100898636 [Metaseiulus
           occidentalis]
          Length = 642

 Score =  167 bits (423), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 94/215 (43%), Positives = 130/215 (60%), Gaps = 8/215 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           + +  +GY   + +Q  CGSCW+F  TGAVEG ++    +L  LS+Q L+DC     + G
Sbjct: 432 IDWRQQGYVTPVKNQGNCGSCWAFSATGAVEGQHFKATGRLESLSEQNLVDCV--KESKG 489

Query: 198 CDGGEDFRSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           CDGG   +++Q+I  +G + T+D Y PY   D  C     +  AT++G+  +   SE  L
Sbjct: 490 CDGGFFEQAFQYIKDNGGINTEDSY-PYEAFDGSCRFREDSIGATVSGYQTIPKGSEADL 548

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           + A++  GP+SVAID S  SF  Y  GVYY+  C++S   LDHAVL VGYG   G+ YW 
Sbjct: 549 QKAVSTIGPISVAIDVSNPSFQNYREGVYYEPSCSSS--NLDHAVLVVGYGSDGGEDYWL 606

Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVT 350
           VKNSW T +G QGYV M+  K NNCG+ +A  Y T
Sbjct: 607 VKNSWGTSFGEQGYVRMARNKGNNCGIASAAAYPT 641



 Score =  154 bits (388), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 79/216 (36%), Positives = 121/216 (56%), Gaps = 6/216 (2%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  +GY   + DQ  CG+CW+F  TGA+EG ++     L  LS+Q ++DC     +NG
Sbjct: 110 VDWTQQGYVTPVKDQGKCGACWTFAATGAIEGQHFKATGNLVSLSEQNILDCVKTATSNG 169

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
           C GG    ++ ++   G    ++  PY      C     +  AT++G+  ++  +E  L+
Sbjct: 170 CSGGLFVEAFDYLKNSGGIDAEESYPYEASGGTCRFRQDSVAATVSGYQAISAGNEAELQ 229

Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQV 317
            A+A  GP+SV ID+    F  Y  G+YY+ +C    + L HAVL VGYG  +G+ YW V
Sbjct: 230 EAVATIGPISVGIDSGHPGFQHYTGGIYYEPECT---EHLSHAVLVVGYGTENGEDYWLV 286

Query: 318 KNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY-VTM 351
           KNSW   +G QGY+ M+  ++NNCG+ T   Y +TM
Sbjct: 287 KNSWGASYGLQGYIKMARNRNNNCGIATGAAYPITM 322


>gi|296189340|ref|XP_002742739.1| PREDICTED: cathepsin L1 [Callithrix jacchus]
          Length = 333

 Score =  167 bits (423), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 94/220 (42%), Positives = 132/220 (60%), Gaps = 11/220 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TGA+EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 118 VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSGPQGNQG 177

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           CDGG    ++Q++ ++ GL +++ Y PY   +  C      + A  TGFV++ P  E AL
Sbjct: 178 CDGGLMDYAFQYVQENGGLDSEESY-PYEATEESCKYNPEYSVANDTGFVDI-PKLEKAL 235

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GP+SVAIDA  +SF FY  G+Y++ +C  S + +DH VL VGYG      D  
Sbjct: 236 MKAVATVGPISVAIDAGHESFQFYKEGIYFEPEC--SSEDMDHGVLVVGYGFERTGSDNS 293

Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
            YW VKNSW   WG  GY+ M+  + N+CG+ +A +Y T+
Sbjct: 294 KYWLVKNSWGEKWGMDGYIKMAKDRKNHCGIASAASYPTV 333


>gi|74151179|dbj|BAE27712.1| unnamed protein product [Mus musculus]
          Length = 334

 Score =  167 bits (423), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 90/204 (44%), Positives = 127/204 (62%), Gaps = 10/204 (4%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F  +G +EG  ++K  KL  LS+Q L+DCS   GN GC+GG    ++Q+I
Sbjct: 131 NQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYI 190

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
            ++ GL +++ Y PY  +D  C      A A  TGFV++ P  E+AL  A+A  GP+SVA
Sbjct: 191 KENGGLDSEESY-PYEAKDGSCKYRAEFAVANDTGFVDI-PQQEEALMKAVATVGPISVA 248

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTYW 325
           +DAS  S  FY +G+YY+  C  S   LDH VL VGYG    + +   YW VKNSW + W
Sbjct: 249 MDASHPSLQFYSSGIYYEPNC--SSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEW 306

Query: 326 GNQGYV-LMSIKDNNCGVMTAPTY 348
           G +GY+ +   +DN+CG+ TA +Y
Sbjct: 307 GMEGYIKIAKDRDNHCGLATAASY 330


>gi|340368358|ref|XP_003382719.1| PREDICTED: digestive cysteine proteinase 2-like [Amphimedon
           queenslandica]
          Length = 329

 Score =  167 bits (423), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 88/225 (39%), Positives = 130/225 (57%), Gaps = 4/225 (1%)

Query: 127 VRYNKASKDAIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQAL 186
           VR   A  D++  R +      + +Q  CG+CW+F  TGA+EG +++    L  LS+Q L
Sbjct: 105 VRSTFALADSVDWR-KSGAVTGVKNQGKCGACWAFSATGALEGQHFINTGTLISLSEQQL 163

Query: 187 IDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFV 246
           +DCS  +GNNGC GG    +++++        ++  PYL +   C   ++ A    T + 
Sbjct: 164 MDCSSSFGNNGCKGGLMDNAFRYLETVAGDMTEEAYPYLAEVGTCRYNSSEAKVKNTVYK 223

Query: 247 NVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGY 306
           ++    EDAL+ A+A  GP+SV+I++   SF  Y  GVYY+  C++S   LDH VL +GY
Sbjct: 224 DIPEGDEDALQEAVATIGPISVSINSEHSSFQLYDQGVYYEPTCSSSK--LDHGVLVIGY 281

Query: 307 GELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVT 350
           G  D   YW VKNSW T WG  GY++MS  K+NNCG+ T  +Y T
Sbjct: 282 GTSDNNDYWLVKNSWGTNWGMDGYIMMSRNKENNCGIATRASYPT 326


>gi|12805315|gb|AAH02125.1| Ctss protein [Mus musculus]
          Length = 340

 Score =  167 bits (422), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 90/200 (45%), Positives = 119/200 (59%), Gaps = 6/200 (3%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCS--WGYGNNGCDGGEDFRSYQW 209
           Q  CG+CW+F   GA+EG   +K  KL  LS Q L+DCS    YGN GC GG    ++Q+
Sbjct: 141 QGSCGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQY 200

Query: 210 IMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           I+ +G    D   PY   D  CH  +    AT + ++ +    EDALK A+A  GPVSV 
Sbjct: 201 IIDNGGIEADASYPYKAMDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVG 260

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS  SF FY +GVY D  C  +   ++H VL VGYG LDGK YW VKNSW   +G+QG
Sbjct: 261 IDASHSSFFFYKSGVYDDPSCTGN---VNHGVLVVGYGTLDGKDYWLVKNSWGLNFGDQG 317

Query: 330 YVLMSIKD-NNCGVMTAPTY 348
           Y+ M+  + N+CG+ +  +Y
Sbjct: 318 YIRMARNNKNHCGIASDCSY 337


>gi|255211|gb|AAB23202.1| cathepsin S [cattle, spleen, Peptide Partial, 217 aa]
 gi|227966|prf||1714236A cathepsin S
          Length = 217

 Score =  167 bits (422), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 90/200 (45%), Positives = 120/200 (60%), Gaps = 7/200 (3%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG-YGNNGCDGGEDFRSYQWI 210
           Q  CGSCW+F   GA+E    +K  KL  LS Q L+DCS   YGN GC+GG    ++Q+I
Sbjct: 19  QGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTAKYGNKGCNGGFMTEAFQYI 78

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           +  +G+ ++  Y PY   D  C        AT + ++ +   SE+ALK A+A  GPVSV 
Sbjct: 79  IDNNGIDSEASY-PYKAMDGKCQYDVKNRAATCSRYIELPFGSEEALKEAVANKGPVSVG 137

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS  SF  Y  GVYYD  C  +   ++H VL VGYG LDGK YW VKNSW  ++G+QG
Sbjct: 138 IDASHSSFFLYKTGVYYDPSCTQN---VNHGVLVVGYGNLDGKDYWLVKNSWGLHFGDQG 194

Query: 330 YVLMSIKD-NNCGVMTAPTY 348
           Y+ M+    N+CG+   P+Y
Sbjct: 195 YIRMARNSGNHCGIANYPSY 214


>gi|256535831|gb|ACU82390.1| cathepsin L 2 [Pheronema raphanus]
          Length = 320

 Score =  167 bits (422), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 85/214 (39%), Positives = 124/214 (57%), Gaps = 3/214 (1%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + DQ  CGSCW+F   G++EG Y+        LS+Q L+DCS   GN G
Sbjct: 109 VDWRTKGYVTKVKDQKQCGSCWAFSAAGSLEGQYFTGQTNY-FLSEQQLVDCSRKEGNEG 167

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
           CDGG+   ++++  K+G   +  Y  YLG+DA C      A A +TG+  +     ++LK
Sbjct: 168 CDGGDMVLAFKYYEKYGAMNESVY-SYLGRDAKCKFDENKAVAKVTGYKKIPKMDCNSLK 226

Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQV 317
            A A  GP+SVA+DAS  SF  Y +G+Y  ++C      LDH VL VGYG   G+ +W +
Sbjct: 227 NAAAVTGPISVAMDASHNSFQIYSSGIYDPKRCGKRNRQLDHGVLVVGYGTESGEDFWLI 286

Query: 318 KNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
           KNSW   WG QGY  +++  + CG+ T+ ++ T+
Sbjct: 287 KNSWGEDWGMQGYFKIAVGKDECGICTSASFPTV 320


>gi|1498185|dbj|BAA06738.1| cysteine proteinase-1 precursor [Drosophila melanogaster]
          Length = 254

 Score =  167 bits (422), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 90/214 (42%), Positives = 128/214 (59%), Gaps = 7/214 (3%)

Query: 139 VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KG   ++ DQ  CGSCW+F +TGA+EG ++ K   L  LS+Q L+DCS  YGNNG
Sbjct: 41  VDWRTKGAVTAVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNG 100

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++ +I  + G+ T+  Y PY   D  CH       AT  GF ++    E  +
Sbjct: 101 CNGGLMDNAFPYIKDNGGIDTEKSY-PYEAIDDSCHFNRAQVGATDRGFTDIPQGDEKKM 159

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYW 315
              +   GPVSVAIDAS +SF FY  GVY + +C+     LDH VL VG+G +  G+ YW
Sbjct: 160 PEPVPTVGPVSVAIDASHESFQFYSEGVYNEPQCD--AQNLDHGVLVVGFGTDESGEDYW 217

Query: 316 QVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
            VKNSW T WG++G++ ++  K+N CG+ +  +Y
Sbjct: 218 LVKNSWGTTWGDKGFIKMLRNKENQCGIASPSSY 251


>gi|363737841|ref|XP_001232765.2| PREDICTED: pro-cathepsin H [Gallus gallus]
          Length = 327

 Score =  167 bits (422), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 85/224 (37%), Positives = 127/224 (56%), Gaps = 3/224 (1%)

Query: 126 WVRYNKASKDAIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQA 185
           ++R +    +A+  R +      + +Q  CGSCW+F TTG +E A  +   KL  L++Q 
Sbjct: 101 FLRSDGPCPEAVDWRKKGNFVTPVKNQGPCGSCWTFSTTGCLESAIAIATGKLLSLAEQQ 160

Query: 186 LIDCSWGYGNNGCDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTG 244
           L+DC+  + N+GC GG   +++++I+   GL  +D Y PY  Q+  C      A A +  
Sbjct: 161 LVDCAQAFNNHGCSGGLPSQAFEYILYNKGLMGEDAY-PYRAQNGTCKFQPDKAIAFVKD 219

Query: 245 FVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAV 304
            +N+T   E  +  A+ KH PVS A + +   F  Y  GVY + +C ++PD ++HAVLAV
Sbjct: 220 VINITQYDEAGMVEAVGKHNPVSFAFEVT-SDFMHYRKGVYSNPRCEHTPDKVNHAVLAV 278

Query: 305 GYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTY 348
           GYGE DG+PYW VKNSW   WG  GY L+    N CG+    +Y
Sbjct: 279 GYGEEDGRPYWIVKNSWGPLWGMDGYFLIERGKNMCGLAACASY 322


>gi|3850787|emb|CAA05360.1| cathepsin S [Mus musculus]
          Length = 330

 Score =  167 bits (422), Expect = 9e-39,   Method: Compositional matrix adjust.
 Identities = 90/200 (45%), Positives = 119/200 (59%), Gaps = 6/200 (3%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCS--WGYGNNGCDGGEDFRSYQW 209
           Q  CG+CW+F   GA+EG   +K  KL  LS Q L+DCS    YGN GC GG    ++Q+
Sbjct: 131 QGSCGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQY 190

Query: 210 IMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           I+ +G    D   PY   D  CH  +    AT + ++ +    EDALK A+A  GPVSV 
Sbjct: 191 IIDNGGIEADASYPYKAMDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVG 250

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS  SF FY +GVY D  C  +   ++H VL VGYG LDGK YW VKNSW   +G+QG
Sbjct: 251 IDASHSSFFFYKSGVYDDPSCTGN---VNHGVLVVGYGTLDGKDYWLVKNSWGLNFGDQG 307

Query: 330 YVLMSIKD-NNCGVMTAPTY 348
           Y+ M+  + N+CG+ +  +Y
Sbjct: 308 YIRMARNNKNHCGIASYCSY 327


>gi|344275472|ref|XP_003409536.1| PREDICTED: cathepsin S-like isoform 2 [Loxodonta africana]
          Length = 281

 Score =  167 bits (422), Expect = 9e-39,   Method: Compositional matrix adjust.
 Identities = 92/205 (44%), Positives = 122/205 (59%), Gaps = 7/205 (3%)

Query: 147 NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW-GYGNNGCDGGEDFR 205
           N L D   CG+CW+F   GA+E    +K  KL  LS Q L+DCS   Y N GC+GG   R
Sbjct: 78  NHLGDMGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSGEKYSNKGCNGGFMTR 137

Query: 206 SYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHG 264
           ++Q+I+  +G+ ++  Y PY   D  C        AT + +  +   SEDALK A+A  G
Sbjct: 138 AFQYIIDNNGIDSEASY-PYKATDGKCQYDPKNRAATCSKYTELPYGSEDALKEAVANKG 196

Query: 265 PVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTY 324
           PVSV IDAS+ SF  Y +GVYYD  C    D ++H VL VGYG L+GK YW VKNSW   
Sbjct: 197 PVSVGIDASRPSFFLYKSGVYYDPSCT---DNVNHGVLVVGYGNLNGKDYWLVKNSWGLN 253

Query: 325 WGNQGYVLMSIKD-NNCGVMTAPTY 348
           +G QGY+ M+    N+CG+ + P+Y
Sbjct: 254 FGEQGYIRMARNSGNHCGIASFPSY 278


>gi|398258447|gb|AFO72231.1| cathepsin L-like protease, partial [Leucinodes orbonalis]
          Length = 207

 Score =  167 bits (422), Expect = 9e-39,   Method: Compositional matrix adjust.
 Identities = 96/190 (50%), Positives = 120/190 (63%), Gaps = 8/190 (4%)

Query: 136 AIP--VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG 192
           A+P  V +  KG    + DQ  CGSCWSF +TGA+EG ++ K   L  LS+Q LIDCS  
Sbjct: 20  ALPEEVDWRKKGAVTEVKDQGKCGSCWSFSSTGALEGQHFRKSGYLVSLSEQNLIDCSSA 79

Query: 193 YGNNGCDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPN 251
           YGNNGC+GG    ++++I   HG+ T+  Y PY   D  C   +  + A   GFV+V   
Sbjct: 80  YGNNGCNGGLMDNAFKYIRDNHGIDTEKTY-PYEAVDDKCRYNSKNSGADDVGFVDVPAG 138

Query: 252 SEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELD 310
            ED LK A+A  GPV+VAIDASQ+SF FY  GVYYDE C++S   LDH VL VGYG + D
Sbjct: 139 DEDKLKEAVATVGPVAVAIDASQESFQFYSTGVYYDENCSSS--NLDHGVLVVGYGTDED 196

Query: 311 GKPYWQVKNS 320
           G  YW VKNS
Sbjct: 197 GGDYWLVKNS 206


>gi|6978721|ref|NP_037071.1| pro-cathepsin H precursor [Rattus norvegicus]
 gi|115729|sp|P00786.1|CATH_RAT RecName: Full=Pro-cathepsin H; Contains: RecName: Full=Cathepsin H
           mini chain; Contains: RecName: Full=Cathepsin H;
           Contains: RecName: Full=Cathepsin H heavy chain;
           Contains: RecName: Full=Cathepsin H light chain; Flags:
           Precursor
 gi|55886|emb|CAA68699.1| cathepsin H pre-pro-peptide [Rattus norvegicus]
 gi|55391460|gb|AAH85352.1| Cathepsin H [Rattus norvegicus]
 gi|149018921|gb|EDL77562.1| cathepsin H, isoform CRA_a [Rattus norvegicus]
 gi|226475|prf||1514114A cathepsin H
          Length = 333

 Score =  167 bits (422), Expect = 9e-39,   Method: Compositional matrix adjust.
 Identities = 82/199 (41%), Positives = 122/199 (61%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTGA+E A  +   K+  L++Q L+DC+  + N+GC GG   +++++I
Sbjct: 132 NQGACGSCWTFSTTGALESAVAIASGKMMTLAEQQLVDCAQNFNNHGCQGGLPSQAFEYI 191

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           +   G+  +D Y PY+G++  C      A A +   VN+T N E A+  A+A + PVS A
Sbjct: 192 LYNKGIMGEDSY-PYIGKNGQCKFNPEKAVAFVKNVVNITLNDEAAMVEAVALYNPVSFA 250

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            + ++  F  Y +GVY    C+ +PD ++HAVLAVGYGE +G  YW VKNSW + WGN G
Sbjct: 251 FEVTE-DFMMYKSGVYSSNSCHKTPDKVNHAVLAVGYGEQNGLLYWIVKNSWGSNWGNNG 309

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y L+    N CG+    +Y
Sbjct: 310 YFLIERGKNMCGLAACASY 328


>gi|432108215|gb|ELK33129.1| Cathepsin L1 [Myotis davidii]
          Length = 334

 Score =  167 bits (422), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 97/221 (43%), Positives = 134/221 (60%), Gaps = 12/221 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TG++EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 118 VDWRDKGYVTPVKNQGQCGSCWAFSATGSLEGQMFRKTGKLVSLSEQNLVDCSRAQGNEG 177

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDA-YCHIANTTATATMTGFVNVTPNSEDA 255
           C+GG    ++Q++  + GL T++ Y PYL +++  C+     + A  TGFV++ P  E A
Sbjct: 178 CNGGLMDNAFQYVKDNKGLDTEESY-PYLARESNTCNYRPEYSAANDTGFVDI-PQREKA 235

Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGK--- 312
           L  A+A  GP+SVAIDA   SF FY  G+YY+  C  S   LDH VL VGYG   G+   
Sbjct: 236 LLKAVATVGPISVAIDAGHSSFQFYNAGIYYEPNC--SSKDLDHGVLVVGYGSEGGESKN 293

Query: 313 -PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
             +W VKNSW + WG  GYV M+  + N+CG+ TA +Y T+
Sbjct: 294 NKFWIVKNSWGSGWGMNGYVKMARDQSNHCGIATAASYPTV 334


>gi|443685370|gb|ELT89004.1| hypothetical protein CAPTEDRAFT_95613, partial [Capitella teleta]
          Length = 295

 Score =  167 bits (422), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 95/221 (42%), Positives = 126/221 (57%), Gaps = 10/221 (4%)

Query: 136 AIPVR------YEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALID 188
           AIPV       +  KGY   + +Q  CGSCW+F   GA+EG ++ K  KL  LS+Q L+D
Sbjct: 74  AIPVSVPAEVDWRKKGYVTPVKNQGQCGSCWAFSAIGALEGQHFRKTGKLVSLSEQNLVD 133

Query: 189 CSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
           CS  YGNNGC+GG    ++++I  +     +   PY   D  C        AT  G+ ++
Sbjct: 134 CSKSYGNNGCNGGVMDYAFKYIKDNDGDDTEACYPYEAVDGMCRFKRECVGATCRGYTDL 193

Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
              +E  +K A+A  GPVSVAIDAS  SF  Y  GVY +++C  SP  LDH VL VGYG 
Sbjct: 194 PWGNEVKMKEAVALVGPVSVAIDASHSSFMSYKGGVYVEKEC--SPYQLDHGVLVVGYGT 251

Query: 309 LDGKPYWQVKNSWSTYWGNQGYVLMSIK-DNNCGVMTAPTY 348
             G  YW VKNSW T WG+QGY+ M+    N+CG+ +   Y
Sbjct: 252 EQGLDYWLVKNSWGTTWGDQGYIKMARNMHNHCGIASMACY 292


>gi|14422331|emb|CAC41636.1| early leaf senescence abundant cysteine protease [Pisum sativum]
          Length = 350

 Score =  167 bits (422), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 87/199 (43%), Positives = 117/199 (58%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F TTGA+E AY     K   LS+Q L+DC+  + N GC GG   +++++I
Sbjct: 150 DQGSCGSCWTFSTTGALESAYAQAFGKNISLSEQQLVDCAGAFNNFGCSGGLPSQAFEYI 209

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + GL T++ Y PY G +  C   +      + G VN+T  +ED LK A+A   PVSVA
Sbjct: 210 KYNGGLETEEAY-PYTGSNGLCKFRSEHVAVKVLGSVNITLGAEDELKHAIAFARPVSVA 268

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            +     F  Y +GVY    C ++P  ++HAVLAVGYG  DG PYW +KNSW   WG+ G
Sbjct: 269 FEVVH-DFRLYKSGVYTSTACGSTPMDVNHAVLAVGYGIEDGIPYWLIKNSWGGDWGDHG 327

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y  M +  N CGV T  +Y
Sbjct: 328 YFKMEMGKNMCGVATCSSY 346


>gi|74178074|dbj|BAE29827.1| unnamed protein product [Mus musculus]
 gi|74178231|dbj|BAE29900.1| unnamed protein product [Mus musculus]
 gi|74220784|dbj|BAE31361.1| unnamed protein product [Mus musculus]
          Length = 326

 Score =  167 bits (422), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 90/200 (45%), Positives = 119/200 (59%), Gaps = 6/200 (3%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCS--WGYGNNGCDGGEDFRSYQW 209
           Q  CG+CW+F   GA+EG   +K  KL  LS Q L+DCS    YGN GC GG    ++Q+
Sbjct: 127 QGSCGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQY 186

Query: 210 IMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           I+ +G    D   PY   D  CH  +    AT + ++ +    EDALK A+A  GPVSV 
Sbjct: 187 IIDNGGIEADASYPYKATDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVG 246

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS  SF FY +GVY D  C  +   ++H VL VGYG LDGK YW VKNSW   +G+QG
Sbjct: 247 IDASHSSFFFYKSGVYDDPSCTGN---VNHGVLVVGYGTLDGKDYWLVKNSWGLNFGDQG 303

Query: 330 YVLMSIKD-NNCGVMTAPTY 348
           Y+ M+  + N+CG+ +  +Y
Sbjct: 304 YIRMARNNKNHCGIASYCSY 323


>gi|351705687|gb|EHB08606.1| Cathepsin S [Heterocephalus glaber]
          Length = 331

 Score =  166 bits (421), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 91/200 (45%), Positives = 123/200 (61%), Gaps = 7/200 (3%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW-GYGNNGCDGGEDFRSYQWI 210
           Q  CGSCW+F   GA+EG   +K  KL  LS Q L+DCS   Y N GC GG    ++Q++
Sbjct: 133 QGACGSCWAFSAVGALEGQLKLKTGKLVSLSAQNLVDCSTEKYRNKGCSGGFMTEAFQYV 192

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           +  +G+ ++  Y PY   D  CH  +    AT + +  +   SE+ALK A+A  GPVSVA
Sbjct: 193 IDNNGIDSETSY-PYKATDEKCHYDSKNRAATCSRYTELPYGSEEALKEAVANKGPVSVA 251

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           +DAS+ SF  Y NGVY D  C  +   + H VLAVGYG L+GK YW VKNSW  Y+G+QG
Sbjct: 252 VDASRPSFFLYKNGVYDDPSCTQN---VTHGVLAVGYGNLNGKDYWLVKNSWGLYFGDQG 308

Query: 330 YVLMSI-KDNNCGVMTAPTY 348
           Y+ M+  K N+CG+ +  +Y
Sbjct: 309 YIRMARNKGNHCGIASYSSY 328


>gi|392306967|ref|NP_067256.3| cathepsin S isoform 2 preproprotein [Mus musculus]
 gi|26390492|dbj|BAC25906.1| unnamed protein product [Mus musculus]
 gi|148706872|gb|EDL38819.1| cathepsin S [Mus musculus]
          Length = 342

 Score =  166 bits (421), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 90/200 (45%), Positives = 119/200 (59%), Gaps = 6/200 (3%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCS--WGYGNNGCDGGEDFRSYQW 209
           Q  CG+CW+F   GA+EG   +K  KL  LS Q L+DCS    YGN GC GG    ++Q+
Sbjct: 143 QGSCGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQY 202

Query: 210 IMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           I+ +G    D   PY   D  CH  +    AT + ++ +    EDALK A+A  GPVSV 
Sbjct: 203 IIDNGGIEADASYPYKATDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVG 262

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS  SF FY +GVY D  C  +   ++H VL VGYG LDGK YW VKNSW   +G+QG
Sbjct: 263 IDASHSSFFFYKSGVYDDPSCTGN---VNHGVLVVGYGTLDGKDYWLVKNSWGLNFGDQG 319

Query: 330 YVLMSIKD-NNCGVMTAPTY 348
           Y+ M+  + N+CG+ +  +Y
Sbjct: 320 YIRMARNNKNHCGIASYCSY 339


>gi|1134882|emb|CAA92583.1| cysteine protease [Pisum sativum]
          Length = 350

 Score =  166 bits (421), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 87/199 (43%), Positives = 117/199 (58%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F TTGA+E AY     K   LS+Q L+DC+  + N GC GG   +++++I
Sbjct: 150 DQGSCGSCWTFSTTGALESAYAQAFGKNISLSEQQLVDCAGAFNNFGCSGGLPSQAFEYI 209

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + GL T++ Y PY G +  C   +      + G VN+T  +ED LK A+A   PVSVA
Sbjct: 210 KYNGGLETEEAY-PYTGSNGLCKFRSEHVAVKVLGSVNITLGAEDELKHAIAFARPVSVA 268

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            +     F  Y +GVY    C ++P  ++HAVLAVGYG  DG PYW +KNSW   WG+ G
Sbjct: 269 FEVVH-DFRLYKSGVYTSTACGSTPMDVNHAVLAVGYGIEDGIPYWLIKNSWGGDWGDHG 327

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y  M +  N CGV T  +Y
Sbjct: 328 YFKMEMGKNMCGVATCSSY 346


>gi|390608645|ref|NP_001254624.1| cathepsin S isoform 1 preproprotein [Mus musculus]
 gi|74214026|dbj|BAE29430.1| unnamed protein product [Mus musculus]
          Length = 343

 Score =  166 bits (421), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 90/200 (45%), Positives = 119/200 (59%), Gaps = 6/200 (3%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCS--WGYGNNGCDGGEDFRSYQW 209
           Q  CG+CW+F   GA+EG   +K  KL  LS Q L+DCS    YGN GC GG    ++Q+
Sbjct: 144 QGSCGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQY 203

Query: 210 IMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           I+ +G    D   PY   D  CH  +    AT + ++ +    EDALK A+A  GPVSV 
Sbjct: 204 IIDNGGIEADASYPYKATDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVG 263

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS  SF FY +GVY D  C  +   ++H VL VGYG LDGK YW VKNSW   +G+QG
Sbjct: 264 IDASHSSFFFYKSGVYDDPSCTGN---VNHGVLVVGYGTLDGKDYWLVKNSWGLNFGDQG 320

Query: 330 YVLMSIKD-NNCGVMTAPTY 348
           Y+ M+  + N+CG+ +  +Y
Sbjct: 321 YIRMARNNKNHCGIASYCSY 340


>gi|203341|gb|AAA63484.1| cathepsin H [Rattus norvegicus]
          Length = 298

 Score =  166 bits (421), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 82/199 (41%), Positives = 122/199 (61%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTGA+E A  +   K+  L++Q L+DC+  + N+GC GG   +++++I
Sbjct: 97  NQGACGSCWTFSTTGALESAVAIASGKMMTLAEQQLVDCAQNFNNHGCQGGLPSQAFEYI 156

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           +   G+  +D Y PY+G++  C      A A +   VN+T N E A+  A+A + PVS A
Sbjct: 157 LYNKGIMGEDSY-PYIGKNGQCKFNPEKAVAFVKNVVNITLNDEAAMVEAVALYNPVSFA 215

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            + ++  F  Y +GVY    C+ +PD ++HAVLAVGYGE +G  YW VKNSW + WGN G
Sbjct: 216 FEVTE-DFMMYKSGVYSSNSCHKTPDKVNHAVLAVGYGEQNGLLYWIVKNSWGSNWGNNG 274

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y L+    N CG+    +Y
Sbjct: 275 YFLIERGKNMCGLAACASY 293


>gi|356530431|ref|XP_003533785.1| PREDICTED: cysteine proteinase [Glycine max]
          Length = 354

 Score =  166 bits (421), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 87/203 (42%), Positives = 119/203 (58%), Gaps = 3/203 (1%)

Query: 147 NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRS 206
           +S+ DQ  CGSCW+F TTGA+E AY     K   LS+Q L+DC+  + N GC GG   ++
Sbjct: 150 SSVKDQGSCGSCWTFSTTGALEAAYAQAFGKSISLSEQQLVDCAGPFNNFGCHGGLPSQA 209

Query: 207 YQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGP 265
           +++I  + GL T++ Y PY G+D  C  +       +   VN+T  +ED LK A+A   P
Sbjct: 210 FEYIKYNGGLETEEAY-PYTGKDGVCKFSAENVAVQVLDSVNITLGAEDELKHAVAFVRP 268

Query: 266 VSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYW 325
           VSVA       F FY NGV+  + C ++   ++HAVLAVGYG  +G PYW +KNSW   W
Sbjct: 269 VSVAFQVVN-GFHFYENGVFTSDTCGSTSQDVNHAVLAVGYGVENGVPYWLIKNSWGESW 327

Query: 326 GNQGYVLMSIKDNNCGVMTAPTY 348
           G  GY  M +  N CGV T  +Y
Sbjct: 328 GENGYFKMELGKNMCGVATCASY 350


>gi|330376140|gb|AEC13302.1| cathepsin H [Gallus gallus]
          Length = 329

 Score =  166 bits (421), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 85/224 (37%), Positives = 127/224 (56%), Gaps = 3/224 (1%)

Query: 126 WVRYNKASKDAIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQA 185
           ++R +    +A+  R +      + +Q  CGSCW+F TTG +E A  +   KL  L++Q 
Sbjct: 103 FLRSDGPCPEAVDWRKKGNFVTPVKNQGPCGSCWTFSTTGCLESAIAIATGKLLSLAEQL 162

Query: 186 LIDCSWGYGNNGCDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTG 244
           L+DC+  + N+GC GG   +++++I+   GL  +D Y PY  Q+  C      A A +  
Sbjct: 163 LVDCAQAFNNHGCSGGLPSQAFEYILYNKGLMGEDAY-PYRAQNGTCKFQPDKAIAFVKD 221

Query: 245 FVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAV 304
            +N+T   E  +  A+ KH PVS A + +   F  Y  GVY + +C ++PD ++HAVLAV
Sbjct: 222 VINITQYDEAGMVEAVGKHNPVSFAFEVT-SDFMHYRKGVYSNPRCEHTPDKVNHAVLAV 280

Query: 305 GYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTY 348
           GYGE DG+PYW VKNSW   WG  GY L+    N CG+    +Y
Sbjct: 281 GYGEEDGRPYWIVKNSWGPLWGMDGYFLIERGKNMCGLAACASY 324


>gi|348551380|ref|XP_003461508.1| PREDICTED: pro-cathepsin H-like [Cavia porcellus]
          Length = 335

 Score =  166 bits (421), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 81/199 (40%), Positives = 123/199 (61%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTGA+E A  +   K+  L++Q L+DC+  + N+GC+GG   +++++I
Sbjct: 134 NQGACGSCWTFSTTGALESAVAIASGKMLSLAEQQLVDCAQDFNNHGCEGGLPSQAFEYI 193

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           +   G+  +D Y PY G+D +C      A A +   VN+T N E+A+  A+A + PVS A
Sbjct: 194 LYNKGIMGEDTY-PYQGKDGHCRFQPQKAIAFVKDVVNITLNDEEAMVEAVALYNPVSFA 252

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            + ++  F  Y +G+Y    C+ +PD ++HAVLAVGYG  +G PYW VKNSW T WG  G
Sbjct: 253 FEVTE-DFISYQSGIYSSTSCHKTPDKVNHAVLAVGYGVQNGVPYWIVKNSWGTAWGQDG 311

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y L+    N CG+    ++
Sbjct: 312 YFLIERGKNMCGLAACASF 330


>gi|2746723|gb|AAB94925.1| cathepsin S precursor [Mus musculus]
          Length = 340

 Score =  166 bits (421), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 90/200 (45%), Positives = 119/200 (59%), Gaps = 6/200 (3%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCS--WGYGNNGCDGGEDFRSYQW 209
           Q  CG+CW+F   GA+EG   +K  KL  LS Q L+DCS    YGN GC GG    ++Q+
Sbjct: 141 QGSCGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQY 200

Query: 210 IMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           I+ +G    D   PY   D  CH  +    AT + ++ +    EDALK A+A  GPVSV 
Sbjct: 201 IIDNGGIEADASYPYKAMDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVG 260

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS  SF FY +GVY D  C  +   ++H VL VGYG LDGK YW VKNSW   +G+QG
Sbjct: 261 IDASHSSFFFYKSGVYDDPSCTGN---VNHGVLVVGYGTLDGKDYWLVKNSWGLNFGDQG 317

Query: 330 YVLMSIKD-NNCGVMTAPTY 348
           Y+ M+  + N+CG+ +  +Y
Sbjct: 318 YIRMARNNKNHCGIASYCSY 337


>gi|28192371|gb|AAK07729.1| NTCP23-like cysteine proteinase [Nicotiana tabacum]
          Length = 360

 Score =  166 bits (421), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 84/199 (42%), Positives = 121/199 (60%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTGA+E AY     K   LS+Q L+DC+  + N GC+GG   +++++I
Sbjct: 160 NQGKCGSCWTFSTTGALEAAYSQAFGKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYI 219

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + GL T++ Y PY G++  C  ++      +   VN+T  +ED LK A+A   PVS+A
Sbjct: 220 KSNGGLDTEEAY-PYTGKNGLCKFSSENVGVKVIDSVNITLGAEDELKYAVALVRPVSIA 278

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            +   K F  Y +GVY   +C N+P  ++HAVLAVGYG  +G PYW +KNSW   WG+ G
Sbjct: 279 FEVI-KGFKQYKSGVYTSTECGNTPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNG 337

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y  M +  N CG+ T  +Y
Sbjct: 338 YFKMEMGKNMCGIATCASY 356


>gi|146386366|gb|ABQ23971.1| cathepsin K [Oryctolagus cuniculus]
          Length = 183

 Score =  166 bits (421), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 90/186 (48%), Positives = 116/186 (62%), Gaps = 6/186 (3%)

Query: 155 CGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKH- 213
           CGSCW+F + GA+EG    K  KL  LS Q L+DC     N GC GG    ++Q++ ++ 
Sbjct: 1   CGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENYGCGGGYMTNAFQYVQRNR 58

Query: 214 GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDAS 273
           G+ ++D Y PY+GQD  C    T   A   G+  +   +E ALK A+A+ GPVSVAIDAS
Sbjct: 59  GIDSEDAY-PYVGQDESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDAS 117

Query: 274 QKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLM 333
             SF FY  GVYYDE C  S D ++HAVLAVGYG   G  +W +KNSW   WGN+GY+LM
Sbjct: 118 LTSFQFYSKGVYYDENC--SSDNVNHAVLAVGYGIQKGNKHWIIKNSWGESWGNKGYILM 175

Query: 334 SIKDNN 339
           +   NN
Sbjct: 176 ARNKNN 181


>gi|2961621|gb|AAC05781.1| cathepsin S [Mus musculus]
          Length = 340

 Score =  166 bits (421), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 90/200 (45%), Positives = 119/200 (59%), Gaps = 6/200 (3%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCS--WGYGNNGCDGGEDFRSYQW 209
           Q  CG+CW+F   GA+EG   +K  KL  LS Q L+DCS    YGN GC GG    ++Q+
Sbjct: 141 QGSCGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQY 200

Query: 210 IMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           I+ +G    D   PY   D  CH  +    AT + ++ +    EDALK A+A  GPVSV 
Sbjct: 201 IIDNGGIEADASYPYKATDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVG 260

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS  SF FY +GVY D  C  +   ++H VL VGYG LDGK YW VKNSW   +G+QG
Sbjct: 261 IDASHSSFFFYKSGVYDDPSCTGN---VNHGVLVVGYGTLDGKDYWLVKNSWGLNFGDQG 317

Query: 330 YVLMSIKD-NNCGVMTAPTY 348
           Y+ M+  + N+CG+ +  +Y
Sbjct: 318 YIRMARNNKNHCGIASYCSY 337


>gi|341940310|sp|O70370.2|CATS_MOUSE RecName: Full=Cathepsin S; Flags: Precursor
          Length = 340

 Score =  166 bits (421), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 90/200 (45%), Positives = 119/200 (59%), Gaps = 6/200 (3%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCS--WGYGNNGCDGGEDFRSYQW 209
           Q  CG+CW+F   GA+EG   +K  KL  LS Q L+DCS    YGN GC GG    ++Q+
Sbjct: 141 QGSCGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQY 200

Query: 210 IMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           I+ +G    D   PY   D  CH  +    AT + ++ +    EDALK A+A  GPVSV 
Sbjct: 201 IIDNGGIEADASYPYKATDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVG 260

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS  SF FY +GVY D  C  +   ++H VL VGYG LDGK YW VKNSW   +G+QG
Sbjct: 261 IDASHSSFFFYKSGVYDDPSCTGN---VNHGVLVVGYGTLDGKDYWLVKNSWGLNFGDQG 317

Query: 330 YVLMSIKD-NNCGVMTAPTY 348
           Y+ M+  + N+CG+ +  +Y
Sbjct: 318 YIRMARNNKNHCGIASYCSY 337


>gi|8347420|dbj|BAA96501.1| cysteine protease [Nicotiana tabacum]
          Length = 360

 Score =  166 bits (421), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 84/199 (42%), Positives = 121/199 (60%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTGA+E AY     K   LS+Q L+DC+  + N GC+GG   +++++I
Sbjct: 160 NQGKCGSCWTFSTTGALEAAYSQAFGKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYI 219

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + GL T++ Y PY G++  C  ++      +   VN+T  +ED LK A+A   PVS+A
Sbjct: 220 KSNGGLDTEEAY-PYTGKNGLCKFSSENVGVKVIDSVNITLGAEDELKYAVALVRPVSIA 278

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            +   K F  Y +GVY   +C N+P  ++HAVLAVGYG  +G PYW +KNSW   WG+ G
Sbjct: 279 FEVI-KGFKQYKSGVYTSTECGNTPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNG 337

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y  M +  N CG+ T  +Y
Sbjct: 338 YFKMEMGKNMCGIATCASY 356


>gi|74149661|dbj|BAE36450.1| unnamed protein product [Mus musculus]
          Length = 334

 Score =  166 bits (421), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 90/204 (44%), Positives = 126/204 (61%), Gaps = 10/204 (4%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F  +G +EG  ++K  KL  LS+Q L+DCS   GN GC+GG    ++Q+I
Sbjct: 131 NQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYI 190

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
            ++ GL +++ Y PY  +D  C      A A  TGFV++ P  E AL  A+A  GP+SVA
Sbjct: 191 KENGGLDSEESY-PYEAKDGSCKYRAEFAVANGTGFVDI-PQQEKALMKAVATVGPISVA 248

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTYW 325
           +DAS  S  FY +G+YY+  C  S   LDH VL VGYG    + +   YW VKNSW + W
Sbjct: 249 MDASHPSLQFYSSGIYYEPNC--SSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEW 306

Query: 326 GNQGYV-LMSIKDNNCGVMTAPTY 348
           G +GY+ +   +DN+CG+ TA +Y
Sbjct: 307 GMEGYIKIAKDRDNHCGLATAASY 330


>gi|2146900|pir||S67481 cathepsin L-like cysteine proteinase (EC 3.4.22.-) CP1 [similarity]
           - fruit fly (Drosophila melanogaster)  (fragment)
          Length = 218

 Score =  166 bits (421), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 90/214 (42%), Positives = 128/214 (59%), Gaps = 7/214 (3%)

Query: 139 VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KG   ++ DQ  CGSCW+F +TGA+EG ++ K   L  LS+Q L+DCS  YGNNG
Sbjct: 5   VDWRTKGAVTAVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNG 64

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++ +I  + G+ T+  Y PY   D  CH       AT  GF ++    E  +
Sbjct: 65  CNGGLMDNAFPYIKDNGGIDTEKSY-PYEAIDDSCHFNRAQVGATDRGFTDIPQGDEKKM 123

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYW 315
              +   GPVSVAIDAS +SF FY  GVY + +C+     LDH VL VG+G +  G+ YW
Sbjct: 124 PEPVPTVGPVSVAIDASHESFQFYSEGVYNEPQCD--AQNLDHGVLVVGFGTDESGEDYW 181

Query: 316 QVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
            VKNSW T WG++G++ ++  K+N CG+ +  +Y
Sbjct: 182 LVKNSWGTTWGDKGFIKMLRNKENQCGIASPSSY 215


>gi|333827692|gb|AEG19548.1| cathepsin L-like cysteine protease [Taenia pisiformis]
          Length = 338

 Score =  166 bits (421), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 87/200 (43%), Positives = 121/200 (60%), Gaps = 6/200 (3%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F +TGA+E A   K  KL  LS+Q L+DCS   GN+GC+GG    +++++
Sbjct: 140 NQGNCGSCWAFSSTGALEAALAKKTGKLISLSEQQLVDCSLKNGNDGCNGGYMSNAFKYL 199

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
             H +  +  Y PY   D  C    +    T+T    +   +E AL  A+A  GP+S+AI
Sbjct: 200 EDHSIEPESAY-PYRATDGPCRYNESLGVGTVTDIGEIPEGNETALMEAVATVGPISIAI 258

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
           DAS   F FY +G+Y    C  S   L+H VLAVGYG+LDGKPYW VKNSW + WG +GY
Sbjct: 259 DASSLGFMFYRHGIYKSHWC--SSKFLNHGVLAVGYGKLDGKPYWLVKNSWGSGWGMKGY 316

Query: 331 VLMSIKD--NNCGVMTAPTY 348
           ++M+ KD  N CG+ +   +
Sbjct: 317 IMMA-KDYHNMCGIASLADF 335


>gi|209738038|gb|ACI69888.1| Digestive cysteine proteinase 2 precursor [Salmo salar]
          Length = 367

 Score =  166 bits (421), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 89/205 (43%), Positives = 125/205 (60%), Gaps = 9/205 (4%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCWSF TTGA+E  Y +K+ K+ + S+Q L+DC     + GC+GG    ++++I
Sbjct: 135 DQGNCGSCWSFSTTGAMESQYRLKYGKMKLFSEQQLVDCDRQNIDQGCNGGFPVAAFEYI 194

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIA---NTTATATMTGFVNVTPNSEDALKLALAKHGPVS 267
            + GL T+++Y PY      C      N    +T      V   +E+AL  A+ K GP+S
Sbjct: 195 REFGLLTEEEY-PYSAHSNQCRFKPDENGHLNSTKVTGYTVIEMNENALTEAIYKRGPIS 253

Query: 268 VAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG--ELDGKPYWQVKNSWSTYW 325
           VAIDAS   F FY +GVY +  C ++   LDHAVLAVG+G  ++   PY+ VKNSWS+ W
Sbjct: 254 VAIDASSSDFQFYHSGVYQNPSCGSAVSELDHAVLAVGFGVDKVHKTPYYIVKNSWSSGW 313

Query: 326 GNQGYVLMSIKD--NNCGVMTAPTY 348
           G+ GY+ M I++  NNCG+ T  TY
Sbjct: 314 GDHGYIKM-IRNGKNNCGIATFATY 337


>gi|449270628|gb|EMC81287.1| Cathepsin H, partial [Columba livia]
          Length = 261

 Score =  166 bits (421), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 85/224 (37%), Positives = 128/224 (57%), Gaps = 3/224 (1%)

Query: 126 WVRYNKASKDAIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQA 185
           ++R +    D+I  R +      + +Q  CGSCW+F TTG +E A  +   KL  L++Q 
Sbjct: 35  FLRSSGPYPDSIDWRKKGNYVTPVKNQGPCGSCWTFSTTGCLESAIAIATGKLLSLAEQQ 94

Query: 186 LIDCSWGYGNNGCDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTG 244
           L+DC+  + N+GC GG   +++++I+   GL  +D Y PY  ++  C      A A +  
Sbjct: 95  LVDCAQAFNNHGCSGGLPSQAFEYILYNRGLMGEDTY-PYRAENGTCKFQPEKAIAFVRD 153

Query: 245 FVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAV 304
            +N+T   ED +  A+ KH PVS A + +  +F  Y  GVY + +C ++PD ++HAVLAV
Sbjct: 154 VINITQYDEDGMVEAVGKHNPVSFAFEVT-SNFMHYRKGVYSNPRCEHTPDKVNHAVLAV 212

Query: 305 GYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTY 348
           GYGE DG P+W VKNSW   WG  GY L+    N CG+    +Y
Sbjct: 213 GYGEEDGTPFWIVKNSWGPLWGMDGYFLIERGKNMCGLAACASY 256


>gi|74213650|dbj|BAE35627.1| unnamed protein product [Mus musculus]
          Length = 334

 Score =  166 bits (420), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 90/204 (44%), Positives = 126/204 (61%), Gaps = 10/204 (4%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F  +G +EG  ++K  KL  LS+Q L+DCS   GN GC+GG    ++Q+I
Sbjct: 131 NQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYI 190

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
            ++ GL +++ Y PY  +D  C      A A  TGFV++ P  E AL  A+A  GP+SVA
Sbjct: 191 KENGGLDSEESY-PYEAKDGSCKYRAEFAVANDTGFVDI-PQQEKALMKAVATVGPISVA 248

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTYW 325
           +DAS  S  FY +G+YY+  C  S   LDH VL VGYG    + +   YW VKNSW + W
Sbjct: 249 MDASHPSLQFYSSGIYYEPNC--SSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEW 306

Query: 326 GNQGYV-LMSIKDNNCGVMTAPTY 348
           G +GY+ +   +DN+CG+ TA +Y
Sbjct: 307 GMEGYIEIAKDRDNHCGLATAASY 330


>gi|6753558|ref|NP_034114.1| cathepsin L1 preproprotein [Mus musculus]
 gi|115742|sp|P06797.2|CATL1_MOUSE RecName: Full=Cathepsin L1; AltName: Full=Major excreted protein;
           Short=MEP; AltName: Full=p39 cysteine proteinase;
           Contains: RecName: Full=Cathepsin L1 heavy chain;
           Contains: RecName: Full=Cathepsin L1 light chain; Flags:
           Precursor
 gi|53047|emb|CAA29470.1| unnamed protein product [Mus musculus]
 gi|309186|gb|AAA37445.1| preprocysteine proteinase [Mus musculus]
 gi|12832050|dbj|BAB21945.1| unnamed protein product [Mus musculus]
 gi|26340196|dbj|BAC33761.1| unnamed protein product [Mus musculus]
 gi|45768760|gb|AAH68163.1| Cathepsin L [Mus musculus]
 gi|74139700|dbj|BAE31701.1| unnamed protein product [Mus musculus]
 gi|74146632|dbj|BAE41323.1| unnamed protein product [Mus musculus]
 gi|74151584|dbj|BAE41141.1| unnamed protein product [Mus musculus]
 gi|74185397|dbj|BAE30172.1| unnamed protein product [Mus musculus]
 gi|74197196|dbj|BAE35143.1| unnamed protein product [Mus musculus]
 gi|74203006|dbj|BAE26206.1| unnamed protein product [Mus musculus]
 gi|74219606|dbj|BAE29572.1| unnamed protein product [Mus musculus]
 gi|148684295|gb|EDL16242.1| cathepsin L [Mus musculus]
          Length = 334

 Score =  166 bits (420), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 90/204 (44%), Positives = 126/204 (61%), Gaps = 10/204 (4%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F  +G +EG  ++K  KL  LS+Q L+DCS   GN GC+GG    ++Q+I
Sbjct: 131 NQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYI 190

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
            ++ GL +++ Y PY  +D  C      A A  TGFV++ P  E AL  A+A  GP+SVA
Sbjct: 191 KENGGLDSEESY-PYEAKDGSCKYRAEFAVANDTGFVDI-PQQEKALMKAVATVGPISVA 248

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTYW 325
           +DAS  S  FY +G+YY+  C  S   LDH VL VGYG    + +   YW VKNSW + W
Sbjct: 249 MDASHPSLQFYSSGIYYEPNC--SSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEW 306

Query: 326 GNQGYV-LMSIKDNNCGVMTAPTY 348
           G +GY+ +   +DN+CG+ TA +Y
Sbjct: 307 GMEGYIKIAKDRDNHCGLATAASY 330


>gi|158524604|gb|ABW71226.1| cysteine protease [Nicotiana tabacum]
          Length = 360

 Score =  166 bits (420), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 84/199 (42%), Positives = 121/199 (60%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTGA+E AY     K   LS+Q L+DC+  + N GC+GG   +++++I
Sbjct: 160 NQGKCGSCWTFSTTGALEAAYSQAFGKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYI 219

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + GL T++ Y PY G++  C  ++      +   VN+T  +ED LK A+A   PVS+A
Sbjct: 220 KSNGGLDTEEAY-PYTGKNGLCKFSSENVGVKVIDSVNITLGAEDELKYAVALVRPVSIA 278

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            +   K F  Y +GVY   +C N+P  ++HAVLAVGYG  +G PYW +KNSW   WG+ G
Sbjct: 279 FEVI-KGFKQYKSGVYSSTECGNTPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDDG 337

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y  M +  N CG+ T  +Y
Sbjct: 338 YFKMEMGKNMCGIATCASY 356


>gi|332260024|ref|XP_003279085.1| PREDICTED: cathepsin L1 isoform 3 [Nomascus leucogenys]
 gi|441593306|ref|XP_004087072.1| PREDICTED: cathepsin L1 [Nomascus leucogenys]
 gi|441593309|ref|XP_004087073.1| PREDICTED: cathepsin L1 [Nomascus leucogenys]
          Length = 333

 Score =  166 bits (420), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 93/220 (42%), Positives = 131/220 (59%), Gaps = 11/220 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TGA+EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 118 VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSGPQGNEG 177

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++Q++  + GL +++ Y PY   +  C      + A  TGFV++ P  E AL
Sbjct: 178 CNGGLMDYAFQYVQDNGGLDSEESY-PYEATEESCKYNPKYSVANDTGFVDI-PKQEKAL 235

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GP+SVA+DA  +SF FY  G+Y++  C  S + +DH VL VGYG    E D  
Sbjct: 236 MKAVATVGPISVAVDAGHQSFQFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDNN 293

Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
            YW VKNSW   WG  GY+ M+  + N+CG+ +A +Y T+
Sbjct: 294 KYWLVKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYPTV 333


>gi|8547325|gb|AAF76330.1|AF271385_1 cathepsin L [Fasciola hepatica]
          Length = 326

 Score =  166 bits (420), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 90/218 (41%), Positives = 124/218 (56%), Gaps = 8/218 (3%)

Query: 131 KASKDAIPVRYEMK--GY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALI 187
           KA+K A+P R + +  GY   + DQ  CGSCW+F TTGA+EG Y    +     S+Q L+
Sbjct: 102 KANKRAVPDRIDWRESGYVTEVKDQGGCGSCWAFSTTGAMEGQYMKNQRTSISFSEQQLV 161

Query: 188 DCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVN 247
           DCS  +GN GC+GG    +Y+++ + GL T+  Y PY   +  C        A +TG+  
Sbjct: 162 DCSRDFGNYGCNGGLMENAYEYLKRFGLETESSY-PYRAVEGQCRYNEQLGVAKVTGYYT 220

Query: 248 VTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG 307
           V    E  L+  +   GP +VA+D  +  F  Y +G+Y  + C  SPD L+H VLAVGYG
Sbjct: 221 VHSGDEVELQNLVGAEGPAAVALDV-ESDFMMYRSGIYQSQTC--SPDRLNHGVLAVGYG 277

Query: 308 ELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMT 344
             DG  YW VKNSW T+WG  GY+ M  K  N CG+ +
Sbjct: 278 IQDGTDYWIVKNSWGTWWGEDGYIRMVRKRGNMCGIAS 315


>gi|12847813|dbj|BAB27719.1| unnamed protein product [Mus musculus]
          Length = 334

 Score =  166 bits (420), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 90/204 (44%), Positives = 126/204 (61%), Gaps = 10/204 (4%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F  +G +EG  ++K  KL  LS+Q L+DCS   GN GC+GG    ++Q+I
Sbjct: 131 NQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDYAFQYI 190

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
            ++ GL +++ Y PY  +D  C      A A  TGFV++ P  E AL  A+A  GP+SVA
Sbjct: 191 KENGGLDSEESY-PYEAKDGSCKYRAEFAVANDTGFVDI-PQQEKALMKAVATVGPISVA 248

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTYW 325
           +DAS  S  FY +G+YY+  C  S   LDH VL VGYG    + +   YW VKNSW + W
Sbjct: 249 MDASHPSLQFYSSGIYYEPNC--SSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEW 306

Query: 326 GNQGYV-LMSIKDNNCGVMTAPTY 348
           G +GY+ +   +DN+CG+ TA +Y
Sbjct: 307 GMEGYIKIAKDRDNHCGLATAASY 330


>gi|4886998|gb|AAD32136.1|AF121837_1 cathepsin L [Mus musculus]
 gi|4887000|gb|AAD32137.1|AF121838_1 cathepsin L [Mus musculus]
 gi|4887002|gb|AAD32138.1|AF121839_1 cathepsin L [Mus musculus]
 gi|200501|gb|AAA39984.1| preprocathepsin L precursor [Mus musculus]
          Length = 334

 Score =  166 bits (420), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 90/204 (44%), Positives = 126/204 (61%), Gaps = 10/204 (4%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F  +G +EG  ++K  KL  LS+Q L+DCS   GN GC+GG    ++Q+I
Sbjct: 131 NQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYI 190

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
            ++ GL +++ Y PY  +D  C      A A  TGFV++ P  E AL  A+A  GP+SVA
Sbjct: 191 KENGGLDSEESY-PYEAKDGSCKYRAEFAVANDTGFVDI-PQQEKALMKAVATVGPISVA 248

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTYW 325
           +DAS  S  FY +G+YY+  C  S   LDH VL VGYG    + +   YW VKNSW + W
Sbjct: 249 MDASHPSLQFYSSGIYYEPNC--SSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEW 306

Query: 326 GNQGYV-LMSIKDNNCGVMTAPTY 348
           G +GY+ +   +DN+CG+ TA +Y
Sbjct: 307 GMEGYIKIAKDRDNHCGLATAASY 330


>gi|149030666|gb|EDL85703.1| cathepsin S [Rattus norvegicus]
          Length = 291

 Score =  166 bits (420), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 90/207 (43%), Positives = 121/207 (58%), Gaps = 6/207 (2%)

Query: 145 GYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCS--WGYGNNGCDGGE 202
           G N + D   CGSCW+F   GA+EG   +K  KL  LS Q L+DCS    YGN GC GG 
Sbjct: 85  GMNHMGDMGSCGSCWAFSAVGALEGQLKLKTGKLVSLSAQNLVDCSTEEKYGNKGCGGGF 144

Query: 203 DFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAK 262
              ++Q+I+ +G    +   PY   D  CH       AT + ++ +    E+ALK A+A 
Sbjct: 145 MTEAFQYIIDNGGIDSEASYPYKAMDEKCHYDPKNRAATCSRYIELPFGDEEALKEAVAT 204

Query: 263 HGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWS 322
            GPVSV IDAS  SF  Y +GVY D  C    + ++H VL VGYG LDGK YW VKNSW 
Sbjct: 205 KGPVSVGIDASHSSFFLYQSGVYDDPSCT---ENVNHGVLVVGYGTLDGKDYWLVKNSWG 261

Query: 323 TYWGNQGYVLMSIKD-NNCGVMTAPTY 348
            ++G+QGY+ M+  + N+CG+ +  +Y
Sbjct: 262 LHFGDQGYIRMARNNKNHCGIASYCSY 288


>gi|149755226|ref|XP_001494409.1| PREDICTED: cathepsin L1-like [Equus caballus]
          Length = 334

 Score =  166 bits (420), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 95/220 (43%), Positives = 130/220 (59%), Gaps = 10/220 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TGA+EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 118 VDWREKGYVTPVKNQGPCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAEGNQG 177

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDA-YCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++Q++  +G    ++  PYL ++   C+     + A  TG+V++ P  E AL
Sbjct: 178 CNGGLMDNAFQYVKDNGGLDSEESYPYLAKEGNNCNYKPEYSAANDTGYVDI-PQKEKAL 236

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GP+SVAIDA  +SF FY +G+YYD  C  S   LDH VL VGYG    + +  
Sbjct: 237 MKAVATVGPISVAIDAGHESFQFYKSGIYYDPDC--SSKDLDHGVLVVGYGFEGRDSNNN 294

Query: 313 PYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
            +W VKNSW   WG  GYV M+   NN CG+ TA +Y T+
Sbjct: 295 KFWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPTV 334


>gi|332384364|gb|AEE69034.1| cysteine protease [Taenia pisiformis]
          Length = 338

 Score =  166 bits (420), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 87/200 (43%), Positives = 121/200 (60%), Gaps = 6/200 (3%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F +TGA+E A   K  KL  LS+Q L+DCS   GN+GC+GG    +++++
Sbjct: 140 NQGNCGSCWAFSSTGALEAALAKKTGKLISLSEQQLVDCSLKNGNDGCNGGYMSNAFKYL 199

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
             H +  +  Y PY   D  C    +    T+T    +   +E AL  A+A  GP+S+AI
Sbjct: 200 EDHSIEPESAY-PYRATDGPCRYNESLGVGTVTDIGEIPEGNETALMEAVATVGPISIAI 258

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
           DAS   F FY +G+Y    C  S   L+H VLAVGYG+LDGKPYW VKNSW + WG +GY
Sbjct: 259 DASSLGFMFYRHGIYKSHWC--SSKFLNHGVLAVGYGKLDGKPYWLVKNSWGSGWGMKGY 316

Query: 331 VLMSIKD--NNCGVMTAPTY 348
           ++M+ KD  N CG+ +   +
Sbjct: 317 IMMA-KDYHNMCGIASLADF 335


>gi|111036376|dbj|BAF02517.1| cathepsin L-like proteinase [Echinococcus multilocularis]
          Length = 338

 Score =  166 bits (420), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 86/200 (43%), Positives = 123/200 (61%), Gaps = 6/200 (3%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F +TGA+EGA   K  KL  LS+Q L+DC+   GN+GC+GG    +++++
Sbjct: 140 NQGNCGSCWAFSSTGALEGALAKKTGKLISLSEQQLVDCTLENGNDGCNGGYMSNAFKYL 199

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
             H +  +  Y PY   D  C    +    ++T   ++   +E AL  A+A  GP+S+AI
Sbjct: 200 EGHSIEPESAY-PYRATDGPCRYNESLGVGSVTDIGDIPEGNETALMEAVATVGPISIAI 258

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
           DAS   F FY +G+Y    C  S   L+H VLA+GYG+LDGKPYW VKNSW + WG +GY
Sbjct: 259 DASTLGFMFYHHGIYKSHWC--SSKFLNHGVLAIGYGKLDGKPYWLVKNSWGSRWGMKGY 316

Query: 331 VLMSIKD--NNCGVMTAPTY 348
           ++M+ KD  N CGV +   +
Sbjct: 317 IMMA-KDYHNMCGVASLADF 335


>gi|74200292|dbj|BAE22939.1| unnamed protein product [Mus musculus]
          Length = 308

 Score =  166 bits (420), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 90/204 (44%), Positives = 126/204 (61%), Gaps = 10/204 (4%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F  +G +EG  ++K  KL  LS+Q L+DCS   GN GC+GG    ++Q+I
Sbjct: 105 NQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYI 164

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
            ++ GL +++ Y PY  +D  C      A A  TGFV++ P  E AL  A+A  GP+SVA
Sbjct: 165 KENGGLDSEESY-PYEAKDGSCKYRAEFAVANDTGFVDI-PQQEKALMKAVATVGPISVA 222

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTYW 325
           +DAS  S  FY +G+YY+  C  S   LDH VL VGYG    + +   YW VKNSW + W
Sbjct: 223 MDASHPSLQFYSSGIYYEPNC--SSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEW 280

Query: 326 GNQGYV-LMSIKDNNCGVMTAPTY 348
           G +GY+ +   +DN+CG+ TA +Y
Sbjct: 281 GMEGYIKIAKDRDNHCGLATAASY 304


>gi|71482942|gb|AAZ32410.1| cysteine proteinase aleuran type [Nicotiana benthamiana]
          Length = 360

 Score =  166 bits (420), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 84/199 (42%), Positives = 121/199 (60%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTGA+E AY     K   LS+Q L+DC+  + N GC+GG   +++++I
Sbjct: 160 NQGKCGSCWTFSTTGALEAAYGQAFGKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYI 219

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + GL T++ Y PY G++  C  ++      +   VN+T  +ED LK A+A   PVS+A
Sbjct: 220 KSNGGLDTEEAY-PYTGKNGLCKFSSENVGVKVIDSVNITLGAEDELKYAVALVRPVSIA 278

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            +   K F  Y +GVY   +C N+P  ++HAVLAVGYG  +G PYW +KNSW   WG+ G
Sbjct: 279 FEVI-KGFKQYKSGVYTSTECGNTPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNG 337

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y  M +  N CG+ T  +Y
Sbjct: 338 YFKMEMGKNMCGIATCASY 356


>gi|159464745|ref|XP_001690602.1| cystein endopsptidase [Chlamydomonas reinhardtii]
 gi|158280102|gb|EDP05861.1| cystein endopsptidase [Chlamydomonas reinhardtii]
          Length = 616

 Score =  166 bits (420), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 82/223 (36%), Positives = 130/223 (58%), Gaps = 13/223 (5%)

Query: 133 SKDAIPVRYEMKGYNS---LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDC 189
           S+  +P   + +G  +   + DQ +CGSC++F  TG ++G +++   +    S+Q +IDC
Sbjct: 374 SRADLPPSVDWRGTGADPGVKDQGMCGSCYTFAATGTMDGTWFVATGQRRSFSEQQIIDC 433

Query: 190 SWGYGNNGCDGGEDFRSYQWIMKH-----GLPTQDDYGPYLGQDAYCHIANTTATATMTG 244
           +W YG NGC GG     YQ ++ +     G+  + DY  Y G+  YC  +N T     +G
Sbjct: 434 AWDYGPNGCFGG----YYQPVLNYVAEQGGMALEQDYT-YRGEPGYCRASNHTRVGLFSG 488

Query: 245 FVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAV 304
           ++NV   +E AL  A+AK+GP++V+++A  ++FSFY  GV+ +  C      LDH V   
Sbjct: 489 YMNVESRNELALMEAVAKYGPIAVSVNADPEAFSFYSEGVFDEPACTTRMRDLDHTVTLF 548

Query: 305 GYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPT 347
           GYG  DGK YW V+NSWS +WG+ GY+ +    ++CG+ T P 
Sbjct: 549 GYGSQDGKDYWLVRNSWSHFWGDDGYIKIVRGKHDCGIATDPA 591


>gi|395740610|ref|XP_002819972.2| PREDICTED: cathepsin L1 [Pongo abelii]
          Length = 333

 Score =  166 bits (420), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 95/220 (43%), Positives = 131/220 (59%), Gaps = 11/220 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TGA+EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 118 VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLISLSEQNLVDCSGPQGNEG 177

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++Q++  + GL +++ Y PY   +  C      + A  TGFV++ P  E AL
Sbjct: 178 CNGGLMDYAFQYVQDNGGLDSEESY-PYEATEESCKYNPKYSVANDTGFVDI-PKQEKAL 235

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GP+SVAIDA  +SF FY  G+Y++  C  S + +DH VL VGYG    E D  
Sbjct: 236 MKAVATVGPISVAIDAGHESFLFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDNN 293

Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
            YW VKNSW   WG  GYV M+  + N+CG+ +A +Y T+
Sbjct: 294 KYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 333


>gi|355567871|gb|EHH24212.1| Cathepsin L1 [Macaca mulatta]
          Length = 333

 Score =  166 bits (420), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 92/219 (42%), Positives = 128/219 (58%), Gaps = 9/219 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TGA+EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 118 VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSGPQGNEG 177

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
           C+GG    ++Q++  +G    ++  PY   +  C      + A  TGFV++ P  E AL 
Sbjct: 178 CNGGLMDYAFQYVADNGGLDSEEAYPYEATEESCKYNPEYSVANDTGFVDI-PKQEKALM 236

Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKP 313
            A+A  GP+SVAIDA  +SF FY  G+Y++  C  S + +DH VL VGYG    E D   
Sbjct: 237 KAVATVGPISVAIDAGHESFMFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDNSK 294

Query: 314 YWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
           YW VKNSW   WG  GY+ M+  + N+CG+ +A +Y T+
Sbjct: 295 YWLVKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYPTV 333


>gi|301769891|ref|XP_002920367.1| PREDICTED: cathepsin L1-like [Ailuropoda melanoleuca]
 gi|281346353|gb|EFB21937.1| hypothetical protein PANDA_009084 [Ailuropoda melanoleuca]
          Length = 333

 Score =  166 bits (420), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 92/217 (42%), Positives = 133/217 (61%), Gaps = 11/217 (5%)

Query: 139 VRYEMKGYNSLLD-QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY + +  Q  C SCW+F  TGA+EG  + K  KL  LS+Q L+DCSW   N+G
Sbjct: 118 VDWREKGYVTPVKYQGHCQSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSWPQNNDG 177

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    +++++  + GL + + Y PYLG++  C      + A +T F +V+ N ED L
Sbjct: 178 CRGGLMDNAFRYVKDNGGLDSAESY-PYLGRNESCKYRPEKSAANLTTFWSVS-NKEDGL 235

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
              +A  GPVS A+D+S  SF FY  G+YYD  C +  + L+HAVL VGYG    E + K
Sbjct: 236 MTTVATVGPVSAAVDSSLHSFQFYKKGIYYDPNCRS--NRLNHAVLVVGYGFEGEESENK 293

Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
            YW +KNSW T WG +GY+L++  +DN+CG+ T  ++
Sbjct: 294 KYWIIKNSWGTNWGMKGYMLLAKDRDNHCGIATMASF 330


>gi|454101|gb|AAA82966.1| cathepsin H prepropeptide [Mus musculus]
          Length = 333

 Score =  166 bits (420), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 79/198 (39%), Positives = 121/198 (61%), Gaps = 1/198 (0%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  C SCW+F TTGA+E A  +   K+  L++Q L+DC+  + N+GC GG   +++++I
Sbjct: 132 NQGACASCWTFSTTGALESAVAIASGKMLSLAEQQLVDCAQAFNNHGCKGGLPSQAFEYI 191

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
           + +    ++D  PY+G+D+ C      A A +   VN+T N E A+  A+A + PVS A 
Sbjct: 192 LYNKGIMEEDSYPYIGKDSSCRFNPQKAVAFVKNVVNITLNDEAAMVEAVALYNPVSFAF 251

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
           + ++  F  Y +GVY  + C+ +PD ++HAVLAVGYGE +G  YW VKNSW + WG  GY
Sbjct: 252 EVTE-DFLMYKSGVYSSKSCHKTPDKVNHAVLAVGYGEQNGLLYWIVKNSWGSQWGENGY 310

Query: 331 VLMSIKDNNCGVMTAPTY 348
            L+    N CG+    +Y
Sbjct: 311 FLIERGKNMCGLAACASY 328


>gi|432853333|ref|XP_004067655.1| PREDICTED: cathepsin L2-like [Oryzias latipes]
          Length = 352

 Score =  166 bits (420), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 96/236 (40%), Positives = 132/236 (55%), Gaps = 8/236 (3%)

Query: 116 EGEKVSKYSLWVRYNKASKDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMK 174
            G++VS   L  RYN    +A  V Y   GY   + DQ  CGSCW+F TTGA+EG    K
Sbjct: 119 RGKEVSARRL--RYNARKSEAGFVDYRNMGYVTEVKDQGYCGSCWAFSTTGAIEGQIVKK 176

Query: 175 HKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAY-CHI 233
             +L  LS+Q L+DCS  YG +GC G     +Y +++ +GL T D Y PY   D   C  
Sbjct: 177 TGQLLSLSEQNLVDCSRPYGTHGCSGAWMASAYDYVLSNGLQTTDSY-PYTSVDTQPCFY 235

Query: 234 ANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNS 293
            +  A A +  +  +    E AL  A+A  GP++VAIDA   SF FY +G+Y +  C+  
Sbjct: 236 DSRLAVAHIKDYRFIPQGDEQALADAVATIGPITVAIDADHASFLFYSSGIYDEPNCD-- 293

Query: 294 PDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
           P+ L HAVL VGYG  +G+ YW +KNSW + WG  GY+ ++    N CG+ +   Y
Sbjct: 294 PNRLSHAVLLVGYGSEEGQDYWIIKNSWGSSWGEGGYMRIIRNGSNTCGIASYALY 349


>gi|348542778|ref|XP_003458861.1| PREDICTED: digestive cysteine proteinase 3-like [Oreochromis
           niloticus]
          Length = 218

 Score =  166 bits (419), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 89/196 (45%), Positives = 126/196 (64%), Gaps = 6/196 (3%)

Query: 155 CGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKH- 213
           CGSCW+F  TGA+EG ++ K   L  LS+Q L+DCS  + N+GCDGG    ++++I  + 
Sbjct: 24  CGSCWAFSATGALEGQHFKKTGNLVSLSEQQLVDCSRNFFNHGCDGGWMIPAFKYIKDNG 83

Query: 214 GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDAS 273
           G+ T++ Y  Y  +D  CH       A  +G+  V  + E+ALK A+A  GP+S+A+DAS
Sbjct: 84  GIQTEESY-TYEARDGRCHYNANFVGAQCSGYGTVKQD-EEALKQAVAAIGPISIAVDAS 141

Query: 274 QKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLM 333
            +SF  Y +GVY +  C+N    L+HAVLAVGYG  +G  YW VKNSW + WGN+GY+ M
Sbjct: 142 HESFQLYQSGVYDEPWCSNI--NLNHAVLAVGYGTENGHDYWLVKNSWGSEWGNKGYIKM 199

Query: 334 SI-KDNNCGVMTAPTY 348
           +  KDN CG+ T  +Y
Sbjct: 200 TRNKDNQCGIATEASY 215


>gi|75067394|sp|Q9GKL8.1|CATL1_CERAE RecName: Full=Cathepsin L1; AltName: Full=Major excreted protein;
           Short=MEP; Contains: RecName: Full=Cathepsin L1 heavy
           chain; Contains: RecName: Full=Cathepsin L1 light chain;
           Flags: Precursor
 gi|11493685|gb|AAG35605.1|AF201700_1 cysteine protease [Chlorocebus aethiops]
          Length = 333

 Score =  166 bits (419), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 94/220 (42%), Positives = 131/220 (59%), Gaps = 11/220 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TGA+EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 118 VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSGPQGNEG 177

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++Q++  + GL +++ Y PY   +  C      + A  TGFV++ P  E AL
Sbjct: 178 CNGGLMDYAFQYVADNGGLDSEESY-PYEATEESCKYNPEYSVANDTGFVDI-PKQEKAL 235

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GP+SVAIDA  +SF FY  G+Y++  C  S + +DH VL VGYG    E D  
Sbjct: 236 MKAVATVGPISVAIDAGHESFMFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDNS 293

Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
            YW VKNSW   WG  GY+ M+  + N+CG+ +A +Y T+
Sbjct: 294 KYWLVKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYPTV 333


>gi|301775254|ref|XP_002923050.1| PREDICTED: cathepsin H-like [Ailuropoda melanoleuca]
          Length = 307

 Score =  166 bits (419), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 91/227 (40%), Positives = 126/227 (55%), Gaps = 13/227 (5%)

Query: 123 YSLWVRYNKASKDAIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLS 182
           Y  +V + K  K   PV+          +Q  CGSCW+F TTGA+E A  +K  KL  L+
Sbjct: 88  YPPFVDWRKKGKFVSPVK----------NQGGCGSCWTFSTTGALESAIAIKTGKLLSLA 137

Query: 183 QQALIDCSWGYGNNGCDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATAT 241
           +Q L+DC+  + N+GC GG   +++++I    G+  +D Y PY GQD  C    + A A 
Sbjct: 138 EQQLVDCAQDFNNHGCQGGLPSQAFEYIRYNRGIMGEDSY-PYKGQDGDCKFQPSKAIAF 196

Query: 242 MTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAV 301
           +    N+T N E A+  A+A   PVS A + +   F  Y  GVY    C+ +PD ++HAV
Sbjct: 197 VKDVANITINDEQAMVEAVALFNPVSFAFEVT-GDFMMYRKGVYSSTSCHKTPDKVNHAV 255

Query: 302 LAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTY 348
           LAVGYGE +G PYW VKNSW   WG  GY L+    N CG+    +Y
Sbjct: 256 LAVGYGEQNGVPYWIVKNSWGPQWGMHGYFLIERGKNMCGLAACASY 302


>gi|109112057|ref|XP_001086247.1| PREDICTED: cathepsin L1-like isoform 5 [Macaca mulatta]
 gi|402897797|ref|XP_003911929.1| PREDICTED: cathepsin L1 [Papio anubis]
          Length = 333

 Score =  166 bits (419), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 94/220 (42%), Positives = 131/220 (59%), Gaps = 11/220 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TGA+EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 118 VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSGPQGNEG 177

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++Q++  + GL +++ Y PY   +  C      + A  TGFV++ P  E AL
Sbjct: 178 CNGGLMDYAFQYVADNGGLDSEESY-PYEATEESCKYNPEYSVANDTGFVDI-PKQEKAL 235

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GP+SVAIDA  +SF FY  G+Y++  C  S + +DH VL VGYG    E D  
Sbjct: 236 MKAVATVGPISVAIDAGHESFMFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDNS 293

Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
            YW VKNSW   WG  GY+ M+  + N+CG+ +A +Y T+
Sbjct: 294 KYWLVKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYPTV 333


>gi|380790141|gb|AFE66946.1| cathepsin L1 preproprotein [Macaca mulatta]
 gi|384939708|gb|AFI33459.1| cathepsin L1 preproprotein [Macaca mulatta]
          Length = 333

 Score =  166 bits (419), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 94/220 (42%), Positives = 131/220 (59%), Gaps = 11/220 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TGA+EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 118 VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSGPQGNEG 177

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++Q++  + GL +++ Y PY   +  C      + A  TGFV++ P  E AL
Sbjct: 178 CNGGLMDYAFQYVADNGGLDSEESY-PYEATEESCKYNPEYSVANDTGFVDI-PKQEKAL 235

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GP+SVAIDA  +SF FY  G+Y++  C  S + +DH VL VGYG    E D  
Sbjct: 236 MKAVATVGPISVAIDAGHESFMFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDNS 293

Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
            YW VKNSW   WG  GY+ M+  + N+CG+ +A +Y T+
Sbjct: 294 KYWLVKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYPTV 333


>gi|281350252|gb|EFB25836.1| hypothetical protein PANDA_012122 [Ailuropoda melanoleuca]
          Length = 294

 Score =  166 bits (419), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 91/227 (40%), Positives = 126/227 (55%), Gaps = 13/227 (5%)

Query: 123 YSLWVRYNKASKDAIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLS 182
           Y  +V + K  K   PV+          +Q  CGSCW+F TTGA+E A  +K  KL  L+
Sbjct: 75  YPPFVDWRKKGKFVSPVK----------NQGGCGSCWTFSTTGALESAIAIKTGKLLSLA 124

Query: 183 QQALIDCSWGYGNNGCDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATAT 241
           +Q L+DC+  + N+GC GG   +++++I    G+  +D Y PY GQD  C    + A A 
Sbjct: 125 EQQLVDCAQDFNNHGCQGGLPSQAFEYIRYNRGIMGEDSY-PYKGQDGDCKFQPSKAIAF 183

Query: 242 MTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAV 301
           +    N+T N E A+  A+A   PVS A + +   F  Y  GVY    C+ +PD ++HAV
Sbjct: 184 VKDVANITINDEQAMVEAVALFNPVSFAFEVT-GDFMMYRKGVYSSTSCHKTPDKVNHAV 242

Query: 302 LAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTY 348
           LAVGYGE +G PYW VKNSW   WG  GY L+    N CG+    +Y
Sbjct: 243 LAVGYGEQNGVPYWIVKNSWGPQWGMHGYFLIERGKNMCGLAACASY 289


>gi|256535829|gb|ACU82389.1| cathepsin L 1 [Pheronema raphanus]
          Length = 328

 Score =  166 bits (419), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 81/198 (40%), Positives = 123/198 (62%), Gaps = 4/198 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F +TG++EG Y++ + KL   S+  L+DCS  YGNNGC GG    ++++ 
Sbjct: 132 NQGQCGSCWAFSSTGSLEGQYFINNDKLLSFSESELVDCSRRYGNNGCKGGLMDNAFRYW 191

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
             +    + DY PY+ +D  C  +      T++ + NV   S+ +L+ A+   GP+SVA+
Sbjct: 192 EVYKEELESDY-PYVAKDGPCRYSQDKGVTTISSYKNVPHFSQISLQDAVRTIGPISVAM 250

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
           DAS KSF  Y +GVY + +C+ +   LDH VL VGYG    +P+W VKNSW   WG  GY
Sbjct: 251 DASHKSFQLYHSGVYSESECSQTK--LDHGVLVVGYG-TSSEPFWLVKNSWGAGWGMDGY 307

Query: 331 VLMSIKDNNCGVMTAPTY 348
             +++++N CG+ T P+Y
Sbjct: 308 FEIAMRNNMCGLETEPSY 325


>gi|225707912|gb|ACO09802.1| Cathepsin K precursor [Osmerus mordax]
          Length = 331

 Score =  166 bits (419), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 95/217 (43%), Positives = 126/217 (58%), Gaps = 9/217 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           + Y  KG    + +Q  CGSCW+F + GA+EG       KL  LS Q L+DC     NNG
Sbjct: 120 IDYRRKGMVTPVKNQGSCGSCWAFSSVGALEGQLMKTTGKLVDLSPQNLVDCV--TENNG 177

Query: 198 CDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    ++ ++    G+ ++  Y PY+GQD  C    +  TA+  G+  +   +E AL
Sbjct: 178 CGGGYMTNAFNYVRDNQGIDSEAAY-PYIGQDETCAYNVSGMTASCRGYKEIPEGNERAL 236

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGEL-DGKPYW 315
            +A+AK GPVSV IDA+  +F FY  GVYYD  CN   D ++HAVLAVGYG    GK YW
Sbjct: 237 TVAVAKVGPVSVGIDATLSTFQFYQKGVYYDRNCN--KDDINHAVLAVGYGVTPKGKKYW 294

Query: 316 QVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
            VKNSWS  WGN+GY+LM+    N CG+    +Y  M
Sbjct: 295 IVKNSWSESWGNKGYILMARNRGNLCGIANLASYPIM 331


>gi|348565223|ref|XP_003468403.1| PREDICTED: cathepsin L1-like [Cavia porcellus]
          Length = 333

 Score =  166 bits (419), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 96/217 (44%), Positives = 129/217 (59%), Gaps = 11/217 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TG++EG  + K   L  LS+Q L+DCS   GN G
Sbjct: 118 VDWREKGYVTEVKNQGQCGSCWAFSATGSLEGQMFHKTGNLVSLSEQNLVDCSRPQGNQG 177

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++Q++  + GL  +  Y PY+G+D  C      + A  TGFV+V P  E  +
Sbjct: 178 CNGGLMDFAFQYVKDNKGLEAEKSY-PYVGKDGECKYKPELSAANDTGFVDV-PQREKVV 235

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
           + ALA  GP+SVAIDA  +SF FY  G+YYD  C  S   L+H VL VGYG    E    
Sbjct: 236 QKALATVGPLSVAIDAGLQSFQFYKEGIYYDPGC--SSRDLNHGVLLVGYGTDASETGKG 293

Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
            YW +KNSW T WG  GYV ++  ++N+CGV TA +Y
Sbjct: 294 DYWLIKNSWGTTWGADGYVKIARNRNNHCGVATAASY 330


>gi|388513209|gb|AFK44666.1| unknown [Lotus japonicus]
 gi|388514955|gb|AFK45539.1| unknown [Lotus japonicus]
          Length = 352

 Score =  166 bits (419), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 83/198 (41%), Positives = 113/198 (57%), Gaps = 1/198 (0%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ+ CGSCW+F TTGA+E AY   H K   LS+Q L+DC+  + N GC+GG   +++++I
Sbjct: 152 DQAHCGSCWTFSTTGALEAAYAQAHGKNISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYI 211

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
             +G    +   PY  +D  C          +   VN+T  +ED LK A+A   PVSVA 
Sbjct: 212 KYNGGIALEKEYPYTAKDEACKFTAENVAVRVLDSVNITLGAEDELKHAVAFARPVSVAF 271

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
                 F  Y  GVY  + C N+P  ++HAVLAVGYG  +  PYW +KNSW + WG+ GY
Sbjct: 272 QVVD-GFRLYKEGVYTSDTCGNTPMDVNHAVLAVGYGVENNVPYWIIKNSWGSTWGDHGY 330

Query: 331 VLMSIKDNNCGVMTAPTY 348
             M +  N CGV T  +Y
Sbjct: 331 FKMELGKNMCGVATCASY 348


>gi|74927078|sp|Q86GF7.1|CRUST_PANBO RecName: Full=Crustapain; AltName: Full=NsCys; Flags: Precursor
 gi|28971811|dbj|BAC65417.1| crustapain [Pandalus borealis]
          Length = 323

 Score =  166 bits (419), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 89/201 (44%), Positives = 120/201 (59%), Gaps = 6/201 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F    A+EGA+++K   L  LS+Q L+DCS  YGN GC+GG  +++YQ+I
Sbjct: 123 DQGQCGSCWAFSAVAALEGAHFLKTGDLVSLSEQNLVDCSSSYGNQGCNGGWPYQAYQYI 182

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           +   G+ T+  Y PY   D  C        AT++ +V      E AL+ A+   GPVSV 
Sbjct: 183 IANRGIDTESSY-PYKAIDDNCRYDAGNIGATVSSYVEPASGDESALQHAVQNEGPVSVC 241

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQ 328
           IDA Q SF  Y  GVYY+  C++     +HAV AVGYG + +G  YW VKNSW  +WG  
Sbjct: 242 IDAGQSSFGSYGGGVYYEPNCDSWY--ANHAVTAVGYGTDANGGDYWIVKNSWGAWWGES 299

Query: 329 GYVLMSI-KDNNCGVMTAPTY 348
           GY+ M+  +DNNC + T   Y
Sbjct: 300 GYIKMARNRDNNCAIATYSVY 320


>gi|344275470|ref|XP_003409535.1| PREDICTED: cathepsin S-like isoform 1 [Loxodonta africana]
          Length = 331

 Score =  166 bits (419), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 90/200 (45%), Positives = 120/200 (60%), Gaps = 7/200 (3%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW-GYGNNGCDGGEDFRSYQWI 210
           Q  CG+CW+F   GA+E    +K  KL  LS Q L+DCS   Y N GC+GG   R++Q+I
Sbjct: 133 QGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSGEKYSNKGCNGGFMTRAFQYI 192

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           +  +G+ ++  Y PY   D  C        AT + +  +   SEDALK A+A  GPVSV 
Sbjct: 193 IDNNGIDSEASY-PYKATDGKCQYDPKNRAATCSKYTELPYGSEDALKEAVANKGPVSVG 251

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS+ SF  Y +GVYYD  C    D ++H VL VGYG L+GK YW VKNSW   +G QG
Sbjct: 252 IDASRPSFFLYKSGVYYDPSCT---DNVNHGVLVVGYGNLNGKDYWLVKNSWGLNFGEQG 308

Query: 330 YVLMSIKD-NNCGVMTAPTY 348
           Y+ M+    N+CG+ + P+Y
Sbjct: 309 YIRMARNSGNHCGIASFPSY 328


>gi|410042826|ref|XP_003951516.1| PREDICTED: cathepsin L1 [Pan troglodytes]
          Length = 278

 Score =  165 bits (418), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 94/220 (42%), Positives = 131/220 (59%), Gaps = 11/220 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TGA+EG  + K  +L  LS+Q L+DCS   GN G
Sbjct: 63  VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEG 122

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++Q++  + GL +++ Y PY   +  C      + A  TGFV++ P  E AL
Sbjct: 123 CNGGLMDYAFQYVQDNGGLDSEESY-PYEATEESCKYNPKYSVANDTGFVDI-PKQEKAL 180

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GP+SVAIDA  +SF FY  G+Y++  C  S + +DH VL VGYG    E D  
Sbjct: 181 MKAVATVGPISVAIDAGHESFLFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDNN 238

Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
            YW VKNSW   WG  GYV M+  + N+CG+ +A +Y T+
Sbjct: 239 KYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 278


>gi|328869030|gb|EGG17408.1| cysteine protease [Dictyostelium fasciculatum]
          Length = 379

 Score =  165 bits (418), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 98/243 (40%), Positives = 133/243 (54%), Gaps = 36/243 (14%)

Query: 139 VRYEMKGYNS-LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KG  S + +Q  CGSCWSF TTG+VEGA+Y+   K+  LS+Q L+DCS   GN G
Sbjct: 137 VDWRAKGAVSPIKNQGQCGSCWSFSTTGSVEGAHYISTGKMVPLSEQNLVDCSGSEGNMG 196

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDA-YCHIANTTATATMTGFVNVTPNSEDA 255
           C GG    ++ +I+K+ G+ T+D Y PY  +    C    T   AT++ + N+T   E  
Sbjct: 197 CQGGLMNLAFDYIIKNEGIDTEDSY-PYSAETGKKCLFNKTNVGATISSYKNITSGDESN 255

Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELD----- 310
           L  A+   GPVSVAIDAS  SF  Y +G+YY++ C  S   LDH VL VGYG  D     
Sbjct: 256 LADAVKNAGPVSVAIDASHNSFQLYSHGIYYEKDC--SSVNLDHGVLVVGYGSGDPSSLA 313

Query: 311 ------------------------GKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTA 345
                                      YW VKNSW + WG+ G++ MS+ +DNNCG+ T+
Sbjct: 314 NNVGGRSGPKMVVFNNRMVKTPSSNGDYWIVKNSWGSTWGSHGFIFMSMNRDNNCGIATS 373

Query: 346 PTY 348
            +Y
Sbjct: 374 ASY 376


>gi|6630974|gb|AAF19631.1|AF194427_1 cysteine proteinase precursor [Myxine glutinosa]
          Length = 324

 Score =  165 bits (418), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 92/212 (43%), Positives = 121/212 (57%), Gaps = 4/212 (1%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  +GY   + DQ  CGSCW+F  TG++EG ++ K   L  LS+Q L+DC+  YGN G
Sbjct: 112 VDWRNQGYVTPVKDQGQCGSCWTFSATGSLEGQHFAKTGNLLSLSEQQLVDCAGRYGNYG 171

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
           C+GG    +Y +I   G    +   PY  +D  C    +   AT  G+V +    E AL 
Sbjct: 172 CNGGLMESAYDYIKGVGGVELESAYPYTARDGRCKFDRSKVVATCKGYVVIPVGDEQALM 231

Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQV 317
            A+   GPV+V+IDAS  SF  Y +GVY   +C  S   LDH VLAVGYG   G+ YW V
Sbjct: 232 QAVGTIGPVAVSIDASGYSFQLYESGVYDFRRC--SSTNLDHGVLAVGYGTEGGQNYWLV 289

Query: 318 KNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           KNSW   WG+QGY+ MS  K+N CG+ T   Y
Sbjct: 290 KNSWGPGWGDQGYIKMSKDKNNQCGIATDSCY 321


>gi|417409876|gb|JAA51427.1| Putative cathepsin s, partial [Desmodus rotundus]
          Length = 342

 Score =  165 bits (418), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 90/200 (45%), Positives = 122/200 (61%), Gaps = 7/200 (3%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG-YGNNGCDGGEDFRSYQWI 210
           Q  CGSCW+F   GA+E    +K  KL  LS Q L+DCS G Y N GC+GG    ++Q+I
Sbjct: 144 QGSCGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSVGKYSNRGCNGGFMTEAFQYI 203

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           +  +G+ ++  Y PY   D  C   +    AT + +  +  +SEDALK A+A  GPVSVA
Sbjct: 204 IDNNGIESEASY-PYKAMDGKCQYDSKYRAATCSRYTELPEDSEDALKEAVANKGPVSVA 262

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS  SF  Y +GVYYD  C      ++H VL VGYG L+GK YW VKNSW  ++G+QG
Sbjct: 263 IDASHPSFFLYRSGVYYDPACTLH---VNHGVLVVGYGNLNGKDYWLVKNSWGLHFGDQG 319

Query: 330 YVLMSIKD-NNCGVMTAPTY 348
           Y+ M+    N+CG+ +  +Y
Sbjct: 320 YIRMARNSGNHCGIASYASY 339


>gi|403300975|ref|XP_003941187.1| PREDICTED: cathepsin L1-like isoform 1 [Saimiri boliviensis
           boliviensis]
 gi|403300977|ref|XP_003941188.1| PREDICTED: cathepsin L1-like isoform 2 [Saimiri boliviensis
           boliviensis]
 gi|403300979|ref|XP_003941189.1| PREDICTED: cathepsin L1-like isoform 3 [Saimiri boliviensis
           boliviensis]
          Length = 333

 Score =  165 bits (418), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 93/220 (42%), Positives = 132/220 (60%), Gaps = 11/220 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TGA+EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 118 VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSGPQGNQG 177

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++Q++ ++ GL +++ Y PY   +  C      + A  TGFV++ P  E AL
Sbjct: 178 CNGGLMDYAFQYVQENGGLDSEESY-PYEATEESCKYNPKYSVANDTGFVDI-PKLEKAL 235

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GP+SVAIDA  +SF FY  G+Y++ +C  S + +DH VL VGYG      D  
Sbjct: 236 MKAVATVGPISVAIDAGHESFQFYKEGIYFEPEC--SSEDMDHGVLVVGYGFERTGSDNS 293

Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
            YW VKNSW   WG  GY+ M+  + N+CG+ +A +Y T+
Sbjct: 294 KYWLVKNSWGEEWGMDGYIKMAKDRKNHCGIASAASYPTV 333


>gi|431920312|gb|ELK18347.1| Cathepsin H [Pteropus alecto]
          Length = 232

 Score =  165 bits (418), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 82/195 (42%), Positives = 118/195 (60%), Gaps = 3/195 (1%)

Query: 155 CGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM-KH 213
           CGSCW+F TTGA+E A  +K  K+  L++Q L+DC+  + N+GC GG   +++++I    
Sbjct: 35  CGSCWTFSTTGALESAIAIKTGKMLSLAEQQLVDCAQNFNNHGCKGGLPSQAFEYIRYNK 94

Query: 214 GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDAS 273
           G+  +D Y PY G+D  C      A A +    N+T N E+A+  A+A + PVS A + +
Sbjct: 95  GIMGEDTY-PYQGKDGTCKFQPEKAIAFVKDVANITINDEEAMVEAVALYNPVSFAFEVT 153

Query: 274 QKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLM 333
           +  F  Y  G+Y    C+ +PD ++HAVLAVGYGE +GKPYW VKNSW   WG  GY L+
Sbjct: 154 E-DFMLYRKGIYSSTSCHKTPDKVNHAVLAVGYGEENGKPYWIVKNSWGPQWGMNGYFLI 212

Query: 334 SIKDNNCGVMTAPTY 348
               N CG+    +Y
Sbjct: 213 ERGKNMCGLAACASY 227


>gi|342305192|dbj|BAK55650.1| cathepsin S [Oplegnathus fasciatus]
          Length = 337

 Score =  165 bits (418), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 94/223 (42%), Positives = 131/223 (58%), Gaps = 7/223 (3%)

Query: 133 SKDAIP--VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDC 189
           S  A+P  V +  KG   S+  Q  CGSCW+F   GA+EG       KL  LS Q L+DC
Sbjct: 118 SGAAVPDTVDWREKGCVTSVKMQGSCGSCWAFSAAGALEGQLAKTTGKLLDLSPQNLVDC 177

Query: 190 SWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVT 249
           S  YGN+GC+GG   R++Q+++ +     D   PY GQ   CH       A  + +  + 
Sbjct: 178 SSKYGNHGCNGGFMHRAFQYVIDNQGIDSDASYPYTGQSQQCHYNPAYRAANCSRYSFLP 237

Query: 250 PNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGEL 309
              E ALK ALA  GP+SVAIDA++ SF+FY +GVY D+ C  +   ++H VLAVGYG L
Sbjct: 238 EGDEGALKEALATIGPISVAIDATRPSFTFYRSGVYDDQTCTRN---VNHGVLAVGYGTL 294

Query: 310 DGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
           +GK YW VKNSW + +G++G++ M+  K++ CG+     Y  M
Sbjct: 295 NGKDYWLVKNSWGSTFGDKGFIRMARNKNDQCGIALYGCYPIM 337


>gi|410303012|gb|JAA30106.1| cathepsin L1 [Pan troglodytes]
          Length = 333

 Score =  165 bits (418), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 94/220 (42%), Positives = 131/220 (59%), Gaps = 11/220 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TGA+EG  + K  +L  LS+Q L+DCS   GN G
Sbjct: 118 VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEG 177

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++Q++  + GL +++ Y PY   +  C      + A  TGFV++ P  E AL
Sbjct: 178 CNGGLMDYAFQYVQDNGGLDSEESY-PYEATEESCKYNPKYSVANDTGFVDI-PKQEKAL 235

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GP+SVAIDA  +SF FY  G+Y++  C  S + +DH VL VGYG    E D  
Sbjct: 236 MKAVATVGPISVAIDAGHESFLFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDNN 293

Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
            YW VKNSW   WG  GYV M+  + N+CG+ +A +Y T+
Sbjct: 294 KYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 333


>gi|294889035|ref|XP_002772673.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
 gi|239877094|gb|EER04489.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
          Length = 358

 Score =  165 bits (418), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 92/211 (43%), Positives = 126/211 (59%), Gaps = 19/211 (9%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKK--LAVLSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
           +Q  CGSCW+F  TGA+EG Y + + K  L   S+Q L+DCS   GN GC+GG    ++ 
Sbjct: 134 NQQSCGSCWAFSATGALEGRYEIANPKHELVEFSEQQLVDCSSEEGNMGCNGGLMDNAFA 193

Query: 209 WIMKHGLPTQDDYGPYLGQDAYCHIANTTATATM-----TGFVNVTPNSEDALKLALAKH 263
           ++M+HGL T++DY  Y   D  C  +     A +     TGFV+V     +A+K AL + 
Sbjct: 194 YVMQHGLCTEEDYA-YEAIDEPCRNSTVKEKARLHPHDVTGFVDVHSKDGEAMKEAL-QS 251

Query: 264 GPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWST 323
           GPVSVAI+A    F FY  GV   E C +S   LDH VL VGYGELDGK YW+VKNSW  
Sbjct: 252 GPVSVAIEADMPDFQFYHEGVLTGE-CGDS---LDHGVLLVGYGELDGKKYWKVKNSWGA 307

Query: 324 YWGNQGYVLMS------IKDNNCGVMTAPTY 348
            WG++GY+L+        +++ CG++   +Y
Sbjct: 308 EWGHEGYILLERERADGTEEDECGILLQGSY 338


>gi|410990002|ref|XP_004001239.1| PREDICTED: cathepsin L1-like isoform 1 [Felis catus]
          Length = 333

 Score =  165 bits (418), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 94/220 (42%), Positives = 130/220 (59%), Gaps = 11/220 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY     +Q  C S W+F  TGA+EG  + K  KL  LS+Q L+DC+W  GN G
Sbjct: 118 VNWRKKGYVTPTKNQGQCASGWAFSATGAIEGQMFGKTHKLVSLSEQNLLDCAWSEGNGG 177

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    ++Q++  + GL +++ Y PY  Q   C     ++ A +TGF+ + P +E  L
Sbjct: 178 CSGGSMGNAFQYVKDNGGLDSEESY-PYHAQLQSCKYNPESSAADITGFLTI-PKTEYKL 235

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             ++A  GPVS AIDAS  +F FY  G+YYD  C  S + L H VL VGYG    ELD K
Sbjct: 236 MRSVAIVGPVSAAIDASLDTFRFYDQGIYYDSNC--SHEILHHGVLIVGYGFEGAELDNK 293

Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
            YW VKNSW   WG  GY+LM+  +DN+CG+ +  ++ T+
Sbjct: 294 KYWIVKNSWGEAWGIDGYILMAKDRDNHCGIASLASFPTV 333


>gi|46251290|gb|AAS84611.1| cathepsin L-like cysteine proteinase I variant form precursor
           [Heterodera glycines]
          Length = 374

 Score =  165 bits (418), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 86/202 (42%), Positives = 123/202 (60%), Gaps = 7/202 (3%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q +CGSCW+F  TGA+EG +      L  LS+Q LIDCS  YGN GC+GG    ++Q+I
Sbjct: 173 NQGMCGSCWAFSATGALEGQHVRDKGHLVSLSEQNLIDCSKKYGNMGCNGGIMDNAFQYI 232

Query: 211 MKH-GLPTQDDYGPYLGQDA-YCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
             + G+  +  Y PY  +    C        AT +G+ ++    E+ L++A+A  GPVSV
Sbjct: 233 KDNKGIDKETAY-PYKAKTGKKCLFKRNDVGATDSGYNDIAEGDEEDLRMAVATQGPVSV 291

Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGN 327
           AIDA  +SF  Y NGVY++++C+  P  LDH VL  GYG +     YW VKNSW T WG 
Sbjct: 292 AIDAGHRSFQLYTNGVYFEKECD--PQNLDHGVLVEGYGTDPTQGDYWIVKNSWGTRWGE 349

Query: 328 QGYVLMSI-KDNNCGVMTAPTY 348
           QGY+ M+  ++NNCG+ +  ++
Sbjct: 350 QGYIRMARNRNNNCGIASHASF 371


>gi|291398027|ref|XP_002715626.1| PREDICTED: cathepsin S [Oryctolagus cuniculus]
          Length = 331

 Score =  165 bits (418), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 89/200 (44%), Positives = 121/200 (60%), Gaps = 7/200 (3%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW-GYGNNGCDGGEDFRSYQWI 210
           Q  CG+CW+F   GA+E    +K   L  LS Q L+DCS   YGN GC+GG    ++Q+I
Sbjct: 133 QGSCGACWAFSAVGALEAQLKLKTGNLVSLSAQNLVDCSTTKYGNKGCNGGFMTEAFQYI 192

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           +  +G+ ++  Y PY   D  CH  +    AT + +  +   SE+ALK A+A  GPVSVA
Sbjct: 193 IDNNGIDSEASY-PYKAMDQKCHYDSKHRAATCSKYTELPFGSEEALKEAVANKGPVSVA 251

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS  SF  Y +GVYY+  C  +   ++H VLAVGYG L GK YW VKNSW  ++G QG
Sbjct: 252 IDASHSSFFLYRSGVYYEPSCTQN---VNHGVLAVGYGNLKGKDYWLVKNSWGIHFGEQG 308

Query: 330 YVLMSIKD-NNCGVMTAPTY 348
           Y+ M+    N+CG+   P+Y
Sbjct: 309 YIRMARNSKNHCGIANYPSY 328


>gi|193786743|dbj|BAG52066.1| unnamed protein product [Homo sapiens]
          Length = 333

 Score =  165 bits (418), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 94/220 (42%), Positives = 131/220 (59%), Gaps = 11/220 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TGA+EG  + K  +L  LS+Q L+DCS   GN G
Sbjct: 118 VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEG 177

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++Q++  + GL +++ Y PY   +  C      + A  TGFV++ P  E AL
Sbjct: 178 CNGGLMDYAFQYVQDNGGLDSEESY-PYEATEESCKYNPKYSVANDTGFVDI-PKQEKAL 235

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GP+SVAIDA  +SF FY  G+Y++  C  S + +DH VL VGYG    E D  
Sbjct: 236 MKAVATVGPISVAIDAGHESFLFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDNN 293

Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
            YW VKNSW   WG  GYV M+  + N+CG+ +A +Y T+
Sbjct: 294 KYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 333


>gi|345798093|ref|XP_536212.3| PREDICTED: pro-cathepsin H [Canis lupus familiaris]
          Length = 350

 Score =  165 bits (418), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 91/228 (39%), Positives = 127/228 (55%), Gaps = 14/228 (6%)

Query: 123 YSLWVRYNKASKDAIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLS 182
           Y  +V + K  K   PV+          +Q  CGSCW+F TTGA+E A  +K  KL  L+
Sbjct: 130 YPPFVDWRKKGKFVSPVK----------NQGSCGSCWTFSTTGALESAIAIKSGKLLSLA 179

Query: 183 QQALIDCSWGYGNNGCDG-GEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATA 240
           +Q L+DC+  + N+GC G G   +++++I    G+  +D Y PY GQD  C    + A A
Sbjct: 180 EQQLVDCAQNFNNHGCQGYGAPLQAFEYIRYNKGIMGEDSY-PYKGQDGDCKYQPSKAIA 238

Query: 241 TMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHA 300
            +    N+T N E A+  A+A + PVS A + +   F  Y  G+Y    C+ +PD ++HA
Sbjct: 239 FVKDVANITINDEQAMVEAVALYNPVSFAFEVTS-DFMMYRKGIYSSTSCHKTPDKVNHA 297

Query: 301 VLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTY 348
           VLAVGYGE +G PYW VKNSW   WG  GY LM    N CG+    +Y
Sbjct: 298 VLAVGYGEQNGIPYWIVKNSWGPQWGMNGYFLMERGKNMCGLAACASY 345


>gi|302831502|ref|XP_002947316.1| hypothetical protein VOLCADRAFT_79608 [Volvox carteri f.
           nagariensis]
 gi|300267180|gb|EFJ51364.1| hypothetical protein VOLCADRAFT_79608 [Volvox carteri f.
           nagariensis]
          Length = 326

 Score =  165 bits (417), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 82/212 (38%), Positives = 124/212 (58%), Gaps = 12/212 (5%)

Query: 130 NKASKDAIPVRYEMKGYNS----LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQA 185
           N + +  +P   + +G  +    + DQ  CGSCW+F   GA++GA++    +    S+Q 
Sbjct: 115 NGSLRGRLPTFVDWRGTGADGPGVKDQVSCGSCWAFSAAGAMQGAWFKATGQALSFSEQQ 174

Query: 186 LIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCH-------IANTTA 238
           L+DC+W YGN+GC GG    + Q+++  G   Q+   PYL Q+++C+        A    
Sbjct: 175 LVDCAWDYGNDGCAGGFVEPTLQYVVDAGGIAQESEYPYLAQNSFCNDVTQRGGPATVPV 234

Query: 239 TATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLD 298
           +A  +G++N+    E AL  A+A HGPV+V ++A+   F FY  GVYY+E+C   PD +D
Sbjct: 235 SARFSGYLNIPSRDEAALMEAVALHGPVAVLMNAALAPFRFYSEGVYYNEECGQDPDSMD 294

Query: 299 HAVLAVGYGEL-DGKPYWQVKNSWSTYWGNQG 329
           HA+L VGYG   +G  YW +KNSWS YWG  G
Sbjct: 295 HAILLVGYGTTPEGVDYWLIKNSWSKYWGMDG 326


>gi|162815|gb|AAA30435.1| cathepsin S, partial [Bos taurus]
 gi|312895|emb|CAA43971.1| cathepsin S [Bos taurus]
          Length = 196

 Score =  165 bits (417), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 89/197 (45%), Positives = 119/197 (60%), Gaps = 7/197 (3%)

Query: 155 CGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG-YGNNGCDGGEDFRSYQWIM-K 212
           CGSCW+F   GA+E    +K  KL  LS Q L+DCS   YGN GC+GG    ++Q+I+  
Sbjct: 1   CGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTAKYGNKGCNGGFMTEAFQYIIDN 60

Query: 213 HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDA 272
           +G+ ++  Y PY   D  C        AT + ++ +   SE+ALK A+A  GPVSV IDA
Sbjct: 61  NGIDSEASY-PYKAMDGKCQYDVKNRAATCSRYIELPFGSEEALKEAVANKGPVSVGIDA 119

Query: 273 SQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVL 332
           S  SF  Y  GVYYD  C  +   ++H VL VGYG LDGK YW VKNSW  ++G+QGY+ 
Sbjct: 120 SHSSFFLYKTGVYYDPSCTQN---VNHGVLVVGYGNLDGKDYWLVKNSWGLHFGDQGYIR 176

Query: 333 MSIKD-NNCGVMTAPTY 348
           M+    N+CG+   P+Y
Sbjct: 177 MARNSGNHCGIANYPSY 193


>gi|148745204|gb|AAI42984.1| Cathepsin L1 [Homo sapiens]
          Length = 333

 Score =  165 bits (417), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 94/220 (42%), Positives = 131/220 (59%), Gaps = 11/220 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TGA+EG  + K  +L  LS+Q L+DCS   GN G
Sbjct: 118 VDWREKGYVTPVKNQGPCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEG 177

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++Q++  + GL +++ Y PY   +  C      + A  TGFV++ P  E AL
Sbjct: 178 CNGGLMDYAFQYVQDNGGLDSEESY-PYEATEESCKYNPKYSVANDTGFVDI-PKQEKAL 235

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GP+SVAIDA  +SF FY  G+Y++  C  S + +DH VL VGYG    E D  
Sbjct: 236 MKAVATVGPISVAIDAGHESFLFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDNN 293

Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
            YW VKNSW   WG  GYV M+  + N+CG+ +A +Y T+
Sbjct: 294 KYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 333


>gi|60827856|gb|AAX36816.1| cathepsin L [synthetic construct]
          Length = 334

 Score =  165 bits (417), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 92/219 (42%), Positives = 128/219 (58%), Gaps = 9/219 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TGA+EG  + K  +L  LS+Q L+DCS   GN G
Sbjct: 118 VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEG 177

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
           C+GG    ++Q++  +G    ++  PY   +  C      + A  TGFV++ P  E AL 
Sbjct: 178 CNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDI-PKQEKALM 236

Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKP 313
            A+A  GP+SVAIDA  +SF FY  G+Y++  C  S + +DH VL VGYG    E D   
Sbjct: 237 KAVATVGPISVAIDAGHESFLFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDNNK 294

Query: 314 YWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
           YW VKNSW   WG  GYV M+  + N+CG+ +A +Y T+
Sbjct: 295 YWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 333


>gi|15214962|gb|AAH12612.1| Cathepsin L1 [Homo sapiens]
 gi|61363426|gb|AAX42388.1| cathepsin L [synthetic construct]
 gi|123988681|gb|ABM83856.1| cathepsin L [synthetic construct]
 gi|123999196|gb|ABM87178.1| cathepsin L [synthetic construct]
          Length = 333

 Score =  165 bits (417), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 94/220 (42%), Positives = 131/220 (59%), Gaps = 11/220 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TGA+EG  + K  +L  LS+Q L+DCS   GN G
Sbjct: 118 VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEG 177

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++Q++  + GL +++ Y PY   +  C      + A  TGFV++ P  E AL
Sbjct: 178 CNGGLMDYAFQYVQDNGGLDSEESY-PYEATEESCKYNPKYSVANDTGFVDI-PKQEKAL 235

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GP+SVAIDA  +SF FY  G+Y++  C  S + +DH VL VGYG    E D  
Sbjct: 236 MKAVATVGPISVAIDAGHESFLFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDNN 293

Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
            YW VKNSW   WG  GYV M+  + N+CG+ +A +Y T+
Sbjct: 294 KYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 333


>gi|344284284|ref|XP_003413898.1| PREDICTED: pro-cathepsin H-like [Loxodonta africana]
          Length = 335

 Score =  165 bits (417), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 87/213 (40%), Positives = 125/213 (58%), Gaps = 5/213 (2%)

Query: 139 VRYEMKGY--NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNN 196
           V +  KG+  + + +Q  CGSCW+F TTGA+E A  +   KL  L++Q L+DC+  + N+
Sbjct: 120 VDWRKKGHFVSPVKNQGACGSCWTFSTTGALESAIAIAGGKLLSLAEQQLVDCAKDFNNH 179

Query: 197 GCDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDA 255
           GC GG   +++++I+   G+  +D Y PY GQD  C      A A +    N+T N E+A
Sbjct: 180 GCQGGLPSQAFEYILYNKGIMGEDTY-PYKGQDDVCKFQPKKAIAFVKDVANITLNDEEA 238

Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYW 315
           +  A+A + PVS A + +   F  Y  G+Y    C+ +PD ++HAVLAVGYGE  G PYW
Sbjct: 239 MVEAVALYNPVSFAFEVTD-DFMKYSKGIYSSTSCHKTPDKVNHAVLAVGYGEEKGIPYW 297

Query: 316 QVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTY 348
            VKNSW  YWG  GY L+    N CG+    +Y
Sbjct: 298 IVKNSWGPYWGMDGYFLIERGKNMCGLAACASY 330


>gi|66810271|ref|XP_638859.1| cysteine proteinase 3 [Dictyostelium discoideum AX4]
 gi|166201983|sp|Q23894.2|CYSP3_DICDI RecName: Full=Cysteine proteinase 3; AltName: Full=Cysteine
           proteinase II; Flags: Precursor
 gi|60467526|gb|EAL65548.1| cysteine proteinase 3 [Dictyostelium discoideum AX4]
          Length = 337

 Score =  165 bits (417), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 111/298 (37%), Positives = 163/298 (54%), Gaps = 20/298 (6%)

Query: 68  SDFKVNIYRLFFLR-PRFHENEKIRYNWTYIGEELVNGIIL------EKWRLV---TSEG 117
           S+ K   ++ F  R   F +N    +NW   G + V G+        E++RL    T   
Sbjct: 40  SNNKAYTHKEFMPRYEEFKKNMDYVHNWNSKGSKTVLGLNQHADLSNEEYRLNYLGTRAH 99

Query: 118 EKVSKY---SLWVRYNKAS-KDAIPVRY-EMKGYNSLLDQSVCGSCWSFGTTGAVEGAYY 172
            K++ Y   +L +R N+   K  + V + E      + DQ  CGSC+SF TTG+VEG   
Sbjct: 100 IKLNGYHKRNLGLRLNRPQFKQPLNVDWREKDAVTPVKDQGQCGSCYSFSTTGSVEGVTA 159

Query: 173 MKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMK-HGLPTQDDYGPYLGQDAYC 231
           +K  KL  LS+Q ++DCS  +GN GC+GG    ++++I+K +GL +++ Y   +  +  C
Sbjct: 160 IKTGKLVSLSEQNILDCSSSFGNEGCNGGLMTNAFEYIIKNNGLNSEEQYPYEMKVNDEC 219

Query: 232 HIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCN 291
                +  A +T +  +    E+ L+ AL  + PVSVAIDAS  SF  Y  GVYY+  C 
Sbjct: 220 KFQEGSVAAKITSYKEIEAGDENDLQNALLLN-PVSVAIDASHNSFQLYTAGVYYEPAC- 277

Query: 292 NSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
            S + LDH VLAVG G  +G+ Y+ VKNSW   WG  GY+ M+  KDNNCG+ T  +Y
Sbjct: 278 -SSEDLDHGVLAVGMGTDNGEDYYIVKNSWGPSWGLNGYIHMARNKDNNCGISTMASY 334


>gi|4503155|ref|NP_001903.1| cathepsin L1 isoform 1 preproprotein [Homo sapiens]
 gi|22202619|ref|NP_666023.1| cathepsin L1 isoform 1 preproprotein [Homo sapiens]
 gi|384081592|ref|NP_001244900.1| cathepsin L1 isoform 1 preproprotein [Homo sapiens]
 gi|384081594|ref|NP_001244901.1| cathepsin L1 isoform 1 preproprotein [Homo sapiens]
 gi|332832229|ref|XP_003312197.1| PREDICTED: cathepsin L1 isoform 2 [Pan troglodytes]
 gi|332832233|ref|XP_001137800.2| PREDICTED: cathepsin L1 isoform 1 [Pan troglodytes]
 gi|397470218|ref|XP_003806728.1| PREDICTED: cathepsin L1 isoform 1 [Pan paniscus]
 gi|397470220|ref|XP_003806729.1| PREDICTED: cathepsin L1 isoform 2 [Pan paniscus]
 gi|397470222|ref|XP_003806730.1| PREDICTED: cathepsin L1 isoform 3 [Pan paniscus]
 gi|410042824|ref|XP_003951515.1| PREDICTED: cathepsin L1 [Pan troglodytes]
 gi|115741|sp|P07711.2|CATL1_HUMAN RecName: Full=Cathepsin L1; AltName: Full=Major excreted protein;
           Short=MEP; Contains: RecName: Full=Cathepsin L1 heavy
           chain; Contains: RecName: Full=Cathepsin L1 light chain;
           Flags: Precursor
 gi|29715|emb|CAA30981.1| pro-(cathepsin L) [Homo sapiens]
 gi|190418|gb|AAA66974.1| preprocathepsin L precursor [Homo sapiens]
 gi|31873292|emb|CAD97637.1| hypothetical protein [Homo sapiens]
 gi|48146223|emb|CAG33334.1| CTSL [Homo sapiens]
 gi|119583135|gb|EAW62731.1| cathepsin L, isoform CRA_a [Homo sapiens]
 gi|119583136|gb|EAW62732.1| cathepsin L, isoform CRA_a [Homo sapiens]
 gi|119583137|gb|EAW62733.1| cathepsin L, isoform CRA_a [Homo sapiens]
 gi|119583138|gb|EAW62734.1| cathepsin L, isoform CRA_a [Homo sapiens]
 gi|119583140|gb|EAW62736.1| cathepsin L, isoform CRA_a [Homo sapiens]
 gi|208965934|dbj|BAG72981.1| cathepsin L1 [synthetic construct]
 gi|410303006|gb|JAA30103.1| cathepsin L1 [Pan troglodytes]
 gi|410303008|gb|JAA30104.1| cathepsin L1 [Pan troglodytes]
 gi|410303010|gb|JAA30105.1| cathepsin L1 [Pan troglodytes]
          Length = 333

 Score =  165 bits (417), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 94/220 (42%), Positives = 131/220 (59%), Gaps = 11/220 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TGA+EG  + K  +L  LS+Q L+DCS   GN G
Sbjct: 118 VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEG 177

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++Q++  + GL +++ Y PY   +  C      + A  TGFV++ P  E AL
Sbjct: 178 CNGGLMDYAFQYVQDNGGLDSEESY-PYEATEESCKYNPKYSVANDTGFVDI-PKQEKAL 235

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GP+SVAIDA  +SF FY  G+Y++  C  S + +DH VL VGYG    E D  
Sbjct: 236 MKAVATVGPISVAIDAGHESFLFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDNN 293

Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
            YW VKNSW   WG  GYV M+  + N+CG+ +A +Y T+
Sbjct: 294 KYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 333


>gi|440910969|gb|ELR60703.1| Cathepsin H, partial [Bos grunniens mutus]
          Length = 329

 Score =  165 bits (417), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 83/199 (41%), Positives = 118/199 (59%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTGA+E A  +   KL  L++Q L+DC+  + N+GC GG   +++++I
Sbjct: 128 NQGSCGSCWTFSTTGALESAVAIATGKLPFLAEQQLVDCAQNFNNHGCQGGLPSQAFEYI 187

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
               G+  +D Y PY GQD  C    + A A +    N+T N E+A+  A+A H PVS A
Sbjct: 188 RYNKGIMGEDTY-PYRGQDGDCKYQPSKAIAFVKDVANITLNDEEAMVEAVALHNPVSFA 246

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            + +   F  Y  G+Y    C+ +PD ++HAVLAVGYGE  G PYW VKNSW   WG +G
Sbjct: 247 FEVT-ADFMMYRKGIYSSTSCHKTPDKVNHAVLAVGYGEEKGIPYWIVKNSWGPNWGMKG 305

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y L+    N CG+    ++
Sbjct: 306 YFLIERGKNMCGLAACASF 324


>gi|328875652|gb|EGG24016.1| counting factor associated protein [Dictyostelium fasciculatum]
          Length = 529

 Score =  165 bits (417), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 96/202 (47%), Positives = 132/202 (65%), Gaps = 5/202 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKK-LAVLSQQALIDCSWGYGNNGCDGGEDFRSYQW 209
           DQ VCGSCW+FG+  ++E    +KH K L  LS+Q L+DC++   + GC+GG    +YQ+
Sbjct: 325 DQGVCGSCWTFGSLASLETVACLKHNKDLISLSEQELVDCAYVGQSMGCNGGFASNAYQY 384

Query: 210 IMKHG-LPTQDDYGPYLGQDAYCHIANTTATAT-MTGFVNVTPNSEDALKLALAKHGPVS 267
           IM  G + T+ DY PYL Q+AYC  +    +   +  +VNVT  SE AL+ A+A  G V+
Sbjct: 385 IMNAGGIATESDY-PYLMQNAYCKASTVQNSGVRVQSYVNVTAFSEAALQNAVATVGVVA 443

Query: 268 VAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGN 327
           VAIDAS   F +Y +GVYY   C +  D LDH V  +GYG  +G+ YW VKNSWSTY+GN
Sbjct: 444 VAIDASAPDFRYYSSGVYYSTVCQSGLDYLDHEVAVLGYGTDNGQQYWIVKNSWSTYYGN 503

Query: 328 QGYVLMSI-KDNNCGVMTAPTY 348
           +G V M++ +DNNCG+ +  TY
Sbjct: 504 EGIVWMAMNRDNNCGIASQATY 525


>gi|77735725|ref|NP_001029557.1| pro-cathepsin H precursor [Bos taurus]
 gi|115312126|sp|Q3T0I2.1|CATH_BOVIN RecName: Full=Pro-cathepsin H; Contains: RecName: Full=Cathepsin H
           mini chain; Contains: RecName: Full=Cathepsin H;
           Contains: RecName: Full=Cathepsin H heavy chain;
           Contains: RecName: Full=Cathepsin H light chain; Flags:
           Precursor
 gi|74267711|gb|AAI02387.1| Cathepsin H [Bos taurus]
 gi|296475480|tpg|DAA17595.1| TPA: cathepsin H precursor [Bos taurus]
          Length = 335

 Score =  165 bits (417), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 83/199 (41%), Positives = 118/199 (59%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTGA+E A  +   KL  L++Q L+DC+  + N+GC GG   +++++I
Sbjct: 134 NQGSCGSCWTFSTTGALESAVAIATGKLPFLAEQQLVDCAQNFNNHGCQGGLPSQAFEYI 193

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
               G+  +D Y PY GQD  C    + A A +    N+T N E+A+  A+A H PVS A
Sbjct: 194 RYNKGIMGEDTY-PYRGQDGDCKYQPSKAIAFVKDVANITLNDEEAMVEAVALHNPVSFA 252

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            + +   F  Y  G+Y    C+ +PD ++HAVLAVGYGE  G PYW VKNSW   WG +G
Sbjct: 253 FEVT-ADFMMYRKGIYSSTSCHKTPDKVNHAVLAVGYGEEKGIPYWIVKNSWGPNWGMKG 311

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y L+    N CG+    ++
Sbjct: 312 YFLIERGKNMCGLAACASF 330


>gi|354473025|ref|XP_003498737.1| PREDICTED: cathepsin S-like [Cricetulus griseus]
          Length = 341

 Score =  165 bits (417), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 88/200 (44%), Positives = 119/200 (59%), Gaps = 6/200 (3%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCS--WGYGNNGCDGGEDFRSYQW 209
           Q  CGSCW+F   GA+E    +K  KL  LS Q L+DCS    YGN GCDGG   R++Q+
Sbjct: 142 QGSCGSCWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEEKYGNKGCDGGFMTRAFQY 201

Query: 210 IMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           I+ +G    D   PY      CH  + +  AT + ++ +    E+ALK A+A  GPVSV 
Sbjct: 202 IIDNGGIDSDASYPYKAVAEKCHYDSKSRAATCSRYMELPSGDEEALKEAVANKGPVSVG 261

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS  SF  Y +GVY +  C    + ++H VL VGYG LDGK YW VKNSW  ++G+QG
Sbjct: 262 IDASHPSFFLYKSGVYDEPSCT---ENVNHGVLVVGYGNLDGKDYWLVKNSWGLHFGDQG 318

Query: 330 YVLMSIKD-NNCGVMTAPTY 348
           Y+ M+  + N CG+ +  +Y
Sbjct: 319 YIRMARNNKNQCGIASYGSY 338


>gi|403302730|ref|XP_003942006.1| PREDICTED: cathepsin S isoform 1 [Saimiri boliviensis boliviensis]
          Length = 339

 Score =  165 bits (417), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 85/199 (42%), Positives = 119/199 (59%), Gaps = 6/199 (3%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM 211
           Q  CG+CW+F   GA+E    +K  KL  LS Q L+DCS  YGN GC+GG    ++Q+I+
Sbjct: 142 QGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSEKYGNKGCNGGFMTEAFQYII 201

Query: 212 KH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
            + G+ ++  Y PY   D  C   +    AT + +  +    ED LK A+A  GPV V +
Sbjct: 202 DNKGIDSEASY-PYKATDQKCQYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVCVGV 260

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
           DAS  SF  Y +GVYYD  C      ++H VL +GYG+L+GK YW VKNSW + +G QGY
Sbjct: 261 DASHPSFFLYRSGVYYDPACTQK---VNHGVLVIGYGDLNGKEYWLVKNSWGSNFGEQGY 317

Query: 331 VLMSI-KDNNCGVMTAPTY 348
           + M+  K N+CG+ + P+Y
Sbjct: 318 IRMARNKGNHCGIASYPSY 336


>gi|113603|sp|P05167.1|ALEU_HORVU RecName: Full=Thiol protease aleurain; Flags: Precursor
 gi|19021|emb|CAA28804.1| aleurain [Hordeum vulgare]
          Length = 362

 Score =  165 bits (417), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 85/199 (42%), Positives = 119/199 (59%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q+ CGSCW+F TTGA+E AY     K   LS+Q L+DC+ G+ N GC+GG   +++++I
Sbjct: 161 NQAHCGSCWTFSTTGALEAAYTQATGKNISLSEQQLVDCAGGFNNFGCNGGLPSQAFEYI 220

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T++ Y PY G +  CH     A   +   VN+T N+ED LK A+    PVSVA
Sbjct: 221 KYNGGIDTEESY-PYKGVNGVCHYKAENAAVQVLDSVNITLNAEDELKNAVGLVRPVSVA 279

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
                  F  Y +GVY  + C  +PD ++HAVLAVGYG  +G PYW +KNSW   WG+ G
Sbjct: 280 FQVID-GFRQYKSGVYTSDHCGTTPDDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNG 338

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y  M +  N C + T  +Y
Sbjct: 339 YFKMEMGKNMCAIATCASY 357


>gi|431896621|gb|ELK06033.1| Cathepsin S [Pteropus alecto]
          Length = 331

 Score =  165 bits (417), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 90/200 (45%), Positives = 122/200 (61%), Gaps = 7/200 (3%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW-GYGNNGCDGGEDFRSYQWI 210
           Q  CGSCW+F   GA+E    +K  KL  LS Q L+DCS   Y N GC+GG    ++Q+I
Sbjct: 133 QGSCGSCWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYSNKGCNGGFMTSAFQYI 192

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           +  +G+ ++  Y PY  QD  C   +    AT + +  +   SE+ALK A+A  GPVSVA
Sbjct: 193 IDNNGIDSEASY-PYKAQDGKCQYDSKFRAATCSKYTELPFGSEEALKEAVANKGPVSVA 251

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS  SF  Y +GVYYD+ C      ++H VL VGYG LDGK YW VKNSW   +G++G
Sbjct: 252 IDASHPSFFLYRSGVYYDQSCTLK---VNHGVLVVGYGNLDGKDYWLVKNSWGLNFGDKG 308

Query: 330 YVLMSIKD-NNCGVMTAPTY 348
           Y+ M+    N+CG+ + P+Y
Sbjct: 309 YIRMARNSGNHCGIASYPSY 328


>gi|19698255|dbj|BAB86770.1| cathepsin L-like [Engraulis japonicus]
          Length = 324

 Score =  165 bits (417), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 88/202 (43%), Positives = 125/202 (61%), Gaps = 7/202 (3%)

Query: 149 LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
           + +Q  CGSCWSF  TGA+E    ++   L  LS+Q L+DCS  YGN GC+GG   +++Q
Sbjct: 125 IKNQGQCGSCWSFSATGALESQTCLRRGYLPSLSEQQLVDCSGSYGNYGCNGGWPDQAFQ 184

Query: 209 WIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTP-NSEDALKLALAKHGPVS 267
           +I  +G    + Y PY  +   CH  +  + AT +G+ +VTP  SE AL+  +A  GP+S
Sbjct: 185 YIQANGGIDSESYYPYQARVGTCHYNSAYSAATCSGYQDVTPVGSESALQYYVANVGPLS 244

Query: 268 VAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGN 327
           +AIDAS   +  Y +GV+ D  C+ +    DHAVL VGYG  +G+ YW VKNSW T+WG 
Sbjct: 245 IAIDAS--GWQSYQSGVFNDPSCSQTA---DHAVLLVGYGTYNGQDYWLVKNSWGTWWGE 299

Query: 328 QGYVLMSIKDNN-CGVMTAPTY 348
           QGY++M+   NN CG+    +Y
Sbjct: 300 QGYIMMTRNANNQCGIANHASY 321


>gi|340368362|ref|XP_003382721.1| PREDICTED: cathepsin L1-like [Amphimedon queenslandica]
          Length = 329

 Score =  164 bits (416), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 90/225 (40%), Positives = 129/225 (57%), Gaps = 7/225 (3%)

Query: 129 YNKASKDAIPVRYEMKGYNSLLD---QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQA 185
           ++ + ++A+P   + +    + D   Q  CGSCW+F TTG++EG + +K   L  LS+Q 
Sbjct: 103 FSSSLRNAMPSSLDWRDKKVVTDVKNQGKCGSCWAFSTTGSLEGLHALKTGHLVSLSEQQ 162

Query: 186 LIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGF 245
           L+DCS  YGNNGCDGG    ++Q+I   G    ++  PY  ++  C        AT  G+
Sbjct: 163 LMDCSVKYGNNGCDGGNMRSAFQYIKDAGGDDTEESYPYTAKNESCRFDPKKVGATDEGY 222

Query: 246 VNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVG 305
           V +    E +L  AL + GP+SVA+DA  K+F FY  G+Y D  C+N+   L+H V  +G
Sbjct: 223 VRIPSGDEVSLMHALYEVGPISVAMDAGLKTFQFYKKGIYSDYLCSNTH--LNHGVTLIG 280

Query: 306 YGE-LDGKPYWQVKNSWSTYWGNQGYVLMS-IKDNNCGVMTAPTY 348
           YGE  DG PYW VKNSW   WG  GY +++    N CGV T  +Y
Sbjct: 281 YGESSDGSPYWLVKNSWGKDWGIDGYFMLARYVGNMCGVATDASY 325


>gi|62510453|sp|Q8HY82.1|CATS_SAIBB RecName: Full=Cathepsin S; Flags: Precursor
 gi|27497536|gb|AAO13008.1| cathepsin S preproprotein [Saimiri boliviensis]
          Length = 330

 Score =  164 bits (416), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 85/199 (42%), Positives = 119/199 (59%), Gaps = 6/199 (3%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM 211
           Q  CG+CW+F   GA+E    +K  KL  LS Q L+DCS  YGN GC+GG    ++Q+I+
Sbjct: 133 QGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSEKYGNKGCNGGFMTEAFQYII 192

Query: 212 KH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
            + G+ ++  Y PY   D  C   +    AT + +  +    ED LK A+A  GPV V +
Sbjct: 193 DNKGIDSEASY-PYKATDQKCQYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVCVGV 251

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
           DAS  SF  Y +GVYYD  C      ++H VL +GYG+L+GK YW VKNSW + +G QGY
Sbjct: 252 DASHPSFFLYRSGVYYDPACTQK---VNHGVLVIGYGDLNGKEYWLVKNSWGSNFGEQGY 308

Query: 331 VLMSI-KDNNCGVMTAPTY 348
           + M+  K N+CG+ + P+Y
Sbjct: 309 IRMARNKGNHCGIASYPSY 327


>gi|74222595|dbj|BAE38161.1| unnamed protein product [Mus musculus]
          Length = 334

 Score =  164 bits (416), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 90/204 (44%), Positives = 125/204 (61%), Gaps = 10/204 (4%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F  +G +EG  ++K  KL  LS+Q L+DCS   GN GC+GG    ++Q+I
Sbjct: 131 NQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYI 190

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
            ++ GL +++ Y PY  +D  C      A A  TGFV++ P  E AL  A+A  GP+SVA
Sbjct: 191 KENGGLDSEESY-PYEAKDGSCKYRAEFAVANDTGFVDI-PQQEKALMKAVATVGPISVA 248

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTYW 325
           +DAS  S  FY  G+YY+  C  S   LDH VL VGYG    + +   YW VKNSW + W
Sbjct: 249 MDASHPSLQFYSLGIYYEPNC--SSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEW 306

Query: 326 GNQGYV-LMSIKDNNCGVMTAPTY 348
           G +GY+ +   +DN+CG+ TA +Y
Sbjct: 307 GMEGYIKIAKDRDNHCGLATAASY 330


>gi|224069140|ref|XP_002326284.1| predicted protein [Populus trichocarpa]
 gi|118482340|gb|ABK93094.1| unknown [Populus trichocarpa]
 gi|222833477|gb|EEE71954.1| predicted protein [Populus trichocarpa]
          Length = 358

 Score =  164 bits (416), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 85/199 (42%), Positives = 119/199 (59%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTGA+E AY+    K   LS+Q L+DC+  + N GC+GG   +++++I
Sbjct: 157 NQGHCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCARAFNNFGCNGGLPSQAFEYI 216

Query: 211 -MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
               GL T++ Y PY G+D  C  ++      +   VN+T  +ED LK A+A   PVSVA
Sbjct: 217 KFNGGLDTEEAY-PYTGKDDACKFSSENVGVRVVESVNITLGAEDELKHAVAFVRPVSVA 275

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            +    SF  Y  GVY    C ++P  ++HAVLAVGYG  +G PYW +KNSW   WG+ G
Sbjct: 276 FEVV-GSFRLYKEGVYTTSTCGSTPMDVNHAVLAVGYGVENGIPYWLIKNSWGEDWGDNG 334

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y  M +  N CG+ T  +Y
Sbjct: 335 YFKMEMGKNMCGIATCASY 353


>gi|381283083|gb|AFG19440.1| cathepsin L, partial [Larimichthys crocea]
          Length = 257

 Score =  164 bits (416), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 90/177 (50%), Positives = 115/177 (64%), Gaps = 9/177 (5%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F TTGA+EG ++ K  KL  LS+Q L+DCS   GN GC+GG   +++Q++
Sbjct: 84  DQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYV 143

Query: 211 M-KHGLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
              HGL ++D Y PYLG  D  CH      +A  TGFV+V    E AL  A+A  GPVSV
Sbjct: 144 KDNHGLDSEDSY-PYLGTDDQPCHYDPNYNSANDTGFVDVPSGKEHALMKAVAAVGPVSV 202

Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSW 321
           AIDA  +SF FY +G+YY++ C  S + LDH VL VGYG    ++DGK YW V NSW
Sbjct: 203 AIDAGHESFQFYQSGIYYEKDC--SSEELDHGVLVVGYGFEGEDVDGKKYWIVNNSW 257


>gi|194689248|gb|ACF78708.1| unknown [Zea mays]
 gi|414885653|tpg|DAA61667.1| TPA: cysteine protease2 [Zea mays]
          Length = 360

 Score =  164 bits (416), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 86/199 (43%), Positives = 117/199 (58%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTGA+E AY     K   LS+Q LIDC + + N GC+GG   +++++I
Sbjct: 160 NQGHCGSCWTFSTTGALEAAYTQATGKPISLSEQQLIDCGFAFNNFGCNGGLPSQAFEYI 219

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + GL T++ Y PY G +  C   N      +   VN+T  +ED LK A+    PVSVA
Sbjct: 220 KYNGGLDTEESY-PYQGVNGICKFKNENVGVKVLDSVNITLGAEDELKDAVGLVRPVSVA 278

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            +     F  Y +GVY  + C  +P  ++HAVLAVGYG  DG PYW +KNSW   WG++G
Sbjct: 279 FEVIT-GFRLYKSGVYTSDHCGTTPMDVNHAVLAVGYGVEDGVPYWLIKNSWGADWGDEG 337

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y  M +  N CGV T  +Y
Sbjct: 338 YFKMEMGKNMCGVATCASY 356


>gi|326516056|dbj|BAJ88051.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 362

 Score =  164 bits (416), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 85/199 (42%), Positives = 119/199 (59%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q+ CGSCW+F TTGA+E AY     K   LS+Q L+DC+ G+ N GC+GG   +++++I
Sbjct: 161 NQAHCGSCWTFSTTGALEAAYTQATGKNISLSEQQLVDCAGGFNNFGCNGGLPSQAFEYI 220

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T++ Y PY G +  CH     A   +   VN+T N+ED LK A+    PVSVA
Sbjct: 221 KYNGGIDTEESY-PYKGVNGVCHYKAENAAVQVLDSVNITLNAEDELKNAVGLVRPVSVA 279

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
                  F  Y +GVY  + C  +PD ++HAVLAVGYG  +G PYW +KNSW   WG+ G
Sbjct: 280 FQVID-GFRQYKSGVYTSDHCGTTPDDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNG 338

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y  M +  N C + T  +Y
Sbjct: 339 YFKMEMGKNMCAIATCASY 357


>gi|74142447|dbj|BAE31977.1| unnamed protein product [Mus musculus]
          Length = 334

 Score =  164 bits (416), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 89/204 (43%), Positives = 126/204 (61%), Gaps = 10/204 (4%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           ++  CGSCW+F  +G +EG  ++K  KL  LS+Q L+DCS   GN GC+GG    ++Q+I
Sbjct: 131 NKGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYI 190

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
            ++ GL +++ Y PY  +D  C      A A  TGFV++ P  E AL  A+A  GP+SVA
Sbjct: 191 KENGGLDSEESY-PYEAKDGSCKYRAEFAVANDTGFVDI-PQQEKALMKAVATVGPISVA 248

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTYW 325
           +DAS  S  FY +G+YY+  C  S   LDH VL VGYG    + +   YW VKNSW + W
Sbjct: 249 MDASHPSLQFYSSGIYYEPNC--SSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEW 306

Query: 326 GNQGYV-LMSIKDNNCGVMTAPTY 348
           G +GY+ +   +DN+CG+ TA +Y
Sbjct: 307 GMEGYIKIAKDRDNHCGLATAASY 330


>gi|432114311|gb|ELK36239.1| Cathepsin S [Myotis davidii]
          Length = 340

 Score =  164 bits (416), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 90/200 (45%), Positives = 123/200 (61%), Gaps = 7/200 (3%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG-YGNNGCDGGEDFRSYQWI 210
           Q  CGSCW+F   GA+E    +K  KL  LS Q L+DCS G Y N GC+GG    ++Q+I
Sbjct: 142 QGSCGSCWAFSAVGALEAQLKLKTGKLVSLSVQNLVDCSTGKYSNKGCNGGFMTEAFQYI 201

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           +  +G+ ++  Y PY   D  C        AT + +V +   +E+ALK A+A  GPVSVA
Sbjct: 202 IDNNGIDSEASY-PYKAMDGKCQYDVKNRAATCSKYVELPFGNEEALKEAVANKGPVSVA 260

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS  SF  Y +GVYYD+ C  +   ++H VLAVGYG  +GK YW VKNSW  ++G QG
Sbjct: 261 IDASHPSFFLYRSGVYYDKACTLN---VNHGVLAVGYGNYNGKDYWLVKNSWGLHFGEQG 317

Query: 330 YVLMSIKD-NNCGVMTAPTY 348
           Y+ M+    N+CG+ + P+Y
Sbjct: 318 YIRMARNSGNHCGIASYPSY 337


>gi|410256886|gb|JAA16410.1| cathepsin L1 [Pan troglodytes]
          Length = 333

 Score =  164 bits (416), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 94/220 (42%), Positives = 131/220 (59%), Gaps = 11/220 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TGA+EG  + K  +L  LS+Q L+DCS   GN G
Sbjct: 118 VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEG 177

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++Q++  + GL +++ Y PY   +  C      + A  TGFV++ P  E AL
Sbjct: 178 CNGGLMDYAFQYVQDNGGLDSEESY-PYEATEESCKYNPKYSVANDTGFVDI-PKQEKAL 235

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GP+SVAIDA  +SF FY  G+Y++  C  S + +DH VL VGYG    E D  
Sbjct: 236 MKAVATVGPISVAIDAGHESFLFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDNN 293

Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
            YW VKNSW   WG  GYV M+  + N+CG+ +A +Y T+
Sbjct: 294 KYWLVKNSWGGEWGMGGYVKMAKDRRNHCGIASAASYPTV 333


>gi|66270077|gb|AAY43368.1| cysteine protease [Phytophthora infestans]
          Length = 510

 Score =  164 bits (416), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 88/209 (42%), Positives = 131/209 (62%), Gaps = 14/209 (6%)

Query: 145 GYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDF 204
           G   + +Q +CGSCW+F TTGAVEGA ++   KL  LS+Q L+DC    G+ GC+GG   
Sbjct: 129 GVTPVKNQGMCGSCWAFSTTGAVEGAAFVSSGKLVSLSEQELVDCDHN-GDMGCNGGLMD 187

Query: 205 RSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKH 263
            ++ WI  +G + ++DDY  Y  +   C   +      ++GF +V P  E ALK+A+A+ 
Sbjct: 188 HAFAWIEDNGGICSEDDY-EYKAKAQVCR--DCEKVVKISGFQDVNPQDEHALKVAVAQQ 244

Query: 264 GPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWST 323
            PVSVAI+A QK+F FY +GV+ +  C      LDH VLAVGYG  +G+ +W+VKNSW +
Sbjct: 245 -PVSVAIEADQKAFQFYKSGVF-NLTCGTR---LDHGVLAVGYGSENGQKFWKVKNSWGS 299

Query: 324 YWGNQGYVLMSIKDN----NCGVMTAPTY 348
            WG +GY+ ++ ++N     CG+ + P+Y
Sbjct: 300 SWGEKGYIRLAREENGPAGQCGIASVPSY 328


>gi|410256882|gb|JAA16408.1| cathepsin L1 [Pan troglodytes]
 gi|410256884|gb|JAA16409.1| cathepsin L1 [Pan troglodytes]
          Length = 333

 Score =  164 bits (416), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 94/220 (42%), Positives = 131/220 (59%), Gaps = 11/220 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TGA+EG  + K  +L  LS+Q L+DCS   GN G
Sbjct: 118 VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEG 177

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++Q++  + GL +++ Y PY   +  C      + A  TGFV++ P  E AL
Sbjct: 178 CNGGLMDYAFQYVQDNGGLDSEESY-PYEATEESCKYNPKYSVANDTGFVDI-PKQEKAL 235

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GP+SVAIDA  +SF FY  G+Y++  C  S + +DH VL VGYG    E D  
Sbjct: 236 MKAVATVGPISVAIDAGHESFLFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDNN 293

Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
            YW VKNSW   WG  GYV M+  + N+CG+ +A +Y T+
Sbjct: 294 KYWLVKNSWGGEWGMGGYVKMAKDRRNHCGIASAASYPTV 333


>gi|379991182|emb|CCA61803.1| cathepsin protein CatL1-MM3p, partial [Fasciola hepatica]
          Length = 326

 Score =  164 bits (416), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 89/218 (40%), Positives = 123/218 (56%), Gaps = 8/218 (3%)

Query: 131 KASKDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALI 187
           +A+  A+P  + +   GY   + DQ  CGSCW+F TTG +EG Y    +     S+Q L+
Sbjct: 102 EANNRAVPDKIDWRESGYVTEVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLV 161

Query: 188 DCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVN 247
           DCS  +GNNGC GG    +YQ++ + GL T+  Y PY   +  C        A +TG+  
Sbjct: 162 DCSGPWGNNGCSGGLMENAYQYLKQFGLETESSY-PYTAVEGQCRYNKQLGVAKVTGYYT 220

Query: 248 VTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG 307
           V   SE  LK  +   GP +VA+D  +  F  Y  G+Y  + C  SP GL+HAVLAVGYG
Sbjct: 221 VHSGSEVELKNLVGAEGPAAVAVDV-ESDFMMYSGGIYQSQTC--SPLGLNHAVLAVGYG 277

Query: 308 ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMT 344
              G  YW VKNSW +YWG +GY+ M+  + N CG+ +
Sbjct: 278 TQGGTDYWIVKNSWGSYWGERGYIRMARNRGNMCGIAS 315


>gi|145334857|ref|NP_001078774.1| thiol protease aleurain [Arabidopsis thaliana]
 gi|332009932|gb|AED97315.1| thiol protease aleurain [Arabidopsis thaliana]
          Length = 361

 Score =  164 bits (416), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 84/192 (43%), Positives = 116/192 (60%), Gaps = 3/192 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F TTGA+E AY+    K   LS+Q L+DC+  + N GC+GG   +++++I
Sbjct: 158 DQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYI 217

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + GL T+  Y PY G+D  C  +       +   VN+T  +ED LK A+    PVS+A
Sbjct: 218 KSNGGLDTEKAY-PYTGKDETCKFSAENVGVQVLNSVNITLGAEDELKHAVGLVRPVSIA 276

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            +    SF  Y +GVY D  C ++P  ++HAVLAVGYG  DG PYW +KNSW   WG++G
Sbjct: 277 FEVIH-SFRLYKSGVYTDSHCGSTPMDVNHAVLAVGYGVEDGVPYWLIKNSWGADWGDKG 335

Query: 330 YVLMSIKDNNCG 341
           Y  M +  N CG
Sbjct: 336 YFKMEMGKNMCG 347


>gi|403300987|ref|XP_003941193.1| PREDICTED: cathepsin L2 [Saimiri boliviensis boliviensis]
          Length = 333

 Score =  164 bits (416), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 93/217 (42%), Positives = 126/217 (58%), Gaps = 11/217 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TGA+EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 118 VDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQG 177

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
           C+GG    +++++ ++G    +   PY  +D  C      + A  TGFV V P  E  L 
Sbjct: 178 CNGGFMNYAFRYVKENGGLDSEASYPYEAKDGICKYKPENSVANDTGFV-VIPTHEKELM 236

Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-----ELDGK 312
            A+A  GP+SVA+DAS  SF FY +G+Y+++KC  S   LDH VL VGYG       D K
Sbjct: 237 KAVATVGPISVAVDASHSSFQFYKSGIYFEKKC--SSKNLDHGVLVVGYGFEGANSKDNK 294

Query: 313 PYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTY 348
            YW +KNSW   WG  GY+ ++   NN CG+ TA +Y
Sbjct: 295 -YWLIKNSWGPEWGLNGYIKIAKDQNNHCGIATAASY 330


>gi|301116794|ref|XP_002906125.1| cysteine protease family C01A, putative [Phytophthora infestans
           T30-4]
 gi|262107474|gb|EEY65526.1| cysteine protease family C01A, putative [Phytophthora infestans
           T30-4]
          Length = 535

 Score =  164 bits (416), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 88/209 (42%), Positives = 131/209 (62%), Gaps = 14/209 (6%)

Query: 145 GYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDF 204
           G   + +Q +CGSCW+F TTGAVEGA ++   KL  LS+Q L+DC    G+ GC+GG   
Sbjct: 129 GVTPVKNQGMCGSCWAFSTTGAVEGAAFVSSGKLVSLSEQELVDCDHN-GDMGCNGGLMD 187

Query: 205 RSYQWIMKHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKH 263
            ++ WI  +G + ++DDY  Y  +   C   +      ++GF +V P  E ALK+A+A+ 
Sbjct: 188 HAFAWIEDNGGICSEDDY-EYKAKAQVCR--DCEKVVKISGFQDVNPQDEHALKVAVAQQ 244

Query: 264 GPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWST 323
            PVSVAI+A QK+F FY +GV+ +  C      LDH VLAVGYG  +G+ +W+VKNSW +
Sbjct: 245 -PVSVAIEADQKAFQFYKSGVF-NLTCGTR---LDHGVLAVGYGSENGQKFWKVKNSWGS 299

Query: 324 YWGNQGYVLMSIKDN----NCGVMTAPTY 348
            WG +GY+ ++ ++N     CG+ + P+Y
Sbjct: 300 SWGEKGYIRLAREENGPAGQCGIASVPSY 328


>gi|301767946|ref|XP_002919405.1| PREDICTED: cathepsin S-like [Ailuropoda melanoleuca]
          Length = 340

 Score =  164 bits (416), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 95/228 (41%), Positives = 131/228 (57%), Gaps = 10/228 (4%)

Query: 127 VRYNKASKDAIP--VRYEMKGYNSLLD-QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQ 183
           V Y   S   +P  V +  KG  + +  Q  CG+CW+F   GA+E    +K  KL  LS 
Sbjct: 114 VTYKSNSNQKLPDSVDWREKGCVTKVKYQGACGACWAFSAVGALEAQLKLKTGKLVSLSA 173

Query: 184 QALIDCSW-GYGNNGCDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATAT 241
           Q L+DCS   YGN GC+GG    ++Q+I+  +G+ ++  Y PY   D  C   +    AT
Sbjct: 174 QNLVDCSTEKYGNKGCNGGFMTEAFQYIIDNNGIDSEASY-PYKATDGKCRYDSKNRAAT 232

Query: 242 MTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAV 301
            + +  +   SED LK A+A  GPVSVAIDA   SF  Y +GVYYD  C  +   ++H V
Sbjct: 233 CSKYTELPSGSEDDLKEAVANKGPVSVAIDARHSSFFLYRSGVYYDPSCTQN---VNHGV 289

Query: 302 LAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKD-NNCGVMTAPTY 348
           L VGYG L+GK YW VKNSW   +G+QGY+ M+    N+CG+ + P+Y
Sbjct: 290 LVVGYGNLNGKDYWLVKNSWGLNFGDQGYIRMARNSGNHCGIASYPSY 337


>gi|86279349|gb|ABC88770.1| putative cathepsin L-like proteinase [Tenebrio molitor]
          Length = 416

 Score =  164 bits (416), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 86/203 (42%), Positives = 124/203 (61%), Gaps = 7/203 (3%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCWSF TTGA+EG   ++  +L  LS+Q LIDCS  YGN GCDGG    ++ +I
Sbjct: 219 DQGQCGSCWSFSTTGAIEGQLALQRGRLTSLSEQNLIDCSSSYGNAGCDGGWMDSAFSYI 278

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
             +G+ ++  Y PY  Q  YC   ++    T++G+ ++    E++L  A+ + GPV+VAI
Sbjct: 279 PDYGIMSEFAY-PYEAQGDYCRFDSSQFVTTLSGYYDLPSGGENSLADAVGQAGPVAVAI 337

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
           DA  +   FY  G++YD+ CN S   L+H V  VGYG  +G+ YW +KNSW   WG  GY
Sbjct: 338 DAPDE-LQFYSGGLFYDQTCNQSD--LNHGVFVVGYGSDNGQDYWILKNSWGFGWGESGY 394

Query: 331 VLMSIKD--NNCGVMTAPTYVTM 351
               +++  NNCG+ TA +Y  +
Sbjct: 395 -WRQVRNYGNNCGIATAASYPAL 416



 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 72/176 (40%), Positives = 107/176 (60%), Gaps = 4/176 (2%)

Query: 136 AIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGN 195
           A  V +     + + DQ  CGSCWSF TTGA+EG   ++  +L  LS+Q LIDCS  YGN
Sbjct: 36  AASVDWRSNAVSEVKDQGQCGSCWSFSTTGAIEGQLALQRGRLTSLSEQNLIDCSSSYGN 95

Query: 196 NGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDA 255
            GCDGG    ++ +I  +G+ ++  Y PY  Q  YC   ++ +  T++G+ ++    E++
Sbjct: 96  AGCDGGWMDSAFSYIHDYGIMSESAY-PYEAQGDYCRFDSSQSVTTLSGYYDLPSGGENS 154

Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDG 311
           L  A+ + GPV+VAIDA+ +   FY  G++YD+ CN S   L+H VL VGYG  +G
Sbjct: 155 LADAVGQAGPVAVAIDATDE-LQFYSGGLFYDQTCNQSD--LNHGVLVVGYGSDNG 207


>gi|147903593|ref|NP_001080822.1| cathepsin S precursor [Xenopus laevis]
 gi|33417128|gb|AAH56059.1| Ctss-a protein [Xenopus laevis]
          Length = 333

 Score =  164 bits (416), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 81/199 (40%), Positives = 119/199 (59%), Gaps = 4/199 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F   GA+EG   +K  KL  LS Q L+DC+  YGN GC GG    ++Q++
Sbjct: 135 NQGGCGSCWAFSAVGALEGQLMLKTGKLVSLSPQNLVDCASKYGNKGCSGGFMTSAFQYV 194

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
           + +     D Y PY   D  CH       ++   +  + P +ED LK AL   GP+SVAI
Sbjct: 195 IDNNGIDSDSYYPYHAMDEKCHYELAGKASSCVKYTEIVPGTEDNLKQALGTIGPISVAI 254

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
           D ++ +F  Y +GVY D  C+     ++H VLA+GYG L+G+ +W +KNSW TY+G++G+
Sbjct: 255 DGTRPTFFLYKSGVYSDPSCSQE---VNHGVLAIGYGTLNGQDFWLLKNSWGTYYGDKGF 311

Query: 331 VLMSI-KDNNCGVMTAPTY 348
           V ++  K N CGV +  +Y
Sbjct: 312 VRIARNKGNLCGVASYTSY 330


>gi|281352890|gb|EFB28474.1| hypothetical protein PANDA_008012 [Ailuropoda melanoleuca]
          Length = 328

 Score =  164 bits (415), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 95/228 (41%), Positives = 131/228 (57%), Gaps = 10/228 (4%)

Query: 127 VRYNKASKDAIP--VRYEMKGYNSLLD-QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQ 183
           V Y   S   +P  V +  KG  + +  Q  CG+CW+F   GA+E    +K  KL  LS 
Sbjct: 102 VTYKSNSNQKLPDSVDWREKGCVTKVKYQGACGACWAFSAVGALEAQLKLKTGKLVSLSA 161

Query: 184 QALIDCSW-GYGNNGCDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATAT 241
           Q L+DCS   YGN GC+GG    ++Q+I+  +G+ ++  Y PY   D  C   +    AT
Sbjct: 162 QNLVDCSTEKYGNKGCNGGFMTEAFQYIIDNNGIDSEASY-PYKATDGKCRYDSKNRAAT 220

Query: 242 MTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAV 301
            + +  +   SED LK A+A  GPVSVAIDA   SF  Y +GVYYD  C  +   ++H V
Sbjct: 221 CSKYTELPSGSEDDLKEAVANKGPVSVAIDARHSSFFLYRSGVYYDPSCTQN---VNHGV 277

Query: 302 LAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKD-NNCGVMTAPTY 348
           L VGYG L+GK YW VKNSW   +G+QGY+ M+    N+CG+ + P+Y
Sbjct: 278 LVVGYGNLNGKDYWLVKNSWGLNFGDQGYIRMARNSGNHCGIASYPSY 325


>gi|426216524|ref|XP_004002512.1| PREDICTED: cathepsin S isoform 1 [Ovis aries]
          Length = 331

 Score =  164 bits (415), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 89/200 (44%), Positives = 119/200 (59%), Gaps = 7/200 (3%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW-GYGNNGCDGGEDFRSYQWI 210
           Q  CGSCW+F   GA+E    +K  KL  LS Q L+DCS   YGN GC+GG    ++Q+I
Sbjct: 133 QGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTVKYGNKGCNGGFMTEAFQYI 192

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           +  +G+ ++  Y PY   D  C        AT + ++ +   SE+ALK A+A  GPVSV 
Sbjct: 193 IDNNGIDSEASY-PYKAMDGRCQYDVKNRAATCSRYIELPFGSEEALKEAVANKGPVSVG 251

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDA Q SF  Y  GVYYD  C  +   ++H VL VGYG L+GK YW VKNSW   +G+QG
Sbjct: 252 IDAKQTSFFLYKTGVYYDPSCTQN---VNHGVLVVGYGSLNGKDYWLVKNSWGLNFGDQG 308

Query: 330 YVLMSIKD-NNCGVMTAPTY 348
           Y+ M+    N+CG+   P+Y
Sbjct: 309 YIRMARNSGNHCGIANFPSY 328


>gi|1706261|sp|Q10717.1|CYSP2_MAIZE RecName: Full=Cysteine proteinase 2; Flags: Precursor
 gi|644490|dbj|BAA08245.1| cysteine proteinase [Zea mays]
          Length = 360

 Score =  164 bits (415), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 85/199 (42%), Positives = 117/199 (58%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTGA+E AY     K   LS+Q L+DC + + N GC+GG   +++++I
Sbjct: 160 NQGHCGSCWTFSTTGALEAAYTQATGKPISLSEQQLVDCGFAFNNFGCNGGLPSQAFEYI 219

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + GL T++ Y PY G +  C   N      +   VN+T  +ED LK A+    PVSVA
Sbjct: 220 KYNGGLDTEESY-PYQGVNGICKFKNENVGVKVLDSVNITLGAEDELKDAVGLVRPVSVA 278

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            +     F  Y +GVY  + C  +P  ++HAVLAVGYG  DG PYW +KNSW   WG++G
Sbjct: 279 FEVIT-GFRLYKSGVYTSDHCGTTPMDVNHAVLAVGYGVEDGVPYWLIKNSWGADWGDEG 337

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y  M +  N CGV T  +Y
Sbjct: 338 YFKMEMGKNMCGVATCASY 356


>gi|410990004|ref|XP_004001240.1| PREDICTED: cathepsin L1-like isoform 2 [Felis catus]
          Length = 334

 Score =  164 bits (415), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 95/221 (42%), Positives = 131/221 (59%), Gaps = 12/221 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY     +Q  C S W+F  TGA+EG  + K  KL  LS+Q L+DC+W  GN G
Sbjct: 118 VNWRKKGYVTPTKNQGQCASGWAFSATGAIEGQMFGKTHKLVSLSEQNLLDCAWSEGNGG 177

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQD-AYCHIANTTATATMTGFVNVTPNSEDA 255
           C GG    ++Q++  + GL +++ Y PY  QD   C     ++ A +TGF+ + P +E  
Sbjct: 178 CSGGSMGNAFQYVKDNGGLDSEESY-PYHAQDLQSCKYNPESSAADITGFLTI-PKTEYK 235

Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
           L  ++A  GPVS AIDAS  +F FY  G+YYD  C  S + L H VL VGYG    ELD 
Sbjct: 236 LMRSVAIVGPVSAAIDASLDTFRFYDQGIYYDSNC--SHEILHHGVLIVGYGFEGAELDN 293

Query: 312 KPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
           K YW VKNSW   WG  GY+LM+  +DN+CG+ +  ++ T+
Sbjct: 294 KKYWIVKNSWGEAWGIDGYILMAKDRDNHCGIASLASFPTV 334


>gi|399764976|gb|AFP50672.1| cathepsin L-like protease, partial [Earias vitella]
          Length = 222

 Score =  164 bits (415), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 90/174 (51%), Positives = 113/174 (64%), Gaps = 5/174 (2%)

Query: 149 LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
           + DQ  CGSCWSF TTGA+EG +Y K   L  LS+Q LIDCS  YGNNGC+GG    +++
Sbjct: 52  VKDQGKCGSCWSFSTTGALEGQHYRKTGVLVSLSEQNLIDCSTAYGNNGCNGGLMDNAFK 111

Query: 209 WIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVS 267
           +I   HGL T+  Y PY   D  C      + A   GFV++    ED LK A+A  GPVS
Sbjct: 112 YIKDNHGLDTEKSY-PYEAVDDKCRCNPKNSGADDVGFVDIPSGDEDKLKAAVATVGPVS 170

Query: 268 VAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNS 320
           VAIDASQ+SF FY +GVY +++C+++   LDH VL VGYG   DG+ YW VKNS
Sbjct: 171 VAIDASQQSFQFYSDGVYDEDQCSSTE--LDHGVLVVGYGTSEDGQDYWLVKNS 222


>gi|158148921|dbj|BAF81994.1| cysteine proteinase [Platycodon grandiflorus]
          Length = 359

 Score =  164 bits (415), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 85/199 (42%), Positives = 118/199 (59%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTGA+E AY     K   LS+Q L+DC+  Y N GC+GG   +++++I
Sbjct: 159 NQGHCGSCWTFSTTGALEAAYVQAFGKAIFLSEQQLVDCARAYNNFGCNGGLPSQAFEYI 218

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + GL T++ Y PY G D  C  ++      +   VN+T  +ED LK A+A   PVSVA
Sbjct: 219 KANGGLDTEEAY-PYTGVDGVCKFSSENIGVQVLDSVNITLGAEDELKDAVAFVRPVSVA 277

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            +     F  Y +GVY  + C N+P  ++HAV+AVGYG  +  PYW +KNSW   WG+ G
Sbjct: 278 FEVVS-GFRLYKSGVYTSDTCGNTPMDVNHAVVAVGYGVENDVPYWLIKNSWGADWGDNG 336

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y  M +  N CGV T  +Y
Sbjct: 337 YFKMEMGKNMCGVATCASY 355


>gi|426365294|ref|XP_004049712.1| PREDICTED: LOW QUALITY PROTEIN: putative cathepsin L-like protein
           6-like [Gorilla gorilla gorilla]
          Length = 333

 Score =  164 bits (415), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 95/220 (43%), Positives = 134/220 (60%), Gaps = 12/220 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + DQ  CGSCW+F  TGA+EG  + K  KL  L++Q L+DCS   GN G
Sbjct: 119 VDWREKGYMTPVKDQGQCGSCWAFSATGALEGQVFWKTGKLISLNEQNLVDCSGPQGNEG 178

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG     ++++ ++ GL ++  Y PY G+   C      + A  TGFV++ P+ E  L
Sbjct: 179 CNGGFMDNPFRYVQENRGLDSEASY-PYEGKVKTCGYNPKYSAANDTGFVDI-PSREKDL 236

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGY----GELDGK 312
             A+A  GP+SVA+ AS  SF FY  G+Y++ +C+  P+GLDHA+L VGY     + D K
Sbjct: 237 AKAVATVGPISVAVGASHVSFQFYKKGIYFEPRCD--PEGLDHAMLVVGYSYEGADWDNK 294

Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
            YW VKNSW   WG  GY+ M+  + NNCG+ TA +Y T+
Sbjct: 295 -YWLVKNSWGKNWGMDGYIKMAKDRRNNCGITTAASYPTV 333


>gi|302790930|ref|XP_002977232.1| hypothetical protein SELMODRAFT_228454 [Selaginella moellendorffii]
 gi|300155208|gb|EFJ21841.1| hypothetical protein SELMODRAFT_228454 [Selaginella moellendorffii]
          Length = 353

 Score =  164 bits (415), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 86/199 (43%), Positives = 118/199 (59%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTGA+E A+     K+ VLS+Q L+DC+ GY N GC+GG   +++++I
Sbjct: 149 NQQHCGSCWTFSTTGALESAHAQATGKMVVLSEQQLVDCAGGYNNFGCNGGLPSQAFEYI 208

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + GL T+D Y PY G D  C     +  A +   VN+T  +ED L  A+A + PVS+A
Sbjct: 209 RYNGGLDTEDSY-PYTGHDGKCTYNQNSIGAKVYDVVNITEGAEDELIHAVAFNRPVSIA 267

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            +   K F FY +GVY    C   PD ++HAVLAVGY      PYW +KNSW   +G  G
Sbjct: 268 YEV-LKDFRFYKSGVYTSNVCGTGPDTVNHAVLAVGYNRDAPVPYWIIKNSWGESFGLDG 326

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y  M +  N CG+ T  +Y
Sbjct: 327 YFYMEMGKNMCGIATCASY 345


>gi|6650705|gb|AAF21977.1|AF115280_1 thiolproteinase SmTP1 [Sarcocystis muris]
          Length = 394

 Score =  164 bits (415), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 89/201 (44%), Positives = 123/201 (61%), Gaps = 9/201 (4%)

Query: 148 SLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSY 207
           S+ DQ  CGSCW+F  TGA+EG Y  K  KL  LSQQ L+DCS   GN GCDGG    ++
Sbjct: 190 SVKDQGDCGSCWAFSATGAMEGVYCAKTGKLVNLSQQQLVDCSRFLGNQGCDGGRMEEAF 249

Query: 208 QWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVS 267
           ++++++G     +  PY+ +D  C  +  T+ AT+TG+ +V   SE ++K ALA   PVS
Sbjct: 250 EYVVENGGICSGENYPYMRKDGVCKSSQCTSVATITGYRSVPRRSEKSMKTALALRSPVS 309

Query: 268 VAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGY-GELDGK-PYWQVKNSWSTYW 325
           VAI A+Q +F FY +G+ +D  C  +   LDH VL VGY  E  G+  YW +KNSW   W
Sbjct: 310 VAIQANQAAFQFYYDGI-FDAPCGTN---LDHGVLLVGYSAETAGQGDYWIMKNSWGAAW 365

Query: 326 GNQGYVLMSIKD---NNCGVM 343
           G  GY+LM++       CGV+
Sbjct: 366 GKGGYMLMAMHKGPAGQCGVL 386


>gi|473159|emb|CAA83538.1| cathepsin L [Schistosoma mansoni]
          Length = 317

 Score =  164 bits (415), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 88/220 (40%), Positives = 133/220 (60%), Gaps = 8/220 (3%)

Query: 133 SKDAIPVRYEMKGYNSLL---DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDC 189
           S D +P +++ + + ++    +Q +CGSCW+F   GAVEG    KHKKL  LS+Q L+DC
Sbjct: 99  SNDPLPSKWDWRDHGAVTPVKNQGLCGSCWAFSAAGAVEGQLVKKHKKLISLSEQQLVDC 158

Query: 190 SWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVT 249
           S+ YGN+GC GG   +S+ ++ K+ + ++ DY  Y+G D+ CH   +     +  FV++ 
Sbjct: 159 SYKYGNDGCQGGTMDQSFAYLEKYPIESEKDY-KYIGHDSSCHFRKSKGVVKVKKFVDLP 217

Query: 250 PNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGEL 309
              E+ L+ AL  +GP+SVAIDA       Y +G+Y  ++C++    L+H VLAVGYG  
Sbjct: 218 ARDEEKLQKALYHYGPISVAIDALD-DLILYKSGIYESKQCSSFL--LNHGVLAVGYGRE 274

Query: 310 DGKPYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
           + K YW +KNSW T WG  GY  L   K N CG+ T  ++
Sbjct: 275 NRKDYWLIKNSWGTTWGMNGYFKLRRNKHNMCGIATNASF 314


>gi|432910512|ref|XP_004078392.1| PREDICTED: cathepsin K-like [Oryzias latipes]
          Length = 331

 Score =  164 bits (414), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 92/216 (42%), Positives = 125/216 (57%), Gaps = 7/216 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V Y  KG   S+ +Q  CGSCW+F + GA+EG    K   L  LS Q L+DC     N+G
Sbjct: 120 VDYRKKGMVTSVKNQGSCGSCWAFSSVGALEGQLAKKTGNLVDLSPQNLVDCV--TENDG 177

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
           C GG    +++++ ++G    +   PY+G+D  C    +   A + G+  V    E AL 
Sbjct: 178 CGGGYMTNAFKYVQENGGIDSEAAYPYMGEDQPCRYNVSGLAAQIKGYKEVPEGDEHALA 237

Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQ 316
           +AL K GPVSV IDASQ SF +Y  G+Y+D  CN   + ++HAVLAVGYG    GK +W 
Sbjct: 238 VALFKAGPVSVGIDASQNSFLYYQKGIYFDRNCNK--EDINHAVLAVGYGVNAKGKKFWI 295

Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
           VKNSW   WGN+GYVLM+  + N CG+    +Y  M
Sbjct: 296 VKNSWGETWGNKGYVLMARNRGNVCGIANLASYPVM 331


>gi|444519959|gb|ELV12909.1| Cathepsin L1 [Tupaia chinensis]
          Length = 333

 Score =  164 bits (414), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 91/217 (41%), Positives = 132/217 (60%), Gaps = 11/217 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TGA+EG  + K  KL  LS+Q L+DCS    N+G
Sbjct: 118 VDWREKGYVTPVKNQHRCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSQPQHNSG 177

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG   +++Q++  + GL +++ Y PY   ++ C  +   + AT+TGF ++ P  E AL
Sbjct: 178 CKGGLVIKAFQYVKDNGGLDSEESY-PYEEMESTCRYSPGNSAATVTGFKHI-PAEEKAL 235

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGEL----DGK 312
           + A+A  GP+SVAIDA   SF FY  G+ ++  C  SP  L+HAVL VGYG +    +  
Sbjct: 236 EKAVASVGPISVAIDAHHHSFQFYTGGILHEPNC--SPKWLNHAVLVVGYGVMQEGSNNN 293

Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
            YW VKNSW   WG  GY++M+  K+N+CG+ +   Y
Sbjct: 294 TYWLVKNSWGERWGVGGYIMMAKDKNNHCGIASDALY 330


>gi|195624522|gb|ACG34091.1| thiol protease aleurain precursor [Zea mays]
          Length = 360

 Score =  164 bits (414), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 86/199 (43%), Positives = 117/199 (58%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTGA+E AY     K   LS+Q LIDC + + N GC+GG   +++++I
Sbjct: 160 NQGHCGSCWTFSTTGALEAAYTQATGKPISLSEQQLIDCGFAFNNFGCNGGLPSQAFEYI 219

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + GL T++ Y PY G +  C   N      +   VN+T  +ED LK A+    PVSVA
Sbjct: 220 KYNGGLDTEESY-PYQGVNGICKFKNENVGFKVLDSVNITLGAEDELKDAVGLVRPVSVA 278

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            +     F  Y +GVY  + C  +P  ++HAVLAVGYG  DG PYW +KNSW   WG++G
Sbjct: 279 FEVIT-GFRLYKSGVYTSDHCGTTPMDVNHAVLAVGYGVEDGVPYWLIKNSWGADWGDEG 337

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y  M +  N CGV T  +Y
Sbjct: 338 YFKMEMGKNMCGVATCASY 356


>gi|281200606|gb|EFA74824.1| cysteine proteinase 5 precursor [Polysphondylium pallidum PN500]
          Length = 307

 Score =  164 bits (414), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 99/223 (44%), Positives = 134/223 (60%), Gaps = 19/223 (8%)

Query: 136 AIPVRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYG 194
           A  V +  KG    + +Q  CGSCWSF TTG+ EGA+++K   L  LS+Q L+DCS   G
Sbjct: 83  AANVDWRAKGAVTPIKNQGQCGSCWSFSTTGSTEGAHFIKTGNLVSLSEQNLMDCSKPEG 142

Query: 195 NNGCDGGEDFRSYQWIMK-HGLPTQDDYGPYLGQDA-YCHIANTTATATMTGFVNVTPNS 252
           N GC+GG    ++++I+K +G+ T+  Y PY  +D   C      + AT++ +VNVT  S
Sbjct: 143 NQGCNGGLMTAAFEYIIKNNGIDTESSY-PYKAEDGKKCLYNPANSAATLSSYVNVTTGS 201

Query: 253 EDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGK 312
           E  L +  +  GPVSVAIDAS  SF  Y +GVYY+ KC+ +   LDH VL VGYG  D  
Sbjct: 202 ESDLAVK-SGLGPVSVAIDASHNSFQLYSSGVYYEPKCSQTQ--LDHGVLVVGYGS-DAL 257

Query: 313 P----------YWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMT 344
           P          +W VKNSW T WG +GY+ MS  ++NNCG+ T
Sbjct: 258 PSAGVSAGSGDWWIVKNSWGTTWGVEGYIYMSRNRNNNCGIAT 300


>gi|261824899|pdb|3H89|A Chain A, A Combined Crystallographic And Molecular Dynamics Study
           Of Cathepsin-L Retro-Binding Inhibitors(Compound 4)
 gi|261824900|pdb|3H89|B Chain B, A Combined Crystallographic And Molecular Dynamics Study
           Of Cathepsin-L Retro-Binding Inhibitors(Compound 4)
 gi|261824901|pdb|3H89|C Chain C, A Combined Crystallographic And Molecular Dynamics Study
           Of Cathepsin-L Retro-Binding Inhibitors(Compound 4)
 gi|261824902|pdb|3H89|D Chain D, A Combined Crystallographic And Molecular Dynamics Study
           Of Cathepsin-L Retro-Binding Inhibitors(Compound 4)
 gi|261824903|pdb|3H89|E Chain E, A Combined Crystallographic And Molecular Dynamics Study
           Of Cathepsin-L Retro-Binding Inhibitors(Compound 4)
 gi|261824904|pdb|3H89|F Chain F, A Combined Crystallographic And Molecular Dynamics Study
           Of Cathepsin-L Retro-Binding Inhibitors(Compound 4)
 gi|261824905|pdb|3H8B|A Chain A, A Combined Crystallographic And Molecular Dynamics Study
           Of Cathepsin-L Retro-Binding Inhibitors(Compound 9)
 gi|261824906|pdb|3H8B|B Chain B, A Combined Crystallographic And Molecular Dynamics Study
           Of Cathepsin-L Retro-Binding Inhibitors(Compound 9)
 gi|261824907|pdb|3H8B|C Chain C, A Combined Crystallographic And Molecular Dynamics Study
           Of Cathepsin-L Retro-Binding Inhibitors(Compound 9)
 gi|261824908|pdb|3H8B|D Chain D, A Combined Crystallographic And Molecular Dynamics Study
           Of Cathepsin-L Retro-Binding Inhibitors(Compound 9)
 gi|261824909|pdb|3H8B|E Chain E, A Combined Crystallographic And Molecular Dynamics Study
           Of Cathepsin-L Retro-Binding Inhibitors(Compound 9)
 gi|261824910|pdb|3H8B|F Chain F, A Combined Crystallographic And Molecular Dynamics Study
           Of Cathepsin-L Retro-Binding Inhibitors(Compound 9)
 gi|317455049|pdb|2XU3|A Chain A, Cathepsin L With A Nitrile Inhibitor
 gi|317455050|pdb|2XU4|A Chain A, Cathepsin L With A Nitrile Inhibitor
 gi|317455051|pdb|2XU5|A Chain A, Cathepsin L With A Nitrile Inhibitor
 gi|358009432|pdb|2YJ2|A Chain A, Cathepsin L With A Nitrile Inhibitor
 gi|358009433|pdb|2YJ8|A Chain A, Cathepsin L With A Nitrile Inhibitor
 gi|358009434|pdb|2YJ9|A Chain A, Cathepsin L With A Nitrile Inhibitor
 gi|358009435|pdb|2YJB|A Chain A, Cathepsin L With A Nitrile Inhibitor
 gi|358009436|pdb|2YJC|A Chain A, Cathepsin L With A Nitrile Inhibitor
          Length = 220

 Score =  164 bits (414), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 94/220 (42%), Positives = 131/220 (59%), Gaps = 11/220 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TGA+EG  + K  +L  LS+Q L+DCS   GN G
Sbjct: 5   VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEG 64

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++Q++  + GL +++ Y PY   +  C      + A  TGFV++ P  E AL
Sbjct: 65  CNGGLMDYAFQYVQDNGGLDSEESY-PYEATEESCKYNPKYSVANDTGFVDI-PKQEKAL 122

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GP+SVAIDA  +SF FY  G+Y++  C  S + +DH VL VGYG    E D  
Sbjct: 123 MKAVATVGPISVAIDAGHESFLFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDNN 180

Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
            YW VKNSW   WG  GYV M+  + N+CG+ +A +Y T+
Sbjct: 181 KYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 220


>gi|146215994|gb|ABQ10199.1| cysteine protease Cp1 [Actinidia deliciosa]
          Length = 358

 Score =  164 bits (414), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 90/226 (39%), Positives = 130/226 (57%), Gaps = 12/226 (5%)

Query: 130 NKASKDAIPVRYEMKGY------NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQ 183
           +K +++A+P   EMK +      + + DQ  CGSCW+F TTGA+E AY     K   LS+
Sbjct: 134 HKLTEEALP---EMKDWRVSGIVSPVKDQGHCGSCWTFSTTGALEAAYKQAFGKGISLSE 190

Query: 184 QALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATM 242
           Q L+DC+  + N GC GG   ++++++  + GL T++ Y PY G++  C  ++      +
Sbjct: 191 QQLVDCAGAFNNFGCSGGLPSQAFEYVKYNGGLDTEEAY-PYTGKNGECKFSSENVGVQV 249

Query: 243 TGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVL 302
              VN+T  +ED LK A+A   PVSVA       F  Y  GVY  + C  +P  ++HAVL
Sbjct: 250 LDSVNITLGAEDELKHAVAFVRPVSVAFQVVN-GFRLYKEGVYTSDTCGRTPMDVNHAVL 308

Query: 303 AVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTY 348
           AVGYG  +G PYW +KNSW   WG+ GY  M +  N CGV T  +Y
Sbjct: 309 AVGYGVENGVPYWLIKNSWGADWGDSGYFKMEMGKNMCGVATCASY 354


>gi|6851030|emb|CAB71032.1| cysteine protease [Lolium multiflorum]
          Length = 359

 Score =  164 bits (414), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 86/199 (43%), Positives = 117/199 (58%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ+ CGSCW+F TTGA+E AY     K   LS+Q L+DC+  Y N GC+GG   +++++I
Sbjct: 158 DQASCGSCWTFSTTGALEAAYTQATGKNISLSEQQLVDCAGAYNNFGCNGGLPSQAFEYI 217

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T++ Y PY G +  C      A   +   VN+T N+ED LK A+    PVSVA
Sbjct: 218 KYNGGIDTEESY-PYKGVNGVCKYRPENAAVQVADSVNITLNAEDELKNAVGLVRPVSVA 276

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            +     F  Y +GVY  + C  +PD ++HAVLAVGYG  +G PYW +KNSW   WG  G
Sbjct: 277 FEVID-GFKQYKSGVYTSDHCGTTPDDVNHAVLAVGYGVENGVPYWLIKNSWGADWGEDG 335

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y  M +  N C V T  +Y
Sbjct: 336 YFKMEMGKNMCAVATCASY 354


>gi|313754424|pdb|3OF8|A Chain A, Structural Basis For Reversible And Irreversible
           Inhibition Of Human Cathepsin L By Their Respective
           Dipeptidyl Glyoxal And Diazomethylketone Inhibitors
 gi|313754425|pdb|3OF9|A Chain A, Structural Basis For Irreversible Inhibition Of Human
           Cathepsin L By A Diazomethylketone Inhibitor
          Length = 221

 Score =  164 bits (414), Expect = 9e-38,   Method: Compositional matrix adjust.
 Identities = 94/220 (42%), Positives = 131/220 (59%), Gaps = 11/220 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TGA+EG  + K  +L  LS+Q L+DCS   GN G
Sbjct: 6   VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEG 65

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++Q++  + GL +++ Y PY   +  C      + A  TGFV++ P  E AL
Sbjct: 66  CNGGLMDYAFQYVQDNGGLDSEESY-PYEATEESCKYNPKYSVANDTGFVDI-PKQEKAL 123

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GP+SVAIDA  +SF FY  G+Y++  C  S + +DH VL VGYG    E D  
Sbjct: 124 MKAVATVGPISVAIDAGHESFLFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDNN 181

Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
            YW VKNSW   WG  GYV M+  + N+CG+ +A +Y T+
Sbjct: 182 KYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 221


>gi|432117576|gb|ELK37815.1| Cathepsin L1 [Myotis davidii]
          Length = 299

 Score =  163 bits (413), Expect = 9e-38,   Method: Compositional matrix adjust.
 Identities = 110/296 (37%), Positives = 157/296 (53%), Gaps = 36/296 (12%)

Query: 86  ENEKIRYNWTYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKASKDAIP--VRYEM 143
           E+ + R N+T +       +  E++RLV + G +  K+     + + +   IP  V +  
Sbjct: 10  EHSQGRRNFT-LAMNAFGDMTNEEFRLVMN-GFQNQKHKKGDMFQEPALAEIPPSVDWRK 67

Query: 144 KG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGE 202
           KG    + DQ  CGSCW+F  TGA+EG  + K  KL  LS+Q L+DCS   GN GC GG 
Sbjct: 68  KGCVTPVKDQGGCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNEGCSGGL 127

Query: 203 DFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALA 261
              ++Q++  + GL T++ Y PY G D  C      + A  TGFV++  + E +L  A+A
Sbjct: 128 MDNAFQYVKDNEGLDTEESY-PYYGTDDTCKYKPEFSAANDTGFVDIHKD-ERSLMKAVA 185

Query: 262 KHGPVSVAIDASQKSFSFYVN---------------------GVYYDEKCNNSPDGLDHA 300
             GP+SVA+DAS +SF FY                       G+YYD  C  S + L+H 
Sbjct: 186 SVGPISVALDASLESFQFYEKGKVTVSSYLEIFTPAMTSVFLGIYYDPDC--SSEDLNHG 243

Query: 301 VLAVGYG----ELDGKPYWQVKNSWSTYWGNQGYVLMSIK-DNNCGVMTAPTYVTM 351
           VL VGYG    E+D   YW VKNSW T WG  GY+ M+   DN+CG+ +  +Y T+
Sbjct: 244 VLVVGYGFEGVEMDNNKYWIVKNSWGTKWGMDGYIKMAKDLDNHCGIASMASYPTV 299


>gi|301612003|ref|XP_002935514.1| PREDICTED: cathepsin K-like [Xenopus (Silurana) tropicalis]
          Length = 331

 Score =  163 bits (413), Expect = 9e-38,   Method: Compositional matrix adjust.
 Identities = 91/216 (42%), Positives = 125/216 (57%), Gaps = 8/216 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           + Y  KGY   + DQ  CGSCW+F T GA+EG    K  KL  +S Q L+DC     N G
Sbjct: 121 IDYRKKGYVTPIRDQGECGSCWAFSTVGALEGQLMKKTGKLVGISPQNLVDCV--KDNFG 178

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    +++++ K+ G+ +++ Y PY+G D  C    +   A + GF  V   SE AL
Sbjct: 179 CGGGYMTTAFKYVKKNKGIDSEEAY-PYVGMDQKCKYNVSGRAAEIKGFKEVKKGSETAL 237

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           K A+   GP+SV IDA   +F  Y  G+YYD+ C+   D ++HAVLAVGYG+     YW 
Sbjct: 238 KKAVGLVGPISVGIDAGLDTFFLYKKGIYYDKSCDG--DSINHAVLAVGYGKQKKGKYWI 295

Query: 317 VKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
           +KNSW   WGN+GY+LM+  K N CG+    +Y  M
Sbjct: 296 IKNSWGEDWGNKGYILMAREKGNACGIANLASYPVM 331


>gi|79331505|ref|NP_001032106.1| thiol protease aleurain [Arabidopsis thaliana]
 gi|332009931|gb|AED97314.1| thiol protease aleurain [Arabidopsis thaliana]
          Length = 357

 Score =  163 bits (413), Expect = 9e-38,   Method: Compositional matrix adjust.
 Identities = 85/199 (42%), Positives = 119/199 (59%), Gaps = 4/199 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F TTGA+E AY+    K   LS+Q L+DC+  + N GC+GG   +++++I
Sbjct: 158 DQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYI 217

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + GL T+  Y PY G+D  C  +       +   VN+T  +ED LK A+    PVS+A
Sbjct: 218 KSNGGLDTEKAY-PYTGKDETCKFSAENVGVQVLNSVNITLGAEDELKHAVGLVRPVSIA 276

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            +    SF  Y +GVY D  C ++P  ++HAVLAVGYG  DG PYW +KNSW   WG++G
Sbjct: 277 FEVIH-SFRLYKSGVYTDSHCGSTPMDVNHAVLAVGYGVEDGVPYWLIKNSWGADWGDKG 335

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y  M +  N C + T  +Y
Sbjct: 336 YFKMEMGKNMC-IATCASY 353


>gi|28194643|gb|AAO33583.1|AF479265_1 cathepsin P [Meriones unguiculatus]
          Length = 334

 Score =  163 bits (413), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 94/233 (40%), Positives = 138/233 (59%), Gaps = 13/233 (5%)

Query: 128 RYNKASKDAIPV--RYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQ 184
           R +K   D +P    +  +GY   + +Q  CGSCW+F  TGA+EG  + K  KL  LS Q
Sbjct: 105 RKDKKIVDGLPKFKNWINEGYVTPVRNQGTCGSCWAFAATGAIEGQMFWKTGKLTPLSVQ 164

Query: 185 ALIDCSWGYGNNGCDGGEDFRSYQWIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMT 243
            L+DCS   GN GC  G  FR++ ++ +  GL  +  Y PY G+   C   ++ + A +T
Sbjct: 165 NLVDCSEKQGNKGCAQGSAFRAFMYVNETKGLQDEISY-PYEGKQGTCRYNSSNSRAYVT 223

Query: 244 GFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLA 303
            F  + P +E  L +A+A  GPV+ A+DASQ SF FY  G+YY+ KC  S   ++HAVL 
Sbjct: 224 DF-RLLPQNEIYLLVAVASIGPVAAAVDASQDSFRFYRGGIYYEPKC--SQYSVNHAVLV 280

Query: 304 VGYG----ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
           VGYG    E DGK YW +KNSW   WG +GY+ ++  ++N+CG+ +  ++V +
Sbjct: 281 VGYGYEGNETDGKDYWLIKNSWGENWGMRGYMKIARDRNNHCGIASQASFVDI 333


>gi|440291372|gb|ELP84641.1| cysteine proteinase 3 precursor, putative [Entamoeba invadens IP1]
          Length = 247

 Score =  163 bits (413), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 90/222 (40%), Positives = 124/222 (55%), Gaps = 17/222 (7%)

Query: 131 KASKDAIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVL--SQQALID 188
           + S    PVR          DQ+ CGSC+SFG+  ++EG   M     + L  +++ ++ 
Sbjct: 36  RTSGKVTPVR----------DQAQCGSCYSFGSLASLEGRLLMIKGSASTLDLAEEQIVR 85

Query: 189 CSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
           CS    NNGC+GG     Y +I  +G+  + DY PY   D  C +  +   AT+TG+  V
Sbjct: 86  CS---DNNGCNGGTGSVVYDYIKANGIVQEKDY-PYTATDGSCTVDTSKKYATITGYKKV 141

Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
           T    +ALK AL   G V V+IDAS  +   Y  GVY+D KC      L+H V +VGYG 
Sbjct: 142 TKKDNNALKTALV-DGVVDVSIDASSVALQLYKTGVYHDTKCKTGLLDLNHEVASVGYGA 200

Query: 309 LDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVT 350
           L+G+ YW VKNSW T WG++GYVLM++  N CG+ T P Y T
Sbjct: 201 LNGEEYWIVKNSWGTSWGDKGYVLMAMAGNTCGITTDPIYPT 242


>gi|10946820|ref|NP_067420.1| cathepsin 6 precursor [Mus musculus]
 gi|9931384|gb|AAG02172.1|AF223401_1 cathepsin-6 [Mus musculus]
 gi|12838129|dbj|BAB24093.1| unnamed protein product [Mus musculus]
 gi|16445021|gb|AAK00510.1| cathepsin 6 precursor [Mus musculus]
 gi|68534635|gb|AAH99455.1| Cathepsin 6 [Mus musculus]
 gi|148709368|gb|EDL41314.1| cathepsin 6 [Mus musculus]
          Length = 334

 Score =  163 bits (413), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 90/226 (39%), Positives = 132/226 (58%), Gaps = 11/226 (4%)

Query: 131 KASKDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALI 187
           +A  D +P  V +  KGY   +  Q  C SCW+F   GA+EG  + K  KL  LS Q L+
Sbjct: 109 RAVGDVLPKFVDWRKKGYVTRVRRQKFCNSCWAFAVNGAIEGQMFKKTGKLTPLSVQNLV 168

Query: 188 DCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVN 247
           DC+   GN+GC  G+ + +Y++++ +G    +   PY G++  C      + A +TGFV+
Sbjct: 169 DCTKTQGNDGCQWGDPYIAYEYVLNNGGLEAEATYPYEGKEGPCRYNPKNSKAEITGFVS 228

Query: 248 VTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG 307
           + P SED L  A+A  GP+S A+DAS   FSFY  G+Y+   C+N  + ++HAVL VGYG
Sbjct: 229 L-PESEDILMEAVATIGPISAAVDASFNRFSFYDGGIYHQPNCSN--NTVNHAVLVVGYG 285

Query: 308 ----ELDGKPYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
               E DG  YW +KNSW   WG  GY+ ++  ++N+CG+ T   Y
Sbjct: 286 TEGNETDGNKYWLIKNSWGRRWGIGGYMKIIRDQNNHCGIATYAHY 331


>gi|18407961|ref|NP_566880.1| thiol protease aleurain-like protein [Arabidopsis thaliana]
 gi|73622182|sp|Q8RWQ9.1|ALEUL_ARATH RecName: Full=Thiol protease aleurain-like; Flags: Precursor
 gi|20147207|gb|AAM10319.1| AT3g45310/F18N11_70 [Arabidopsis thaliana]
 gi|332644500|gb|AEE78021.1| thiol protease aleurain-like protein [Arabidopsis thaliana]
          Length = 358

 Score =  163 bits (413), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 85/199 (42%), Positives = 117/199 (58%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTGA+E AY+    K   LS+Q L+DC+  + N GC GG   +++++I
Sbjct: 158 EQGHCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGTFNNFGCHGGLPSQAFEYI 217

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + GL T++ Y PY G+D  C  +       +   VN+T  +ED LK A+    PVSVA
Sbjct: 218 KYNGGLDTEEAY-PYTGKDGGCKFSAKNIGVQVRDSVNITLGAEDELKHAVGLVRPVSVA 276

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            +   + F FY  GV+    C N+P  ++HAVLAVGYG  D  PYW +KNSW   WG+ G
Sbjct: 277 FEVVHE-FRFYKKGVFTSNTCGNTPMDVNHAVLAVGYGVEDDVPYWLIKNSWGGEWGDNG 335

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y  M +  N CGV T  +Y
Sbjct: 336 YFKMEMGKNMCGVATCSSY 354


>gi|45550334|gb|AAS67923.1| cathepsin L [Artemia franciscana]
          Length = 226

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 87/199 (43%), Positives = 120/199 (60%), Gaps = 3/199 (1%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM 211
           Q  C SC +F  TGA+E   + K  KL  LS+Q LIDCS  YGN GC GG   +++++I 
Sbjct: 26  QGQCASCLAFSPTGALESQTFRKTGKLISLSEQNLIDCSGEYGNLGCKGGWISQAFEYIK 85

Query: 212 KH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
            + G+ T++ Y  Y  ++ +C        A   GFVN+    ED LK A+A  GPVS  I
Sbjct: 86  DNKGIDTENKY-HYEAKENFCRDNPRNRGAVALGFVNIPSGEEDKLKAAVATVGPVSAVI 144

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
           D S + F FY  GVYY+  C  S + L+HAVL +G G  +G+ YW VKNSWS +WG++GY
Sbjct: 145 DVSHEGFQFYSKGVYYEPSCKTSFEHLNHAVLVIGCGSDNGEDYWLVKNSWSKHWGDEGY 204

Query: 331 VLMSI-KDNNCGVMTAPTY 348
           + ++  + N+CGV TA  Y
Sbjct: 205 LKIARNRKNHCGVATAALY 223


>gi|20136379|gb|AAM11647.1|AF490984_1 cathepsin L, partial [Fasciola hepatica]
          Length = 311

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 88/218 (40%), Positives = 124/218 (56%), Gaps = 8/218 (3%)

Query: 131 KASKDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALI 187
           +A+  A+P  + +   GY   + DQ  CGSCW+F TTG +EG Y    +     S+Q L+
Sbjct: 87  EANNRAVPDKIDWRESGYVTEVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLV 146

Query: 188 DCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVN 247
           DCS  +GNNGC GG    +YQ++ + GL T+  Y PY   +  C        A +TG+  
Sbjct: 147 DCSGPWGNNGCSGGLMENAYQYLKQFGLETESSY-PYTAVEGQCRYNKQLGVAKVTGYYT 205

Query: 248 VTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG 307
           V   SE  LK  +   GP +VA+D  +  F  Y +G+Y  + C  SP  ++HAVLAVGYG
Sbjct: 206 VHSGSEVELKNLVGAEGPAAVAVDV-ESDFMMYRSGIYQSQTC--SPLRVNHAVLAVGYG 262

Query: 308 ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMT 344
             DG  YW VKNSW +YWG +GY+ M+  + N CG+ +
Sbjct: 263 TQDGTDYWIVKNSWGSYWGERGYIRMARNRGNMCGIAS 300


>gi|383410403|gb|AFH28415.1| cathepsin L1 preproprotein [Macaca mulatta]
          Length = 333

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 93/220 (42%), Positives = 130/220 (59%), Gaps = 11/220 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TGA+EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 118 VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSGPQGNEG 177

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++Q++  + GL +++ Y PY   +  C      + A  TGFV++ P  E AL
Sbjct: 178 CNGGLMDYAFQYVADNGGLDSEESY-PYEATEESCKYNPEYSVANDTGFVDI-PKQEKAL 235

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GP+SVAIDA  +SF FY  G+Y++  C  S + +DH VL VGYG    E D  
Sbjct: 236 MKAVATVGPISVAIDAGHESFMFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDNS 293

Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
            YW  KNSW   WG  GY+ M+  + N+CG+ +A +Y T+
Sbjct: 294 KYWLGKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYPTV 333


>gi|225444726|ref|XP_002278624.1| PREDICTED: thiol protease aleurain-like isoform 1 [Vitis vinifera]
 gi|147826441|emb|CAN62278.1| hypothetical protein VITISV_031382 [Vitis vinifera]
 gi|297738562|emb|CBI27807.3| unnamed protein product [Vitis vinifera]
          Length = 362

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 87/199 (43%), Positives = 116/199 (58%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F TTGA+E AY     K   LS+Q L+DC+  + N GC GG   +++++I
Sbjct: 162 DQGHCGSCWTFSTTGALEAAYAQAFGKGISLSEQQLVDCAGAFNNFGCHGGLPSQAFEYI 221

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + GL T++ Y PY G D  C  ++      +   VN+T  +ED LK A+A   PVSVA
Sbjct: 222 KYNGGLDTEEAY-PYTGLDGTCKFSSENIGVQVLDSVNITLGAEDELKHAVAFVRPVSVA 280

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            +     F FY  GVY    C ++P  ++HAVLAVGYG  DG  YW +KNSW   WG+ G
Sbjct: 281 FEVVH-DFRFYKKGVYTSGTCGSTPMDVNHAVLAVGYGVEDGVAYWLIKNSWGENWGDNG 339

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y  M +  N CGV T  +Y
Sbjct: 340 YFKMELGKNMCGVATCSSY 358


>gi|338717354|ref|XP_001492337.3| PREDICTED: pro-cathepsin H-like [Equus caballus]
          Length = 323

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 83/199 (41%), Positives = 118/199 (59%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTGA+E A  +   KL  L++Q L+DC+  + N+GC GG   +++++I
Sbjct: 122 NQGGCGSCWTFSTTGALESAVAIASGKLLSLAEQQLVDCAQNFNNHGCQGGLPSQAFEYI 181

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
               G+  +D Y PY GQD  C      A A +    N+T N E A+  A+A + PVS A
Sbjct: 182 RYNKGIMGEDTY-PYKGQDGDCKFQPNKAIAFVKDVANITLNDEKAMVEAVALYNPVSFA 240

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            + ++  F  Y  G+Y    C+ +PD ++HAVLAVGYGE +G PYW VKNSW  +WG  G
Sbjct: 241 FEVTE-DFMMYRKGIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPHWGMNG 299

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y L+    N CG+    +Y
Sbjct: 300 YFLIERGKNMCGLAACASY 318


>gi|258588539|pdb|3HWN|A Chain A, Cathepsin L With Az13010160
 gi|258588540|pdb|3HWN|B Chain B, Cathepsin L With Az13010160
 gi|258588541|pdb|3HWN|C Chain C, Cathepsin L With Az13010160
 gi|258588542|pdb|3HWN|D Chain D, Cathepsin L With Az13010160
          Length = 258

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 93/220 (42%), Positives = 130/220 (59%), Gaps = 11/220 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TGA+EG  + K  +L  LS+Q L+DCS   GN G
Sbjct: 43  VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEG 102

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++Q++  + GL +++ Y PY   +  C      + A   GFV++ P  E AL
Sbjct: 103 CNGGLMDYAFQYVQDNGGLDSEESY-PYEATEESCKYNPKYSVANDAGFVDI-PKQEKAL 160

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GP+SVAIDA  +SF FY  G+Y++  C  S + +DH VL VGYG    E D  
Sbjct: 161 MKAVATVGPISVAIDAGHESFLFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDNN 218

Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
            YW VKNSW   WG  GYV M+  + N+CG+ +A +Y T+
Sbjct: 219 KYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 258


>gi|71897043|ref|NP_001026516.1| cathepsin S precursor [Gallus gallus]
 gi|53126701|emb|CAG30977.1| hypothetical protein RCJMB04_1f23 [Gallus gallus]
          Length = 328

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 91/223 (40%), Positives = 131/223 (58%), Gaps = 7/223 (3%)

Query: 128 RYNKASKDAIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALI 187
           R    + DA+  R E      + +Q  CG+CW+F   GA+E    +K  KL  LS Q L+
Sbjct: 108 RRRGGAPDAMDWR-EKGCVTEVKNQGACGACWAFSAVGALEAQVKLKTGKLVSLSAQNLV 166

Query: 188 DCSWGYGNNGCDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFV 246
           DCS  YGN GC GG   R++Q+I+  +G+ +++ Y PY+ Q+  C    +T  AT + +V
Sbjct: 167 DCSMMYGNKGCGGGFMTRAFQYIIDNNGIDSEESY-PYMAQNGTCQYNVSTRAATCSKYV 225

Query: 247 NVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGY 306
            +    E ALK A+A  GPVSVAIDA+Q +F  Y +GVY D +C      ++H VL VGY
Sbjct: 226 ELPYADEAALKDAVANVGPVSVAIDATQPTFFLYRSGVYDDPRCTQE---VNHGVLVVGY 282

Query: 307 GELDGKPYWQVKNSWSTYWGNQGYVLMSIKD-NNCGVMTAPTY 348
           G L+ K +W VKNSW   +G+ GY+ MS    N+CG+ +  +Y
Sbjct: 283 GTLNEKDFWLVKNSWGERFGDGGYIRMSRNHANHCGIASYASY 325


>gi|139002720|dbj|BAF51966.1| cathepsin K [Carassius auratus]
 gi|139002725|dbj|BAF51967.1| tartrate-resistant acid phosphatase [Carassius auratus]
          Length = 332

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 93/217 (42%), Positives = 128/217 (58%), Gaps = 9/217 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           + Y   GY  S+ +Q  CGSCW+F + GA+EG       +L  LS Q L+DC     N+G
Sbjct: 121 IDYRKLGYVTSVKNQGSCGSCWAFSSVGALEGQLKKTKGQLVDLSPQNLVDCV--TDNDG 178

Query: 198 CDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    +++++    G+ +++ Y PY+G D  C   ++   AT  GF  +   +E AL
Sbjct: 179 CGGGYMTNAFRYVKDNQGIDSEEGY-PYVGTDQQCAYNSSARAATCKGFKEIPQGNEKAL 237

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGEL-DGKPYW 315
             A+AK GPVSV IDA Q +F +Y +GVYYD  CN   D ++HAVLAVGYG    GK YW
Sbjct: 238 TAAVAKVGPVSVGIDAMQSTFLYYKSGVYYDPNCNK--DDVNHAVLAVGYGATPKGKKYW 295

Query: 316 QVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
            VKNSW   WG +GYVLM+   NN CG+ +  ++  M
Sbjct: 296 IVKNSWGEDWGKKGYVLMARNRNNACGIASLASFPVM 332


>gi|19698257|dbj|BAB86771.1| cathepsin L-like [Engraulis japonicus]
          Length = 324

 Score =  162 bits (411), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 87/202 (43%), Positives = 124/202 (61%), Gaps = 7/202 (3%)

Query: 149 LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
           + +Q  CGSCWSF  TGA+E    ++   L  LS+Q L+DCS  YGN GC+GG    ++Q
Sbjct: 125 IKNQGQCGSCWSFSATGALESQTCLRRGYLPSLSEQQLVDCSGPYGNYGCNGGWPDHAFQ 184

Query: 209 WIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTP-NSEDALKLALAKHGPVS 267
           ++  +G    + Y PY  +   CH  +  + AT +G+ +VTP  SE AL+  +A  GP+S
Sbjct: 185 YVQANGGIDSESYYPYQARVGTCHYNSAYSAATCSGYQDVTPVGSESALQYYVANVGPLS 244

Query: 268 VAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGN 327
           +AIDAS   +  Y +GV+ D  C+ +    DHAVL VGYG  +G+ YW VKNSW T+WG 
Sbjct: 245 IAIDAS--GWQSYQSGVFNDPSCSQT---ADHAVLLVGYGTYNGQDYWLVKNSWGTWWGE 299

Query: 328 QGYVLMSIKDNN-CGVMTAPTY 348
           QGY++M+   NN CG+    +Y
Sbjct: 300 QGYIMMARNANNQCGIANHASY 321


>gi|226821419|gb|ACO82385.1| cathepsin K [Lutjanus argentimaculatus]
          Length = 330

 Score =  162 bits (411), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 92/216 (42%), Positives = 123/216 (56%), Gaps = 7/216 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           + Y  KG   S+ +Q  CGSCW+F + GA+EG    K  +L  LS Q L+DC     N+G
Sbjct: 119 IDYRKKGMVTSVKNQGSCGSCWAFSSAGALEGQLAKKTGQLVDLSPQNLVDCV--TENDG 176

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
           C GG   +++Q++  +G    ++  PY+G+D  C    T   A   G+  +   +E AL 
Sbjct: 177 CGGGYMTKAFQYVADNGGIDSEEAYPYIGEDQPCRYNATGMAAQCKGYKEIPEGNEHALA 236

Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGEL-DGKPYWQ 316
           +AL K GPVSV IDA+  SF FY  GVYYD  CN   + ++HAVLAVGYG    GK YW 
Sbjct: 237 VALFKAGPVSVGIDATLSSFQFYSKGVYYDPSCNK--EDINHAVLAVGYGVTGKGKKYWI 294

Query: 317 VKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           VKNSW   WG  GY+LM+    N CG+    +Y  M
Sbjct: 295 VKNSWGESWGKGGYILMARNRGNLCGIANLASYPIM 330


>gi|296228726|ref|XP_002759933.1| PREDICTED: cathepsin S isoform 1 [Callithrix jacchus]
          Length = 330

 Score =  162 bits (411), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 83/199 (41%), Positives = 120/199 (60%), Gaps = 6/199 (3%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM 211
           Q  CG+CW+F   GA+E    +K  KL  LS Q L+DCS  YGN GC+GG    ++Q+I+
Sbjct: 133 QGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSEKYGNKGCNGGFMTEAFQYII 192

Query: 212 KH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
            + G+ ++  Y PY   D  C   +    AT + +  +    ED LK A+A  GPV V +
Sbjct: 193 DNKGIDSEASY-PYKAMDQKCQYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVCVGV 251

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
           DAS  SF  Y +GVYYD  C  +   ++H VL +GYG+L+G+ YW VKNSW + +G +GY
Sbjct: 252 DASHSSFFLYRSGVYYDPACTQN---VNHGVLVIGYGDLNGEEYWLVKNSWGSNFGERGY 308

Query: 331 VLMSI-KDNNCGVMTAPTY 348
           + M+  K N+CG+ + P+Y
Sbjct: 309 IRMARNKGNHCGIASYPSY 327


>gi|47169476|tpe|CAE48375.1| TPA: cathepsin Q-like 2 [Rattus norvegicus]
          Length = 342

 Score =  162 bits (411), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 91/223 (40%), Positives = 130/223 (58%), Gaps = 13/223 (5%)

Query: 134 KDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCS 190
           +DA+P  + +  +GY   + +Q  C SCW+F   GA+EG  + K  KL  LS Q L+DCS
Sbjct: 122 RDALPKSIDWRKEGYVTRVREQGKCKSCWAFPVAGAIEGQMFKKTGKLTPLSVQNLVDCS 181

Query: 191 WGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTP 250
              GN GC GG  + ++Q+++++G    +   PY G++  C      A A +T FV + P
Sbjct: 182 KPQGNKGCRGGTTYNAFQYVLQNGGLESEATYPYKGKEGLCKYNPKNAYAKITRFVAL-P 240

Query: 251 NSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG--- 307
             ED L  ALA  GPV+  I AS  SF F V+G+Y++ KCNN    ++HAVL VGYG   
Sbjct: 241 EDEDVLMDALATKGPVAAGIHASHGSFHF-VSGIYHEPKCNNR---VNHAVLVVGYGFEG 296

Query: 308 -ELDGKPYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
            E DG  YW +KNSW   WG +GY+ +   ++N+CG+ T   Y
Sbjct: 297 NETDGNNYWLIKNSWGKQWGLKGYMKIAKDRNNHCGIATFAQY 339


>gi|334332716|ref|XP_001367365.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
          Length = 335

 Score =  162 bits (411), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 88/214 (41%), Positives = 128/214 (59%), Gaps = 6/214 (2%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  +GY   + +Q  C SCW+F   GA+EG ++ K  +L  LS Q L+DC+     + 
Sbjct: 121 VDWREEGYVTPVKNQGQCLSCWAFSAVGAIEGQWFRKTGELVSLSIQNLVDCTTSDSISS 180

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
           C GG   R++Q++  +G    ++  PY+G+   C      + A + GFV++    E AL 
Sbjct: 181 CHGGFMDRAFQYVQDNGGIDTEECYPYVGEVNECKYQPECSGANVVGFVDIPSMDERALM 240

Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE--LDGKPYW 315
            A+A  GP+SVAID    SF FY +GVYYD +C++S   L+HA L VGYG   +DG+ YW
Sbjct: 241 EAVATVGPISVAIDGGNPSFKFYESGVYYDPQCSSSQ--LNHAGLVVGYGSEGIDGRKYW 298

Query: 316 QVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
            VKNSW   WGN GY+LM+  +DN+CG+ T  +Y
Sbjct: 299 IVKNSWGELWGNNGYILMAKDEDNHCGIATEASY 332


>gi|443694581|gb|ELT95681.1| hypothetical protein CAPTEDRAFT_173171 [Capitella teleta]
          Length = 342

 Score =  162 bits (411), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 93/217 (42%), Positives = 129/217 (59%), Gaps = 8/217 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F +TG++EG ++    +L  LS+Q L+DC+  YGN G
Sbjct: 128 VDWRDKGYVTPVKNQGACGSCWAFSSTGSLEGQHFRLTGQLVSLSEQNLVDCTKKYGNAG 187

Query: 198 CDGGEDFRSYQWI-MKHGLPTQDDYGPYLGQDAYCHIANTTAT--ATMTGFVNVTPNSED 254
           C+GG    ++ ++   +G+ T+  Y PY G D +C    +     A  TG V+V    E 
Sbjct: 188 CNGGWMDNAFNYVKANNGIDTEAFY-PYEGHDDWCGYDGSPGHKGANCTGHVDVQQGDEL 246

Query: 255 ALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPY 314
           ALK A+A  GPVSV IDA+ +SF  Y +G+Y +  C+NS    DHAVL VGYG   G  Y
Sbjct: 247 ALKQAVATVGPVSVGIDATHRSFQLYKSGIYDEVACSNSS--TDHAVLVVGYGSQGGHDY 304

Query: 315 WQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVT 350
           W VKNSW T WG  GY++MS  K N C + +  +Y T
Sbjct: 305 WLVKNSWGTSWGMDGYIMMSRNKGNQCAIASYASYPT 341


>gi|155970232|gb|ABU41785.1| cysteine protease [Rosa x borboniana]
          Length = 357

 Score =  162 bits (411), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 87/199 (43%), Positives = 114/199 (57%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F TTGA+E AY     K    S+Q L+DC+  + N GC GG   +++++I
Sbjct: 157 DQGHCGSCWTFSTTGALEAAYVQAFGKQISPSEQQLVDCAGAFNNFGCSGGLPSQAFEYI 216

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + GL T+  Y PY   D  C  ++      +   VN+T N E+ LK A+A   PVSVA
Sbjct: 217 KYNGGLDTEQAY-PYTAVDGACKFSSENVGVRVLDSVNITLNDEEELKHAVAFVRPVSVA 275

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
               Q  F  Y +GVY  E C N+P  ++HAVLAVGYG  +G PYW +KNSW   WG+ G
Sbjct: 276 FQVVQ-DFRLYKSGVYTSETCGNTPMDVNHAVLAVGYGVENGVPYWLIKNSWGQSWGDNG 334

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y  M    N CGV T  +Y
Sbjct: 335 YFKMEYGKNMCGVATCASY 353


>gi|432910514|ref|XP_004078393.1| PREDICTED: cathepsin S-like [Oryzias latipes]
          Length = 339

 Score =  162 bits (411), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 86/201 (42%), Positives = 122/201 (60%), Gaps = 4/201 (1%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM 211
           Q  CGSCW+F   GA+EG    K  KL  LS Q L+DCS  YGN+GC+GG   +++Q+++
Sbjct: 142 QGSCGSCWAFSAVGALEGQLCRKTGKLVDLSPQNLVDCSSKYGNHGCNGGFMHQAFQYVI 201

Query: 212 KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAID 271
            +     D   PY+G    CH ++    A  + +  +    E ALK A+A  GP+SVAID
Sbjct: 202 DNQGIDSDAGYPYVGVTQNCHYSSEYRAANCSQYSFLPEGDEGALKEAIATIGPISVAID 261

Query: 272 ASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYV 331
           A++  F+FY +GVY D  C+ +   ++H VLAVGYG L+G+ YW VKNSW T +G QGY+
Sbjct: 262 ATRPRFAFYRSGVYDDSSCSQN---VNHGVLAVGYGTLNGQDYWLVKNSWGTTFGEQGYI 318

Query: 332 LMSI-KDNNCGVMTAPTYVTM 351
            M+  K++ CG+     Y  M
Sbjct: 319 RMARNKNDQCGIAMYGCYPIM 339


>gi|208972988|dbj|BAG74343.1| silicatein-M2 [Ephydatia fluviatilis]
          Length = 326

 Score =  162 bits (411), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 83/199 (41%), Positives = 127/199 (63%), Gaps = 5/199 (2%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM 211
           Q  CG+ ++F  TGA+EGA  + + K   LS+Q +IDCS  YGN+GC GG+ + ++++++
Sbjct: 128 QGQCGASYAFAATGALEGASALANDKQVTLSEQNIIDCSVPYGNHGCSGGDTYTAFKYVI 187

Query: 212 KH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
            + G+ T+  Y  + G+ + C   N T+ A+ TG V++   SE+ L  A+A  GPV+VAI
Sbjct: 188 DNGGIDTESSY-SFKGKQSSCQYNNKTSGASATGVVSIAYGSENDLLAAVATVGPVAVAI 246

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
           DA+  +F FY +GV+    C+++   L+HA+L  GYG  +GK YW VKNSWS  WG+ GY
Sbjct: 247 DANTNAFRFYQSGVFDSSSCSSTK--LNHAMLVTGYGSYNGKDYWLVKNSWSKNWGDSGY 304

Query: 331 VLM-SIKDNNCGVMTAPTY 348
           +LM   K N CG+ +   Y
Sbjct: 305 ILMVRNKYNQCGIASDALY 323


>gi|114658412|ref|XP_001153217.1| PREDICTED: pro-cathepsin H isoform 6 [Pan troglodytes]
 gi|397478882|ref|XP_003810764.1| PREDICTED: pro-cathepsin H [Pan paniscus]
 gi|12803323|gb|AAH02479.1| Cathepsin H [Homo sapiens]
 gi|60655259|gb|AAX32193.1| cathepsin H [synthetic construct]
 gi|123979560|gb|ABM81609.1| cathepsin H [synthetic construct]
 gi|123994193|gb|ABM84698.1| cathepsin H [synthetic construct]
 gi|189054474|dbj|BAG37247.1| unnamed protein product [Homo sapiens]
 gi|410254318|gb|JAA15126.1| cathepsin H [Pan troglodytes]
 gi|410294916|gb|JAA26058.1| cathepsin H [Pan troglodytes]
 gi|410331109|gb|JAA34501.1| cathepsin H [Pan troglodytes]
          Length = 335

 Score =  162 bits (411), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 81/199 (40%), Positives = 118/199 (59%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTGA+E A  +   K+  L++Q L+DC+  + N+GC GG   +++++I
Sbjct: 134 NQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYI 193

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           +   G+  +D Y PY G+D YC      A   +    N+T   E+A+  A+A + PVS A
Sbjct: 194 LYNKGIMGEDTY-PYQGKDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFA 252

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            + +Q  F  Y  G+Y    C+ +PD ++HAVLAVGYGE +G PYW VKNSW   WG  G
Sbjct: 253 FEVTQ-DFMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNG 311

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y L+    N CG+    +Y
Sbjct: 312 YFLIERGKNMCGLAACASY 330


>gi|375340657|emb|CBJ56264.1| cathepsin S protein [Dicentrarchus labrax]
          Length = 337

 Score =  162 bits (411), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 86/201 (42%), Positives = 121/201 (60%), Gaps = 4/201 (1%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM 211
           Q  CGSCW+F   GA+EG       KL  LS Q L+DCS  YGN+GC+GG   +++Q+++
Sbjct: 140 QGSCGSCWAFSAAGALEGQLAKTTGKLVDLSPQNLVDCSTKYGNHGCNGGFMHQAFQYVI 199

Query: 212 KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAID 271
            +     D   PY G++  C   +    A  + +  +   +E ALK ALA  GP+SVAID
Sbjct: 200 DNQGIDSDASYPYTGRNGECRYNSKFRAANCSQYSFLPEGNEGALKEALANIGPISVAID 259

Query: 272 ASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYV 331
           A++ +F+FY +GVY D  C+     ++H VLAVGYG LDG+ YW VKNSW   +G+QGY+
Sbjct: 260 ATRPTFTFYRSGVYNDPNCSQK---VNHGVLAVGYGTLDGQDYWLVKNSWGKTFGDQGYI 316

Query: 332 LMSI-KDNNCGVMTAPTYVTM 351
            MS  K++ CG+     Y  M
Sbjct: 317 RMSRNKNDQCGIALYGCYPIM 337


>gi|213512938|ref|NP_001133871.1| Cathepsin K precursor [Salmo salar]
 gi|209155648|gb|ACI34056.1| Cathepsin K precursor [Salmo salar]
 gi|223647252|gb|ACN10384.1| Cathepsin K precursor [Salmo salar]
 gi|223673129|gb|ACN12746.1| Cathepsin K precursor [Salmo salar]
          Length = 331

 Score =  162 bits (411), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 93/233 (39%), Positives = 128/233 (54%), Gaps = 7/233 (3%)

Query: 122 KYSLWVRYNKASKDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAV 180
           + + W+  N   K    + Y  KG    + +Q  CGSCW+F + GA+EG       KL  
Sbjct: 103 RSNTWIPDNNVVKIPRSIDYRKKGMVTPVKNQLSCGSCWAFSSAGALEGQLAKTTGKLID 162

Query: 181 LSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATA 240
           LS Q L+DC     NNGC GG    +++++ ++G    ++  PYLGQD  C    +   A
Sbjct: 163 LSPQNLVDCV--TENNGCGGGYMTNAFEYVEENGGIDTEEAYPYLGQDGQCAYNASGMGA 220

Query: 241 TMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHA 300
              GF  +    E AL  A+ K GPV+V IDA+  +F FY  GVYYD  CN   D ++HA
Sbjct: 221 QCRGFKEIPEGDEWALTKAVVKVGPVAVGIDATLSTFQFYQRGVYYDPNCNK--DDINHA 278

Query: 301 VLAVGYGE-LDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
           VLAVGYG+   G  +W VKNSWS  WG QGY++M+  + N CG+    +Y  M
Sbjct: 279 VLAVGYGQTAKGMKFWIVKNSWSESWGKQGYIMMARNRGNACGIANLASYPIM 331


>gi|61372279|gb|AAX43816.1| cathepsin H [synthetic construct]
          Length = 336

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 81/199 (40%), Positives = 118/199 (59%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTGA+E A  +   K+  L++Q L+DC+  + N+GC GG   +++++I
Sbjct: 134 NQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYI 193

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           +   G+  +D Y PY G+D YC      A   +    N+T   E+A+  A+A + PVS A
Sbjct: 194 LYNKGIMGEDTY-PYQGKDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFA 252

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            + +Q  F  Y  G+Y    C+ +PD ++HAVLAVGYGE +G PYW VKNSW   WG  G
Sbjct: 253 FEVTQ-DFMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNG 311

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y L+    N CG+    +Y
Sbjct: 312 YFLIERGKNMCGLAACASY 330


>gi|60827884|gb|AAX36817.1| cathepsin H [synthetic construct]
          Length = 336

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 81/199 (40%), Positives = 118/199 (59%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTGA+E A  +   K+  L++Q L+DC+  + N+GC GG   +++++I
Sbjct: 134 NQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYI 193

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           +   G+  +D Y PY G+D YC      A   +    N+T   E+A+  A+A + PVS A
Sbjct: 194 LYNKGIMGEDTY-PYQGKDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFA 252

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            + +Q  F  Y  G+Y    C+ +PD ++HAVLAVGYGE +G PYW VKNSW   WG  G
Sbjct: 253 FEVTQ-DFMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNG 311

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y L+    N CG+    +Y
Sbjct: 312 YFLIERGKNMCGLAACASY 330


>gi|47522632|ref|NP_999094.1| pro-cathepsin H precursor [Sus scrofa]
 gi|5915886|sp|O46427.1|CATH_PIG RecName: Full=Pro-cathepsin H; Contains: RecName: Full=Cathepsin H
           mini chain; Contains: RecName: Full=Cathepsin H;
           Contains: RecName: Full=Cathepsin H heavy chain;
           Contains: RecName: Full=Cathepsin H light chain; Flags:
           Precursor
 gi|2735659|gb|AAB93957.1| preprocathepsin H [Sus scrofa]
 gi|172050733|gb|ACB70168.1| cathepsin H [Sus scrofa]
          Length = 335

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 82/199 (41%), Positives = 118/199 (59%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTGA+E A  +   K+  L++Q L+DC+  + N+GC GG   +++++I
Sbjct: 134 NQGSCGSCWTFSTTGALESAVAIATGKMLSLAEQQLVDCAQNFNNHGCQGGLPSQAFEYI 193

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
               G+  +D Y PY GQD +C      A A +    N+T N E+A+  A+A + PVS A
Sbjct: 194 RYNKGIMGEDTY-PYKGQDDHCKFQPDKAIAFVKDVANITMNDEEAMVEAVALYNPVSFA 252

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            + +   F  Y  G+Y    C+ +PD ++HAVLAVGYGE +G PYW VKNSW   WG  G
Sbjct: 253 FEVTN-DFLMYRKGIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGMNG 311

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y L+    N CG+    +Y
Sbjct: 312 YFLIERGKNMCGLAACASY 330


>gi|410921048|ref|XP_003973995.1| PREDICTED: digestive cysteine proteinase 2-like [Takifugu rubripes]
          Length = 290

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 92/223 (41%), Positives = 126/223 (56%), Gaps = 8/223 (3%)

Query: 130 NKASK-DAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALI 187
           N A K D + + Y   GY   + DQ  CGSCW+F TTGA+EG  + K  +L  LS+Q L+
Sbjct: 69  NSAKKFDGMVIDYRQMGYVTEVKDQGYCGSCWAFSTTGAIEGQIFKKTGQLMSLSEQNLV 128

Query: 188 DCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVN 247
           DCS  YG  GC G     +Y +++ +GL +   Y PY      C+  +  A A +  +  
Sbjct: 129 DCSKSYGTYGCSGAWMANAYDYVVNNGLESTITY-PYTSDTQPCYYDSRLAVAHIKDYRF 187

Query: 248 VTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG 307
           +    E AL  A+A  GP++VAIDAS  SF FY +G+Y +  CN  P+ L HAVL VGYG
Sbjct: 188 IPKGDEQALADAVATIGPITVAIDASHSSFLFYSSGIYEESNCN--PNNLSHAVLLVGYG 245

Query: 308 ELDGKPYWQVKNSWSTYWGNQGYVLMSIKD--NNCGVMTAPTY 348
              G+ YW +KNSW   WG  GY+ + I+D  N CG+ +   Y
Sbjct: 246 SEGGQDYWLIKNSWGPSWGEGGYMRL-IRDGKNPCGIASYALY 287


>gi|332252750|ref|XP_003275518.1| PREDICTED: pro-cathepsin H [Nomascus leucogenys]
          Length = 335

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 81/199 (40%), Positives = 118/199 (59%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTGA+E A  +   K+  L++Q L+DC+  + N+GC GG   +++++I
Sbjct: 134 NQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYI 193

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           +   G+  +D Y PY G+D YC      A   +    N+T   E+A+  A+A + PVS A
Sbjct: 194 LYNKGIMGEDTY-PYQGKDGYCKFRPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFA 252

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            + +Q  F  Y  G+Y    C+ +PD ++HAVLAVGYGE +G PYW VKNSW   WG  G
Sbjct: 253 FEVTQ-DFMMYRRGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNG 311

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y L+    N CG+    +Y
Sbjct: 312 YFLIERGKNMCGLAACASY 330


>gi|4139678|pdb|8PCH|A Chain A, Crystal Structure Of Porcine Cathepsin H Determined At 2.1
           Angstrom Resolution: Location Of The Mini-Chain
           C-Terminal Carboxyl Group Defines Cathepsin H
           Aminopeptidase Function
 gi|28948781|pdb|1NB3|A Chain A, Crystal Structure Of Stefin A In Complex With Cathepsin H:
           N-Terminal Residues Of Inhibitors Can Adapt To The
           Active Sites Of Endo-And Exopeptidases
 gi|28948784|pdb|1NB3|B Chain B, Crystal Structure Of Stefin A In Complex With Cathepsin H:
           N-Terminal Residues Of Inhibitors Can Adapt To The
           Active Sites Of Endo-And Exopeptidases
 gi|28948787|pdb|1NB3|C Chain C, Crystal Structure Of Stefin A In Complex With Cathepsin H:
           N-Terminal Residues Of Inhibitors Can Adapt To The
           Active Sites Of Endo-And Exopeptidases
 gi|28948790|pdb|1NB3|D Chain D, Crystal Structure Of Stefin A In Complex With Cathepsin H:
           N-Terminal Residues Of Inhibitors Can Adapt To The
           Active Sites Of Endo-And Exopeptidases
 gi|28948793|pdb|1NB5|A Chain A, Crystal Structure Of Stefin A In Complex With Cathepsin H
 gi|28948796|pdb|1NB5|B Chain B, Crystal Structure Of Stefin A In Complex With Cathepsin H
 gi|28948799|pdb|1NB5|C Chain C, Crystal Structure Of Stefin A In Complex With Cathepsin H
 gi|28948802|pdb|1NB5|D Chain D, Crystal Structure Of Stefin A In Complex With Cathepsin H
          Length = 220

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 82/199 (41%), Positives = 118/199 (59%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTGA+E A  +   K+  L++Q L+DC+  + N+GC GG   +++++I
Sbjct: 19  NQGSCGSCWTFSTTGALESAVAIATGKMLSLAEQQLVDCAQNFNNHGCQGGLPSQAFEYI 78

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
               G+  +D Y PY GQD +C      A A +    N+T N E+A+  A+A + PVS A
Sbjct: 79  RYNKGIMGEDTY-PYKGQDDHCKFQPDKAIAFVKDVANITMNDEEAMVEAVALYNPVSFA 137

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            + +   F  Y  G+Y    C+ +PD ++HAVLAVGYGE +G PYW VKNSW   WG  G
Sbjct: 138 FEVTN-DFLMYRKGIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGMNG 196

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y L+    N CG+    +Y
Sbjct: 197 YFLIERGKNMCGLAACASY 215


>gi|218478069|dbj|BAH03395.1| cathepsin L-like cysteine peptidase [Taenia solium]
          Length = 346

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 88/203 (43%), Positives = 123/203 (60%), Gaps = 13/203 (6%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F +TGA+EGA+  K  KL  LS+Q L+DCS   GN+GC+GG    +++++
Sbjct: 141 NQGNCGSCWAFSSTGALEGAFAKKTGKLISLSEQQLVDCSLKNGNDGCNGGYMSYAFKYL 200

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
            +H +  +  Y PY   D  C    +    T+T   ++   +E AL  A+A  GP+S+AI
Sbjct: 201 EEHFIEPESAY-PYRATDGPCRYNESLGVGTVTDIGDIPEGNETALMEAVATVGPISIAI 259

Query: 271 DASQKSFSFYV-------NGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWST 323
           DAS   F FY        +G+Y    C  S   L+H VLA+GYG+ DGKPYW VKNSW T
Sbjct: 260 DASSLGFMFYRQVATNPHHGIYKSHWC--SSKFLNHGVLAIGYGKQDGKPYWLVKNSWGT 317

Query: 324 YWGNQGYVLMSIKD--NNCGVMT 344
            WG +GY++M+ KD  N CGV +
Sbjct: 318 RWGMKGYIMMA-KDYHNMCGVAS 339


>gi|213514640|ref|NP_001134963.1| Cathepsin S precursor [Salmo salar]
 gi|209155506|gb|ACI33985.1| Cathepsin S precursor [Salmo salar]
 gi|209737594|gb|ACI69666.1| Cathepsin S precursor [Salmo salar]
 gi|223647278|gb|ACN10397.1| Cathepsin S precursor [Salmo salar]
 gi|223673157|gb|ACN12760.1| Cathepsin S precursor [Salmo salar]
          Length = 330

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 91/210 (43%), Positives = 119/210 (56%), Gaps = 5/210 (2%)

Query: 144 KGYNSLLD-QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGE 202
           KGY + +  Q  CGSCW+F   GA+EG       KL  +S Q L+DCS  YGN GC+GG 
Sbjct: 124 KGYVTEVKMQGSCGSCWAFSAVGALEGQLMKTTGKLIDISSQNLVDCSSKYGNKGCNGGF 183

Query: 203 DFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAK 262
             +++Q+++ +     D   PY G    C        A  + +  +    E  LK ALA 
Sbjct: 184 MSQAFQYVIDNQGIDSDQSYPYKGVQQQCSYNPAQRAANCSKYSFLPEGDEGVLKEALAT 243

Query: 263 HGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWS 322
            GP+SVAIDA++  F+FY +GVY D  C      ++HAVLAVGYG L G+ YW VKNSWS
Sbjct: 244 IGPISVAIDATRPLFTFYRSGVYNDPTCTKK---INHAVLAVGYGTLGGQDYWLVKNSWS 300

Query: 323 TYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
             WG+QGY+ MS  KDN CG+     Y  M
Sbjct: 301 LSWGDQGYIRMSRNKDNQCGIALYGCYPVM 330


>gi|426379977|ref|XP_004056662.1| PREDICTED: pro-cathepsin H [Gorilla gorilla gorilla]
          Length = 335

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 81/199 (40%), Positives = 118/199 (59%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTGA+E A  +   K+  L++Q L+DC+  + N+GC GG   +++++I
Sbjct: 134 NQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYI 193

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           +   G+  +D Y PY G+D YC      A   +    N+T   E+A+  A+A + PVS A
Sbjct: 194 LYNKGIMGEDTY-PYQGKDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFA 252

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            + +Q  F  Y  G+Y    C+ +PD ++HAVLAVGYGE +G PYW VKNSW   WG  G
Sbjct: 253 FEVTQ-DFMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPKWGMNG 311

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y L+    N CG+    +Y
Sbjct: 312 YFLIERGKNMCGLAACASY 330


>gi|945081|gb|AAC49361.1| P21 [Petunia x hybrida]
          Length = 358

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 83/199 (41%), Positives = 121/199 (60%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTGA+E AY  K  K   LS+Q L+DC+  + N GC+GG   +++++I
Sbjct: 158 NQGKCGSCWTFSTTGALEAAYTQKFGKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYI 217

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + GL T++ Y PY G++  C  ++      +T  VN+T  +ED LK A+A   PVSVA
Sbjct: 218 KSNGGLETEEAY-PYTGKNGLCKFSSQNVGVKVTDSVNITLGAEDELKYAVALVRPVSVA 276

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            +   K F  Y +GVY   +C  +P  ++HAVLAVGYG   G P+W +KNSW   WG+  
Sbjct: 277 FEVV-KGFKQYKSGVYTSTECGTTPMDVNHAVLAVGYGVEYGVPFWLIKNSWGADWGDNA 335

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y  M + ++ CG+ T  +Y
Sbjct: 336 YFKMEMGNDMCGIATCASY 354


>gi|326926970|ref|XP_003209669.1| PREDICTED: cathepsin H-like [Meleagris gallopavo]
          Length = 323

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 82/198 (41%), Positives = 117/198 (59%), Gaps = 3/198 (1%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM 211
           Q  CGSCW+F TTG +E A  +   KL  L++Q L+DC+  + N+GC GG   +++++I+
Sbjct: 123 QGPCGSCWTFSTTGCLESAIAIATGKLLSLAEQQLVDCAQAFNNHGCSGGLPSQAFEYIL 182

Query: 212 -KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
              GL  +D Y PY  Q+  C      A A +   +N+T   E ++  A+ KH PVS A 
Sbjct: 183 YNKGLMGEDAY-PYRAQNGTCKFQPDKAVAFVRDVINITQYDEASMVEAVGKHNPVSFAF 241

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
           + +   F  Y  GVY + +C ++PD ++HAVLAVGYGE DG PYW VKNSW + WG  GY
Sbjct: 242 EVTND-FMHYRKGVYSNPRCEHTPDKVNHAVLAVGYGEEDGLPYWIVKNSWGSLWGMDGY 300

Query: 331 VLMSIKDNNCGVMTAPTY 348
            L+    N CG+    +Y
Sbjct: 301 FLIERGKNMCGLAACASY 318


>gi|28194647|gb|AAO33585.1|AF479267_1 cathepsin L [Mesocricetus auratus]
          Length = 333

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 90/207 (43%), Positives = 127/207 (61%), Gaps = 10/207 (4%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F   GA+EG   +K   L  LS+Q L+DCS G GN GC+GG    ++Q++
Sbjct: 131 NQGQCGSCWAFSACGALEGQMCLKTGVLVSLSEQNLVDCSRGEGNQGCNGGLMDFAFQYV 190

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           + + GL +++ Y PY  +D  C      A A  TG+V++ P  E AL  A+A  GP++VA
Sbjct: 191 LNNKGLDSEESY-PYEAKDGTCKYKPEFAAANDTGYVDI-PQLEKALMKAVATVGPIAVA 248

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTYW 325
           IDAS  SF FY +G+Y++  C  S   LDH VL +GYG    + + K YW VKNSW T W
Sbjct: 249 IDASHPSFQFYSSGIYFEPNC--SSKDLDHGVLVIGYGFEGTDSNKKKYWIVKNSWGTGW 306

Query: 326 GNQGYV-LMSIKDNNCGVMTAPTYVTM 351
           G  G+  +   K+N+CG+ TA +Y T+
Sbjct: 307 GMGGFFHIAKDKNNHCGIATAASYPTV 333


>gi|535600|gb|AAA29137.1| cathepsin [Fasciola hepatica]
          Length = 326

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 90/218 (41%), Positives = 123/218 (56%), Gaps = 8/218 (3%)

Query: 131 KASKDAIPVRYEMK--GY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALI 187
           KA+K A+P R + +  GY   + DQ  CGSCW+F TTGA+EG Y    K     S+Q L+
Sbjct: 102 KANKRAVPDRIDWRESGYVTEVKDQGGCGSCWAFSTTGAMEGQYMKNEKTSISFSEQQLV 161

Query: 188 DCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVN 247
           DCS  +GN GC+GG    +Y+++ + GL T+  Y PY   +  C        A +TG+  
Sbjct: 162 DCSGPFGNYGCNGGLMENAYEYLKRFGLETESSY-PYRAVEGQCRYNEQLGVAKVTGYYT 220

Query: 248 VTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG 307
           V    E  L+  +    P +VA+D  +  F  Y +G+Y  + C  SPD L+H VLAVGYG
Sbjct: 221 VHSGDEVELQNLVGCRRPAAVALDV-ESDFMMYRSGIYQSQTC--SPDRLNHGVLAVGYG 277

Query: 308 ELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMT 344
             DG  YW VKNSW T+WG  GY+ M  K  N CG+ +
Sbjct: 278 IQDGTDYWIVKNSWGTWWGEDGYIRMVRKRGNMCGIAS 315


>gi|402856107|ref|XP_003892641.1| PREDICTED: cathepsin S isoform 2 [Papio anubis]
          Length = 281

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 87/206 (42%), Positives = 118/206 (57%), Gaps = 5/206 (2%)

Query: 145 GYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW-GYGNNGCDGGED 203
           G N L D   CG+CW+F   GA+E    +K  KL  LS Q L+DCS   YGN GC+GG  
Sbjct: 76  GMNHLGDMGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFM 135

Query: 204 FRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKH 263
            R++Q+I+ +     D   PY   D  C   +    AT + +  +    ED LK  +A  
Sbjct: 136 TRAFQYIIDNNGIDSDASYPYKATDQKCQYDSKYRAATCSKYTELPYGREDVLKEVVANK 195

Query: 264 GPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWST 323
           GPVSV +DAS  SF  Y +GVYY+  C  +   ++H VL VGYG L+GK YW VKNSW  
Sbjct: 196 GPVSVGVDASHPSFFLYRSGVYYEPSCTQN---VNHGVLVVGYGVLNGKEYWLVKNSWGR 252

Query: 324 YWGNQGYVLMSI-KDNNCGVMTAPTY 348
            +G +GY+ M+  K N+CG+ + P+Y
Sbjct: 253 NFGEEGYIRMARNKGNHCGIASFPSY 278


>gi|148927396|gb|ABR19829.1| cysteine proteinase [Elaeis guineensis]
          Length = 358

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 82/199 (41%), Positives = 115/199 (57%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F TTGA+E AY     K   LS+Q L+DC++ + N GC+GG   +++++I
Sbjct: 157 DQGSCGSCWTFSTTGALEAAYTQATGKGISLSEQQLVDCAYAFNNFGCNGGLPSQAFEYI 216

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + GL T++ Y PY G + +CH         +   VN+T  +ED L  A+    PVS+A
Sbjct: 217 KYNGGLDTEESY-PYAGVNGFCHFKPENVGVKVVESVNITLGAEDELLHAVGLVRPVSIA 275

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            +     F FY  GVY  + C  +   ++HAVLAVGYG  +G PYW +KNSW   WG  G
Sbjct: 276 FEVVS-GFRFYKGGVYTSDTCGRTQMDVNHAVLAVGYGVENGVPYWLIKNSWGEEWGVDG 334

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y  M +  N CG+ T  +Y
Sbjct: 335 YFKMELGKNMCGIATCASY 353


>gi|23110955|ref|NP_004381.2| pro-cathepsin H preproprotein [Homo sapiens]
 gi|288558851|sp|P09668.4|CATH_HUMAN RecName: Full=Pro-cathepsin H; Contains: RecName: Full=Cathepsin H
           mini chain; Contains: RecName: Full=Cathepsin H;
           Contains: RecName: Full=Cathepsin H heavy chain;
           Contains: RecName: Full=Cathepsin H light chain; Flags:
           Precursor
 gi|119619549|gb|EAW99143.1| cathepsin H [Homo sapiens]
          Length = 335

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 81/199 (40%), Positives = 118/199 (59%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTGA+E A  +   K+  L++Q L+DC+  + N+GC GG   +++++I
Sbjct: 134 NQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYI 193

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           +   G+  +D Y PY G+D YC      A   +    N+T   E+A+  A+A + PVS A
Sbjct: 194 LYNKGIMGEDTY-PYQGKDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFA 252

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            + +Q  F  Y  G+Y    C+ +PD ++HAVLAVGYGE +G PYW VKNSW   WG  G
Sbjct: 253 FEVTQ-DFMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNG 311

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y L+    N CG+    +Y
Sbjct: 312 YFLIERGKNMCGLAACASY 330


>gi|86285730|gb|ABC94586.1| silicatein alpha [Hymeniacidon perlevis]
          Length = 331

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 84/206 (40%), Positives = 123/206 (59%), Gaps = 4/206 (1%)

Query: 148 SLLDQSVCGSCWSFGTTGAVEGAYYMKHK-KLAVLSQQALIDCSWGYGNNGCDGGEDFRS 206
           ++ DQ  CG+ ++F   GA+EGAY + H       S+Q +IDCS  YGN GC GG  + +
Sbjct: 128 AVKDQGDCGASYAFSAMGALEGAYALAHNGNQESFSEQNIIDCSIPYGNYGCHGGNMYDA 187

Query: 207 YQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
           + +++ +G   ++   P+LG+ + C+    T    M+G V +   SED L+ A+A  GPV
Sbjct: 188 FLYVIANGGVAKESAYPFLGKQSSCNYNRNTRGTGMSGSVAIKSESEDDLQTAVANVGPV 247

Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWG 326
           +VAID +  +F FY +GVY   +C  S   L+HA++  GYG   GK YW  KNSWST WG
Sbjct: 248 AVAIDGANNAFRFYYSGVYDSSRC--SSTSLNHAMVVTGYGTYAGKKYWLAKNSWSTNWG 305

Query: 327 NQGYVLMSI-KDNNCGVMTAPTYVTM 351
             GYV+M+  K N CG+ T  +Y T+
Sbjct: 306 QSGYVMMARGKYNQCGIATDASYPTL 331


>gi|48145879|emb|CAG33162.1| CTSH [Homo sapiens]
          Length = 335

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 81/199 (40%), Positives = 118/199 (59%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTGA+E A  +   K+  L++Q L+DC+  + N+GC GG   +++++I
Sbjct: 134 NQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYI 193

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           +   G+  +D Y PY G+D YC      A   +    N+T   E+A+  A+A + PVS A
Sbjct: 194 LYNKGIMGEDTY-PYQGKDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFA 252

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            + +Q  F  Y  G+Y    C+ +PD ++HAVLAVGYGE +G PYW VKNSW   WG  G
Sbjct: 253 FEVTQ-DFMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNG 311

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y L+    N CG+    +Y
Sbjct: 312 YFLIERGKNMCGLAACASY 330


>gi|172050735|gb|ACB70169.1| cathepsin H transcript variant 3 [Sus scrofa]
          Length = 251

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 82/199 (41%), Positives = 118/199 (59%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTGA+E A  +   K+  L++Q L+DC+  + N+GC GG   +++++I
Sbjct: 50  NQGSCGSCWTFSTTGALESAVAIATGKMLSLAEQQLVDCAQNFNNHGCQGGLPSQAFEYI 109

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
               G+  +D Y PY GQD +C      A A +    N+T N E+A+  A+A + PVS A
Sbjct: 110 RYNKGIMGEDTY-PYKGQDDHCKFQPDKAIAFVKDVANITMNDEEAMVEAVALYNPVSFA 168

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            + +   F  Y  G+Y    C+ +PD ++HAVLAVGYGE +G PYW VKNSW   WG  G
Sbjct: 169 FEVTN-DFLMYRKGIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGMNG 227

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y L+    N CG+    +Y
Sbjct: 228 YFLIERGKNMCGLAACASY 246


>gi|426248750|ref|XP_004018122.1| PREDICTED: pro-cathepsin H [Ovis aries]
          Length = 355

 Score =  162 bits (410), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 81/199 (40%), Positives = 119/199 (59%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTGA+E A  +   KL  L++Q L+DC+  + N+GC GG   +++++I
Sbjct: 154 NQGSCGSCWTFSTTGALESAVAIATGKLPFLAEQQLVDCAQNFNNHGCQGGLPSQAFEYI 213

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
               G+  +D Y PY G+D  C    + A A +    N+T N E+A+  A+A + PVS A
Sbjct: 214 RYNKGIMGEDTY-PYRGEDGDCKYQPSKAIAFVKDVANITLNDEEAMVEAVALYNPVSFA 272

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            + +   F  Y  G+Y    C+ +PD ++HAVLAVGYGE  G PYW VKNSW  +WG +G
Sbjct: 273 FEVT-ADFMMYRKGIYSSTSCHKTPDKVNHAVLAVGYGEEKGIPYWIVKNSWGPHWGMKG 331

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y L+    N CG+    ++
Sbjct: 332 YFLIERGKNMCGLAACASF 350


>gi|380236892|emb|CBK52289.1| cathepsin S protein [Dicentrarchus labrax]
          Length = 337

 Score =  162 bits (409), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 86/201 (42%), Positives = 120/201 (59%), Gaps = 4/201 (1%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM 211
           Q  CGSCW+F   GA+EG       KL  LS Q L+DCS  YGN+GC+GG    ++Q+++
Sbjct: 140 QGSCGSCWAFSAAGALEGQLAKTTGKLVDLSPQNLVDCSTKYGNHGCNGGLMHHAFQYVI 199

Query: 212 KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAID 271
            +     D   PY G++  C   +    A  + +  +   +E ALK ALA  GP+SVAID
Sbjct: 200 DNQGIDSDASYPYTGRNGECRYNSKFRAANCSQYSFLPEGNEGALKEALANIGPISVAID 259

Query: 272 ASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYV 331
           A++ +F+FY +GVY D  C+     ++H VLAVGYG LDG+ YW VKNSW   +G+QGY+
Sbjct: 260 ATRPTFTFYRSGVYNDPNCSQK---VNHGVLAVGYGTLDGQDYWLVKNSWGKTFGDQGYI 316

Query: 332 LMSI-KDNNCGVMTAPTYVTM 351
            MS  K++ CG+     Y  M
Sbjct: 317 RMSRNKNDQCGIALYGCYPIM 337


>gi|62510452|sp|Q8HY81.1|CATS_CANFA RecName: Full=Cathepsin S; Flags: Precursor
 gi|27497538|gb|AAO13009.1| cathepsin S preproprotein [Canis lupus familiaris]
          Length = 331

 Score =  162 bits (409), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 96/234 (41%), Positives = 136/234 (58%), Gaps = 10/234 (4%)

Query: 121 SKYSLWVRYNKASKDAIP--VRYEMKGYNSLLD-QSVCGSCWSFGTTGAVEGAYYMKHKK 177
           S++   V Y   S   +P  V +  KG  + +  Q  CG+CW+F   GA+E    +K  K
Sbjct: 99  SQWQRNVTYRSNSNQKLPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGK 158

Query: 178 LAVLSQQALIDCSW-GYGNNGCDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIAN 235
           L  LS Q L+DCS   YGN GC+GG    ++Q+I+  +G+ ++  Y PY   +  C   +
Sbjct: 159 LVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNNGIDSEASY-PYKAMNGKCRYDS 217

Query: 236 TTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPD 295
               AT + +  +   SEDALK A+A  GPVSVAIDAS  SF  Y +GVYY+  C  +  
Sbjct: 218 KKRAATCSKYTELPFGSEDALKEAVANKGPVSVAIDASHYSFFLYRSGVYYEPSCTQN-- 275

Query: 296 GLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKD-NNCGVMTAPTY 348
            ++H VL VGYG L+GK YW VKNSW   +G+QGY+ M+    N+CG+ + P+Y
Sbjct: 276 -VNHGVLVVGYGNLNGKDYWLVKNSWGLNFGDQGYIRMARNSGNHCGIASYPSY 328


>gi|253722774|pdb|1CJL|A Chain A, Crystal Structure Of A Cysteine Protease Proform
          Length = 312

 Score =  162 bits (409), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 93/220 (42%), Positives = 130/220 (59%), Gaps = 11/220 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGS W+F  TGA+EG  + K  +L  LS+Q L+DCS   GN G
Sbjct: 97  VDWREKGYVTPVKNQGQCGSSWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPEGNEG 156

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++Q++  + GL +++ Y PY   +  C      + A   GFV++ P  E AL
Sbjct: 157 CNGGLMDYAFQYVQDNGGLDSEESY-PYEATEESCKYNPKYSVANDAGFVDI-PKQEKAL 214

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GP+SVAIDA  +SF FY  G+Y++  C  S + +DH VL VGYG    E DG 
Sbjct: 215 MKAVATVGPISVAIDAGHESFLFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDGN 272

Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
            YW VKNSW   WG  GYV M+  + N+CG+ +A +Y T+
Sbjct: 273 KYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 312


>gi|296213765|ref|XP_002753411.1| PREDICTED: pro-cathepsin H [Callithrix jacchus]
          Length = 336

 Score =  162 bits (409), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 84/213 (39%), Positives = 126/213 (59%), Gaps = 5/213 (2%)

Query: 139 VRYEMKGY--NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNN 196
           V +  KG+  + + +Q  CGSCW+F TTGA+E A  +   K+  L++Q L+DC+  + N+
Sbjct: 121 VDWRKKGHFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNH 180

Query: 197 GCDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDA 255
           GC GG   +++++I+  +G+  +D Y PY G+D+ C      A   +    N+T   EDA
Sbjct: 181 GCQGGLPSQAFEYILYNNGIMGEDTY-PYQGKDSDCKFQPGKAIGFVKDVANITIYDEDA 239

Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYW 315
           +  A+A + PVS A + +Q  F  Y  G+Y    C+ +PD ++HAVLAVGYGE +G PYW
Sbjct: 240 MVEAVALYNPVSFAFEVTQ-DFMMYKRGIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYW 298

Query: 316 QVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTY 348
            VKNSW   WG  GY L+    N CG+    +Y
Sbjct: 299 IVKNSWGPQWGMNGYFLIERGKNMCGLAACASY 331


>gi|241913450|pdb|3HHA|A Chain A, Crystal Structure Of Cathepsin L In Complex With
           Az12878478
 gi|241913451|pdb|3HHA|B Chain B, Crystal Structure Of Cathepsin L In Complex With
           Az12878478
 gi|241913452|pdb|3HHA|C Chain C, Crystal Structure Of Cathepsin L In Complex With
           Az12878478
 gi|241913453|pdb|3HHA|D Chain D, Crystal Structure Of Cathepsin L In Complex With
           Az12878478
 gi|317455045|pdb|2XU1|A Chain A, Cathepsin L With A Nitrile Inhibitor
 gi|317455046|pdb|2XU1|B Chain B, Cathepsin L With A Nitrile Inhibitor
 gi|317455047|pdb|2XU1|C Chain C, Cathepsin L With A Nitrile Inhibitor
 gi|317455048|pdb|2XU1|D Chain D, Cathepsin L With A Nitrile Inhibitor
          Length = 220

 Score =  162 bits (409), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 93/220 (42%), Positives = 130/220 (59%), Gaps = 11/220 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F  TGA+EG  + K  +L  LS+Q L+DCS   GN G
Sbjct: 5   VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEG 64

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++Q++  + GL +++ Y PY   +  C      + A   GFV++ P  E AL
Sbjct: 65  CNGGLMDYAFQYVQDNGGLDSEESY-PYEATEESCKYNPKYSVANDAGFVDI-PKQEKAL 122

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GP+SVAIDA  +SF FY  G+Y++  C  S + +DH VL VGYG    E D  
Sbjct: 123 MKAVATVGPISVAIDAGHESFLFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDNN 180

Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
            YW VKNSW   WG  GYV M+  + N+CG+ +A +Y T+
Sbjct: 181 KYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 220


>gi|355681664|gb|AER96818.1| cathepsin S [Mustela putorius furo]
          Length = 338

 Score =  162 bits (409), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 95/234 (40%), Positives = 134/234 (57%), Gaps = 10/234 (4%)

Query: 121 SKYSLWVRYNKASKDAIP--VRYEMKGYNSLLD-QSVCGSCWSFGTTGAVEGAYYMKHKK 177
           S++   V Y   S   +P  V +  KG  + +  Q  CG+CW+F   GA+E    +K   
Sbjct: 107 SQWQANVTYKSNSNQKLPDSVDWREKGCVTEVKYQGACGACWAFSAVGALEAQLKLKTGN 166

Query: 178 LAVLSQQALIDCSW-GYGNNGCDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIAN 235
           L  LS Q L+DCS   YGN GC+GG   +++Q+I+  +G+ ++  Y PY   D  C   +
Sbjct: 167 LVSLSAQNLVDCSTERYGNKGCNGGFMTKAFQYIIDNNGIDSEVSY-PYKAMDGNCRYDS 225

Query: 236 TTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPD 295
               AT + +  +   SEDALK A+A  GPVSVAIDA   SF  Y +GVYYD  C  +  
Sbjct: 226 KHRAATCSKYTELPFGSEDALKEAVANKGPVSVAIDAKHSSFFLYKSGVYYDPSCTQN-- 283

Query: 296 GLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKD-NNCGVMTAPTY 348
            ++H VL VGYG L+G+ YW VKNSW   +G QGY+ M+    N+CG+ + P+Y
Sbjct: 284 -VNHGVLVVGYGNLNGRDYWLVKNSWGLNFGEQGYIRMARNSGNHCGIASYPSY 336


>gi|195729975|gb|ACG50798.1| cathepsin L1 [Fascioloides magna]
          Length = 327

 Score =  162 bits (409), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 84/218 (38%), Positives = 128/218 (58%), Gaps = 8/218 (3%)

Query: 131 KASKDAIPVRYEMKGY---NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALI 187
           +A  + +P   + + Y     + DQ  CGSCW+F +TGA+EG Y  K +     S+Q L+
Sbjct: 103 QAKGNDVPESIDWRDYGYVTEVKDQGQCGSCWAFSSTGAMEGQYIKKFRTTVSFSEQQLV 162

Query: 188 DCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVN 247
           DC+  YGN+GC+GG   R+++++ ++GL T+  Y PY   D +C   +    A +TG+  
Sbjct: 163 DCTRNYGNSGCNGGWMERAFEYLRRNGLETESSY-PYRAVDDHCRYESQLGVAKVTGYYT 221

Query: 248 VTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG 307
               +E +L   +   GPV+VA+D  Q  FS Y +G+Y  E C  S   ++HAVLAVGYG
Sbjct: 222 EHSGNEVSLMNMVGGEGPVAVAVDV-QSDFSMYKSGIYQSETC--STYYVNHAVLAVGYG 278

Query: 308 ELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMT 344
              G  YW +KNSW ++WG+QGY+  +   NN CG+ +
Sbjct: 279 TESGTDYWILKNSWGSWWGDQGYIRFARNRNNMCGIAS 316


>gi|237844793|ref|XP_002371694.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
 gi|50313163|gb|AAT74529.1| toxopain-2 [Toxoplasma gondii]
 gi|89242977|gb|ABD64744.1| cathepsin L [Toxoplasma gondii]
 gi|95007485|emb|CAJ20707.1| toxopain-2 [Toxoplasma gondii RH]
 gi|211969358|gb|EEB04554.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
 gi|221480879|gb|EEE19300.1| cysteine protease, putative [Toxoplasma gondii GT1]
 gi|221501596|gb|EEE27366.1| cysteine protease, putative [Toxoplasma gondii VEG]
          Length = 422

 Score =  162 bits (409), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 88/206 (42%), Positives = 119/206 (57%), Gaps = 10/206 (4%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F TTGA+EGA+  K  KL  LS+Q L+DCS   GN  C GGE   ++Q++
Sbjct: 222 DQRDCGSCWAFSTTGALEGAHCAKTGKLVSLSEQELMDCSRAEGNQSCSGGEMNDAFQYV 281

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
           +  G    +D  PYL +D  C   +      + GF +V   SE A+K ALAK  PVS+AI
Sbjct: 282 LDSGGICSEDAYPYLARDEECRAQSCEKVVKILGFKDVPRRSEAAMKAALAK-SPVSIAI 340

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG--ELDGKPYWQVKNSWSTYWGNQ 328
           +A Q  F FY  GV +D  C      LDH VL VGYG  +   K +W +KNSW T WG  
Sbjct: 341 EADQMPFQFYHEGV-FDASCGTD---LDHGVLLVGYGTDKESKKDFWIMKNSWGTGWGRD 396

Query: 329 GYVLMSI---KDNNCGVMTAPTYVTM 351
           GY+ M++   ++  CG++   ++  M
Sbjct: 397 GYMYMAMHKGEEGQCGLLLDASFPVM 422


>gi|354622947|ref|NP_001002938.2| cathepsin S precursor [Canis lupus familiaris]
          Length = 339

 Score =  162 bits (409), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 96/234 (41%), Positives = 136/234 (58%), Gaps = 10/234 (4%)

Query: 121 SKYSLWVRYNKASKDAIP--VRYEMKGYNSLLD-QSVCGSCWSFGTTGAVEGAYYMKHKK 177
           S++   V Y   S   +P  V +  KG  + +  Q  CG+CW+F   GA+E    +K  K
Sbjct: 107 SQWQRNVTYRSNSNQKLPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGK 166

Query: 178 LAVLSQQALIDCSW-GYGNNGCDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIAN 235
           L  LS Q L+DCS   YGN GC+GG    ++Q+I+  +G+ ++  Y PY   +  C   +
Sbjct: 167 LVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNNGIDSEASY-PYKAMNGKCRYDS 225

Query: 236 TTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPD 295
               AT + +  +   SEDALK A+A  GPVSVAIDAS  SF  Y +GVYY+  C  +  
Sbjct: 226 KKRAATCSKYTELPFGSEDALKEAVANKGPVSVAIDASHYSFFLYRSGVYYEPSCTQN-- 283

Query: 296 GLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKD-NNCGVMTAPTY 348
            ++H VL VGYG L+GK YW VKNSW   +G+QGY+ M+    N+CG+ + P+Y
Sbjct: 284 -VNHGVLVVGYGNLNGKDYWLVKNSWGLNFGDQGYIRMARNSGNHCGIASYPSY 336


>gi|388491952|gb|AFK34042.1| unknown [Lotus japonicus]
          Length = 352

 Score =  162 bits (409), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 82/198 (41%), Positives = 112/198 (56%), Gaps = 1/198 (0%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ+ CGSCW+F TTGA+E AY   H K   LS+Q L+DC+  + N GC+GG   +++++I
Sbjct: 152 DQAHCGSCWTFSTTGALEAAYAQAHGKNISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYI 211

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
             +G    +   PY  +D             +   VN+T  +ED LK A+A   PVSVA 
Sbjct: 212 KYNGGIALEKEYPYTAKDEASKFTAENVAVRVLDSVNITLGAEDELKHAVAFARPVSVAF 271

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
                 F  Y  GVY  + C N+P  ++HAVLAVGYG  +  PYW +KNSW + WG+ GY
Sbjct: 272 QVVD-GFRLYKEGVYTSDTCGNTPMDVNHAVLAVGYGVENNVPYWIIKNSWGSTWGDHGY 330

Query: 331 VLMSIKDNNCGVMTAPTY 348
             M +  N CGV T  +Y
Sbjct: 331 FKMELGKNMCGVATCASY 348


>gi|328866326|gb|EGG14711.1| hypothetical protein DFA_10969 [Dictyostelium fasciculatum]
          Length = 369

 Score =  162 bits (409), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 94/240 (39%), Positives = 139/240 (57%), Gaps = 26/240 (10%)

Query: 134 KDAIPVRYEMKGYNS-LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG 192
           KD   V +  +G  S + +Q  CGSCWSF +TGA+EGA+ +K  ++  LS+Q L+DCS  
Sbjct: 134 KDNPNVDWRKQGAVSHVKNQGQCGSCWSFSSTGAIEGAHAIKTGEMISLSEQQLVDCSKR 193

Query: 193 YGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQD-AYCHIANTTATATMTGFVNVTPN 251
           YGNNGC+GG    ++ +++  G    ++  PY   D + C   +T A  +++   N+   
Sbjct: 194 YGNNGCNGGLMTLAFDYVIDAGGLESEEAYPYTTTDTSACMFNSTNAVTSISDHQNIRAG 253

Query: 252 SEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDG 311
           +E  L+  L   GPVSVAIDAS +SF FY +G++Y  +C++S   LDH VLAVG+G+  G
Sbjct: 254 NEKHLETVLRNVGPVSVAIDASPRSFRFYKSGIFYAPECSSSQ--LDHGVLAVGFGK--G 309

Query: 312 KP-------------------YWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
            P                   Y+ VKNSW + WG+ G++ MS  + NNCG+ T  TY T+
Sbjct: 310 NPESNFENKVSFIHDDTKNNEYYIVKNSWGSDWGSNGFIYMSKNRKNNCGIATMATYPTI 369


>gi|327289213|ref|XP_003229319.1| PREDICTED: cathepsin S-like [Anolis carolinensis]
          Length = 333

 Score =  162 bits (409), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 91/228 (39%), Positives = 132/228 (57%), Gaps = 6/228 (2%)

Query: 126 WVRYNKASKDAIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQA 185
           W R   ++ D +  R E     ++ +Q  CGSCW+F   GA+E    +K   L  LS Q 
Sbjct: 110 WARQGASAPDTVDWR-EKGCVTNVKNQGSCGSCWAFSAVGALECQLKLKTGNLVSLSPQN 168

Query: 186 LIDCSWGYGNNGCDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTG 244
           L+DCS  +GN+GC+GG    ++Q+++  +G+ ++  Y PY GQ   C        AT + 
Sbjct: 169 LVDCSSAFGNHGCNGGYISAAFQYVIYNNGIDSEASY-PYTGQSGTCRYNLQGRAATCSR 227

Query: 245 FVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAV 304
           +V++   +E ALK A+A  GPVSVAIDAS+ SF  +  GVY D  C ++   ++H VL V
Sbjct: 228 YVDLPSGNEAALKDAVANFGPVSVAIDASRPSFFLFRKGVYDDPSCTSAH--INHGVLVV 285

Query: 305 GYGELDGKPYWQVKNSWSTYWGNQGYVLMSIK-DNNCGVMTAPTYVTM 351
           GYG  DG  YW VKNSW   +G+QGY+ ++   DN CG+ +  TY  M
Sbjct: 286 GYGTEDGIDYWLVKNSWGVSFGDQGYIKIARNHDNRCGIASQCTYPLM 333


>gi|395755765|ref|XP_002833453.2| PREDICTED: putative cathepsin L-like protein 6-like, partial [Pongo
           abelii]
          Length = 213

 Score =  162 bits (409), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 94/215 (43%), Positives = 130/215 (60%), Gaps = 11/215 (5%)

Query: 144 KGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGE 202
           KGY   + DQ+ CGSCW+F  TGA+EG  + K  KL  LS+Q L+DCS   GN GC+GG 
Sbjct: 3   KGYVTPVKDQAQCGSCWAFSATGALEGQMFWKTGKLTSLSEQNLVDCSGPQGNEGCNGGF 62

Query: 203 DFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALA 261
              S+Q++ ++ GL ++  Y  Y G+   C      + A  TGF ++ P+ E  L  A+A
Sbjct: 63  MDNSFQYVQENGGLDSEASY-SYEGKVKTCRYNPKYSAANDTGFADI-PSWEKDLAKAVA 120

Query: 262 KHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGY----GELDGKPYWQV 317
             GP+SVA+DAS  SF FY  G+Y++  C+  P+GLDHA+L V Y     + D   YW V
Sbjct: 121 TVGPISVAVDASHVSFQFYKKGIYFEPCCD--PEGLDHAMLVVDYSYEGADSDNNKYWLV 178

Query: 318 KNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
           KNSW   WG  GY+ M+  + NNCG+ TA +Y T+
Sbjct: 179 KNSWGKNWGMDGYIKMAKDRKNNCGIATAASYPTV 213


>gi|226821425|gb|ACO82388.1| cathepsin S [Lutjanus argentimaculatus]
          Length = 337

 Score =  162 bits (409), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 88/205 (42%), Positives = 120/205 (58%), Gaps = 4/205 (1%)

Query: 148 SLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSY 207
           S+  Q  CGSCW+F   GA+EG    K  KL  LS Q L+DCS  YGN+GC+GG    ++
Sbjct: 136 SVKTQGSCGSCWAFSAVGALEGQLAKKTGKLVDLSPQNLVDCSTKYGNHGCNGGFMDHAF 195

Query: 208 QWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVS 267
           Q+++ +     D   PY G+   CH   +   A  + +  +    E ALK ALA  GP+S
Sbjct: 196 QYVIDNQGIDSDASYPYTGRSDQCHYNPSYRAANCSSYNFLPEGDEGALKQALATIGPIS 255

Query: 268 VAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGN 327
           VAIDA++  F FY +GVY D  C+     ++H VLAVGYG L+G+ YW VKNSW T +G+
Sbjct: 256 VAIDATRPRFIFYRSGVYNDPSCSQE---VNHGVLAVGYGTLNGQDYWLVKNSWGTKFGD 312

Query: 328 QGYVLMSIKDNN-CGVMTAPTYVTM 351
           QGY+ M+   N+ CG+     Y  M
Sbjct: 313 QGYIRMARNQNDQCGIAMYGCYPIM 337


>gi|115743|sp|P07154.2|CATL1_RAT RecName: Full=Cathepsin L1; AltName: Full=Cyclic protein 2;
           Short=CP-2; AltName: Full=Major excreted protein;
           Short=MEP; Contains: RecName: Full=Procathepsin L;
           Contains: RecName: Full=Cathepsin L1 heavy chain;
           Contains: RecName: Full=Cathepsin L1 light chain; Flags:
           Precursor
 gi|38648869|gb|AAH63175.1| Cathepsin L1 [Rattus norvegicus]
 gi|149029152|gb|EDL84437.1| cathepsin L, isoform CRA_a [Rattus norvegicus]
 gi|386267881|dbj|BAM14518.1| cathepsin L [Rattus norvegicus]
          Length = 334

 Score =  162 bits (409), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 87/203 (42%), Positives = 121/203 (59%), Gaps = 8/203 (3%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F  +G +EG  ++K  KL  LS+Q L+DCS   GN GC+GG    ++Q+I
Sbjct: 131 NQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHDQGNQGCNGGLMDFAFQYI 190

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
            ++G    ++  PY  +D  C      A A  TGFV++ P  E AL  A+A  GP+SVA+
Sbjct: 191 KENGGLDSEESYPYEAKDGSCKYRAEYAVANDTGFVDI-PQQEKALMKAVATVGPISVAM 249

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTYWG 326
           DAS  S  FY +G+YY+  C  S   LDH VL VGYG    + +   YW VKNSW   WG
Sbjct: 250 DASHPSLQFYSSGIYYEPNC--SSKDLDHGVLVVGYGYEGTDSNKDKYWLVKNSWGKEWG 307

Query: 327 NQGYV-LMSIKDNNCGVMTAPTY 348
             GY+ +   ++N+CG+ TA +Y
Sbjct: 308 MDGYIKIAKDRNNHCGLATAASY 330


>gi|318844127|ref|NP_001187181.1| cathspsin H precursor [Ictalurus punctatus]
 gi|196475594|gb|ACG76366.1| cathspsin H [Ictalurus punctatus]
          Length = 326

 Score =  161 bits (408), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 84/215 (39%), Positives = 126/215 (58%), Gaps = 3/215 (1%)

Query: 135 DAIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYG 194
           D+I  R +      + +Q  CGSCW+F TTG +E    +   KL +L++Q L+DC+  + 
Sbjct: 111 DSIDWRKKGNYVTEVKNQGACGSCWTFSTTGCLESVTAIATGKLPLLAEQQLVDCAGAFN 170

Query: 195 NNGCDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSE 253
           N+GC+GG   +++++IM   GL T+DDY PY+G+D  C      A A +   VN+T   E
Sbjct: 171 NHGCNGGLPSQAFEYIMYNKGLMTEDDY-PYVGRDGPCKFDPKLAAAFVKDVVNITKYDE 229

Query: 254 DALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKP 313
             +  A+A+  PVS+A +   + F  Y +GVY   +C+N+ + ++HAVLAVGY E +G P
Sbjct: 230 MGIVDAVARLNPVSIAFEVLPE-FMHYKDGVYTSNECHNTTETVNHAVLAVGYAEENGTP 288

Query: 314 YWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTY 348
           YW VKNSW   WG  GY  +    N CG+    +Y
Sbjct: 289 YWIVKNSWGPQWGIDGYFYIERGQNMCGLAACASY 323


>gi|340370388|ref|XP_003383728.1| PREDICTED: digestive cysteine proteinase 2-like [Amphimedon
           queenslandica]
          Length = 398

 Score =  161 bits (408), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 87/202 (43%), Positives = 119/202 (58%), Gaps = 7/202 (3%)

Query: 149 LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
           + DQ  CGSCW+F   G++EG +++    L  LS+Q L+DCS    N+GC+GG    +++
Sbjct: 198 IKDQGQCGSCWAFSAIGSLEGQHFINTGNLVSLSEQQLVDCS--LKNDGCNGGMLSTAFK 255

Query: 209 WIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVS 267
           +I    G  ++ DY PY  ++  C    + A A +TG+  +    ED+L  A+   GP+S
Sbjct: 256 YIESVAGEESETDY-PYTAKNGTCQYDPSKAVAKVTGYTALPSGDEDSLNDAVTSKGPIS 314

Query: 268 VAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGN 327
           V IDAS KSF  Y  GVYY++ C  S   LDH VL VGYG  D   YW VKNSW T WG 
Sbjct: 315 VCIDASHKSFQLYSEGVYYEKSC--SYFLLDHCVLVVGYGTEDTADYWLVKNSWGTSWGM 372

Query: 328 QGYVLMSI-KDNNCGVMTAPTY 348
           +GY+ MS  + NNCG+ T   Y
Sbjct: 373 KGYIRMSRNRKNNCGIATNAAY 394


>gi|45822207|emb|CAE47500.1| cathepsin L-like proteinase [Diabrotica virgifera virgifera]
          Length = 326

 Score =  161 bits (408), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 92/220 (41%), Positives = 127/220 (57%), Gaps = 8/220 (3%)

Query: 137 IPVRY---EMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGY 193
           +P ++   E      + DQ  CGSCWSF TTG VEGAY++K  KL  LS+Q L+DC+   
Sbjct: 110 LPSKFDWREKGAVTEVKDQGSCGSCWSFSTTGTVEGAYFLKTGKLVSLSEQNLVDCA-KE 168

Query: 194 GNNGCDGGEDFRSYQWI-MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNS 252
              GC GG   ++ ++I    G+ +++DY PY G D  C   ++   A ++ F  +  N 
Sbjct: 169 DCYGCSGGYMDKALEYIETAGGIMSENDY-PYEGIDDKCRFDSSKVAAKISNFTYIKKND 227

Query: 253 EDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGK 312
           ED LK A+   GP+SVAIDAS  +F  Y +G+  D  C +  + L+H VL VGYG    +
Sbjct: 228 EDDLKNAVIAKGPISVAIDASF-NFQLYDSGILDDSSCYSDFNSLNHGVLVVGYGTEKEQ 286

Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
            YW VKNSW   WG  GY+ MS  K+N CG+ T  TY T+
Sbjct: 287 DYWIVKNSWGADWGMDGYIWMSRNKNNQCGIATDATYPTI 326


>gi|157128512|ref|XP_001661463.1| cathepsin l [Aedes aegypti]
 gi|91992510|gb|ABE72971.1| cathepsin L [Aedes aegypti]
 gi|108872552|gb|EAT36777.1| AAEL011167-PA [Aedes aegypti]
          Length = 327

 Score =  161 bits (408), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 94/224 (41%), Positives = 134/224 (59%), Gaps = 6/224 (2%)

Query: 131 KASKDAIPVRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDC 189
           +A+     + +  KG    + DQ  CGSC++F   GA+EGA + K  KL  LS+Q ++DC
Sbjct: 107 QATTTVTSIDWRTKGAVTPVKDQGRCGSCYAFSALGALEGATFTKTGKLVNLSEQNIVDC 166

Query: 190 SWGYGNNGCDGGEDFRSYQWI-MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
           +  YGN GC+GG     +++I   +G+ T   Y       A C        AT TG+V +
Sbjct: 167 TSTYGNYGCNGGSMTSVFKYIKTNNGVDTGAFYPYKAAVAATCGFNPAYVGATDTGYV-L 225

Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
            P +E AL+ A+A  GPVSVAIDAS  SF  Y +G+YY+  C++S   L+H VL VGYG 
Sbjct: 226 LPANETALQTAVANIGPVSVAIDASNPSFQQYKSGIYYEPLCSSSK--LNHGVLVVGYGT 283

Query: 309 LDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
            +G  YWQVKNSW T WG +GY+ M+  K+N+CG+ +  +Y T+
Sbjct: 284 ENGTDYWQVKNSWGTTWGEKGYIKMARNKNNHCGIASFASYPTV 327


>gi|308322047|gb|ADO28161.1| cathepsin H [Ictalurus furcatus]
          Length = 326

 Score =  161 bits (408), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 84/215 (39%), Positives = 126/215 (58%), Gaps = 3/215 (1%)

Query: 135 DAIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYG 194
           D+I  R +      + +Q  CGSCW+F TTG +E    +   KL +L++Q L+DC+  + 
Sbjct: 111 DSIDWRKKGNYVTEVKNQGACGSCWTFSTTGCLESVTAIATGKLPLLAEQQLVDCAGAFN 170

Query: 195 NNGCDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSE 253
           N+GC+GG   +++++IM   GL T+DDY PY+G+D  C      A A +   VN+T   E
Sbjct: 171 NHGCNGGLPSQAFEYIMYNKGLMTEDDY-PYVGRDGPCKFDPKLAAAFVKDVVNITKYDE 229

Query: 254 DALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKP 313
             +  A+A+  PVS+A +   + F  Y +GVY   +C+N+ + ++HAVLAVGY E +G P
Sbjct: 230 MGIVDAVARLNPVSIAFEVLPE-FMHYKDGVYTSNECHNTTETVNHAVLAVGYAEENGTP 288

Query: 314 YWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTY 348
           YW VKNSW   WG  GY  +    N CG+    +Y
Sbjct: 289 YWIVKNSWGPQWGIDGYFYIERGQNMCGLAACASY 323


>gi|302763927|ref|XP_002965385.1| hypothetical protein SELMODRAFT_439207 [Selaginella moellendorffii]
 gi|300167618|gb|EFJ34223.1| hypothetical protein SELMODRAFT_439207 [Selaginella moellendorffii]
          Length = 353

 Score =  161 bits (408), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 85/199 (42%), Positives = 116/199 (58%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTGA+E A+     K+ VLS+Q L+DC+ GY N GC GG   +++++I
Sbjct: 149 NQQHCGSCWTFSTTGALESAHAQATGKMVVLSEQQLVDCAGGYNNFGCSGGLPSQAFEYI 208

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + GL T+D Y PY   D  C     +  A +   VN+T  +ED L  A+A + PVS+A
Sbjct: 209 RYNGGLDTEDSY-PYTAHDGKCMYNQNSIGAKVYDVVNITEGAEDELIHAVAFNRPVSIA 267

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            +   K F FY +GVY    C   PD ++HAVLAVGY      PYW +KNSW   +G  G
Sbjct: 268 YEV-LKDFRFYKSGVYTSNVCGTGPDTVNHAVLAVGYNRDAPVPYWIIKNSWGESFGLDG 326

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y  M +  N CG+ T  +Y
Sbjct: 327 YFYMEMGKNMCGIATCASY 345


>gi|16506815|gb|AAL23962.1|AF426248_1 truncated cathepsin H [Homo sapiens]
          Length = 323

 Score =  161 bits (408), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 81/199 (40%), Positives = 117/199 (58%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTGA+E A  +   K+  L++Q L+DC+  + N GC GG   +++++I
Sbjct: 122 NQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNYGCQGGLPSQAFEYI 181

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           +   G+  +D Y PY G+D YC      A   +    N+T   E+A+  A+A + PVS A
Sbjct: 182 LYNKGIMGEDTY-PYQGKDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFA 240

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            + +Q  F  Y  G+Y    C+ +PD ++HAVLAVGYGE +G PYW VKNSW   WG  G
Sbjct: 241 FEVTQ-DFMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNG 299

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y L+    N CG+    +Y
Sbjct: 300 YFLIERGKNMCGLAACASY 318


>gi|27532972|ref|NP_083912.2| cathepsin Q precursor [Mus musculus]
 gi|27960482|gb|AAO27845.1|AF456461_1 cathepsin Q [Mus musculus]
 gi|16445011|gb|AAK00505.1| cathepsin Q precursor [Mus musculus]
 gi|71050990|gb|AAH99415.1| Cathepsin Q [Mus musculus]
 gi|148709365|gb|EDL41311.1| cathepsin Q, isoform CRA_a [Mus musculus]
          Length = 343

 Score =  161 bits (408), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 92/227 (40%), Positives = 130/227 (57%), Gaps = 14/227 (6%)

Query: 134 KDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCS 190
           KDA+P  V +  +GY   + +Q  C SCW+F  TGA+EG  + K  KL  LS Q L+DCS
Sbjct: 122 KDALPKFVDWRNEGYVTRVRNQRNCNSCWAFPVTGAIEGQMFKKTGKLIPLSVQNLVDCS 181

Query: 191 WGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVT 249
              GN GC  G  +  +Q+++ + GL  Q  Y PY G++  C      + A +TGFV V 
Sbjct: 182 RPQGNRGCRWGNTYNGFQYVLHNGGLEAQATY-PYEGKEGLCRYNPKNSAAKITGFV-VL 239

Query: 250 PNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-- 307
           P SED L  A+A  GP++  I     SF FY  GVYY+  C +S   ++HAVL +GYG  
Sbjct: 240 PESEDVLMDAVATKGPIATGIHVVSSSFRFYDGGVYYEPNCTSS---VNHAVLIIGYGYV 296

Query: 308 --ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
             E DG  YW +KNSW   WG  GY++++  ++N+C + +   Y T+
Sbjct: 297 GNETDGNNYWLIKNSWGRRWGLSGYMMIAKDRNNHCAIASLAQYPTV 343


>gi|164472556|gb|ABY58967.1| cathepsin L [Toxoplasma gondii]
          Length = 421

 Score =  161 bits (408), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 88/206 (42%), Positives = 119/206 (57%), Gaps = 10/206 (4%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F TTGA+EGA+  K  KL  LS+Q L+DCS   GN  C GGE   ++Q++
Sbjct: 221 DQRDCGSCWAFSTTGALEGAHCAKTGKLVSLSEQELMDCSRAEGNQSCSGGEMNDAFQYV 280

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
           +  G    +D  PYL +D  C   +      + GF +V   SE A+K ALAK  PVS+AI
Sbjct: 281 LDSGGICSEDAYPYLARDEECRAQSCEKVVKILGFKDVPRRSEAAMKAALAK-SPVSIAI 339

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG--ELDGKPYWQVKNSWSTYWGNQ 328
           +A Q  F FY  GV +D  C      LDH VL VGYG  +   K +W +KNSW T WG  
Sbjct: 340 EADQMPFQFYHEGV-FDASCGTD---LDHGVLLVGYGTDKESKKDFWIMKNSWGTGWGRD 395

Query: 329 GYVLMSI---KDNNCGVMTAPTYVTM 351
           GY+ M++   ++  CG++   ++  M
Sbjct: 396 GYMYMAMHKGEEGQCGLLLDASFPVM 421


>gi|224809458|ref|NP_001019580.2| cathepsin S, b.1 precursor [Danio rerio]
 gi|63101450|gb|AAH95788.1| Cathepsin S, b.1 [Danio rerio]
 gi|77748418|gb|AAI07613.1| Cathepsin S, b.1 [Danio rerio]
          Length = 330

 Score =  161 bits (408), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 94/224 (41%), Positives = 131/224 (58%), Gaps = 7/224 (3%)

Query: 132 ASKDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALID 188
           +S DA+P  + +  KGY +S+  Q  CGSCW+F + GA+EG       KL  LS Q L+D
Sbjct: 110 SSGDAVPDSLDWREKGYVSSVKMQGACGSCWAFSSVGALEGQLKKTTGKLVDLSPQNLVD 169

Query: 189 CSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
           CS  YGN GC+GG    ++Q+++ +G    D   PY G    C  +++   A  T +  V
Sbjct: 170 CSSKYGNKGCNGGFMSDAFQYVIDNGGIASDSAYPYRGVQQQCSYSSSQRAANCTKYYFV 229

Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
               E+ALK A+A  GP+SVAIDA++  F  Y +GVY D  C+     ++HAVL VGYG 
Sbjct: 230 RQGDENALKQAVASVGPISVAIDATRPQFVLYHSGVYNDPTCSKR---VNHAVLVVGYGT 286

Query: 309 LDGKPYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           L G+ YW VKNSW T +G+ GY+ M+   NN CG+ +   Y  M
Sbjct: 287 LSGQDYWLVKNSWGTRFGDGGYIRMARNKNNMCGIASYACYPVM 330


>gi|357458911|ref|XP_003599736.1| Cysteine proteinase [Medicago truncatula]
 gi|357474719|ref|XP_003607644.1| Cysteine proteinase [Medicago truncatula]
 gi|355488784|gb|AES69987.1| Cysteine proteinase [Medicago truncatula]
 gi|355508699|gb|AES89841.1| Cysteine proteinase [Medicago truncatula]
          Length = 340

 Score =  161 bits (408), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 94/222 (42%), Positives = 125/222 (56%), Gaps = 12/222 (5%)

Query: 136 AIP--VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG 192
           AIP  V + +KG    + DQ  CGSCW+F T  A EG   +   KL  LS+Q L+DC   
Sbjct: 123 AIPAAVDWRVKGAVTPIKDQGQCGSCWAFSTVAATEGINQITTGKLVSLSEQELVDCDTK 182

Query: 193 YGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNS 252
             + GC+GG     +++I+K+G  T +   PY   D  C+ A TT  A +TG+  V  NS
Sbjct: 183 GEDQGCEGGLMEDGFEFIIKNGGITSETNYPYKAADGSCNTATTTPVAKITGYEKVPVNS 242

Query: 253 EDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGK 312
           E +L  A+A   P+SV+IDAS  SF FY +G+Y  E C      LDH V AVGYG  +G 
Sbjct: 243 EKSLLKAVANQ-PISVSIDASDSSFMFYSSGIYTGE-CGTE---LDHGVTAVGYGSANGT 297

Query: 313 PYWQVKNSWSTYWGNQGYVLM----SIKDNNCGVMTAPTYVT 350
            YW VKNSW T WG +GY+ M    + K+  CG+    +Y T
Sbjct: 298 DYWIVKNSWGTVWGEKGYIRMQRGIAAKEGLCGIAMDSSYPT 339


>gi|444522624|gb|ELV13407.1| Cathepsin L1 [Tupaia chinensis]
          Length = 307

 Score =  161 bits (408), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 91/219 (41%), Positives = 128/219 (58%), Gaps = 9/219 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F +TGA+EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 92  VDWREKGYVTPVKNQGDCGSCWAFSSTGALEGQMFRKTGKLVSLSEQNLVDCSISEGNFG 151

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
           C+GG    ++ ++  +G    ++  PY   D  C      + A  TGFV++ P  E AL+
Sbjct: 152 CNGGIMDNAFLYVKDNGGLDSEESYPYEAVDDSCKYNPKNSAANDTGFVHL-PVEEKALE 210

Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELD----GKP 313
            A+A  GP+SV IDAS  SF FY  G+Y++  C  S   LDHAVL VGYG ++       
Sbjct: 211 KAVATVGPISVGIDASADSFQFYKEGIYFEPNC--SSVELDHAVLVVGYGVMEEASTNNK 268

Query: 314 YWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
           +W VKNSW   WG  GY++M+  ++NNCG+ +   Y T+
Sbjct: 269 FWLVKNSWGKNWGMDGYIMMAKDRNNNCGIASYAMYPTV 307


>gi|2499879|sp|Q40143.1|CYSP3_SOLLC RecName: Full=Cysteine proteinase 3; Flags: Precursor
 gi|1235545|emb|CAA88629.1| pre-pro-cysteine proteinase [Solanum lycopersicum]
          Length = 356

 Score =  161 bits (408), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 84/198 (42%), Positives = 116/198 (58%), Gaps = 3/198 (1%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI- 210
           Q  CGSCW+F TTGA+E AY     K   LS+Q L+DC+  + N GC+GG   +++++I 
Sbjct: 157 QGKCGSCWTFSTTGALEAAYAQAFGKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIK 216

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
              GL T++ Y PY G++  C  +       +   VN+T  +E  LK A+A   PVSVA 
Sbjct: 217 FNGGLDTEEAY-PYTGKNGICKFSQANIGVKVISSVNITLGAEYELKYAVALVRPVSVAF 275

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
           +   K F  Y +GVY   +C ++P  ++HAVLAVGYG  +G PYW +KNSW   WG  GY
Sbjct: 276 EVV-KGFKQYKSGVYASTECGDTPMDVNHAVLAVGYGVENGTPYWLIKNSWGADWGEDGY 334

Query: 331 VLMSIKDNNCGVMTAPTY 348
             M +  N CGV T  +Y
Sbjct: 335 FKMEMGKNMCGVATCASY 352


>gi|281204231|gb|EFA78427.1| cysteine proteinase 3 [Polysphondylium pallidum PN500]
          Length = 329

 Score =  161 bits (408), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 106/292 (36%), Positives = 150/292 (51%), Gaps = 19/292 (6%)

Query: 67  VSDFKVNIYRLFFLRPRFHENEKIRYNWTYIGEELVNGIIL------EKWRLVTSEGEKV 120
           V D + + Y        F +N    + W  + +E   G  +      E++R V   G  V
Sbjct: 35  VQDRQYDAYEFRTRYSAFKDNLDFIHRWNAVNKETELGATVFADLTNEEYRAVY-LGMNV 93

Query: 121 SKYSLWVRYNKASKDAIPVRYEMKGYNS-----LLDQSVCGSCWSFGTTGAVEGAYYMKH 175
              +   +     +   PVR  +   N+     + DQ  CGSCW+F TTGAVEGA+ +  
Sbjct: 94  DASNFAAQPATLDQVYQPVRSTLDWRNNGAVGRVKDQGQCGSCWAFSTTGAVEGAHQIAT 153

Query: 176 KKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAY-CHI 233
                LS+Q L+DCS  YGN+GC GG    +  +I+K  G+ T++ Y PY  +D+Y C  
Sbjct: 154 GNFVSLSEQQLMDCSRSYGNHGCQGGLMDSAMSYIVKQGGINTEESY-PYEMRDSYTCKY 212

Query: 234 ANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNS 293
                 A ++G+ N+   SE  L   L   GPV++A+DAS  SF  Y +GV+YD  C  S
Sbjct: 213 NPANNGAKLSGYSNIKRGSEADLAAKL-NIGPVAIALDASHSSFQLYKSGVFYDPAC--S 269

Query: 294 PDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMT 344
              L H VLAVGYG      YW VKNSW T WG+ GY+ +   ++N+CGV T
Sbjct: 270 STSLSHGVLAVGYGTEGSSAYWIVKNSWGTRWGDAGYIWIAKDRNNHCGVAT 321


>gi|344271939|ref|XP_003407794.1| PREDICTED: cathepsin L1-like [Loxodonta africana]
          Length = 335

 Score =  161 bits (408), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 92/221 (41%), Positives = 132/221 (59%), Gaps = 13/221 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  +GY   + DQ  C SCW+F  TGA+EG  + K  KL  LS+Q L+DCS    NNG
Sbjct: 120 VNWTQRGYVTPVKDQGSCHSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPESNNG 179

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDA 255
           C GG   +++Q++  + GL +++ Y PY  +++  C      + A  TGFVN+ P  E A
Sbjct: 180 CSGGLMDKAFQYVKNNGGLDSEESY-PYTAKESRNCLYKPEFSAANNTGFVNIPP-QEKA 237

Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
           L  A+A  GP+SVA+DAS KSF FY +G+Y+D  C  +   ++H VL VGYG    + D 
Sbjct: 238 LMNAVASVGPISVAVDASLKSFRFYKSGIYFDPACRLA---VNHGVLVVGYGFEGTDPDK 294

Query: 312 KPYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTYVTM 351
             YW VKNSW   WG  GY+ +   ++N+CG+  A +Y T+
Sbjct: 295 NKYWLVKNSWGKSWGADGYIKIAKDRNNHCGIARAASYPTV 335


>gi|164519063|ref|NP_001002813.2| cathepsin Q-like 2 precursor [Rattus norvegicus]
 gi|67678196|gb|AAH97257.1| Ctsql2 protein [Rattus norvegicus]
 gi|149039735|gb|EDL93851.1| rCG24202 [Rattus norvegicus]
          Length = 343

 Score =  161 bits (407), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 88/223 (39%), Positives = 126/223 (56%), Gaps = 12/223 (5%)

Query: 134 KDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCS 190
           +DA+P  + +  +GY   + +Q  C SCW+F   GA+EG  + K  KL  LS Q L+DCS
Sbjct: 122 RDALPKSIDWRKEGYVTRVREQGKCKSCWAFPVAGAIEGQMFKKTGKLTPLSVQNLVDCS 181

Query: 191 WGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTP 250
              GN GC GG  + ++Q+++++G    +   PY G++  C      A A +T FV + P
Sbjct: 182 KPQGNKGCRGGTTYNAFQYVLQNGGLESEATYPYKGKEGLCKYNPKNAYAKITRFVAL-P 240

Query: 251 NSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG--- 307
             ED L  ALA  GPV+  I     S  FY  G+Y++ KCNN    ++HAVL VGYG   
Sbjct: 241 EDEDVLMDALATKGPVAAGIHVVYSSLRFYKKGIYHEPKCNNR---VNHAVLVVGYGFEG 297

Query: 308 -ELDGKPYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
            E DG  YW +KNSW   WG +GY+ +   ++N+CG+ T   Y
Sbjct: 298 NETDGNNYWLIKNSWGKQWGLKGYMKIAKDRNNHCGIATFAQY 340


>gi|54400500|ref|NP_001005999.1| cathepsin L, like [Danio rerio]
 gi|53734591|gb|AAH83200.1| Cathepsin L, like [Danio rerio]
          Length = 261

 Score =  161 bits (407), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 100/222 (45%), Positives = 141/222 (63%), Gaps = 13/222 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           + +  KGY   + DQ  CGSCW+F +TGA+EG  + K  KL  LS+Q L+DCS   GNNG
Sbjct: 44  IDWRQKGYVTPIKDQKRCGSCWAFSSTGALEGQVFRKTGKLVSLSEQNLMDCSRPQGNNG 103

Query: 198 CDGGEDFRSYQWIM-KHGLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDA 255
           CDGG   +++Q++   +GL +++ Y PYL   D  CH     + A +TGFV++    E A
Sbjct: 104 CDGGLMDQAFQYVQDNNGLDSEESY-PYLATDDQPCHYDPRYSAANVTGFVDIPSGKEHA 162

Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
           L  A+A  GPV+VAIDA  +SF FY +G+YY + C  S + LDH VL VGYG    ++ G
Sbjct: 163 LMKAVAAVGPVAVAIDAGHESFQFYQSGIYYGKAC--STEELDHGVLVVGYGYEGVDVAG 220

Query: 312 KPYWQVKNSWSTYWGNQGYVLMSIKD--NNCGVMTAPTYVTM 351
           + YW VKNSW+  WG++GY+ M+ KD  N+ G+ T+ +Y  M
Sbjct: 221 RRYWIVKNSWTDRWGDKGYIYMA-KDLKNHRGIATSASYPLM 261


>gi|62955291|ref|NP_001017661.1| cathepsin S, b.2 precursor [Danio rerio]
 gi|62204682|gb|AAH93339.1| Cathepsin S, b.2 [Danio rerio]
 gi|182891354|gb|AAI64362.1| Ctssb.2 protein [Danio rerio]
          Length = 330

 Score =  161 bits (407), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 89/207 (42%), Positives = 120/207 (57%), Gaps = 5/207 (2%)

Query: 144 KGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGE 202
           KGY  S+ +Q  CGSCW+F + GA+EG       KL  LS Q L+DCS  YGN GC+GG 
Sbjct: 124 KGYVTSVKNQGACGSCWAFSSVGALEGQLMKTTGKLVDLSPQNLVDCSSKYGNLGCNGGY 183

Query: 203 DFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAK 262
             +++Q+++ +G    +   PY G    C    +   A  T +  V+   E ALK ALA 
Sbjct: 184 MSQAFQYVIDNGGIDSESSYPYQGTQGSCRYDPSQRAANCTSYKFVSQGDEQALKEALAN 243

Query: 263 HGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWS 322
            GPVSVAIDA++  F FY +GVY D  C      ++H VLAVGYG L G+ YW VKNSW 
Sbjct: 244 IGPVSVAIDATRPQFIFYRSGVYDDPSCTQK---VNHGVLAVGYGTLSGQDYWLVKNSWG 300

Query: 323 TYWGNQGYVLMSIKDNN-CGVMTAPTY 348
             +G+ GY+ ++   NN CG+ +   Y
Sbjct: 301 AGFGDGGYIRIARNKNNMCGIASEACY 327


>gi|28932704|gb|AAO60046.1| midgut cysteine proteinase 3 [Rhipicephalus appendiculatus]
          Length = 334

 Score =  161 bits (407), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 94/200 (47%), Positives = 128/200 (64%), Gaps = 6/200 (3%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTG++EG ++ K +KL  LS+Q L+DC    GNNGC GG    ++++I
Sbjct: 136 NQGQCGSCWAFSTTGSLEGQHFRKMRKLVSLSEQNLVDCMQKLGNNGCGGGLMDNAFKYI 195

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T+  Y PY   D  CH   +   AT TGF ++    E++   A+A  GPVSVA
Sbjct: 196 KANKGIDTELSY-PYNATDGVCHFKKSGVGATATGFEDIPARDENSWD-AVAPVGPVSVA 253

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS +SF FY  GV  + +C  S D LDH VL VGYG  DG+ YW VKNSW T WG++G
Sbjct: 254 IDASHESFQFYSEGVLDEPEC--SSDQLDHGVLVVGYGTKDGQDYWLVKNSWGTTWGDEG 311

Query: 330 YVLMSI-KDNNCGVMTAPTY 348
           Y+ M+  KDN CG+ ++ +Y
Sbjct: 312 YIYMTRNKDNQCGIASSASY 331


>gi|39930363|ref|NP_058817.1| cathepsin J precursor [Rattus norvegicus]
 gi|84028185|sp|Q63088.2|CATJ_RAT RecName: Full=Cathepsin J; AltName: Full=Cathepsin L-related
           protein; AltName: Full=Cathepsin P; AltName:
           Full=Catlrp-p; Flags: Precursor
 gi|28196048|gb|AAL26793.2| cathepsin P [Rattus norvegicus]
 gi|66910531|gb|AAH97263.1| Cathepsin J [Rattus norvegicus]
 gi|149039736|gb|EDL93852.1| cathepsin J [Rattus norvegicus]
          Length = 334

 Score =  161 bits (407), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 91/215 (42%), Positives = 129/215 (60%), Gaps = 11/215 (5%)

Query: 141 YEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCD 199
           +  +GY   + +Q  CGSCW+F   GA+EG  + K   L  LS Q L+DCS   GNNGC 
Sbjct: 120 WRKEGYVTPVRNQGKCGSCWAFAAVGAIEGQMFSKTGNLTPLSVQNLLDCSKSEGNNGCR 179

Query: 200 GGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKL 258
            G   +++ +++K+ GL  +  Y PY G+D  C   +  A+A +TGFVN+ PN E  L +
Sbjct: 180 WGTAHQAFNYVLKNKGLEAEATY-PYEGKDGPCRYHSENASANITGFVNLPPN-ELYLWV 237

Query: 259 ALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPY 314
           A+A  GPVS AIDAS  SF FY  GVY++  C  S   ++HAVL VGYG    E DG  Y
Sbjct: 238 AVASIGPVSAAIDASHDSFRFYSGGVYHEPNC--SSYVVNHAVLVVGYGFEGNETDGNNY 295

Query: 315 WQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
           W +KNSW   WG  G++ +   ++N+CG+ +  ++
Sbjct: 296 WLIKNSWGEEWGINGFMKIAKDRNNHCGIASQASF 330


>gi|246148|gb|AAB21516.1| Cyclic Protein-2 [Rattus sp.]
          Length = 247

 Score =  161 bits (407), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 87/203 (42%), Positives = 121/203 (59%), Gaps = 8/203 (3%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F  +G +EG  ++K  KL  LS+Q L+DCS   GN GC+GG    ++Q+I
Sbjct: 44  NQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHDQGNQGCNGGLMDFAFQYI 103

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
            ++G    ++  PY  +D  C      A A  TGFV++ P  E AL  A+A  GP+SVA+
Sbjct: 104 KENGGLDSEESYPYEAKDGSCKYRAEYAVANDTGFVDI-PQQEKALMKAVATVGPISVAM 162

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTYWG 326
           DAS  S  FY +G+YY+  C  S   LDH VL VGYG    + +   YW VKNSW   WG
Sbjct: 163 DASHPSLQFYSSGIYYEPNC--SSKDLDHGVLVVGYGYEGTDSNKDKYWLVKNSWGKEWG 220

Query: 327 NQGYV-LMSIKDNNCGVMTAPTY 348
             GY+ +   ++N+CG+ TA +Y
Sbjct: 221 MDGYIKIAKDRNNHCGLATAASY 243


>gi|29710|emb|CAA34734.1| unnamed protein product [Homo sapiens]
          Length = 335

 Score =  161 bits (407), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 81/199 (40%), Positives = 117/199 (58%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTGA+E A  +   K+  L++Q L+DC+  + N GC GG   +++++I
Sbjct: 134 NQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNYGCQGGLPSQAFEYI 193

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           +   G+  +D Y PY G+D YC      A   +    N+T   E+A+  A+A + PVS A
Sbjct: 194 LYNKGIMGEDTY-PYQGKDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFA 252

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            + +Q  F  Y  G+Y    C+ +PD ++HAVLAVGYGE +G PYW VKNSW   WG  G
Sbjct: 253 FEVTQ-DFMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNG 311

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y L+    N CG+    +Y
Sbjct: 312 YFLIERGKNMCGLAACASY 330


>gi|403258371|ref|XP_003921746.1| PREDICTED: pro-cathepsin H [Saimiri boliviensis boliviensis]
          Length = 336

 Score =  161 bits (407), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 81/199 (40%), Positives = 118/199 (59%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTGA+E A  +   K+  L++Q L+DC+  + N+GC GG   +++++I
Sbjct: 135 NQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYI 194

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           +   G+  +D Y PY G+D+ C      A   +    N+T   EDA+  A+A + PVS A
Sbjct: 195 LYNKGIMGEDTY-PYQGKDSDCKFQPGKAIGFVKDVANITIYDEDAMVEAVALYNPVSFA 253

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            + +Q  F  Y  G+Y    C+ +PD ++HAVLAVGYGE +G PYW VKNSW   WG  G
Sbjct: 254 FEVTQ-DFMMYKRGIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGMNG 312

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y L+    N CG+    +Y
Sbjct: 313 YFLIERGKNMCGLAACASY 331


>gi|94448670|emb|CAI91573.1| silicatein a4 [Lubomirskia baicalensis]
 gi|312386085|gb|ADQ74587.1| silicatein alpha 4 [Lubomirskia baicalensis]
          Length = 326

 Score =  161 bits (407), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 81/202 (40%), Positives = 128/202 (63%), Gaps = 5/202 (2%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM 211
           Q  CG+ ++F  TGA+EGA  + + K  +LS+Q +IDCS  YGN+GC GG+ + + ++++
Sbjct: 128 QGQCGASYAFAATGALEGASALSNDKQVILSEQNIIDCSVPYGNHGCSGGDTYTAMKYVI 187

Query: 212 KH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
            + G+ T+  Y  + G+ + C  ++  + A+ TG +++   SE  L  A+A  GPV+VA+
Sbjct: 188 DNGGIDTESSY-SFQGKQSSCQYSSKNSGASATGVISIASGSETDLFAAVATVGPVAVAV 246

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
           DA+  +F FY +GV+    C+N+   L+HA+L  GYG  +GK YW VKNSWS  WG+ GY
Sbjct: 247 DANTNAFRFYQSGVFDSSSCSNTK--LNHAMLVTGYGSYNGKDYWLVKNSWSKNWGDNGY 304

Query: 331 VLM-SIKDNNCGVMTAPTYVTM 351
           ++M   K N CG+ T   Y T+
Sbjct: 305 IMMVRNKYNQCGIATDALYPTL 326


>gi|9931986|ref|NP_064680.1| cathepsin R precursor [Mus musculus]
 gi|23813621|sp|Q9JIA9.1|CATR_MOUSE RecName: Full=Cathepsin R; Flags: Precursor
 gi|9623188|gb|AAF90051.1|AF245399_1 cathepsin R [Mus musculus]
 gi|12837970|dbj|BAB24023.1| unnamed protein product [Mus musculus]
 gi|12852278|dbj|BAB29345.1| unnamed protein product [Mus musculus]
 gi|16445015|gb|AAK00507.1| cathepsin R precursor [Mus musculus]
 gi|71682221|gb|AAI00339.1| Cathepsin R [Mus musculus]
 gi|148709367|gb|EDL41313.1| cathepsin R [Mus musculus]
          Length = 334

 Score =  161 bits (407), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 89/219 (40%), Positives = 126/219 (57%), Gaps = 9/219 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   +  Q  C +CW+F  TGA+E     +  KL  LS Q L+DCS   GNNG
Sbjct: 119 VDWRKKGYVTPVRRQGDCDACWAFAVTGAIEAQAIWQTGKLTPLSVQNLVDCSKPQGNNG 178

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
           C GG+ + ++Q+++ +G    +   PY G+D  C      + A +TGFV++ P SED L 
Sbjct: 179 CLGGDTYNAFQYVLHNGGLESEATYPYEGKDGPCRYNPKNSKAEITGFVSL-PQSEDILM 237

Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKP 313
            A+A  GP++  IDAS +SF  Y  G+Y++  C  S D + H VL VGYG    E DG  
Sbjct: 238 AAVATIGPITAGIDASHESFKNYKGGIYHEPNC--SSDTVTHGVLVVGYGFKGIETDGNH 295

Query: 314 YWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTYVTM 351
           YW +KNSW   WG +GY+ L   K+N+CG+ +   Y T+
Sbjct: 296 YWLIKNSWGKRWGIRGYMKLAKDKNNHCGIASYAHYPTI 334


>gi|67678376|gb|AAH96862.1| Cathepsin S, b.2 [Danio rerio]
          Length = 330

 Score =  161 bits (407), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 89/207 (42%), Positives = 120/207 (57%), Gaps = 5/207 (2%)

Query: 144 KGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGE 202
           KGY  S+ +Q  CGSCW+F + GA+EG       KL  LS Q L+DCS  YGN GC+GG 
Sbjct: 124 KGYVTSVKNQGACGSCWAFSSVGALEGQLMKTTGKLVDLSPQNLVDCSSKYGNLGCNGGY 183

Query: 203 DFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAK 262
             +++Q+++ +G    +   PY G    C    +   A  T +  V+   E ALK ALA 
Sbjct: 184 MSQAFQYVIDNGGIDSESSYPYQGTQGSCRYDPSQRAANCTSYKFVSQGDEQALKEALAN 243

Query: 263 HGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWS 322
            GPVSVAIDA++  F FY +GVY D  C      ++H VLAVGYG L G+ YW VKNSW 
Sbjct: 244 IGPVSVAIDATRPQFIFYRSGVYDDPSCTQK---VNHGVLAVGYGTLSGQDYWLVKNSWG 300

Query: 323 TYWGNQGYVLMSIKDNN-CGVMTAPTY 348
             +G+ GY+ ++   NN CG+ +   Y
Sbjct: 301 AGFGDGGYIRIARNKNNMCGIASEACY 327


>gi|16506813|gb|AAL23961.1|AF426247_1 cathepsin H [Homo sapiens]
          Length = 335

 Score =  161 bits (407), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 81/199 (40%), Positives = 117/199 (58%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTGA+E A  +   K+  L++Q L+DC+  + N GC GG   +++++I
Sbjct: 134 NQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNYGCQGGLPSQAFEYI 193

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           +   G+  +D Y PY G+D YC      A   +    N+T   E+A+  A+A + PVS A
Sbjct: 194 LYNKGIMGEDTY-PYQGKDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFA 252

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            + +Q  F  Y  G+Y    C+ +PD ++HAVLAVGYGE +G PYW VKNSW   WG  G
Sbjct: 253 FEVTQ-DFMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNG 311

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y L+    N CG+    +Y
Sbjct: 312 YFLIERGKNMCGLAACASY 330


>gi|354504284|ref|XP_003514207.1| PREDICTED: cathepsin R-like [Cricetulus griseus]
          Length = 334

 Score =  160 bits (406), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 87/216 (40%), Positives = 127/216 (58%), Gaps = 9/216 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY  S+  Q+ C SCW+F   GA+EG  + K   L  LS Q L+DCS  +GNNG
Sbjct: 119 VNWRKKGYVTSVKKQAYCNSCWAFAVNGAIEGQMFKKTGNLTRLSVQNLVDCSKPHGNNG 178

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
           CD G+ + +Y++++ +G    +   PY G++  C      + A +TGFV++ P SE++L 
Sbjct: 179 CDWGDPYIAYEYVLHNGGVEAEATYPYEGKEGPCRYNPKYSAANITGFVSL-PKSEESLM 237

Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKP 313
            A+A  GP+S  ID +   F FY  G++YD KC+N  D ++H VL VGYG    E DG  
Sbjct: 238 AAVATIGPISAGIDIASDFFMFYKKGIFYDPKCHN--DTVNHVVLVVGYGFEGNETDGNN 295

Query: 314 YWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTY 348
           YW VKNS+   WG +GY+ ++   NN C + +   Y
Sbjct: 296 YWLVKNSYGKKWGLRGYMKIAKDQNNHCAIASYAHY 331


>gi|29708|emb|CAA30428.1| cathepsin H [Homo sapiens]
          Length = 248

 Score =  160 bits (406), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 81/199 (40%), Positives = 117/199 (58%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTGA+E A  +   K+  L++Q L+DC+  + N GC GG   +++++I
Sbjct: 47  NQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNYGCQGGLPSQAFEYI 106

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           +   G+  +D Y PY G+D YC      A   +    N+T   E+A+  A+A + PVS A
Sbjct: 107 LYNKGIMGEDTY-PYQGKDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFA 165

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            + +Q  F  Y  G+Y    C+ +PD ++HAVLAVGYGE +G PYW VKNSW   WG  G
Sbjct: 166 FEVTQ-DFMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNG 224

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y L+    N CG+    +Y
Sbjct: 225 YFLIERGKNMCGLAACASY 243


>gi|354507493|ref|XP_003515790.1| PREDICTED: cathepsin L1-like [Cricetulus griseus]
 gi|344259154|gb|EGW15258.1| Cathepsin L1 [Cricetulus griseus]
          Length = 333

 Score =  160 bits (406), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 90/207 (43%), Positives = 126/207 (60%), Gaps = 10/207 (4%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F   GA+EG   +K   L  LS+Q L+DCS   GN GC+GG    ++Q++
Sbjct: 131 NQGQCGSCWAFSACGALEGQMCLKTGVLVSLSEQNLVDCSQAEGNQGCNGGLMDFAFQYV 190

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           + + GL +++ Y PY  +D  C      A A  TG+V++ P  E AL  A+A  GP+++A
Sbjct: 191 LNNKGLDSEESY-PYEAKDGTCKYKPEFAAANDTGYVDI-PQLEKALMKAVATVGPIAIA 248

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTYW 325
           IDAS  SF FY +G+YY+  C  S   LDH VL VGYG    + + K YW VKNSW + W
Sbjct: 249 IDASHPSFQFYSSGIYYEPNC--SSKELDHGVLVVGYGFEGTDSNKKKYWIVKNSWGSSW 306

Query: 326 GNQGYV-LMSIKDNNCGVMTAPTYVTM 351
           G  G+  +   K+N+CGV TA +Y T+
Sbjct: 307 GMGGFFHIAKDKNNHCGVATAASYPTV 333


>gi|463046|gb|AAA49207.1| cysteine proteinase [Cyprinus carpio]
          Length = 331

 Score =  160 bits (406), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 91/220 (41%), Positives = 127/220 (57%), Gaps = 7/220 (3%)

Query: 132 ASKDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALID 188
           +S  AIP  + +  KGY +S+ +Q  CGSCW+F + GA+EG       KL  LS Q L+D
Sbjct: 110 SSGAAIPDSLDWREKGYVSSVKNQGACGSCWAFSSVGALEGQLMKTTGKLVDLSPQNLVD 169

Query: 189 CSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
           CS  YGN GC GG    ++Q+++ +G    +   PY G    C    +   A  T +  V
Sbjct: 170 CSSSYGNYGCGGGLMSAAFQYVIDNGGIDSESSYPYEGVQGQCRYNPSQLAANCTKYYYV 229

Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
               E+ALK A+A  GP+SVAIDA+   F  Y +GVY D  C  +   ++HAVLAVGYG 
Sbjct: 230 RQGDEEALKQAVANIGPISVAIDATHPQFILYRSGVYNDPSCTTN---INHAVLAVGYGA 286

Query: 309 LDGKPYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPT 347
           + G+ +W VKNSW T +G+ GY+ M+   NN CG+   P+
Sbjct: 287 IAGQDFWLVKNSWGTGFGDGGYIRMARNQNNMCGIAHTPS 326


>gi|395535909|ref|XP_003769963.1| PREDICTED: cathepsin S [Sarcophilus harrisii]
          Length = 347

 Score =  160 bits (406), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 86/200 (43%), Positives = 116/200 (58%), Gaps = 6/200 (3%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG--YGNNGCDGGEDFRSYQW 209
           Q  CGSCW+F   GA+E    +K  KL  LS Q L+DCS    Y N+GC+GG    ++Q+
Sbjct: 148 QGTCGSCWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTNEKYENHGCNGGCMTEAFQY 207

Query: 210 IMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           I+ +     D   PY  +D  C        AT + +  +   SEDALK A+A  GPVSV 
Sbjct: 208 IIDNNGIDSDASYPYKAKDGKCQYNPANRAATCSRYTELPYGSEDALKEAVANKGPVSVG 267

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS  SF  Y +GVYYD  C  +   ++H VL  GYG LDGK YW VKNSW   +G++G
Sbjct: 268 IDASLPSFFLYKSGVYYDPSCTQN---VNHGVLVTGYGNLDGKDYWLVKNSWGLSFGDKG 324

Query: 330 YVLMSI-KDNNCGVMTAPTY 348
           Y+ ++  + N+CG+   P+Y
Sbjct: 325 YIRIARNRGNHCGIANFPSY 344


>gi|168047065|ref|XP_001775992.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162672650|gb|EDQ59184.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 336

 Score =  160 bits (406), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 85/200 (42%), Positives = 118/200 (59%), Gaps = 4/200 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q+ CGSCW+F TTGA+E A+     K+ +LS+Q L+DC+  + N GC GG   +++++I
Sbjct: 128 NQASCGSCWTFSTTGALEAAHAQATGKMVLLSEQQLVDCAGEFNNFGCGGGLPSQAFEYI 187

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T+D Y PY  +D+ C     T  A +   VN+T  +E  LK A+A   PVSVA
Sbjct: 188 RYNGGIDTEDSY-PYNAKDSQCRFHKNTIGAQVWDVVNITEGAETQLKHAIATMRPVSVA 246

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE-LDGKPYWQVKNSWSTYWGNQ 328
            +     F  Y  GVY    C+  P  ++HAVLAVGYGE  +G PYW +KNSW   WG  
Sbjct: 247 FEVVH-DFRLYNGGVYTSLNCHTGPQTVNHAVLAVGYGEDENGVPYWIIKNSWGADWGMN 305

Query: 329 GYVLMSIKDNNCGVMTAPTY 348
           GY  M +  N CGV T  +Y
Sbjct: 306 GYFNMEMGKNMCGVATCASY 325


>gi|118140100|gb|ABK63481.1| cathepsin S [Channa argus]
          Length = 335

 Score =  160 bits (406), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 86/197 (43%), Positives = 123/197 (62%), Gaps = 6/197 (3%)

Query: 148 SLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSY 207
           S+ +Q  CGSCW+F   GA+EG       KL  LS Q L+DCS  YGN+GCDGG    ++
Sbjct: 134 SVKNQGSCGSCWAFSAVGALEGQLAKTTGKLVDLSPQNLVDCSGKYGNHGCDGGFMTNAF 193

Query: 208 QWIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
           Q++++  G+ ++  Y PY+G +  CH     + A  + +  +    E+ALK A+A  GP+
Sbjct: 194 QYVIENQGIESEASY-PYIGLEQQCHYNPEESAANCSQYHFLPEKDEEALKEAIATIGPI 252

Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWG 326
           SVAIDAS+ +F+FY +GVY D  C+   + ++H VLAVGYG    +  W VKNSW TY+G
Sbjct: 253 SVAIDASKPTFTFYSSGVYDDPTCS---EVINHGVLAVGYGTQSTQDSWLVKNSWGTYFG 309

Query: 327 NQGYVLMSI-KDNNCGV 342
           + GY+ MS  K N CG+
Sbjct: 310 DSGYIRMSRNKGNQCGI 326


>gi|215261455|pdb|3F75|A Chain A, Activated Toxoplasma Gondii Cathepsin L (Tgcpl) In Complex
           With Its Propeptide
          Length = 224

 Score =  160 bits (406), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 88/206 (42%), Positives = 119/206 (57%), Gaps = 10/206 (4%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F TTGA+EGA+  K  KL  LS+Q L+DCS   GN  C GGE   ++Q++
Sbjct: 24  DQRDCGSCWAFSTTGALEGAHCAKTGKLVSLSEQELMDCSRAEGNQSCSGGEMNDAFQYV 83

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
           +  G    +D  PYL +D  C   +      + GF +V   SE A+K ALAK  PVS+AI
Sbjct: 84  LDSGGICSEDAYPYLARDEECRAQSCEKVVKILGFKDVPRRSEAAMKAALAKS-PVSIAI 142

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG--ELDGKPYWQVKNSWSTYWGNQ 328
           +A Q  F FY  GV +D  C      LDH VL VGYG  +   K +W +KNSW T WG  
Sbjct: 143 EADQMPFQFYHEGV-FDASCGTD---LDHGVLLVGYGTDKESKKDFWIMKNSWGTGWGRD 198

Query: 329 GYVLMSI---KDNNCGVMTAPTYVTM 351
           GY+ M++   ++  CG++   ++  M
Sbjct: 199 GYMYMAMHKGEEGQCGLLLDASFPVM 224


>gi|148575301|gb|ABQ95351.1| secreted cathepsin L2 [Fasciola hepatica]
          Length = 326

 Score =  160 bits (405), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 89/218 (40%), Positives = 122/218 (55%), Gaps = 8/218 (3%)

Query: 131 KASKDAIPVRYEMKGY---NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALI 187
           KA+K A+P   + + Y     + DQ  CGSCW+F TTGAVEG +    +  A  S+Q L+
Sbjct: 102 KANKLAVPESIDWRDYYYVTEVKDQGQCGSCWAFSTTGAVEGQFRKNERASASFSEQQLV 161

Query: 188 DCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVN 247
           DC+  +GN GC GG    +Y+++  +GL T+  Y PY   +  C      A A +TG+  
Sbjct: 162 DCTRDFGNYGCGGGYMENAYEYLKHNGLETESYY-PYQAVEGPCQYDGRLAYAKVTGYYT 220

Query: 248 VTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG 307
           V    E  LK  +   GP +VA+DA    F  Y +G+Y  + C   PD L HAVLAVGYG
Sbjct: 221 VHSGDEIELKNLVGTEGPAAVALDAD-SDFMMYQSGIYQSQTC--LPDRLTHAVLAVGYG 277

Query: 308 ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMT 344
             DG  YW VKNSW T+WG  GY+  +  + N CG+ +
Sbjct: 278 SQDGTDYWIVKNSWGTWWGEDGYIRFARNRGNMCGIAS 315


>gi|330793420|ref|XP_003284782.1| hypothetical protein DICPUDRAFT_28222 [Dictyostelium purpureum]
 gi|325085276|gb|EGC38686.1| hypothetical protein DICPUDRAFT_28222 [Dictyostelium purpureum]
          Length = 347

 Score =  160 bits (405), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 98/237 (41%), Positives = 132/237 (55%), Gaps = 28/237 (11%)

Query: 136 AIPVRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYG 194
           A  + +  KG    + +Q  CG CWSF TTG+ EGA  +    L  LS+Q LIDCS  YG
Sbjct: 112 AASIDWRTKGAVTPIKNQQQCGGCWSFSTTGSTEGATAIAKGNLPSLSEQNLIDCSGSYG 171

Query: 195 NNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDA-YCHIANTTATATMTGFVNVTPNS 252
           NNGC+GG    ++++I+ + G+ T+  Y PY  +D   C        AT++ + NVT  S
Sbjct: 172 NNGCNGGLMTLAFEYIINNKGIDTESSY-PYTAKDGKTCKYNPANIGATLSSYSNVTSGS 230

Query: 253 EDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----- 307
           E +L+ A A  GPVSVAIDAS  SF  Y +G+YY+  C  S   LDH VL VGY      
Sbjct: 231 EPSLESA-ANIGPVSVAIDASHNSFQLYSSGIYYEPAC--STTSLDHGVLVVGYASGSGS 287

Query: 308 ------------ELDGKP---YWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
                        ++G     YW VKNSW T WG +GY+LMS  ++NNCG+ T  ++
Sbjct: 288 GSGSGSGSGSGLAVEGASSGNYWIVKNSWGTSWGIEGYILMSKDRNNNCGIATMASF 344


>gi|440290877|gb|ELP84181.1| cysteine proteinase 2 precursor, putative [Entamoeba invadens IP1]
          Length = 315

 Score =  160 bits (405), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 86/202 (42%), Positives = 120/202 (59%), Gaps = 7/202 (3%)

Query: 151 DQSVCGSCWSFGTTGAVEGA--YYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
           DQ  CGSC+SFG   ++EG   +   ++    LS++ ++ CS    NNGC+GG     Y 
Sbjct: 114 DQKSCGSCYSFGALASLEGRLLFLGGNENTLDLSEEQIMRCS---ANNGCNGGTGSIVYD 170

Query: 209 WIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
           +I ++G+  + DY PY   +  C +  T   AT+ G+  +T    DALK AL   G V V
Sbjct: 171 YIKENGVVQEKDY-PYTATEGNCGVDLTKKFATIEGYKYITRKDNDALKSALVD-GVVDV 228

Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQ 328
           +IDAS   FS Y +GVY+D KC  S   L+H + AVGYG+L+G+ YW VKNSW T WG  
Sbjct: 229 SIDASSPMFSLYKSGVYHDTKCKTSLFSLNHEIAAVGYGKLNGEEYWIVKNSWGTNWGES 288

Query: 329 GYVLMSIKDNNCGVMTAPTYVT 350
           GY+LM+++ N CGV + P   T
Sbjct: 289 GYILMAMEGNTCGVTSEPIVPT 310


>gi|149392541|gb|ABR26073.1| oryzain gamma chain precursor [Oryza sativa Indica Group]
          Length = 367

 Score =  160 bits (405), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 85/199 (42%), Positives = 114/199 (57%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F TTG++E AY     K   LS+Q L+DC+  Y N GC GG   +++++I
Sbjct: 167 DQGHCGSCWTFSTTGSLEAAYTQATGKPVSLSEQQLVDCATAYNNFGCSGGLPSQAFEYI 226

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + GL T++ Y PY G +  CH         +   VN+T  +ED LK A+    PVSVA
Sbjct: 227 KYNGGLDTEEAY-PYTGVNGICHYKPENVGVKVLDSVNITLGAEDELKNAVGLVRPVSVA 285

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
                  F  Y +GVY  + C  SP  ++HAVLAVGYG  +G PYW +KNSW   WG+ G
Sbjct: 286 FQVIN-GFRMYKSGVYTSDHCGTSPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNG 344

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y  M +  N CG+ T  +Y
Sbjct: 345 YFKMEMGKNMCGIATCASY 363


>gi|315075311|ref|NP_001186668.1| cathepsin S isoform 2 preproprotein [Homo sapiens]
 gi|194376464|dbj|BAG62991.1| unnamed protein product [Homo sapiens]
          Length = 281

 Score =  160 bits (405), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 86/206 (41%), Positives = 118/206 (57%), Gaps = 5/206 (2%)

Query: 145 GYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW-GYGNNGCDGGED 203
           G N L D   CG+CW+F   GA+E    +K  KL  LS Q L+DCS   YGN GC+GG  
Sbjct: 76  GMNHLGDMGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFM 135

Query: 204 FRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKH 263
             ++Q+I+ +     D   PY   D  C   +    AT + +  +    ED LK A+A  
Sbjct: 136 TTAFQYIIDNKGIDSDASYPYKAMDQKCQYDSKYRAATCSKYTELPYGREDVLKEAVANK 195

Query: 264 GPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWST 323
           GPVSV +DA   SF  Y +GVYY+  C  +   ++H VL VGYG+L+GK YW VKNSW  
Sbjct: 196 GPVSVGVDARHPSFFLYRSGVYYEPSCTQN---VNHGVLVVGYGDLNGKEYWLVKNSWGH 252

Query: 324 YWGNQGYVLMSI-KDNNCGVMTAPTY 348
            +G +GY+ M+  K N+CG+ + P+Y
Sbjct: 253 NFGEEGYIRMARNKGNHCGIASFPSY 278


>gi|218202220|gb|EEC84647.1| hypothetical protein OsI_31538 [Oryza sativa Indica Group]
          Length = 363

 Score =  160 bits (405), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 85/199 (42%), Positives = 114/199 (57%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F TTG++E AY     K   LS+Q L+DC+  Y N GC GG   +++++I
Sbjct: 163 DQGHCGSCWTFSTTGSLEAAYTQATGKPVSLSEQQLVDCATAYNNFGCSGGLPSQAFEYI 222

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + GL T++ Y PY G +  CH         +   VN+T  +ED LK A+    PVSVA
Sbjct: 223 KYNGGLDTEEAY-PYTGVNGICHYKPENVGVKVLDSVNITLGAEDELKNAVGLVRPVSVA 281

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
                  F  Y +GVY  + C  SP  ++HAVLAVGYG  +G PYW +KNSW   WG+ G
Sbjct: 282 FQVIN-GFRMYKSGVYTSDHCGTSPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNG 340

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y  M +  N CG+ T  +Y
Sbjct: 341 YFKMEMGKNMCGIATCASY 359


>gi|209731972|gb|ACI66855.1| Cathepsin H precursor [Salmo salar]
          Length = 328

 Score =  160 bits (405), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 81/202 (40%), Positives = 119/202 (58%), Gaps = 3/202 (1%)

Query: 148 SLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSY 207
           ++ +Q  CGSCW+F TTG +E    +   KL  LS+Q L+DC+  + N+GC+GG   +++
Sbjct: 124 AVKNQGSCGSCWTFSTTGCLESVTAIATGKLLQLSEQQLVDCAQAFNNHGCNGGLPSQAF 183

Query: 208 QWI-MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
           ++I    G+ T+DDY PY   D  C      A A +   VN+T   E  +  A+A+  PV
Sbjct: 184 EYIKFNKGIMTEDDY-PYTAHDDTCKFKTDLAAAFVKDVVNITKYDEMGMVDAVARFNPV 242

Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWG 326
           S+A + +   F  Y  GVY  ++C+N+ D ++HAVLAVGYGE  G PYW VKNSW + WG
Sbjct: 243 SLAYEVTS-DFMHYDGGVYTSKECHNTTDTVNHAVLAVGYGEEKGTPYWIVKNSWGSSWG 301

Query: 327 NQGYVLMSIKDNNCGVMTAPTY 348
            +GY  +    N CG+    +Y
Sbjct: 302 MKGYFFIERGKNMCGLAACSSY 323


>gi|1222694|gb|AAA92018.1| CP5 [Dictyostelium discoideum]
          Length = 344

 Score =  160 bits (405), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 99/267 (37%), Positives = 147/267 (55%), Gaps = 38/267 (14%)

Query: 104 GIILEKWRLVTSEGEKVSKYSLWVRYNKASKDAI-PVRYEMKGYNSLLDQSVCGSCWSFG 162
           G   +   L+ ++ EKV   S     +  S+ A+ PV+          +Q  CG CWSF 
Sbjct: 91  GTKFDASSLIGTQEEKVHTNSSAASKDWRSEGAVTPVK----------NQGQCGGCWSFS 140

Query: 163 TTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM-KHGLPTQDDY 221
           TTG+ EGA++    +L  LS+Q LIDCS    N+GCDGG    ++++I+  +G+ T+  Y
Sbjct: 141 TTGSTEGAHFQSKGELVSLSEQNLIDCS--TENSGCDGGLMTYAFEYIINNNGIDTESSY 198

Query: 222 GPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYV 281
            PY  ++  C   +  + AT++ +  VT  SE +L+ A+  + PVSVAIDAS +SF  Y 
Sbjct: 199 -PYKAENGKCEYKSENSGATLSSYKTVTAGSESSLESAVNVN-PVSVAIDASHQSFQLYT 256

Query: 282 NGVYYDEKCNNSPDGLDHAVLAVGY-------------------GELDGKPYWQVKNSWS 322
           +G+YY+ +C  S + LDH VLAVGY                          YW VKNSW 
Sbjct: 257 SGIYYEPEC--SSENLDHGVLAVGYGSGSGSSSGQSSGQSSGNLSASSSNEYWIVKNSWG 314

Query: 323 TYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           T WG +GY+LMS  +DNNCG+ ++ ++
Sbjct: 315 TSWGIEGYILMSRNRDNNCGIASSASF 341


>gi|344250850|gb|EGW06954.1| Cathepsin R [Cricetulus griseus]
          Length = 279

 Score =  160 bits (405), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 87/216 (40%), Positives = 127/216 (58%), Gaps = 9/216 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY  S+  Q+ C SCW+F   GA+EG  + K   L  LS Q L+DCS  +GNNG
Sbjct: 64  VNWRKKGYVTSVKKQAYCNSCWAFAVNGAIEGQMFKKTGNLTRLSVQNLVDCSKPHGNNG 123

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
           CD G+ + +Y++++ +G    +   PY G++  C      + A +TGFV++ P SE++L 
Sbjct: 124 CDWGDPYIAYEYVLHNGGVEAEATYPYEGKEGPCRYNPKYSAANITGFVSL-PKSEESLM 182

Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKP 313
            A+A  GP+S  ID +   F FY  G++YD KC+N  D ++H VL VGYG    E DG  
Sbjct: 183 AAVATIGPISAGIDIASDFFMFYKKGIFYDPKCHN--DTVNHVVLVVGYGFEGNETDGNN 240

Query: 314 YWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTY 348
           YW VKNS+   WG +GY+ ++   NN C + +   Y
Sbjct: 241 YWLVKNSYGKKWGLRGYMKIAKDQNNHCAIASYAHY 276


>gi|291463491|pdb|3IV2|A Chain A, Crystal Structure Of Mature Apo-Cathepsin L C25a Mutant
 gi|291463492|pdb|3IV2|B Chain B, Crystal Structure Of Mature Apo-Cathepsin L C25a Mutant
 gi|291463519|pdb|3K24|A Chain A, Crystal Structure Of Mature Apo-Cathepsin L C25a Mutant In
           Complex With Gln-Leu-Ala Peptide
 gi|291463520|pdb|3K24|B Chain B, Crystal Structure Of Mature Apo-Cathepsin L C25a Mutant In
           Complex With Gln-Leu-Ala Peptide
          Length = 220

 Score =  160 bits (405), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 93/220 (42%), Positives = 130/220 (59%), Gaps = 11/220 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGS W+F  TGA+EG  + K  +L  LS+Q L+DCS   GN G
Sbjct: 5   VDWREKGYVTPVKNQGQCGSAWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEG 64

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++Q++  + GL +++ Y PY   +  C      + A  TGFV++ P  E AL
Sbjct: 65  CNGGLMDYAFQYVQDNGGLDSEESY-PYEATEESCKYNPKYSVANDTGFVDI-PKQEKAL 122

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GP+SVAIDA  +SF FY  G+Y++  C  S + +DH VL VGYG    E D  
Sbjct: 123 MKAVATVGPISVAIDAGHESFLFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDNN 180

Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
            YW VKNSW   WG  GYV M+  + N+CG+ +A +Y T+
Sbjct: 181 KYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 220


>gi|118388791|ref|XP_001027491.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89309261|gb|EAS07249.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 356

 Score =  160 bits (405), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 84/200 (42%), Positives = 119/200 (59%), Gaps = 3/200 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYM-KHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQW 209
           DQ  CGSCW+F TTGA+E  Y + +  +   LS+Q LIDC+  + NNGC GG   +++++
Sbjct: 144 DQQNCGSCWTFSTTGAIESHYAIFEDVEPTSLSEQQLIDCAGAFNNNGCSGGLPSQAFEY 203

Query: 210 IMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTG-FVNVTPNSEDALKLALAKHGPVSV 268
           I  +G  + ++   Y+ QD  C  +  T  A + G   N+T   ED LK A+   GPVS+
Sbjct: 204 IKYNGGISYENSYYYIAQDQECQFSPETVGARVRGGSFNITQGDEDQLKQAVGTVGPVSI 263

Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQ 328
           A       F  Y +GVY +  C++SP  ++HAVLAVGYG  +G  YW VKNSWS +WG++
Sbjct: 264 AFQV-MGDFKLYKSGVYSNPDCSSSPQTVNHAVLAVGYGSENGVDYWYVKNSWSEFWGDE 322

Query: 329 GYVLMSIKDNNCGVMTAPTY 348
           GY  +    N CGV T  +Y
Sbjct: 323 GYFKIQRGVNMCGVATCASY 342


>gi|260516654|gb|ACX43954.1| cysteine protease 1 [Brachiaria hybrid cultivar]
 gi|260516656|gb|ACX43955.1| cysteine protease 1 [Brachiaria hybrid cultivar]
 gi|260516658|gb|ACX43956.1| cysteine protease 1 [Brachiaria hybrid cultivar]
 gi|260516660|gb|ACX43957.1| cysteine protease 1 [Brachiaria hybrid cultivar]
 gi|260516662|gb|ACX43958.1| cysteine protease 2 [Brachiaria hybrid cultivar]
 gi|260516664|gb|ACX43959.1| cysteine protease 2 [Brachiaria hybrid cultivar]
 gi|260516666|gb|ACX43960.1| cysteine protease 2 [Brachiaria hybrid cultivar]
 gi|260516668|gb|ACX43961.1| cysteine protease 2 [Brachiaria hybrid cultivar]
 gi|260516670|gb|ACX43962.1| cysteine protease 2 [Brachiaria hybrid cultivar]
          Length = 338

 Score =  160 bits (405), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 90/222 (40%), Positives = 128/222 (57%), Gaps = 11/222 (4%)

Query: 135 DAIPVRYEMKGYNSLL---DQSVCGSCWSFGTTGAVEGAYYMKHKK-LAVLSQQALIDCS 190
           +A P   + +  N++    DQ  CGSCW+F  TG++EGA+ ++ K  L  LS+Q L+DCS
Sbjct: 123 EAAPTSIDWRTSNAVTPIKDQGQCGSCWAFSATGSIEGAWVLQGKHTLTSLSEQQLVDCS 182

Query: 191 WGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVT 249
             YGN GC+GG    ++++I+ + G+  +  Y PY G    C   + T   T++G+ +V 
Sbjct: 183 TSYGNAGCNGGLMDYAFEYIIANKGICAESAY-PYKGVGGLCQ-KSCTKVVTISGYKDVA 240

Query: 250 PNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGEL 309
              E +L  A+   GPVSVAI+A Q  F FY +GV+     +N    LDH VLAVGYG  
Sbjct: 241 SGDEASLLNAVGTVGPVSVAIEADQAGFQFYSSGVFSGTCGHN----LDHGVLAVGYGTT 296

Query: 310 DGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
             + YW VKNSW T WG  GY+ M    N CG+   P+Y T+
Sbjct: 297 GSQDYWIVKNSWGTSWGESGYIRMIRNKNQCGIAIQPSYPTV 338


>gi|157835400|pdb|2NQD|B Chain B, Crystal Structure Of Cysteine Protease Inhibitor,
           Chagasin, In Complex With Human Cathepsin L
          Length = 221

 Score =  160 bits (405), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 93/220 (42%), Positives = 130/220 (59%), Gaps = 11/220 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGS W+F  TGA+EG  + K  +L  LS+Q L+DCS   GN G
Sbjct: 6   VDWREKGYVTPVKNQGQCGSAWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEG 65

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++Q++  + GL +++ Y PY   +  C      + A  TGFV++ P  E AL
Sbjct: 66  CNGGLMDYAFQYVQDNGGLDSEESY-PYEATEESCKYNPKYSVANDTGFVDI-PKQEKAL 123

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GP+SVAIDA  +SF FY  G+Y++  C  S + +DH VL VGYG    E D  
Sbjct: 124 MKAVATVGPISVAIDAGHESFLFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDNN 181

Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
            YW VKNSW   WG  GYV M+  + N+CG+ +A +Y T+
Sbjct: 182 KYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 221


>gi|115479391|ref|NP_001063289.1| Os09g0442300 [Oryza sativa Japonica Group]
 gi|115510968|sp|P25778.2|ORYC_ORYSJ RecName: Full=Oryzain gamma chain; Flags: Precursor
 gi|51535997|dbj|BAD38077.1| putative oryzain gamma chain precursor [Oryza sativa Japonica
           Group]
 gi|113631522|dbj|BAF25203.1| Os09g0442300 [Oryza sativa Japonica Group]
 gi|215694919|dbj|BAG90110.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 362

 Score =  160 bits (405), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 85/199 (42%), Positives = 114/199 (57%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F TTG++E AY     K   LS+Q L+DC+  Y N GC GG   +++++I
Sbjct: 162 DQGHCGSCWTFSTTGSLEAAYTQATGKPVSLSEQQLVDCATAYNNFGCSGGLPSQAFEYI 221

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + GL T++ Y PY G +  CH         +   VN+T  +ED LK A+    PVSVA
Sbjct: 222 KYNGGLDTEEAY-PYTGVNGICHYKPENVGVKVLDSVNITLGAEDELKNAVGLVRPVSVA 280

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
                  F  Y +GVY  + C  SP  ++HAVLAVGYG  +G PYW +KNSW   WG+ G
Sbjct: 281 FQVIN-GFRMYKSGVYTSDHCGTSPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNG 339

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y  M +  N CG+ T  +Y
Sbjct: 340 YFKMEMGKNMCGIATCASY 358


>gi|45550332|gb|AAS67922.1| cathepsin L [Artemia franciscana]
          Length = 226

 Score =  160 bits (405), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 85/196 (43%), Positives = 118/196 (60%), Gaps = 3/196 (1%)

Query: 155 CGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKH- 213
           C SC +F  TGA+E   + K  KL  LS+Q LIDCS  YGN GC GG   +++++I  + 
Sbjct: 29  CASCLAFSPTGALESQTFRKTGKLISLSEQNLIDCSGEYGNLGCKGGWISQAFEYIKDNK 88

Query: 214 GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDAS 273
           G+ T++ Y  Y  ++ +C        A   GFVN+    ED LK A+A  GPVS  ID S
Sbjct: 89  GIDTENKY-HYEAKENFCRDNPRNRGAVALGFVNIPSGEEDKLKAAVATVGPVSAVIDVS 147

Query: 274 QKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLM 333
            + F FY  GVYY+  C  S + L+H VL +G G  +G+ YW VKNSWS +WG++GY+ +
Sbjct: 148 HEGFQFYSKGVYYEPSCKTSFEHLNHEVLVIGCGSDNGEDYWLVKNSWSKHWGDEGYLKI 207

Query: 334 SI-KDNNCGVMTAPTY 348
           +  + N+CGV TA  Y
Sbjct: 208 ARNRKNHCGVATAALY 223


>gi|387915132|gb|AFK11175.1| cathspsin H [Callorhinchus milii]
          Length = 330

 Score =  160 bits (404), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 82/199 (41%), Positives = 119/199 (59%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTG +E A  +K  KL  L++Q L+DC+  Y N+GC+GG   +++++I
Sbjct: 129 NQGGCGSCWTFSTTGCLESAIAIKTGKLLSLAEQQLVDCAGAYKNHGCNGGLPSQAFEYI 188

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + GL  + DY PY  QD +C      A A +   VN+T   E+ +  A+A+  PVS+A
Sbjct: 189 KYNGGLEAEKDY-PYTAQDQHCQYQPNKAVAFVKEVVNITQYDENGIVDAVARLNPVSIA 247

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            + +   F  Y  GVY +  C+++PD ++HAVLAVGYG  +G  YW VKNSW   WG  G
Sbjct: 248 FEVTDDFFQ-YEGGVYSNSNCDSTPDKVNHAVLAVGYGVQNGTKYWIVKNSWGPEWGLNG 306

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y  +    N CG+   P+Y
Sbjct: 307 YFYIIRGKNMCGLAACPSY 325


>gi|327322928|gb|AEA48885.1| cathepsin S [Oplegnathus fasciatus]
          Length = 337

 Score =  160 bits (404), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 92/223 (41%), Positives = 130/223 (58%), Gaps = 7/223 (3%)

Query: 133 SKDAIP--VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDC 189
           S  A+P  V +  KG   S+  Q  CGSCW+F   GA+EG       KL  LS Q L+DC
Sbjct: 118 SGAAVPDTVDWREKGCVTSVKMQGSCGSCWAFSAAGALEGQLAKTTGKLLDLSPQNLVDC 177

Query: 190 SWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVT 249
           S  YGN+GC+GG   R++Q+++ +     D   PY GQ   CH       A  + + ++ 
Sbjct: 178 SSKYGNHGCNGGFMHRAFQYVIDNQGIDSDASYPYTGQSQQCHYNPAYRAANCSRYSSLP 237

Query: 250 PNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGEL 309
              E ALK ALA  GP+SVAIDA++ SF+FY +GVY D+    +   ++H VLAVGYG L
Sbjct: 238 EGDEGALKEALATIGPISVAIDATRPSFTFYRSGVYDDQTYTRN---VNHGVLAVGYGTL 294

Query: 310 DGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
           +GK YW VKN W + +G++G++ M+  K++ CG+     Y  M
Sbjct: 295 NGKDYWLVKNGWGSTFGDKGFIRMARNKNDQCGIALYGCYPIM 337


>gi|84660244|emb|CAI43319.1| silicatein alpha [Lubomirskia baicalensis]
 gi|85677148|emb|CAI46306.1| silicatein alpha [Lubomirskia baicalensis]
 gi|220675708|emb|CAP69653.1| silcatein [Lubomirskia baicalensis]
          Length = 326

 Score =  160 bits (404), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 82/199 (41%), Positives = 126/199 (63%), Gaps = 5/199 (2%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM 211
           Q  CG+ ++F  TGA+EGA  + + K   LS+Q +IDCS  YGN+GC GG+ + ++++++
Sbjct: 128 QGQCGASYAFAATGALEGASALANDKQVTLSEQNIIDCSVPYGNHGCSGGDTYTAFKYVI 187

Query: 212 KH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
            + G+ T+  Y  + G+ + C   N T+ A+ TG V++   SE  L  A+A  GPV+VA+
Sbjct: 188 DNGGIDTESSY-SFKGKQSSCQYNNKTSGASATGVVSIGYGSESDLLAAVATVGPVAVAV 246

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
           DA+  +F FY +GV+    C+++   L+HA+L  GYG  +GK YW VKNSWS  WG+ GY
Sbjct: 247 DANTNAFRFYQSGVFDSSSCSSTK--LNHAMLVTGYGSYNGKDYWLVKNSWSKNWGDSGY 304

Query: 331 VLM-SIKDNNCGVMTAPTY 348
           +LM   K N CG+ +   Y
Sbjct: 305 ILMVRNKYNQCGIASDALY 323


>gi|410904753|ref|XP_003965856.1| PREDICTED: cathepsin S-like [Takifugu rubripes]
          Length = 334

 Score =  160 bits (404), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 86/206 (41%), Positives = 122/206 (59%), Gaps = 6/206 (2%)

Query: 148 SLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSY 207
           S+  Q  CGSCW+F   GA+EG       +L  LS Q L+DCS  YGN+GC+GG   R++
Sbjct: 133 SVKKQGSCGSCWAFSAAGALEGQLAKTTGRLVDLSPQNLVDCSGKYGNHGCNGGYMHRAF 192

Query: 208 QWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
           Q+++   G+ ++  Y PY GQ   CH       A  + +  +T   E  L+ A+A  GP+
Sbjct: 193 QYVIDNQGIDSEASY-PYRGQVQQCHYNPAFRAANCSQYRFLTQGDEGNLQAAVASIGPI 251

Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWG 326
           SVAIDA Q  F FY +GVY D  C+ +   ++HAVLAVGYG L+G+ YW VKNSW   +G
Sbjct: 252 SVAIDAKQPKFYFYKSGVYDDPSCSQT---INHAVLAVGYGTLNGQDYWLVKNSWGVKFG 308

Query: 327 NQGYVLM-SIKDNNCGVMTAPTYVTM 351
           ++GY+ M   K++ CG+     +  M
Sbjct: 309 DKGYIRMVRNKNDQCGIAQFACFPIM 334


>gi|444514070|gb|ELV10520.1| Cathepsin L1 [Tupaia chinensis]
          Length = 450

 Score =  160 bits (404), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 94/216 (43%), Positives = 123/216 (56%), Gaps = 16/216 (7%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KG+   + +Q  CGSCW+F  TGA+EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 242 VDWREKGFVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLISLSEQNLVDCSRRQGNLG 301

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    ++Q+I  + GL +++ Y PY G D  C      A A  TGF       E AL
Sbjct: 302 CQGGLMDNAFQYIKDNGGLDSEESY-PYKGMDGTCQYKAEWAVANDTGF-------EKAL 353

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG---ELDGKP 313
             A+A  GP+SVAIDA   SF FY +G+YY+  C  S + LDH VL VGYG         
Sbjct: 354 MKAVASVGPISVAIDAGHASFQFYKDGIYYEPDC--SSENLDHGVLVVGYGVEKRNSNDK 411

Query: 314 YWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
           YW +KNSW   WG  GYV +   ++N+CGV +A +Y
Sbjct: 412 YWLIKNSWGEQWGANGYVKIAKDRNNHCGVASAASY 447


>gi|449471885|ref|XP_004186123.1| PREDICTED: LOW QUALITY PROTEIN: pro-cathepsin H [Taeniopygia
           guttata]
          Length = 334

 Score =  160 bits (404), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 86/224 (38%), Positives = 125/224 (55%), Gaps = 9/224 (4%)

Query: 132 ASKDAIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW 191
           A  D+I  R +      +  Q  CGSCW+F TTG +E A  +   KL  L++Q L+DC+ 
Sbjct: 108 AVPDSIDWRKKGNFVTPVKIQGACGSCWTFSTTGCLESAIAIATGKLLSLAEQQLVDCAQ 167

Query: 192 GYGNNGCDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCH------IANTTATATMTG 244
            + N+GC GG   +++++I+   GL  +D Y PY  ++  C       I    A A +  
Sbjct: 168 AFNNHGCSGGLPSQAFEYILYNRGLMGEDSY-PYRAKNGTCRFQPDNDIRVGKAIAFVKD 226

Query: 245 FVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAV 304
            +N+T   ED +  A+ +H PVS A + +   F  Y  GVY + +C ++PD ++HAVLAV
Sbjct: 227 VINITQYDEDGMVEAVGRHNPVSFAFEVT-SDFMHYRKGVYSNPRCEHTPDKVNHAVLAV 285

Query: 305 GYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTY 348
           GYG+ DG PYW VKNSW   WG QGY L+    N CG+    +Y
Sbjct: 286 GYGQEDGTPYWIVKNSWGRLWGMQGYFLIERGKNMCGLAACASY 329


>gi|63101996|gb|AAH95694.1| Cathepsin S, b.1 [Danio rerio]
          Length = 330

 Score =  160 bits (404), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 93/224 (41%), Positives = 131/224 (58%), Gaps = 7/224 (3%)

Query: 132 ASKDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALID 188
           +S DA+P  + +  KGY +S+  Q  CGSCW+F + GA+EG       KL  LS Q L+D
Sbjct: 110 SSGDAVPDSLDWREKGYVSSVKMQGACGSCWAFSSVGALEGQLKKTTGKLVDLSPQNLVD 169

Query: 189 CSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
           CS  YGN GC+GG    ++Q+++ +G    D   PY G    C  +++   A  T +  V
Sbjct: 170 CSSKYGNKGCNGGFMSDAFQYVIDNGGIASDSAYPYRGVQQQCSYSSSQRAANCTKYYFV 229

Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
               E+ALK A+A  GP+SVAIDA++  F  Y +GVY D  C+     ++HAVL VGYG 
Sbjct: 230 RQGDENALKQAVASVGPISVAIDATRPQFVLYHSGVYNDPTCSKR---VNHAVLVVGYGT 286

Query: 309 LDGKPYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           L G+ +W VKNSW T +G+ GY+ M+   NN CG+ +   Y  M
Sbjct: 287 LSGQDHWLVKNSWGTRFGDGGYIRMARNKNNMCGIASYACYPVM 330


>gi|116242316|gb|ABJ89815.1| putative cathepsin L preprotein [Clonorchis sinensis]
          Length = 371

 Score =  160 bits (404), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 95/227 (41%), Positives = 129/227 (56%), Gaps = 15/227 (6%)

Query: 135 DAIP---VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCS 190
           DA P   V +  KG    + +Q  CGSCW+F  TG +EG +Y+   KL  LS+Q L+DCS
Sbjct: 150 DAAPPAEVDWRTKGAVTPVKNQGDCGSCWAFSATGGIEGQHYLATGKLVSLSEQQLVDCS 209

Query: 191 WGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYL-GQDAY---CHIANTTATATMTGF 245
               N+GCDGG    +++++ +H G+ T+  Y PY+ G   Y   C      A   +TG+
Sbjct: 210 --SSNDGCDGGLMDLAFEYVKEHKGIDTEVHY-PYVSGNTGYARQCSFDPKYAAVNVTGY 266

Query: 246 VNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVG 305
           V++    E  L+ A+  HGP+SV I+A   SF  Y +G+Y D +CN  P  LDH VL VG
Sbjct: 267 VDIPEGQELLLQQAVGFHGPISVGINAGLPSFMAYESGIYSDHRCN--PHDLDHGVLVVG 324

Query: 306 YGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           YG  +G PYW +KNSW   WG  GYV +    NN CGV T  +Y  M
Sbjct: 325 YGVDNGVPYWLIKNSWGEDWGENGYVRILRNHNNLCGVATMASYPLM 371


>gi|395844675|ref|XP_003795081.1| PREDICTED: cathepsin L1-like [Otolemur garnettii]
          Length = 333

 Score =  160 bits (404), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 89/219 (40%), Positives = 128/219 (58%), Gaps = 9/219 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F +TGA+EG  + K  KL  LS+Q L+DCS   GN+G
Sbjct: 118 VDWRKKGYVTPVKNQGQCGSCWAFSSTGALEGQMFRKTGKLISLSEQNLVDCSQRQGNHG 177

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
           C GG    ++ ++ ++G    +   PY+ +D  C      + A  TGFVN+ P  E AL 
Sbjct: 178 CSGGLMNFAFNYVKENGGLDSEVSYPYVARDEKCKYKPEYSVANDTGFVNI-PTQEKALM 236

Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKP 313
            A+A  GP+S+AIDAS  S  FY +G+YY+  C  S   LDH VL +GYG    + D   
Sbjct: 237 KAVAIIGPISIAIDASHISIQFYKSGIYYEPNC--SSKNLDHGVLLIGYGFEGTDSDDNK 294

Query: 314 YWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTYVTM 351
           +W +KNSW   WG  G + +   K+N+CG+ +A +Y T+
Sbjct: 295 FWFIKNSWGIEWGLDGCIKIAKDKNNHCGIASAASYPTV 333


>gi|66823245|ref|XP_644977.1| cysteine proteinase 5 precursor [Dictyostelium discoideum AX4]
 gi|166201986|sp|P54640.2|CYSP5_DICDI RecName: Full=Cysteine proteinase 5; Flags: Precursor
 gi|60473097|gb|EAL71045.1| cysteine proteinase 5 precursor [Dictyostelium discoideum AX4]
          Length = 344

 Score =  160 bits (404), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 99/267 (37%), Positives = 147/267 (55%), Gaps = 38/267 (14%)

Query: 104 GIILEKWRLVTSEGEKVSKYSLWVRYNKASKDAI-PVRYEMKGYNSLLDQSVCGSCWSFG 162
           G   +   L+ ++ EKV   S     +  S+ A+ PV+          +Q  CG CWSF 
Sbjct: 91  GTKFDASSLIGTQEEKVFTTSSAASKDWRSEGAVTPVK----------NQGQCGGCWSFS 140

Query: 163 TTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM-KHGLPTQDDY 221
           TTG+ EGA++    +L  LS+Q LIDCS    N+GCDGG    ++++I+  +G+ T+  Y
Sbjct: 141 TTGSTEGAHFQSKGELVSLSEQNLIDCS--TENSGCDGGLMTYAFEYIINNNGIDTESSY 198

Query: 222 GPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYV 281
            PY  ++  C   +  + AT++ +  VT  SE +L+ A+  + PVSVAIDAS +SF  Y 
Sbjct: 199 -PYKAENGKCEYKSENSGATLSSYKTVTAGSESSLESAVNVN-PVSVAIDASHQSFQLYT 256

Query: 282 NGVYYDEKCNNSPDGLDHAVLAVGY-------------------GELDGKPYWQVKNSWS 322
           +G+YY+ +C  S + LDH VLAVGY                          YW VKNSW 
Sbjct: 257 SGIYYEPEC--SSENLDHGVLAVGYGSGSGSSSGQSSGQSSGNLSASSSNEYWIVKNSWG 314

Query: 323 TYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           T WG +GY+LMS  +DNNCG+ ++ ++
Sbjct: 315 TSWGIEGYILMSRNRDNNCGIASSASF 341


>gi|223673161|gb|ACN12762.1| Cathepsin S precursor [Salmo salar]
          Length = 330

 Score =  160 bits (404), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 88/201 (43%), Positives = 112/201 (55%), Gaps = 4/201 (1%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM 211
           Q  CGSCW+F + GA+EG       KL  LS Q L+DCS  YGN GC GG   +++Q+++
Sbjct: 133 QGSCGSCWAFSSVGALEGQLMKTTGKLIDLSPQNLVDCSSKYGNKGCHGGFMTKAFQYVI 192

Query: 212 KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAID 271
            +     D   PY G    C        A  + +  +    E  LK ALA  GP+SV ID
Sbjct: 193 DNQGIASDQSYPYKGVQQQCIYNPAQRAANCSRYSFLPEGDEGVLKEALATIGPISVGID 252

Query: 272 ASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYV 331
           A++ SF+FY +GVY D  C       +HAVLAVGYG L G+ YW VKNSW   WG+QGY+
Sbjct: 253 ATRPSFAFYRSGVYNDPTCTKK---TNHAVLAVGYGTLGGQDYWLVKNSWGLSWGDQGYI 309

Query: 332 LMSI-KDNNCGVMTAPTYVTM 351
            MS  KDN CG+     Y  M
Sbjct: 310 RMSRNKDNQCGIALYGCYPVM 330


>gi|337255596|gb|AEI61876.1| cathepsin K [Gadus morhua]
          Length = 331

 Score =  160 bits (404), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 92/225 (40%), Positives = 125/225 (55%), Gaps = 7/225 (3%)

Query: 130 NKASKDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALID 188
           +K  K    V Y  KG   S+ +Q  CGSCW+F + GA+EG        L  LS Q L+D
Sbjct: 111 DKLEKLPKSVDYRKKGMVTSVKNQGSCGSCWAFSSAGALEGQLAKTTGTLRDLSPQNLVD 170

Query: 189 CSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
           C     N+GC GG    ++ ++M++G    D+  PY+GQD  C    +   A   G+  +
Sbjct: 171 CV--TENSGCGGGYMTNAFSYVMQNGGIDSDESYPYVGQDQQCGFNVSGVAAECKGYKQI 228

Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG- 307
               E AL +AL K GPVSV IDA   +F FY +GVYYD  CN   + ++HAVLAVG+G 
Sbjct: 229 PVGDERALAVALFKAGPVSVGIDAGLGTFQFYQHGVYYDRNCN--AEDINHAVLAVGFGV 286

Query: 308 ELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
              GK YW +KNSW   WG++GY+LM+    N CG+    +Y  M
Sbjct: 287 TAKGKKYWIIKNSWGEDWGHKGYILMARNRGNLCGIANMASYPIM 331


>gi|6978723|ref|NP_037288.1| cathepsin L1 preproprotein [Rattus norvegicus]
 gi|55888|emb|CAA68691.1| prepro-cathepsin L [Rattus norvegicus]
          Length = 334

 Score =  160 bits (404), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 86/203 (42%), Positives = 120/203 (59%), Gaps = 8/203 (3%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F  +G +EG  ++K  KL  LS+Q L+DCS   GN GC+GG    ++Q+I
Sbjct: 131 NQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHDQGNQGCNGGLMDFAFQYI 190

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
            ++G    ++  PY  +D  C      A A  TGFV++ P  E AL   +A  GP+SVA+
Sbjct: 191 KENGGLDSEESYPYEAKDGSCKYRAEYAVANDTGFVDI-PQQEKALMKPVATVGPISVAM 249

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTYWG 326
           DAS  S  FY +G+YY+  C  S   LDH VL VGYG    + +   YW VKNSW   WG
Sbjct: 250 DASHPSLQFYSSGIYYEPNC--SSKDLDHGVLVVGYGYEGTDSNKDKYWLVKNSWGKEWG 307

Query: 327 NQGYV-LMSIKDNNCGVMTAPTY 348
             GY+ +   ++N+CG+ TA +Y
Sbjct: 308 MDGYIKIAKDRNNHCGLATAASY 330


>gi|27960480|gb|AAO27844.1|AF456460_1 cathepsin Q2 [Rattus norvegicus]
          Length = 343

 Score =  159 bits (403), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 92/243 (37%), Positives = 135/243 (55%), Gaps = 19/243 (7%)

Query: 124 SLWVRYNKAS-------KDAIP--VRYEMKGYNSLLD-QSVCGSCWSFGTTGAVEGAYYM 173
           SLW R   +S       +DA+P  V +  +GY + +  Q  C SCW+F   GA+EG  + 
Sbjct: 105 SLWKRALGSSLPNSWYWRDALPKFVDWRKEGYVTHVRVQGRCNSCWAFPVVGAIEGQMFK 164

Query: 174 KHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHI 233
           K  KL  LS Q L+DCS   GN GC GG  + ++Q+++++G    +   PY G++  C  
Sbjct: 165 KTGKLTPLSVQNLVDCSKPQGNKGCRGGTTYNAFQYVLQNGGLESEATYPYEGKEGLCRY 224

Query: 234 ANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNS 293
               ++A +T FV + P +ED L  A+A  GPV+  I     S  FY  G+Y++ KCNN 
Sbjct: 225 NPNNSSAKITRFVAL-PENEDVLMDAVATKGPVAAGIHVVHSSLRFYKKGIYHEPKCNNY 283

Query: 294 PDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
              ++HAVL VGYG    E DG  YW ++NSW   WG  GY+ +   ++N+CG+ T   Y
Sbjct: 284 ---VNHAVLVVGYGFEGNETDGNNYWLIQNSWGERWGLNGYMKIAKDRNNHCGIATFAQY 340

Query: 349 VTM 351
            T+
Sbjct: 341 PTV 343


>gi|170292465|pdb|3BC3|A Chain A, Exploring Inhibitor Binding At The S Subsites Of Cathepsin
           L
 gi|170292466|pdb|3BC3|B Chain B, Exploring Inhibitor Binding At The S Subsites Of Cathepsin
           L
 gi|261824911|pdb|3H8C|A Chain A, A Combined Crystallographic And Molecular Dynamics Study
           Of Cathepsin-L Retro-Binding Inhibitors (Compound 14)
 gi|261824912|pdb|3H8C|B Chain B, A Combined Crystallographic And Molecular Dynamics Study
           Of Cathepsin-L Retro-Binding Inhibitors (Compound 14)
          Length = 220

 Score =  159 bits (403), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 93/220 (42%), Positives = 130/220 (59%), Gaps = 11/220 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGS W+F  TGA+EG  + K  +L  LS+Q L+DCS   GN G
Sbjct: 5   VDWREKGYVTPVKNQGQCGSXWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEG 64

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++Q++  + GL +++ Y PY   +  C      + A  TGFV++ P  E AL
Sbjct: 65  CNGGLMDYAFQYVQDNGGLDSEESY-PYEATEESCKYNPKYSVANDTGFVDI-PKQEKAL 122

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GP+SVAIDA  +SF FY  G+Y++  C  S + +DH VL VGYG    E D  
Sbjct: 123 MKAVATVGPISVAIDAGHESFLFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDNN 180

Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
            YW VKNSW   WG  GYV M+  + N+CG+ +A +Y T+
Sbjct: 181 KYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 220


>gi|328872971|gb|EGG21338.1| cysteine proteinase 5 precursor [Dictyostelium fasciculatum]
          Length = 358

 Score =  159 bits (403), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 92/226 (40%), Positives = 125/226 (55%), Gaps = 33/226 (14%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCWSF  TG+ EGA+ +    L  LS+Q LIDCS  YGN+GC+GG    ++++I
Sbjct: 133 NQGQCGSCWSFSATGSTEGAHQISTSNLVALSEQNLIDCSSSYGNDGCNGGLMDNAFKYI 192

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           + + G+ T+  Y PY+ +   C      + AT++ +V+VT  SE AL+    K GPVSVA
Sbjct: 193 IANGGIDTEASY-PYVAKVQKCKYNPANSGATLSSYVDVTSGSESALQSQTVK-GPVSVA 250

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG---------------------- 307
           IDAS +SF  Y +GVYY+  C  S   LDH VL VGYG                      
Sbjct: 251 IDASHQSFQLYDSGVYYEPAC--SSTNLDHGVLVVGYGTASANGSSDSDSSAASQSSSSE 308

Query: 308 -----ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPT 347
                   G  +W+VKNSW   WG  GY+ M+  +DNNCG+ T  +
Sbjct: 309 SSDDQATQGAQFWKVKNSWGPEWGLSGYIQMARNRDNNCGIATTAS 354


>gi|74765984|sp|Q24940.1|CATLL_FASHE RecName: Full=Cathepsin L-like proteinase; Flags: Precursor
 gi|497700|gb|AAA29136.1| cathepsin [Fasciola hepatica]
          Length = 326

 Score =  159 bits (403), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 87/218 (39%), Positives = 122/218 (55%), Gaps = 8/218 (3%)

Query: 131 KASKDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALI 187
           +A+  A+P  + +   GY   + DQ  CGSCW+F TTG +EG Y    +     S+Q L+
Sbjct: 102 EANNRAVPDKIDWRESGYVTEVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLV 161

Query: 188 DCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVN 247
           DCS  +GNNGC GG    +YQ++ + GL T+  Y PY   +  C        A +TG+  
Sbjct: 162 DCSGPWGNNGCSGGLMENAYQYLKQFGLETESSY-PYTAVEGQCRYNKQLGVAKVTGYYT 220

Query: 248 VTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG 307
           V   SE  LK  +    P +VA+D  +  F  Y +G+Y  + C  SP  ++HAVLAVGYG
Sbjct: 221 VHSGSEVELKNLVGARRPAAVAVDV-ESDFMMYRSGIYQSQTC--SPLRVNHAVLAVGYG 277

Query: 308 ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMT 344
              G  YW VKNSW TYWG +GY+ M+  + N CG+ +
Sbjct: 278 TQGGTDYWIVKNSWGTYWGERGYIRMARNRGNMCGIAS 315


>gi|61200410|gb|AAX39778.1| cathepsin R [Mus musculus]
          Length = 335

 Score =  159 bits (403), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 88/214 (41%), Positives = 124/214 (57%), Gaps = 9/214 (4%)

Query: 144 KGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGE 202
           KGY   +  Q  C +CW+F  TGA+E     +  KL  LS Q L+DCS   GNNGC GG+
Sbjct: 125 KGYVTPVRRQGDCDACWAFAVTGAIEAQAIWQTGKLTPLSVQNLVDCSKPQGNNGCLGGD 184

Query: 203 DFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAK 262
            + ++Q+++ +G    +   PY G+D  C      + A +TGFV++ P SED L  A+A 
Sbjct: 185 TYNAFQYVLHNGGLESEATYPYEGKDGPCRYNPKNSKAEITGFVSL-PQSEDILMAAVAT 243

Query: 263 HGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVK 318
            GP++  IDAS +SF  Y  G+Y++  C  S D + H VL VGYG    E DG  YW +K
Sbjct: 244 IGPITAGIDASHESFKNYKGGIYHEPNC--SSDTVTHGVLVVGYGFKGIETDGNHYWLIK 301

Query: 319 NSWSTYWGNQGYV-LMSIKDNNCGVMTAPTYVTM 351
           NSW   WG +GY+ L   K+N+CG+ +   Y T+
Sbjct: 302 NSWGKRWGIRGYMKLAKDKNNHCGIASYAHYPTI 335


>gi|340371596|ref|XP_003384331.1| PREDICTED: digestive cysteine proteinase 2-like [Amphimedon
           queenslandica]
          Length = 327

 Score =  159 bits (403), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 90/217 (41%), Positives = 130/217 (59%), Gaps = 9/217 (4%)

Query: 136 AIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG 192
           AIP  V +  KG    + +Q  CGSCW+F TTG++EGA+  K  KL  LS+Q L+DC   
Sbjct: 113 AIPDSVDWRTKGLVTPVKNQKQCGSCWAFSTTGSLEGAHAKKTGKLVSLSEQNLVDCD-- 170

Query: 193 YGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPN 251
             ++GC GG    ++++I ++ G+ T++ Y PY  ++  C        AT+   V++   
Sbjct: 171 KKDHGCQGGLMTTAFKYIEENKGIDTEESY-PYKAKNGRCEFKKDDIGATVERHVSILTT 229

Query: 252 SEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDG 311
             +ALK A+A+ GP+SVA+DAS  SF  Y +G+Y  + C  S   LDH VL VGYG+ DG
Sbjct: 230 DCEALKKAVAEIGPISVAMDASHSSFQLYKSGIYDPKIC--SSRKLDHGVLVVGYGKEDG 287

Query: 312 KPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTY 348
           + YW VKNSW   WG +GY  ++ K N CG+ T+  Y
Sbjct: 288 EEYWLVKNSWGKNWGMEGYFKIASKKNLCGICTSACY 324


>gi|296168728|emb|CAQ54044.1| silicatein alpha 4 [Baikalospongia intermedia]
          Length = 197

 Score =  159 bits (403), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 80/199 (40%), Positives = 127/199 (63%), Gaps = 5/199 (2%)

Query: 155 CGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKH- 213
           CG+ ++F  TGA+EGA  + + K  +LS+Q +IDCS  YGN+GC GG+ + + ++++ + 
Sbjct: 2   CGASYAFAATGALEGASALSNDKQVILSEQNIIDCSVPYGNHGCSGGDTYTAMKYVIDNG 61

Query: 214 GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDAS 273
           G+ T+  Y  + G+ + C  ++  + A+ TG +++   SE  L  A+A  GPV+VA+DA+
Sbjct: 62  GIDTESSY-SFQGKQSSCQYSSKNSGASATGVISIASGSETDLFAAVATVGPVAVAVDAN 120

Query: 274 QKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLM 333
             +F FY +GV+    C+N+   L+HA+L  GYG  +GK YW VKNSWS  WG+ GY++M
Sbjct: 121 TNAFRFYQSGVFDSSSCSNTK--LNHAMLVTGYGSYNGKDYWLVKNSWSKNWGDNGYIMM 178

Query: 334 -SIKDNNCGVMTAPTYVTM 351
              K N CG+ T   Y T+
Sbjct: 179 VRNKYNQCGIATDALYPTL 197


>gi|410990008|ref|XP_004001242.1| PREDICTED: cathepsin L1 isoform 1 [Felis catus]
          Length = 333

 Score =  159 bits (403), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 96/252 (38%), Positives = 139/252 (55%), Gaps = 12/252 (4%)

Query: 108 EKWRLVTSEGEKVSKYSLWVRYNKASKDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTT 164
           E++R V + G K+ K   W  +       IP  V +  KGY   + DQ  C  CW+F  T
Sbjct: 86  EEFRQVMN-GLKIQKRKKWKVFQAPFFVEIPSSVDWREKGYVTPVKDQGYCLCCWAFSAT 144

Query: 165 GAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPY 224
           GA+EG  + K  KL  LS+Q L+DCS   GN G  GG    ++Q++  +G    ++  PY
Sbjct: 145 GALEGQMFRKTGKLVSLSEQNLVDCSQTEGNEGYSGGLIDDAFQYVKDNGGLDSEESYPY 204

Query: 225 LGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGV 284
             Q   C      + A +T + ++ P+ E+ L + LA  GP+S AIDAS  +F FY  G+
Sbjct: 205 HAQGDSCKYRPENSVANVTDYWDI-PSKENELMITLAAVGPISAAIDASLDTFRFYKEGI 263

Query: 285 YYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNN 339
           YYD  C  S + +DH VL VGYG    E + K YW +KNSW T WG  GY+ M+  +DN+
Sbjct: 264 YYDPSC--SSEDVDHGVLVVGYGADGTETENKKYWIIKNSWGTDWGMDGYIKMAKDRDNH 321

Query: 340 CGVMTAPTYVTM 351
           CG+ +  ++ T+
Sbjct: 322 CGIASLASFPTV 333


>gi|346574373|gb|AEO36958.1| silicatein-alpha 1 [Baikalospongia fungiformis]
          Length = 324

 Score =  159 bits (403), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 81/199 (40%), Positives = 126/199 (63%), Gaps = 5/199 (2%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM 211
           Q  CG+ ++F  TGA+EGA  + + K   LS+Q +IDCS  YGN+GC GG+ + ++++++
Sbjct: 126 QGQCGASYAFAATGALEGASALANDKQVTLSEQNIIDCSVPYGNHGCSGGDTYTAFKYVI 185

Query: 212 KH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
            + G+ T+  Y  + G+ + C   N T+ A+ TG V++   SE  L  A+A  GPV+VA+
Sbjct: 186 DNGGIDTESSY-SFKGKQSSCQYNNKTSGASATGVVSIGYGSESDLLAAVATVGPVAVAV 244

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
           DA+  +F FY +GV+    C+++   L+HA+L  GYG  +GK YW +KNSWS  WG+ GY
Sbjct: 245 DANTNAFRFYQSGVFDSSSCSSTK--LNHAMLVTGYGSYNGKDYWLIKNSWSKNWGDSGY 302

Query: 331 VLM-SIKDNNCGVMTAPTY 348
           +LM   K N CG+ +   Y
Sbjct: 303 ILMVRNKYNQCGIASDALY 321


>gi|358255476|dbj|GAA57175.1| cathepsin L [Clonorchis sinensis]
          Length = 385

 Score =  159 bits (403), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 86/208 (41%), Positives = 122/208 (58%), Gaps = 10/208 (4%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F  TGA+EG  ++    L  LS+Q L+DCS  YGNN C+GG    +++++
Sbjct: 181 NQGNCGSCWAFSATGAIEGQNFLATGNLVSLSEQQLVDCSSEYGNNACNGGLMDNAFKYV 240

Query: 211 M-KHGLPTQDDYGPYLG-----QDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHG 264
              +G+ T+  Y PY+       +  C      A   +TG++++       LK A+  +G
Sbjct: 241 KDSNGIDTEASY-PYVSGETGDANPTCRFNLKEAVVRVTGYIDLPRGQVSELKQAVGHYG 299

Query: 265 PVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTY 324
           P+SVAI+A   SF  Y +GVY D++C  S D LDH VL VGYGE +G PYW +KNSW  +
Sbjct: 300 PISVAINAGLPSFMSYKSGVYSDDQC--SSDDLDHGVLLVGYGEENGIPYWLIKNSWGPH 357

Query: 325 WGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           WG  GYV +    NN CGV +  +Y  M
Sbjct: 358 WGENGYVKILRDHNNLCGVASMASYPLM 385


>gi|291410711|ref|XP_002721635.1| PREDICTED: cathepsin H [Oryctolagus cuniculus]
          Length = 333

 Score =  159 bits (403), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 78/199 (39%), Positives = 119/199 (59%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTGA+E A  +   K+  L++Q L+DC+  + N+GC+GG   +++++I
Sbjct: 132 NQGACGSCWTFSTTGALESAVAIAGGKMLSLAEQQLVDCAQNFNNHGCEGGLPSQAFEYI 191

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           +   G+  +D Y PY   +  C      A A +    N+T N E+A+  A+A + PVS A
Sbjct: 192 LYNKGIMGEDSY-PYRAMEGRCKFQPQKAIAFVKDVANITLNDEEAMVEAVALYNPVSFA 250

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            + ++  F  Y  G+Y    C+ +PD ++HAVLAVGYGE +G PYW VKNSW ++WG  G
Sbjct: 251 FEVTE-DFMQYRKGIYSSTSCHKTPDKVNHAVLAVGYGEENGVPYWIVKNSWGSHWGMNG 309

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y  +    N CG+    +Y
Sbjct: 310 YFYIERGKNMCGLAACASY 328


>gi|50657027|emb|CAH04631.1| cathepsin H [Suberites domuncula]
          Length = 335

 Score =  159 bits (403), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 79/199 (39%), Positives = 119/199 (59%), Gaps = 2/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTG +E  +++K  +L  LS+Q L+DC+  + NNGC+GG   +++++I
Sbjct: 135 NQGQCGSCWTFSTTGCLESHHFLKTGQLVSLSEQQLVDCAQAFNNNGCNGGLPSQAFEYI 194

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
             +G    ++  PY   D  CH   +  +AT++  VN+T   E  L  A+   GPVS+A 
Sbjct: 195 HYNGGLDSEESYPYRAHDEKCHFVPSEVSATVSNVVNITSKDEMQLYNAVGTVGPVSIAY 254

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELD-GKPYWQVKNSWSTYWGNQG 329
           D S   F FY  GVY  ++C   P+ ++HAVLAVGY   + G+ YW VKNSW T +G  G
Sbjct: 255 DVS-ADFRFYKKGVYKSKECKTDPEHVNHAVLAVGYNTTESGEDYWIVKNSWGTKFGING 313

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y  ++  +N CG+    +Y
Sbjct: 314 YFWIARGENMCGLADCASY 332


>gi|146386356|gb|ABQ23966.1| cathepsin H [Oryctolagus cuniculus]
          Length = 215

 Score =  159 bits (403), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 78/199 (39%), Positives = 119/199 (59%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTGA+E A  +   K+  L++Q L+DC+  + N+GC+GG   +++++I
Sbjct: 15  NQGACGSCWTFSTTGALESAVAIAGGKMLSLAEQQLVDCAQNFNNHGCEGGLPSQAFEYI 74

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           +   G+  +D Y PY   +  C      A A +    N+T N E+A+  A+A + PVS A
Sbjct: 75  LYNKGIMGEDSY-PYRAMEGRCKFQPQKAIAFVKDVANITLNDEEAMVEAVALYNPVSFA 133

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            + ++  F  Y  G+Y    C+ +PD ++HAVLAVGYGE +G PYW VKNSW ++WG  G
Sbjct: 134 FEVTE-DFMQYRKGIYSSTSCHKTPDKVNHAVLAVGYGEENGVPYWIVKNSWGSHWGMNG 192

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y  +    N CG+    +Y
Sbjct: 193 YFYIERGKNMCGLAACASY 211


>gi|222641669|gb|EEE69801.1| hypothetical protein OsJ_29533 [Oryza sativa Japonica Group]
          Length = 314

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 85/199 (42%), Positives = 114/199 (57%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F TTG++E AY     K   LS+Q L+DC+  Y N GC GG   +++++I
Sbjct: 114 DQGHCGSCWTFSTTGSLEAAYTQATGKPVSLSEQQLVDCATAYNNFGCSGGLPSQAFEYI 173

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + GL T++ Y PY G +  CH         +   VN+T  +ED LK A+    PVSVA
Sbjct: 174 KYNGGLDTEEAY-PYTGVNGICHYKPENVGVKVLDSVNITLGAEDELKNAVGLVRPVSVA 232

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
                  F  Y +GVY  + C  SP  ++HAVLAVGYG  +G PYW +KNSW   WG+ G
Sbjct: 233 FQVIN-GFRMYKSGVYTSDHCGTSPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNG 291

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y  M +  N CG+ T  +Y
Sbjct: 292 YFKMEMGKNMCGIATCASY 310


>gi|5822035|pdb|1CS8|A Chain A, Crystal Structure Of Procathepsin L
          Length = 316

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 92/220 (41%), Positives = 129/220 (58%), Gaps = 11/220 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGS W+F  TGA+EG  + K  +L  LS+Q L+DCS   GN G
Sbjct: 101 VDWREKGYVTPVKNQGQCGSXWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEG 160

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++Q++  + GL +++ Y PY   +  C      + A   GFV++ P  E AL
Sbjct: 161 CNGGLMDYAFQYVQDNGGLDSEESY-PYEATEESCKYNPKYSVANDAGFVDI-PKQEKAL 218

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GP+SVAIDA  +SF FY  G+Y++  C  S + +DH VL VGYG    E D  
Sbjct: 219 MKAVATVGPISVAIDAGHESFLFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDNN 276

Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
            YW VKNSW   WG  GYV M+  + N+CG+ +A +Y T+
Sbjct: 277 KYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 316


>gi|348504496|ref|XP_003439797.1| PREDICTED: digestive cysteine proteinase 2-like [Oreochromis
           niloticus]
          Length = 352

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 91/236 (38%), Positives = 130/236 (55%), Gaps = 8/236 (3%)

Query: 116 EGEKVSKYSLWVRYNKASKDAIPVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMK 174
            G++VS   L  R +    D   V Y   G+   + DQ  CGSCW+F TTGA+E   Y K
Sbjct: 119 RGKEVSARRL--RNSARHLDGFAVDYRSMGFVTEVKDQGFCGSCWAFSTTGAIEAQLYKK 176

Query: 175 HKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAY-CHI 233
             +L  LS+Q L+DCS  +G  GC G     +Y +++ +GL + + Y PY   D   C  
Sbjct: 177 TGQLISLSEQNLVDCSKSFGTYGCSGAWMANAYDYVVSNGLESSNTY-PYTSVDTQPCFY 235

Query: 234 ANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNS 293
            ++ A A +  +  +    E A+  ALA  GP++V IDA   SF FY +G+Y +  CN  
Sbjct: 236 DSSLAVAHIRDYRFIPRGDEQAMADALATIGPITVTIDADHASFLFYSSGIYDEPNCN-- 293

Query: 294 PDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
           P+ L+HAVL VGYG  +G+ YW +KNSW T WG  GY+ ++    N CG+ +   Y
Sbjct: 294 PNNLNHAVLLVGYGSQEGQDYWIIKNSWGTGWGEGGYMRIVRNGQNACGLASYALY 349


>gi|211909240|gb|ACJ12893.1| cathepsin L1D [Fasciola hepatica]
          Length = 326

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 86/218 (39%), Positives = 122/218 (55%), Gaps = 8/218 (3%)

Query: 131 KASKDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALI 187
           +AS  A+P  + +   GY   + DQ  CGSCW+F TTG +EG Y    +     S+Q L+
Sbjct: 102 EASNRAVPDKIDWRESGYVTGVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLV 161

Query: 188 DCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVN 247
           DCS  +GNNGC GG    +Y+++ + GL T+  Y PY   +  C        A +TG+  
Sbjct: 162 DCSGPWGNNGCGGGLMENAYEYLKQFGLETESSY-PYRAVEGQCRYNRQLGVAKVTGYYT 220

Query: 248 VTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG 307
           +   +E  LK  +   GP +VA+D  +  F  Y +G+Y  + C  SP GL+HAVLAVGYG
Sbjct: 221 LHSGNEAGLKSLVGSEGPAAVAVDV-ESDFMMYRSGIYQSQTC--SPLGLNHAVLAVGYG 277

Query: 308 ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMT 344
              G  YW VKNSW   WG +GY+ M+  + N CG+ +
Sbjct: 278 TQGGTDYWIVKNSWGLSWGERGYIRMARNRGNMCGIAS 315


>gi|440297812|gb|ELP90453.1| cysteine proteinase 2 precursor, putative [Entamoeba invadens IP1]
          Length = 304

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 90/238 (37%), Positives = 133/238 (55%), Gaps = 14/238 (5%)

Query: 123 YSLWVRYN-------KASKDAIPVRYEMKGYNSLL---DQSVCGSCWSFGTTGAVEGAYY 172
           +SL  R+N       K+ K   P   +++  N +    +Q  CG C++FG+  ++EG   
Sbjct: 61  FSLLKRFNSDPPTNVKSLKMGFPASLDLRADNKVTPVRNQGDCGGCYAFGSLSSLEGRLL 120

Query: 173 M--KHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAY 230
           M   + K   LS++ ++ CS   GNNGC GG   RS+++I K+G+  + DY PY   D  
Sbjct: 121 MIGGNYKTLDLSEEQIVQCSLIEGNNGCWGGLGKRSFEYIKKNGVVQESDY-PYTAVDGK 179

Query: 231 CHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKC 290
           C +  T   AT+ G++NV     + LK AL   G V V+IDA       Y +GVY D+ C
Sbjct: 180 CLVDLTKKFATIGGYINVPSKDVNELKSALV-GGVVDVSIDARTVKLQLYKSGVYTDKNC 238

Query: 291 NNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTY 348
            +    L+H V AVGYG  +G+ YW VKNSW   WG++G++LM+I++N CGV T   Y
Sbjct: 239 KSEMSDLNHEVSAVGYGNENGQEYWIVKNSWGEEWGDKGFILMAIENNTCGVATDALY 296


>gi|313213752|emb|CBY40632.1| unnamed protein product [Oikopleura dioica]
          Length = 440

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 85/200 (42%), Positives = 116/200 (58%), Gaps = 5/200 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F T  ++E  + +    L  LS+Q L+DCS  +GN GC GG   + + +I
Sbjct: 241 DQGQCGSCWAFSTIASLESQWALAGNALTSLSEQQLVDCSMNWGNYGCSGGLMTQGFTYI 300

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
              +G+ T+  Y PY  QD  C         ++T   N+    E AL  A+   GP+SVA
Sbjct: 301 HDNNGVDTEASY-PYTAQDGKCVFNPANVGTSLTSCYNIASGDEAALANAVQMVGPMSVA 359

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS  SF  Y +GVYY+  C  S   LDH V AVGYG   G  ++ VKNSW+  WG+ G
Sbjct: 360 IDASHMSFQLYTSGVYYEPNC--SSQFLDHGVTAVGYGSSSGNDFFIVKNSWAATWGDNG 417

Query: 330 YVLMSI-KDNNCGVMTAPTY 348
           Y++MS  K+NNCG+ T+ +Y
Sbjct: 418 YIMMSRNKNNNCGIATSASY 437


>gi|148669360|gb|EDL01307.1| mCG12220, isoform CRA_b [Mus musculus]
          Length = 335

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 90/216 (41%), Positives = 127/216 (58%), Gaps = 9/216 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           + ++ +GY   +  Q  C SCW+F  TGA+EG  + K  +L  LS Q L+DCS   GN G
Sbjct: 120 INWKKRGYVTPVQTQGRCNSCWAFSVTGAIEGQMFRKTGQLIPLSVQNLVDCSRPQGNWG 179

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
           C  G  + +  ++M++G    +   PY  +D  C  +   +TA +TGF    P +EDAL 
Sbjct: 180 CYLGNTYLALHYVMENGGLESEATYPYEEKDGSCRYSPENSTANITGF-EFVPKNEDALM 238

Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKP 313
            A+A  GP+SVAIDA   SF FY  G+YY+  C++S   + H++L VGYG    E DG+ 
Sbjct: 239 NAVASIGPISVAIDARHASFLFYKRGIYYEPNCSSSV--VTHSMLLVGYGFTGRESDGRK 296

Query: 314 YWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           YW VKNS  T WGN+GY+ +S  K N+CG+ T   Y
Sbjct: 297 YWLVKNSMGTQWGNKGYMKISRDKGNHCGIATYALY 332


>gi|385298943|gb|AFI60244.1| cysteine protease/senescence-enhanced 1, partial [Panicum virgatum]
          Length = 282

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 85/200 (42%), Positives = 118/200 (59%), Gaps = 5/200 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTGA+E AY     K   LS+Q L+DC+  Y N GC+GG   +++++I
Sbjct: 82  NQGHCGSCWTFSTTGALEAAYTQATGKPVSLSEQQLVDCAGAYNNFGCNGGLPSQAFEYI 141

Query: 211 MKH--GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
            KH  GL T++ Y PY G +  C    +     +   VN+T  +E+ LK A+    PVSV
Sbjct: 142 -KHNGGLDTEESY-PYKGVNGLCQFKASNVGVKVLDSVNITLGAENELKDAVGLVRPVSV 199

Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQ 328
           A +     F  Y +GVY  + C  +P  ++HAVLAVGYG  +G PYW +KNSW   WG++
Sbjct: 200 AFEVIN-GFRLYKSGVYTSDHCGTTPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDE 258

Query: 329 GYVLMSIKDNNCGVMTAPTY 348
           GY  M +  N CGV T  +Y
Sbjct: 259 GYFKMEMGKNMCGVATCASY 278


>gi|308322193|gb|ADO28234.1| cathepsin K [Ictalurus furcatus]
          Length = 331

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 95/228 (41%), Positives = 125/228 (54%), Gaps = 9/228 (3%)

Query: 129 YNKASKDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQA 185
           Y   S D +P  + Y   GY   + +Q  CGSCW+F + GA+EG       KL  LS Q 
Sbjct: 108 YYPDSLDKLPKSIDYRKLGYVTPVRNQGSCGSCWAFSSVGALEGQLMKTTGKLVNLSPQN 167

Query: 186 LIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGF 245
           L+DC     N+GC GG    ++ ++  +G    ++  PY+GQD  C    +   A    F
Sbjct: 168 LVDCV--TENDGCGGGYMTNAFSYVRDNGGIDSEEAYPYVGQDQQCAYNKSGKAAECRRF 225

Query: 246 VNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVG 305
             V   SE AL  A+AK GPVSV IDA Q +F FY  GVYYD  C+   + ++HAVLAVG
Sbjct: 226 KEVKKGSEYALASAVAKVGPVSVGIDAMQSTFQFYKRGVYYDPNCDK--ESINHAVLAVG 283

Query: 306 YGEL-DGKPYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           YG    GK +W VKNSW   WG +GYVLM+   NN CG+    ++  M
Sbjct: 284 YGATPKGKKHWIVKNSWGEEWGMKGYVLMARNRNNACGIANLASFPVM 331


>gi|332220183|ref|XP_003259237.1| PREDICTED: cathepsin S isoform 1 [Nomascus leucogenys]
          Length = 331

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 84/199 (42%), Positives = 117/199 (58%), Gaps = 5/199 (2%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW-GYGNNGCDGGEDFRSYQWI 210
           Q  CG+CW+F   GA+E    +K  KL  LS Q L+DCS   YGN GC+GG    ++Q+I
Sbjct: 133 QGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYI 192

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
           + +     D   PY   D  C   +    AT + +  +  + ED LK A+A  GPVSV +
Sbjct: 193 IDNKGIDSDASYPYKAMDQKCQYDSKYRAATCSKYTELPYSREDVLKEAVANKGPVSVGV 252

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
           DAS  SF  Y +GVYY+  C  +   ++H VL VGYG+L+GK YW VKNSW   +G +GY
Sbjct: 253 DASHPSFFLYRSGVYYEPSCTQN---VNHGVLVVGYGDLNGKEYWLVKNSWGRNFGEEGY 309

Query: 331 VLMSI-KDNNCGVMTAPTY 348
           + M+  K N+CG+ + P+Y
Sbjct: 310 IRMARNKGNHCGIASFPSY 328


>gi|310751866|gb|ADP09371.1| cathepsin L-like proteinase [Fasciola hepatica]
          Length = 326

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 87/225 (38%), Positives = 124/225 (55%), Gaps = 8/225 (3%)

Query: 131 KASKDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALI 187
           +A+  A+P  + +   GY   + DQ  CGSCW+F TTG +EG Y    +     S+Q L+
Sbjct: 102 EANNRAVPDKIDWRESGYVTEVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLV 161

Query: 188 DCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVN 247
           DCS  +GNNGC GG    +Y+++ + GL T+  Y PY   +  C        A +TG+  
Sbjct: 162 DCSRPWGNNGCGGGLMENAYEYLKQFGLETESSY-PYRAVEGQCRYNKQLGVAKVTGYYT 220

Query: 248 VTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG 307
           V   SE  LK  +   GP +VA+D  +  F  Y  G+Y  + C  SP GL+HAVLAVGYG
Sbjct: 221 VHSGSEVELKNLVGAEGPAAVAVDV-ESDFMMYSGGIYQSQTC--SPLGLNHAVLAVGYG 277

Query: 308 ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
              G  YW VKNSW   WG +GY+ M+  + N CG+ +  + + +
Sbjct: 278 TQGGTDYWIVKNSWGLSWGERGYIRMARNRGNMCGIASLASLLMV 322


>gi|114559420|ref|XP_001171183.1| PREDICTED: cathepsin S isoform 1 [Pan troglodytes]
 gi|397492868|ref|XP_003817342.1| PREDICTED: cathepsin S isoform 2 [Pan paniscus]
          Length = 281

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 86/206 (41%), Positives = 118/206 (57%), Gaps = 5/206 (2%)

Query: 145 GYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW-GYGNNGCDGGED 203
           G N L D   CG+CW+F   GA+E    +K  KL  LS Q L+DCS   YGN GC+GG  
Sbjct: 76  GMNHLGDMGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFM 135

Query: 204 FRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKH 263
             ++Q+I+ +     D   PY   D  C   +    AT + +  +    ED LK A+A  
Sbjct: 136 TTAFQYIIDNKGIDSDASYPYKATDQKCQYDSKYRAATCSKYTELPYGREDVLKEAVANK 195

Query: 264 GPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWST 323
           GPVSV +DA   SF  Y +GVYY+  C  +   ++H VL VGYG+L+GK YW VKNSW  
Sbjct: 196 GPVSVGVDALHPSFFLYRSGVYYEPSCTQN---VNHGVLVVGYGDLNGKEYWLVKNSWGH 252

Query: 324 YWGNQGYVLMSI-KDNNCGVMTAPTY 348
            +G +GY+ M+  K N+CG+ + P+Y
Sbjct: 253 NFGEEGYIRMARNKGNHCGIASFPSY 278


>gi|31981229|ref|NP_071721.2| cathepsin M precursor [Mus musculus]
 gi|12837968|dbj|BAB24022.1| unnamed protein product [Mus musculus]
 gi|12838184|dbj|BAB24116.1| unnamed protein product [Mus musculus]
 gi|16445013|gb|AAK00506.1| cathepsin M precursor [Mus musculus]
 gi|148669359|gb|EDL01306.1| mCG12220, isoform CRA_a [Mus musculus]
 gi|148921940|gb|AAI46421.1| Cathepsin M [synthetic construct]
 gi|151555373|gb|AAI48858.1| Cathepsin M [synthetic construct]
          Length = 333

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 90/216 (41%), Positives = 127/216 (58%), Gaps = 9/216 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           + ++ +GY   +  Q  C SCW+F  TGA+EG  + K  +L  LS Q L+DCS   GN G
Sbjct: 118 INWKKRGYVTPVQTQGRCNSCWAFSVTGAIEGQMFRKTGQLIPLSVQNLVDCSRPQGNWG 177

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
           C  G  + +  ++M++G    +   PY  +D  C  +   +TA +TGF    P +EDAL 
Sbjct: 178 CYLGNTYLALHYVMENGGLESEATYPYEEKDGSCRYSPENSTANITGF-EFVPKNEDALM 236

Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKP 313
            A+A  GP+SVAIDA   SF FY  G+YY+  C++S   + H++L VGYG    E DG+ 
Sbjct: 237 NAVASIGPISVAIDARHASFLFYKRGIYYEPNCSSSV--VTHSMLLVGYGFTGRESDGRK 294

Query: 314 YWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           YW VKNS  T WGN+GY+ +S  K N+CG+ T   Y
Sbjct: 295 YWLVKNSMGTQWGNKGYMKISRDKGNHCGIATYALY 330


>gi|225706086|gb|ACO08889.1| Cathepsin S precursor [Osmerus mordax]
          Length = 333

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 90/215 (41%), Positives = 120/215 (55%), Gaps = 5/215 (2%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F + GA+EG       KL  LS Q L+DCS  YGN G
Sbjct: 122 VDWRQKGYVTQVKNQGSCGSCWAFSSVGALEGQLMRTTGKLLDLSPQNLVDCSSKYGNKG 181

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
           C+GG    ++Q+++ +     D   PY G    CH   +  +A  T +  +    E  LK
Sbjct: 182 CNGGFMSEAFQYVIDNKGIDSDTSYPYQGVQGTCHYNPSYRSANCTRYSFLPEGDETTLK 241

Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQV 317
            A+A  GP+SVAIDA++ SF  + +GVY D  C      ++HAVL VGYG LDG+ YW V
Sbjct: 242 QAVAMIGPISVAIDATRPSFILWRSGVYNDLTCTQK---INHAVLVVGYGTLDGQDYWLV 298

Query: 318 KNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           KNSW T +G  GY+ MS   NN CG+     Y  M
Sbjct: 299 KNSWGTRFGENGYIRMSRNRNNQCGIALYGCYPIM 333


>gi|162460343|ref|NP_001105479.1| cysteine protease2 precursor [Zea mays]
 gi|1491774|emb|CAA68192.1| cysteine protease [Zea mays]
          Length = 360

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 84/199 (42%), Positives = 115/199 (57%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTGA+E AY     K   LS+Q L+DC   + N GC+GG   +++++I
Sbjct: 160 NQGHCGSCWTFSTTGALEAAYTQATGKPISLSEQQLVDCGLAFNNFGCNGGLPSQAFEYI 219

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + GL T++ Y PY G +      N      +   VN+T  +ED LK A+    PVSVA
Sbjct: 220 KYNGGLDTEESY-PYQGVNGISKFKNENVGVKVLDSVNITLGAEDELKDAVGLVRPVSVA 278

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            +     F  Y +GVY  + C  +P  ++HAVLAVGYG  DG PYW +KNSW   WG++G
Sbjct: 279 FEVIT-GFRLYKSGVYTSDHCGTTPMDVNHAVLAVGYGVEDGVPYWLIKNSWGADWGDEG 337

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y  M +  N CGV T  +Y
Sbjct: 338 YFKMEMGKNMCGVATCASY 356


>gi|208972992|dbj|BAG74345.1| silicatein-M4 [Ephydatia fluviatilis]
 gi|296168739|emb|CAQ54047.1| silicatein alpha 3 [Ephydatia muelleri]
          Length = 327

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 83/202 (41%), Positives = 118/202 (58%), Gaps = 3/202 (1%)

Query: 148 SLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSY 207
           S+  Q  CGS ++F   GA+EGA  +   KL  LS+Q +IDCS  YGN+GC GG+ + ++
Sbjct: 125 SVQSQGSCGSSYAFAAAGALEGANALAADKLVALSEQNIIDCSVAYGNHGCSGGDVYTAF 184

Query: 208 QWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVS 267
           ++++ +G    D   PY G+   C   +    A  TG V +T  SE  L  A+A  GP++
Sbjct: 185 KYVVDNGGIDTDSSYPYKGKQYSCQYNSKNLGAVATGVVKITSGSETDLLSAVASVGPIA 244

Query: 268 VAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGN 327
           VA+DA+  SF FY +GV+    C+ +   L+HA+L  GYG  +GK YW VKNSW T WG 
Sbjct: 245 VAVDATVNSFMFYQSGVFDSSSCSTTK--LNHAMLVTGYGSTNGKDYWLVKNSWGTGWGE 302

Query: 328 QGYVLM-SIKDNNCGVMTAPTY 348
            GY+ M   K N CG+ +   Y
Sbjct: 303 SGYIKMVRNKYNQCGIASDALY 324


>gi|318065049|ref|NP_001187379.1| cathepsin K precursor [Ictalurus punctatus]
 gi|308322859|gb|ADO28567.1| cathepsin K [Ictalurus punctatus]
          Length = 331

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 95/228 (41%), Positives = 125/228 (54%), Gaps = 9/228 (3%)

Query: 129 YNKASKDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQA 185
           Y   S D +P  + Y   GY   + +Q  CGSCW+F + GA+EG       KL  LS Q 
Sbjct: 108 YYPDSLDKLPKSIDYRKLGYVTPVRNQGSCGSCWAFSSVGALEGQLMKTTGKLVNLSPQN 167

Query: 186 LIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGF 245
           L+DC     N+GC GG    ++ ++  +G    ++  PY+GQD  C    +   A    F
Sbjct: 168 LVDCV--TENDGCGGGYMTNAFSYVRDNGGIDSEEAYPYVGQDQQCAYNKSGKAAECRRF 225

Query: 246 VNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVG 305
             V   SE AL  A+AK GPVSV IDA Q +F FY  GVYYD  C+   + ++HAVLAVG
Sbjct: 226 KEVKKGSEYALASAVAKVGPVSVGIDAMQSTFQFYKRGVYYDPNCDK--ESINHAVLAVG 283

Query: 306 YGEL-DGKPYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           YG    GK +W VKNSW   WG +GYVLM+   NN CG+    ++  M
Sbjct: 284 YGATPKGKKHWIVKNSWGEEWGMKGYVLMARNRNNACGIANLASFPVM 331


>gi|355692920|gb|EHH27523.1| Cathepsin H, partial [Macaca mulatta]
          Length = 305

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 80/199 (40%), Positives = 118/199 (59%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTGA+E A  +   K+  L++Q L+DC+  + N+GC GG   +++++I
Sbjct: 104 NQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYI 163

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           +   G+  +D Y PY G+D  C      A   +    N+T  +E+A+  A+A + PVS A
Sbjct: 164 LYNKGIMGEDTY-PYQGKDGDCKFRPGKAIGFVKDVANITIYAEEAMVEAVALYNPVSFA 222

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            + +Q  F  Y  G+Y    C+ +PD ++HAVLAVGYGE +G PYW VKNSW   WG  G
Sbjct: 223 FEVTQ-DFMMYKTGIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGMNG 281

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y L+    N CG+    +Y
Sbjct: 282 YFLIERGKNMCGLAACASY 300


>gi|313246319|emb|CBY35240.1| unnamed protein product [Oikopleura dioica]
          Length = 326

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 85/200 (42%), Positives = 116/200 (58%), Gaps = 5/200 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F T  ++E  + +    L  LS+Q L+DCS  +GN GC GG   + + +I
Sbjct: 127 DQGQCGSCWAFSTIASLESQWALAGNALTSLSEQQLVDCSMNWGNYGCSGGLMTQGFTYI 186

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
              +G+ T+  Y PY  QD  C         ++T   N+    E AL  A+   GP+SVA
Sbjct: 187 HDNNGVDTEASY-PYTAQDGKCVFNPANVGTSLTSCYNIASGDEAALANAVQMVGPMSVA 245

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS  SF  Y +GVYY+  C  S   LDH V AVGYG   G  ++ VKNSW+  WG+ G
Sbjct: 246 IDASHMSFQLYTSGVYYEPNC--SSQFLDHGVTAVGYGSSSGNDFFIVKNSWAATWGDNG 303

Query: 330 YVLMSI-KDNNCGVMTAPTY 348
           Y++MS  K+NNCG+ T+ +Y
Sbjct: 304 YIMMSRNKNNNCGIATSASY 323


>gi|313235127|emb|CBY24999.1| unnamed protein product [Oikopleura dioica]
          Length = 326

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 85/200 (42%), Positives = 116/200 (58%), Gaps = 5/200 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F T  ++E  + +    L  LS+Q L+DCS  +GN GC GG   + + +I
Sbjct: 127 DQGQCGSCWAFSTIASLESQWALAGNALTSLSEQQLVDCSMNWGNYGCSGGLMTQGFTYI 186

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
              +G+ T+  Y PY  QD  C         ++T   N+    E AL  A+   GP+SVA
Sbjct: 187 HDNNGVDTEASY-PYTAQDGKCVFNPANVGTSLTSCYNIASGDEAALANAVQMVGPMSVA 245

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS  SF  Y +GVYY+  C  S   LDH V AVGYG  +G  ++ VKNSW+  WG+ G
Sbjct: 246 IDASHMSFQLYTSGVYYEPNC--SSQFLDHGVTAVGYGSSNGNDFFIVKNSWAATWGDNG 303

Query: 330 YVLMSI-KDNNCGVMTAPTY 348
           Y++MS  K NNCG+ T+ +Y
Sbjct: 304 YIMMSRNKSNNCGIATSASY 323


>gi|355763133|gb|EHH62119.1| hypothetical protein EGM_20318 [Macaca fascicularis]
          Length = 331

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 84/199 (42%), Positives = 115/199 (57%), Gaps = 5/199 (2%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW-GYGNNGCDGGEDFRSYQWI 210
           Q  CG+CW+F   GA+E    +K  KL  LS Q L+DCS   YGN GC+GG   R++Q+I
Sbjct: 133 QGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTRAFQYI 192

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
           + +     D   PY   D  C   +    AT + +  +    ED LK  +A  GPVSV +
Sbjct: 193 IDNNGIDSDASYPYKATDQKCQYDSKYRAATCSKYTELPYGREDVLKEVVANKGPVSVGV 252

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
           DAS  SF  Y +GVYY+  C  +   ++H VL VGYG L+GK YW VKNSW   +G +GY
Sbjct: 253 DASHPSFFLYRSGVYYEPSCTQN---VNHGVLVVGYGVLNGKEYWLVKNSWGRNFGEEGY 309

Query: 331 VLMSI-KDNNCGVMTAPTY 348
           + M+  K N+CG+ + P+Y
Sbjct: 310 IRMARNKGNHCGIASFPSY 328


>gi|355558399|gb|EHH15179.1| hypothetical protein EGK_01236 [Macaca mulatta]
 gi|380809986|gb|AFE76868.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
 gi|383416071|gb|AFH31249.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
 gi|383416073|gb|AFH31250.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
 gi|383416075|gb|AFH31251.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
 gi|383416077|gb|AFH31252.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
 gi|383416079|gb|AFH31253.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
          Length = 331

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 84/199 (42%), Positives = 115/199 (57%), Gaps = 5/199 (2%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW-GYGNNGCDGGEDFRSYQWI 210
           Q  CG+CW+F   GA+E    +K  KL  LS Q L+DCS   YGN GC+GG   R++Q+I
Sbjct: 133 QGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTRAFQYI 192

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
           + +     D   PY   D  C   +    AT + +  +    ED LK  +A  GPVSV +
Sbjct: 193 IDNNGIDSDASYPYKATDQKCQYDSKYRAATCSKYTELPYGREDVLKEVVANKGPVSVGV 252

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
           DAS  SF  Y +GVYY+  C  +   ++H VL VGYG L+GK YW VKNSW   +G +GY
Sbjct: 253 DASHPSFFLYRSGVYYEPSCTQN---VNHGVLVVGYGVLNGKEYWLVKNSWGRNFGEEGY 309

Query: 331 VLMSI-KDNNCGVMTAPTY 348
           + M+  K N+CG+ + P+Y
Sbjct: 310 IRMARNKGNHCGIASFPSY 328


>gi|297297049|ref|XP_002804951.1| PREDICTED: cathepsin H [Macaca mulatta]
          Length = 323

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 80/199 (40%), Positives = 117/199 (58%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTGA+E A  +   K+  L++Q L+DC+  + N+GC GG   +++++I
Sbjct: 122 NQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYI 181

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           +   G+  +D Y PY G+D  C      A   +    N+T   E+A+  A+A + PVS A
Sbjct: 182 LYNKGIMGEDTY-PYQGKDGDCKFRPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFA 240

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            + +Q  F  Y  G+Y    C+ +PD ++HAVLAVGYGE +G PYW VKNSW   WG  G
Sbjct: 241 FEVTQ-DFMIYKTGIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGMNG 299

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y L+    N CG+    +Y
Sbjct: 300 YFLIERGKNMCGLAACASY 318


>gi|402856105|ref|XP_003892640.1| PREDICTED: cathepsin S isoform 1 [Papio anubis]
          Length = 331

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 84/199 (42%), Positives = 115/199 (57%), Gaps = 5/199 (2%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW-GYGNNGCDGGEDFRSYQWI 210
           Q  CG+CW+F   GA+E    +K  KL  LS Q L+DCS   YGN GC+GG   R++Q+I
Sbjct: 133 QGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTRAFQYI 192

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
           + +     D   PY   D  C   +    AT + +  +    ED LK  +A  GPVSV +
Sbjct: 193 IDNNGIDSDASYPYKATDQKCQYDSKYRAATCSKYTELPYGREDVLKEVVANKGPVSVGV 252

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
           DAS  SF  Y +GVYY+  C  +   ++H VL VGYG L+GK YW VKNSW   +G +GY
Sbjct: 253 DASHPSFFLYRSGVYYEPSCTQN---VNHGVLVVGYGVLNGKEYWLVKNSWGRNFGEEGY 309

Query: 331 VLMSI-KDNNCGVMTAPTY 348
           + M+  K N+CG+ + P+Y
Sbjct: 310 IRMARNKGNHCGIASFPSY 328


>gi|110349475|gb|ABG73218.1| cathepsin L 2 precursor [Diaprepes abbreviatus]
          Length = 348

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 86/200 (43%), Positives = 124/200 (62%), Gaps = 8/200 (4%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCWSF  TGA+E  ++ K  KL  LS+Q L+DCS  YGN+GC GG    ++ +I
Sbjct: 152 NQRNCGSCWSFSATGALEAQWFKKTNKLISLSEQQLVDCSGRYGNHGCHGGWMHWAFGYI 211

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
            ++ G+ T+  Y PY  +D  C        AT++  + V P  E+ L   ++  GP+S+A
Sbjct: 212 KENGGIDTEQSY-PYTAKDGRCAYKPGNKAATVSQVIMV-PRGENQLAAKVSSVGPISIA 269

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            + S K F FY +GVY + +C +S   L+HA+LAVGYG + GK +W VKNSW T WG+QG
Sbjct: 270 AEVSHK-FQFYHSGVYDEPQCGHS---LNHAMLAVGYGSMGGKNFWLVKNSWGTGWGDQG 325

Query: 330 YVLMSI-KDNNCGVMTAPTY 348
           Y+ M+  K+N CG+    +Y
Sbjct: 326 YIRMAKDKNNQCGIALMASY 345


>gi|405958752|gb|EKC24846.1| Cathepsin L1 [Crassostrea gigas]
          Length = 290

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 82/199 (41%), Positives = 123/199 (61%), Gaps = 4/199 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F  TGA+EG    K  KL VLS Q ++DCS  +GN+GC+GG    ++++I
Sbjct: 91  NQFACGSCWAFTATGALEGQVRNKTGKLIVLSVQQMMDCSEKWGNHGCEGGLMDAAFKYI 150

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
              G    +   PY   +  C    +   A + G+ ++ P SE++L +A+A  GP+S A+
Sbjct: 151 HDVGGIESNASYPYKPAEEKCKFNKSAVVAKVKGYKDL-PKSEESLMVAVATVGPISAAL 209

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
           DAS  SF  Y +GVY +  C++    +DH+++ VGYG +DGK YW  KNSW T WG++GY
Sbjct: 210 DASHSSFQLYKSGVYDEPNCSSGQ--VDHSLVVVGYGLMDGKKYWIAKNSWGTSWGDKGY 267

Query: 331 VLMSI-KDNNCGVMTAPTY 348
           +L+S  K+N CG+    +Y
Sbjct: 268 ILLSKDKNNQCGIANTLSY 286


>gi|301609080|ref|XP_002934105.1| PREDICTED: cathepsin S-like [Xenopus (Silurana) tropicalis]
          Length = 334

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 83/198 (41%), Positives = 118/198 (59%), Gaps = 4/198 (2%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM 211
           Q  CGSC++FG  GA+E  +  K  +L   S Q L+DCS+  GNNGC GG    S+ ++ 
Sbjct: 137 QGSCGSCYAFGAVGALECQWKRKTGRLVTFSPQELVDCSYTVGNNGCKGGGSNASFTYMK 196

Query: 212 KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAID 271
           K+G+  +  Y PY G++A C     +    +  F  +   +E  LK A+   GPV VAID
Sbjct: 197 KYGVMEESAY-PYTGKEAQCKKEKPSNVGVVKQFYRLPTGNEVLLKKAVGTVGPVYVAID 255

Query: 272 ASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYV 331
           +S++ F  Y +GVYYD  C+ +   L HAVL VGY + +G+ YW VKNSW  Y+G++GY+
Sbjct: 256 SSRQGFRMYKSGVYYDPYCSTT--SLSHAVLIVGYSKENGQYYWLVKNSWGEYFGDKGYI 313

Query: 332 LMSIKDNN-CGVMTAPTY 348
            M+ K NN CG+ T   Y
Sbjct: 314 KMARKRNNHCGIATRAAY 331


>gi|61661067|gb|AAX51229.1| cathepsin S cysteine protease [Paralichthys olivaceus]
          Length = 337

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 90/218 (41%), Positives = 129/218 (59%), Gaps = 7/218 (3%)

Query: 137 IPVRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGN 195
           + V + ++G   S+  Q  CGSCW+F   GA+EG       KL  LS Q L+DCS  YGN
Sbjct: 124 VSVDWRLQGCVTSVKMQGSCGSCWAFSAAGALEGQLAKTTGKLVDLSPQNLVDCSLKYGN 183

Query: 196 NGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSED 254
            GC+GG   R++Q+++ + G+ ++  Y PY GQ   C    +   A  + +  +    E 
Sbjct: 184 KGCNGGFMDRAFQYVIDNKGIDSEASY-PYRGQLQQCSYNPSYRAANCSRYSFLPEGDEG 242

Query: 255 ALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPY 314
           ALK ALA  GP+SVAIDA++ +F+FY +GVY D  C      ++H VLAVGYG   G+ Y
Sbjct: 243 ALKNALATIGPISVAIDATRPTFAFYRSGVYNDPTCTQR---VNHGVLAVGYGTESGQDY 299

Query: 315 WQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
           W VKNSW T +G++GY+ MS  K++ CG+    +Y  M
Sbjct: 300 WLVKNSWGTSFGDKGYIRMSRNKNDQCGIALYCSYPIM 337


>gi|357474725|ref|XP_003607647.1| Cysteine proteinase [Medicago truncatula]
 gi|355508702|gb|AES89844.1| Cysteine proteinase [Medicago truncatula]
          Length = 340

 Score =  159 bits (401), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 93/222 (41%), Positives = 125/222 (56%), Gaps = 12/222 (5%)

Query: 136 AIP--VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG 192
           AIP  V + +KG    + DQ  CGSCW+F T  A+EG   +   KL  LS+Q L+DC   
Sbjct: 123 AIPEAVDWRVKGAVTPIKDQGQCGSCWAFSTVAAIEGINQITTGKLISLSEQELVDCDTK 182

Query: 193 YGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNS 252
             + GC+GG     +++I+K+G  T +   PY   D  C+ A T   A +TG+  V  NS
Sbjct: 183 GEDQGCEGGLMEDGFEFIIKNGGITSETNYPYKAADGSCNTATTAPVAKITGYEKVPVNS 242

Query: 253 EDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGK 312
           E +L  A+A   P+SV+IDAS  SF FY +G+Y  E C      LDH V AVGYG  +G 
Sbjct: 243 EISLLKAVANQ-PISVSIDASDSSFMFYSSGIYTGE-CGTE---LDHGVTAVGYGSANGT 297

Query: 313 PYWQVKNSWSTYWGNQGYVLM----SIKDNNCGVMTAPTYVT 350
            YW VKNSW T WG +GY+ M    + K+  CG+    +Y T
Sbjct: 298 DYWIVKNSWGTVWGEKGYIRMQRGIADKEGLCGIAMDSSYPT 339


>gi|440291600|gb|ELP84863.1| cysteine proteinase 2 precursor, putative [Entamoeba invadens IP1]
          Length = 315

 Score =  159 bits (401), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 90/222 (40%), Positives = 127/222 (57%), Gaps = 8/222 (3%)

Query: 131 KASKDAIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVL--SQQALID 188
           K+  +++ +R + K    + DQ+ CGSC+SFG   ++EG        +  L  S++ ++ 
Sbjct: 95  KSFPESLDLRQQGK-VTKVKDQNPCGSCYSFGALASLEGRLLFVGGNVDTLDLSEEQIVR 153

Query: 189 CSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV 248
           CS    NNGC+GG     Y +I  +G+  + DY PY   D  C    T   AT+ G+  +
Sbjct: 154 CS---PNNGCNGGTGSVVYDYIKGNGILQEKDY-PYTASDGKCEADLTYIFATVEGYKYI 209

Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE 308
           T    DALK AL   G V V+IDAS   FS Y +GVY+D KC  S   L+H + AVGYG+
Sbjct: 210 TRKDNDALKSALVD-GVVDVSIDASSPKFSLYKSGVYHDTKCKTSLLSLNHEIAAVGYGK 268

Query: 309 LDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVT 350
           L+G+ YW VKNSW T WG  GY+LM+++ N CGV + P   T
Sbjct: 269 LNGEEYWIVKNSWGTNWGENGYILMAMEGNTCGVTSEPIIPT 310


>gi|60649669|gb|AAH90560.1| LOC594890 protein, partial [Xenopus (Silurana) tropicalis]
          Length = 355

 Score =  159 bits (401), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 92/218 (42%), Positives = 127/218 (58%), Gaps = 10/218 (4%)

Query: 131 KASKDAIPVRYEMKGYN---SLLDQSVCGSCWSFGTTGAVEGAYYMKHK--KLAVLSQQA 185
           + SK + P   + +  N   S+ DQ  C + W+F + GA+E    MK +  KL  LS Q 
Sbjct: 133 EISKSSPPESIDWRNKNCVTSVKDQGSCIASWAFSSIGALE-CQNMKRRTGKLESLSVQN 191

Query: 186 LIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGF 245
           L+DCS  YGNNGC GG    S+++I+ +G+  + +Y PY G+D  C        +  T +
Sbjct: 192 LLDCSQTYGNNGCKGGWVVSSFRYIIDNGIELESNY-PYQGKDGKCSYTPVKKASVCTSY 250

Query: 246 VNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVG 305
             +    E  LK  +   GPVSVAIDAS+K+F  Y NGVYYD  C++S    DH+VL VG
Sbjct: 251 RQLPYGDEATLKQVVGLMGPVSVAIDASRKTFRMYKNGVYYDPNCSSSTP--DHSVLVVG 308

Query: 306 YGELDGKPYWQVKNSWSTYWGNQGYVLMSIK-DNNCGV 342
           YG  DG  YW VKNSW T +G++GY+ M+    NNCG+
Sbjct: 309 YGAEDGVEYWLVKNSWGTSFGDEGYIKMARNHHNNCGI 346


>gi|62945374|ref|NP_001017509.1| uncharacterized protein LOC498688 precursor [Rattus norvegicus]
 gi|60552853|gb|AAH91563.1| Similar to cathepsin R [Rattus norvegicus]
 gi|149039732|gb|EDL93848.1| similar to cathepsin R [Rattus norvegicus]
          Length = 334

 Score =  159 bits (401), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 86/219 (39%), Positives = 127/219 (57%), Gaps = 9/219 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   +  Q  C +CW+F  TGA+E     +  KL  LS Q L+DCS   GNNG
Sbjct: 119 VDWRKKGYVTPVRRQGNCNACWAFSVTGAIEAQTIWQSGKLIPLSVQNLVDCSKPQGNNG 178

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
           C GG+ + ++Q+++ +G    +   PY G+D  C      ++A +TGFV++ P SED L 
Sbjct: 179 CLGGDTYNAFQYVLHNGGLQSEATYPYEGKDGPCRYNPKNSSAEITGFVSL-PESEDILM 237

Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKP 313
           +A+A  GP+S  IDAS +SF FY  G+Y++  C  S + + H VL VGYG    +  G  
Sbjct: 238 VAVATIGPISAGIDASHESFKFYKKGIYHEPNC--SSNSVTHGVLVVGYGFKGNDTGGDH 295

Query: 314 YWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTYVTM 351
           YW +KNSW   WG +GY+ +   K+N+C + +   Y T+
Sbjct: 296 YWLIKNSWGKQWGIRGYMKITKDKNNHCAIASYAHYPTI 334


>gi|84660246|emb|CAI43320.1| cathepsin L [Lubomirskia baicalensis]
 gi|85677150|emb|CAI46307.1| cathepsin L [Lubomirskia baicalensis]
          Length = 327

 Score =  159 bits (401), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 92/214 (42%), Positives = 127/214 (59%), Gaps = 7/214 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KG+  ++ +Q  CGSCW+F     +EG ++     L  LS+Q L+DCS   GN G
Sbjct: 112 VDWRTKGFVTAIKNQGQCGSCWAFSAVAGLEGQHFNATGTLVSLSEQNLVDCSTAEGNQG 171

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPN-SEDA 255
           C+GG    ++Q+++K+ G+ T+  Y PY   D  C        +T +GF ++ P+ SE A
Sbjct: 172 CNGGLMDNAFQYVIKNGGIDTEASY-PYKAVDQKCKFNAANVGSTCSGFSDILPHKSEAA 230

Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYW 315
           L++A+A  GP+SVAIDAS  SF  Y +GVY +  C  S   LDH V AVGY    G  YW
Sbjct: 231 LQVAVAVVGPISVAIDASHTSFQLYKSGVYSESAC--SQTSLDHGVTAVGYDSSSGVAYW 288

Query: 316 QVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
            VKNSW T WG  GY+ MS  K+N CG+ TA +Y
Sbjct: 289 IVKNSWGTTWGQAGYIWMSRNKNNQCGIATAASY 322


>gi|374530932|gb|AEP83812.2| cysteine endopeptidase EP8 [Secale cereale x Triticum durum]
          Length = 364

 Score =  158 bits (400), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 99/230 (43%), Positives = 128/230 (55%), Gaps = 14/230 (6%)

Query: 128 RYNKASKDAIPVRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQAL 186
           RY  A      V +  KG   ++ DQ  CGSCW+F T  AVEG   ++  KL  LS+Q L
Sbjct: 124 RYGDADNLPPAVDWRQKGAVTAIKDQGQCGSCWAFSTIVAVEGINKIRTGKLVSLSEQEL 183

Query: 187 IDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATA-TMTGF 245
           +DC     N GCDGG    ++Q+I K+G+ T+ +Y PY G+   C +A   A A T+ G+
Sbjct: 184 MDCD-NVNNQGCDGGLMDYAFQFIHKNGITTESNY-PYQGEQGSCDLAKEKAHAVTIDGY 241

Query: 246 VNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVG 305
            +V  N E AL+ A+A   PVSVAIDAS   F FY  GV+  E C+     LDH V AVG
Sbjct: 242 EDVPANDESALQKAVAGQ-PVSVAIDASGNDFQFYSEGVFTGE-CSTD---LDHGVAAVG 296

Query: 306 YGEL-DGKPYWQVKNSWSTYWGNQGYVLM----SIKDNNCGVMTAPTYVT 350
           YG   DG  YW VKNSW   WG +GY+ M    S  +  CG+    +Y T
Sbjct: 297 YGTTRDGTKYWIVKNSWGEDWGEKGYIRMQRGVSQAEGQCGIAMQASYPT 346


>gi|260516678|gb|ACX43965.1| cysteine protease 1 [Brachiaria hybrid cultivar]
          Length = 338

 Score =  158 bits (400), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 89/222 (40%), Positives = 128/222 (57%), Gaps = 11/222 (4%)

Query: 135 DAIPVRYEMKGYNSLL---DQSVCGSCWSFGTTGAVEGAYYMKHKK-LAVLSQQALIDCS 190
           +A P   + +  N++    DQ  CGSCW+F  TG++EGA+ ++ K  L  LS+Q L+DCS
Sbjct: 123 EAAPTSIDWRTSNAVTPIKDQGQCGSCWAFSATGSIEGAWVLQGKHTLTSLSEQQLVDCS 182

Query: 191 WGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVT 249
             YG+ GC+GG    ++++I+ + G+  +  Y PY G    C   + T   T++G+ +V 
Sbjct: 183 TSYGDAGCNGGLMDYAFEYIIANKGICAESAY-PYKGVGGLCQ-KSCTKVVTISGYKDVA 240

Query: 250 PNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGEL 309
              E +L  A+   GPVSVAI+A Q  F FY +GV+     +N    LDH VLAVGYG  
Sbjct: 241 SGDEASLLNAVGTVGPVSVAIEADQAGFQFYSSGVFSGTCGHN----LDHGVLAVGYGTT 296

Query: 310 DGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
             + YW VKNSW T WG  GY+ M    N CG+   P+Y T+
Sbjct: 297 GSQDYWIVKNSWGTSWGESGYIRMIRNKNQCGIAIQPSYPTV 338


>gi|387915678|gb|AFK11448.1| cathepsin L1 [Callorhinchus milii]
          Length = 336

 Score =  158 bits (400), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 89/213 (41%), Positives = 128/213 (60%), Gaps = 6/213 (2%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F +TGA+EG  + K  KL  LS+Q L+DCS   GN+G
Sbjct: 124 VDWRDKGYVTPVKNQGACGSCWAFSSTGALEGQTFKKTGKLIPLSEQNLVDCSQKQGNHG 183

Query: 198 CDGGEDFRSYQWIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG   R++ +I + +G+ T+  Y PY  ++  C+       AT  G+       E AL
Sbjct: 184 CNGGMMDRAFTYIQQNNGIDTEASY-PYTAKEHPCNYDPRHNAATCHGYRYSEQYDEMAL 242

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
              +A  GP+SVAIDA   SF FY +G+Y + +C +    ++HAVL VGY    G  YW 
Sbjct: 243 AETVATIGPISVAIDAKHISFQFYKSGIYQEPRCQSY--NINHAVLVVGYNSQGGNNYWI 300

Query: 317 VKNSWSTYWGNQGYVLM-SIKDNNCGVMTAPTY 348
           VKNS+ + WGN+GY+ M   K+N+CG+ + PTY
Sbjct: 301 VKNSFGSRWGNKGYIWMPKDKNNHCGIASYPTY 333


>gi|28932708|gb|AAO60048.1| midgut cysteine proteinase 5 [Rhipicephalus appendiculatus]
          Length = 329

 Score =  158 bits (400), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 89/200 (44%), Positives = 118/200 (59%), Gaps = 7/200 (3%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F  TG++EG + +K  +L  LS+Q L+DCS  +GNNGC+GG    ++ +I
Sbjct: 132 DQGQCGSCWAFSATGSLEGQHLLKDGELVSLSEQNLVDCSQSFGNNGCEGGLMDNAFNYI 191

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+ T++ Y PY   D  C        AT TGFV++    ED LK A     P  + 
Sbjct: 192 KANDGIDTEEGY-PYEAVDGECRFKKEDVGATDTGFVDIPGGIEDDLKKASFCWPPPWLW 250

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
              S  SF  Y  GVY +  C  S + LDH VL VGYG   GK YW VKNSW+  WG+QG
Sbjct: 251 --RSPSSFQLYSEGVYDESDC--SSEQLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQG 306

Query: 330 YVLMSI-KDNNCGVMTAPTY 348
           Y+LMS  K+N CG+ +A +Y
Sbjct: 307 YILMSRDKNNQCGIASAASY 326


>gi|403364285|gb|EJY81901.1| Cathepsin H [Oxytricha trifallax]
          Length = 363

 Score =  158 bits (400), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 83/208 (39%), Positives = 121/208 (58%), Gaps = 3/208 (1%)

Query: 142 EMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGG 201
           E  G   + DQ  CGSCW+F T G +E  + +K+++   LS+Q L+DC+  Y N GC+GG
Sbjct: 143 EHNGVTPVKDQGSCGSCWTFSTVGTLEAHFLIKYQQSRNLSEQQLVDCAGAYDNYGCNGG 202

Query: 202 EDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTT-ATATMTGFVNVTPNSEDALKLAL 260
               ++Q+I  +G    +   PY  +D  C I  +  +   + G VN+T  SED L +A+
Sbjct: 203 LPSHAFQYISDNGGIATEAAYPYFAKDRPCTIQQSQKSVGVVGGSVNLT-KSEDELAIAI 261

Query: 261 AKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNS 320
            +HGPVS+A +     F  Y +GVY  + C N PD ++HAV+AVG+G  +G  YW VKNS
Sbjct: 262 FQHGPVSIAYEVI-DDFMDYHSGVYTTKDCKNGPDDVNHAVVAVGFGTENGVDYWLVKNS 320

Query: 321 WSTYWGNQGYVLMSIKDNNCGVMTAPTY 348
           WST WG+ GY  +    N CG+    +Y
Sbjct: 321 WSTKWGDNGYFKIQRGVNMCGINNCNSY 348


>gi|208972990|dbj|BAG74344.1| silicatein-M3 [Ephydatia fluviatilis]
          Length = 326

 Score =  158 bits (400), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 81/202 (40%), Positives = 127/202 (62%), Gaps = 5/202 (2%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM 211
           Q  CG+ ++F  TGA+EGA  + + K  +LS+Q +IDCS  YGN+GC GG+ + + ++++
Sbjct: 128 QGQCGASYAFAATGALEGASALANDKQVILSEQNIIDCSVPYGNHGCSGGDTYTAMKYVI 187

Query: 212 KH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
            + G+ T+  Y  + G+ + C  ++  + A+ TG +++T  SE  L  A+A  GPV+VA+
Sbjct: 188 DNGGIDTESSY-SFQGKQSSCQYSSKNSGASATGVISITSGSETDLLAAVATVGPVAVAV 246

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
           DA+  +F FY +GV+    C+N+    +HA+L  GYG  +GK YW VKNSWS  WG+ GY
Sbjct: 247 DANTNAFRFYQSGVFDSSSCSNTKP--NHAMLVTGYGSYNGKDYWLVKNSWSKNWGDNGY 304

Query: 331 VLM-SIKDNNCGVMTAPTYVTM 351
           +LM   K N C + T   Y T+
Sbjct: 305 ILMVRNKYNQCAIATDALYPTL 326


>gi|288548564|gb|ADC52430.1| cathepsin L1 cysteine protease [Pinctada fucata]
          Length = 331

 Score =  158 bits (400), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 89/214 (41%), Positives = 125/214 (58%), Gaps = 7/214 (3%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCWSF  TG++EG ++    KL  LS+Q LIDCS   GN+G
Sbjct: 118 VDWRDKGYVTPVKNQGHCGSCWSFSATGSLEGQHFKSTGKLVSLSEQNLIDCSKKEGNHG 177

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDA 255
           C GG    ++++I K+ G+ T+  Y PY  +D   C        AT  G V++   SE A
Sbjct: 178 CKGGLMDFAFEYIQKNDGIDTEQSY-PYTAKDGIECRFKKADVGATDKGKVDLPRQSEKA 236

Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYW 315
           L+ A+A  GP+SVA+DA  +SF  Y  G+Y +  C+++   LDH VLAVGYG      YW
Sbjct: 237 LQEAVATVGPISVAMDAGHRSFQLYKRGIYTEPMCSSTK--LDHGVLAVGYGSEGEGDYW 294

Query: 316 QVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTY 348
            VKNSW   WG +G+ +++    N CG+ T  +Y
Sbjct: 295 LVKNSWGATWGMEGFFMLARNHRNECGIATQASY 328


>gi|118429523|gb|ABK91809.1| cathepsin L-like proteinase precursor [Clonorchis sinensis]
          Length = 373

 Score =  158 bits (400), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 85/205 (41%), Positives = 121/205 (59%), Gaps = 10/205 (4%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F  TGA+EG  ++    L  LS+Q L+DCS  YGNN C+GG    +++++
Sbjct: 169 NQGNCGSCWAFSATGAIEGQNFLATGNLVSLSEQQLVDCSSEYGNNACNGGLMDNAFKYV 228

Query: 211 M-KHGLPTQDDYGPYLG-----QDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHG 264
              +G+ T+  Y PY+       +  C      A   +TG++++       LK A+  +G
Sbjct: 229 KDSNGIDTEASY-PYVSGETGDANPTCRFNLKEAVVRVTGYIDLPRGQVSELKQAVGHYG 287

Query: 265 PVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTY 324
           P+SVAI+A   SF  Y +GVY D++C  S D LDH VL VGYGE +G PYW +KNSW  +
Sbjct: 288 PISVAINAGLPSFMSYKSGVYSDDQC--SSDDLDHGVLLVGYGEENGIPYWLIKNSWGPH 345

Query: 325 WGNQGYVLMSIKDNN-CGVMTAPTY 348
           WG  GYV +    NN CGV +  +Y
Sbjct: 346 WGENGYVKILRDHNNLCGVASMASY 370


>gi|195984441|gb|ACG63793.1| silicatein A1 [Latrunculia oparinae]
          Length = 329

 Score =  158 bits (400), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 84/206 (40%), Positives = 125/206 (60%), Gaps = 5/206 (2%)

Query: 148 SLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSY 207
           S+ DQS CG+ ++F    A+EGA  +    L  LS+Q LIDCS  YGN+GC GG    ++
Sbjct: 127 SVKDQSQCGASYAFSAMAALEGANALATDTLVNLSEQNLIDCSVPYGNHGCKGGNMLYAF 186

Query: 208 QWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
           ++++ + G+ T + Y P+ G+ + C      A   ++G V ++  SE  L  A+A  GPV
Sbjct: 187 KYVIANEGVDTANSY-PFYGKQSSCVYNEKYAAVKISGMVRISQGSESDLLGAVANVGPV 245

Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWG 326
           +VAID S  +F FY +GVY   +C++S   L+HA++  GYG   GK YW VKNSW   WG
Sbjct: 246 AVAIDGSSNAFRFYSSGVYDSSRCSSSK--LNHAIVVTGYGSYSGKKYWLVKNSWGKNWG 303

Query: 327 NQGYVLMSI-KDNNCGVMTAPTYVTM 351
           N GY++M+  K N CG+ +  +Y T+
Sbjct: 304 NYGYIMMARGKYNQCGIASDASYPTL 329


>gi|334324657|ref|XP_003340546.1| PREDICTED: cathepsin S-like isoform 2 [Monodelphis domestica]
          Length = 281

 Score =  158 bits (400), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 85/204 (41%), Positives = 118/204 (57%), Gaps = 5/204 (2%)

Query: 147 NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG-YGNNGCDGGEDFR 205
           N L D   CGSCW+F   GA+E    +K  KL  LS Q L+DCS   Y N+GC+GG    
Sbjct: 78  NHLGDMGSCGSCWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTDKYDNHGCNGGFMTS 137

Query: 206 SYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGP 265
           ++Q+++ +     D   PY   D  C     +  AT + +  +   SE+ALK A+A  GP
Sbjct: 138 AFQYVIDNNGIDSDVSYPYKATDGKCQYNPASRAATCSKYTELPYGSEEALKEAVANKGP 197

Query: 266 VSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYW 325
           VSV IDA   SF  Y +GVYYD  C      ++H VL +GYG LDG+ YW VKNSW  ++
Sbjct: 198 VSVGIDAKTPSFFLYKSGVYYDPSCTQK---VNHGVLVIGYGNLDGQDYWLVKNSWGLHF 254

Query: 326 GNQGYVLMSI-KDNNCGVMTAPTY 348
           G++GYV ++  + N+CG+   P+Y
Sbjct: 255 GDKGYVRIARNRGNHCGIANFPSY 278


>gi|392873946|gb|AFM85805.1| cathepsin H [Callorhinchus milii]
          Length = 259

 Score =  158 bits (400), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 82/199 (41%), Positives = 119/199 (59%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTG +E A  +K  KL  L++Q L+DC+  Y N+GC+GG   +++++I
Sbjct: 58  NQGGCGSCWTFSTTGCLESAIAIKTGKLLSLAEQQLVDCAGAYKNHGCNGGLPSQAFEYI 117

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + GL  + DY PY  QD +C      A A +   VN+T   E+ +  A+A+  PVS+A
Sbjct: 118 KYNGGLEAEKDY-PYTAQDQHCQYQPNKAVAFVKEVVNITQYDENGIVDAVARLNPVSIA 176

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            + +   F  Y  GVY +  C+++PD ++HAVLAVGYG  +G  YW VKNSW   WG  G
Sbjct: 177 FEVTDDFFQ-YEGGVYSNSNCDSTPDKVNHAVLAVGYGVQNGTKYWIVKNSWGPEWGLNG 235

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y  +    N CG+   P+Y
Sbjct: 236 YFYIIRGKNMCGLAACPSY 254


>gi|109082090|ref|XP_001108862.1| PREDICTED: cathepsin H isoform 2 [Macaca mulatta]
          Length = 335

 Score =  158 bits (400), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 80/199 (40%), Positives = 117/199 (58%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTGA+E A  +   K+  L++Q L+DC+  + N+GC GG   +++++I
Sbjct: 134 NQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYI 193

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           +   G+  +D Y PY G+D  C      A   +    N+T   E+A+  A+A + PVS A
Sbjct: 194 LYNKGIMGEDTY-PYQGKDGDCKFRPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFA 252

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            + +Q  F  Y  G+Y    C+ +PD ++HAVLAVGYGE +G PYW VKNSW   WG  G
Sbjct: 253 FEVTQ-DFMIYKTGIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGMNG 311

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y L+    N CG+    +Y
Sbjct: 312 YFLIERGKNMCGLAACASY 330


>gi|402875039|ref|XP_003901328.1| PREDICTED: pro-cathepsin H [Papio anubis]
          Length = 335

 Score =  158 bits (400), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 80/199 (40%), Positives = 117/199 (58%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTGA+E A  +   K+  L++Q L+DC+  + N+GC GG   +++++I
Sbjct: 134 NQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYI 193

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           +   G+  +D Y PY G+D  C      A   +    N+T   E+A+  A+A + PVS A
Sbjct: 194 LYNKGIMGEDTY-PYQGKDGDCKFRPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFA 252

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            + +Q  F  Y  G+Y    C+ +PD ++HAVLAVGYGE +G PYW VKNSW   WG  G
Sbjct: 253 FEVTQ-DFMMYKTGIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGMNG 311

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y L+    N CG+    +Y
Sbjct: 312 YFLIERGKNMCGLAACASY 330


>gi|293342574|ref|XP_002725265.1| PREDICTED: cathepsin Q-like isoform 2 [Rattus norvegicus]
 gi|79152841|gb|AAI07914.1| Ctsq protein [Rattus norvegicus]
 gi|149039734|gb|EDL93850.1| rCG24269 [Rattus norvegicus]
          Length = 343

 Score =  158 bits (400), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 91/240 (37%), Positives = 133/240 (55%), Gaps = 19/240 (7%)

Query: 124 SLWVRYNKAS-------KDAIP--VRYEMKGYNSLLD-QSVCGSCWSFGTTGAVEGAYYM 173
           SLW R   +S       +DA+P  V +  +GY + +  Q  C SCW+F   GA+EG  + 
Sbjct: 105 SLWKRALGSSLPNSWYWRDALPKFVDWRKEGYVTHVRVQGRCNSCWAFPVVGAIEGQMFK 164

Query: 174 KHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHI 233
           K  KL  LS Q L+DCS   GN GC GG  + ++Q+++++G    +   PY G++  C  
Sbjct: 165 KTGKLTPLSVQNLVDCSKPQGNKGCRGGTTYNAFQYVLQNGGLESEATYPYEGKEGLCRY 224

Query: 234 ANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNS 293
               ++A +T FV + P +ED L  A+A  GPV+  I     S  FY  G+Y++ KCNN 
Sbjct: 225 NPNNSSAKITRFVAL-PENEDVLMDAVATKGPVAAGIHVVHSSLRFYKKGIYHEPKCNNY 283

Query: 294 PDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 348
              ++HAVL VGYG    E DG  YW ++NSW   WG  GY+ +   ++N+CG+ T   Y
Sbjct: 284 ---VNHAVLVVGYGFEGNETDGNNYWLIQNSWGERWGLNGYMKIAKDRNNHCGIATFAQY 340


>gi|355778231|gb|EHH63267.1| Cathepsin H, partial [Macaca fascicularis]
          Length = 305

 Score =  158 bits (400), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 80/199 (40%), Positives = 117/199 (58%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTGA+E A  +   K+  L++Q L+DC+  + N+GC GG   +++++I
Sbjct: 104 NQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYI 163

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           +   G+  +D Y PY G+D  C      A   +    N+T   E+A+  A+A + PVS A
Sbjct: 164 LYNKGIMGEDTY-PYQGKDGDCKFRPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFA 222

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            + +Q  F  Y  G+Y    C+ +PD ++HAVLAVGYGE +G PYW VKNSW   WG  G
Sbjct: 223 FEVTQ-DFMMYKTGIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGMNG 281

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y L+    N CG+    +Y
Sbjct: 282 YFLIERGKNMCGLAACASY 300


>gi|291234299|ref|XP_002737086.1| PREDICTED: cathepsin L-like [Saccoglossus kowalevskii]
          Length = 808

 Score =  158 bits (400), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 85/186 (45%), Positives = 118/186 (63%), Gaps = 5/186 (2%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  +GY   + DQ  CGSCWSF  TG++EG  + K  KL  LS+Q L+DCS   GN G
Sbjct: 594 VDWRAQGYVTEVKDQGQCGSCWSFSATGSLEGQTFKKTGKLPDLSEQNLVDCSTTQGNAG 653

Query: 198 CDGGEDFRSYQWI-MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG   ++++++ + +G+ +++ Y PY G D  C    +   AT TG+ +V   SE AL
Sbjct: 654 CNGGLMDQAFEYVKVNNGIDSEESY-PYEGVDGDCRFKASDVAATCTGYTDVESGSESAL 712

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ 316
           + A+A  GPVSVAIDAS +SF FY +GVY + +C+++   LDH VL VGYG  D   YW 
Sbjct: 713 QQAVASVGPVSVAIDASHQSFQFYSSGVYIEPECSSTE--LDHGVLTVGYGTEDSSDYWL 770

Query: 317 VKNSWS 322
           VKNS S
Sbjct: 771 VKNSES 776


>gi|10798511|emb|CAC12806.1| cathepsin L1 [Fasciola hepatica]
          Length = 311

 Score =  158 bits (400), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 86/218 (39%), Positives = 122/218 (55%), Gaps = 8/218 (3%)

Query: 131 KASKDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALI 187
           +A+  A+P  + +   GY   + DQ  CGSCW+F TTG +EG Y    K     S+Q L+
Sbjct: 87  EANNRAVPDKIDWRESGYVTGVKDQGNCGSCWAFSTTGTMEGQYMKNEKTSISFSEQQLV 146

Query: 188 DCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVN 247
           DCS  +GNNGC GG    +Y+++ + GL T+  Y PY   +  C        A +TG+  
Sbjct: 147 DCSGPWGNNGCSGGLMENAYEYLKRFGLETESSY-PYRAVEGQCRYNEQLGVAKVTGYYT 205

Query: 248 VTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG 307
           V   SE  LK  +   GP ++A++A +  F  Y +G+Y  + C   P  L+HAVLAVGYG
Sbjct: 206 VHSGSEVELKNLVGSEGPAAIAVEA-ESDFMMYRSGIYQSQTC--LPFALNHAVLAVGYG 262

Query: 308 ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMT 344
             DG  YW VKNSW   WG +GY+ M+  + N CG+ +
Sbjct: 263 TQDGTDYWIVKNSWGLSWGERGYIRMARNRGNMCGIAS 300


>gi|384251069|gb|EIE24547.1| cysteine proteinase [Coccomyxa subellipsoidea C-169]
          Length = 548

 Score =  158 bits (399), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 82/211 (38%), Positives = 127/211 (60%), Gaps = 11/211 (5%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYG-NNGCDGGEDFRSYQW 209
           DQ+ CGSCW+FG T A++GAY+    K    S+Q L+DCSW +G N+ CDGG+   +  +
Sbjct: 322 DQATCGSCWAFGATAALQGAYWAATGKATSFSEQQLMDCSWKFGANHACDGGDYDNAIDY 381

Query: 210 IMK-HGLPTQDDYGPYLGQDAYC---HIANT----TATATMTGFVNVTPNSEDALKLALA 261
           ++   G+  + DY  YLGQD +C    +A+T    ++   + G+  V P+ ++AL  A+ 
Sbjct: 382 LVNVGGIAKEKDY-EYLGQDDFCGNPFMADTKRPSSSLVKVKGYSYVPPHDDEALMEAVY 440

Query: 262 KHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNS 320
             G ++V++DASQ +F FY +G Y +  C    D LDHAV  VGYG +  G  YW +KNS
Sbjct: 441 SRGTIAVSLDASQPTFRFYASGTYDEPNCMWKSDDLDHAVALVGYGTDEAGVDYWIIKNS 500

Query: 321 WSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
           WS +WG+ G++ ++   + CG+ + P Y   
Sbjct: 501 WSNHWGDGGFIKIARGHHGCGITSDPVYAVF 531


>gi|281204396|gb|EFA78592.1| cysteine proteinase 3 [Polysphondylium pallidum PN500]
          Length = 330

 Score =  158 bits (399), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 91/200 (45%), Positives = 118/200 (59%), Gaps = 7/200 (3%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCWSF TTG++EGA  +     A LS+Q L+DCS  YGN GC+GG    + +++
Sbjct: 131 DQGQCGSCWSFSTTGSIEGANQIATGNFASLSEQQLMDCSRDYGNEGCNGGLMDAAMKYV 190

Query: 211 MKH-GLPTQDDYGPYLGQDAY-CHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
           +   GL T++ Y PY   D+Y C        A ++ +++V   SE  L   L K GPVSV
Sbjct: 191 IAQGGLDTEESY-PYTMSDSYTCKFNPANIGAKISSYIDVQRGSETDLAAKLNK-GPVSV 248

Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQ 328
           AIDAS  SF  Y +GVYY+  C  S   LDH VLAVGYG      YW VKNSW   WG  
Sbjct: 249 AIDASHSSFQLYKSGVYYEPAC--SSYNLDHGVLAVGYGTEGSSNYWIVKNSWGPNWGLS 306

Query: 329 GYVLMSI-KDNNCGVMTAPT 347
           GY+ M+  K N+CG+ +  +
Sbjct: 307 GYIWMAKDKSNHCGISSMAS 326


>gi|211909242|gb|ACJ12894.1| cathepsin L1D [Fasciola hepatica]
          Length = 326

 Score =  158 bits (399), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 85/218 (38%), Positives = 122/218 (55%), Gaps = 8/218 (3%)

Query: 131 KASKDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALI 187
           +A+  A+P  + +   GY   + DQ  CGSCW+F TTG +EG Y    +     S+Q L+
Sbjct: 102 EANNRAVPDKIDWRESGYVTGVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLV 161

Query: 188 DCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVN 247
           DCS  +GNNGC GG    +Y+++ + GL T+  Y PY   +  C        A +TG+  
Sbjct: 162 DCSGPWGNNGCGGGLMENAYEYLKQFGLETESSY-PYRAVEGQCRYNRQLGVAKVTGYYT 220

Query: 248 VTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG 307
           +   +E  LK  +   GP +VA+D  +  F  Y +G+Y  + C  SP GL+HAVLAVGYG
Sbjct: 221 LHSGNEAGLKSLVGSEGPAAVAVDV-ESDFMMYRSGIYQSQTC--SPLGLNHAVLAVGYG 277

Query: 308 ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMT 344
              G  YW VKNSW   WG +GY+ M+  + N CG+ +
Sbjct: 278 TQGGTDYWIVKNSWGLSWGERGYIRMARNRGNMCGIAS 315


>gi|242044818|ref|XP_002460280.1| hypothetical protein SORBIDRAFT_02g025920 [Sorghum bicolor]
 gi|241923657|gb|EER96801.1| hypothetical protein SORBIDRAFT_02g025920 [Sorghum bicolor]
          Length = 363

 Score =  158 bits (399), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 85/199 (42%), Positives = 116/199 (58%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTGA+E AY     K   LS+Q L+DC   + N GC+GG   +++++I
Sbjct: 163 NQGHCGSCWTFSTTGALEAAYTQATGKPISLSEQQLVDCGKPFNNFGCNGGLPSQAFEYI 222

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + GL T++ Y PY G +  C          +   VN+T  +ED LK A+A   PVSVA
Sbjct: 223 KYNGGLDTEESY-PYKGVNGICDFKAENVGVKVLDSVNITLGAEDELKDAVALVRPVSVA 281

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
                  F  Y +GVY  + C N+P  ++HAVLAVGYG  +G PYW +KNSW   WG++G
Sbjct: 282 FQVVN-GFRQYKSGVYTSDSCGNTPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDKG 340

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y  M +  N CGV T  +Y
Sbjct: 341 YFKMEMGKNMCGVATCASY 359


>gi|380798253|gb|AFE71002.1| pro-cathepsin H preproprotein, partial [Macaca mulatta]
          Length = 242

 Score =  158 bits (399), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 80/199 (40%), Positives = 117/199 (58%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTGA+E A  +   K+  L++Q L+DC+  + N+GC GG   +++++I
Sbjct: 41  NQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYI 100

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           +   G+  +D Y PY G+D  C      A   +    N+T   E+A+  A+A + PVS A
Sbjct: 101 LYNKGIMGEDTY-PYQGKDGDCKFRPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFA 159

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            + +Q  F  Y  G+Y    C+ +PD ++HAVLAVGYGE +G PYW VKNSW   WG  G
Sbjct: 160 FEVTQ-DFMMYKTGIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGMNG 218

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y L+    N CG+    +Y
Sbjct: 219 YFLIERGKNMCGLAACASY 237


>gi|388512155|gb|AFK44139.1| unknown [Medicago truncatula]
          Length = 340

 Score =  158 bits (399), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 93/222 (41%), Positives = 124/222 (55%), Gaps = 12/222 (5%)

Query: 136 AIP--VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG 192
           AIP  V + +KG    + DQ  CGSCW+F T  A+EG   +   KL  LS+Q L+DC   
Sbjct: 123 AIPEAVDWRVKGAVTPIKDQGQCGSCWAFSTVAAIEGINQITTGKLISLSEQELVDCDTK 182

Query: 193 YGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNS 252
             + GC+GG     +++I+K+G  T +   PY   D  C  A T   A +TG+  V  NS
Sbjct: 183 GEDQGCEGGLMEDGFEFIIKNGGITSETNYPYKAADGSCSAATTAPVAKITGYEKVPVNS 242

Query: 253 EDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGK 312
           E +L  A+A   P+SV+IDAS  SF FY +G+Y  E C      LDH V AVGYG  +G 
Sbjct: 243 EISLLKAVANQ-PISVSIDASDSSFMFYSSGIYTGE-CGTE---LDHGVTAVGYGSANGT 297

Query: 313 PYWQVKNSWSTYWGNQGYVLM----SIKDNNCGVMTAPTYVT 350
            YW VKNSW T WG +GY+ M    + K+  CG+    +Y T
Sbjct: 298 DYWIVKNSWGTVWGEKGYIRMQRGIADKEGLCGIAMDSSYPT 339


>gi|259089092|ref|NP_001158584.1| Cathepsin L precursor [Oncorhynchus mykiss]
 gi|225705034|gb|ACO08363.1| Cathepsin L precursor [Oncorhynchus mykiss]
          Length = 330

 Score =  158 bits (399), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 88/224 (39%), Positives = 121/224 (54%), Gaps = 12/224 (5%)

Query: 133 SKDAIPVRYEMKGY---NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDC 189
           S  A P  ++ + Y     + +Q  CGSC++F   GA EG  + +   L  LS+Q L+DC
Sbjct: 114 SLSAAPETWDRRQYGYVTPVKNQGSCGSCYAFAAVGAPEGQLFKQTGTLLPLSEQNLVDC 173

Query: 190 SWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVT 249
           S  Y NNGCDGG   R + ++  HG+ ++  Y PY  +   C   N T  A   GF  V 
Sbjct: 174 SGDYHNNGCDGGLAMRCFSYVSDHGIMSERKY-PYTAEVGPCQYQNATKEAWCKGFNRVP 232

Query: 250 PNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE- 308
              E   +  L + GP++V+++A+  SF FY +GV Y+  C+      +HAVLAVGYG  
Sbjct: 233 SLDEKVFRDTLYEVGPIAVSVNATHPSFKFYKDGVLYEPDCSTR---TNHAVLAVGYGSS 289

Query: 309 -LDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
            LD   YW VKNSW T WG  GY+LM+   N CG+   P Y  M
Sbjct: 290 YLD---YWIVKNSWGTGWGRDGYILMARGYNRCGIARRPVYPIM 330


>gi|7271893|gb|AAF44677.1|AF239266_1 cathepsin L [Fasciola gigantica]
          Length = 326

 Score =  158 bits (399), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 87/218 (39%), Positives = 121/218 (55%), Gaps = 8/218 (3%)

Query: 131 KASKDAIPVRYEMKGY---NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALI 187
           KA+K A+P   + + Y     + DQ  CGSCW+F TTGA+EG +    +  A  S+Q L+
Sbjct: 102 KANKPAVPESIDWRDYYYVTEVKDQGQCGSCWAFSTTGAMEGQFRKNERASASFSEQQLV 161

Query: 188 DCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVN 247
           DC+  +GN+GC GG    +Y+++   GL T D Y PY   +  C      A A +T +  
Sbjct: 162 DCTRNFGNHGCGGGYMENAYEYLKHSGLET-DSYYPYQAVEGPCQYDGRLAYAKVTDYYT 220

Query: 248 VTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG 307
           V    E  LK  +   GP +VA+D     F  Y +G+Y+ E C   PD L HAVLAVGYG
Sbjct: 221 VHSGDEVELKNLVGTEGPAAVALDVDY-DFMMYESGIYHSETC--LPDRLTHAVLAVGYG 277

Query: 308 ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMT 344
             DG  YW VKNSW + WG +GY+  +  + N CG+ +
Sbjct: 278 AQDGTDYWIVKNSWGSSWGEKGYIRFARNRGNMCGIAS 315


>gi|310975577|gb|ADP55137.1| cathepsin S [Miichthys miiuy]
          Length = 338

 Score =  158 bits (399), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 87/203 (42%), Positives = 120/203 (59%), Gaps = 7/203 (3%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM 211
           Q  CGSCW+F   GA+EG       KL  LS Q L+DCS  YGN+GC+GG   +++Q+++
Sbjct: 140 QGACGSCWAFSAAGALEGQLAKTTGKLVDLSPQNLVDCSTKYGNHGCNGGFMHKAFQYVI 199

Query: 212 -KHGLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             HG+ +   Y PY G Q   CH +     A  + +  +    E ALK ALA  GP+SVA
Sbjct: 200 DNHGIDSDAAY-PYTGRQSQECHYSPKFRAANCSQYSFLPEGDEGALKQALATIGPISVA 258

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDA +  F+FY +GVY D  C+     ++H VLAVGYG L+G+ YW VKNSW   +G+ G
Sbjct: 259 IDARRPRFAFYSSGVYDDPSCSQD---VNHGVLAVGYGTLNGQDYWLVKNSWGQTFGDNG 315

Query: 330 YVLMSI-KDNNCGVMTAPTYVTM 351
           Y+ M+  K++ CG+     Y  M
Sbjct: 316 YIRMARNKNDQCGIARYGCYPIM 338


>gi|209732040|gb|ACI66889.1| Cathepsin H precursor [Salmo salar]
          Length = 330

 Score =  158 bits (399), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 89/248 (35%), Positives = 132/248 (53%), Gaps = 16/248 (6%)

Query: 105 IILEKWRLVTSEGEKVSKYSLW---VRYNKASKDAIPVRYEMKGYNSLLDQSVCGSCWSF 161
           ++ E      ++G  +S +  +   V + +      PV+Y          Q  CGSCW+F
Sbjct: 90  LLTEPQNCSATKGSHISSHGPYPGSVDWREKGNYVSPVKY----------QGHCGSCWTF 139

Query: 162 GTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI-MKHGLPTQDD 220
            TTG +E    +   KL +LS+Q L+DC+  + N+GC GG   ++++++   +GL T+DD
Sbjct: 140 STTGCLESVTAIATGKLPLLSEQQLVDCAQDFNNHGCMGGLPSQAFEYVKYNNGLMTEDD 199

Query: 221 YGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFY 280
           Y PY G D  C+     A A +   VN+T   E  +  A+A+  PVS   + +   F  Y
Sbjct: 200 Y-PYTGHDGSCNFKPELAAAFVKDVVNITSYDEKGMVDAVARLNPVSFGYEVTD-DFLHY 257

Query: 281 VNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNC 340
            +GVY    C N+ D ++HAVLAVGYGE +  PYW VKNSW T WG  GY L+    N C
Sbjct: 258 KDGVYSSTTCKNTTDNVNHAVLAVGYGEKNSTPYWIVKNSWGTNWGMDGYFLIERGRNMC 317

Query: 341 GVMTAPTY 348
           G+    +Y
Sbjct: 318 GLAACSSY 325


>gi|134025544|gb|AAI35768.1| LOC594890 protein [Xenopus (Silurana) tropicalis]
          Length = 333

 Score =  158 bits (399), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 92/218 (42%), Positives = 127/218 (58%), Gaps = 10/218 (4%)

Query: 131 KASKDAIPVRYEMKGYN---SLLDQSVCGSCWSFGTTGAVEGAYYMKHK--KLAVLSQQA 185
           + SK + P   + +  N   S+ DQ  C + W+F + GA+E    MK +  KL  LS Q 
Sbjct: 111 EISKSSPPESIDWRNKNCVTSVKDQGSCIASWAFSSIGALE-CQNMKRRTGKLESLSVQN 169

Query: 186 LIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGF 245
           L+DCS  YGNNGC GG    S+++I+ +G+  + +Y PY G+D  C        +  T +
Sbjct: 170 LLDCSQTYGNNGCKGGWVVSSFRYIIDNGIELESNY-PYQGKDGKCSYTPVKKASVCTSY 228

Query: 246 VNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVG 305
             +    E  LK  +   GPVSVAIDAS+K+F  Y NGVYYD  C++S    DH+VL VG
Sbjct: 229 RQLPYGDEATLKQVVGLMGPVSVAIDASRKTFRMYKNGVYYDPNCSSSTP--DHSVLVVG 286

Query: 306 YGELDGKPYWQVKNSWSTYWGNQGYVLMSIK-DNNCGV 342
           YG  DG  YW VKNSW T +G++GY+ M+    NNCG+
Sbjct: 287 YGAEDGVEYWLVKNSWGTSFGDEGYIKMARNHHNNCGI 324


>gi|410990006|ref|XP_004001241.1| PREDICTED: cathepsin L1-like [Felis catus]
          Length = 328

 Score =  158 bits (399), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 85/200 (42%), Positives = 119/200 (59%), Gaps = 10/200 (5%)

Query: 155 CGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKH- 213
           C S W+F   GA+EG  + K+ K   LS Q L+DCS   GN GC+GG    ++Q++  + 
Sbjct: 130 CASGWAFSAAGAIEGQIFRKYGKRVSLSVQNLLDCSQAEGNEGCNGGLMSNAFQYVRNNR 189

Query: 214 GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDAS 273
           GL T++ Y PY+ +D  C      + A +T F  + P  E+AL +A+   G +S AIDAS
Sbjct: 190 GLDTEESY-PYVARDGPCKYRPEYSAANVTAFQTI-PRREEALLVAMKNMGSISAAIDAS 247

Query: 274 QKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTYWGNQG 329
             +F FY  G+YYD KC  S + L+H VL VGYG    E D + YW VKNSW T WG  G
Sbjct: 248 LDTFRFYKGGIYYDPKC--SSEDLNHGVLVVGYGFQGKESDNQKYWFVKNSWGTDWGMGG 305

Query: 330 YVLMSI-KDNNCGVMTAPTY 348
           Y+ M+  ++NNCG+ T  ++
Sbjct: 306 YIKMARERNNNCGIATRASF 325


>gi|197205900|gb|ACH48003.1| cathepsin [Latrunculia oparinae]
          Length = 351

 Score =  157 bits (398), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 78/206 (37%), Positives = 113/206 (54%), Gaps = 3/206 (1%)

Query: 147 NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRS 206
           +++ DQ  C SC++F TT A+EG + +   +L  LS+Q +IDCS  YGN GC GG    +
Sbjct: 148 SAVKDQGRCKSCYAFATTAALEGMHALATGRLVPLSEQNVIDCSVPYGNRGCSGGSRMAT 207

Query: 207 YQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
             + + +G        PYLG+   C     +     TG V +    E  LK A+A  GPV
Sbjct: 208 IMYAVDNGGIDGTSSYPYLGRQYLCKFTEESIATGCTGMVRIKRGKEQDLKKAVAVVGPV 267

Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWG 326
           +VA+D+   SF FY +G+Y +  C+ +   L H ++ +GYG   G  YW +KNSW T WG
Sbjct: 268 TVAVDSRHTSFQFYASGIYSEPSCSRTK--LTHTLIIIGYGSKSGHDYWLLKNSWGTSWG 325

Query: 327 NQGYVLMSIK-DNNCGVMTAPTYVTM 351
             GY++MS    N CG+ T   Y T+
Sbjct: 326 EDGYIMMSRNYANQCGIATKAMYTTI 351


>gi|350606375|ref|NP_001076821.2| uncharacterized protein LOC594890 precursor [Xenopus (Silurana)
           tropicalis]
          Length = 333

 Score =  157 bits (398), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 92/218 (42%), Positives = 127/218 (58%), Gaps = 10/218 (4%)

Query: 131 KASKDAIPVRYEMKGYN---SLLDQSVCGSCWSFGTTGAVEGAYYMKHK--KLAVLSQQA 185
           + SK + P   + +  N   S+ DQ  C + W+F + GA+E    MK +  KL  LS Q 
Sbjct: 111 EISKSSPPESIDWRNKNCVTSVKDQGSCIASWAFSSIGALE-CQNMKRRTGKLESLSVQN 169

Query: 186 LIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGF 245
           L+DCS  YGNNGC GG    S+++I+ +G+  + +Y PY G+D  C        +  T +
Sbjct: 170 LLDCSQTYGNNGCKGGWVVSSFRYIIDNGIELESNY-PYQGKDGKCSYTPVKKASVCTSY 228

Query: 246 VNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVG 305
             +    E  LK  +   GPVSVAIDAS+K+F  Y NGVYYD  C++S    DH+VL VG
Sbjct: 229 RQLPYGDEATLKQVVGLMGPVSVAIDASRKTFRMYKNGVYYDPNCSSSTP--DHSVLVVG 286

Query: 306 YGELDGKPYWQVKNSWSTYWGNQGYVLMSIK-DNNCGV 342
           YG  DG  YW VKNSW T +G++GY+ M+    NNCG+
Sbjct: 287 YGAEDGVEYWLVKNSWGTSFGDEGYIKMARNHHNNCGI 324


>gi|94448668|emb|CAI91572.1| silicatein a3 [Lubomirskia baicalensis]
          Length = 344

 Score =  157 bits (398), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 84/203 (41%), Positives = 121/203 (59%), Gaps = 5/203 (2%)

Query: 148 SLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSY 207
           S+  Q  CGS ++F   GA+EGA  +   KL  LS+Q +IDCS  YGN+GC GG+ + ++
Sbjct: 142 SVQSQGQCGSSYAFAAAGALEGATALAADKLVALSEQNIIDCSVPYGNHGCSGGDVYTAF 201

Query: 208 QWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
           ++++ + G+ T+  Y PY G+ + C   +    A  TG V +   SE  L  A+A  GP+
Sbjct: 202 KYVVDNGGIDTESSY-PYKGKKSSCQYNSKNVGAISTGVVKIASGSETDLLSAVASVGPI 260

Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWG 326
           +VA+DAS  +F FY +GV+    C+ S   L+HA+L  GYG  +GK YW VKNSW T WG
Sbjct: 261 AVAVDASVNAFMFYQSGVFDSSTCSTSK--LNHAMLVTGYGSTNGKDYWLVKNSWGTGWG 318

Query: 327 NQGYVLM-SIKDNNCGVMTAPTY 348
             GY+ M   K N CG+ +   Y
Sbjct: 319 ESGYIKMVRNKYNQCGIASDALY 341


>gi|179959|gb|AAA35655.1| cathepsin [Homo sapiens]
 gi|248406|gb|AAB22005.1| cathepsin S [Homo sapiens]
          Length = 331

 Score =  157 bits (398), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 83/199 (41%), Positives = 115/199 (57%), Gaps = 5/199 (2%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW-GYGNNGCDGGEDFRSYQWI 210
           Q  CG+CW+F   GA+E    +K  KL  LS Q L+DCS   YGN GC+GG    ++Q+I
Sbjct: 133 QGSCGACWAFSAVGALEAQLKLKTGKLVTLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYI 192

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
           + +     D   PY   D  C   +    AT + +  +    ED LK A+A  GPVSV +
Sbjct: 193 IDNKGIDSDASYPYKAMDQKCQYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVSVGV 252

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
           DA   SF  Y +GVYY+  C  +   ++H VL VGYG+L+GK YW VKNSW   +G +GY
Sbjct: 253 DARHPSFFLYRSGVYYEPSCTQN---VNHGVLVVGYGDLNGKEYWLVKNSWGHNFGEEGY 309

Query: 331 VLMSI-KDNNCGVMTAPTY 348
           + M+  K N+CG+ + P+Y
Sbjct: 310 IRMARNKGNHCGIASFPSY 328


>gi|346574377|gb|AEO36960.1| silicatein-alpha 3 [Baikalospongia fungiformis]
          Length = 324

 Score =  157 bits (398), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 84/203 (41%), Positives = 121/203 (59%), Gaps = 5/203 (2%)

Query: 148 SLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSY 207
           S+  Q  CGS ++F   GA+EGA  +   KL  LS+Q +IDCS  YGN+GC GG+ + ++
Sbjct: 122 SVQSQGQCGSSYAFAAAGALEGATALAADKLVALSEQNIIDCSVPYGNHGCSGGDVYTAF 181

Query: 208 QWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
           ++++ + G+ T+  Y PY G+ + C   +    A  TG V +   SE  L  A+A  GP+
Sbjct: 182 KYVVDNGGIDTESSY-PYKGKQSSCQYNSKNVGAISTGVVKIASGSETDLLSAVASVGPI 240

Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWG 326
           +VA+DAS  +F FY +GV+    C+ S   L+HA+L  GYG  +GK YW VKNSW T WG
Sbjct: 241 AVAVDASVNAFMFYQSGVFDSSTCSTSK--LNHAMLVTGYGSTNGKDYWLVKNSWGTGWG 298

Query: 327 NQGYVLM-SIKDNNCGVMTAPTY 348
             GY+ M   K N CG+ +   Y
Sbjct: 299 ESGYIKMVRNKYNQCGIASDALY 321


>gi|291383488|ref|XP_002708302.1| PREDICTED: cathepsin L1 [Oryctolagus cuniculus]
          Length = 344

 Score =  157 bits (398), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 92/212 (43%), Positives = 128/212 (60%), Gaps = 11/212 (5%)

Query: 144 KGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGE 202
           +GY   + +Q +CGS W+F  TGA+EG  + K  +L  LS+Q L+DCSW  GN GC GG 
Sbjct: 123 RGYVTPVKNQELCGSSWAFSATGALEGQMFRKTGRLVSLSEQNLVDCSWPQGNQGCSGGL 182

Query: 203 DFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALA 261
              ++Q++    GL +++ Y PY  +   C      + A +TGFV+V+ + E AL  A+A
Sbjct: 183 MDYAFQYVKDNRGLDSEESY-PYEQRKGSCKYNPRFSAANVTGFVDVSKD-EKALMEAVA 240

Query: 262 KHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG--ELDGK--PYWQV 317
             GPVSV I  + +SF FY  G+YYD KC  S + ++HAVL VGYG  E+  K   YW +
Sbjct: 241 TVGPVSVGIATTPESFLFYEGGIYYDPKC--SSENVNHAVLVVGYGFEEVGSKNNKYWLI 298

Query: 318 KNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTY 348
           KNSW   WG  GY+ M+   NN CG+ TA +Y
Sbjct: 299 KNSWGKDWGMGGYMKMAKDQNNHCGIATAASY 330


>gi|312386083|gb|ADQ74586.1| silicatein alpha 3 [Lubomirskia baicalensis]
          Length = 330

 Score =  157 bits (398), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 84/203 (41%), Positives = 121/203 (59%), Gaps = 5/203 (2%)

Query: 148 SLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSY 207
           S+  Q  CGS ++F   GA+EGA  +   KL  LS+Q +IDCS  YGN+GC GG+ + ++
Sbjct: 128 SVQSQGQCGSSYAFAAAGALEGATALAADKLVALSEQNIIDCSVPYGNHGCSGGDVYTAF 187

Query: 208 QWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
           ++++ + G+ T+  Y PY G+ + C   +    A  TG V +   SE  L  A+A  GP+
Sbjct: 188 KYVVDNGGIDTESSY-PYKGKKSSCQYNSKNVGAISTGVVKIASGSETDLLSAVASVGPI 246

Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWG 326
           +VA+DAS  +F FY +GV+    C+ S   L+HA+L  GYG  +GK YW VKNSW T WG
Sbjct: 247 AVAVDASVNAFMFYQSGVFDSSTCSTSK--LNHAMLVTGYGSTNGKDYWLVKNSWGTGWG 304

Query: 327 NQGYVLM-SIKDNNCGVMTAPTY 348
             GY+ M   K N CG+ +   Y
Sbjct: 305 ESGYIKMVRNKYNQCGIASDALY 327


>gi|70912393|ref|NP_783171.2| cathepsin R precursor [Rattus norvegicus]
 gi|66911479|gb|AAH97484.1| Cathepsin R [Rattus norvegicus]
 gi|149039731|gb|EDL93847.1| cathepsin R [Rattus norvegicus]
          Length = 334

 Score =  157 bits (398), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 96/267 (35%), Positives = 149/267 (55%), Gaps = 21/267 (7%)

Query: 103 NGIILE--KWRLVTSEGEKVSKYSLWVRYNKASK--------DAIP--VRYEMKGY-NSL 149
           NG I+E  ++  +T+E  +    ++ +R ++  K        + +P  V +  KGY   +
Sbjct: 71  NGFIMEMNEFGDLTAEEFRKMMVNIPIRSHRKGKIIRKRDVGNVLPKFVDWRKKGYVTRV 130

Query: 150 LDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQW 209
            +Q  C SCW+F  TGA+EG  + K  +L  LS Q L+DC+   GN GC  G+   +Y++
Sbjct: 131 QNQKFCNSCWAFAVTGAIEGQMFNKTGQLTPLSVQNLVDCTKSQGNEGCQWGDPHIAYEY 190

Query: 210 IMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           ++ +G    +   PY G++  C      + A +TGFV++ P SED L  A+A  GP+SVA
Sbjct: 191 VLNNGGLEAEATYPYKGKEGVCRYNPKHSKAEITGFVSL-PESEDILMEAVATIGPISVA 249

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTYW 325
           +DAS  SF FY  G+Y +  C+N+   ++H+VL VGYG    E DG  YW +KNSW   W
Sbjct: 250 VDASFNSFGFYKKGLYDEPNCSNNT--VNHSVLVVGYGFEGNETDGNSYWLIKNSWGRKW 307

Query: 326 GNQGYVLMSIKDNN-CGVMTAPTYVTM 351
           G +GY+ +    NN C + +   Y T+
Sbjct: 308 GLRGYMKIPKDQNNFCAIASYAHYPTV 334


>gi|225706914|gb|ACO09303.1| Cathepsin H precursor [Osmerus mordax]
          Length = 328

 Score =  157 bits (398), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 88/245 (35%), Positives = 134/245 (54%), Gaps = 10/245 (4%)

Query: 105 IILEKWRLVTSEGEKVSKYSLWVRYNKASKDAIPVRYEMKGYNSLLDQSVCGSCWSFGTT 164
           ++ E      ++G  VS   L+        D++  R +     ++ +Q  CGSCW+F TT
Sbjct: 90  LLTEPQNCSATKGTHVSSKGLY-------PDSVDWRKKGNYVTNVKNQGPCGSCWTFSTT 142

Query: 165 GAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM-KHGLPTQDDYGP 223
           G +E    +   KL  LS+Q L+DC+  + N+GC+GG   +++++I    GL T+DDY P
Sbjct: 143 GCLESVTAISTGKLLQLSEQQLVDCAQAFNNHGCNGGLPSQAFEYIKYNKGLMTEDDY-P 201

Query: 224 YLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNG 283
           Y  QD  C      A A +   VN+T   E  +  A+A+  PVS+A + +   F  Y +G
Sbjct: 202 YTAQDGTCKFKPERAAAFVKDVVNITMYDEMGMVDAVARLNPVSMAYEVT-SDFMHYHSG 260

Query: 284 VYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVM 343
           VY   +C+N+ D ++HAVLAVGY E +  PYW VKNSW  +WG +GY  +    N CG+ 
Sbjct: 261 VYSSSECHNTTDTVNHAVLAVGYDEENVTPYWIVKNSWGPFWGMKGYFFIERGKNMCGLS 320

Query: 344 TAPTY 348
              +Y
Sbjct: 321 ACSSY 325


>gi|313103779|pdb|3KSE|A Chain A, Unreduced Cathepsin L In Complex With Stefin A
 gi|313103780|pdb|3KSE|B Chain B, Unreduced Cathepsin L In Complex With Stefin A
 gi|313103781|pdb|3KSE|C Chain C, Unreduced Cathepsin L In Complex With Stefin A
          Length = 220

 Score =  157 bits (398), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 92/220 (41%), Positives = 129/220 (58%), Gaps = 11/220 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGS W+F  TGA+EG  + K  +L  LS+Q L+DCS   GN G
Sbjct: 5   VDWREKGYVTPVKNQGQCGSXWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEG 64

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C+GG    ++Q++  + GL +++ Y PY   +  C      + A   GFV++ P  E AL
Sbjct: 65  CNGGLMDYAFQYVQDNGGLDSEESY-PYEATEESCKYNPKYSVANDAGFVDI-PKQEKAL 122

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGK 312
             A+A  GP+SVAIDA  +SF FY  G+Y++  C  S + +DH VL VGYG    E D  
Sbjct: 123 MKAVATVGPISVAIDAGHESFLFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDDN 180

Query: 313 PYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
            YW VKNSW   WG  GYV M+  + N+CG+ +A +Y T+
Sbjct: 181 KYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 220


>gi|116488416|gb|AAB41670.2| secreted cathepsin L 1 [Fasciola hepatica]
          Length = 326

 Score =  157 bits (398), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 87/218 (39%), Positives = 120/218 (55%), Gaps = 8/218 (3%)

Query: 131 KASKDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALI 187
           +A+  A+P  + +   GY   + DQ  CGSCW+F TTG +EG Y    +     S+Q L+
Sbjct: 102 EANNRAVPDKIDWRESGYVTEVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLV 161

Query: 188 DCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVN 247
           DCS  +GNNGC GG    +YQ++ + GL T+  Y PY   +  C        A +TGF  
Sbjct: 162 DCSRPWGNNGCGGGLMENAYQYLKQFGLETESSY-PYTAVEGQCRYNKQLGVAKVTGFYT 220

Query: 248 VTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG 307
           V   SE  LK  +   GP +VA+D  +  F  Y +G+Y  + C  SP  ++HAVLAVGYG
Sbjct: 221 VHSGSEVELKNLVGAEGPAAVAVDV-ESDFMMYRSGIYQSQTC--SPLRVNHAVLAVGYG 277

Query: 308 ELDGKPYWQVKNSWSTYWGNQGYVLM-SIKDNNCGVMT 344
              G  YW VKNSW   WG +GY+ M   + N CG+ +
Sbjct: 278 TQGGTDYWIVKNSWGLSWGERGYIRMVRNRGNMCGIAS 315


>gi|308321226|gb|ADO27765.1| cathepsin S [Ictalurus furcatus]
          Length = 329

 Score =  157 bits (398), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 85/215 (39%), Positives = 122/215 (56%), Gaps = 5/215 (2%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + +Q  CGSCW+F   GA+EG       ++  LS Q L+DCS  YGN G
Sbjct: 118 VDWREKGYVTEVKNQGSCGSCWAFSAAGALEGQLKRTTGQVKSLSPQNLVDCSSKYGNKG 177

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
           C+GG   +++Q+++  G    D+  PY   D  C    +   A  + +  V+   E+ALK
Sbjct: 178 CNGGFMTQAFQYVIDDGGIDSDEAYPYTAMDGQCRYDQSQRAANCSSYNYVSEGDEEALK 237

Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQV 317
            A+A  GP+SVAIDA++  F  Y +GVY D  C  +   ++H VL VGYG L+G+ YW V
Sbjct: 238 QAVATIGPISVAIDATRPMFILYHSGVYSDPTCTQN---VNHGVLVVGYGSLNGEDYWLV 294

Query: 318 KNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 351
           KNSW T +G+ GY+ ++  K N CG+     Y  M
Sbjct: 295 KNSWGTRFGDGGYIRIARNKGNMCGIANYACYPLM 329


>gi|242081867|ref|XP_002445702.1| hypothetical protein SORBIDRAFT_07g024430 [Sorghum bicolor]
 gi|241942052|gb|EES15197.1| hypothetical protein SORBIDRAFT_07g024430 [Sorghum bicolor]
          Length = 372

 Score =  157 bits (397), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 95/216 (43%), Positives = 121/216 (56%), Gaps = 13/216 (6%)

Query: 139 VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KG    + DQ  CGSCW+F T  AVEG   +K K L  LS+Q L+DC    GN G
Sbjct: 140 VDWRQKGAVTDVKDQGQCGSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCD-TKGNAG 198

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
           CDGG    ++Q+I KHG    +D  PY  + A C  +   A  T+ G+ +V  N E ALK
Sbjct: 199 CDGGLMDYAFQYIAKHGGVAAEDAYPYKARQASCKKSPAPAV-TIDGYEDVPANDESALK 257

Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQ 316
            A+A H PVSVAI+AS   F FY  GV+   +C      LDH V AVGYG   DG  YW 
Sbjct: 258 KAVA-HQPVSVAIEASGSHFQFYSEGVFAG-RCGTE---LDHGVTAVGYGVAADGTKYWV 312

Query: 317 VKNSWSTYWGNQGYVLM----SIKDNNCGVMTAPTY 348
           VKNSW   WG +GY+ M    + K+ +CG+    +Y
Sbjct: 313 VKNSWGPEWGEKGYIRMARDVAAKEGHCGIAMEASY 348


>gi|22653680|sp|Q9JL96.1|CATM_MOUSE RecName: Full=Cathepsin M; Flags: Precursor
 gi|7715970|gb|AAF68224.1|AF202528_1 cathepsin M [Mus musculus]
          Length = 333

 Score =  157 bits (397), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 89/216 (41%), Positives = 126/216 (58%), Gaps = 9/216 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           + ++ +GY   +  Q  C SCW+F  TGA+EG  + K  +L  LS Q L+DCS   GN G
Sbjct: 118 INWKKRGYVTPVQTQGRCNSCWAFSVTGAIEGQMFRKTGQLIPLSVQNLVDCSRPQGNWG 177

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
           C  G  + +  ++M++G    +   PY  +D  C  +   +TA +TGF    P +EDAL 
Sbjct: 178 CYLGNTYLALHYVMENGGLESEATYPYEEKDGSCRYSPENSTANITGF-EFVPKNEDALM 236

Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKP 313
            A+A  GP+SVAIDA   SF FY  G+YY+  C++    + H++L VGYG    E DG+ 
Sbjct: 237 NAVASIGPISVAIDARHASFLFYKRGIYYEPNCSSCV--VTHSMLLVGYGFTGRESDGRK 294

Query: 314 YWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           YW VKNS  T WGN+GY+ +S  K N+CG+ T   Y
Sbjct: 295 YWLVKNSMGTQWGNKGYMKISRDKGNHCGIATYALY 330


>gi|410904751|ref|XP_003965855.1| PREDICTED: cathepsin K-like [Takifugu rubripes]
          Length = 331

 Score =  157 bits (397), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 89/212 (41%), Positives = 121/212 (57%), Gaps = 9/212 (4%)

Query: 141 YEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCD 199
           Y  KG   ++ DQ  CGSCW+F + GA+EG    K  KL  LS Q L+DC     N+GC 
Sbjct: 122 YRKKGIVTAVKDQGQCGSCWAFSSAGALEGMQAKKTGKLVDLSPQNLVDCV--KENDGCG 179

Query: 200 GGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKL 258
           GG    +++++    G+ ++  Y PY+ Q+  C    +   A  + +  V   +E  L  
Sbjct: 180 GGYMTNAFRYVATNRGIDSEASY-PYVAQEQSCQYKESGKAAECSSYEEVPQGNEKQLAY 238

Query: 259 ALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQV 317
           AL KHGP++V IDA+  +F  Y  GVYYD  CN  P+ ++HAVL VGYG    G+ YW V
Sbjct: 239 ALFKHGPIAVGIDATLSTFQLYSKGVYYDPNCN--PENINHAVLLVGYGVNSRGQHYWIV 296

Query: 318 KNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTY 348
           KNSWST WGN GYVLM+    N CG+    +Y
Sbjct: 297 KNSWSTNWGNGGYVLMARNRGNLCGIANLASY 328


>gi|410990014|ref|XP_004001245.1| PREDICTED: cathepsin L1-like [Felis catus]
          Length = 334

 Score =  157 bits (397), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 87/203 (42%), Positives = 122/203 (60%), Gaps = 13/203 (6%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM 211
           Q  C S W+F  TGA+EG  + ++    V   Q L+DCSW  GN GC+GG    ++Q++ 
Sbjct: 136 QGRCASGWAFSATGALEGQMFPENWPNVV---QNLLDCSWPQGNEGCNGGLMSNAFQYVK 192

Query: 212 KHG-LPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
            +G L T++ Y PY+ +D  C      + A +T F N+ P  E+AL +A+A  GP+S AI
Sbjct: 193 NNGGLDTEESY-PYVARDGPCKYRPEHSAANVTAFENI-PQQEEALMMAVANMGPISAAI 250

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTYWG 326
           DAS  +F FY  G+YYD KC  S + L+H VL VGYG    E D + YW VKNSW   WG
Sbjct: 251 DASLDTFRFYKEGIYYDPKC--SSEDLNHGVLVVGYGFQGKESDNQKYWFVKNSWGADWG 308

Query: 327 NQGYVLMSI-KDNNCGVMTAPTY 348
             GY+ M+  +DN+CG+ T  ++
Sbjct: 309 MDGYIKMAKDRDNHCGIATMASF 331


>gi|47199802|emb|CAF88807.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 261

 Score =  157 bits (397), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 90/188 (47%), Positives = 121/188 (64%), Gaps = 10/188 (5%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   + DQ  CGSCW+F TTGA+EG ++ +  KL  LS+Q L+DCS   GN G
Sbjct: 77  VDWRDKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRQTGKLVSLSEQNLVDCSRPEGNEG 136

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDA 255
           C+GG   +++Q+I  + GL ++  Y PYL   D  CH   +  +A  TGFV+V   SE A
Sbjct: 137 CNGGLMDQAFQYIKDNGGLDSEASY-PYLATDDQPCHYDPSNNSANETGFVDVPSGSERA 195

Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDG 311
           L  A+A  GPVSVAIDA  +SF FY +G+YY+++C  S + LDH VL VGYG    ++DG
Sbjct: 196 LMKAVASVGPVSVAIDAGHESFQFYQSGIYYEKEC--SSEELDHGVLVVGYGFQGEDVDG 253

Query: 312 KPYWQVKN 319
           K +W VKN
Sbjct: 254 KKFWIVKN 261


>gi|281211531|gb|EFA85693.1| cysteine protease [Polysphondylium pallidum PN500]
          Length = 366

 Score =  157 bits (397), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 92/230 (40%), Positives = 123/230 (53%), Gaps = 39/230 (16%)

Query: 149 LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
           + DQ  CGSCWSF TTG+VEGA+ +K   +  LS+Q L+DCS   GN GC+GG    ++ 
Sbjct: 129 IKDQGQCGSCWSFSTTGSVEGAHQIKTGNMVELSEQNLVDCSSAEGNMGCNGGLMNNAFD 188

Query: 209 WIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVS 267
           +I+  HG+ T+  Y       + C    T   AT++ + ++TP SE  L  A+   GPVS
Sbjct: 189 YIISNHGIDTEQSYPYTANTGSVCKFNKTNVGATISSYKSITPGSETDLANAVKTAGPVS 248

Query: 268 VAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-------ELD---------- 310
           VAIDAS +SF  Y +G+YY+  C  S   LDH VL VGYG       ++D          
Sbjct: 249 VAIDASHRSFQLYSHGIYYEWLC--SSTRLDHGVLVVGYGSGNPPNSDMDHMILKKTAKT 306

Query: 311 ------------------GKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCG 341
                              K YW VKNSWS  WG++GY+ MS  + NNCG
Sbjct: 307 DHYHGKKSLKVEKVDTTSSKNYWIVKNSWSDTWGDKGYIYMSKDRKNNCG 356


>gi|387915178|gb|AFK11198.1| cathepsin L1 [Callorhinchus milii]
          Length = 331

 Score =  157 bits (397), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 80/212 (37%), Positives = 127/212 (59%), Gaps = 6/212 (2%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  KGY   +  Q  CGSC+ F  TG++E   + K+ KL  LS+Q+++DCS  YGN G
Sbjct: 122 VDWRKKGYVTPVYYQGTCGSCYGFAATGSLEAMIFKKYGKLIKLSEQSIVDCSSYYGNLG 181

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
           C GG   R+Y+++M++G+ + D Y PY  +   C    ++  ATM+ ++ V  N E+AL 
Sbjct: 182 CSGGSIIRAYKFVMENGIQSADTY-PYTAKAGTCRHNKSSTVATMSHYIKVHSN-EEALA 239

Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQV 317
             ++  GPV+V +    +S+ FY +G+ YD  C N     DH +L VG+G    + YW V
Sbjct: 240 QTVSTVGPVAVCVHTKTRSWQFYKSGILYDPLCKNYT--YDHGILVVGFGIQKDEHYWIV 297

Query: 318 KNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           KNSW + WG +GY+ ++  ++N+CG+ +  TY
Sbjct: 298 KNSWGSRWGMKGYIWIAKDRNNHCGISSYATY 329


>gi|392354135|ref|XP_225128.6| PREDICTED: LOW QUALITY PROTEIN: cathepsin M [Rattus norvegicus]
          Length = 333

 Score =  157 bits (397), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 84/216 (38%), Positives = 126/216 (58%), Gaps = 9/216 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +  +GY  S+ +Q  C SCW+F   GA+EG  + K  +L  LS Q L+DCS   GN G
Sbjct: 118 VDWRRRGYVTSVKNQGRCNSCWAFSVAGAIEGQMFRKTGRLVSLSAQNLVDCSRPEGNRG 177

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
           C  G  F +++++  +G    +   PY G++ +C      + A + GF ++  ++E+AL 
Sbjct: 178 CISGHTFYTFKYVWNNGGLEAESTYPYEGREGHCRYLPERSAARIKGF-SIISSTEEALM 236

Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKP 313
            A+A  GP+SV IDAS +SF+FY  G+YY+ KC N    ++HAVL VGYG    E DG+ 
Sbjct: 237 NAVATIGPISVGIDASHESFTFYSGGIYYEPKCRNKT--VNHAVLLVGYGYEGRESDGRK 294

Query: 314 YWQVKNSWSTYWGNQGYVLMSIK-DNNCGVMTAPTY 348
           YW +KNS    WG  GY+ ++   + +CG+ T   Y
Sbjct: 295 YWLIKNSHGVGWGMNGYMKLARGWNKHCGIATCAFY 330


>gi|23110962|ref|NP_004070.3| cathepsin S isoform 1 preproprotein [Homo sapiens]
 gi|88984046|sp|P25774.3|CATS_HUMAN RecName: Full=Cathepsin S; Flags: Precursor
 gi|60816153|gb|AAX36372.1| cathepsin S [synthetic construct]
 gi|61358282|gb|AAX41541.1| cathepsin S [synthetic construct]
 gi|119573903|gb|EAW53518.1| cathepsin S, isoform CRA_b [Homo sapiens]
 gi|119573904|gb|EAW53519.1| cathepsin S, isoform CRA_b [Homo sapiens]
          Length = 331

 Score =  157 bits (397), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 83/199 (41%), Positives = 115/199 (57%), Gaps = 5/199 (2%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW-GYGNNGCDGGEDFRSYQWI 210
           Q  CG+CW+F   GA+E    +K  KL  LS Q L+DCS   YGN GC+GG    ++Q+I
Sbjct: 133 QGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYI 192

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
           + +     D   PY   D  C   +    AT + +  +    ED LK A+A  GPVSV +
Sbjct: 193 IDNKGIDSDASYPYKAMDQKCQYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVSVGV 252

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
           DA   SF  Y +GVYY+  C  +   ++H VL VGYG+L+GK YW VKNSW   +G +GY
Sbjct: 253 DARHPSFFLYRSGVYYEPSCTQN---VNHGVLVVGYGDLNGKEYWLVKNSWGHNFGEEGY 309

Query: 331 VLMSI-KDNNCGVMTAPTY 348
           + M+  K N+CG+ + P+Y
Sbjct: 310 IRMARNKGNHCGIASFPSY 328


>gi|414589597|tpg|DAA40168.1| TPA: hypothetical protein ZEAMMB73_868349 [Zea mays]
          Length = 252

 Score =  157 bits (397), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 83/199 (41%), Positives = 114/199 (57%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTGA+E AY     K   LS+Q L+DC + + N GC GG   +++++I
Sbjct: 52  NQGHCGSCWTFSTTGALEAAYTQATGKAISLSEQQLVDCGFAFNNFGCKGGLPSQAFEYI 111

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + GL T++ Y PY G +  C          +   VN+T  +ED LK A+    PVSVA
Sbjct: 112 KYNGGLDTEESY-PYQGVNGICQFKAENVGVKVLDSVNITLGAEDELKDAVGLVRPVSVA 170

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            +     F  Y  GVY  + C  +P  ++HAVLAVGYG  +G PYW +KNSW   WG++G
Sbjct: 171 FEVI-SGFRLYKTGVYTSDHCGTTPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDEG 229

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y  M +  N CGV T  +Y
Sbjct: 230 YFKMEMGKNMCGVATCASY 248


>gi|308160643|gb|EFO63118.1| Cathepsin L precursor [Giardia lamblia P15]
          Length = 577

 Score =  157 bits (397), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 88/219 (40%), Positives = 129/219 (58%), Gaps = 17/219 (7%)

Query: 147 NSLLDQSVCGSCWSFGTTGAVEGAYY--------MKHKKLAVLSQQALIDCSWGYGNNGC 198
           N   DQ  CGSCW+FG  G +EG           ++H+ L   S+Q+++DC WG+G+ GC
Sbjct: 357 NMAKDQVACGSCWTFGAIGTIEGRINKLRVVEEGLRHEPLKAYSEQSIVDCYWGFGSFGC 416

Query: 199 DGGEDFRSYQWIMKHG---LPTQDDYGPYLGQDAYCHIA--NTTATATMTGFVNVTPNSE 253
           DGG+   + +W++++    +  + +Y PYLGQ+  C  A  +   + ++TG+  V   S 
Sbjct: 417 DGGDTLAALKWLVENNRGRVAFESEY-PYLGQNDLCKEALFDQEPSYSVTGYSAVKQYSI 475

Query: 254 DALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDG-K 312
            +LK AL + GPV+V+I  ++ S  FY  GVY D  C +  D L HAVLAVGYG  D   
Sbjct: 476 PSLKAAL-QDGPVAVSIGITE-SLLFYSGGVYNDPACPHEYDDLAHAVLAVGYGTDDTYG 533

Query: 313 PYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVTM 351
            YW V+NSWS  WG  GY  +S+KDN CG++T  +Y  +
Sbjct: 534 DYWIVRNSWSPLWGMDGYFYLSMKDNICGILTDASYAVV 572


>gi|171948778|gb|ACB59246.1| cathepsin H [Sus scrofa]
          Length = 297

 Score =  157 bits (397), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 82/202 (40%), Positives = 118/202 (58%), Gaps = 6/202 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGED---FRSY 207
           +Q  CGSCW+F TTGA+E A  +   K+  L++Q L+DC+  + N+GC GG      +++
Sbjct: 93  NQGSCGSCWTFSTTGALESAVAIATGKMLSLAEQQLVDCAQNFNNHGCQGGLPGLPSQAF 152

Query: 208 QWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPV 266
           ++I    G+  +D Y PY GQD +C      A A +    N+T N E+A+  A+A + PV
Sbjct: 153 EYIRYNKGIMGEDTY-PYKGQDDHCKFQPDKAIAFVKDVANITMNDEEAMVEAVALYNPV 211

Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWG 326
           S A + +   F  Y  G+Y    C+ +PD ++HAVLAVGYGE +G PYW VKNSW   WG
Sbjct: 212 SFAFEVTN-DFLMYRKGIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWG 270

Query: 327 NQGYVLMSIKDNNCGVMTAPTY 348
             GY L+    N CG+    +Y
Sbjct: 271 MNGYFLIERGKNMCGLAACASY 292


>gi|27960487|gb|AAO27847.1|AF456463_1 cathepsin M [Mus musculus]
 gi|16323039|gb|AAL15416.1| cathepsin M [Mus musculus]
          Length = 333

 Score =  157 bits (397), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 89/216 (41%), Positives = 126/216 (58%), Gaps = 9/216 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           + ++ +GY   +  Q  C SCW+F  TGA+EG  + K   L  LS Q L+DCS   GN G
Sbjct: 118 INWKKRGYVTPVRTQGRCNSCWAFSVTGAIEGQMFRKTGPLIPLSVQNLVDCSRPQGNWG 177

Query: 198 CDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
           C  G  + +  ++M++G    +   PY  ++  C  +   +TA +TGF    P +EDAL 
Sbjct: 178 CYLGNTYLALHYVMENGGLESEATYPYEEKEGSCRYSPENSTANITGF-EFVPKNEDALM 236

Query: 258 LALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKP 313
            A+A  GP+SVAIDA   SF FY  G+YY+  C++S   + H++L VGYG    E DG+ 
Sbjct: 237 NAVASIGPISVAIDARHASFLFYKRGIYYEPNCSSSV--VTHSMLLVGYGFAGRESDGRK 294

Query: 314 YWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
           YW VKNS  T WGN+GY+ +S  K N+CG+ T   Y
Sbjct: 295 YWLVKNSMGTQWGNKGYMKISRDKGNHCGIATYALY 330


>gi|47086663|ref|NP_997853.1| cathepsin H precursor [Danio rerio]
 gi|45709087|gb|AAH67615.1| Cathepsin H [Danio rerio]
          Length = 330

 Score =  157 bits (396), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 90/245 (36%), Positives = 131/245 (53%), Gaps = 10/245 (4%)

Query: 105 IILEKWRLVTSEGEKVSKYSLWVRYNKASKDAIPVRYEMKGYNSLLDQSVCGSCWSFGTT 164
           ++ E      + G  VS   L+        DAI  R +      + +Q  CGSCW+F TT
Sbjct: 90  LLTEPQNCSATRGNHVSSNGLY-------PDAIDWRTKGHYITDVKNQGPCGSCWTFSTT 142

Query: 165 GAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM-KHGLPTQDDYGP 223
           G +E    +   KL  L++Q LIDC+  + N+GC+GG    ++++IM   GL T+DDY P
Sbjct: 143 GCLESVTAIATGKLLQLAEQQLIDCAGDFDNHGCNGGLPSHAFEYIMYNKGLMTEDDY-P 201

Query: 224 YLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNG 283
           Y  +   C      A A +   VN+T   E  +  A+A+  PVS A + +   F  Y +G
Sbjct: 202 YQAKGGQCRFKPQLAAAFVKEVVNITKYDEMGMVDAVARLNPVSFAYEVT-SDFMHYKDG 260

Query: 284 VYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVM 343
           +Y   +C+N+ D ++HAVLAVGY E +G PYW VKNSW T WG +GY  +    N CG+ 
Sbjct: 261 IYTSTECHNTTDMVNHAVLAVGYAEENGTPYWIVKNSWGTNWGIKGYFYIERGKNMCGLA 320

Query: 344 TAPTY 348
              +Y
Sbjct: 321 ACSSY 325


>gi|61368403|gb|AAX43172.1| cathepsin S [synthetic construct]
          Length = 332

 Score =  157 bits (396), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 83/199 (41%), Positives = 115/199 (57%), Gaps = 5/199 (2%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW-GYGNNGCDGGEDFRSYQWI 210
           Q  CG+CW+F   GA+E    +K  KL  LS Q L+DCS   YGN GC+GG    ++Q+I
Sbjct: 133 QGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYI 192

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
           + +     D   PY   D  C   +    AT + +  +    ED LK A+A  GPVSV +
Sbjct: 193 IDNKGIDSDASYPYKAMDQKCQYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVSVGV 252

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
           DA   SF  Y +GVYY+  C  +   ++H VL VGYG+L+GK YW VKNSW   +G +GY
Sbjct: 253 DARHPSFFLYRSGVYYEPSCTQN---VNHGVLVVGYGDLNGKEYWLVKNSWGHNFGEEGY 309

Query: 331 VLMSI-KDNNCGVMTAPTY 348
           + M+  K N+CG+ + P+Y
Sbjct: 310 IRMARNKGNHCGIASFPSY 328


>gi|334324655|ref|XP_001370975.2| PREDICTED: cathepsin S-like isoform 1 [Monodelphis domestica]
          Length = 331

 Score =  157 bits (396), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 83/199 (41%), Positives = 116/199 (58%), Gaps = 5/199 (2%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG-YGNNGCDGGEDFRSYQWI 210
           Q  CGSCW+F   GA+E    +K  KL  LS Q L+DCS   Y N+GC+GG    ++Q++
Sbjct: 133 QGSCGSCWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTDKYDNHGCNGGFMTSAFQYV 192

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
           + +     D   PY   D  C     +  AT + +  +   SE+ALK A+A  GPVSV I
Sbjct: 193 IDNNGIDSDVSYPYKATDGKCQYNPASRAATCSKYTELPYGSEEALKEAVANKGPVSVGI 252

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
           DA   SF  Y +GVYYD  C      ++H VL +GYG LDG+ YW VKNSW  ++G++GY
Sbjct: 253 DAKTPSFFLYKSGVYYDPSCTQK---VNHGVLVIGYGNLDGQDYWLVKNSWGLHFGDKGY 309

Query: 331 VLMSI-KDNNCGVMTAPTY 348
           V ++  + N+CG+   P+Y
Sbjct: 310 VRIARNRGNHCGIANFPSY 328


>gi|354502591|ref|XP_003513367.1| PREDICTED: cathepsin L1-like isoform 1 [Cricetulus griseus]
          Length = 330

 Score =  157 bits (396), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 92/218 (42%), Positives = 126/218 (57%), Gaps = 10/218 (4%)

Query: 139 VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNG 197
           V +   GY   + DQ  C SCW+F   G++EG  + K  KL  LS+Q L+DCS    NNG
Sbjct: 118 VDWRKHGYVTPVKDQGSCVSCWAFSAVGSLEGQMFRKTGKLVPLSEQNLVDCSRSQHNNG 177

Query: 198 CDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDAL 256
           C GG    ++Q+I  + GL T + Y PY  QD  C      + A +TGFV V P++E+AL
Sbjct: 178 CHGGLFTSAFQYIKDNGGLDTSESY-PYEAQDGPCRYDPKHSAANITGFV-VVPSNEEAL 235

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKC-NNSPDGLDHAVLAVGYG-ELDGKPY 314
             A+A  GP+S+ I    +S  FY +G YYD  C N+ P   +H+VL VGYG E DG+ Y
Sbjct: 236 MKAVATVGPISIGISVRLRSLLFYKSGFYYDPDCYNHYP---NHSVLLVGYGEESDGQKY 292

Query: 315 WQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTYVTM 351
           W VKNSW   WG  GY+ +   ++N+C + T   Y T+
Sbjct: 293 WLVKNSWGEEWGMDGYIKIAKDRNNHCSIATIAAYPTV 330


>gi|391333246|ref|XP_003741030.1| PREDICTED: digestive cysteine proteinase 2-like [Metaseiulus
           occidentalis]
          Length = 327

 Score =  157 bits (396), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 90/214 (42%), Positives = 125/214 (58%), Gaps = 9/214 (4%)

Query: 138 PVRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNN 196
            V +   GY   + DQ  CGSCW+F  TGAVEG Y+ K  +L  LS+Q L+DC     ++
Sbjct: 117 AVDWRQNGYVTPVKDQGKCGSCWAFAATGAVEGQYFKKTGQLVSLSEQNLVDCD--RSSD 174

Query: 197 GCDGGEDFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDA 255
           GC+GG  + S+++I  + G+ T+  YG Y      C     +  AT++G  +V    E+A
Sbjct: 175 GCEGGYFYESFEYIRSNGGIATESSYG-YEATAGSCRFTADSIGATVSGRDSVASGDEEA 233

Query: 256 LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYW 315
           L  A+A  GP+SV ID    +F  Y +GVYYD +C++S    +HAVL VGYG   G  YW
Sbjct: 234 LLKAVASIGPISVTIDVID-TFRHYSSGVYYDAECSSSS--RNHAVLVVGYGTEAGGDYW 290

Query: 316 QVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 348
            VKNSW T +G QGY+ M+  K NNCG+ +   Y
Sbjct: 291 LVKNSWGTSFGEQGYIKMARNKGNNCGIASEAGY 324


>gi|197205894|gb|ACH48000.1| silicatein A2 [Latrunculia oparinae]
          Length = 329

 Score =  157 bits (396), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 85/203 (41%), Positives = 121/203 (59%), Gaps = 5/203 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQS CG+ ++F   GA+EGA  +    L+VLS+Q +IDCS  YGN+GC GG    ++++I
Sbjct: 130 DQSQCGASYAFSAVGALEGANALATGSLSVLSEQNIIDCSVPYGNHGCKGGNMLYAFKYI 189

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           + + GL     Y P+ G+   C   +      ++G V +   SE  L  A+A  GPVSVA
Sbjct: 190 IANDGLDVAKSY-PFQGKQQSCVYDDQDTGGKISGMVRIKQGSESDLIGAVANVGPVSVA 248

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           ID S  +F FY +GVY   +C++S   L+HA++  GYG   GK YW VKNSW T WG  G
Sbjct: 249 IDGSSNAFRFYASGVYDSSRCSSSK--LNHAMVVTGYGTYGGKDYWLVKNSWGTNWGQSG 306

Query: 330 YVLMSI-KDNNCGVMTAPTYVTM 351
           Y++M+  K N CG+ +   Y T+
Sbjct: 307 YIMMARGKYNQCGIASDACYPTL 329


>gi|179957|gb|AAC37592.1| cathepsin S [Homo sapiens]
          Length = 331

 Score =  157 bits (396), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 83/199 (41%), Positives = 115/199 (57%), Gaps = 5/199 (2%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW-GYGNNGCDGGEDFRSYQWI 210
           Q  CG+CW+F   GA+E    +K  KL  LS Q L+DCS   YGN GC+GG    ++Q+I
Sbjct: 133 QGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYI 192

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
           + +     D   PY   D  C   +    AT + +  +    ED LK A+A  GPVSV +
Sbjct: 193 IDNKGIDSDASYPYKAMDLKCQYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVSVGV 252

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
           DA   SF  Y +GVYY+  C  +   ++H VL VGYG+L+GK YW VKNSW   +G +GY
Sbjct: 253 DARHPSFFLYRSGVYYEPSCTQN---VNHGVLVVGYGDLNGKEYWLVKNSWGHNFGEEGY 309

Query: 331 VLMSI-KDNNCGVMTAPTY 348
           + M+  K N+CG+ + P+Y
Sbjct: 310 IRMARNKGNHCGIASFPSY 328


>gi|395856029|ref|XP_003800445.1| PREDICTED: cathepsin S [Otolemur garnettii]
          Length = 331

 Score =  157 bits (396), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 85/200 (42%), Positives = 119/200 (59%), Gaps = 7/200 (3%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW-GYGNNGCDGGEDFRSYQWI 210
           Q  CGSCW+F   GA+E    +   KL  LS Q L+DCS   Y N GC GG    ++Q+I
Sbjct: 133 QGSCGSCWAFSAVGALEAQLKLTTGKLVSLSAQNLVDCSTEKYRNEGCHGGFMTEAFQYI 192

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           +  +G+ ++  Y PY   D  C   +    AT + +  +   SE+ALK A+A  GPVSVA
Sbjct: 193 IDNNGIDSEASY-PYKAMDEKCQYDSKNRAATCSKYTELPFGSEEALKEAVASKGPVSVA 251

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS  SF  Y +GVYY+  C      ++H VL VGYG L+G  YW VKNSW  Y+G++G
Sbjct: 252 IDASHSSFFLYRSGVYYEPACTQV---VNHGVLVVGYGNLNGNDYWLVKNSWGLYFGDKG 308

Query: 330 YVLMSI-KDNNCGVMTAPTY 348
           Y+ M+  ++N+CG+ +  +Y
Sbjct: 309 YIRMARNRENHCGIASYSSY 328


>gi|205689966|sp|A6NFJ7.3|CATL6_HUMAN RecName: Full=Putative cathepsin L-like protein 6
          Length = 277

 Score =  157 bits (396), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 100/254 (39%), Positives = 146/254 (57%), Gaps = 15/254 (5%)

Query: 108 EKWRLVTSEGEKVSKYSLWVRYNKASKDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTT 164
           E++R V + G +  K+    ++ +     IP  V +  KGY   + DQ  CGSCW+F  T
Sbjct: 29  EEFRQVMN-GFQYQKHRKGKQFQERLLPEIPTSVDWREKGYMTPVKDQGQCGSCWAFSAT 87

Query: 165 GAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKH-GLPTQDDYGP 223
           GA+EG  + K  KL  L++Q L+DCS   GN GC+GG     ++++ ++ GL ++  Y P
Sbjct: 88  GALEGQMFWKTGKLISLNEQNLVDCSGPQGNEGCNGGFMDNPFRYVQENGGLDSEASY-P 146

Query: 224 YLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNG 283
           Y G+   C      + A  TGFV++ P+ E  L  A+A  GP+SVA  AS  SF FY  G
Sbjct: 147 YEGKVKTCRYNPKYSAANDTGFVDI-PSQEKDLAKAVATVGPISVAAGASHVSFQFYKKG 205

Query: 284 VYYDEKCNNSPDGLDHAVLAVGY----GELDGKPYWQVKN-SWSTYWGNQGYVLMSI-KD 337
           +Y++ +C+  P+GLDHA+L VGY     + D   YW VKN S    WG  GY+ M+  + 
Sbjct: 206 IYFEPRCD--PEGLDHAMLLVGYSYEGADSDNNKYWLVKNSSEGKNWGMDGYIKMAKDRR 263

Query: 338 NNCGVMTAPTYVTM 351
           NNCG+ TA +Y T+
Sbjct: 264 NNCGIATAASYPTV 277


>gi|391341652|ref|XP_003745141.1| PREDICTED: counting factor associated protein D-like [Metaseiulus
           occidentalis]
          Length = 751

 Score =  157 bits (396), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 78/195 (40%), Positives = 111/195 (56%), Gaps = 6/195 (3%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKK--LAVLSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
           +Q  CGSC+SF +   +E  Y +++ K      S+Q ++DCSW   N GC GG    +++
Sbjct: 551 NQGTCGSCYSFASVAYLESQYIIRNGKGNTTRFSEQQIVDCSWDSLNIGCKGGFPHGAFE 610

Query: 209 WIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTP-NSEDALKLALAKHGPVS 267
           ++ K+GL T+D YGPYL  +  C  A       +    + T     + L   +  HGP++
Sbjct: 611 YVQKYGLFTEDQYGPYLDDEGKCRDAEMKGEPIIPTLKSFTMMEGAECLLRHVGLHGPIA 670

Query: 268 VAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGN 327
           V I  S  SF  Y  G+Y D  C++S   L HAVL VGYG L G+PYW VKNSW   WG 
Sbjct: 671 VGIHGSSDSFRAYSRGIYNDPTCDHS---LTHAVLVVGYGSLRGEPYWLVKNSWGPKWGA 727

Query: 328 QGYVLMSIKDNNCGV 342
           +GY+L+S K+N CG+
Sbjct: 728 EGYILVSRKENYCGI 742


>gi|1185459|gb|AAA87849.1| preprocathepsin cathepsin L [Schistosoma japonicum]
          Length = 331

 Score =  157 bits (396), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 79/198 (39%), Positives = 118/198 (59%), Gaps = 5/198 (2%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM 211
           Q +CGSCW+F  TGA+EG    KHKKL  LS+Q L+DC + YGN+GC+GG    ++ ++ 
Sbjct: 134 QGLCGSCWAFSATGAIEGQLRRKHKKLVKLSEQQLVDCRYNYGNDGCEGGTMDLAFNYLE 193

Query: 212 KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAID 271
           KH + +++DY  YLG DA CH   +     +  F ++    E  L+ A+ ++GP+SV I 
Sbjct: 194 KHYIESENDY-KYLGHDANCHYRKSKGVVKVKKFGDLPARDEKTLEKAVYQYGPISVGI- 251

Query: 272 ASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYV 331
            +  S   Y +G+Y  + C  +   ++H VLAVGYG  +GK YW +KNSW   WG  GY 
Sbjct: 252 VALDSLILYKSGIYESKDCKYAD--INHGVLAVGYGRENGKDYWLIKNSWGDLWGMNGYF 309

Query: 332 -LMSIKDNNCGVMTAPTY 348
            L   K + CG+ +  ++
Sbjct: 310 KLRRNKPHMCGISSNSSF 327


>gi|148283737|gb|ABN50361.2| cathepsin L [Fasciola hepatica]
          Length = 326

 Score =  157 bits (396), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 87/218 (39%), Positives = 122/218 (55%), Gaps = 8/218 (3%)

Query: 131 KASKDAIPVRYEMKGY---NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALI 187
           KA+K A+P   + + Y     + +Q  CGSCW+F TTGAVEG +    +  A  S+Q L+
Sbjct: 102 KANKLAVPESIDWRDYYYVTEVKNQGQCGSCWAFSTTGAVEGQFRKNERASASFSEQQLV 161

Query: 188 DCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVN 247
           +C+  +GN GC GG    +Y+++  +GL T+  Y PY   +  C      A A +TG+  
Sbjct: 162 NCTRDFGNYGCGGGYVENAYEYLKHNGLETESYY-PYQAVEGPCQYDGRLAYAKVTGYYT 220

Query: 248 VTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG 307
           V    E  LK  +   GP +VA+DA    F  Y +G+Y  + C   PD L HAVLAVGYG
Sbjct: 221 VHSGDEIELKNLVGTEGPAAVALDAD-SDFMMYQSGIYQSQTC--LPDRLTHAVLAVGYG 277

Query: 308 ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMT 344
             DG  YW VKNSW T+WG  GY+  +  + N CG+ +
Sbjct: 278 SQDGTDYWIVKNSWGTWWGEDGYIRFARNRGNMCGIAS 315


>gi|350583407|ref|XP_003481511.1| PREDICTED: cathepsin S [Sus scrofa]
          Length = 331

 Score =  156 bits (395), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 87/200 (43%), Positives = 118/200 (59%), Gaps = 7/200 (3%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW-GYGNNGCDGGEDFRSYQWI 210
           Q  CGSCW+F   GA+E    MK  +L  LS Q L+DCS   Y N GC+GG    ++Q+I
Sbjct: 133 QGSCGSCWAFSAVGALEAQVKMKTGRLVSLSAQNLVDCSTEKYRNKGCNGGFMTEAFQYI 192

Query: 211 M-KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           +  +G+ ++  Y PY   D  C   +    AT + +  +    E ALK A+A  GPVSVA
Sbjct: 193 IDNNGIDSEASY-PYKAVDGKCKYDSKNRAATCSRYTELPFADEYALKEAVANKGPVSVA 251

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDA   SF FY +GVYYD  C  +   ++H VL VGYG L+GK YW VKNSW   +G+ G
Sbjct: 252 IDAKHSSFFFYRSGVYYDPSCTQN---VNHGVLVVGYGNLNGKDYWLVKNSWGLNFGDGG 308

Query: 330 YVLMSIK-DNNCGVMTAPTY 348
           Y+ M+   +N+CG+   P+Y
Sbjct: 309 YIRMARNSENHCGIANYPSY 328


>gi|157862755|gb|ABV90500.1| cathepsin L, partial [Fasciola gigantica]
          Length = 251

 Score =  156 bits (395), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 89/219 (40%), Positives = 126/219 (57%), Gaps = 10/219 (4%)

Query: 131 KASKDAIP--VRYEMKGY-NSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAV-LSQQAL 186
           +A   A+P  + +   GY   + DQ  CGSCW+F TTGA+EG  YMK +++ +  S+Q L
Sbjct: 27  RAKNRAVPTSIDWRESGYVTEVKDQGGCGSCWAFSTTGAMEGQ-YMKSQRINISFSEQQL 85

Query: 187 IDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFV 246
           +DCS  +GN+GC GG   ++Y+++   GL T+  Y PY   +  C        A ++ + 
Sbjct: 86  VDCSGDFGNHGCSGGLMEKAYEYLRHFGLETESSY-PYRADEGPCQYDKQLGVAQLSDYY 144

Query: 247 NVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGY 306
            V    E ALK  +   GP +VA+D +   F  Y +G+Y DE C  S   L+HA+LAVGY
Sbjct: 145 IVHSQDEVALKNLIGVEGPAAVALDVN-IDFMMYKSGIYQDEIC--SSRYLNHALLAVGY 201

Query: 307 GELDGKPYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMT 344
           G  DG  YW VKNSW + WG  GY+ L   +DN CG+ T
Sbjct: 202 GTEDGTEYWIVKNSWGSRWGEHGYIRLARNRDNMCGIAT 240


>gi|7770062|ref|NP_036137.1| cathepsin J precursor [Mus musculus]
 gi|6467374|gb|AAF13142.1|AF136272_1 cathepsin J precursor [Mus musculus]
 gi|15418834|gb|AAK58455.1| cathepsin J [Mus musculus]
 gi|148709364|gb|EDL41310.1| cathepsin J, isoform CRA_b [Mus musculus]
          Length = 333

 Score =  156 bits (395), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 88/206 (42%), Positives = 122/206 (59%), Gaps = 10/206 (4%)

Query: 149 LLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
           + +Q  CGSCW+F   GA+EG  + K   L  LS Q L+DCS   GN GC  G   ++++
Sbjct: 128 VRNQGKCGSCWAFAAAGAIEGQMFWKTGNLTPLSVQNLLDCSKTVGNKGCQSGTAHQAFE 187

Query: 209 WIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVS 267
           +++K+ GL  +  Y PY G+D  C   +  A+A +T +VN+ PN E  L +A+A  GPVS
Sbjct: 188 YVLKNKGLEAEATY-PYEGKDGPCRYRSENASANITDYVNLPPN-ELYLWVAVASIGPVS 245

Query: 268 VAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE----LDGKPYWQVKNSWST 323
            AIDAS  SF FY  G+YY+  C  S   ++HAVL VGYG      DG  YW +KNSW  
Sbjct: 246 AAIDASHDSFRFYNGGIYYEPNC--SSYFVNHAVLVVGYGSEGDVKDGNNYWLIKNSWGE 303

Query: 324 YWGNQGYVLMSIKDNN-CGVMTAPTY 348
            WG  GY+ ++   NN CG+ +  +Y
Sbjct: 304 EWGMNGYMQIAKDHNNHCGIASLASY 329


>gi|12803615|gb|AAH02642.1| Cathepsin S [Homo sapiens]
 gi|49456313|emb|CAG46477.1| CTSS [Homo sapiens]
 gi|60821573|gb|AAX36579.1| cathepsin S [synthetic construct]
 gi|189069420|dbj|BAG37086.1| unnamed protein product [Homo sapiens]
 gi|261858586|dbj|BAI45815.1| cathepsin S [synthetic construct]
          Length = 331

 Score =  156 bits (395), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 83/199 (41%), Positives = 115/199 (57%), Gaps = 5/199 (2%)

Query: 152 QSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW-GYGNNGCDGGEDFRSYQWI 210
           Q  CG+CW+F   GA+E    +K  KL  LS Q L+DCS   YGN GC+GG    ++Q+I
Sbjct: 133 QGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYI 192

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
           + +     D   PY   D  C   +    AT + +  +    ED LK A+A  GPVSV +
Sbjct: 193 IDNKGIDSDASYPYKAMDQKCQYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVSVGV 252

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
           DA   SF  Y +GVYY+  C  +   ++H VL VGYG+L+GK YW VKNSW   +G +GY
Sbjct: 253 DARHPSFFLYRSGVYYEPSCTQN---VNHGVLVVGYGDLNGKEYWLVKNSWGHNFGEEGY 309

Query: 331 VLMSI-KDNNCGVMTAPTY 348
           + M+  K N+CG+ + P+Y
Sbjct: 310 IRMARNKGNHCGIASFPSY 328


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.318    0.134    0.422 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,976,626,717
Number of Sequences: 23463169
Number of extensions: 256321855
Number of successful extensions: 525954
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 5628
Number of HSP's successfully gapped in prelim test: 1683
Number of HSP's that attempted gapping in prelim test: 505102
Number of HSP's gapped (non-prelim): 8766
length of query: 351
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 208
effective length of database: 9,003,962,200
effective search space: 1872824137600
effective search space used: 1872824137600
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 77 (34.3 bits)